When Should Dense Retrievers Be Updated in Evolving Corpora? Detecting Out-of-Distribution Corpora Using GradNormIR
Published in ACL 2025 Findings, 2025
This work addresses the challenge of maintaining dense retriever performance in evolving corpora. We propose GradNormIR, an unsupervised method that detects out-of-distribution shifts in document collections using gradient norms, enabling timely updates of dense retrievers without manual intervention. Experiments on BEIR benchmark demonstrate that our approach effectively identifies when retriever updates are necessary, maintaining and improving performance as corpus distributions change.
| Paper (arXiv) | Paper (ACL Anthology) | Code |
Recommended citation: Dayoon Ko, Jinyoung Kim, Sohyeon Kim, Jinhyuk Kim, Jaehoon Lee, Seonghak Song, Minyoung Lee, Gunhee Kim. (2025). "When Should Dense Retrievers Be Updated in Evolving Corpora? Detecting Out-of-Distribution Corpora Using GradNormIR." Findings of ACL 2025.
Download Paper
