Nandan Thakur
9 papers · 2021–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
๐ Cross-Pollinator (15) ๐ฃ Hot Topic Early Bird ๐ Conference Polyglot (4) ๐ Renaissance Researcher (6) ๐ Interdisciplinary Bridge
๐บ๏ธ
Taxonomy Completionist
(19)
๐งญ
Keyword Pioneer
๐ฅ
Mega-Team
(82)
๐ฅ
Unstoppable
(5)
Conferences
NAACL (4)
ACL (2)
EMNLP (2)
ICLR (1)
Top co-authors
Keywords
information retrieval
(4)
retrieval-augmented generation
(3)
domain adaptation
(2)
multilingual nlp
(2)
query generation
(2)
dense retrieval
(2)
knowledge retrieval
(2)
large language model
(2)
evaluation benchmark
(1)
semantic embedding
(1)
multilingual retrieval
(1)
language model robustness
(1)
llm evaluation
(1)
data augmentation
(1)
synthetic training datum
(1)
pseudo labeling
(1)
robustness evaluation
(1)
training datum
(1)
multilingual evaluation
(1)
hallucination rate
(1)
Papers
BrowseComp-Plus: A Fair and Disentangled Evaluation Benchmark for Deep Search Agents
ACL 2026
Hard Negatives, Hard Lessons: Revisiting Training Data Quality for Robust Information Retrieval with LLMs
EMNLP 2025
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR 2025
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
NAACL 2025
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval
NAACL 2024
โKnowing When You Donโt Knowโ: A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation
EMNLP 2024
Evaluating Embedding APIs for Information Retrieval
ACL 2023
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
NAACL 2022
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
NAACL 2021