Crystina Zhang
10 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
🐝 Cross-Pollinator (15) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (22) 🧭 Keyword Pioneer 🌍 Conference Polyglot (3)
🌈
Renaissance Researcher
(7)
👥
Mega-Team
(82)
⚡
Prolific Year
(5)
❓
The Questioner
Conferences
EMNLP (4)
NAACL (3)
ACL (2)
ICLR (1)
Top co-authors
Keywords
retrieval-augmented generation
(2)
multilingual language model
(2)
information retrieval
(2)
multilingual nlp
(1)
prompt engineering
(1)
language model robustness
(1)
question answering
(1)
ranking aggregation
(1)
listwise ranking
(1)
diffusion model
(1)
multimodal dataset
(1)
vision language model
(1)
evaluation benchmark
(1)
vision-language model
(1)
text-to-image generation
(1)
cross-lingual transfer
(1)
multi-image understanding
(1)
hallucination rate
(1)
out-of-domain generalization
(1)
visual question answering
(1)
Papers
BrowseComp-Plus: A Fair and Disentangled Evaluation Benchmark for Deep Search Agents
ACL 2026
The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining
ACL 2026
Hard Negatives, Hard Lessons: Revisiting Training Data Quality for Robust Information Retrieval with LLMs
EMNLP 2025
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR 2025
Tomato, Tomahto, Tomate: Do Multilingual Language Models Understand Based on Subword-Level Semantic Concepts?
NAACL 2025
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models
NAACL 2024
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture
EMNLP 2024
“Knowing When You Don’t Know”: A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation
EMNLP 2024
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation
EMNLP 2024
CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders.
NAACL 2024