Raoyuan Zhao
6 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
❓
The Questioner
(2)
Conferences
EMNLP (3)
ACL (2)
EACL (1)
Top co-authors
Keywords
large language model
(4)
prompt engineering
(1)
chain-of-thought reasoning
(1)
model evaluation
(1)
knowledge probing
(1)
synthetic data generation
(1)
instruction tuning
(1)
model comparison
(1)
synthetic datum
(1)
failure detection
(1)
cultural awareness
(1)
behavioral testing
(1)
multilingual reasoning
(1)
reasoning trace
(1)
multilingual evaluation
(1)
controlled generation
(1)
nlp evaluation
(1)
knowledge gap
(1)
faithfulness analysis
(1)
typographical error
(1)
Papers
A Comprehensive Evaluation of Multilingual Chain-of-Thought Reasoning: Performance, Consistency, and Faithfulness Across Languages
EACL 2026
Evaluating Robustness of Large Language Models Against Multilingual Typographical Errors
ACL 2026
What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
ACL 2025
MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs
EMNLP 2025
Do We Know What LLMs Don’t Know? A Study of Consistency in Knowledge Probing
EMNLP 2025
SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists
EMNLP 2024