Zehan Qi
12 papers · 2024–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
๐ Cross-Pollinator (15) ๐บ๏ธ Taxonomy Completionist (29) ๐งญ Keyword Pioneer ๐ฃ Hot Topic Early Bird ๐ Conference Polyglot (4)
๐
Interdisciplinary Bridge
๐ฅ
Mega-Team
(28)
๐
Century Club
(10)
โก
Prolific Year
(7)
Conferences
ACL (4)
EMNLP (3)
ICLR (2)
NIPS (2)
EACL (1)
Top co-authors
Keywords
large language model
(6)
language model
(2)
benchmark evaluation
(2)
question answering
(1)
bias mitigation
(1)
prompt engineering
(1)
model robustness
(1)
natural language queries
(1)
software engineering
(1)
knowledge graph
(1)
supervised fine-tuning
(1)
code generation
(1)
autonomous agent
(1)
scaling law
(1)
adversarial robustness
(1)
retrieval-augmented generation
(1)
chain-of-thought prompting
(1)
bias reduction
(1)
reasoning capability
(1)
zero-shot learning
(1)
Papers
DebateQA: Evaluating Question Answering on Debatable Knowledge
EACL 2026
KARL: Reinforcement Learning for LLM Agents on Multi-Turn Knowledge-Intensive Agentic Tasks
ACL 2026
A Survey of Post-Training Scaling in Large Language Models
ACL 2025
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
ICLR 2025
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency
NIPS 2024
LONG2RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
EMNLP 2024
Knowledge Conflicts for LLMs: A Survey
EMNLP 2024
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
NIPS 2024
NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries
ACL 2024
Preemptive Answer โAttacksโ on Chain-of-Thought Reasoning
ACL 2024
Walking in Othersโ Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias
EMNLP 2024