conftrace_

Zehan Qi

12 papers · 2024–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🐝 Cross-Pollinator (15) 🗺️ Taxonomy Completionist (29) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (4)

🌉 Interdisciplinary Bridge 👥 Mega-Team (28) 💎 Century Club (10) ⚡ Prolific Year (7)

Conferences

ACL (4) EMNLP (3) ICLR (2) NIPS (2) EACL (1)

Top co-authors

Rongwu Xu (6) Xiao Liu (6) Wei Xu (6) Yuxiao Dong (5) Jie Tang (5) Zhijiang Guo (4) Yifan Xu (3) Shuntian Yao (3) Xueqiao Sun (3) Hanyu Lai (3)

Keywords

large language model (6) language model (2) benchmark evaluation (2) question answering (1) bias mitigation (1) prompt engineering (1) model robustness (1) natural language queries (1) software engineering (1) knowledge graph (1) supervised fine-tuning (1) code generation (1) autonomous agent (1) scaling law (1) adversarial robustness (1) retrieval-augmented generation (1) chain-of-thought prompting (1) bias reduction (1) reasoning capability (1) zero-shot learning (1)

Papers

DebateQA: Evaluating Question Answering on Debatable Knowledge EACL 2026 KARL: Reinforcement Learning for LLM Agents on Multi-Turn Knowledge-Intensive Agentic Tasks ACL 2026 A Survey of Post-Training Scaling in Large Language Models ACL 2025 WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning ICLR 2025 VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents ICLR 2025 Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency NIPS 2024 LONG2RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall EMNLP 2024 Knowledge Conflicts for LLMs: A Survey EMNLP 2024 MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs NIPS 2024 NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries ACL 2024 Preemptive Answer “Attacks” on Chain-of-Thought Reasoning ACL 2024 Walking in Others’ Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias EMNLP 2024