Haonan Li
29 papers · 2019–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Cross-Pollinator (8) π Interdisciplinary Bridge π Academic Marathon (6) π Conference Polyglot (11) π Renaissance Researcher (5)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(38)
π§
Keyword Pioneer
π₯
Mega-Team
(43)
π
Grand Slam
π€
Dynamic Duo
(19)
π§¬
Topic Evolution
π₯
Unstoppable
(7)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(100)
π
Century Club
(23)
Conferences
ACL (9)
EMNLP (4)
NAACL (4)
EACL (3)
COLING (2)
AAAI (1)
AACL (1)
ICLR (1)
ICML (1)
IJCNLP (1)
NIPS (1)
SEMEVAL (1)
Top co-authors
Keywords
large language model
(11)
pre-trained language model
(3)
benchmark evaluation
(3)
risk assessment
(2)
question answering
(2)
machine reading comprehension
(2)
chinese language
(2)
instruction tuning
(2)
arabic language
(2)
multiple-choice question
(2)
contrastive learning
(2)
multitask learning
(2)
harmful content detection
(2)
named entity recognition
(1)
chain-of-thought reasoning
(1)
text classification
(1)
claim verification
(1)
sentiment analysis
(1)
multilingual nlp
(1)
language model evaluation
(1)
Papers
Nanda Family: Open-Weights Generative Large Language Models for Hindi
EACL 2026
FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning
ACL 2026
Multilingual Idioms in Sentences and Conversations Across High-, Medium-, and Low-Resource Languages
ACL 2026
CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval
ACL 2026
Control Illusion: The Failure of Instruction Hierarchies in Large Language Models
AAAI 2026
SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning
EACL 2026
BALSAM: A Platform for Benchmarking Arabic Large Language Models
EMNLP 2025
Loki: An Open-Source Tool for Fact Verification
COLING 2025
ToolGen: Unified Tool Retrieval and Calling via Generation
ICLR 2025
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
ICML 2025
NAT: Enhancing Agent Tuning with Negative Samples
NAACL 2025
Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
NAACL 2025
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
ACL 2024
CMMLU: Measuring massive multitask language understanding in Chinese
ACL 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
ACL 2024
A Chinese Dataset for Evaluating the Safeguards in Large Language Models
ACL 2024
Demystifying Instruction Mixing for Fine-tuning Large Language Models
ACL 2024
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
NIPS 2024
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
ACL 2024
Do-Not-Answer: Evaluating Safeguards in LLMs
EACL 2024
Location Aware Modular Biencoder for Tourism Question Answering
IJCNLP 2023
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
EMNLP 2023
Location Aware Modular Biencoder for Tourism Question Answering
AACL 2023
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis
EMNLP 2022
MultiSpanQA: A Dataset for Multi-Span Question Answering
NAACL 2022
CULG: Commercial Universal Language Generation
NAACL 2022
KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning
EMNLP 2021
Target Word Masking for Location Metonymy Resolution
COLING 2020
UniMelb at SemEval-2019 Task 12: Multi-model combination for toponym resolution
SEMEVAL 2019