Wenhao Huang

40 papers · 2019–2026 · 11 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (6) 🌍 Conference Polyglot (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (10) 🗺️ Taxonomy Completionist (82) 🧬 Topic Evolution 👥 Mega-Team (32) 🤝 Dynamic Duo (19) 🔬 Deep Specialist (11) 🚀 Conference Pioneer 💎 Century Club (37) 🗃️ Keyword Collector (185) ❓ The Questioner (4) ⚡ Prolific Year (12)

Conferences

ACL (11) EMNLP (8) ICLR (5) AAAI (4) COLING (3) CVPR (2) EACL (2) NAACL (2) ICCV (1) IJCNLP (1) NIPS (1)

Top co-authors

Ge Zhang (22) Chenghua Lin (12) Jiaheng Liu (12) Jie Fu (11) Xingwei Qu (10) Yanghua Xiao (9) Yizhi Li (8) Yiming Liang (8) Jiaqing Liang (7) Tianyu Zheng (7)

Keywords

large language model (14) benchmark evaluation (6) multimodal large language model (5) information extraction (4) multimodal learning (4) higher-order perception (2) visual question answering (2) distant supervision (2) representation learning (2) transfer learning (2) instruction tuning (2) instruction following (2) noisy label (2) multimodal understanding (2) reinforcement learning (2) prompt engineering (2) text generation (2) positive unlabeled learning (2) relation extraction (2) vision-language model (2)

Papers

MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models EACL 2026 COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values EACL 2026 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization ACL 2026 LIME: Less Is More for MLLM Evaluation ACL 2025 MIO: A Foundation Model on Multimodal Tokens EMNLP 2025 MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation EMNLP 2025 SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning EMNLP 2025 SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models ICCV 2025 COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning NAACL 2025 KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks ICLR 2025 Steering Protein Family Design through Profile Bayesian Flow ICLR 2025 Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation CVPR 2025 MuPT: A Generative Symbolic Music Pretrained Transformer ICLR 2025 Can MLLMs Understand the Deep Implication Behind Chinese Images? ACL 2025 PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment ACL 2025 Is There a One-Model-Fits-All Approach to Information Extraction? Revisiting Task Definition Biases EMNLP 2024 PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness EMNLP 2024 MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training ICLR 2024 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning ICLR 2024 AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation EMNLP 2024 Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation AAAI 2024 Can Large Language Models Understand Real-World Complex Instructions? AAAI 2024 PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents ACL 2024 ChatMusician: Understanding and Generating Music Intrinsically with LLM ACL 2024 CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models ACL 2024 SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval ACL 2024 RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models ACL 2024 CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation COLING 2024 Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction COLING 2024 MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces COLING 2024 MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI CVPR 2024 II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models NIPS 2024 MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language EMNLP 2024 DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence? EMNLP 2024 MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response NAACL 2024 Adaptive Ordered Information Extraction with Deep Reinforcement Learning ACL 2023 Revisiting the Negative Data of Distantly Supervised Relation Extraction ACL 2021 Revisiting the Negative Data of Distantly Supervised Relation Extraction IJCNLP 2021 Text Assisted Insight Ranking Using Context-Aware Memory Network AAAI 2019 Learning Personalized End-to-End Goal-Oriented Dialog AAAI 2019