Wenhao Huang
40 papers · 2019–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Academic Marathon (6) π Conference Polyglot (10) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (12)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(10)
πΊοΈ
Taxonomy Completionist
(82)
π§¬
Topic Evolution
π₯
Mega-Team
(32)
π€
Dynamic Duo
(19)
π¬
Deep Specialist
(11)
π
Conference Pioneer
π
Century Club
(37)
ποΈ
Keyword Collector
(185)
β
The Questioner
(4)
β‘
Prolific Year
(12)
Conferences
ACL (11)
EMNLP (8)
ICLR (5)
AAAI (4)
COLING (3)
CVPR (2)
EACL (2)
NAACL (2)
ICCV (1)
IJCNLP (1)
NIPS (1)
Top co-authors
Keywords
large language model
(14)
benchmark evaluation
(6)
multimodal large language model
(5)
information extraction
(4)
multimodal learning
(4)
higher-order perception
(2)
visual question answering
(2)
distant supervision
(2)
representation learning
(2)
transfer learning
(2)
instruction tuning
(2)
instruction following
(2)
noisy label
(2)
multimodal understanding
(2)
reinforcement learning
(2)
prompt engineering
(2)
text generation
(2)
positive unlabeled learning
(2)
relation extraction
(2)
vision-language model
(2)
Papers
MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
EACL 2026
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
EACL 2026
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
ACL 2026
LIME: Less Is More for MLLM Evaluation
ACL 2025
MIO: A Foundation Model on Multimodal Tokens
EMNLP 2025
MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation
EMNLP 2025
SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning
EMNLP 2025
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
NAACL 2025
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks
ICLR 2025
Steering Protein Family Design through Profile Bayesian Flow
ICLR 2025
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
CVPR 2025
MuPT: A Generative Symbolic Music Pretrained Transformer
ICLR 2025
Can MLLMs Understand the Deep Implication Behind Chinese Images?
ACL 2025
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment
ACL 2025
Is There a One-Model-Fits-All Approach to Information Extraction? Revisiting Task Definition Biases
EMNLP 2024
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
EMNLP 2024
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
ICLR 2024
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
ICLR 2024
AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation
EMNLP 2024
Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation
AAAI 2024
Can Large Language Models Understand Real-World Complex Instructions?
AAAI 2024
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
ACL 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
ACL 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
ACL 2024
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
ACL 2024
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
ACL 2024
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation
COLING 2024
Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction
COLING 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
COLING 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
NIPS 2024
MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language
EMNLP 2024
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
EMNLP 2024
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
NAACL 2024
Adaptive Ordered Information Extraction with Deep Reinforcement Learning
ACL 2023
Revisiting the Negative Data of Distantly Supervised Relation Extraction
ACL 2021
Revisiting the Negative Data of Distantly Supervised Relation Extraction
IJCNLP 2021
Text Assisted Insight Ranking Using Context-Aware Memory Network
AAAI 2019
Learning Personalized End-to-End Goal-Oriented Dialog
AAAI 2019