Benyou Wang

66 papers · 2018–2026 · 11 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🗺️ Taxonomy Completionist (15) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (11)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (15) 🐣 Hot Topic Early Bird 🏆 Grand Slam 👑 Triple Crown 🏆 Keyword Champion (2) 🤝 Dynamic Duo (13) 👥 Mega-Team (34) 🔬 Deep Specialist (15) 🧬 Topic Evolution ⚡ Prolific Year (13) 🗃️ Keyword Collector (257) 💎 Century Club (61) 🔥 Unstoppable (8) ❓ The Questioner (5)

Conferences

ACL (19) EMNLP (14) NAACL (11) NIPS (7) ICLR (6) COLING (2) ICML (2) IJCAI (2) AAAI (1) ICCV (1) UAI (1)

Top co-authors

Junying Chen (13) Xiang Wan (12) Xidong Wang (9) Shunian Chen (9) Anningzhe Gao (8) Haizhou Li (8) Zhihong Chen (7) Jianquan Li (7) Zhenyang Cai (7) Yan Hu (6)

Research topics

Models (1) Applications (1) Reinforcement Learning (1) Linguistics (1)

Keywords

large language model (19) multimodal large language model (7) benchmark evaluation (6) multimodal learning (6) medical imaging (5) reinforcement learning (5) dialogue system (4) vision-language model (4) visual question answering (4) multi-task learning (4) reinforcement learning from human feedback (3) word embedding (3) contrastive learning (3) knowledge distillation (3) retrieval-augmented generation (3) sentiment analysis (3) domain adaptation (2) affective computing (2) cross-lingual alignment (2) instruction following (2)

Papers

Human or LLM as Standardized Patients? A Comparative Study in Medical Education ACL 2026 Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion ACL 2026 S2S-Arena: Evaluating Paralinguistic Instruction Following in Speech-to-Speech Models ACL 2026 Probing Audio-Visual Reasoning in Multimodal Language Models through the Lens of Audio ACL 2026 Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning ACL 2026 Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis ICML 2025 MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria NAACL 2025 Is Your LLM Outdated? A Deep Look at Temporal Generalization NAACL 2025 LLMs for Mathematical Modeling: Towards Bridging the Gap between Natural and Mathematical Languages NAACL 2025 Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion ACL 2025 Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging ACL 2025 Soundwave: Less is More for Speech-Text Alignment in LLMs ACL 2025 Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation ACL 2025 CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis ACL 2025 Towards Medical Complex Reasoning with LLMs through Medical Verifiable Problems ACL 2025 Unlocking LLMs’ Self-Improvement Capacity with Autonomous Learning for Domain Adaptation ACL 2025 Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs COLING 2025 RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions EMNLP 2025 From Word to World: Evaluate and Mitigate Culture Bias in LLMs via Word Association Test EMNLP 2025 Model Unlearning via Sparse Autoencoder Subspace Guided Projections EMNLP 2025 Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization EMNLP 2025 DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization EMNLP 2025 Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM EMNLP 2025 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture EMNLP 2025 Periodical Moving Average Accelerates Gradient Accumulation for Post-Training UAI 2025 Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models ICLR 2025 Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts ICLR 2025 Smurfs: Multi-Agent System using Context-Efficient DFSDT for Tool Planning NAACL 2025 UCL-Bench: A Chinese User-Centric Legal Benchmark for Large Language Models NAACL 2025 UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models NAACL 2025 Huatuo-26M, a Large-scale Chinese Medical QA Dataset NAACL 2025 Humans or LLMs as the Judge? A Study on Judgement Bias EMNLP 2024 Alignment at Pre-training! Towards Native Alignment for Arabic LLMs NIPS 2024 OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning NAACL 2024 Rethinking the Uniformity Metric in Self-Supervised Learning ICLR 2024 CMB: A Comprehensive Medical Benchmark in Chinese NAACL 2024 AceGPT, Localizing Large Language Models in Arabic NAACL 2024 MathScale: Scaling Instruction Tuning for Mathematical Reasoning ICML 2024 FinBen: A Holistic Financial Benchmark for Large Language Models NIPS 2024 VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment EMNLP 2024 Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale EMNLP 2024 PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator ACL 2024 Exploring the Potential of Dense Information in Multimodal Alignment ACL 2024 GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI NIPS 2024 Lifting the Curse of Capacity Gap in Distilling Language Models ACL 2023 Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk ACL 2023 On the Difference of BERT-style and CLIP-style Text Encoders ACL 2023 One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems ACL 2023 Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias NIPS 2023 HuatuoGPT, Towards Taming Language Model to Be a Doctor EMNLP 2023 CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations NIPS 2023 Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts ICCV 2023 Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary AAAI 2023 Exploring extreme parameter compression for pre-trained language models ICLR 2022 MorphTE: Injecting Morphology in Tensorized Embeddings NIPS 2022 DPTDR: Deep Prompt Tuning for Dense Passage Retrieval COLING 2022 Hypoformer: Hybrid Decomposition Transformer for Edge-friendly Neural Machine Translation EMNLP 2022 What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum Probability EMNLP 2021 Word2Fun: Modelling Words as Functions for Diachronic Word Representation NIPS 2021 On Position Embeddings in BERT ICLR 2021 Encoding word order in complex embeddings ICLR 2020 A Multi-task Learning Framework for Opinion Triplet Extraction EMNLP 2020 CNM: An Interpretable Complex-valued Network for Matching NAACL 2019 A Multi-task Learning Approach for Image Captioning IJCAI 2018 PLASTIC: Prioritize Long and Short-term Information in Top-n Recommendation using Adversarial Training IJCAI 2018 Quantum-Inspired Complex Word Embedding ACL 2018