Benyou Wang
66 papers · 2018–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (15) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (11)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(15)
π£
Hot Topic Early Bird
π
Grand Slam
π
Triple Crown
π
Keyword Champion
(2)
π€
Dynamic Duo
(13)
π₯
Mega-Team
(34)
π¬
Deep Specialist
(15)
π§¬
Topic Evolution
β‘
Prolific Year
(13)
ποΈ
Keyword Collector
(257)
π
Century Club
(61)
π₯
Unstoppable
(8)
β
The Questioner
(5)
Conferences
ACL (19)
EMNLP (14)
NAACL (11)
NIPS (7)
ICLR (6)
COLING (2)
ICML (2)
IJCAI (2)
AAAI (1)
ICCV (1)
UAI (1)
Top co-authors
Research topics
Keywords
large language model
(19)
multimodal large language model
(7)
benchmark evaluation
(6)
multimodal learning
(6)
medical imaging
(5)
reinforcement learning
(5)
dialogue system
(4)
vision-language model
(4)
visual question answering
(4)
multi-task learning
(4)
reinforcement learning from human feedback
(3)
word embedding
(3)
contrastive learning
(3)
knowledge distillation
(3)
retrieval-augmented generation
(3)
sentiment analysis
(3)
domain adaptation
(2)
affective computing
(2)
cross-lingual alignment
(2)
instruction following
(2)
Papers
Human or LLM as Standardized Patients? A Comparative Study in Medical Education
ACL 2026
Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion
ACL 2026
S2S-Arena: Evaluating Paralinguistic Instruction Following in Speech-to-Speech Models
ACL 2026
Probing Audio-Visual Reasoning in Multimodal Language Models through the Lens of Audio
ACL 2026
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning
ACL 2026
Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis
ICML 2025
MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria
NAACL 2025
Is Your LLM Outdated? A Deep Look at Temporal Generalization
NAACL 2025
LLMs for Mathematical Modeling: Towards Bridging the Gap between Natural and Mathematical Languages
NAACL 2025
Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
ACL 2025
Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging
ACL 2025
Soundwave: Less is More for Speech-Text Alignment in LLMs
ACL 2025
Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
ACL 2025
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis
ACL 2025
Towards Medical Complex Reasoning with LLMs through Medical Verifiable Problems
ACL 2025
Unlocking LLMsβ Self-Improvement Capacity with Autonomous Learning for Domain Adaptation
ACL 2025
Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs
COLING 2025
RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions
EMNLP 2025
From Word to World: Evaluate and Mitigate Culture Bias in LLMs via Word Association Test
EMNLP 2025
Model Unlearning via Sparse Autoencoder Subspace Guided Projections
EMNLP 2025
Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization
EMNLP 2025
DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization
EMNLP 2025
Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM
EMNLP 2025
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture
EMNLP 2025
Periodical Moving Average Accelerates Gradient Accumulation for Post-Training
UAI 2025
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models
ICLR 2025
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
ICLR 2025
Smurfs: Multi-Agent System using Context-Efficient DFSDT for Tool Planning
NAACL 2025
UCL-Bench: A Chinese User-Centric Legal Benchmark for Large Language Models
NAACL 2025
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
NAACL 2025
Huatuo-26M, a Large-scale Chinese Medical QA Dataset
NAACL 2025
Humans or LLMs as the Judge? A Study on Judgement Bias
EMNLP 2024
Alignment at Pre-training! Towards Native Alignment for Arabic LLMs
NIPS 2024
OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning
NAACL 2024
Rethinking the Uniformity Metric in Self-Supervised Learning
ICLR 2024
CMB: A Comprehensive Medical Benchmark in Chinese
NAACL 2024
AceGPT, Localizing Large Language Models in Arabic
NAACL 2024
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
ICML 2024
FinBen: A Holistic Financial Benchmark for Large Language Models
NIPS 2024
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
EMNLP 2024
Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale
EMNLP 2024
PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator
ACL 2024
Exploring the Potential of Dense Information in Multimodal Alignment
ACL 2024
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI
NIPS 2024
Lifting the Curse of Capacity Gap in Distilling Language Models
ACL 2023
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
ACL 2023
On the Difference of BERT-style and CLIP-style Text Encoders
ACL 2023
One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems
ACL 2023
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
NIPS 2023
HuatuoGPT, Towards Taming Language Model to Be a Doctor
EMNLP 2023
CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations
NIPS 2023
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts
ICCV 2023
Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary
AAAI 2023
Exploring extreme parameter compression for pre-trained language models
ICLR 2022
MorphTE: Injecting Morphology in Tensorized Embeddings
NIPS 2022
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
COLING 2022
Hypoformer: Hybrid Decomposition Transformer for Edge-friendly Neural Machine Translation
EMNLP 2022
What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum Probability
EMNLP 2021
Word2Fun: Modelling Words as Functions for Diachronic Word Representation
NIPS 2021
On Position Embeddings in BERT
ICLR 2021
Encoding word order in complex embeddings
ICLR 2020
A Multi-task Learning Framework for Opinion Triplet Extraction
EMNLP 2020
CNM: An Interpretable Complex-valued Network for Matching
NAACL 2019
A Multi-task Learning Approach for Image Captioning
IJCAI 2018
PLASTIC: Prioritize Long and Short-term Information in Top-n Recommendation using Adversarial Training
IJCAI 2018
Quantum-Inspired Complex Word Embedding
ACL 2018