Wangchunshu Zhou

52 papers · 2019–2026 · 11 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (11) 🗺️ Taxonomy Completionist (10) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (6)

🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (14) 👑 Triple Crown 🏆 Grand Slam 👥 Mega-Team (29) 🔬 Deep Specialist (10) 🧬 Topic Evolution ⚡ Prolific Year (10) ❓ The Questioner (2) 🗃️ Keyword Collector (218) 🔥 Unstoppable (7) 📈 Trend Setter 💎 Century Club (51)

Conferences

EMNLP (17) ACL (15) ICLR (4) ICML (4) NIPS (3) COLING (2) EACL (2) NAACL (2) AAAI (1) ECCV (1) IJCNLP (1)

Top co-authors

Ke Xu (14) Tao Ge (10) Furu Wei (9) Canwen Xu (7) Yuchen Eleanor Jiang (6) Julian McAuley (6) Wenhao Huang (6) Ming Zhou (5) Jie Fu (5) Jiaheng Liu (5)

Keywords

large language model (7) knowledge distillation (7) model compression (6) transfer learning (5) benchmark evaluation (5) text generation (4) self-supervised learning (4) pretrained language model (3) machine translation (3) language model (3) vision-language model (3) natural language understanding (2) efficient inference (2) contrastive learning (2) few-shot learning (2) instruction tuning (2) question answering (2) representation learning (2) grammatical error correction (2) prompt engineering (2)

Papers

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values EACL 2026 PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment ACL 2025 MIO: A Foundation Model on Multimodal Tokens EMNLP 2025 OAgents: An Empirical Study of Building Effective Agents EMNLP 2025 OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use ACL 2025 M+: Extending MemoryLLM with Scalable Long-Term Memory ICML 2025 ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning ICLR 2025 MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications EMNLP 2024 SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models COLING 2024 Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? NAACL 2024 CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models ACL 2024 OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models ICML 2024 RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models ACL 2024 AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning ACL 2024 LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild ACL 2024 How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs ECCV 2024 PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness EMNLP 2024 Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models EMNLP 2023 To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis NIPS 2023 Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training ACL 2023 Learning to Predict Persona Information for Dialogue Personalization without Explicit Persona Description ACL 2023 Commonsense Knowledge Transfer for Pre-trained Language Models ACL 2023 Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference ACL 2023 EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning ACL 2023 Poor Man’s Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference EACL 2023 Evaluating Large Language Models on Controlled Generation Tasks EMNLP 2023 Doolittle: Benchmarks and Corpora for Academic Writing Formalization EMNLP 2023 Let’s Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models EMNLP 2023 Findings of the WMT 2023 Shared Task on Machine Translation with Terminologies EMNLP 2023 Write and Paint: Generative Vision-Language Models are Unified Modal Learners ICLR 2023 Controlled Text Generation with Natural Language Instructions ICML 2023 VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training ICML 2022 Contextual Representation Learning beyond Masked Language Modeling ACL 2022 BERT Learns to Teach: Knowledge Distillation with Meta Learning ACL 2022 Efficiently Tuned Parameters Are Task Embeddings EMNLP 2022 Pre-training Text-to-Text Transformers for Concept-centric Common Sense ICLR 2021 Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training IJCNLP 2021 Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge NAACL 2021 Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training ACL 2021 Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting EMNLP 2021 Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression EMNLP 2021 Self-Adversarial Learning with Comparative Discrimination for Text Generation ICLR 2020 Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation Models AAAI 2020 Improving Grammatical Error Correction with Machine Translation Pairs EMNLP 2020 Towards Interpretable Natural Language Understanding with Explanations as Latent Variables NIPS 2020 BERT Loses Patience: Fast and Robust Inference with Early Exit NIPS 2020 Pseudo-Bidirectional Decoding for Local Sequence Transduction EMNLP 2020 CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning EMNLP 2020 Scheduled DropHead: A Regularization Method for Transformer Models EMNLP 2020 Connecting the Dots Between Fact Verification and Fake News Detection COLING 2020 BERT-of-Theseus: Compressing BERT by Progressive Module Replacing EMNLP 2020 BERT-based Lexical Substitution ACL 2019