Wangchunshu Zhou
52 papers · 2019–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (11) πΊοΈ Taxonomy Completionist (10) π Interdisciplinary Bridge π Academic Marathon (6)
πΊοΈ
Taxonomy Completionist
(10)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π€
Dynamic Duo
(14)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(29)
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
β‘
Prolific Year
(10)
β
The Questioner
(2)
ποΈ
Keyword Collector
(218)
π₯
Unstoppable
(7)
π
Trend Setter
π
Century Club
(51)
Conferences
EMNLP (17)
ACL (15)
ICLR (4)
ICML (4)
NIPS (3)
COLING (2)
EACL (2)
NAACL (2)
AAAI (1)
ECCV (1)
IJCNLP (1)
Top co-authors
Keywords
large language model
(7)
knowledge distillation
(7)
model compression
(6)
transfer learning
(5)
benchmark evaluation
(5)
text generation
(4)
self-supervised learning
(4)
pretrained language model
(3)
machine translation
(3)
language model
(3)
vision-language model
(3)
natural language understanding
(2)
efficient inference
(2)
contrastive learning
(2)
few-shot learning
(2)
instruction tuning
(2)
question answering
(2)
representation learning
(2)
grammatical error correction
(2)
prompt engineering
(2)
Papers
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
EACL 2026
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment
ACL 2025
MIO: A Foundation Model on Multimodal Tokens
EMNLP 2025
OAgents: An Empirical Study of Building Effective Agents
EMNLP 2025
OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use
ACL 2025
M+: Extending MemoryLLM with Scalable Long-Term Memory
ICML 2025
ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
ICLR 2025
MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications
EMNLP 2024
SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models
COLING 2024
Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data?
NAACL 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
ACL 2024
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
ICML 2024
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
ACL 2024
AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
ACL 2024
LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
ACL 2024
How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs
ECCV 2024
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
EMNLP 2024
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
EMNLP 2023
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
NIPS 2023
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training
ACL 2023
Learning to Predict Persona Information for Dialogue Personalization without Explicit Persona Description
ACL 2023
Commonsense Knowledge Transfer for Pre-trained Language Models
ACL 2023
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
ACL 2023
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning
ACL 2023
Poor Manβs Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
EACL 2023
Evaluating Large Language Models on Controlled Generation Tasks
EMNLP 2023
Doolittle: Benchmarks and Corpora for Academic Writing Formalization
EMNLP 2023
Letβs Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
EMNLP 2023
Findings of the WMT 2023 Shared Task on Machine Translation with Terminologies
EMNLP 2023
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
ICLR 2023
Controlled Text Generation with Natural Language Instructions
ICML 2023
VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training
ICML 2022
Contextual Representation Learning beyond Masked Language Modeling
ACL 2022
BERT Learns to Teach: Knowledge Distillation with Meta Learning
ACL 2022
Efficiently Tuned Parameters Are Task Embeddings
EMNLP 2022
Pre-training Text-to-Text Transformers for Concept-centric Common Sense
ICLR 2021
Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training
IJCNLP 2021
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge
NAACL 2021
Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training
ACL 2021
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
EMNLP 2021
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression
EMNLP 2021
Self-Adversarial Learning with Comparative Discrimination for Text Generation
ICLR 2020
Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation Models
AAAI 2020
Improving Grammatical Error Correction with Machine Translation Pairs
EMNLP 2020
Towards Interpretable Natural Language Understanding with Explanations as Latent Variables
NIPS 2020
BERT Loses Patience: Fast and Robust Inference with Early Exit
NIPS 2020
Pseudo-Bidirectional Decoding for Local Sequence Transduction
EMNLP 2020
CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning
EMNLP 2020
Scheduled DropHead: A Regularization Method for Transformer Models
EMNLP 2020
Connecting the Dots Between Fact Verification and Fake News Detection
COLING 2020
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing
EMNLP 2020
BERT-based Lexical Substitution
ACL 2019