Zhengyan Zhang

25 papers · 2017–2025 · 9 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (9)

🌍 Conference Polyglot (9) 🏃 Academic Marathon (8) 🌈 Renaissance Researcher (5) 🤝 Dynamic Duo (24) 🧬 Topic Evolution 📈 Trend Setter ⚡ Prolific Year (7) 💎 Century Club (25) 🗃️ Keyword Collector (116) 🔥 Unstoppable (7)

Conferences

ACL (11) EMNLP (4) COLING (3) IJCNLP (2) AAAI (1) ICML (1) IJCAI (1) NAACL (1) NIPS (1)

Top co-authors

Zhiyuan Liu (24) Maosong Sun (23) Xu Han (13) Yankai Lin (9) Chaojun Xiao (8) Jie Zhou (8) Xiaozhi Wang (5) Yasheng Wang (4) Fanchao Qi (4) Ruobing Xie (4)

Keywords

model compression (7) pre-trained language model (5) large language model (4) mixture of expert (3) domain adaptation (3) knowledge injection (2) text classification (2) model quantization (2) parameter efficient (2) adversarial learning (2) efficient computing (2) question answering (2) parameter efficiency (2) prompt tuning (2) attention mechanism (2) transfer learning (2) parameter-efficient learning (2) knowledge distillation (2) inference efficiency (2) transformer model (2)

Papers

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention ACL 2025 ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models COLING 2025 InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory NIPS 2024 Robust and Scalable Model Editing for Large Language Models COLING 2024 Exploring the Benefit of Activation Sparsity in Pre-training ICML 2024 Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules EMNLP 2023 READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises ACL 2023 Plug-and-Play Knowledge Injection for Pre-trained Language Models ACL 2023 Plug-and-Play Document Modules for Pre-trained Models ACL 2023 Emergent Modularity in Pre-trained Transformers ACL 2023 Prompt Tuning for Discriminative Pre-trained Language Models ACL 2022 Finding Skill Neurons in Pre-trained Transformer-based Language Models EMNLP 2022 Knowledge Inheritance for Pre-trained Language Models NAACL 2022 BMCook: A Task-agnostic Compression Toolkit for Big Models EMNLP 2022 BMInf: An Efficient Toolkit for Big Model Inference and Tuning ACL 2022 Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models COLING 2022 MoEfication: Transformer Feed-forward Layers are Mixtures of Experts ACL 2022 Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning ACL 2021 Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning IJCNLP 2021 Adversarial Language Games for Advanced Natural Language Intelligence AAAI 2021 Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger IJCNLP 2021 Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger ACL 2021 Train No Evil: Selective Masking for Task-Guided Pre-Training EMNLP 2020 ERNIE: Enhanced Language Representation with Informative Entities ACL 2019 TransNet: Translation-Based Network Representation Learning for Social Relation Extraction IJCAI 2017