Zhengyan Zhang
25 papers · 2017–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (9)
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(8)
🌈
Renaissance Researcher
(5)
🤝
Dynamic Duo
(24)
🧬
Topic Evolution
📈
Trend Setter
⚡
Prolific Year
(7)
💎
Century Club
(25)
🗃️
Keyword Collector
(116)
🔥
Unstoppable
(7)
Conferences
ACL (11)
EMNLP (4)
COLING (3)
IJCNLP (2)
AAAI (1)
ICML (1)
IJCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
model compression
(7)
pre-trained language model
(5)
large language model
(4)
mixture of expert
(3)
domain adaptation
(3)
knowledge injection
(2)
text classification
(2)
model quantization
(2)
parameter efficient
(2)
adversarial learning
(2)
efficient computing
(2)
question answering
(2)
parameter efficiency
(2)
prompt tuning
(2)
attention mechanism
(2)
transfer learning
(2)
parameter-efficient learning
(2)
knowledge distillation
(2)
inference efficiency
(2)
transformer model
(2)
Papers
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
ACL 2025
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
COLING 2025
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
NIPS 2024
Robust and Scalable Model Editing for Large Language Models
COLING 2024
Exploring the Benefit of Activation Sparsity in Pre-training
ICML 2024
Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules
EMNLP 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
ACL 2023
Plug-and-Play Knowledge Injection for Pre-trained Language Models
ACL 2023
Plug-and-Play Document Modules for Pre-trained Models
ACL 2023
Emergent Modularity in Pre-trained Transformers
ACL 2023
Prompt Tuning for Discriminative Pre-trained Language Models
ACL 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
EMNLP 2022
Knowledge Inheritance for Pre-trained Language Models
NAACL 2022
BMCook: A Task-agnostic Compression Toolkit for Big Models
EMNLP 2022
BMInf: An Efficient Toolkit for Big Model Inference and Tuning
ACL 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
COLING 2022
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
ACL 2022
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning
ACL 2021
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning
IJCNLP 2021
Adversarial Language Games for Advanced Natural Language Intelligence
AAAI 2021
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger
IJCNLP 2021
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger
ACL 2021
Train No Evil: Selective Masking for Task-Guided Pre-Training
EMNLP 2020
ERNIE: Enhanced Language Representation with Informative Entities
ACL 2019
TransNet: Translation-Based Network Representation Learning for Social Relation Extraction
IJCAI 2017