Yian Zhang
6 papers · 2020–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (4) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (24) π Academic Marathon (5)
π
Cross-Pollinator
(13)
β
The Questioner
(3)
Conferences
EMNLP (3)
ACL (1)
ICML (1)
IJCNLP (1)
Top co-authors
Keywords
language model
(3)
commonsense knowledge
(2)
natural language understanding
(2)
self-supervised learning
(1)
language modeling
(1)
instruction following
(1)
language model alignment
(1)
unsupervised parsing
(1)
semantic knowledge
(1)
probing analysis
(1)
instruction tuning
(1)
model alignment
(1)
constituency parsing
(1)
inductive bia
(1)
learning curve
(1)
downstream task
(1)
self-supervised pretraining
(1)
pretrained language model
(1)
pre-training datum
(1)
syntactic structure
(1)
Papers
Position: Language model developers should report train-test overlap
ICML 2025
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models
EMNLP 2024
When Do You Need Billions of Words of Pretraining Data?
ACL 2021
When Do You Need Billions of Words of Pretraining Data?
IJCNLP 2021
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
EMNLP 2020
Latent Tree Learning with Ordered Neurons: What Parses Does It Produce?
EMNLP 2020