Katherine Lee
16 papers · 2004–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (23) π Conference Polyglot (8) π Academic Marathon (21) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(8)
π
Grand Slam
π
Keyword Champion
(4)
π₯
Mega-Team
(67)
π
Century Club
(16)
β‘
Prolific Year
(5)
Conferences
ICLR (4)
ACL (2)
ICML (2)
JMLR (2)
NAACL (2)
NIPS (2)
AAAI (1)
COLING (1)
Top co-authors
Keywords
language model
(5)
training datum
(4)
large language model
(2)
binary classification
(1)
transfer learning
(1)
ensemble learning
(1)
knowledge distillation
(1)
model distillation
(1)
model evaluation
(1)
fair classification
(1)
machine learning
(1)
ensemble method
(1)
fairness metric
(1)
data quality
(1)
membership inference
(1)
sensitive information
(1)
personally identifiable information
(1)
temporal shift
(1)
multi-step reasoning
(1)
data memorization
(1)
Papers
Measuring memorization in language models via probabilistic extraction
NAACL 2025
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
ACL 2025
Scalable Extraction of Training Data from Aligned, Production Language Models
ICLR 2025
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
ICLR 2025
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
ICML 2025
Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification
AAAI 2024
A Pretrainerβs Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
NAACL 2024
Stealing part of a production language model
ICML 2024
PaLM: Scaling Language Modeling with Pathways
JMLR 2023
Students Parrot Their Teachers: Membership Inference on Model Distillation
NIPS 2023
Counterfactual Memorization in Neural Language Models
NIPS 2023
Measuring Forgetting of Memorized Training Examples
ICLR 2023
Quantifying Memorization Across Neural Language Models
ICLR 2023
Deduplicating Training Data Makes Language Models Better
ACL 2022
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
JMLR 2020
Analysis and Detection of Reading Miscues for Interactive Literacy Tutors
COLING 2004