Stella Biderman
31 papers · 2021–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🐝 Cross-Pollinator (12) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (8)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🔬
Deep Specialist
(10)
👥
Mega-Team
(54)
👑
Triple Crown
🔥
Unstoppable
(5)
⚡
Prolific Year
(10)
💎
Century Club
(31)
❓
The Questioner
(2)
🗃️
Keyword Collector
(111)
Conferences
NIPS (8)
ICML (6)
ACL (5)
ICLR (5)
EMNLP (4)
NAACL (2)
ECCV (1)
Top co-authors
Keywords
large language model
(11)
transformer architecture
(4)
multilingual language model
(4)
cross-lingual transfer
(2)
model scaling
(2)
instruction tuning
(2)
zero-shot generalization
(2)
few-shot learning
(2)
neural network
(2)
representation learning
(2)
zero-shot learning
(1)
multilingual nlp
(1)
attention mechanism
(1)
prompt engineering
(1)
reproducible research
(1)
neural network compression
(1)
symbolic reasoning
(1)
multilingual summarization
(1)
offline reinforcement learning
(1)
language model training
(1)
Papers
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
ICML 2025
Bridging the Data Provenance Gap Across Text, Speech, and Video
ICLR 2025
KMMLU: Measuring Massive Multitask Language Understanding in Korean
NAACL 2025
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
ICLR 2025
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
ICLR 2025
Grokking Group Multiplication with Cosets
ICML 2024
A Walsh Hadamard Derived Linear Vector Symbolic Architecture
NIPS 2024
Stay on Topic with Classifier-Free Guidance
ICML 2024
LLM Circuit Analyses Are Consistent Across Training and Scale
NIPS 2024
Position: On the Societal Impact of Open Foundation Models
ICML 2024
Consent in Crisis: The Rapid Decline of the AI Data Commons
NIPS 2024
Re-Evaluating Evaluation for Multilingual Summarization
EMNLP 2024
Llemma: An Open Language Model for Mathematics
ICLR 2024
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs
NIPS 2023
Emergent and Predictable Memorization in Large Language Models
NIPS 2023
LEACE: Perfect linear concept erasure in closed form
NIPS 2023
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
ACL 2023
Crosslingual Generalization through Multitask Finetuning
ACL 2023
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration
ACL 2023
RWKV: Reinventing RNNs for the Transformer Era
EMNLP 2023
Recasting Self-Attention with Holographic Reduced Representations
ICML 2023
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
ICML 2023
BigBio: A Framework for Data-Centric Biomedical Natural Language Processing
NIPS 2022
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NIPS 2022
What Language Model to Train if You Have One Million GPU Hours?
EMNLP 2022
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance
ECCV 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
ACL 2022
You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings
ACL 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
ICLR 2022
Towards a Model-Theoretic View of Narratives
NAACL 2021