Stella Biderman

31 papers · 2021–2025 · 7 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🐝 Cross-Pollinator (12) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (8)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🔬 Deep Specialist (10) 👥 Mega-Team (54) 👑 Triple Crown 🔥 Unstoppable (5) ⚡ Prolific Year (10) 💎 Century Club (31) ❓ The Questioner (2) 🗃️ Keyword Collector (111)

Conferences

NIPS (8) ICML (6) ACL (5) ICLR (5) EMNLP (4) NAACL (2) ECCV (1)

Top co-authors

Hailey Schoelkopf (9) Edward Raff (8) Lintang Sutawika (7) Niklas Muennighoff (6) Manan Dey (5) Shayne Longpre (5) M Saiful Bari (4) Zheng Xin Yong (4) Quentin Anthony (4) Thomas Wang (4)

Keywords

large language model (11) transformer architecture (4) multilingual language model (4) cross-lingual transfer (2) model scaling (2) instruction tuning (2) zero-shot generalization (2) few-shot learning (2) neural network (2) representation learning (2) zero-shot learning (1) multilingual nlp (1) attention mechanism (1) prompt engineering (1) reproducible research (1) neural network compression (1) symbolic reasoning (1) multilingual summarization (1) offline reinforcement learning (1) language model training (1)

Papers

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? ICML 2025 Bridging the Data Provenance Gap Across Text, Speech, and Video ICLR 2025 KMMLU: Measuring Massive Multitask Language Understanding in Korean NAACL 2025 Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon ICLR 2025 PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs ICLR 2025 Grokking Group Multiplication with Cosets ICML 2024 A Walsh Hadamard Derived Linear Vector Symbolic Architecture NIPS 2024 Stay on Topic with Classifier-Free Guidance ICML 2024 LLM Circuit Analyses Are Consistent Across Training and Scale NIPS 2024 Position: On the Societal Impact of Open Foundation Models ICML 2024 Consent in Crisis: The Rapid Decline of the AI Data Commons NIPS 2024 Re-Evaluating Evaluation for Multilingual Summarization EMNLP 2024 Llemma: An Open Language Model for Mathematics ICLR 2024 trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback EMNLP 2023 The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs NIPS 2023 Emergent and Predictable Memorization in Large Language Models NIPS 2023 LEACE: Perfect linear concept erasure in closed form NIPS 2023 BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting ACL 2023 Crosslingual Generalization through Multitask Finetuning ACL 2023 GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration ACL 2023 RWKV: Reinventing RNNs for the Transformer Era EMNLP 2023 Recasting Self-Attention with Holographic Reduced Representations ICML 2023 Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling ICML 2023 BigBio: A Framework for Data-Centric Biomedical Natural Language Processing NIPS 2022 The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset NIPS 2022 What Language Model to Train if You Have One Million GPU Hours? EMNLP 2022 VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance ECCV 2022 GPT-NeoX-20B: An Open-Source Autoregressive Language Model ACL 2022 You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings ACL 2022 Multitask Prompted Training Enables Zero-Shot Task Generalization ICLR 2022 Towards a Model-Theoretic View of Narratives NAACL 2021