Alexandra Olteanu

14 papers · 2021–2025 · 5 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (5)

🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (5) 🤝 Dynamic Duo (10) 👥 Mega-Team (20) 🔥 Unstoppable (5) 💎 Century Club (14) 🗃️ Keyword Collector (57) ❓ The Questioner

Conferences

ACL (8) IJCNLP (2) NAACL (2) EMNLP (1) ICML (1)

Top co-authors

Su Lin Blodgett (10) Adam Trischler (6) Hanna Wallach (6) Kaheer Suleman (5) Jackie Chi Kit CHEUNG (4) Emily Sheng (3) Ian Porada (3) Alexandra Chouldechova (2) Chad Atalla (2) Yu Lu Liu (2)

Keywords

coreference resolution (4) stereotyping detection (3) world knowledge (2) fairness benchmark (2) measurement validity (2) nlp evaluation (2) semantic plausibility (2) measurement modeling (2) text generation (2) measurement model (2) natural language generation (2) evaluation practice (1) representational harm (1) pretrained language model (1) commonsense reasoning (1) text summarization (1) intervention mechanism (1) ai safety (1) bias evaluation (1) natural language understanding (1)

Papers

Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems ACL 2025 Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge ICML 2025 Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems ACL 2025 “One-Size-Fits-All”? Examining Expectations around What Constitute “Fair” or “Good” NLG System Behaviors NAACL 2024 ECBD: Evidence-Centered Benchmark Design for NLP ACL 2024 Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective ACL 2024 Responsible AI Considerations in Text Summarization Research: A Review of Current Practices EMNLP 2023 FairPrism: Evaluating Fairness-Related Harms in Text Generation ACL 2023 The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources ACL 2023 Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications NAACL 2022 Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets ACL 2021 ADEPT: An Adjective-Dependent Plausibility Task ACL 2021 Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets IJCNLP 2021 ADEPT: An Adjective-Dependent Plausibility Task IJCNLP 2021