Alexandra Olteanu
14 papers · 2021–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Cross-Pollinator (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Renaissance Researcher (5)
π
Conference Polyglot
(5)
π
Renaissance Researcher
(5)
π€
Dynamic Duo
(10)
π₯
Mega-Team
(20)
π₯
Unstoppable
(5)
π
Century Club
(14)
ποΈ
Keyword Collector
(57)
β
The Questioner
Conferences
ACL (8)
IJCNLP (2)
NAACL (2)
EMNLP (1)
ICML (1)
Top co-authors
Keywords
coreference resolution
(4)
stereotyping detection
(3)
world knowledge
(2)
fairness benchmark
(2)
measurement validity
(2)
nlp evaluation
(2)
semantic plausibility
(2)
measurement modeling
(2)
text generation
(2)
measurement model
(2)
natural language generation
(2)
evaluation practice
(1)
representational harm
(1)
pretrained language model
(1)
commonsense reasoning
(1)
text summarization
(1)
intervention mechanism
(1)
ai safety
(1)
bias evaluation
(1)
natural language understanding
(1)
Papers
Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems
ACL 2025
Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge
ICML 2025
Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems
ACL 2025
βOne-Size-Fits-Allβ? Examining Expectations around What Constitute βFairβ or βGoodβ NLG System Behaviors
NAACL 2024
ECBD: Evidence-Centered Benchmark Design for NLP
ACL 2024
Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective
ACL 2024
Responsible AI Considerations in Text Summarization Research: A Review of Current Practices
EMNLP 2023
FairPrism: Evaluating Fairness-Related Harms in Text Generation
ACL 2023
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources
ACL 2023
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
NAACL 2022
Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets
ACL 2021
ADEPT: An Adjective-Dependent Plausibility Task
ACL 2021
Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets
IJCNLP 2021
ADEPT: An Adjective-Dependent Plausibility Task
IJCNLP 2021