Asma Ghandeharioun
11 papers · 2018–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (7) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (7) π Cross-Pollinator (8)
π
Cross-Pollinator
(8)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(22)
π
Grand Slam
π±
Topic Pioneer
π§¬
Topic Evolution
β
The Questioner
(2)
π
Trend Setter
ποΈ
Keyword Collector
(53)
π
Century Club
(11)
Conferences
NIPS (4)
ICML (2)
AAAI (1)
AISTATS (1)
EMNLP (1)
ICLR (1)
NAACL (1)
Top co-authors
Keywords
large language model
(2)
open-domain dialog
(2)
sentiment analysis
(1)
attention mechanism
(1)
knowledge editing
(1)
in-context learning
(1)
model editing
(1)
language model
(1)
harmful content
(1)
hierarchical reinforcement learning
(1)
reward function
(1)
scalable inference
(1)
hierarchical structure
(1)
human feedback
(1)
semantic coherence
(1)
conversational ai
(1)
credit assignment
(1)
mechanistic interpretability
(1)
deep generative model
(1)
policy gradient
(1)
Papers
Racing Thoughts: Explaining Contextualization Errors in Large Language Models
NAACL 2025
Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
ICML 2024
Who's asking? User personas and the mechanics of latent misalignment
NIPS 2024
Interpretability Illusions in the Generalization of Simplified Models
ICML 2024
Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models
NIPS 2023
Post Hoc Explanations of Language Models Can Improve Language Models
NIPS 2023
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
ICLR 2022
Hierarchical Reinforcement Learning for Open-Domain Dialog
AAAI 2020
Human-centric dialog training via offline reinforcement learning
EMNLP 2020
Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
NIPS 2019
Multimodal Prediction and Personalization of Photo Edits with Deep Generative Models
AISTATS 2018