Fernanda Viégas
5 papers · 2018–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
🌍 Conference Polyglot (4) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird
🐝
Cross-Pollinator
(15)
Conferences
ICML (2)
ICCV (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
image classification
(2)
model interpretability
(1)
saliency map
(1)
attention head
(1)
integrated gradient
(1)
attribution method
(1)
activation steering
(1)
inference-time intervention
(1)
large language model
(1)
neural network
(1)
concept activation vector
(1)
testing with cav
(1)
region-based attribution
(1)
Papers
When Bad Data Leads to Good Models
ICML 2025
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
NIPS 2023
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
ICLR 2023
XRAI: Better Attributions Through Regions
ICCV 2019
Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)
ICML 2018