conftrace_

Fernanda Viégas

5 papers · 2018–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌍 Conference Polyglot (4) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (15)

Conferences

ICML (2) ICCV (1) ICLR (1) NIPS (1)

Top co-authors

Martin Wattenberg (4) Kenneth Li (3) Hanspeter Pfister (2) Michael Terry (1) Justin Gilmer (1) Oam Patel (1) David Bau (1) James Wexler (1) Rory sayres (1) Aspen K Hopkins (1)

Keywords

image classification (2) model interpretability (1) saliency map (1) attention head (1) integrated gradient (1) attribution method (1) activation steering (1) inference-time intervention (1) large language model (1) neural network (1) concept activation vector (1) testing with cav (1) region-based attribution (1)

Papers

When Bad Data Leads to Good Models ICML 2025 Inference-Time Intervention: Eliciting Truthful Answers from a Language Model NIPS 2023 Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task ICLR 2023 XRAI: Better Attributions Through Regions ICCV 2019 Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV) ICML 2018