conftrace_

Piotr Mardziel

4 papers · 2020–2021 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (3) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (15)

Conferences

NIPS (2) AAAI (1) ACL (1)

Top co-authors

Anupam Datta (4) Kaiji Lu (2) Zifan Wang (2) Matt Fredrikson (2) Sanghamitra Dutta (1) Shakul Ramkumar (1) Klas Leino (1) Haofan Wang (1) Pulkit Grover (1) Praveen Venkatesh (1)

Keywords

information theory (1) algorithmic fairness (1) model robustness (1) attention mechanism (1) neural network interpretability (1) feature attribution (1) lipschitz continuity (1) adversarial attack (1) recurrent neural network (1) language model (1) partial information decomposition (1) counterfactual reasoning (1) counterfactual fairness (1) causal analysis (1) information flow (1) syntactic structure (1) lstm language model (1) subject-verb agreement (1) gradient-based attribution (1) influence path (1)

Papers

Influence Patterns for Explaining Information Flow in BERT NIPS 2021 Smoothed Geometry for Robust Attribution NIPS 2020 An Information-Theoretic Quantification of Discrimination with Exempt Features AAAI 2020 Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models ACL 2020