Piotr Mardziel
4 papers · 2020–2021 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (3) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
NIPS (2)
AAAI (1)
ACL (1)
Top co-authors
Keywords
information theory
(1)
algorithmic fairness
(1)
model robustness
(1)
attention mechanism
(1)
neural network interpretability
(1)
feature attribution
(1)
lipschitz continuity
(1)
adversarial attack
(1)
recurrent neural network
(1)
language model
(1)
partial information decomposition
(1)
counterfactual reasoning
(1)
counterfactual fairness
(1)
causal analysis
(1)
information flow
(1)
syntactic structure
(1)
lstm language model
(1)
subject-verb agreement
(1)
gradient-based attribution
(1)
influence path
(1)
Papers
Influence Patterns for Explaining Information Flow in BERT
NIPS 2021
Smoothed Geometry for Robust Attribution
NIPS 2020
An Information-Theoretic Quantification of Discrimination with Exempt Features
AAAI 2020
Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models
ACL 2020