Shahar Katz
7 papers · 2023–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (3) π Cross-Pollinator (12) πΊοΈ Taxonomy Completionist (14)
π£
Hot Topic Early Bird
π
Keyword Champion
(2)
π
Trend Setter
Conferences
ACL (2)
EMNLP (2)
AAAI (1)
EACL (1)
NAACL (1)
Top co-authors
Keywords
attention mechanism
(3)
language model
(2)
generative pre-trained transformer
(2)
jailbreak attack
(2)
gradient descent
(1)
support vector machine
(1)
hidden state
(1)
model fine-tuning
(1)
attention head
(1)
hidden state analysis
(1)
attention masking
(1)
linear representation
(1)
information flow
(1)
gradient analysis
(1)
random forest classifier
(1)
causal masking
(1)
prefill phase
(1)
transformer interpretability
(1)
weight modification
(1)
vocabulary space
(1)
Papers
TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors
ACL 2026
Safeguarding Language Models via Self-Destruct Trapdoor
EACL 2026
AlignTree: Efficient Defense Against LLM Jailbreak Attacks
AAAI 2026
Reversed Attention: On The Gradient Descent Of Attention Layers In GPT
NAACL 2025
Segment-Based Attention Masking for GPTs
ACL 2025
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
EMNLP 2024
VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers
EMNLP 2023