Xuansheng Wu
9 papers · 2023–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
EMNLP (2)
NAACL (2)
AAAI (1)
ACL (1)
EACL (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
large language model
(4)
sparse autoencoder
(3)
model steering
(2)
medical imaging
(1)
instruction following
(1)
neural network analysis
(1)
feature disentanglement
(1)
instruction tuning
(1)
backdoor attack
(1)
diffusion model
(1)
latent representation
(1)
adversarial defense
(1)
language model
(1)
vision-language model
(1)
model explanation
(1)
feed-forward network
(1)
mechanistic interpretability
(1)
linear probing
(1)
latent feature
(1)
gradient analysis
(1)
Papers
AutoSCORE: Enhancing Automated Scoring with Multi-Agent Large Language Models via Structured Component Recognition
AAAI 2026
Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering
EACL 2026
A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models
EMNLP 2025
Concept-Centric Token Interpretation for Vector-Quantized Generative Models
ICML 2025
Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders
EMNLP 2025
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
NAACL 2025
InFoBench: Evaluating Instruction Following Ability in Large Language Models
ACL 2024
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
NAACL 2024
Black-box Backdoor Defense via Zero-shot Image Purification
NIPS 2023