conftrace_

Xuansheng Wu

9 papers · 2023–2026 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

🌍 Conference Polyglot (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15)

Conferences

EMNLP (2) NAACL (2) AAAI (1) ACL (1) EACL (1) ICML (1) NIPS (1)

Top co-authors

Ninghao Liu (8) Mengnan Du (5) Haiyan Zhao (3) Jin Sun (2) Dong Yu (2) Dong Shu (2) Yucheng Shi (2) Wenlin Yao (2) Xiaoyang Wang (2) Kaiqiang Song (1)

Keywords

large language model (4) sparse autoencoder (3) model steering (2) medical imaging (1) instruction following (1) neural network analysis (1) feature disentanglement (1) instruction tuning (1) backdoor attack (1) diffusion model (1) latent representation (1) adversarial defense (1) language model (1) vision-language model (1) model explanation (1) feed-forward network (1) mechanistic interpretability (1) linear probing (1) latent feature (1) gradient analysis (1)

Papers

AutoSCORE: Enhancing Automated Scoring with Multi-Agent Large Language Models via Structured Component Recognition AAAI 2026 Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering EACL 2026 A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models EMNLP 2025 Concept-Centric Token Interpretation for Vector-Quantized Generative Models ICML 2025 Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders EMNLP 2025 LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models NAACL 2025 InFoBench: Evaluating Instruction Following Ability in Large Language Models ACL 2024 From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning NAACL 2024 Black-box Backdoor Defense via Zero-shot Image Purification NIPS 2023