Xingwu Sun
27 papers · 2018–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
๐ Academic Marathon (7) ๐ Interdisciplinary Bridge ๐งญ Keyword Pioneer ๐ Conference Polyglot (11) ๐ Cross-Pollinator (5)
๐
Conference Polyglot
(11)
๐
Academic Marathon
(7)
๐
Renaissance Researcher
(8)
๐ค
Dynamic Duo
(19)
๐ฅ
Mega-Team
(26)
๐งฌ
Topic Evolution
โก
Prolific Year
(5)
๐
Century Club
(26)
๐๏ธ
Keyword Collector
(137)
๐ฅ
Unstoppable
(5)
Conferences
EMNLP (6)
AAAI (5)
ACL (4)
NAACL (4)
ICML (2)
COLING (1)
CVPR (1)
ICCV (1)
IJCAI (1)
IJCNLP (1)
NIPS (1)
Top co-authors
Keywords
neural network
(3)
large language model
(3)
document retrieval
(3)
pseudo query
(2)
transformer architecture
(2)
adversarial attack
(2)
evaluation benchmark
(2)
mixture of expert
(2)
multimodal large language model
(2)
parameter efficiency
(2)
multimodal learning
(2)
visual question answering
(2)
dense retrieval
(2)
state space model
(2)
vision-language model
(2)
direct preference optimization
(1)
preference learning
(1)
catastrophic forgetting
(1)
hyperparameter optimization
(1)
contrastive learning
(1)
Papers
TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model
AAAI 2026
Sparsifying Mamba
EMNLP 2025
The Security Threat of Compressed Projectors in Large Vision-Language Models
EMNLP 2025
Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval
ICCV 2025
Enhancing Contrastive Learning Inspired by the Philosophy of โThe Blind Men and the Elephantโ
AAAI 2025
Continuous Speech Tokenizer in Text To Speech
NAACL 2025
Language Models โGrokโ to Copy
NAACL 2025
Exploring Forgetting in Large Language Model Pre-Training
ACL 2025
Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization
ACL 2025
PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset
CVPR 2025
QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models
NAACL 2025
Scaling Laws for FloatingโPoint Quantization Training
ICML 2025
HMoE: Heterogeneous Mixture of Experts for Language Modeling
EMNLP 2025
Autonomy-of-Experts Models
ICML 2025
Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling
NIPS 2024
DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation
AAAI 2024
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning
AAAI 2024
LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders
COLING 2024
SeeDRec: Sememe-based Diffusion for Sequential Recommendation
IJCAI 2024
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
ACL 2023
An Anchor-based Relative Position Embedding Method for Cross-Modal Tasks
EMNLP 2022
Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval
IJCNLP 2021
TITA: A Two-stage Interaction and Topic-Aware Text Matching Model
NAACL 2021
Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval
ACL 2021
Enhancing Document Ranking with Task-adaptive Training and Segmented Token Recovery Mechanism
EMNLP 2021
A Bidirectional Multi-paragraph Reading Model for Zero-shot Entity Linking
AAAI 2021
Answer-focused and Position-aware Neural Question Generation
EMNLP 2018