Xingwu Sun

27 papers · 2018–2026 · 11 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (11) 🐝 Cross-Pollinator (5)

🌍 Conference Polyglot (11) 🏃 Academic Marathon (7) 🌈 Renaissance Researcher (8) 🤝 Dynamic Duo (19) 👥 Mega-Team (26) 🧬 Topic Evolution ⚡ Prolific Year (5) 💎 Century Club (26) 🗃️ Keyword Collector (137) 🔥 Unstoppable (5)

Conferences

EMNLP (6) AAAI (5) ACL (4) NAACL (4) ICML (2) COLING (1) CVPR (1) ICCV (1) IJCAI (1) IJCNLP (1) NIPS (1)

Top co-authors

Zhanhui Kang (20) Ruobing Xie (16) Di Wang (7) Beihong Jin (5) Hongyin Tang (5) Shuaipeng Li (5) Fuzheng Zhang (5) Zhen Yang (4) Fengzong Lian (4) Xirong Li (3)

Keywords

neural network (3) large language model (3) document retrieval (3) pseudo query (2) transformer architecture (2) adversarial attack (2) evaluation benchmark (2) mixture of expert (2) multimodal large language model (2) parameter efficiency (2) multimodal learning (2) visual question answering (2) dense retrieval (2) state space model (2) vision-language model (2) direct preference optimization (1) preference learning (1) catastrophic forgetting (1) hyperparameter optimization (1) contrastive learning (1)

Papers

TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model AAAI 2026 Sparsifying Mamba EMNLP 2025 The Security Threat of Compressed Projectors in Large Vision-Language Models EMNLP 2025 Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval ICCV 2025 Enhancing Contrastive Learning Inspired by the Philosophy of “The Blind Men and the Elephant” AAAI 2025 Continuous Speech Tokenizer in Text To Speech NAACL 2025 Language Models “Grok” to Copy NAACL 2025 Exploring Forgetting in Large Language Model Pre-Training ACL 2025 Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization ACL 2025 PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset CVPR 2025 QAVA: Query-Agnostic Visual Attack to Large Vision-Language Models NAACL 2025 Scaling Laws for Floating–Point Quantization Training ICML 2025 HMoE: Heterogeneous Mixture of Experts for Language Modeling EMNLP 2025 Autonomy-of-Experts Models ICML 2025 Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling NIPS 2024 DINGO: Towards Diverse and Fine-Grained Instruction-Following Evaluation AAAI 2024 Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning AAAI 2024 LightVLP: A Lightweight Vision-Language Pre-training via Gated Interactive Masked AutoEncoders COLING 2024 SeeDRec: Sememe-based Diffusion for Sequential Recommendation IJCAI 2024 TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities ACL 2023 An Anchor-based Relative Position Embedding Method for Cross-Modal Tasks EMNLP 2022 Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval IJCNLP 2021 TITA: A Two-stage Interaction and Topic-Aware Text Matching Model NAACL 2021 Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval ACL 2021 Enhancing Document Ranking with Task-adaptive Training and Segmented Token Recovery Mechanism EMNLP 2021 A Bidirectional Multi-paragraph Reading Model for Zero-shot Entity Linking AAAI 2021 Answer-focused and Position-aware Neural Question Generation EMNLP 2018