Xiangmin Xu

27 papers · 2016–2026 · 8 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (9) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (11) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (9) 🤝 Dynamic Duo (17) 🧬 Topic Evolution 🔥 Unstoppable (5) 🚀 Conference Pioneer 📈 Trend Setter 🗃️ Keyword Collector (120) ⚡ Prolific Year (10) 💎 Century Club (26)

Conferences

INTERSPEECH (7) EMNLP (5) CVPR (4) ACL (3) MICCAI (3) AAAI (2) ICCV (2) NAACL (1)

Top co-authors

Xiaofen Xing (17) Yirong Chen (5) Weibin Zhang (4) Xin Zhang (4) Changxing Ding (4) Jingkai Lin (3) Tong Xiong (3) Yawen Zeng (2) Wenyu Tao (2) Chunmei Qing (2)

Keywords

speech emotion recognition (5) attention mechanism (3) retrieval-augmented generation (3) large language model (3) multimodal large language model (3) emotion classification (2) transformer architecture (2) multi-agent system (2) psychological counseling (2) feature extraction (2) convolutional neural network (2) dialogue system (2) self-supervised learning (1) image generation (1) feature distribution (1) object detection (1) semantic segmentation (1) information retrieval (1) domain adaptation (1) image classification (1)

Papers

DAVID: Dual-stage Adaptive Vision-text Integrated Decoupling for Multimodal KV Cache Eviction AAAI 2026 PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological Counseling ACL 2025 TreeRAG: Unleashing the Power of Hierarchical Storage for Enhanced Knowledge Retrieval in Long Documents ACL 2025 Drawing Developmental Trajectory from Cortical Surface Reconstruction ICCV 2025 Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification CVPR 2025 SAKI-RAG: Mitigating Context Fragmentation in Long-Document RAG via Sentence-level Attention Knowledge Integration EMNLP 2025 TailorRPA: A Retrieval-Based Framework for Eliciting Personalized and Coherent Role-Playing Agents in General Domain EMNLP 2025 CATCH: A Novel Data Synthesis Framework for High Therapy Fidelity and Memory-Driven Planning Chain of Thought in AI Counseling EMNLP 2025 QuantAgents: Towards Multi-agent Financial System via Simulated Trading EMNLP 2025 Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection ICCV 2025 Heterogeneous Masked Attention-Guided Path Convolution for Functional Brain Network Analysis MICCAI 2025 Fetal MRI Reconstruction by Global Diffusion and Consistent Implicit Representation MICCAI 2024 VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool ACL 2024 Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On CVPR 2024 Disentangled Pre-training for Human-Object Interaction Detection CVPR 2024 DropFormer: A Dynamic Noise-Dropping Transformer for Speech Emotion Recognition INTERSPEECH 2024 Cortical Surface Reconstruction from 2D MRI with Segmentation-Constrained Super-Resolution and Representation Learning MICCAI 2024 Superpoint Transformer for 3D Scene Instance Segmentation AAAI 2023 Exploring Downstream Transfer of Self-Supervised Features for Speech Emotion Recognition INTERSPEECH 2023 Multi-Scale Temporal Transformer For Speech Emotion Recognition INTERSPEECH 2023 SoulChat: Improving LLMs’ Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations EMNLP 2023 SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of Speech INTERSPEECH 2022 Modeling Compositionality with Dependency Graph for Dialogue Generation NAACL 2022 Adaptive Domain-Aware Representation Learning for Speech Emotion Recognition INTERSPEECH 2020 Improving Training of Deep Neural Networks via Singular Value Bounding CVPR 2017 ResNet and Model Fusion for Automatic Spoofing Detection INTERSPEECH 2017 Improved Music Genre Classification with Convolutional Neural Networks INTERSPEECH 2016