Xiangmin Xu
27 papers · 2016–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🏃 Academic Marathon (9) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird
🐝
Cross-Pollinator
(11)
🌍
Conference Polyglot
(8)
🏃
Academic Marathon
(9)
🤝
Dynamic Duo
(17)
🧬
Topic Evolution
🔥
Unstoppable
(5)
🚀
Conference Pioneer
📈
Trend Setter
🗃️
Keyword Collector
(120)
⚡
Prolific Year
(10)
💎
Century Club
(26)
Conferences
INTERSPEECH (7)
EMNLP (5)
CVPR (4)
ACL (3)
MICCAI (3)
AAAI (2)
ICCV (2)
NAACL (1)
Top co-authors
Keywords
speech emotion recognition
(5)
attention mechanism
(3)
retrieval-augmented generation
(3)
large language model
(3)
multimodal large language model
(3)
emotion classification
(2)
transformer architecture
(2)
multi-agent system
(2)
psychological counseling
(2)
feature extraction
(2)
convolutional neural network
(2)
dialogue system
(2)
self-supervised learning
(1)
image generation
(1)
feature distribution
(1)
object detection
(1)
semantic segmentation
(1)
information retrieval
(1)
domain adaptation
(1)
image classification
(1)
Papers
DAVID: Dual-stage Adaptive Vision-text Integrated Decoupling for Multimodal KV Cache Eviction
AAAI 2026
PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological Counseling
ACL 2025
TreeRAG: Unleashing the Power of Hierarchical Storage for Enhanced Knowledge Retrieval in Long Documents
ACL 2025
Drawing Developmental Trajectory from Cortical Surface Reconstruction
ICCV 2025
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification
CVPR 2025
SAKI-RAG: Mitigating Context Fragmentation in Long-Document RAG via Sentence-level Attention Knowledge Integration
EMNLP 2025
TailorRPA: A Retrieval-Based Framework for Eliciting Personalized and Coherent Role-Playing Agents in General Domain
EMNLP 2025
CATCH: A Novel Data Synthesis Framework for High Therapy Fidelity and Memory-Driven Planning Chain of Thought in AI Counseling
EMNLP 2025
QuantAgents: Towards Multi-agent Financial System via Simulated Trading
EMNLP 2025
Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection
ICCV 2025
Heterogeneous Masked Attention-Guided Path Convolution for Functional Brain Network Analysis
MICCAI 2025
Fetal MRI Reconstruction by Global Diffusion and Consistent Implicit Representation
MICCAI 2024
VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool
ACL 2024
Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On
CVPR 2024
Disentangled Pre-training for Human-Object Interaction Detection
CVPR 2024
DropFormer: A Dynamic Noise-Dropping Transformer for Speech Emotion Recognition
INTERSPEECH 2024
Cortical Surface Reconstruction from 2D MRI with Segmentation-Constrained Super-Resolution and Representation Learning
MICCAI 2024
Superpoint Transformer for 3D Scene Instance Segmentation
AAAI 2023
Exploring Downstream Transfer of Self-Supervised Features for Speech Emotion Recognition
INTERSPEECH 2023
Multi-Scale Temporal Transformer For Speech Emotion Recognition
INTERSPEECH 2023
SoulChat: Improving LLMs’ Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations
EMNLP 2023
SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of Speech
INTERSPEECH 2022
Modeling Compositionality with Dependency Graph for Dialogue Generation
NAACL 2022
Adaptive Domain-Aware Representation Learning for Speech Emotion Recognition
INTERSPEECH 2020
Improving Training of Deep Neural Networks via Singular Value Bounding
CVPR 2017
ResNet and Model Fusion for Automatic Spoofing Detection
INTERSPEECH 2017
Improved Music Genre Classification with Convolutional Neural Networks
INTERSPEECH 2016