Rui Liu

61 papers · 2017–2026 · 16 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (16) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🏃 Academic Marathon (8)

🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (16) 🔬 Deep Specialist (12) 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (3) 🗃️ Keyword Collector (259) 🚀 Conference Pioneer 💎 Century Club (52) 🔥 Unstoppable (9) 📈 Trend Setter ⚡ Prolific Year (7)

Conferences

AAAI (11) INTERSPEECH (7) CVPR (6) ICCV (6) ACL (5) EMNLP (5) ICML (5) IJCAI (4) NIPS (3) COLING (2) IJCNLP (2) ACML (1) ECCV (1) ICLR (1) NAACL (1) OSDI (1)

Top co-authors

Haizhou Li (9) Wenguan Wang (7) Yi Yang (7) Zheng Lin (6) hongsheng Li (6) Weiping Wang (6) Guanglai Gao (5) Barzan Mozafari (4) Yifan Hu (4) Xiaogang Wang (4)

Keywords

large language model (5) vision-language navigation (4) multimodal learning (4) speech synthesis (4) domain adaptation (3) contrastive learning (3) conversational speech synthesis (3) emotion recognition (3) neural network (3) aspect-based sentiment (2) communication efficiency (2) affective computing (2) speech generation (2) multi-task learning (2) latent space (2) speech emotion recognition (2) sentiment classification (2) depth estimation (2) scene understanding (2) model compression (2)

Papers

MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation ACL 2026 What You See Is What You Reach: Towards Spatial Navigation with High-Level Human Instructions AAAI 2026 Codebook-Centric Deep Hashing: End-to-End Joint Learning of Semantic Hash Centers and Neural Hash Function AAAI 2026 TellWhisper: Tell Whisper Who Speaks When ACL 2026 MetaGDPO: Alleviating Catastrophic Forgetting with Metacognitive Knowledge Through Group Direct Preference Optimization AAAI 2026 Towards Authentic Movie Dubbing with Retrieve-Augmented Director-Actor Interaction Learning AAAI 2026 MMMamba: A Versatile Cross-Modal in Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement AAAI 2026 MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning ACL 2026 VBF++: Variational Bayesian Fusion with Context-Aware Priors and Recommendation-Guided Adversarial Refinement for Multimodal Video Recommendation AAAI 2026 LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding EMNLP 2025 Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech AAAI 2025 Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis ACL 2025 Scene Map-based Prompt Tuning for Navigation Instruction Generation CVPR 2025 Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis EMNLP 2025 Dual-Path Counterfactual Integration for Multimodal Aspect-Based Sentiment Classification EMNLP 2025 RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration ICCV 2025 3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation ICCV 2025 Underwater Visual SLAM with Depth Uncertainty and Medium Modeling ICCV 2025 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models ICML 2025 OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition ICML 2025 Volumetric Environment Representation for Vision-Language Navigation CVPR 2024 Navigation Instruction Generation with BEV Perception and Large Language Models ECCV 2024 Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge INTERSPEECH 2024 Vision-Language Navigation with Energy-Based Policy NIPS 2024 Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling AAAI 2024 Infrared Small Target Detection with Scale and Location Sensitivity CVPR 2024 FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency INTERSPEECH 2024 Coverage-centric Coreset Selection for High Pruning Rates ICLR 2023 AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models OSDI 2023 Bird's-Eye-View Scene Graph for Vision-Language Navigation ICCV 2023 Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion INTERSPEECH 2023 Explicit Intensity Control for Accented Text-to-speech INTERSPEECH 2023 Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis NAACL 2022 Transformer with Memory Replay AAAI 2022 Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers ICML 2022 Communication-efficient Distributed Learning for Large Batch Optimization ICML 2022 Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Knowledge IJCAI 2022 Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning INTERSPEECH 2022 Target Really Matters: Target-aware Contrastive Learning and Consistency Regularization for Few-shot Stance Detection COLING 2022 HiABP: Hierarchical Initialized ABP for Unsupervised Representation Learning AAAI 2021 Vector-Decomposed Disentanglement for Domain-Invariant Object Detection ICCV 2021 SSMF: Shifting Seasonal Matrix Factorization NIPS 2021 FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting ICCV 2021 DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network CVPR 2021 Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability INTERSPEECH 2021 Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge Graph IJCNLP 2021 Temporal Difference Learning as Gradient Splitting ICML 2021 Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge Graph ACL 2021 Efficient Attention Calibration Network for Real-Time Semantic Segmentation ACML 2020 HyperNews: Simultaneous News Recommendation and Active-Time Prediction via a Double-Task Deep Neural Network IJCAI 2020 StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching CVPR 2020 Adam with Bandit Sampling for Deep Learning NIPS 2020 VEST: A System for Vulnerability Exploit Scoring & Timing IJCAI 2019 Ranking and Sampling in Open-Domain Question Answering IJCNLP 2019 A Bandit Approach to Maximum Inner Product Search AAAI 2019 Ranking and Sampling in Open-Domain Question Answering EMNLP 2019 Conditional Adversarial Generative Flow for Controllable Image Synthesis CVPR 2019 A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction COLING 2018 Discrete Factorization Machines for Fast Feature-based Recommendation IJCAI 2018 Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model INTERSPEECH 2018 Structural Embedding of Syntactic Trees for Machine Comprehension EMNLP 2017