Rui Liu
61 papers · 2017–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (16) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Academic Marathon (8)
π§
Keyword Pioneer
π
Interdisciplinary Bridge
π
Conference Polyglot
(16)
π¬
Deep Specialist
(12)
π
Grand Slam
π§¬
Topic Evolution
π
Keyword Champion
(3)
ποΈ
Keyword Collector
(259)
π
Conference Pioneer
π
Century Club
(52)
π₯
Unstoppable
(9)
π
Trend Setter
β‘
Prolific Year
(7)
Conferences
AAAI (11)
INTERSPEECH (7)
CVPR (6)
ICCV (6)
ACL (5)
EMNLP (5)
ICML (5)
IJCAI (4)
NIPS (3)
COLING (2)
IJCNLP (2)
ACML (1)
ECCV (1)
ICLR (1)
NAACL (1)
OSDI (1)
Top co-authors
Keywords
large language model
(5)
vision-language navigation
(4)
multimodal learning
(4)
speech synthesis
(4)
domain adaptation
(3)
contrastive learning
(3)
conversational speech synthesis
(3)
emotion recognition
(3)
neural network
(3)
aspect-based sentiment
(2)
communication efficiency
(2)
affective computing
(2)
speech generation
(2)
multi-task learning
(2)
latent space
(2)
speech emotion recognition
(2)
sentiment classification
(2)
depth estimation
(2)
scene understanding
(2)
model compression
(2)
Papers
MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation
ACL 2026
What You See Is What You Reach: Towards Spatial Navigation with High-Level Human Instructions
AAAI 2026
Codebook-Centric Deep Hashing: End-to-End Joint Learning of Semantic Hash Centers and Neural Hash Function
AAAI 2026
TellWhisper: Tell Whisper Who Speaks When
ACL 2026
MetaGDPO: Alleviating Catastrophic Forgetting with Metacognitive Knowledge Through Group Direct Preference Optimization
AAAI 2026
Towards Authentic Movie Dubbing with Retrieve-Augmented Director-Actor Interaction Learning
AAAI 2026
MMMamba: A Versatile Cross-Modal in Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement
AAAI 2026
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning
ACL 2026
VBF++: Variational Bayesian Fusion with Context-Aware Priors and Recommendation-Guided Adversarial Refinement for Multimodal Video Recommendation
AAAI 2026
LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding
EMNLP 2025
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech
AAAI 2025
Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis
ACL 2025
Scene Map-based Prompt Tuning for Navigation Instruction Generation
CVPR 2025
Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis
EMNLP 2025
Dual-Path Counterfactual Integration for Multimodal Aspect-Based Sentiment Classification
EMNLP 2025
RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration
ICCV 2025
3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation
ICCV 2025
Underwater Visual SLAM with Depth Uncertainty and Medium Modeling
ICCV 2025
AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models
ICML 2025
OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition
ICML 2025
Volumetric Environment Representation for Vision-Language Navigation
CVPR 2024
Navigation Instruction Generation with BEV Perception and Large Language Models
ECCV 2024
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge
INTERSPEECH 2024
Vision-Language Navigation with Energy-Based Policy
NIPS 2024
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling
AAAI 2024
Infrared Small Target Detection with Scale and Location Sensitivity
CVPR 2024
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
INTERSPEECH 2024
Coverage-centric Coreset Selection for High Pruning Rates
ICLR 2023
AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models
OSDI 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
ICCV 2023
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion
INTERSPEECH 2023
Explicit Intensity Control for Accented Text-to-speech
INTERSPEECH 2023
Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis
NAACL 2022
Transformer with Memory Replay
AAAI 2022
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers
ICML 2022
Communication-efficient Distributed Learning for Large Batch Optimization
ICML 2022
Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Knowledge
IJCAI 2022
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
INTERSPEECH 2022
Target Really Matters: Target-aware Contrastive Learning and Consistency Regularization for Few-shot Stance Detection
COLING 2022
HiABP: Hierarchical Initialized ABP for Unsupervised Representation Learning
AAAI 2021
Vector-Decomposed Disentanglement for Domain-Invariant Object Detection
ICCV 2021
SSMF: Shifting Seasonal Matrix Factorization
NIPS 2021
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
ICCV 2021
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
CVPR 2021
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability
INTERSPEECH 2021
Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge Graph
IJCNLP 2021
Temporal Difference Learning as Gradient Splitting
ICML 2021
Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge Graph
ACL 2021
Efficient Attention Calibration Network for Real-Time Semantic Segmentation
ACML 2020
HyperNews: Simultaneous News Recommendation and Active-Time Prediction via a Double-Task Deep Neural Network
IJCAI 2020
StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching
CVPR 2020
Adam with Bandit Sampling for Deep Learning
NIPS 2020
VEST: A System for Vulnerability Exploit Scoring & Timing
IJCAI 2019
Ranking and Sampling in Open-Domain Question Answering
IJCNLP 2019
A Bandit Approach to Maximum Inner Product Search
AAAI 2019
Ranking and Sampling in Open-Domain Question Answering
EMNLP 2019
Conditional Adversarial Generative Flow for Controllable Image Synthesis
CVPR 2019
A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction
COLING 2018
Discrete Factorization Machines for Fast Feature-based Recommendation
IJCAI 2018
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model
INTERSPEECH 2018
Structural Embedding of Syntactic Trees for Machine Comprehension
EMNLP 2017