Rui Hu
31 papers · 2019–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (11) π Academic Marathon (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (12)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(55)
π§¬
Topic Evolution
π₯
Mega-Team
(22)
π€
Dynamic Duo
(10)
π
Grand Slam
ποΈ
Keyword Collector
(115)
β‘
Prolific Year
(12)
π
Conference Pioneer
π
Century Club
(27)
Conferences
CVPR (9)
ACL (4)
ECCV (4)
AAAI (3)
ICLR (3)
ICCV (2)
WACV (2)
ICML (1)
IJCAI (1)
MICCAI (1)
NIPS (1)
Top co-authors
Research topics
Keywords
multimodal large language model
(5)
federated learning
(4)
attention mechanism
(3)
vision-language model
(3)
object detection
(2)
semantic segmentation
(2)
depth estimation
(2)
backdoor attack
(2)
instance segmentation
(2)
mask prediction
(2)
multimodal learning
(2)
image segmentation
(1)
deformable convolution
(1)
knowledge distillation
(1)
differential privacy
(1)
domain adaptation
(1)
policy optimization
(1)
reinforcement learning
(1)
autonomous driving
(1)
document parsing
(1)
Papers
XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts
ACL 2026
VAPO: End-to-end Slide-Enhanced Speech Recognition with Omni-modal Large Language Models
ACL 2026
Q Cache: Visual Attention Is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model
AAAI 2026
LENS: Learning to Segment Anything with Unified Reinforced Reasoning
AAAI 2026
Achieving Byzantine-Resilient Federated Learning via Layer-Adaptive Sparsified Model Aggregation
WACV 2025
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
CVPR 2025
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
CVPR 2025
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
CVPR 2025
Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection
CVPR 2025
Identify Backdoored Model in Federated Learning via Individual Unlearning
WACV 2025
Finite-Time Analysis of Discrete-Time Stochastic Interpolants
ICML 2025
ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
AAAI 2025
Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
ICLR 2025
GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
ICCV 2025
MEraser: An Effective Fingerprint Erasure Approach for Large Language Models
ACL 2025
Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models
ACL 2025
FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models
NIPS 2024
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
ECCV 2024
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion
ICLR 2024
Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising
MICCAI 2024
Hybrid Local SGD for Federated Learning with Heterogeneous Communications
ICLR 2022
Rethinking Closed-Loop Training for Autonomous Driving
ECCV 2022
Federated Learning with Sparsification-Amplified Privacy and Adaptive Optimization
IJCAI 2021
Learning Lane Graph Representations for Motion Forecasting
ECCV 2020
Conditional Entropy Coding for Efficient Video Compression
ECCV 2020
PnPNet: End-to-End Perception and Prediction With Tracking in the Loop
CVPR 2020
PolyTransform: Deep Polygon Transformer for Instance Segmentation
CVPR 2020
Deep Rigid Instance Scene Flow
CVPR 2019
UPSNet: A Unified Panoptic Segmentation Network
CVPR 2019
Multi-Task Multi-Sensor Fusion for 3D Object Detection
CVPR 2019
DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch
ICCV 2019