Rui Hu

31 papers · 2019–2026 · 11 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (11) 🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (55) 🧬 Topic Evolution 👥 Mega-Team (22) 🤝 Dynamic Duo (10) 🏆 Grand Slam 🗃️ Keyword Collector (115) ⚡ Prolific Year (12) 🚀 Conference Pioneer 💎 Century Club (27)

Conferences

CVPR (9) ACL (4) ECCV (4) AAAI (3) ICLR (3) ICCV (2) WACV (2) ICML (1) IJCAI (1) MICCAI (1) NIPS (1)

Top co-authors

Raquel Urtasun (10) Yuwen Xiong (5) Jiedong Zhuang (4) Zikai Zhang (4) Jiahao Xu (4) Xiaoxin Chen (4) Wei-Chiu Ma (4) Bin Yang (3) Yun Chen (3) Jitao Sang (3)

Research topics

Privacy (1)

Keywords

multimodal large language model (5) federated learning (4) attention mechanism (3) vision-language model (3) object detection (2) semantic segmentation (2) depth estimation (2) backdoor attack (2) instance segmentation (2) mask prediction (2) multimodal learning (2) image segmentation (1) deformable convolution (1) knowledge distillation (1) differential privacy (1) domain adaptation (1) policy optimization (1) reinforcement learning (1) autonomous driving (1) document parsing (1)

Papers

XMark: Reliable Multi-Bit Watermarking for LLM-Generated Texts ACL 2026 VAPO: End-to-end Slide-Enhanced Speech Recognition with Omni-modal Large Language Models ACL 2026 Q Cache: Visual Attention Is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model AAAI 2026 LENS: Learning to Segment Anything with Unified Reinforced Reasoning AAAI 2026 Achieving Byzantine-Resilient Federated Learning via Layer-Adaptive Sparsified Model Aggregation WACV 2025 Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding CVPR 2025 ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models CVPR 2025 BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices CVPR 2025 Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection CVPR 2025 Identify Backdoored Model in Federated Learning via Individual Unlearning WACV 2025 Finite-Time Analysis of Discrete-Time Stochastic Interpolants ICML 2025 ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming AAAI 2025 Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks ICLR 2025 GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding ICCV 2025 MEraser: An Effective Fingerprint Erasure Approach for Large Language Models ACL 2025 Investigating and Enhancing Vision-Audio Capability in Omnimodal Large Language Models ACL 2025 FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models NIPS 2024 FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance ECCV 2024 Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion ICLR 2024 Volumetric Conditional Score-based Residual Diffusion Model for PET/MR Denoising MICCAI 2024 Hybrid Local SGD for Federated Learning with Heterogeneous Communications ICLR 2022 Rethinking Closed-Loop Training for Autonomous Driving ECCV 2022 Federated Learning with Sparsification-Amplified Privacy and Adaptive Optimization IJCAI 2021 Learning Lane Graph Representations for Motion Forecasting ECCV 2020 Conditional Entropy Coding for Efficient Video Compression ECCV 2020 PnPNet: End-to-End Perception and Prediction With Tracking in the Loop CVPR 2020 PolyTransform: Deep Polygon Transformer for Instance Segmentation CVPR 2020 Deep Rigid Instance Scene Flow CVPR 2019 UPSNet: A Unified Panoptic Segmentation Network CVPR 2019 Multi-Task Multi-Sensor Fusion for 3D Object Detection CVPR 2019 DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch ICCV 2019