Lu Sheng

44 papers · 2017–2026 · 9 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8) 🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (73)

🌍 Conference Polyglot (9) 🏃 Academic Marathon (8) 🗺️ Taxonomy Completionist (73) 🔬 Deep Specialist (11) 🏆 Grand Slam 🤝 Dynamic Duo (18) 🧬 Topic Evolution ⚡ Prolific Year (7) 🔥 Unstoppable (9) 💎 Century Club (41) 🗃️ Keyword Collector (192) 🚀 Conference Pioneer 📈 Trend Setter

Conferences

CVPR (19) AAAI (7) ICCV (7) ECCV (6) ICLR (1) ICML (1) IJCAI (1) NIPS (1) WACV (1)

Top co-authors

Jing Shao (19) Xiaogang Wang (10) Dong Xu (10) Zehuan Huang (6) Junjie Yan (5) Zhenfei Yin (5) Xihui Liu (5) Guojun Yin (5) Wanli Ouyang (5) Jing Zhang (4)

Research topics

Privacy (1)

Keywords

image generation (6) diffusion model (5) point cloud (5) multi-modal learning (4) generative model (3) zero-shot learning (3) depth estimation (3) 3d reconstruction (3) 3d vision (3) convolutional neural network (2) domain adaptation (2) 3d object detection (2) 3d generation (2) visual grounding (2) embodied ai (2) object detection (2) multimodal learning (2) self-supervised learning (2) optical flow (2) text-to-image model (2)

Papers

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks AAAI 2026 InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE AAAI 2026 Personalize Anything for Free with Diffusion Transformer AAAI 2026 Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection CVPR 2025 MV-Adapter: Multi-View Consistent Image Generation Made Easy ICCV 2025 WorldSimBench: Towards Video Generation Models as World Simulators ICML 2025 T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation CVPR 2025 MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation CVPR 2025 Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion CVPR 2025 MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception CVPR 2024 Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation AAAI 2024 Data-Free Generalized Zero-Shot Learning AAAI 2024 Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation IJCAI 2024 Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE ICLR 2024 EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion CVPR 2024 VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud CVPR 2023 Siamese DETR CVPR 2023 LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark NIPS 2023 X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation ECCV 2022 Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation ECCV 2022 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds CVPR 2022 DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer AAAI 2022 SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling ECCV 2022 IncreACO: Incrementally Learned Automatic Check-Out With Photorealistic Exemplar Augmentation WACV 2021 StyleFormer: Real-Time Arbitrary Style Transfer via Parametric Style Composition ICCV 2021 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds ICCV 2021 Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds CVPR 2021 ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis CVPR 2021 Morphing and Sampling Network for Dense Point Cloud Completion AAAI 2020 Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues ECCV 2020 Powering One-shot Topological NAS with Stabilized Share-parameter Proxy ECCV 2020 Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM ICCV 2019 Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization ICCV 2019 CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval ICCV 2019 GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving CVPR 2019 Semantics Disentangling for Text-To-Image Generation CVPR 2019 Video Generation From Single Semantic Label Map CVPR 2019 Context and Attribute Grounded Dense Captioning CVPR 2019 Exploring Disentangled Feature Representation Beyond Face Identification CVPR 2018 Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition CVPR 2018 Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition ECCV 2018 Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration CVPR 2018 HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis ICCV 2017 A Generative Model for Depth-Based Robust 3D Facial Pose Tracking CVPR 2017