Lu Sheng
44 papers · 2017–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (8) π Conference Polyglot (9) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (73)
π
Conference Polyglot
(9)
π
Academic Marathon
(8)
πΊοΈ
Taxonomy Completionist
(73)
π¬
Deep Specialist
(11)
π
Grand Slam
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
β‘
Prolific Year
(7)
π₯
Unstoppable
(9)
π
Century Club
(41)
ποΈ
Keyword Collector
(192)
π
Conference Pioneer
π
Trend Setter
Conferences
CVPR (19)
AAAI (7)
ICCV (7)
ECCV (6)
ICLR (1)
ICML (1)
IJCAI (1)
NIPS (1)
WACV (1)
Top co-authors
Research topics
Keywords
image generation
(6)
diffusion model
(5)
point cloud
(5)
multi-modal learning
(4)
generative model
(3)
zero-shot learning
(3)
depth estimation
(3)
3d reconstruction
(3)
3d vision
(3)
convolutional neural network
(2)
domain adaptation
(2)
3d object detection
(2)
3d generation
(2)
visual grounding
(2)
embodied ai
(2)
object detection
(2)
multimodal learning
(2)
self-supervised learning
(2)
optical flow
(2)
text-to-image model
(2)
Papers
IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
AAAI 2026
InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoE
AAAI 2026
Personalize Anything for Free with Diffusion Transformer
AAAI 2026
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection
CVPR 2025
MV-Adapter: Multi-View Consistent Image Generation Made Easy
ICCV 2025
WorldSimBench: Towards Video Generation Models as World Simulators
ICML 2025
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
CVPR 2025
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
CVPR 2025
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
CVPR 2025
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
CVPR 2024
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
Data-Free Generalized Zero-Shot Learning
AAAI 2024
Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation
IJCAI 2024
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE
ICLR 2024
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
CVPR 2024
VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
CVPR 2023
Siamese DETR
CVPR 2023
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark
NIPS 2023
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
ECCV 2022
Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation
ECCV 2022
3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
CVPR 2022
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer
AAAI 2022
SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling
ECCV 2022
IncreACO: Incrementally Learned Automatic Check-Out With Photorealistic Exemplar Augmentation
WACV 2021
StyleFormer: Real-Time Arbitrary Style Transfer via Parametric Style Composition
ICCV 2021
3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
ICCV 2021
Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds
CVPR 2021
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
CVPR 2021
Morphing and Sampling Network for Dense Point Cloud Completion
AAAI 2020
Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues
ECCV 2020
Powering One-shot Topological NAS with Stabilized Share-parameter Proxy
ECCV 2020
Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM
ICCV 2019
Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization
ICCV 2019
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
ICCV 2019
GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving
CVPR 2019
Semantics Disentangling for Text-To-Image Generation
CVPR 2019
Video Generation From Single Semantic Label Map
CVPR 2019
Context and Attribute Grounded Dense Captioning
CVPR 2019
Exploring Disentangled Feature Representation Beyond Face Identification
CVPR 2018
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
CVPR 2018
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
ECCV 2018
Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration
CVPR 2018
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
ICCV 2017
A Generative Model for Depth-Based Robust 3D Facial Pose Tracking
CVPR 2017