Jiwen Lu
182 papers · 2013–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (11)
🐝
Cross-Pollinator
(11)
🧭
Keyword Pioneer
🏃
Academic Marathon
(12)
🌟
Keyword Trendsetter Combo
(5)
🏠
Conference Loyalist
(76)
🔬
Deep Specialist
(38)
🏆
Keyword Champion
🧬
Topic Evolution
🤝
Dynamic Duo
(151)
⚡
Prolific Year
(22)
🗃️
Keyword Collector
(605)
❓
The Questioner
📈
Trend Setter
🔥
Unstoppable
(13)
🚀
Conference Pioneer
💎
Century Club
(182)
Conferences
CVPR (76)
ICCV (44)
ECCV (34)
NIPS (14)
ICLR (7)
AAAI (3)
CORL (3)
IJCAI (1)
Top co-authors
Research topics
Keywords
point cloud
(13)
metric learning
(13)
representation learning
(11)
model compression
(10)
diffusion model
(10)
neural network
(9)
deep metric learning
(9)
3d vision
(9)
attention mechanism
(8)
vision transformer
(8)
contrastive learning
(7)
image generation
(7)
face recognition
(7)
video understanding
(7)
semantic segmentation
(7)
convolutional neural network
(6)
action recognition
(6)
autonomous driving
(6)
depth estimation
(6)
feature embedding
(6)
Papers
EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models
CVPR 2025
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
ICLR 2025
Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
CVPR 2025
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
ICCV 2025
Learning Counterfactually Decoupled Attention for Open-World Model Attribution
ICCV 2025
SpectralAR: Spectral Autoregressive Visual Generation
ICCV 2025
PlaneRAS: Learning Planar Primitives for 3D Plane Recovery
ICCV 2025
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
ICCV 2025
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
ICCV 2025
D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
ICCV 2025
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
ICLR 2025
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
CVPR 2025
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
CVPR 2025
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
CVPR 2025
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
CVPR 2025
GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation
CORL 2025
MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation
CORL 2025
InstaRevive: One-Step Image Enhancement via Dynamic Score Matching
ICLR 2025
ThinkBot: Embodied Instruction Following with Thought Chain Reasoning
ICLR 2025
3D Small Object Detection with Dynamic Spatial Pruning
ECCV 2024
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
CVPR 2024
FlowIE: Efficient Image Enhancement via Rectified Flow
CVPR 2024
MirageRoom: 3D Scene Segmentation with 2D Pre-trained Models by Mirage Projection
CVPR 2024
Segment and Caption Anything
CVPR 2024
Memory-based Adapters for Online 3D Scene Perception
CVPR 2024
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression
CVPR 2024
Towards Accurate Post-training Quantization for Diffusion Models
CVPR 2024
Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
CVPR 2024
Path Choice Matters for Clear Attributions in Path Methods
ICLR 2024
Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution
ECCV 2024
ProtoComp: Diverse Point Cloud Completion with Controllable Prototype
ECCV 2024
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
ECCV 2024
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
ECCV 2024
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
ECCV 2024
Efficient Inference of Vision Instruction-Following Models with Elastic Cache
ECCV 2024
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving
ECCV 2024
SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding
ECCV 2024
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
ECCV 2024
LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction
CVPR 2024
X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition
CVPR 2024
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
CVPR 2024
SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction
CVPR 2024
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner
NIPS 2024
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
NIPS 2024
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation
NIPS 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
NIPS 2024
Bridging the Divide: Reconsidering Softmax and Linear Attention
NIPS 2024
Q-VLM: Post-training Quantization for Large Vision-Language Models
NIPS 2024
FLAG3D: A 3D Fitness Activity Dataset With Language Instruction
CVPR 2023
Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint
ICLR 2023
A Simple Baseline for Multi-Camera 3D Object Detection
AAAI 2023
GAIN: On the Generalization of Instructional Action Understanding
ICLR 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
ICCV 2023
SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
ICCV 2023
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
ICCV 2023
CLIP-Cluster: CLIP-Guided Attribute Hallucination for Face Clustering
ICCV 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
ICCV 2023
Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning
ICCV 2023
Token-Label Alignment for Vision Transformers
ICCV 2023
Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
ICCV 2023
OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions
ICCV 2023
DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion
CVPR 2023
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation
CVPR 2023
Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis
CVPR 2023
Diffusion-SDF: Text-To-Shape via Voxelized Diffusion
CVPR 2023
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
CVPR 2023
MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory
NIPS 2023
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
NIPS 2023
Deep Factorized Metric Learning
CVPR 2023
LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
CVPR 2023
Uncertainty-Aware Representation Learning for Action Segmentation
IJCAI 2022
HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions
NIPS 2022
P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting
NIPS 2022
OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression
NIPS 2022
SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation
CORL 2022
HyperDet3D: Learning a Scene-Conditioned 3D Object Detector
CVPR 2022
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
CVPR 2022
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
CVPR 2022
FineDiving: A Fine-Grained Dataset for Procedure-Aware Action Quality Assessment
CVPR 2022
Back to Reality: Weakly-Supervised 3D Object Detection With Shape-Guided Label Enhancement
CVPR 2022
Dimension Embeddings for Monocular 3D Object Detection
CVPR 2022
Point-BERT: Pre-Training 3D Point Cloud Transformers With Masked Point Modeling
CVPR 2022
DenseCLIP: Language-Guided Dense Prediction With Context-Aware Prompting
CVPR 2022
Attributable Visual Similarity Learning
CVPR 2022
SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation
CVPR 2022
Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search
CVPR 2022
Spike Transformer: Monocular Depth Estimation for Spiking Camera
ECCV 2022
Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value
ECCV 2022
Label2Label: A Language Modeling Framework for Multi-Attribute Learning
ECCV 2022
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis
ECCV 2022
Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution
ECCV 2022
AMixer: Adaptive Weight Mixing for Self-Attention Free Vision Transformers
ECCV 2022
Dynamic Metric Learning with Cross-Level Concept Distillation
ECCV 2022
LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection
ECCV 2022
RandomRooms: Unsupervised Pre-Training From Synthetic Shapes and Randomized Layouts for 3D Object Detection
ICCV 2021
Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection
ICCV 2021
Personalized Trajectory Prediction via Distribution Discrimination
ICCV 2021
PoinTr: Diverse Point Cloud Completion With Geometry-Aware Transformers
ICCV 2021
Instance Similarity Learning for Unsupervised Feature Representation
ICCV 2021
Group-Aware Contrastive Regression for Action Quality Assessment
ICCV 2021
Pseudo Facial Generation With Extreme Poses for Face Recognition
CVPR 2021
WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition
CVPR 2021
Learning Probabilistic Ordinal Embeddings for Uncertainty-Aware Regression
CVPR 2021
Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-Identification
ICCV 2021
Multi-Proxy Wasserstein Classifier for Image Classification
AAAI 2021
SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation
AAAI 2021
DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
NIPS 2021
Global Filter Networks for Image Classification
NIPS 2021
Human Trajectory Prediction via Counterfactual Analysis
ICCV 2021
Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes
CVPR 2021
Objects Are Different: Flexible Monocular 3D Object Detection
CVPR 2021
Deep Compositional Metric Learning
CVPR 2021
Meta-Mining Discriminative Samples for Kinship Verification
CVPR 2021
Gait Recognition in the Wild: A Benchmark
ICCV 2021
Towards Interpretable Deep Metric Learning With Structural Matching
ICCV 2021
Generalizable Mixed-Precision Quantization via Attribution Rank Preservation
ICCV 2021
NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-View Stereo
ICCV 2021
Deep Relational Metric Learning
ICCV 2021
PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds
CVPR 2021
Self-Supervised Video Hashing via Bidirectional Transformers
CVPR 2021
Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification
ECCV 2020
Temporal Coherence or Temporal Motion: Which is More Critical for Video-based Person Re-identification?
ECCV 2020
MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation
ECCV 2020
Graph-Based Social Relation Reasoning
ECCV 2020
Reinforced Axial Refinement Network for Monocular 3D Object Detection
ECCV 2020
Structural Deep Metric Learning for Room Layout Estimation
ECCV 2020
Deep Hashing with Active Pairwise Supervision
ECCV 2020
Rotation-robust Intersection over Union for 3D Object Detection
ECCV 2020
Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning
ECCV 2020
BiDet: An Efficient Binarized Object Detector
CVPR 2020
Uncertainty-Aware Score Distribution Learning for Action Quality Assessment
CVPR 2020
Structure-Preserving Super Resolution With Gradient Guidance
CVPR 2020
Deep Face Super-Resolution With Iterative Collaboration Between Attentive Recovery and Landmark Estimation
CVPR 2020
Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds
CVPR 2020
Deep Metric Learning via Adaptive Learnable Assessment
CVPR 2020
Deep Fitting Degree Scoring Network for Monocular 3D Object Detection
CVPR 2019
Conditional Single-View Shape Generation for Multi-View Stereo Reconstruction
CVPR 2019
Enhanced Bayesian Compression via Deep Reinforcement Learning
CVPR 2019
Deep Embedding Learning With Discriminative Sampling Policy
CVPR 2019
UniformFace: Learning Deep Equidistributed Representation for Face Recognition
CVPR 2019
COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis
CVPR 2019
DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing
ICCV 2019
Neighborhood Preserving Hashing for Scalable Video Retrieval
ICCV 2019
Deep Meta Metric Learning
ICCV 2019
Self-Critical Attention Learning for Person Re-Identification
ICCV 2019
BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation
CVPR 2019
Structural Relational Reasoning of Point Clouds
CVPR 2019
Learning Channel-Wise Interactions for Binary Convolutional Neural Networks
CVPR 2019
Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition
CVPR 2019
Hardness-Aware Deep Metric Learning
CVPR 2019
Deep Reinforcement Learning with Iterative Shift for Visual Tracking
ECCV 2018
Graininess-Aware Deep Feature Learning for Pedestrian Detection
ECCV 2018
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning
CVPR 2018
Deep Hashing via Discrepancy Minimization
CVPR 2018
Learning Globally Optimized Object Detector via Policy Gradient
CVPR 2018
Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition
CVPR 2018
Deep Adversarial Metric Learning
CVPR 2018
Relaxation-Free Deep Hashing via Policy Gradient
ECCV 2018
Collaborative Deep Reinforcement Learning for Multi-Object Tracking
ECCV 2018
Deep Variational Metric Learning
ECCV 2018
Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking
ECCV 2018
Part-Activated Deep Reinforcement Learning for Action Prediction
ECCV 2018
Learning Deep Binary Descriptor With Multi-Quantization
CVPR 2017
Attention-Aware Deep Reinforcement Learning for Video Face Recognition
ICCV 2017
Runtime Neural Pruning
NIPS 2017
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds
ICCV 2017
Cross-Modal Deep Variational Hashing
ICCV 2017
Learning Discriminative Aggregation Network for Video-Based Face Recognition
ICCV 2017
Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network
CVPR 2017
Modality and Component Aware Feature Fusion For RGB-D Scene Classification
CVPR 2016
Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks
CVPR 2016
Local Subspace Collaborative Tracking
ICCV 2015
Multiple Feature Fusion via Weighted Entropy for Visual Tracking
ICCV 2015
MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition
ICCV 2015
Multi-View Complementary Hash Tables for Nearest Neighbor Search
ICCV 2015
Deep Transfer Metric Learning
CVPR 2015
Deep Hashing for Compact Binary Codes Learning
CVPR 2015
Multi-Manifold Deep Metric Learning for Image Set Classification
CVPR 2015
Simultaneous Local Binary Feature Learning and Encoding for Face Recognition
ICCV 2015
Discriminative Deep Metric Learning for Face Verification in the Wild
CVPR 2014
Image Set Classification Using Holistic Multiple Order Statistics Features and Localized Multi-kernel Metric Learning
ICCV 2013
Robust Feature Set Matching for Partial Face Recognition
ICCV 2013