conftrace_

Jiwen Lu

182 papers · 2013–2025 · 8 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+16 more ↓

🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (11)

🐝 Cross-Pollinator (11) 🧭 Keyword Pioneer 🏃 Academic Marathon (12) 🌟 Keyword Trendsetter Combo (5) 🏠 Conference Loyalist (76) 🔬 Deep Specialist (38) 🏆 Keyword Champion 🧬 Topic Evolution 🤝 Dynamic Duo (151) ⚡ Prolific Year (22) 🗃️ Keyword Collector (605) ❓ The Questioner 📈 Trend Setter 🔥 Unstoppable (13) 🚀 Conference Pioneer 💎 Century Club (182)

Conferences

CVPR (76) ICCV (44) ECCV (34) NIPS (14) ICLR (7) AAAI (3) CORL (3) IJCAI (1)

Top co-authors

Jie Zhou (151) Yongming Rao (42) Wenzhao Zheng (30) Yansong Tang (24) Ziwei Wang (24) Yueqi Duan (19) Zheng Zhu (18) Xiuwei Xu (16) Wenliang Zhao (16) Guangyi Chen (13)

Research topics

Keywords

point cloud (13) metric learning (13) representation learning (11) model compression (10) diffusion model (10) neural network (9) deep metric learning (9) 3d vision (9) attention mechanism (8) vision transformer (8) contrastive learning (7) image generation (7) face recognition (7) video understanding (7) semantic segmentation (7) convolutional neural network (6) action recognition (6) autonomous driving (6) depth estimation (6) feature embedding (6)

Papers

EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models CVPR 2025 EmbodiedSAM: Online Segment Any 3D Thing in Real Time ICLR 2025 Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding CVPR 2025 EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding ICCV 2025 Learning Counterfactually Decoupled Attention for Open-World Model Attribution ICCV 2025 SpectralAR: Spectral Autoregressive Visual Generation ICCV 2025 PlaneRAS: Learning Planar Primitives for 3D Plane Recovery ICCV 2025 SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs ICCV 2025 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation ICCV 2025 D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection ICCV 2025 Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution ICLR 2025 UniGoal: Towards Universal Zero-shot Goal-oriented Navigation CVPR 2025 GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction CVPR 2025 GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction CVPR 2025 UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting CVPR 2025 GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation CORL 2025 MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation CORL 2025 InstaRevive: One-Step Image Enhancement via Dynamic Score Matching ICLR 2025 ThinkBot: Embodied Instruction Following with Thought Chain Reasoning ICLR 2025 3D Small Object Detection with Dynamic Spatial Pruning ECCV 2024 DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery CVPR 2024 FlowIE: Efficient Image Enhancement via Rectified Flow CVPR 2024 MirageRoom: 3D Scene Segmentation with 2D Pre-trained Models by Mirage Projection CVPR 2024 Segment and Caption Anything CVPR 2024 Memory-based Adapters for Online 3D Scene Perception CVPR 2024 Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression CVPR 2024 Towards Accurate Post-training Quantization for Diffusion Models CVPR 2024 Narrative Action Evaluation with Prompt-Guided Multimodal Interaction CVPR 2024 Path Choice Matters for Clear Attributions in Path Methods ICLR 2024 Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution ECCV 2024 ProtoComp: Diverse Point Cloud Completion with Controllable Prototype ECCV 2024 DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving ECCV 2024 ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation ECCV 2024 GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction ECCV 2024 Efficient Inference of Vision Instruction-Following Models with Elastic Cache ECCV 2024 OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving ECCV 2024 SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding ECCV 2024 DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation ECCV 2024 LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction CVPR 2024 X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition CVPR 2024 MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer CVPR 2024 SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction CVPR 2024 FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner NIPS 2024 SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation NIPS 2024 GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation NIPS 2024 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation NIPS 2024 Bridging the Divide: Reconsidering Softmax and Linear Attention NIPS 2024 Q-VLM: Post-training Quantization for Large Vision-Language Models NIPS 2024 FLAG3D: A 3D Fitness Activity Dataset With Language Instruction CVPR 2023 Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint ICLR 2023 A Simple Baseline for Multi-Camera 3D Object Detection AAAI 2023 GAIN: On the Generalization of Instructional Action Understanding ICLR 2023 Unleashing Text-to-Image Diffusion Models for Visual Perception ICCV 2023 SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving ICCV 2023 OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception ICCV 2023 CLIP-Cluster: CLIP-Guided Attribute Hallucination for Face Clustering ICCV 2023 TCOVIS: Temporally Consistent Online Video Instance Segmentation ICCV 2023 Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning ICCV 2023 Token-Label Alignment for Vision Transformers ICCV 2023 Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models ICCV 2023 OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions ICCV 2023 DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion CVPR 2023 DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation CVPR 2023 Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis CVPR 2023 Diffusion-SDF: Text-To-Shape via Voxelized Diffusion CVPR 2023 Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction CVPR 2023 MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory NIPS 2023 UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models NIPS 2023 Deep Factorized Metric Learning CVPR 2023 LOGO: A Long-Form Video Dataset for Group Action Quality Assessment CVPR 2023 Uncertainty-Aware Representation Learning for Action Segmentation IJCAI 2022 HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions NIPS 2022 P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting NIPS 2022 OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression NIPS 2022 SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation CORL 2022 HyperDet3D: Learning a Scene-Conditioned 3D Object Detector CVPR 2022 Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion CVPR 2022 Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos CVPR 2022 FineDiving: A Fine-Grained Dataset for Procedure-Aware Action Quality Assessment CVPR 2022 Back to Reality: Weakly-Supervised 3D Object Detection With Shape-Guided Label Enhancement CVPR 2022 Dimension Embeddings for Monocular 3D Object Detection CVPR 2022 Point-BERT: Pre-Training 3D Point Cloud Transformers With Masked Point Modeling CVPR 2022 DenseCLIP: Language-Guided Dense Prediction With Context-Aware Prompting CVPR 2022 Attributable Visual Similarity Learning CVPR 2022 SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation CVPR 2022 Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search CVPR 2022 Spike Transformer: Monocular Depth Estimation for Spiking Camera ECCV 2022 Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value ECCV 2022 Label2Label: A Language Modeling Framework for Multi-Attribute Learning ECCV 2022 Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis ECCV 2022 Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution ECCV 2022 AMixer: Adaptive Weight Mixing for Self-Attention Free Vision Transformers ECCV 2022 Dynamic Metric Learning with Cross-Level Concept Distillation ECCV 2022 LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection ECCV 2022 RandomRooms: Unsupervised Pre-Training From Synthetic Shapes and Randomized Layouts for 3D Object Detection ICCV 2021 Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection ICCV 2021 Personalized Trajectory Prediction via Distribution Discrimination ICCV 2021 PoinTr: Diverse Point Cloud Completion With Geometry-Aware Transformers ICCV 2021 Instance Similarity Learning for Unsupervised Feature Representation ICCV 2021 Group-Aware Contrastive Regression for Action Quality Assessment ICCV 2021 Pseudo Facial Generation With Extreme Poses for Face Recognition CVPR 2021 WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition CVPR 2021 Learning Probabilistic Ordinal Embeddings for Uncertainty-Aware Regression CVPR 2021 Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-Identification ICCV 2021 Multi-Proxy Wasserstein Classifier for Image Classification AAAI 2021 SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation AAAI 2021 DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification NIPS 2021 Global Filter Networks for Image Classification NIPS 2021 Human Trajectory Prediction via Counterfactual Analysis ICCV 2021 Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes CVPR 2021 Objects Are Different: Flexible Monocular 3D Object Detection CVPR 2021 Deep Compositional Metric Learning CVPR 2021 Meta-Mining Discriminative Samples for Kinship Verification CVPR 2021 Gait Recognition in the Wild: A Benchmark ICCV 2021 Towards Interpretable Deep Metric Learning With Structural Matching ICCV 2021 Generalizable Mixed-Precision Quantization via Attribution Rank Preservation ICCV 2021 NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-View Stereo ICCV 2021 Deep Relational Metric Learning ICCV 2021 PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds CVPR 2021 Self-Supervised Video Hashing via Bidirectional Transformers CVPR 2021 Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification ECCV 2020 Temporal Coherence or Temporal Motion: Which is More Critical for Video-based Person Re-identification? ECCV 2020 MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation ECCV 2020 Graph-Based Social Relation Reasoning ECCV 2020 Reinforced Axial Refinement Network for Monocular 3D Object Detection ECCV 2020 Structural Deep Metric Learning for Room Layout Estimation ECCV 2020 Deep Hashing with Active Pairwise Supervision ECCV 2020 Rotation-robust Intersection over Union for 3D Object Detection ECCV 2020 Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning ECCV 2020 BiDet: An Efficient Binarized Object Detector CVPR 2020 Uncertainty-Aware Score Distribution Learning for Action Quality Assessment CVPR 2020 Structure-Preserving Super Resolution With Gradient Guidance CVPR 2020 Deep Face Super-Resolution With Iterative Collaboration Between Attentive Recovery and Landmark Estimation CVPR 2020 Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds CVPR 2020 Deep Metric Learning via Adaptive Learnable Assessment CVPR 2020 Deep Fitting Degree Scoring Network for Monocular 3D Object Detection CVPR 2019 Conditional Single-View Shape Generation for Multi-View Stereo Reconstruction CVPR 2019 Enhanced Bayesian Compression via Deep Reinforcement Learning CVPR 2019 Deep Embedding Learning With Discriminative Sampling Policy CVPR 2019 UniformFace: Learning Deep Equidistributed Representation for Face Recognition CVPR 2019 COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis CVPR 2019 DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing ICCV 2019 Neighborhood Preserving Hashing for Scalable Video Retrieval ICCV 2019 Deep Meta Metric Learning ICCV 2019 Self-Critical Attention Learning for Person Re-Identification ICCV 2019 BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation CVPR 2019 Structural Relational Reasoning of Point Clouds CVPR 2019 Learning Channel-Wise Interactions for Binary Convolutional Neural Networks CVPR 2019 Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition CVPR 2019 Hardness-Aware Deep Metric Learning CVPR 2019 Deep Reinforcement Learning with Iterative Shift for Visual Tracking ECCV 2018 Graininess-Aware Deep Feature Learning for Pedestrian Detection ECCV 2018 GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning CVPR 2018 Deep Hashing via Discrepancy Minimization CVPR 2018 Learning Globally Optimized Object Detector via Policy Gradient CVPR 2018 Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition CVPR 2018 Deep Adversarial Metric Learning CVPR 2018 Relaxation-Free Deep Hashing via Policy Gradient ECCV 2018 Collaborative Deep Reinforcement Learning for Multi-Object Tracking ECCV 2018 Deep Variational Metric Learning ECCV 2018 Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking ECCV 2018 Part-Activated Deep Reinforcement Learning for Action Prediction ECCV 2018 Learning Deep Binary Descriptor With Multi-Quantization CVPR 2017 Attention-Aware Deep Reinforcement Learning for Video Face Recognition ICCV 2017 Runtime Neural Pruning NIPS 2017 3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds ICCV 2017 Cross-Modal Deep Variational Hashing ICCV 2017 Learning Discriminative Aggregation Network for Video-Based Face Recognition ICCV 2017 Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network CVPR 2017 Modality and Component Aware Feature Fusion For RGB-D Scene Classification CVPR 2016 Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks CVPR 2016 Local Subspace Collaborative Tracking ICCV 2015 Multiple Feature Fusion via Weighted Entropy for Visual Tracking ICCV 2015 MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition ICCV 2015 Multi-View Complementary Hash Tables for Nearest Neighbor Search ICCV 2015 Deep Transfer Metric Learning CVPR 2015 Deep Hashing for Compact Binary Codes Learning CVPR 2015 Multi-Manifold Deep Metric Learning for Image Set Classification CVPR 2015 Simultaneous Local Binary Feature Learning and Encoding for Face Recognition ICCV 2015 Discriminative Deep Metric Learning for Face Verification in the Wild CVPR 2014 Image Set Classification Using Holistic Multiple Order Statistics Features and Localized Multi-kernel Metric Learning ICCV 2013 Robust Feature Set Matching for Partial Face Recognition ICCV 2013