conftrace_

Xinggang Wang

81 papers · 2011–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🏃 Academic Marathon (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (10)

🐝 Cross-Pollinator (10) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (88) 🏠 Conference Loyalist (29) 🧬 Topic Evolution 🏆 Grand Slam 🤝 Dynamic Duo (50) 🏆 Keyword Champion 🔬 Deep Specialist (20) 🚀 Conference Pioneer 🗃️ Keyword Collector (289) 💎 Century Club (75) 🔥 Unstoppable (11) 📈 Trend Setter ⚡ Prolific Year (13)

Conferences

CVPR (29) ECCV (13) ICCV (13) AAAI (11) ICLR (7) NIPS (4) ICML (2) ACML (1) CORL (1)

Top co-authors

Wenyu Liu (54) Qian Zhang (17) Tianheng Cheng (16) Chang Huang (12) Jiemin Fang (11) Bencheng Liao (9) Shaoyu Chen (9) Bin Feng (8) Yuxin Fang (8) Lingxi Xie (6)

Research topics

Keywords

object detection (12) vision transformer (9) instance segmentation (7) image classification (6) semantic segmentation (5) vision-language model (4) knowledge distillation (4) diffusion model (4) convolutional neural network (4) generative model (4) model compression (4) query-based detection (4) transformer architecture (4) attention mechanism (4) video instance segmentation (4) weakly supervised learning (3) transfer learning (3) image generation (3) point cloud (3) deep neural network (3)

Papers

LENS: Learning to Segment Anything with Unified Reinforced Reasoning AAAI 2026 Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices AAAI 2026 MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning AAAI 2026 Few-step Flow for 3D Generation via Marginal-Data Transport Distillation AAAI 2026 Gait Recognition via Collaborating Discriminative and Generative Diffusion Models AAAI 2026 Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids AAAI 2026 GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding ICCV 2025 Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation CVPR 2025 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models CVPR 2025 DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving CVPR 2025 ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention AAAI 2025 GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images AAAI 2025 ControlAR: Controllable Image Generation with Autoregressive Models ICLR 2025 JudgeLM: Fine-tuned Large Language Models are Scalable Judges ICLR 2025 MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling ICCV 2025 GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding CVPR 2025 DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention CVPR 2025 "Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice" ECCV 2024 Occupancy as Set of Points ECCV 2024 Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition ECCV 2024 Visual Text Generation in the Wild ECCV 2024 Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection ECCV 2024 Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction ECCV 2024 Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views ECCV 2024 YOLO-World: Real-Time Open-Vocabulary Object Detection CVPR 2024 GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models CVPR 2024 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model ICML 2024 MobileInst: Video Instance Segmentation on the Mobile AAAI 2024 FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification NIPS 2024 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering CVPR 2024 Symphonize 3D Semantic Scene Completion with Contextual Instance Queries CVPR 2024 MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction ICLR 2023 Circuit as Set of Points NIPS 2023 PD-Quant: Post-Training Quantization Based on Prediction Difference Metric CVPR 2023 EVA: Exploring the Limits of Masked Visual Representation Learning at Scale CVPR 2023 BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation CVPR 2023 RILS: Masked Visual Reconstruction in Language Semantic Space CVPR 2023 Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt CVPR 2023 TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance ICCV 2023 Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose Estimation ICCV 2023 Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection ICCV 2023 VAD: Vectorized Scene Representation for Efficient Autonomous Driving ICCV 2023 Graph Contrastive Learning for Skeleton-based Action Recognition ICLR 2023 Corrupted Image Modeling for Self-Supervised Visual Pre-Training ICLR 2023 Robust Multi-Object Tracking by Marginal Inference ECCV 2022 AiATrack: Attention in Attention for Transformer Visual Tracking ECCV 2022 Sparse Instance Activation for Real-Time Instance Segmentation CVPR 2022 AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception CVPR 2022 MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens CVPR 2022 Bag of Instances Aggregation Boosts Self-supervised Distillation ICLR 2022 Vision-based Uneven BEV Representation Learning with Polar Rasterization and Surface Estimation CORL 2022 TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation CVPR 2022 Temporally Efficient Vision Transformer for Video Instance Segmentation CVPR 2022 ByteTrack: Multi-Object Tracking by Associating Every Detection Box ECCV 2022 Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images CVPR 2021 Instances As Queries ICCV 2021 Context-Sensitive Temporal Feature Learning for Gait Recognition ICCV 2021 Crossover Learning for Fast Online Video Instance Segmentation ICCV 2021 Hierarchical Aggregation for 3D Instance Segmentation ICCV 2021 You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection NIPS 2021 Human De-Occlusion: Invisible Perception and Recovery for Humans CVPR 2021 Diversity Transfer Network for Few-Shot Learning AAAI 2020 Fast Neural Network Adaptation via Parameter Remapping and Architecture Search ICLR 2020 Boundary-preserving Mask R-CNN ECCV 2020 Densely Connected Search Space for More Flexible Neural Architecture Search CVPR 2020 Direct Object Recognition Without Line-Of-Sight Using Optical Coherence CVPR 2019 Detect or Track: Towards Cost-Effective Video Object Detection/Tracking AAAI 2019 Mask Scoring R-CNN CVPR 2019 RENAS: Reinforced Evolutionary Neural Architecture Search CVPR 2019 CCNet: Criss-Cross Attention for Semantic Segmentation ICCV 2019 Deep Multi-instance Learning with Dynamic Pooling ACML 2018 Weakly Supervised Region Proposal Network and Object Detection ECCV 2018 Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing CVPR 2018 Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-identification ECCV 2018 Object-Level Proposals ICCV 2017 Multiple Instance Detection Network With Online Instance Classifier Refinement CVPR 2017 Robust Scene Text Recognition With Automatic Rectification CVPR 2016 DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection CVPR 2015 Relaxed Multiple-Instance SVM With Application to Object Discovery ICCV 2015 Max-Margin Multiple-Instance Dictionary Learning ICML 2013 Maximal Cliques that Satisfy Hard Constraints with Application to Deformable Object Model Learning NIPS 2011