conftrace_

Chunhua Shen

205 papers · 2008–2026 · 11 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+19 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (15) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (11)

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (17) 🗺️ Taxonomy Completionist (15) 🏠 Conference Loyalist (21) 🌟 Keyword Trendsetter Combo (5) 🏆 Grand Slam 👑 Triple Crown 🤝 Dynamic Duo (44) 🌱 Topic Pioneer 🔬 Deep Specialist (26) 🧬 Topic Evolution 🏆 Keyword Champion (13) 🗃️ Keyword Collector (707) ⚡ Prolific Year (18) ❓ The Questioner (4) 💎 Century Club (202) 📈 Trend Setter 🔥 Unstoppable (14) 🚀 Conference Pioneer

Conferences

CVPR (85) ICCV (37) NIPS (21) ECCV (19) AAAI (11) ICLR (11) IJCAI (8) ICML (6) ACL (4) WACV (2) JMLR (1)

Top co-authors

Hao Chen (45) Anton van den Hengel (44) Ian Reid (20) Xinlong Wang (20) Guosheng Lin (18) Zhi Tian (17) Lingqiao Liu (17) Peng Wang (16) Bohan Zhuang (14) Qi Wu (12)

Keywords

semantic segmentation (33) convolutional neural network (23) object detection (18) attention mechanism (14) instance segmentation (13) representation learning (9) image segmentation (9) metric learning (9) visual question answering (9) depth estimation (9) image classification (8) few-shot learning (8) model compression (8) neural network (7) knowledge distillation (6) point cloud (6) conditional random field (6) neural architecture search (6) multimodal learning (6) self-supervised learning (6)

Papers

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks AAAI 2026 Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration ACL 2026 Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models ACL 2026 Revisiting Convolution Architecture in the Realm of DNA Foundation Models ICLR 2025 Depth Any Video with Scalable Synthetic Data ICLR 2025 MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation CVPR 2025 Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions ICLR 2025 TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings AAAI 2025 Framer: Interactive Frame Interpolation ICLR 2025 What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? ICLR 2025 Aether: Geometric-Aware Unified World Modeling ICCV 2025 SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting ICCV 2025 Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection ICCV 2025 Unified Open-World Segmentation with Multi-Modal Prompts ICCV 2025 SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking ICCV 2025 MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences ICLR 2025 Seeing the Unseen: Composing Outliers for Compositional Zero-Shot Learning IJCAI 2025 POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction ICCV 2025 SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories CVPR 2025 PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training ICLR 2025 Physics Aware Neural Networks for Unsupervised Binding Energy Prediction ICML 2025 Generative Active Learning for Long-tailed Instance Segmentation ICML 2024 DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data CVPR 2024 VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks ECCV 2024 FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior ECCV 2024 FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition CVPR 2024 Traffic Scene Parsing through the TSP6K Dataset CVPR 2024 A Simple Image Segmentation Framework via In-Context Examples NIPS 2024 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation NIPS 2024 Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning AAAI 2024 PointAttN: You Only Need Attention for Point Cloud Completion AAAI 2024 LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning ACL 2024 Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data ACL 2024 Floating Anchor Diffusion Model for Multi-motif Scaffolding ICML 2024 On the Trajectory Regularity of ODE-based Diffusion Sampling ICML 2024 De novo Protein Design Using Geometric Vector Field Networks ICLR 2024 Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching ICLR 2024 Object-Aware Inversion and Reassembly for Image Editing ICLR 2024 Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction ICCV 2023 Images Speak in Images: A Generalist Painter for In-Context Visual Learning CVPR 2023 Learning Conditional Attributes for Compositional Zero-Shot Learning CVPR 2023 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models NIPS 2023 Generative Prompt Model for Weakly Supervised Object Localization ICCV 2023 SegPrompt: Boosting Open-World Segmentation via Category-Level Prompt Learning ICCV 2023 FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models ICCV 2023 Conditional Positional Encodings for Vision Transformers ICLR 2023 DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models ICCV 2023 SegGPT: Towards Segmenting Everything in Context ICCV 2023 Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image ICCV 2023 Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering ICCV 2023 CTVIS: Consistent Training for Online Video Instance Segmentation ICCV 2023 FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning AAAI 2023 Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations AAAI 2023 A Survey on Efficient Training of Transformers IJCAI 2023 Poseur: Direct Human Pose Regression with Transformers ECCV 2022 Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval NIPS 2022 SegViT: Semantic Segmentation with Plain Vision Transformers NIPS 2022 Hierarchical Normalization for Robust Monocular Depth Estimation NIPS 2022 Multi-dataset Training of Transformers for Robust Action Recognition NIPS 2022 DENSE: Data-Free One-Shot Federated Learning NIPS 2022 Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition NIPS 2022 Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images NIPS 2022 FreeSOLO: Learning To Segment Objects Without Annotations CVPR 2022 RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior CVPR 2022 Catching Both Gray and Black Swans: Open-Set Supervised Anomaly Detection CVPR 2022 Retrieval Augmented Classification for Long-Tail Visual Recognition CVPR 2022 Boosting Robustness of Image Matting With Context Assembling and Strong Data Augmentation CVPR 2022 TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation CVPR 2022 PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining NIPS 2022 DisCo: Remedying Self-Supervised Learning on Lightweight Models with Distilled Contrastive Learning ECCV 2022 Efficient Decoder-Free Object Detection with Transformers ECCV 2022 PointInst3D: Segmenting 3D Instances by Points ECCV 2022 Generic Perceptual Loss for Modeling Structured Output Dependencies CVPR 2021 Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data CVPR 2021 Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition CVPR 2021 End-to-End Video Instance Segmentation With Transformers CVPR 2021 Dense Contrastive Learning for Self-Supervised Visual Pre-Training CVPR 2021 FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions CVPR 2021 BoxInst: High-Performance Instance Segmentation With Box Annotations CVPR 2021 HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding CVPR 2021 Learning Affinity-Aware Upsampling for Deep Image Matting CVPR 2021 SA-BNN: State-Aware Binary Neural Network AAAI 2021 Diverse Knowledge Distillation for End-to-End Person Search AAAI 2021 Occluded Person Re-Identification With Single-Scale Global Representations ICCV 2021 Meta Navigator: Search for a Good Adaptation Policy for Few-Shot Learning ICCV 2021 A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation ICCV 2021 Channel-Wise Knowledge Distillation for Dense Prediction ICCV 2021 BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification ICCV 2021 FATNN: Fast and Accurate Ternary Neural Networks ICCV 2021 Twins: Revisiting the Design of Spatial Attention in Vision Transformers NIPS 2021 Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation NIPS 2021 DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets CVPR 2021 Learning To Recover 3D Scene Shape From a Single Image CVPR 2021 Graph Attention Tracking CVPR 2021 AQD: Towards Accurate Quantized Object Detection CVPR 2021 DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution CVPR 2021 AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting ECCV 2020 SOLOv2: Dynamic and Fast Instance Segmentation NIPS 2020 V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices AAAI 2020 Task-Aware Monocular Depth Estimation for 3D Object Detection AAAI 2020 Training Quantized Neural Networks With a Full-Precision Auxiliary Module CVPR 2020 Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising CVPR 2020 BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation CVPR 2020 DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers CVPR 2020 ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network CVPR 2020 Context Prior for Scene Segmentation CVPR 2020 Mask Encoding for Single Shot Instance Segmentation CVPR 2020 NAS-FCOS: Fast Neural Architecture Search for Object Detection CVPR 2020 On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering CVPR 2020 REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments CVPR 2020 Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection CVPR 2020 PolarMask: Single Shot Instance Segmentation With Polar Representation CVPR 2020 Conditional Convolutions for Instance Segmentation ECCV 2020 Representative Graph Neural Network ECCV 2020 Soft Expert Reward Learning for Vision-and-Language Navigation ECCV 2020 Weighing Counts: Sequential Crowd Counting by Reinforcement Learning ECCV 2020 Efficient Semantic Video Segmentation with Per-frame Inference ECCV 2020 Scene Text Image Super-resolution in the wild ECCV 2020 Segmenting Transparent Objects in the Wild ECCV 2020 Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation ECCV 2020 SOLO: Segmenting Objects by Locations ECCV 2020 Instance-Aware Embedding for Point Cloud Instance Segmentation ECCV 2020 Unsupervised Representation Learning by Predicting Random Distances IJCAI 2020 Architecture Search of Dynamic Cells for Semantic Video Segmentation WACV 2020 Template-Based Automatic Search of Compact Semantic Segmentation Architectures WACV 2020 Exploiting Temporal Consistency for Real-Time Video Depth Estimation ICCV 2019 Indices Matter: Learning to Index for Deep Image Matting ICCV 2019 Enforcing Geometric Constraints of Virtual Normal for Depth Prediction ICCV 2019 Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification ICCV 2019 From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer ICCV 2019 Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network ICCV 2019 FCOS: Fully Convolutional One-Stage Object Detection ICCV 2019 Multi-marginal Wasserstein GAN NIPS 2019 Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition AAAI 2019 Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks CVPR 2019 Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks CVPR 2019 Attention-Guided Network for Ghost-Free High Dynamic Range Imaging CVPR 2019 Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation CVPR 2019 Associatively Segmenting Instances and Semantics in Point Clouds CVPR 2019 CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning CVPR 2019 Visual Question Answering as Reading Comprehension CVPR 2019 Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells CVPR 2019 Light-Weight Hybrid Convolutional Network for Liver Tumor Segmentation IJCAI 2019 Knowledge Adaptation for Efficient Semantic Segmentation CVPR 2019 Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video NIPS 2019 Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation CVPR 2019 Bootstrapping the Performance of Webly Supervised Semantic Segmentation CVPR 2018 Adversarial Learning with Local Coordinate Coding ICML 2018 VITAL: VIsual Tracking via Adversarial Learning CVPR 2018 Towards Effective Low-Bitwidth Convolutional Neural Networks CVPR 2018 Salient Object Detection by Lossless Feature Reflection IJCAI 2018 Learning to Predict Crisp Boundaries ECCV 2018 Goal-Oriented Visual Question Generation via Intermediate Rewards ECCV 2018 Repulsion Loss: Detecting Pedestrians in a Crowd CVPR 2018 Visual Question Answering With Memory-Augmented Networks CVPR 2018 Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning CVPR 2018 An End-to-End TextSpotter With Explicit Alignment and Attention CVPR 2018 Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries CVPR 2018 FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors CVPR 2018 Monocular Relative Depth Perception With Web Stereo Data Supervision CVPR 2018 Sequential Person Recognition in Photo Albums With a Recurrent Network CVPR 2017 Towards Context-Aware Interaction Recognition for Visual Relationship Detection ICCV 2017 When Unsupervised Domain Adaptation Meets Tensor Representations ICCV 2017 Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation ICCV 2017 Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks ICCV 2017 Multi-Attention Network for One Shot Learning CVPR 2017 From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur CVPR 2017 RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation CVPR 2017 Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data CVPR 2017 The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions CVPR 2017 Explicit Knowledge-based Reasoning for Visual Question Answering IJCAI 2017 Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation IJCAI 2017 Deep Descriptor Transforming for Image Co-Localization IJCAI 2017 What Value Do Explicit High Level Concepts Have in Vision to Language Problems? CVPR 2016 Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation CVPR 2016 Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge From External Sources CVPR 2016 Fast Training of Triplet-Based Deep Binary Embedding Networks CVPR 2016 Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections NIPS 2016 Less Is More: Zero-Shot Learning From Online Textual Documents With Noise Suppression CVPR 2016 What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution CVPR 2016 Mid-Level Deep Pattern Mining CVPR 2015 Deep Convolutional Neural Fields for Depth Estimation From a Single Image CVPR 2015 Deeply Learning the Messages in Message Passing Inference NIPS 2015 Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior ICCV 2015 The Treasure Beneath Convolutional Layers: Cross-Convolutional-Layer Pooling for Image Classification CVPR 2015 Learning Graph Structure for Multi-Label Image Classification via Clique Generation CVPR 2015 Efficient SDP Inference for Fully-Connected CRFs Based on Low-Rank Decomposition CVPR 2015 Learning to Rank in Person Re-Identification With Metric Ensembles CVPR 2015 Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs CVPR 2015 Supervised Discrete Hashing CVPR 2015 Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors NIPS 2014 Fast Supervised Hashing with Decision Trees for High-Dimensional Data CVPR 2014 Efficient Pedestrian Detection by Directly Optimizing the Partial Area under the ROC Curve ICCV 2013 Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution ICCV 2013 Contextual Hypergraph Modeling for Salient Object Detection ICCV 2013 A General Two-Step Approach to Learning-Based Hashing ICCV 2013 Learning Hash Functions Using Column Generation ICML 2013 Inductive Hashing on Manifolds CVPR 2013 Bilinear Programming for Human Activity Recognition with Unknown MRF Graphs CVPR 2013 A Fast Semidefinite Approach to Solving Binary Quadratic Problems CVPR 2013 Part-Based Visual Tracking with Online Latent Structural Learning CVPR 2013 Learning Compact Binary Codes for Visual Tracking CVPR 2013 Positive Semidefinite Metric Learning Using Boosting-like Algorithms JMLR 2012 Positive Semidefinite Metric Learning with Boosting NIPS 2009 PSDBoost: Matrix-Generation Linear Programming for Positive Semidefinite Matrices Learning NIPS 2008