Qi Tian
219 papers · 2012–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
π Academic Marathon (13) π Conference Polyglot (12) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (4)
π
Interdisciplinary Bridge
π
Academic Marathon
(13)
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(7)
π
Conference Loyalist
(29)
π€
Dynamic Duo
(60)
π±
Topic Pioneer
π¬
Deep Specialist
(26)
π§¬
Topic Evolution
π
Triple Crown
π
Keyword Champion
(2)
π
Grand Slam
ποΈ
Keyword Collector
(734)
π
Century Club
(213)
π
Trend Setter
π₯
Unstoppable
(14)
π
Conference Pioneer
β
The Questioner
(2)
β‘
Prolific Year
(21)
Conferences
CVPR (86)
ICCV (35)
ECCV (29)
AAAI (23)
ICLR (16)
NIPS (12)
IJCAI (6)
ICML (5)
EMNLP (3)
WACV (2)
ACL (1)
COLING (1)
Top co-authors
Research topics
Keywords
transfer learning
(15)
convolutional neural network
(13)
person re-identification
(12)
model compression
(12)
image classification
(12)
neural architecture search
(11)
object detection
(10)
domain adaptation
(10)
semantic segmentation
(9)
neural network
(8)
image retrieval
(8)
unsupervised learning
(8)
self-supervised learning
(8)
few-shot learning
(7)
diffusion model
(7)
representation learning
(7)
knowledge distillation
(7)
vision transformer
(7)
image restoration
(6)
contrastive learning
(6)
Papers
WorldGrow: Generating Infinite 3D World
AAAI 2026
Dereflection Any Image with Diffusion Priors and Diversified Data
AAAI 2026
O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
AAAI 2026
A Principle-Driven Adaptive Policy for Group Cognitive Stimulation Dialogue for Elderly with Cognitive Impairment
AAAI 2026
SFedHIFI: Fire Rate-Based Heterogeneous Information Fusion for Spiking Federated Learning
AAAI 2026
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation
AAAI 2026
Tackling View-Dependent Semantics in 3D Language Gaussian Splatting
ICML 2025
CLIP-Adapted Region-to-Text Learning for Generative Open-Vocabulary Semantic Segmentation
ICCV 2025
METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models
ICCV 2025
Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models
IJCAI 2025
IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner
CVPR 2025
Segment Any 3D Gaussians
AAAI 2025
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
Boosting Segment Anything Model Towards Open-Vocabulary Learning
AAAI 2025
Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration
AAAI 2025
Efficient Multi-modal Long Context Learning for Training-free Adaptation
ICML 2025
Incremental Transformer: Efficient Encoder for Incremented Text Over MRC and Conversation Tasks
COLING 2025
Enhancing Pre-trained Representation Classifiability can Boost its Interpretability
ICLR 2025
C-CLIP: Multimodal Continual Learning for Vision-Language Model
ICLR 2025
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling
ICLR 2025
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
ICLR 2025
Aligning Human Motion Generation with Human Perceptions
ICLR 2025
4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
CVPR 2024
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
CVPR 2024
Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model
CVPR 2024
ControlVideo: Training-free Controllable Text-to-video Generation
ICLR 2024
Towards 3D Molecule-Text Interpretation in Language Models
ICLR 2024
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
ICLR 2024
Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models
ICLR 2024
BarLeRIa: An Efficient Tuning Framework for Referring Image Segmentation
ICLR 2024
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
AAAI 2024
LION: Implicit Vision Prompt Tuning
AAAI 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
ECCV 2024
DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models
ECCV 2024
Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views
ECCV 2024
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding
ECCV 2024
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
ICLR 2024
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data
CVPR 2024
OVMR: Open-Vocabulary Recognition with Multi-Modal References
CVPR 2024
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
CVPR 2024
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
CVPR 2024
Improving Image Restoration through Removing Degradations in Textual Representations
CVPR 2024
Federated Domain Generalization With Generalization Adjustment
CVPR 2023
Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training
CVPR 2023
Adapting Shortcut With Normalizing Flow: An Efficient Tuning Framework for Visual Recognition
CVPR 2023
Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator
CVPR 2023
Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization
CVPR 2023
Parameter-efficient Tuning of Large-scale Multimodal Foundation Model
NIPS 2023
Segment Anything in 3D with NeRFs
NIPS 2023
AiluRus: A Scalable ViT Framework for Dense Prediction
NIPS 2023
Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval
NIPS 2023
Low-Light Video Enhancement with Synthetic Event Guidance
AAAI 2023
Fine-Grained Retrieval Prompt Tuning
AAAI 2023
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories
AAAI 2023
DE-net: Dynamic Text-Guided Image Editing Adversarial Networks
AAAI 2023
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
AAAI 2023
Reasoning over Hierarchical Question Decomposition Tree for Explainable Question Answering
ACL 2023
SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
ICML 2023
Continual Vision-Language Representation Learning with Off-Diagonal Information
ICML 2023
HiViT: A Simpler and More Efficient Design of Hierarchical Vision Transformer
ICLR 2023
Progressively Compressed Auto-Encoder for Self-supervised Representation Learning
ICLR 2023
The KFIoU Loss for Rotated Object Detection
ICLR 2023
USAGE: A Unified Seed Area Generation Paradigm for Weakly Supervised Semantic Segmentation
ICCV 2023
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
ICCV 2023
Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation
ICCV 2023
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
ICCV 2023
Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
EMNLP 2023
Visual Recognition by Request
CVPR 2023
Integrally Pre-Trained Transformer Pyramid Networks
CVPR 2023
TAPE: Task-Agnostic Prior Embedding for Image Restoration
ECCV 2022
Fine-Grained Semantically Aligned Vision-Language Pre-Training
NIPS 2022
ConfounderGAN: Protecting Image Data Privacy with Causal Confounder
NIPS 2022
SiamTrans: Zero-Shot Multi-Frame Image Restoration with Pre-trained Siamese Transformers
AAAI 2022
Can Semantic Labels Assist Self-Supervised Visual Representation Learning?
AAAI 2022
DATA: Domain-Aware and Task-Aware Self-Supervised Learning
CVPR 2022
HyperDet3D: Learning a Scene-Conditioned 3D Object Detector
CVPR 2022
Contextual Similarity Distillation for Asymmetric Image Retrieval
CVPR 2022
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
CVPR 2022
One-Bit Active Query With Contrastive Pairs
CVPR 2022
Partial Class Activation Attention for Semantic Segmentation
CVPR 2022
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks
CVPR 2022
DeeCap: Dynamic Early Exiting for Efficient Image Captioning
CVPR 2022
Learning To Learn by Jointly Optimizing Neural Architecture and Weights
CVPR 2022
Domain-Agnostic Prior for Transfer Semantic Segmentation
CVPR 2022
Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction
ECCV 2022
Cornerformer: Purifying Instances for Corner-Based Detectors
ECCV 2022
Active Pointly-Supervised Instance Segmentation
ECCV 2022
A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining
ECCV 2022
SdAE: Self-Distillated Masked Autoencoder
ECCV 2022
Vibration-Based Uncertainty Estimation for Learning from Limited Supervision
ECCV 2022
MVP: Multimodality-Guided Visual Pre-training
ECCV 2022
GraphQ IR: Unifying the Semantic Parsing of Graph Query Languages with One Intermediate Representation
EMNLP 2022
ParaMac: A General Unsupervised Paraphrase Generation Framework Leveraging Semantic Constraints and Diversifying Mechanisms
EMNLP 2022
Bag of Instances Aggregation Boosts Self-supervised Distillation
ICLR 2022
Pixel Difference Networks for Efficient Edge Detection
ICCV 2021
Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
ICML 2021
Towards Compact CNNs via Collaborative Compression
CVPR 2021
UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification
CVPR 2021
CondenseNet V2: Sparse Feature Reactivation for Deep Networks
CVPR 2021
TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization
ICCV 2021
Shape Self-Correction for Unsupervised Point Cloud Understanding
ICCV 2021
Divide and Conquer for Single-Frame Temporal Action Localization
ICCV 2021
Visformer: The Vision-Friendly Transformer
ICCV 2021
A Fourier-Based Framework for Domain Generalization
CVPR 2021
ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation
CVPR 2021
Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
AAAI 2021
Greedy Gradient Ensemble for Robust Visual Question Answering
ICCV 2021
Handwritten Chinese Font Generation With Collaborative Stroke Refinement
WACV 2021
Appending Adversarial Frames for Universal Video Attack
WACV 2021
Dual Distribution Alignment Network for Generalizable Person Re-Identification
AAAI 2021
Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence
NIPS 2021
Rectifying the Shortcut Learning of Background for Few-Shot Learning
NIPS 2021
Omni-GAN: On the Secrets of cGANs and Beyond
ICCV 2021
Foreground Activation Maps for Weakly Supervised Object Localization
ICCV 2021
Differentiable Convolution Search for Point Cloud Processing
ICCV 2021
Video Super-Resolution with Recurrent Structure-Detail Network
ECCV 2020
Wavelet-Based Dual-Branch Network for Image DemoirΓ©ing
ECCV 2020
API-Net: Robust Generative Classifier via a Single Discriminator
ECCV 2020
Reinforced Axial Refinement Network for Monocular 3D Object Detection
ECCV 2020
FTL: A universal framework for training low-bit DNNs via Feature Transfer
ECCV 2020
Extract and Merge: Superpixel Segmentation with Regional Attributes
ECCV 2020
A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments
IJCAI 2020
Single Camera Training for Person Re-Identification
AAAI 2020
Adversarial Domain Adaptation with Domain Mixup
AAAI 2020
Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision
ECCV 2020
Polar Relative Positional Encoding for Video-Language Segmentation
IJCAI 2020
AdderNet: Do We Really Need Multiplications in Deep Learning?
CVPR 2020
CARS: Continuous Evolution for Efficient Neural Architecture Search
CVPR 2020
Creating Something From Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing
CVPR 2020
Learning to Select Base Classes for Few-Shot Classification
CVPR 2020
Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations
CVPR 2020
A Semi-Supervised Assessor of Neural Architectures
CVPR 2020
Joint Demosaicing and Denoising With Self Guidance
CVPR 2020
Polishing Decision-Based Adversarial Noise With a Customized Sampling
CVPR 2020
Frequency Domain Compact 3D Convolutional Neural Networks
CVPR 2020
Unsupervised Person Re-Identification via Softened Similarity Learning
CVPR 2020
Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction
CVPR 2020
GhostNet: More Features From Cheap Operations
CVPR 2020
Corner Proposal Network for Anchor-free, Two-stage Object Detection
ECCV 2020
Circumventing Outliers of AutoAugment with Knowledge Distillation
ECCV 2020
Social Adaptive Module for Weakly-supervised Group Activity Recognition
ECCV 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
ECCV 2020
Large-Scale Few-Shot Learning via Multi-Modal Knowledge Discovery
ECCV 2020
CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing
ECCV 2020
Rethinking the Distribution Gap of Person Re-identification with Camera-based Batch Normalization
ECCV 2020
Self-Adaptively Learning to DemoirΓ© from Focused and Defocused Image Pairs
NIPS 2020
One-bit Supervision for Image Classification
NIPS 2020
PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search
ICLR 2020
Spatial-Temporal Graph Convolutional Network for Video-Based Person Re-Identification
CVPR 2020
Projection & Probability-Driven Black-Box Attack
CVPR 2020
Transformation GAN for Unsupervised Image Synthesis and Representation Learning
CVPR 2020
Video Super-Resolution With Temporal Group Attention
CVPR 2020
FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification
CVPR 2020
Rethinking Performance Estimation in Neural Architecture Search
CVPR 2020
Gradually Vanishing Bridge for Adversarial Domain Adaptation
CVPR 2020
Label Decoupling Framework for Salient Object Detection
CVPR 2020
Cross-Domain Detection via Graph-Induced Prototype Alignment
CVPR 2020
Learning Temporal Co-Attention Models for Unsupervised Video Action Localization
CVPR 2020
Noise-Aware Fully Webly Supervised Object Detection
CVPR 2020
Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio
CVPR 2020
Multinomial Distribution Learning for Effective Neural Architecture Search
ICCV 2019
Co-Evolutionary Compression for Unpaired Image Translation
ICCV 2019
Accelerate CNN via Recursive Bayesian Pruning
ICCV 2019
Data-Free Learning of Student Networks
ICCV 2019
Global-Local Temporal Representations for Video Person Re-Identification
ICCV 2019
Universal Perturbation Attack Against Image Retrieval
ICCV 2019
CenterNet: Keypoint Triplets for Object Detection
ICCV 2019
Dynamic Points Agglomeration for Hierarchical Point Sets Learning
ICCV 2019
AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations
ICCV 2019
Iterative Reorganization With Weak Spatial Constraints: Solving Arbitrary Jigsaw Puzzles for Unsupervised Representation Learning
CVPR 2019
Variational Convolutional Neural Network Pruning
CVPR 2019
Towards Visual Feature Translation
CVPR 2019
Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling
CVPR 2019
Structural Relational Reasoning of Point Clouds
CVPR 2019
Learning Channel-Wise Interactions for Binary Convolutional Neural Networks
CVPR 2019
Dense Temporal Convolution Network for Sign Language Translation
IJCAI 2019
Information Competing Process for Learning Diversified Representations
NIPS 2019
Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition
CVPR 2019
BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation
CVPR 2019
Deep Modular Co-Attention Networks for Visual Question Answering
CVPR 2019
Learning to Learn Image Classifiers With Visual Analogy
CVPR 2019
Deep Fitting Degree Scoring Network for Monocular 3D Object Detection
CVPR 2019
Progressive Differentiable Architecture Search: Bridging the Depth Gap Between Search and Evaluation
ICCV 2019
Person Transfer GAN to Bridge Domain Gap for Person Re-Identification
CVPR 2018
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
IJCAI 2018
Collaborative Deep Reinforcement Learning for Multi-Object Tracking
ECCV 2018
The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
ECCV 2018
Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline)
ECCV 2018
Deep Hashing via Discrepancy Minimization
CVPR 2018
Multi-Cue Correlation Filters for Robust Visual Tracking
CVPR 2018
Zigzag Learning for Weakly Supervised Object Detection
CVPR 2018
Ensemble Diffusion for Retrieval
ICCV 2017
Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
IJCAI 2017
Person Re-Identification in the Wild
CVPR 2017
Scalable Person Re-Identification on Supervised Smoothed Manifold
CVPR 2017
Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description
CVPR 2017
Multimodal Gaussian Process Latent Variable Models With Harmonization
ICCV 2017
Pose-Driven Deep Convolutional Model for Person Re-Identification
ICCV 2017
SORT: Second-Order Response Transform for Visual Recognition
ICCV 2017
Cascaded Interactional Targeting Network for Egocentric Video Analysis
CVPR 2016
InterActive: Inter-Layer Activeness Propagation
CVPR 2016
DisturbLabel: Regularizing CNN on the Loss Layer
CVPR 2016
Picking Deep Filter Responses for Fine-Grained Image Recognition
CVPR 2016
Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification
ICCV 2015
Query-Adaptive Late Fusion for Image Search and Person Re-Identification
CVPR 2015
Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition
CVPR 2015
Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis
ICCV 2015
RIDE: Reversal Invariant Descriptor Enhancement
ICCV 2015
Scalable Person Re-Identification: A Benchmark
ICCV 2015
Semi-supervised Relational Topic Model for Weakly Annotated Image Recognition in Social Media
CVPR 2014
Packing and Padding: Coupled Multi-index for Accurate Image Retrieval
CVPR 2014
Cross-Scale Cost Aggregation for Stereo Matching
CVPR 2014
Orientational Pyramid Matching for Recognizing Indoor Scenes
CVPR 2014
Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval
CVPR 2014
Lp-Norm IDF for Large Scale Image Search
CVPR 2013
Binary Code Ranking with Weighted Hamming Distance
CVPR 2013
Semantic-Aware Co-indexing for Image Retrieval
ICCV 2013
Hierarchical Part Matching for Fine-Grained Visual Categorization
ICCV 2013
Super-Bit Locality-Sensitive Hashing
NIPS 2012