Ting Yao
92 papers · 2015–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Cross-Pollinator (10) π Academic Marathon (10) π Conference Polyglot (11) π§ Keyword Pioneer π Renaissance Researcher (7)
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(107)
π
Conference Loyalist
(43)
π¬
Deep Specialist
(13)
π€
Dynamic Duo
(77)
π
Grand Slam
π
Conference Pioneer
π
Trend Setter
ποΈ
Keyword Collector
(360)
π
Century Club
(91)
π₯
Unstoppable
(11)
β‘
Prolific Year
(11)
Conferences
CVPR (43)
ECCV (12)
ICCV (12)
AAAI (6)
ACL (5)
IJCAI (4)
NIPS (4)
ICML (3)
COLING (1)
ICLR (1)
IJCNLP (1)
Top co-authors
Keywords
diffusion model
(10)
representation learning
(8)
action recognition
(7)
video captioning
(7)
convolutional neural network
(7)
image captioning
(6)
domain adaptation
(6)
image generation
(5)
transfer learning
(5)
video understanding
(5)
object detection
(4)
semantic segmentation
(4)
long short-term memory
(4)
video recognition
(4)
contrastive learning
(4)
recurrent neural network
(4)
transformer architecture
(4)
metric learning
(3)
video generation
(3)
self-supervised learning
(3)
Papers
FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
AAAI 2026
Aligning Global Semantics and Local Textures in Generative Video Enhancement
ICCV 2025
Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation
ACL 2025
Denoising Token Prediction in Masked Autoregressive Models
ICCV 2025
Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots
ICML 2025
Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction
CVPR 2025
MotionPro: A Precise Motion Controller for Image-to-Video Generation
CVPR 2025
Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion
AAAI 2025
Discriminative Policy Optimization for Token-Level Reward Models
ICML 2025
Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On
ICLR 2025
VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation
CVPR 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
CVPR 2024
Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
CVPR 2024
VideoStudio: Generating Consistent-Content and Multi-Scene Videos
ECCV 2024
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
ECCV 2024
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
ECCV 2024
Prompt Refinement with Image Pivot for Text-to-Image Generation
ACL 2024
Improving Virtual Try-On with Garment-focused Diffusion Models
ECCV 2024
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
ECCV 2024
Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
CVPR 2024
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
CVPR 2024
Semantic-Conditional Diffusion Networks for Image Captioning
CVPR 2023
Knowledge Transfer in Incremental Learning for Multilingual Neural Machine Translation
ACL 2023
Learning To Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic Space
CVPR 2023
3D Human Pose Estimation With Spatio-Temporal Criss-Cross Attention
CVPR 2023
AnchorFormer: Point Cloud Completion From Discriminative Nodes
CVPR 2023
Transforming Radiance Field With Lipschitz Network for Photorealistic 3D Scene Stylization
CVPR 2023
Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation
CVPR 2023
HGNet: Learning Hierarchical Geometry From Points, Edges, and Surfaces
CVPR 2023
Modality-Agnostic Debiasing for Single Domain Generalization
CVPR 2023
PointClustering: Unsupervised Point Cloud Pre-Training Using Transformation Invariance in Clustering
CVPR 2023
ObjectFusion: Multi-modal 3D Object Detection with Object-Centric Fusion
ICCV 2023
Learning Neural Implicit Surfaces with Object-Aware Radiance Fields
ICCV 2023
DPTDR: Deep Prompt Tuning for Dense Passage Retrieval
COLING 2022
Dynamic Temporal Filtering In Video Models
ECCV 2022
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
ECCV 2022
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement
ECCV 2022
Exploring Structure-Aware Transformer Over Interaction Proposals for Human-Object Interaction Detection
CVPR 2022
Comprehending and Ordering Semantics for Image Captioning
CVPR 2022
Stand-Alone Inter-Frame Attention in Video Models
CVPR 2022
MLP-3D: A MLP-Like 3D Architecture With Grouped Time Mixing
CVPR 2022
Out-of-Distribution Detection via Conditional Kernel Independence Model
NIPS 2022
Generalized One-shot Domain Adaptation of Generative Adversarial Networks
NIPS 2022
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
ECCV 2022
SeCo: Exploring Sequence Supervision for Unsupervised Representation Learning
AAAI 2021
Multi-Lingual Question Generation with Language Agnostic Language Model
ACL 2021
Boosting Video Representation Learning With Multi-Faceted Integration
CVPR 2021
Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration
NIPS 2021
Representing Videos As Discriminative Sub-Graphs for Action Recognition
CVPR 2021
Multi-Lingual Question Generation with Language Agnostic Language Model
IJCNLP 2021
A Style and Semantic Memory Mechanism for Domain Generalization
ICCV 2021
Motion-Focused Contrastive Learning of Video Representations
ICCV 2021
Condensing a Sequence to One Informative Frame for Video Recognition
ICCV 2021
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
AAAI 2021
Optimization Planning for 3D ConvNets
ICML 2021
Transferring and Regularizing Prediction for Semantic Segmentation
CVPR 2020
X-Linear Attention Networks for Image Captioning
CVPR 2020
Learning a Unified Sample Weighting Network for Object Detection
CVPR 2020
Joint Contrastive Learning with Infinite Possibilities
NIPS 2020
Learning to Localize Actions from Moments
ECCV 2020
A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction
ACL 2020
ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion
AAAI 2020
Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation
CVPR 2020
Relation Distillation Networks for Video Object Detection
ICCV 2019
Deep Learning for Video Captioning: A Review
IJCAI 2019
Transferrable Prototypical Networks for Unsupervised Domain Adaptation
CVPR 2019
Exploring Object Relation in Mean Teacher for Cross-Domain Detection
CVPR 2019
Gaussian Temporal Awareness Networks for Action Localization
CVPR 2019
Learning Spatio-Temporal Representation With Local and Global Diffusion
CVPR 2019
Pointing Novel Objects in Image Captioning
CVPR 2019
Customizable Architecture Search for Semantic Segmentation
CVPR 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
AAAI 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
IJCAI 2019
Hierarchy Parsing for Image Captioning
ICCV 2019
Fully Convolutional Adaptation Networks for Semantic Segmentation
CVPR 2018
Exploring Visual Relationship for Image Captioning
ECCV 2018
Recurrent Tubelet Proposal and Recognition Networks for Action Detection
ECCV 2018
Memory Matching Networks for One-Shot Image Recognition
CVPR 2018
Jointly Localizing and Describing Events for Dense Video Captioning
CVPR 2018
Video Captioning With Transferred Semantic Attributes
CVPR 2017
Boosting Image Captioning With Attributes
ICCV 2017
Deep Quantization: Encoding Convolutional Activations With Deep Generative Model
CVPR 2017
Learning Spatio-Temporal Representation With Pseudo-3D Residual Networks
ICCV 2017
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
CVPR 2017
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images
CVPR 2016
Jointly Modeling Embedding and Translation to Bridge Video and Language
CVPR 2016
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language
CVPR 2016
Deep Semantic-Preserving and Ranking-Based Hashing for Image Retrieval
IJCAI 2016
Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure
IJCAI 2016
Highlight Detection With Pairwise Deep Ranking for First-Person Video Summarization
CVPR 2016
Semi-Supervised Domain Adaptation With Subspace Learning for Visual Recognition
CVPR 2015
Learning Query and Image Similarities With Ranking Canonical Correlation Analysis
ICCV 2015