Papers
T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval
Xiaohan Wang, Linchao Zhu, Yi Yang
Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation
Younghyun Jo, Seoung Wug Oh, Peter Vajda et al.
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser, Robin Rombach, Bjorn Ommer
Tangent Space Backpropagation for 3D Transformation Groups
Zachary Teed, Jia Deng
TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption
Zhengyuan Yang, Yijuan Lu, Jianfeng Wang et al.
Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation
Tianfei Zhou, Jianwu Li, Xueyi Li et al.
Task-Aware Variational Adversarial Active Learning
Kwanyoung Kim, Dongwon Park, Kwang In Kim et al.
Taskology: Utilizing Task Relations at Scale
Yao Lu, Soren Pirk, Jan Dlabal et al.
Task Programming: Learning Data Efficient Behavior Representations
Jennifer J. Sun, Ann Kennedy, Eric Zhan et al.
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang, Zhan Tong, Bin Ji et al.
Teachers Do More Than Teach: Compressing Image-to-Image Models
Qing Jin, Jian Ren, Oliver J. Woodford et al.
TearingNet: Point Cloud Autoencoder To Learn Topology-Friendly Representations
Jiahao Pang, Duanshun Li, Dong Tian
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
Weihao Xia, Yujiu Yang, Jing-Hao Xue et al.
Temporal Action Segmentation From Timestamp Supervision
Zhe Li, Yazan Abu Farha, Jurgen Gall
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
Zhiwu Qing, Haisheng Su, Weihao Gan et al.
Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation
Saquib Sarfraz, Naila Murray, Vivek Sharma et al.
Temporal Modulation Network for Controllable Space-Time Video Super-Resolution
Gang Xu, Jun Xu, Zhen Li et al.
Temporal Query Networks for Fine-Grained Video Understanding
Chuhan Zhang, Ankush Gupta, Andrew Zisserman
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
Toby Perrett, Alessandro Masullo, Tilo Burghardt et al.
TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking
N Dinesh Reddy, Laurent Guigues, Leonid Pishchulin et al.
Test-Time Fast Adaptation for Dynamic Scene Deblurring via Meta-Auxiliary Learning
Zhixiang Chi, Yang Wang, Yuanhao Yu et al.
TextOCR: Towards Large-Scale End-to-End Reasoning for Arbitrary-Shaped Scene Text
Amanpreet Singh, Guan Pang, Mandy Toh et al.
The Affective Growth of Computer Vision
Norman Makoto Su, David J. Crandall
The Blessings of Unlabeled Background in Untrimmed Videos
Yuan Liu, Jingyuan Chen, Zhenfang Chen et al.
The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures
Yawei Li, Wen Li, Martin Danelljan et al.