Papers
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang, Zhipeng Luo, Zichen Tian et al.
Towards End-to-End Generative Modeling of Long Videos With Memory-Efficient Bidirectional Transformers
Jaehoon Yoo, Semin Kim, Doyup Lee et al.
Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval
Xudong Lin, Simran Tiwari, Shiyuan Huang et al.
Towards Flexible Multi-Modal Document Models
Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra et al.
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Dezhao Luo, Jiabo Huang, Shaogang Gong et al.
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Gen Li, Jie Ji, Minghai Qin et al.
Towards Modality-Agnostic Person Re-Identification With Descriptive Query
Cuiqun Chen, Mang Ye, Ding Jiang
Towards Open-World Segmentation of Parts
Tai-Yu Pan, Qing Liu, Wei-Lun Chao et al.
Towards Practical Plug-and-Play Diffusion Models
Hyojun Go, Yunsung Lee, Jin-Young Kim et al.
Towards Professional Level Crowd Annotation of Expert Domain Data
Pei Wang, Nuno Vasconcelos
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution
Chenfan Qu, Chongyu Liu, Yuliang Liu et al.
Towards Scalable Neural Representation for Diverse Videos
Bo He, Xitong Yang, Hanyu Wang et al.
Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization
Li’an Zhuo, Jian Cao, Qi Wang et al.
Toward Stable, Interpretable, and Lightweight Hyperspectral Super-Resolution
Wen-jin Guo, Weiying Xie, Kai Jiang et al.
Towards Transferable Targeted Adversarial Examples
Zhibo Wang, Hongshan Yang, Yunhe Feng et al.
Towards Trustable Skin Cancer Diagnosis via Rewriting Model's Decision
Siyuan Yan, Zhen Yu, Xuelin Zhang et al.
Towards Unbiased Volume Rendering of Neural Implicit Surfaces With Geometry Priors
Yongqiang Zhang, Zhipeng Hu, Haoqian Wu et al.
Towards Unified Scene Text Spotting Based on Sequence Generation
Taeho Kil, Seonghyeon Kim, Sukmin Seo et al.
Towards Universal Fake Image Detectors That Generalize Across Generative Models
Utkarsh Ojha, Yuheng Li, Yong Jae Lee
Towards Unsupervised Object Detection From LiDAR Point Clouds
Lunjun Zhang, Anqi Joyce Yang, Yuwen Xiong et al.
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Mayu Otani, Riku Togashi, Yu Sawai et al.
TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments
Yu Sun, Qian Bao, Wu Liu et al.
Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion
Davis Rempe, Zhengyi Luo, Xue Bin Peng et al.
Tracking Multiple Deformable Objects in Egocentric Videos
Mingzhen Huang, Xiaoxing Li, Jun Hu et al.