Papers
Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method
Ran Yi, Haoyuan Tian, Zhihao Gu et al.
Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval
Yi Xie, Huaidong Zhang, Xuemiao Xu et al.
Towards Benchmarking and Assessing Visual Naturalness of Physical World Adversarial Attacks
Simin Li, Shuning Zhang, Gujun Chen et al.
Towards Better Decision Forests: Forest Alternating Optimization
Miguel Á. Carreira-Perpiñán, Magzhan Gabidolla, Arman Zharmagambetov
Towards Better Gradient Consistency for Neural Signed Distance Functions via Level Set Alignment
Baorui Ma, Junsheng Zhou, Yu-Shen Liu et al.
Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation
Dong Zhao, Shuang Wang, Qi Zang et al.
Towards Bridging the Performance Gaps of Joint Energy-Based Models
Xiulong Yang, Qing Su, Shihao Ji
Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration
Kemal Oksuz, Tom Joy, Puneet K. Dokania
Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations
Lei Hsiung, Yun-Yun Tsai, Pin-Yu Chen et al.
Towards Domain Generalization for Multi-View 3D Object Detection in Bird-Eye-View
Shuo Wang, Xinhai Zhao, Hai-Ming Xu et al.
Towards Effective Adversarial Textured 3D Meshes on Physical Face Recognition
Xiao Yang, Chang Liu, Longlong Xu et al.
Towards Effective Visual Representations for Partial-Label Learning
Shiyu Xia, Jiaqi Lv, Ning Xu et al.
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
Gongjie Zhang, Zhipeng Luo, Zichen Tian et al.
Towards End-to-End Generative Modeling of Long Videos With Memory-Efficient Bidirectional Transformers
Jaehoon Yoo, Semin Kim, Doyup Lee et al.
Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval
Xudong Lin, Simran Tiwari, Shiyuan Huang et al.
Towards Flexible Multi-Modal Document Models
Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra et al.
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Dezhao Luo, Jiabo Huang, Shaogang Gong et al.
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Gen Li, Jie Ji, Minghai Qin et al.
Towards Modality-Agnostic Person Re-Identification With Descriptive Query
Cuiqun Chen, Mang Ye, Ding Jiang
Towards Open-World Segmentation of Parts
Tai-Yu Pan, Qing Liu, Wei-Lun Chao et al.
Towards Practical Plug-and-Play Diffusion Models
Hyojun Go, Yunsung Lee, Jin-Young Kim et al.
Towards Professional Level Crowd Annotation of Expert Domain Data
Pei Wang, Nuno Vasconcelos
Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution
Chenfan Qu, Chongyu Liu, Yuliang Liu et al.
Towards Scalable Neural Representation for Diverse Videos
Bo He, Xitong Yang, Hanyu Wang et al.