Papers
4,428 papers found
Tables Guide Vision: Learning to See the Heart through Tabular Data
Marta Hasny, Maxime Di Folco, Keno Bressem et al.
TacticalCalib: End-to-End 6-DoF Camera Pose Regression for Tactical Camera Calibration
Liang Fan, Xiaoqian Liu, Zhi Chen et al.
TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection
Xinqi Xiong, Prakrut Patel, Qingyuan Fan et al.
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Alireza Javanmardi, Pragati Jaiswal, Tewodros Amberbir Habtegebrial et al.
TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors
Wei-Yuan Cheng, Kai-Po Chang, Chi-Pin Huang et al.
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
Maximilian von Klinski, Maximilian Schall
TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression
Cheng-Yuan Ho, He-Bi Yang, Jui-Chiu Chiang et al.
Temporal Object Captioning for Street Scene Videos from LiDAR Tracks
Vignesh Gopinathan, Urs Zimmermann, Michael Arnold et al.
Test-Time Adaptation for Video Highlight Detection Using Meta-Auxiliary Learning and Cross-Modality Hallucinations
Zahidul Islam, Sujoy Paul, Mrigank Rochan
Test-Time Adaptation through Semantically-guided Feature Decomposition for Few-shot Chest X-ray Diagnosis
Jayant Mahawar, Angshuman Paul
Test Time Adaptation Using Adaptive Quantile Recalibration
Paria Mehrbod, Pedro Vianna, Geraldin Nanfack et al.
Test-Time Consistency in Vision Language Models
Shih-Han Chou, Shivam Chandhok, James J. Little et al.
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Pin-Yen Chiu, I-Sheng Fang, Jun-Cheng Chen
The Perceptual Observatory Characterizing Robustness and Grounding in MLLMs
Tejas Anvekar, Fenil Bardoliya, Pavan K. Turaga et al.
TiCLS: Tightly Coupled Language Text Spotter
Leeje Jang, Yijun Lin, Yao-Yi Chiang et al.
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang, Feng Cheng, Ziyang Wang et al.
Timestamp Query Transformer for Temporal Action Segmentation
Tieqiao Wang, Sinisa Todorovic
TM-Adapter: Temporal Merge Adapter for Efficient Global Temporal Modeling
Woo Joo Hahm, Seungwoo Jang, Hyeon Tak Kim et al.
TopoRec: Point Cloud Recognition Using Topological Data Analysis
Anirban Ghosh, Iliya Kulbaka, Ian Dahlin et al.
Towards Egocentric 3D Hand Pose Estimation in Unseen Domains
Wiktor Mucha, Michael Wray, Martin Kampel
Towards Fast and Scalable Normal Integration using Continuous Components
Francesco Milano, Jen Jen Chung, Lionel Ott et al.
Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score
Eman Ali, Sathira Silva, Chetan Arora et al.
Towards High-Fidelity, Identity-Preserving Real-Time Makeup Transfer: Decoupling Style Generation
Lydia Chau, Zhi Yu, Ruowei Jiang
Towards Photorealistic Style Transfer with Multimodal Guidance and Robustness to Content Images in Arbitrary Styles
Ruikai Zhou, Yating Liu, Yi Xu
Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood
Gilhyun Nam, Taewon Kim, Joonhyun Jeong et al.