Co-occurring keywords
Papers
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
NAACL 2025
TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation
CVPR 2025
FASTer: Focal token Acquiring-and-Scaling Transformer for Long-term 3D Objection Detection
CVPR 2025
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation
AAAI 2025