Papers
PVO: Panoptic Visual Odometry
Weicai Ye, Xinyue Lan, Shuo Chen et al.
PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer
Honghui Yang, Wenxiao Wang, Minghao Chen et al.
PyPose: A Library for Robot Learning With Physics-Based Optimization
Chen Wang, Dasong Gao, Kuan Xu et al.
PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow
Jiarui Lei, Xiaobo Hu, Yue Wang et al.
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Sheng Xu, Yanjing Li, Mingbao Lin et al.
Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Zaid Khan, Vijay Kumar BG, Samuel Schulter et al.
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation
Sicheng Yang, Zhiyong Wu, Minglei Li et al.
Quality-Aware Pre-Trained Models for Blind Image Quality Assessment
Kai Zhao, Kun Yuan, Ming Sun et al.
QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity
Siyu Huang, Jie An, Donglai Wei et al.
Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
Hoseok Do, EunKyung Yoo, Taehyeong Kim et al.
Quantum-Inspired Spectral-Spatial Pyramid Network for Hyperspectral Image Classification
Jie Zhang, Yongshan Zhang, Yicong Zhou
Quantum Multi-Model Fitting
Matteo Farina, Luca Magri, Willi Menapace et al.
Query-Centric Trajectory Prediction
Zikang Zhou, Jianping Wang, Yung-Hui Li et al.
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
WonJun Moon, Sangeek Hyun, SangUk Park et al.
R2Former: Unified Retrieval and Reranking Transformer for Place Recognition
Sijie Zhu, Linjie Yang, Chen Chen et al.
RaBit: Parametric Modeling of 3D Biped Cartoon Characters With a Topological-Consistent Dataset
Zhongjin Luo, Shengcai Cai, Jinguo Dong et al.
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training
Chen-Wei Xie, Siyang Sun, Xiong Xiong et al.
Randomized Adversarial Training via Taylor Expansion
Gaojie Jin, Xinping Yi, Dengyu Wu et al.
Range-Nullspace Video Frame Interpolation With Focalized Motion Estimation
Zhiyang Yu, Yu Zhang, Dongqing Zou et al.
RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving
Angelika Ando, Spyros Gidaris, Andrei Bursuc et al.
Ranking Regularization for Critical Rare Classes: Minimizing False Positives at a High True Positive Rate
Kiarash Mohammadi, He Zhao, Mengyao Zhai et al.
Rate Gradient Approximation Attack Threats Deep Spiking Neural Networks
Tong Bu, Jianhao Ding, Zecheng Hao et al.
Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments
Masakazu Yoshimura, Junji Otsuka, Atsushi Irie et al.
Raw Image Reconstruction With Learned Compact Metadata
Yufei Wang, Yi Yu, Wenhan Yang et al.