Papers
4,428 papers found
Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression
Roy H. Jennings, Genady Paikin, Roy Shaul et al.
Large Sign Language Models: Toward 3D American Sign Language Translation
Sen Zhang, Xiaoxiao He, Di Liu et al.
LASER: Lip Landmark Assisted Speaker Detection for Robustness
Le Thien Phuc Nguyen, Zhuoran Yu, Yong Jae Lee
LASOR: Towards Clinically Transparent and Explainable Ophthalmic Report Generation via Lesion-Aware Segmentation
Jian Park, Hyunseon Won, JeeEun Kim et al.
Latent Uncertainty-Aware Multi-View SDF Scan Completion
Faezeh Zakeri, Lukas Ruppert, Raphael Braun et al.
Layout Anything: One Transformer for Universal Room Layout Estimation
Md Sohag Mia, Muhammad Abdullah Adnan
Learnable Query-Enhanced Pose Transformation
Yi-Zhen Wang, Hong-Han Shuai
Learning Action Hierarchies via Hybrid Geometric Diffusion
Arjun Ramesh Kaushik, Nalini K. Ratha, Venu Govindaraju
Learning Beyond Labels: Self-Supervised Handwritten Text Recognition
Shree Mitra, Ajoy Mondal, C.V. Jawahar
Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models
Yuxiao Chen, Jue Wang, Zhikang Zhang et al.
Learning from Unknown for Open-Set Test-Time Adaptation
Taki Hasan Rafi, Amit Agarwal, Hitesh L. Patel et al.
Learning Group Actions In Disentangled Latent Image Representations
Farhana Hossain Swarnali, Miaomiao Zhang, Tonmoy Hossain
Learning Spatio-temporal Feature Representations for Video-based Gaze Estimation
Alexandre Personnic, Mihai Bace
Learning Subglacial Bed Topography from Sparse Radar with Physics-Guided Residuals
Bayu Adhi Tama, Jianwu Wang, Vandana Janeja et al.
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions
Haoxin Li, Yingchen Yu, Qilong Wu et al.
Learning Unified Spatio-temporal Representations for Efficient Compressed Video Understanding
Shristi Das Biswas, Efstathia Soufleri, Arani Roy et al.
LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset
Manjushree Aithal, Rosaura G VidalMata, Manikandtan Kartha et al.
Leveraging Pretrained Representations for Cross-Modal Point Cloud Completion
Kshitij Kale, Hrishikesh U, V sreenidhe et al.
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Héctor Laria, Alexandra Gomez-Villa, Jiang Qin et al.
Leveraging Sparsity for Privacy in Collaborative Inference
Maximilian Andreas Hoefler, Karsten Mueller, Wojciech Samek
LiDAR-DHMT: LiDAR-Adaptive Dual Hierarchical Mask Transformer for Robust Freespace Detection and Semantic Segmentation
Siyu Chen, Ting Han, Changshe Zhang et al.
LightGazeNet: A Lightweight GNN-based Architecture for Gaze Estimation
Heena Patel, Anirban Chowdhury, Pooja Jigar Choksy et al.
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures
Seungoh Han, Jaehoon Jang, Hyunsu Kim et al.
Line Art Colorization with Offset Prior-based Diffusion Model
Xuan Zhu, Miao Cao, Fang-Lue Zhang et al.