Papers
11,955 papers found
Instruction Tuning-free Visual Token Complement for Multimodal LLMs
Dongsheng Wang, Jiequan Cui, Miaoge Li et al.
Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions
Taehyeon Kim, Joonkee Kim, Gihun Lee et al.
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
Jianhui Li, Shilong Liu, Zidong Liu et al.
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin, Yadong MU
Integrating Planning and Deep Reinforcement Learning via Automatic Induction of Task Substructures
Jung-Chun Liu, Chi-Hsien Chang, Shao-Hua Sun et al.
Integration of Global and Local Representations for Fine-grained Cross-modal Alignment
Seungwan Jin, Hoyoung Choi, Taehyung Noh et al.
Intelligent Switching for Reset-Free RL
Darshan Patil, Janarthanan Rajendran, Glen Berseth et al.
Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning
Yun-Hin Chan, Rui Zhou, Running Zhao et al.
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Yi Wang, Yinan He, Yizhuo Li et al.
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
Yi Wang, Kunchang Li, Xinhao Li et al.
InterpGNN: Understand and Improve Generalization Ability of Transdutive GNNs through the Lens of Interplay between Train and Test Nodes
Jiawei Sun, Kailai Li, Ruoxin Chen et al.
Interpretable Diffusion via Information Decomposition
Xianghao Kong, Ollie Liu, Han Li et al.
Interpretable Meta-Learning of Physical Systems
Matthieu Blanke, Marc Lelarge
Interpretable Sparse System Identification: Beyond Recent Deep Learning Techniques on Time-Series Prediction
Xiaoyi Liu, Duxin Chen, Wenjia Wei et al.
Interpretable Temporal Class Activation Representation for Audio Spoofing Detection
Menglu Li, Xiao-Ping Zhang
Interpreting CLIP's Image Representation via Text-Based Decomposition
Yossi Gandelsman, Alexei A Efros, Jacob Steinhardt
Interpreting Robustness Proofs of Deep Neural Networks
Debangshu Banerjee, Avaljot Singh, Gagandeep Singh
Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach
Aoqi Zuo, Yiqing Li, Susan Wei et al.
Intriguing Properties of Data Attribution on Diffusion Models
Xiaosen Zheng, Tianyu Pang, Chao Du et al.
Intriguing Properties of Generative Classifiers
Priyank Jaini, Kevin Clark, Robert Geirhos
Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks
Tingyu Qu, Tinne Tuytelaars, Marie-Francine Moens
Invariance-based Learning of Latent Dynamics
Kai Lagemann, Christian Lagemann, Sach Mukherjee
Inverse Approximation Theory for Nonlinear Recurrent Neural Networks
Shida Wang, Zhong Li, Qianxiao Li
Investigating the Benefits of Projection Head for Representation Learning
Yihao Xue, Eric Gan, Jiayi Ni et al.
INViTE: INterpret and Control Vision-Language Models with Text Explanations
Haozhe Chen, Junfeng Yang, Carl Vondrick et al.