Papers
Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Dylan J. Foster, Adam Block, Dipendra Misra
Is Cross-validation the Gold Standard to Estimate Out-of-sample Model Performance?
Garud Iyengar, Henry Lam, Tianyu Wang
Is Function Similarity Over-Engineered? Building a Benchmark
Rebecca Saul, Chang Liu, Noah Fleischmann et al.
Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interactions
Nivasini Ananthakrishnan, Nika Haghtalab, Chara Podimata et al.
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?
Yang Dai, Oubo Ma, Longfei Zhang et al.
Is Multiple Object Tracking a Matter of Specialization?
Gianluca Mancusi, Mattia Bernardi, Aniello Panariello et al.
Is O(log N) practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL
Enoch H. Kang, P. R. Kumar
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models.
Athanasios Tragakis, Marco Aversa, Chaitanya Kaul et al.
Is Programming by Example Solved by LLMs?
Wen-Ding Li, Kevin Ellis
Is Score Matching Suitable for Estimating Point Processes?
Haoqun Cao, Zizhuo Meng, Tianjun Ke et al.
Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization
Wei Liu, Zhiying Deng, Zhongyu Niu et al.
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park, Kevin Frans, Sergey Levine et al.
Is Your HD Map Constructor Reliable under Sensor Corruptions?
Xiaoshuai Hao, Mengchuan Wei, Yifan Yang et al.
Is Your LiDAR Placement Optimized for 3D Scene Understanding?
Ye Li, Lingdong Kong, Hanjiang Hu et al.
Iteration Head: A Mechanistic Study of Chain-of-Thought
Vivien Cabannes, Charles Arnal, Wassim Bouaziz et al.
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
Yi Ma, Jianye Hao, Xiaohan Hu et al.
Iteratively Refined Early Interaction Alignment for Subgraph Matching based Graph Retrieval
Ashwin Ramachandran, Vaibhav Raj, Indrayumna Roy et al.
Iterative Methods via Locally Evolving Set Process
Baojian Zhou, Yifan Sun, Reza Babanezhad Harikandeh et al.
Iterative Reasoning Preference Optimization
Richard Yuanzhe Pang, Weizhe Yuan, Kyunghyun Cho et al.
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu, Shaofeng Yin, Ningya Feng et al.
IWBVT: Instance Weighting-based Bias-Variance Trade-off for Crowdsourcing
Wenjun Zhang, Liangxiao Jiang, Chaoqun Li
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Patrick Chao, Edoardo Debenedetti, Alexander Robey et al.
Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters
Haibo Jin, Andy Zhou, Joe D. Menke et al.
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
Alexander Rutherford, Benjamin Ellis, Matteo Gallici et al.
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
Kun Zhou, Beichen Zhang, Jiapeng Wang et al.