Papers
Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality
Audrey Huang, Nan Jiang
Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding
Chuyang Zhao, Yuxin Song, Junru Chen et al.
OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries
Yuhang Lu, Xinge Zhu, Tai Wang et al.
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu, Shiyu Li, Yuxuan Liu et al.
ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings
Suyoung Lee, Jaeyoung Chung, Jaeyoo Huh et al.
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Jiafei Lyu, Kang Xu, Jiacheng Xu et al.
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Yihong Guo, Yixuan Wang, Yuanyuan Shi et al.
Offline Behavior Distillation
Shiye Lei, Sen Zhang, Dacheng Tao
Offline Multitask Representation Learning for Reinforcement Learning
Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng et al.
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
Jian Qian, Haichen Hu, David Simchi-Levi
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao, Qi Wang, Chen Chen et al.
Off-policy estimation with adaptively collected data: the power of online learning
Jeonghwan Lee, Cong Ma
Off-Policy Selection for Initiating Human-Centric Experimental Design
Ge Gao, Xi Yang, Qitong Gao et al.
Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation Forecasting
Brandon Victor, Mathilde Letard, Peter Naylor et al.
Oja's Algorithm for Streaming Sparse PCA
Syamantak Kumar, Purnamrita Sarkar
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Zhen Huang, Zengzhi Wang, Shijie Xia et al.
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang, Xiangtai Li, Hao Fei et al.
Omnigrasp: Grasping Diverse Objects with Simulated Humanoids
Zhengyi Luo, Jinkun Cao, Sammy Christen et al.
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
Zihao Wang, Shaofei Cai, Zhancun Mu et al.
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Junke Wang, Yi Jiang, Zehuan Yuan et al.
On $f$-Divergence Principled Domain Adaptation: An Improved Framework
Ziqiao Wang, Yongyi Mao
On Affine Homotopy between Language Encoders
Robin S. M. Chan, Reda Boumasmoud, Anej Svete et al.
On Causal Discovery in the Presence of Deterministic Relations
Loka Li, Haoyue Dai, Hanin Al Ghothani et al.
Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge
Fang Dong, Mengyi Chen, Jixian Zhou et al.
On conditional diffusion models for PDE simulations
Aliaksandra Shysheya, Cristiana Diaconu, Federico Bergamin et al.