Papers
Offline Opponent Modeling with Truncated Q-driven Instant Policy Refinement
Yuheng Jing, Kai Li, Bingyun Liu et al.
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
Xiao Huang, Xu Liu, Enze Zhang et al.
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
Kosuke Nakanishi, Akihiro Kubo, Yuji Yasui et al.
Off-Policy Evaluation under Nonignorable Missing Data
Han Wang, Yang Xu, Wenbin Lu et al.
Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents
Shuo Han, German Espinosa, Junda Huang et al.
Olica: Efficient Structured Pruning of Large Language Models without Retraining
Jiujun He, Huazhen Lin
O-MAPL: Offline Multi-agent Preference Learning
The Viet Bui, Tien Anh Mai, Thanh Hong Nguyen
OmiAD: One-Step Adaptive Masked Diffusion Model for Multi-class Anomaly Detection via Adversarial Distillation
Yaoxuan Feng, Wenchao Chen, Yuxin Li et al.
Omni-Angle Assault: An Invisible and Powerful Physical Adversarial Attack on Face Recognition
Shuai Yuan, Hongwei Li, Rui Zhang et al.
OmniArch: Building Foundation Model for Scientific Computing
Tianyu Chen, Haoyi Zhou, Ying Li et al.
OmniAudio: Generating Spatial Audio from 360-Degree Video
Huadai Liu, Tianyi Luo, Kaicheng Luo et al.
OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance
Yongqiang Yao, Jingru Tan, Feizhao Zhang et al.
On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists
Dongyang Fan, Bettina Messmer, Nikita Doikov et al.
On Differential Privacy for Adaptively Solving Search Problems via Sketching
Shiyuan Feng, Ying Feng, George Zhaoqi Li et al.
One Arrow, Two Hawks: Sharpness-aware Minimization for Federated Learning via Global Model Trajectory
Yuhang Li, Tong Liu, Yangguang Cui et al.
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
Jianze Li, Jiezhang Cao, Yong Guo et al.
One-dimensional Path Convolution
Xuanshu Luo, Martin Werner
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs
Yinghui Li, Jiayi Kuang, Haojing Huang et al.
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive Randomization
Undral Byambadalai, Tomu Hirata, Tatsushi Oka et al.
OneForecast: A Universal Framework for Global and Regional Weather Forecasting
Yuan Gao, Hao Wu, Ruiqi Shu et al.
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework
Feiran Li, Qianqian Xu, Shilong Bao et al.
One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation
Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.
One-Pass Feature Evolvable Learning with Theoretical Guarantees
Cun-Yuan Xing, Meng-Zhang Qian, Wu-Yang Chen et al.
One-Shot Heterogeneous Federated Learning with Local Model-Guided Diffusion Models
Mingzhao Yang, Shangchao Su, Bin Li et al.
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Zhendong Wang, Max Li, Ajay Mandlekar et al.