Papers
OmniBench: A Comprehensive Benchmark Integrating Real-World, Time-sensitive, and Multi-Hop Questions with a Multi-Dimensional Hybrid Evaluation Framework
Wenjie Wang, Yufeng Jiang, Ge Sun et al.
OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination
Junzhe Chen, Tianshu Zhang, Shiyu Huang et al.
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation
Fangyuan Mao, Aiming Hao, Jintao Chen et al.
Omni-Embed-Audio: Leveraging Multimodal LLMs for Robust Audio-Text Retrieval
HaeJun Yoo, Yongseop Shin, Insung Lee et al.
OmniEvent: Unified Event Representation Learning
Weiqi Yan, Chenlu Lin, Youbiao Wang et al.
Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code Generation
Jiawei Zhou, Chi Zhang, Xiang Feng et al.
OmniNet: A Multi-Modality Neural Network for Robust Remote Respiratory Rate Measurement from Facial Video
Tsai-Ni Lin, An-Sheng Liu, Li-Chen Fu
OmniOData: Unleashing Small Language Models for OData Query Generation with Synthetic Data and Reinforcement Learning
Tao Bai, Zhaochen Li, Hongxin Shao et al.
OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and Understanding
Teng Fu, Mengyang Zhao, Ke Niu et al.
Omni-RewardBench: Toward a Comprehensive Evaluation of Generative Reward Models Across Modalities
Chi-Min Chan, Yujin Zhou, Pengcheng Wen et al.
OmniScale: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Qianli Ma, Yaowei Zheng, Zhelun Shi et al.
OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs
Feng Chen, Yefei He, Shaoxuan He et al.
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
Dianbing Xi, Jiepeng Wang, Yuanzhi Liang et al.
On Characterizations for Language Generation: Interplay of Hallucinations, Breadth, and Stability
Alkis Kalavasis, Anay Mehrotra, Grigoris Velegkas
OncoCoT: A Temporal-causal Chain-of-Thought Dataset for Oncologic Decision-Making
Peiru Yang, Yudong Li, Shiting Wang et al.
On Condorcet’s Jury Theorem with Abstention
Reshef Meir, Ganesh Ghalme
On Coresets for End-to-end Learning from Crowds
Hang Yang, Zhiwu Li, Witold Pedrycz
One2Seq: One-Token Wise Decoder for Efficient Scene Text Recognition
Zhibin Ma, Pengwen Dai, Wei Zhuo et al.
One Battle After Another: Probing LLMs’ Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework
Qi Jia, Ye Shen, Xiujie Song et al.
One-by-One Stainer: A Fast and Hallucination Resilient Domain Adaptation Method for Histopathology
Karel Moens, Jonas De Vylder, Tinne Tuytelaars et al.
One Cognitive Loop Is Enough: SODA unlocks Pure-Text Spatial Reasoning in Large Language Models
Shunwen Bai, Jiahuan Zhang, Haoran Huang et al.
One-Cycle Structured Pruning via Stability-Driven Subnetwork Search
Deepak Ghimire, Dayoung Kil, Seonghwan Jeong et al.
OnEDIT: Online Editing with Decoupled Implicit Task for Large Language Models
Chae-Won Lee, Jae-Hong Lee, Ji-Hun Kang et al.
OneFont: A Unified Agent for End-to-End Font Creation
Yingxin Lai, Yufei Liu, Guoqing Yang et al.
One for All: Synthesis-Free Fingerprint Learning for Attribution of In-the-Wild Synthetic Images
Jianwei Fei, Yunshu Dai, Peipeng Yu et al.