Papers
Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in sEMG Analysis
Weiyu Guo, Ziyue Qiao, Ying Sun et al.
Revisiting Non-Acyclic GFlowNets in Discrete Environments
Nikita Morozov, Ian Maksimov, Daniil Tiapkin et al.
Revisiting the Predictability of Performative, Social Events
Juan Carlos Perdomo
Revisiting Unbiased Implicit Variational Inference
Tobias Pielok, Bernd Bischl, David Rügamer
Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization
Peiyan Zhang, Haibo Jin, Leyang Hu et al.
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang, Zhihan Liu, Boyi Liu et al.
Reward-free World Models for Online Imitation Learning
Shangzhe Li, Zhiao Huang, Hao Su
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
Masatoshi Uehara, Xingyu Su, Yulai Zhao et al.
Reward-Guided Prompt Evolving in Reinforcement Learning for LLMs
Ziyu Ye, Rishabh Agarwal, Tianqi Liu et al.
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Baohao Liao, Yuhui Xu, Hanze Dong et al.
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd
Shang Liu, Yu Pan, Guanting Chen et al.
Reward Translation via Reward Machine in Semi-Alignable MDPs
Yun Hua, Haosheng Chen, Wenhao Li et al.
Rhomboid Tiling for Geometric Graph Deep Learning
Yipeng Zhang, Longlong Li, Kelin Xia
Riemannian Diffusion Adaptation for Distributed Optimization on Manifolds
Xiuheng Wang, Ricardo Augusto Borsoi, Cédric Richard et al.
Riemann Tensor Neural Networks: Learning Conservative Systems with Physics-Constrained Networks
Anas Jnini, Lorenzo Breschi, Flavio Vella
RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
Min Zhao, Guande He, Yixiao Chen et al.
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son, William Bankes, Sayak Ray Chowdhury et al.
Right Time to Learn: Promoting Generalization via Bio-inspired Spacing Effect in Knowledge Distillation
Guanglong Sun, Hongwei Yan, Liyuan Wang et al.
Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity
Arto Maranjyan, Alexander Tyurin, Peter Richtárik
R.I.P.: Better Models by Survival of the Fittest Prompts
Ping Yu, Weizhe Yuan, Olga Golovneva et al.
RISE: Radius of Influence based Subgraph Extraction for 3D Molecular Graph Explanation
Jingxiang Qu, Wenhan Gao, Jiaxing Zhang et al.
Risk and cross validation in ridge regression with correlated samples
Alexander Atanasov, Jacob A Zavatone-Veth, Cengiz Pehlevan
Risk-Sensitive Theory of Mind: Coordinating with Agents of Unknown Bias using Cumulative Prospect Theory
Mason O. Smith, Wenlong Zhang
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Jonas Gehring, Kunhao Zheng, Jade Copet et al.
RLTHF: Targeted Human Feedback for LLM Alignment
Yifei Xu, Tusher Chakraborty, Emre Kiciman et al.