Papers
Non-Asymptotic Length Generalization
Thomas Chen, Tengyu Ma, Zhiyuan Li
Nonlinearly Preconditioned Gradient Methods under Generalized Smoothness
Konstantinos Oikonomidis, Jan Quan, Emanuel Laude et al.
Nonlinear transformers can perform inference-time feature learning
Naoki Nishikawa, Yujin Song, Kazusato Oko et al.
Nonparametric Identification of Latent Concepts
Yujia Zheng, Shaoan Xie, Kun Zhang
Nonparametric Modern Hopfield Models
Jerry Yao-Chieh Hu, Bo-Yu Chen, Dennis Wu et al.
Nonparametric Teaching for Graph Property Learners
Chen Zhang, Weixin Bu, Zeyi Ren et al.
Non-stationary Diffusion For Probabilistic Time Series Forecasting
Weiwei Ye, Zhuopeng Xu, Ning Gui
Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability
Yu-Jie Zhang, Peng Zhao, Masashi Sugiyama
Non-Stationary Predictions May Be More Informative: Exploring Pseudo-Labels with a Two-Phase Pattern of Training Dynamics
Hongbin Pei, Jingxin Hai, Yu Li et al.
No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization
Martino Bernasconi, Matteo Castiglioni, Andrea Celli
Normalizing Flows are Capable Generative Models
Shuangfei Zhai, Ruixiang Zhang, Preetum Nakkiran et al.
No Soundness in the Real World: On the Challenges of the Verification of Deployed Neural Networks
Attila Szász, Balázs Bánhelyi, Márk Jelasity
Not all solutions are created equal: An analytical dissociation of functional and representational similarity in deep linear neural networks
Lukas Braun, Erin Grant, Andrew M Saxe
Not All Tokens Matter All The Time: Dynamic Token Aggregation Towards Efficient Detection Transformers
Jiacheng Cheng, Xiwen Yao, Xiang Yuan et al.
Not All Wrong is Bad: Using Adversarial Examples for Unlearning
Ali Ebrahimpour-Boroojeny, Hari Sundaram, Varun Chandrasekaran
No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces
Daniel Marczak, Simone Magistri, Sebastian Cygert et al.
Novelty Detection in Reinforcement Learning with World Models
Geigh Zollicoffer, Kenneth Eaton, Jonathan C Balloch et al.
NTK-DFL: Enhancing Decentralized Federated Learning in Heterogeneous Settings via Neural Tangent Kernel
Gabriel Thompson, Kai Yue, Chau-Wai Wong et al.
NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction
Qichao Wang, Ziqiao Meng, Wenqian Cui et al.
Objective drives the consistency of representational similarity across datasets
Laure Ciernik, Lorenz Linhardt, Marco Morik et al.
Observation Interference in Partially Observable Assistance Games
Scott Emmons, Caspar Oesterheld, Vincent Conitzer et al.
Occult: Optimizing Collaborative Communications across Experts for Accelerated Parallel MoE Training and Inference
Shuqing Luo, Pingzhi Li, Jie Peng et al.
Offline Learning for Combinatorial Multi-armed Bandits
Xutong Liu, Xiangxiang Dai, Jinhang Zuo et al.
Offline Model-based Optimization for Real-World Molecular Discovery
Dong-Hee Shin, Young-Han Son, Hyun Jung Lee et al.