Papers
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
Hongkang Li, Meng Wang, Songtao Lu et al.
How do Transformers Perform In-Context Autoregressive Learning ?
Michael Eli Sander, Raja Giryes, Taiji Suzuki et al.
How Far Can Fairness Constraints Help Recover From Biased Data?
Mohit Sharma, Amit Deshpande
How Flawed Is ECE? An Analysis via Logit Smoothing
Muthu Chidambaram, Holden Lee, Colin Mcswiggen et al.
How Free is Parameter-Free Stochastic Optimization?
Amit Attia, Tomer Koren
How Graph Neural Networks Learn: Lessons from Training Dynamics
Chenxiao Yang, Qitian Wu, David Wipf et al.
How Interpretable Are Interpretable Graph Neural Networks?
Yongqiang Chen, Yatao Bian, Bo Han et al.
How Language Model Hallucinations Can Snowball
Muru Zhang, Ofir Press, William Merrill et al.
How Learning by Reconstruction Produces Uninformative Features For Perception
Randall Balestriero, Yann Lecun
How Private are DP-SGD Implementations?
Lynn Chua, Badih Ghazi, Pritish Kamath et al.
How Smooth Is Attention?
Valérie Castin, Pierre Ablin, Gabriel Peyré
How Spurious Features are Memorized: Precise Analysis for Random and NTK Features
Simone Bombari, Marco Mondelli
How to Escape Sharp Minima with Random Perturbations
Kwangjun Ahn, Ali Jadbabaie, Suvrit Sra
How to Explore with Belief: State Entropy Maximization in POMDPs
Riccardo Zamboni, Duilio Cirino, Marcello Restelli et al.
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue, Jiani Liu, Xingyuan Hua et al.
How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization
Andrew Lowy, Jonathan Ullman, Stephen Wright
How to Trace Latent Generative Model Generated Images without Artificial Watermark?
Zhenting Wang, Vikash Sehwag, Chen Chen et al.
How Transformers Learn Causal Structure with Gradient Descent
Eshaan Nichani, Alex Damian, Jason D. Lee
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
Gon Buzaglo, Itamar Harel, Mor Shpigel Nacson et al.
How Universal Polynomial Bases Enhance Spectral Graph Neural Networks: Heterophily, Over-smoothing, and Over-squashing
Keke Huang, Yu Guang Wang, Ming Li et al.
How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis
Federico Bianchi, Patrick John Chia, Mert Yuksekgonul et al.
Human Alignment of Large Language Models through Online Preference Optimisation
Daniele Calandriello, Zhaohan Daniel Guo, Remi Munos et al.
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
Akshay Kumar Jagadish, Julian Coda-Forno, Mirko Thalmann et al.
HumanTOMATO: Text-aligned Whole-body Motion Generation
Shunlin Lu, Ling-Hao Chen, Ailing Zeng et al.
Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?
Fan Yao, Chuanhao Li, Denis Nekipelov et al.