Papers
The Ripple Effect: On Unforeseen Complications of Backdoor Attacks
Rui Zhang, Yun Shen, Hongwei Li et al.
Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos
Christian Pedersen, Laure Zanna, Joan Bruna
The Role of Randomness in Stability
Max Hopkins, Shay Moran
The Role of Sparsity for Length Generalization in LLMs
Noah Golowich, Samy Jelassi, David Brandfonbrener et al.
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu, Rui Ai, Han Zhong et al.
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training
Jinbo Wang, Mingze Wang, Zhanpeng Zhou et al.
The Sparse-Plus-Low-Rank Quasi-Newton Method for Entropic-Regularized Optimal Transport
Chenrui Wang, Yixuan Qiu
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training
Fabian Schaipp, Alexander Hägele, Adrien Taylor et al.
The Surprising Effectiveness of Test-Time Training for Few-Shot Learning
Ekin Akyürek, Mehul Damani, Adam Zweiger et al.
The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data
Thomas Pouplin, Kasia Kobalczyk, Hao Sun et al.
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
Matteo Saponati, Pascal Josef Sager, Pau Vilimelis Aceituno et al.
The Underlying Universal Statistical Structure of Natural Datasets
Noam Itzhak Levi, Yaron Oz
The Value of Prediction in Identifying the Worst-Off
Unai Fischer-Abaigar, Christoph Kern, Juan Carlos Perdomo
Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
Sungwon Kim, Namkyeong Lee, Yunyoung Doh et al.
Thinking LLMs: General Instruction Following with Thought Generation
Tianhao Wu, Janice Lan, Weizhe Yuan et al.
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu, Tengyu Xu, Di Jin et al.
Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making
Xu Wan, Wenyue Xu, Chao Yang et al.
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset
Hao Zhou, Xu Yang, Mingyu Fan et al.
Tight and Fast Bounds for Multi-Label Learning
Yi-Fan Zhang, Min-Ling Zhang
Tightening Causal Bounds via Covariate-Aware Optimal Transport
Sirui Lin, Zijun Gao, Jose Blanchet et al.
Tilted Sharpness-Aware Minimization
Tian Li, Tianyi Zhou, Jeff Bilmes
Time-Aware World Model for Adaptive Prediction and Control
Anh N Nhu, Sanghyun Son, Ming Lin
TimeBase: The Power of Minimalism in Efficient Long-term Time Series Forecasting
Qihe Huang, Zhengyang Zhou, Kuo Yang et al.
TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting
Peiyuan Liu, Beiliang Wu, Yifan Hu et al.
TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation
Daoyu Wang, Mingyue Cheng, Zhiding Liu et al.