Papers
Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
Yeonhong Park, Jake Hyun, Sanglyul Cho et al.
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls
Yu Du, Fangyun Wei, Hongyang Zhang
A Persuasive Approach to Combating Misinformation
Safwan Hossain, Andjela Mladenovic, Yiling Chen et al.
Applying language models to algebraic topology: generating simplicial cycles using multi-labeling in Wu’s formula
Kirill Brilliantov, Fedor Pavutnitskiy, Dmitry Pasechnyuk et al.
Approximate Nearest Neighbor Search with Window Filters
Joshua Engels, Ben Landrum, Shangdi Yu et al.
A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs
Kihyuk Hong, Ambuj Tewari
A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNs
Lars Veefkind, Gabriele Cesa
A Provable Decision Rule for Out-of-Distribution Detection
Xinsong Ma, Xin Zou, Weiwei Liu
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
Mohammed Nowaz Rabbani Chowdhury, Meng Wang, Kaoutar El Maghraoui et al.
APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
Bowen Zhao, Hannaneh Hajishirzi, Qingqing Cao
AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA
Weitao Feng, Wenbo Zhou, Jiyan He et al.
A Rate-Distortion View of Uncertainty Quantification
Ifigeneia Apostolopoulou, Benjamin Eysenbach, Frank Nielsen et al.
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou, Andrea Zanette, Jiayi Pan et al.
A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
Yihan Wu, Zhengmian Hu, Junfeng Guo et al.
Arrows of Time for Large Language Models
Vassilis Papadopoulos, Jérémie Wenger, Clément Hongler
ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations
Kailas Vodrahalli, James Zou
A sampling theory perspective on activations for implicit neural representations
Hemanth Saratchandran, Sameera Ramasinghe, Violetta Shevchenko et al.
A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models
Taehong Moon, Moonseok Choi, Eunggu Yun et al.
A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes
Zhenwei Lin, Chenyu Xue, Qi Deng et al.
A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?
Agustinus Kristiadi, Felix Strieth-Kalthoff, Marta Skreta et al.
A Space Group Symmetry Informed Network for O(3) Equivariant Crystal Tensor Prediction
Keqiang Yan, Alexandra Saxton, Xiaofeng Qian et al.
A Sparsity Principle for Partially Observable Causal Representation Learning
Danru Xu, Dingling Yao, Sebastien Lachapelle et al.
Assessing Large Language Models on Climate Information
Jannis Bulian, Mike S. Schäfer, Afra Amini et al.
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
Boyi Wei, Kaixuan Huang, Yangsibo Huang et al.
A Statistical Framework for Data-dependent Retrieval-Augmented Models
Soumya Basu, Ankit Singh Rawat, Manzil Zaheer