Papers
How to set AdamW’s weight decay as you scale model and dataset size
Xi Wang, Laurence Aitchison
How to Synthesize Text Data without Model Collapse?
Xuekai Zhu, Daixuan Cheng, Hengli Li et al.
How to Train Your Multi-Exit Model? Analyzing the Impact of Training Strategies
Piotr Kubaty, Bartosz Wójcik, Bartłomiej Tomasz Krzepkowski et al.
How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias
Ruiquan Huang, Yingbin Liang, Jing Yang
How Transformers Learn Structured Data: Insights From Hierarchical Filtering
Jerome Garnier-Brun, Marc Mezard, Emanuele Moscato et al.
HPS: Hard Preference Sampling for Human Preference Alignment
Xiandong Zou, Wanyu Lin, Yuchen Li et al.
H-Tuning: Toward Low-Cost and Efficient ECG-based Cardiovascular Disease Detection with Pre-Trained Models
Rushuang Zhou, Yuanting Zhang, Yining Dong
Human-Aligned Image Models Improve Visual Decoding from the Brain
Nona Rajabi, Antonio H. Ribeiro, Miguel Vasco et al.
Human Body Restoration with One-Step Diffusion Model and A New Benchmark
Jue Gong, Jingkai Wang, Zheng Chen et al.
Human Cognition-Inspired Hierarchical Fuzzy Learning Machine
Junbiao Cui, Qin Yue, Jianqing Liang et al.
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning
Hongyao Chen, Tianyang Xu, Xiaojun Wu et al.
HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder
Qi Yang, Le Yang, Geert Van Der Auwera et al.
Hybrid Quantum-Classical Multi-Agent Pathfinding
Thore Gerlach, Loong Kuan Lee, Frederic Barbaresco et al.
Hybrid Spiking Vision Transformer for Object Detection with Event Cameras
Qi Xu, Jie Deng, Jiangrong Shen et al.
Hyperband-based Bayesian Optimization for Black-box Prompt Selection
Lennart Schneider, Martin Wistuba, Aaron Klein et al.
Hyperbolic-PDE GNN: Spectral Graph Neural Networks in the Perspective of A System of Hyperbolic Partial Differential Equations
Juwei Yue, Haikuo Li, Jiawei Sheng et al.
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Yiran Wang, Chenshu Liu, Yunfan Li et al.
HyperIMTS: Hypergraph Neural Network for Irregular Multivariate Time Series Forecasting
Boyuan Li, Yicheng Luo, Zhen Liu et al.
HyperIV: Real-time Implied Volatility Smoothing
Yongxin Yang, Wenqi Chen, Chao Shu et al.
HyperNear: Unnoticeable Node Injection Attacks on Hypergraph Neural Networks
Tingyi Cai, Yunliang Jiang, Ming Li et al.
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee, Youngdo Lee, Takuma Seno et al.
Hyper-Transforming Latent Diffusion Models
Ignacio Peis, Batuhan Koyuncu, Isabel Valera et al.
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
Runquan Gui, Zhihai Wang, Jie Wang et al.
Hypo3D: Exploring Hypothetical Reasoning in 3D
Ye Mao, Weixun Luo, Junpeng Jing et al.