Papers
11,015 papers found
Why (and When) does Local SGD Generalize Better than SGD?
Xinran Gu, Kaifeng Lyu, Longbo Huang et al.
WikiWhy: Answering and Explaining Cause-and-Effect Questions
Matthew Ho, Aditya Sharma, Justin Chang et al.
WiNeRT: Towards Neural Ray Tracing for Wireless Channel Modelling and Differentiable Simulations
Tribhuvanesh Orekondy, Pratik Kumar, Shreya Kadambi et al.
Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic
Yulhwa Kim, Jaeyong Jang, Jehun Lee et al.
Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms
Pan Zhou, Xingyu Xie, Shuicheng YAN
Words are all you need? Language as an approximation for human similarity judgments
Raja Marjieh, Pol Van Rijn, Ilia Sucholutsky et al.
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao, Wangchunshu Zhou, Xinsong Zhang et al.
Your Contrastive Learning Is Secretly Doing Stochastic Neighbor Embedding
Tianyang Hu, Zhili LIU, Fengwei Zhou et al.
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
Yinhuai Wang, Jiwen Yu, Jian Zhang
Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation
Yao Shu, Zhongxiang Dai, Weicong Sng et al.
ZiCo: Zero-shot NAS via inverse Coefficient of Variation on Gradients
Guihong Li, Yuedong Yang, Kartikeya Bhardwaj et al.
$\beta$-Intact-VAE: Identifying and Estimating Causal Effects under Limited Overlap
Pengzhou Abel Wu, Kenji Fukumizu
$\mathrm{SO}(2)$-Equivariant Reinforcement Learning
Dian Wang, Robin Walters, Robert Platt
$\pi$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization
Carl Hvarfner, Danny Stoll, Artur Souza et al.
8-bit Optimizers via Block-wise Quantization
Tim Dettmers, Mike Lewis, Sam Shleifer et al.
Ab-Initio Potential Energy Surfaces by Pairing GNNs with Neural Wave Functions
Nicholas Gao, Stephan Günnemann
A Biologically Interpretable Graph Convolutional Network to Link Genetic Risk Pathways and Imaging Phenotypes of Disease
Sayan Ghosal, Qiang Chen, Giulio Pergola et al.
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu, Viktor Makoviychuk, Yashraj Narang et al.
Acceleration of Federated Learning with Alleviated Forgetting in Local Training
Chencheng Xu, Zhiwei Hong, Minlie Huang et al.
A Class of Short-term Recurrence Anderson Mixing Methods and Their Applications
Fuchao Wei, Chenglong Bao, Yang Liu
A Comparison of Hamming Errors of Representative Variable Selection Methods
Tracy Ke, Longlin Wang
A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion
Zhaoyang Lyu, Zhifeng Kong, Xudong XU et al.
Active Hierarchical Exploration with Stable Subgoal Representation Learning
Siyuan Li, Jin Zhang, Jianhao Wang et al.
Actor-critic is implicitly biased towards high entropy optimal policies
Yuzheng Hu, Ziwei Ji, Matus Telgarsky
Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game
Haobo Fu, Weiming Liu, Shuang Wu et al.