Papers
11,015 papers found
A Design Space Study for LISTA and Beyond
Tianjian Meng, Xiaohan Chen, Yifan Jiang et al.
A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima
Zeke Xie, Issei Sato, Masashi Sugiyama
A Discriminative Gaussian Mixture Model with Sparsity
Hideaki Hayashi, Seiichi Uchida
A Distributional Approach to Controlled Text Generation
Muhammad Khalifa, Hady Elsahar, Marc Dymetman
Adversarially Guided Actor-Critic
Yannis Flet-Berliac, Johan Ferret, Olivier Pietquin et al.
Adversarially-Trained Deep Nets Transfer Better: Illustration on Image Classification
Francisco Utrera, Evan Kravitz, N. Benjamin Erichson et al.
Adversarial score matching and improved sampling for image generation
Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Ioannis Mitliagkas et al.
A Geometric Analysis of Deep Generative Image Models and Its Applications
Binxu Wang, Carlos R Ponce
A Good Image Generator Is What You Need for High-Resolution Video Synthesis
Yu Tian, Jian Ren, Menglei Chai et al.
A Gradient Flow Framework For Analyzing Network Pruning
Ekdeep Singh Lubana, Robert P. Dick
A Hypergradient Approach to Robust Regression without Correspondence
Yujia Xie, Yixiu Mao, Simiao Zuo et al.
A Learning Theoretic Perspective on Local Explainability
Jeffrey Li, Vaishnavh Nagarajan, Gregory Plumb et al.
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Mohit Shridhar, Xingdi Yuan, Marc-Alexandre Cote et al.
Aligning AI With Shared Human Values
Dan Hendrycks, Collin Burns, Steven Basart et al.
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Nikunj Saunshi, Sadhika Malladi, Sanjeev Arora
Analyzing the Expressive Power of Graph Neural Networks in a Spectral Perspective
Muhammet Balcilar, Guillaume Renton, Pierre Héroux et al.
Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics
Vinay Venkatesh Ramasesh, Ethan Dyer, Maithra Raghu
Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies
Paul Pu Liang, Manzil Zaheer, Yuan Wang et al.
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov et al.
ANOCE: Analysis of Causal Effects with Multiple Mediators via Constrained Structural Learning
Hengrui Cai, Rui Song, Wenbin Lu
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Wenhan Xiong, Xiang Li, Srini Iyer et al.
An Unsupervised Deep Learning Approach for Real-World Image Denoising
Dihan Zheng, Sia Huat Tan, Xiaowen Zhang et al.
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Yilun Xu, Yang Song, Sahaj Garg et al.
A PAC-Bayesian Approach to Generalization Bounds for Graph Neural Networks
Renjie Liao, Raquel Urtasun, Richard Zemel
A Panda? No, It's a Sloth: Slowdown Attacks on Adaptive Multi-Exit Neural Network Inference
Sanghyun Hong, Yigitcan Kaya, Ionuț-Vlad Modoranu et al.