Papers
11,015 papers found
Alternating Differentiation for Optimization Layers
Haixiang Sun, Ye Shi, Jingya Wang et al.
A Message Passing Perspective on Learning Dynamics of Contrastive Learning
Yifei Wang, Qi Zhang, Tianqi Du et al.
A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics
Qing Li, Siyuan Huang, Yining Hong et al.
A Mixture-of-Expert Approach to RL-based Dialogue Management
Yinlam Chow, Azamat Tulepbergenov, Ofir Nachum et al.
A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning
Da-Wei Zhou, Qi-Wei Wang, Han-Jia Ye et al.
Amortised Invariance Learning for Contrastive Self-Supervision
Ruchika Chavhan, Jan Stuehmer, Calum Heggan et al.
A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification
Xiang Hu, XinYu KONG, Kewei Tu
An Adaptive Policy to Employ Sharpness-Aware Minimization
Weisen Jiang, Hansi Yang, Yu Zhang et al.
An Additive Instance-Wise Approach to Multi-class Model Interpretation
Vy Vo, Van Nguyen, Trung Le et al.
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting Chen, Ruixiang ZHANG, Geoffrey Hinton
Analogy-Forming Transformers for Few-Shot 3D Parsing
Nikolaos Gkanatsios, Mayank Singh, Zhaoyuan Fang et al.
Analyzing Tree Architectures in Ensembles via Neural Tangent Kernel
Ryuichi Kanoh, Mahito Sugiyama
Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections
Edward De Brouwer, Rahul G Krishnan
An efficient encoder-decoder architecture with top-down attention for speech separation
Kai Li, Runxuan Yang, Xiaolin Hu
An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation
Yuqiao Wen, Yongchang Hao, Yanshuai Cao et al.
A Neural Mean Embedding Approach for Back-door and Front-door Adjustment
Liyuan Xu, Arthur Gretton
A new characterization of the edge of stability based on a sharpness measure aware of batch gradient distribution
Sungyoon Lee, Cheongjae Jang
An Exact Poly-Time Membership-Queries Algorithm for Extracting a Three-Layer ReLU Network
Amit Daniely, Elad Granot
An Extensible Multi-modal Multi-task Object Dataset with Materials
Trevor Scott Standley, Ruohan Gao, Dawn Chen et al.
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal, Yuval Alaluf, Yuval Atzmon et al.
Anisotropic Message Passing: Graph Neural Networks with Directional and Long-Range Interactions
Moritz Thürlemann, Sereina Riniker
A Non-Asymptotic Analysis of Oversmoothing in Graph Neural Networks
Xinyi Wu, Zhengdao Chen, William Wei Wang et al.
A Non-monotonic Self-terminating Language Model
Eugene Choi, Kyunghyun Cho, Cheolhyoung Lee
Anti-Symmetric DGN: a stable architecture for Deep Graph Networks
Alessio Gravina, Davide Bacciu, Claudio Gallicchio
AnyDA: Anytime Domain Adaptation
Omprakash Chakraborty, Aadarsh Sahoo, Rameswar Panda et al.