Co-occurring keywords
Papers
Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks
NIPS 2023
Graph Attention Retrospective
JMLR 2023
Semi-Structured Object Sequence Encoders
EMNLP 2023
Block-State Transformers
NIPS 2023
Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information
EMNLP 2023
Energy Transformer
NIPS 2023
Neural Functional Transformers
NIPS 2023
An Improved End-to-End Audio-Visual Speech Recognition Model
INTERSPEECH 2023