Papers
SepTr: Separable Transformer for Audio Spectrogram Processing
INTERSPEECH 2022
CT-SAT: Contextual Transformer for Sequential Audio Tagging
INTERSPEECH 2022
ATST: Audio Representation Learning with Teacher-Student Transformer
INTERSPEECH 2022
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers
INTERSPEECH 2022
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
INTERSPEECH 2022