conftrace_

← Architectures

Deep Learning › Architectures ›

Transformers

9,294 papers

Papers per year

Papers

Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism INTERSPEECH 2022

A Deep Learning Platform for Language Education Research and Development INTERSPEECH 2022

Span Classification with Structured Information for Disfluency Detection in Spoken Utterances INTERSPEECH 2022

FiLM Conditioning with Enhanced Feature to the Transformer-based End-to-End Noisy Speech Recognition INTERSPEECH 2022

SepTr: Separable Transformer for Audio Spectrogram Processing INTERSPEECH 2022

Similarity and Content-based Phonetic Self Attention for Speech Recognition INTERSPEECH 2022

CT-SAT: Contextual Transformer for Sequential Audio Tagging INTERSPEECH 2022

iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning INTERSPEECH 2022

ATST: Audio Representation Learning with Teacher-Student Transformer INTERSPEECH 2022

KaraTuner: Towards End-to-End Natural Pitch Correction for Singing Voice in Karaoke INTERSPEECH 2022

Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding INTERSPEECH 2022

Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information INTERSPEECH 2022

RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation INTERSPEECH 2022

CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis INTERSPEECH 2022

Automatic Prosody Evaluation of L2 English Read Speech in Reference to Accent Dictionary with Transformer Encoder INTERSPEECH 2022

Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers INTERSPEECH 2022

Microphone Array Channel Combination Algorithms for Overlapped Speech Detection INTERSPEECH 2022

Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech INTERSPEECH 2022

Conformer Based Elderly Speech Recognition System for Alzheimer’s Disease Detection INTERSPEECH 2022

K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables INTERSPEECH 2022

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR INTERSPEECH 2022

An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks INTERSPEECH 2022

Investigation of Ensemble features of Self-Supervised Pretrained Models for Automatic Speech Recognition INTERSPEECH 2022

Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation INTERSPEECH 2022

WA-Transformer: Window Attention-based Transformer with Two-stage Strategy for Multi-task Audio Source Separation INTERSPEECH 2022