Pengyuan Zhang
30 papers · 2017–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π Academic Marathon (7)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(30)
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
π
Century Club
(30)
β‘
Prolific Year
(10)
ποΈ
Keyword Collector
(114)
π₯
Unstoppable
(6)
Conferences
INTERSPEECH (30)
Top co-authors
Keywords
automatic speech recognition
(8)
attention mechanism
(4)
acoustic model
(4)
convolutional neural network
(4)
speech separation
(3)
recurrent neural network
(3)
bidirectional lstm
(3)
end-to-end speech recognition
(3)
speaker diarization
(3)
speech recognition
(3)
speaker embedding
(3)
speaker verification
(3)
multi-task learning
(2)
deep neural network
(2)
variational autoencoder
(2)
connectionist temporal classification
(2)
adversarial learning
(2)
speech synthesis
(2)
speech enhancement
(2)
voice conversion
(2)
Papers
Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
INTERSPEECH 2024
Improving Copy-Synthesis Anti-Spoofing Training Method with Rhythm and Speaker Perturbation
INTERSPEECH 2024
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR
INTERSPEECH 2022
Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output
INTERSPEECH 2022
CTA-RNN: Channel and Temporal-wise Attention RNN leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
INTERSPEECH 2022
SASV Based on Pre-trained ASV System and Integrated Scoring Module
INTERSPEECH 2022
Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods
INTERSPEECH 2022
Decoupled Federated Learning for ASR with Non-IID Data
INTERSPEECH 2022
Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics
INTERSPEECH 2022
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset
INTERSPEECH 2022
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
INTERSPEECH 2022
NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition
INTERSPEECH 2022
LinearSpeech: Parallel Text-to-Speech with Linear Complexity
INTERSPEECH 2021
TVQVC: Transformer Based Vector Quantized Variational Autoencoder with CTC Loss for Voice Conversion
INTERSPEECH 2021
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech
INTERSPEECH 2021
Improved Speech Enhancement Using a Complex-Domain GAN with Fused Time-Domain and Time-Frequency Domain Constraints
INTERSPEECH 2021
The Effect of Silence and Dual-Band Fusion in Anti-Spoofing System
INTERSPEECH 2021
Adaptive Margin Circle Loss for Speaker Verification
INTERSPEECH 2021
Speaker Diarization System Based on DPCA Algorithm for Fearless Steps Challenge Phase-2
INTERSPEECH 2020
Improved Guided Source Separation Integrated with a Strong Back-End for the CHiME-6 Dinner Party Scenario
INTERSPEECH 2020
Domain Adaptation Using Class Similarity for Robust Speech Recognition
INTERSPEECH 2020
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition
INTERSPEECH 2019
Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning
INTERSPEECH 2019
Multi-Accent Adaptation Based on Gate Mechanism
INTERSPEECH 2019
Character-Aware Sub-Word Level Language Modeling for Uyghur and Turkish ASR
INTERSPEECH 2019
Target Speaker Recovery and Recognition Network with Average x-Vector and Global Training
INTERSPEECH 2019
Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR
INTERSPEECH 2018
Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition
INTERSPEECH 2018
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling
INTERSPEECH 2018
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition
INTERSPEECH 2017