Pengyuan Zhang

30 papers · 2017–2024 · 1 conference · across top CS/AI conferences

Achievements

+9 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (14) 🏃 Academic Marathon (7)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (30) 🤝 Dynamic Duo (18) 🧬 Topic Evolution 💎 Century Club (30) ⚡ Prolific Year (10) 🗃️ Keyword Collector (114) 🔥 Unstoppable (6)

Conferences

INTERSPEECH (30)

Top co-authors

Yonghong Yan (18) Gaofeng Cheng (7) Wenchao Wang (5) Hangting Chen (5) Li Wang (4) Han Zhu (4) Zengqiang Shang (4) Yuxiang Zhang (3) Zhihua Huang (3) Ta Li (3)

Keywords

automatic speech recognition (8) attention mechanism (4) acoustic model (4) convolutional neural network (4) speech separation (3) recurrent neural network (3) bidirectional lstm (3) end-to-end speech recognition (3) speaker diarization (3) speech recognition (3) speaker embedding (3) speaker verification (3) multi-task learning (2) deep neural network (2) variational autoencoder (2) connectionist temporal classification (2) adversarial learning (2) speech synthesis (2) speech enhancement (2) voice conversion (2)

Papers

Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder INTERSPEECH 2024 Improving Copy-Synthesis Anti-Spoofing Training Method with Rhythm and Speaker Perturbation INTERSPEECH 2024 Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR INTERSPEECH 2022 Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output INTERSPEECH 2022 CTA-RNN: Channel and Temporal-wise Attention RNN leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition INTERSPEECH 2022 SASV Based on Pre-trained ASV System and Integrated Scoring Module INTERSPEECH 2022 Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods INTERSPEECH 2022 Decoupled Federated Learning for ASR with Non-IID Data INTERSPEECH 2022 Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics INTERSPEECH 2022 Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset INTERSPEECH 2022 Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization INTERSPEECH 2022 NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition INTERSPEECH 2022 LinearSpeech: Parallel Text-to-Speech with Linear Complexity INTERSPEECH 2021 TVQVC: Transformer Based Vector Quantized Variational Autoencoder with CTC Loss for Voice Conversion INTERSPEECH 2021 Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech INTERSPEECH 2021 Improved Speech Enhancement Using a Complex-Domain GAN with Fused Time-Domain and Time-Frequency Domain Constraints INTERSPEECH 2021 The Effect of Silence and Dual-Band Fusion in Anti-Spoofing System INTERSPEECH 2021 Adaptive Margin Circle Loss for Speaker Verification INTERSPEECH 2021 Speaker Diarization System Based on DPCA Algorithm for Fearless Steps Challenge Phase-2 INTERSPEECH 2020 Improved Guided Source Separation Integrated with a Strong Back-End for the CHiME-6 Dinner Party Scenario INTERSPEECH 2020 Domain Adaptation Using Class Similarity for Robust Speech Recognition INTERSPEECH 2020 Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition INTERSPEECH 2019 Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning INTERSPEECH 2019 Multi-Accent Adaptation Based on Gate Mechanism INTERSPEECH 2019 Character-Aware Sub-Word Level Language Modeling for Uyghur and Turkish ASR INTERSPEECH 2019 Target Speaker Recovery and Recognition Network with Average x-Vector and Global Training INTERSPEECH 2019 Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR INTERSPEECH 2018 Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition INTERSPEECH 2018 Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling INTERSPEECH 2018 Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition INTERSPEECH 2017