Bin Ma

37 papers · 2005–2025 · 7 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🌍 Conference Polyglot (7)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (20) 🏠 Conference Loyalist (27) 🤝 Dynamic Duo (15) 👥 Mega-Team (20) 🔬 Deep Specialist (11) 🏆 Keyword Champion (2) 🚀 Conference Pioneer ⚡ Prolific Year (8) 🗃️ Keyword Collector (145) 💎 Century Club (37) 🔥 Unstoppable (6) 📈 Trend Setter

Conferences

INTERSPEECH (27) ACL (3) EMNLP (2) ICML (2) AAAI (1) ACML (1) IJCNLP (1)

Top co-authors

Chongjia Ni (15) Eng Siong Chng (14) Cheung-Chi Leung (11) Haizhou Li (10) Dianwen Ng (9) Trung Hieu Nguyen (7) Shengkui Zhao (7) Yukun Ma (6) Lei Xie (6) Chong Zhang (6)

Research topics

Privacy (1)

Keywords

automatic speech recognition (7) multi-task learning (3) speech recognition (3) bottleneck feature (3) convolutional neural network (3) self-supervised learning (3) mispronunciation detection (2) speaker adaptation (2) speech transformer (2) multi-modal learning (2) generative adversarial network (2) voice conversion (2) connectionist temporal classification (2) representation learning (2) cross-lingual transfer (2) acoustic model (2) keyword spotting (2) speaker verification (2) deep neural network (2) transformer model (2)

Papers

Pixel2Feature Attack (P2FA): Rethinking the Perturbed Space to Enhance Adversarial Transferability ICML 2025 Speed Master: Quick or Slow Play to Attack Speaker Recognition AAAI 2025 Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding ICML 2025 Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis INTERSPEECH 2024 Towards Audio Codec-based Speech Separation INTERSPEECH 2024 Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness INTERSPEECH 2023 Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition INTERSPEECH 2023 Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition INTERSPEECH 2023 A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions INTERSPEECH 2023 Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement INTERSPEECH 2023 ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention INTERSPEECH 2023 A Unified Speaker Adaptation Approach for ASR EMNLP 2021 ExNN-SMOTE: Extended Natural Neighbors Based SMOTE to Deal with Imbalanced Data ACML 2021 Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion INTERSPEECH 2020 Cross Attention with Monotonic Alignment for Speech Transformer INTERSPEECH 2020 Universal Speech Transformer INTERSPEECH 2020 Speech Transformer with Speaker Aware Persistent Memory INTERSPEECH 2020 Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks INTERSPEECH 2019 Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition INTERSPEECH 2019 Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data INTERSPEECH 2019 Towards Language-Universal Mandarin-English Speech Recognition INTERSPEECH 2019 Alibaba Speech Translation Systems for IWSLT 2018 EMNLP 2018 Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search INTERSPEECH 2018 Multi-Task Learning for Mispronunciation Detection on Singapore Children’s Mandarin Speech INTERSPEECH 2017 An Integrated Solution for Snoring Sound Classification Using Bhattacharyya Distance Based GMM Supervectors with SVM, Feature Selection with Random Forest and Spectrogram with CNN INTERSPEECH 2017 Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis INTERSPEECH 2016 Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search INTERSPEECH 2016 The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS INTERSPEECH 2016 Context Aware Mispronunciation Detection for Mandarin Pronunciation Training INTERSPEECH 2016 SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese INTERSPEECH 2016 Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection INTERSPEECH 2016 Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information INTERSPEECH 2016 Joint Speaker and Lexical Modeling for Short-Term Characterization of Speaker INTERSPEECH 2016 Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions ACL 2013 Using Cross-Entity Inference to Improve Event Extraction ACL 2011 Thread Cleaning and Merging for Microblog Topic Detection IJCNLP 2011 A Phonotactic Language Model for Spoken Language Identification ACL 2005