Bin Ma
37 papers · 2005–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π Conference Polyglot (7)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Academic Marathon
(20)
π
Conference Loyalist
(27)
π€
Dynamic Duo
(15)
π₯
Mega-Team
(20)
π¬
Deep Specialist
(11)
π
Keyword Champion
(2)
π
Conference Pioneer
β‘
Prolific Year
(8)
ποΈ
Keyword Collector
(145)
π
Century Club
(37)
π₯
Unstoppable
(6)
π
Trend Setter
Conferences
INTERSPEECH (27)
ACL (3)
EMNLP (2)
ICML (2)
AAAI (1)
ACML (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
automatic speech recognition
(7)
multi-task learning
(3)
speech recognition
(3)
bottleneck feature
(3)
convolutional neural network
(3)
self-supervised learning
(3)
mispronunciation detection
(2)
speaker adaptation
(2)
speech transformer
(2)
multi-modal learning
(2)
generative adversarial network
(2)
voice conversion
(2)
connectionist temporal classification
(2)
representation learning
(2)
cross-lingual transfer
(2)
acoustic model
(2)
keyword spotting
(2)
speaker verification
(2)
deep neural network
(2)
transformer model
(2)
Papers
Pixel2Feature Attack (P2FA): Rethinking the Perturbed Space to Enhance Adversarial Transferability
ICML 2025
Speed Master: Quick or Slow Play to Attack Speaker Recognition
AAAI 2025
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
ICML 2025
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
INTERSPEECH 2024
Towards Audio Codec-based Speech Separation
INTERSPEECH 2024
Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness
INTERSPEECH 2023
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition
INTERSPEECH 2023
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition
INTERSPEECH 2023
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions
INTERSPEECH 2023
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement
INTERSPEECH 2023
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
INTERSPEECH 2023
A Unified Speaker Adaptation Approach for ASR
EMNLP 2021
ExNN-SMOTE: Extended Natural Neighbors Based SMOTE to Deal with Imbalanced Data
ACML 2021
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
INTERSPEECH 2020
Cross Attention with Monotonic Alignment for Speech Transformer
INTERSPEECH 2020
Universal Speech Transformer
INTERSPEECH 2020
Speech Transformer with Speaker Aware Persistent Memory
INTERSPEECH 2020
Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks
INTERSPEECH 2019
Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition
INTERSPEECH 2019
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
INTERSPEECH 2019
Towards Language-Universal Mandarin-English Speech Recognition
INTERSPEECH 2019
Alibaba Speech Translation Systems for IWSLT 2018
EMNLP 2018
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search
INTERSPEECH 2018
Multi-Task Learning for Mispronunciation Detection on Singapore Childrenβs Mandarin Speech
INTERSPEECH 2017
An Integrated Solution for Snoring Sound Classification Using Bhattacharyya Distance Based GMM Supervectors with SVM, Feature Selection with Random Forest and Spectrogram with CNN
INTERSPEECH 2017
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis
INTERSPEECH 2016
Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search
INTERSPEECH 2016
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS
INTERSPEECH 2016
Context Aware Mispronunciation Detection for Mandarin Pronunciation Training
INTERSPEECH 2016
SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese
INTERSPEECH 2016
Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection
INTERSPEECH 2016
Learning Neural Network Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information
INTERSPEECH 2016
Joint Speaker and Lexical Modeling for Short-Term Characterization of Speaker
INTERSPEECH 2016
Broadcast News Story Segmentation Using Manifold Learning on Latent Topic Distributions
ACL 2013
Using Cross-Entity Inference to Improve Event Extraction
ACL 2011
Thread Cleaning and Merging for Microblog Topic Detection
IJCNLP 2011
A Phonotactic Language Model for Spoken Language Identification
ACL 2005