Eng Siong Chng
66 papers · 2010–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (19) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Cross-Pollinator
(14)
πΊοΈ
Taxonomy Completionist
(19)
π§
Keyword Pioneer
π
Conference Loyalist
(44)
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
π₯
Mega-Team
(20)
π¬
Deep Specialist
(16)
π
Keyword Champion
(2)
π₯
Unstoppable
(11)
π
Trend Setter
β‘
Prolific Year
(8)
π
Century Club
(65)
ποΈ
Keyword Collector
(61)
π
Conference Pioneer
Conferences
INTERSPEECH (44)
ACL (8)
EMNLP (7)
AAAI (2)
ICCV (1)
IJCAI (1)
NAACL (1)
NIPS (1)
WACV (1)
Top co-authors
Research topics
Keywords
automatic speech recognition
(13)
speech recognition
(9)
multimodal learning
(8)
representation learning
(6)
multi-task learning
(5)
speech enhancement
(5)
audio-visual speech recognition
(4)
domain adaptation
(4)
keyword spotting
(4)
large language model
(4)
self-supervised learning
(4)
speech separation
(4)
deep neural network
(3)
acoustic model
(3)
multimodal fusion
(3)
deepfake detection
(3)
knowledge distillation
(3)
data augmentation
(3)
catastrophic forgetting
(3)
speaker verification
(3)
Papers
A-V Representation Learning via Audio Shift Prediction for Multimodal Deepfake Detection and Temporal Localization
WACV 2026
Evaluating the Expressive Appropriateness of Speech in Rich Contexts
ACL 2026
Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization
ICCV 2025
InTriage: Intelligent Telephone Triage in Pre-Hospital Emergency Care
EMNLP 2025
CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models
EMNLP 2025
DiaSynth: Synthetic Dialogue Generation Framework for Low Resource Dialogue Applications
NAACL 2025
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
NIPS 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model
EMNLP 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
ACL 2024
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models
ACL 2024
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
INTERSPEECH 2024
Towards Audio Codec-based Speech Separation
INTERSPEECH 2024
Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses
INTERSPEECH 2024
Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems
INTERSPEECH 2024
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
INTERSPEECH 2024
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions
INTERSPEECH 2023
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning
AAAI 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition
ACL 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
ACL 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
ACL 2023
CASSI: Contextual and Semantic Structure-based Interpolation Augmentation for Low-Resource NER
EMNLP 2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
IJCAI 2023
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition
INTERSPEECH 2023
Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness
INTERSPEECH 2023
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition
INTERSPEECH 2023
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
INTERSPEECH 2023
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory
INTERSPEECH 2023
Blind Estimation of Room Impulse Response from Monaural Reverberant Speech with Segmental Generative Neural Network
INTERSPEECH 2023
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
INTERSPEECH 2023
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement
INTERSPEECH 2023
A Neural State-Space Modeling Approach to Efficient Speech Separation
INTERSPEECH 2023
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition
INTERSPEECH 2022
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model
INTERSPEECH 2022
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning
INTERSPEECH 2022
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting
INTERSPEECH 2022
Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion
INTERSPEECH 2021
GDPNet: Refining Latent Multi-View Graph for Relation Extraction
AAAI 2021
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition
INTERSPEECH 2021
A Unified Speaker Adaptation Approach for ASR
EMNLP 2021
Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension
INTERSPEECH 2020
Universal Speech Transformer
INTERSPEECH 2020
Speech Transformer with Speaker Aware Persistent Memory
INTERSPEECH 2020
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences
EMNLP 2020
Cross Attention with Monotonic Alignment for Speech Transformer
INTERSPEECH 2020
SpEx+: A Complete Time Domain Speaker Extraction Network
INTERSPEECH 2020
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition
INTERSPEECH 2020
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network
INTERSPEECH 2020
Target Speaker Extraction for Multi-Talker Speaker Verification
INTERSPEECH 2019
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
INTERSPEECH 2019
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition
INTERSPEECH 2019
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation
INTERSPEECH 2019
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data
INTERSPEECH 2019
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
INTERSPEECH 2018
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning
INTERSPEECH 2018
Mandarin-English Code-switching Speech Recognition
INTERSPEECH 2018
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
INTERSPEECH 2018
Named-Entity Tagging and Domain adaptation for Better Customized Translation
ACL 2018
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source
INTERSPEECH 2017
Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples
INTERSPEECH 2016
A DNN-HMM Approach to Story Segmentation
INTERSPEECH 2016
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions
INTERSPEECH 2016
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS
INTERSPEECH 2016
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis
INTERSPEECH 2016
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions
INTERSPEECH 2016
Modeling of term-distance and term-occurrence information for improving n-gram language model performance
ACL 2013
Non-Isomorphic Forest Pair Translation
EMNLP 2010