conftrace_

Eng Siong Chng

66 papers · 2010–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+15 more ↓ πŸ—ΊοΈ Taxonomy Completionist (19) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) πŸŒ‰ Interdisciplinary Bridge 🐣 Hot Topic Early Bird
🐝 Cross-Pollinator (14) πŸ—ΊοΈ Taxonomy Completionist (19) 🧭 Keyword Pioneer 🏠 Conference Loyalist (44) 🀝 Dynamic Duo (18) 🧬 Topic Evolution πŸ‘₯ Mega-Team (20) πŸ”¬ Deep Specialist (16) πŸ† Keyword Champion (2) πŸ”₯ Unstoppable (11) πŸ“ˆ Trend Setter ⚑ Prolific Year (8) πŸ’Ž Century Club (65) πŸ—ƒοΈ Keyword Collector (61) πŸš€ Conference Pioneer

Conferences

INTERSPEECH (44) ACL (8) EMNLP (7) AAAI (2) ICCV (1) IJCAI (1) NAACL (1) NIPS (1) WACV (1)

Papers

A-V Representation Learning via Audio Shift Prediction for Multimodal Deepfake Detection and Temporal Localization WACV 2026 Evaluating the Expressive Appropriateness of Speech in Rich Contexts ACL 2026 Intra-modal and Cross-modal Synchronization for Audio-visual Deepfake Detection and Temporal Localization ICCV 2025 InTriage: Intelligent Telephone Triage in Pre-Hospital Emergency Care EMNLP 2025 CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models EMNLP 2025 DiaSynth: Synthetic Dialogue Generation Framework for Low Resource Dialogue Applications NAACL 2025 Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models NIPS 2024 Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model EMNLP 2024 GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators ACL 2024 Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models ACL 2024 Noise-aware Speech Enhancement using Diffusion Probabilistic Model INTERSPEECH 2024 Towards Audio Codec-based Speech Separation INTERSPEECH 2024 Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses INTERSPEECH 2024 Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems INTERSPEECH 2024 Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection INTERSPEECH 2024 A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions INTERSPEECH 2023 Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning AAAI 2023 MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition ACL 2023 Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition ACL 2023 UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning ACL 2023 CASSI: Contextual and Semantic Structure-based Interpolation Augmentation for Low-Resource NER EMNLP 2023 Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition IJCAI 2023 Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition INTERSPEECH 2023 Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness INTERSPEECH 2023 Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition INTERSPEECH 2023 ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention INTERSPEECH 2023 Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory INTERSPEECH 2023 Blind Estimation of Room Impulse Response from Monaural Reverberant Speech with Segmental Generative Neural Network INTERSPEECH 2023 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition INTERSPEECH 2023 Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement INTERSPEECH 2023 A Neural State-Space Modeling Approach to Efficient Speech Separation INTERSPEECH 2023 DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition INTERSPEECH 2022 Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model INTERSPEECH 2022 Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning INTERSPEECH 2022 Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting INTERSPEECH 2022 Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion INTERSPEECH 2021 GDPNet: Refining Latent Multi-View Graph for Relation Extraction AAAI 2021 E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition INTERSPEECH 2021 A Unified Speaker Adaptation Approach for ASR EMNLP 2021 Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension INTERSPEECH 2020 Universal Speech Transformer INTERSPEECH 2020 Speech Transformer with Speaker Aware Persistent Memory INTERSPEECH 2020 Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences EMNLP 2020 Cross Attention with Monotonic Alignment for Speech Transformer INTERSPEECH 2020 SpEx+: A Complete Time Domain Speaker Extraction Network INTERSPEECH 2020 Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition INTERSPEECH 2020 Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network INTERSPEECH 2020 Target Speaker Extraction for Multi-Talker Speaker Verification INTERSPEECH 2019 Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data INTERSPEECH 2019 On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition INTERSPEECH 2019 Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation INTERSPEECH 2019 A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data INTERSPEECH 2019 Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR INTERSPEECH 2018 A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning INTERSPEECH 2018 Mandarin-English Code-switching Speech Recognition INTERSPEECH 2018 Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition INTERSPEECH 2018 Named-Entity Tagging and Domain adaptation for Better Customized Translation ACL 2018 Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source INTERSPEECH 2017 Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples INTERSPEECH 2016 A DNN-HMM Approach to Story Segmentation INTERSPEECH 2016 An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions INTERSPEECH 2016 The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS INTERSPEECH 2016 Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis INTERSPEECH 2016 Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions INTERSPEECH 2016 Modeling of term-distance and term-occurrence information for improving n-gram language model performance ACL 2013 Non-Isomorphic Forest Pair Translation EMNLP 2010