Tom Ko

30 papers · 2018–2025 · 8 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (15) 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (15) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (11) 👥 Mega-Team (62) 📈 Trend Setter 🔥 Unstoppable (8) 🗃️ Keyword Collector (134) ⚡ Prolific Year (11) 💎 Century Club (30)

Conferences

INTERSPEECH (15) ACL (7) ICLR (3) AAAI (1) COLING (1) CVPR (1) EMNLP (1) IJCAI (1)

Top co-authors

Yu Zhang (11) Mingxuan Wang (10) Qiushi Huang (7) Qianqian Dong (6) Xubo Liu (5) Lilian Tang (4) Junyi Ao (4) Wenwu Wang (4) Bo Wu (3) Chutong Meng (3)

Research topics

Synthesis (1)

Keywords

speech translation (6) speech recognition (4) data augmentation (4) attention mechanism (3) automated machine learning (2) few-shot learning (2) personalized dialogue (2) speaker verification (2) masked language modeling (2) deep neural network (2) representation learning (2) multimodal learning (2) knowledge distillation (2) machine translation (2) contrastive learning (2) dialogue generation (2) speech synthesis (2) speaker embedding (2) model-agnostic meta-learning (2) speech-to-speech translation (2)

Papers

HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models ICLR 2025 ComLoRA: A Competitive Learning Approach for Enhancing LoRA ICLR 2025 Selective Prompting Tuning for Personalized Conversations with LLMs ACL 2024 Parameter-Efficient Transfer Learning for End-to-end Speech Translation COLING 2024 RepCodec: A Speech Representation Codec for Speech Tokenization ACL 2024 PolyVoice: Language Models for Speech to Speech Translation ICLR 2024 Learning Retrieval Augmentation for Personalized Dialogue Generation EMNLP 2023 Personalized Dialogue Generation with Persona-Adaptive Attention AAAI 2023 CTC-based Non-autoregressive Speech Translation ACL 2023 MOSPC: MOS Prediction Based on Pairwise Comparison ACL 2023 DUB: Discrete Unit Back-translation for Speech Translation ACL 2023 FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN ACL 2023 Leveraging per Image-Token Consistency for Vision-Language Pre-Training CVPR 2023 Recent Advances in Direct Speech-to-text Translation IJCAI 2023 GigaST: A 10,000-hour Pseudo Speech Translation Corpus INTERSPEECH 2023 Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention INTERSPEECH 2023 CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning INTERSPEECH 2023 Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data INTERSPEECH 2022 A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis INTERSPEECH 2022 LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT INTERSPEECH 2022 Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation INTERSPEECH 2022 SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing ACL 2022 Auto-KWS 2021 Challenge: Task, Datasets, and Baselines INTERSPEECH 2021 Token-Level Supervised Contrastive Learning for Punctuation Restoration INTERSPEECH 2021 A Meta-Learning Approach for User-Defined Spoken Term Classification with Varying Classes and Examples INTERSPEECH 2021 AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification INTERSPEECH 2020 An Investigation of Few-Shot Learning in Spoken Term Classification INTERSPEECH 2020 Mixup Learning Strategies for Text-Independent Speaker Verification INTERSPEECH 2019 Long Distance Voice Channel Diagnosis Using Deep Neural Networks INTERSPEECH 2018 Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification INTERSPEECH 2018