Tom Ko
30 papers · 2018–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (15) π£ Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(15)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π€
Dynamic Duo
(11)
π₯
Mega-Team
(62)
π
Trend Setter
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(134)
β‘
Prolific Year
(11)
π
Century Club
(30)
Conferences
INTERSPEECH (15)
ACL (7)
ICLR (3)
AAAI (1)
COLING (1)
CVPR (1)
EMNLP (1)
IJCAI (1)
Top co-authors
Research topics
Keywords
speech translation
(6)
speech recognition
(4)
data augmentation
(4)
attention mechanism
(3)
automated machine learning
(2)
few-shot learning
(2)
personalized dialogue
(2)
speaker verification
(2)
masked language modeling
(2)
deep neural network
(2)
representation learning
(2)
multimodal learning
(2)
knowledge distillation
(2)
machine translation
(2)
contrastive learning
(2)
dialogue generation
(2)
speech synthesis
(2)
speaker embedding
(2)
model-agnostic meta-learning
(2)
speech-to-speech translation
(2)
Papers
HiRA: Parameter-Efficient Hadamard High-Rank Adaptation for Large Language Models
ICLR 2025
ComLoRA: A Competitive Learning Approach for Enhancing LoRA
ICLR 2025
Selective Prompting Tuning for Personalized Conversations with LLMs
ACL 2024
Parameter-Efficient Transfer Learning for End-to-end Speech Translation
COLING 2024
RepCodec: A Speech Representation Codec for Speech Tokenization
ACL 2024
PolyVoice: Language Models for Speech to Speech Translation
ICLR 2024
Learning Retrieval Augmentation for Personalized Dialogue Generation
EMNLP 2023
Personalized Dialogue Generation with Persona-Adaptive Attention
AAAI 2023
CTC-based Non-autoregressive Speech Translation
ACL 2023
MOSPC: MOS Prediction Based on Pairwise Comparison
ACL 2023
DUB: Discrete Unit Back-translation for Speech Translation
ACL 2023
FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN
ACL 2023
Leveraging per Image-Token Consistency for Vision-Language Pre-Training
CVPR 2023
Recent Advances in Direct Speech-to-text Translation
IJCAI 2023
GigaST: A 10,000-hour Pseudo Speech Translation Corpus
INTERSPEECH 2023
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
INTERSPEECH 2023
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
INTERSPEECH 2023
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data
INTERSPEECH 2022
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
INTERSPEECH 2022
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
INTERSPEECH 2022
Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
INTERSPEECH 2022
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
ACL 2022
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
INTERSPEECH 2021
Token-Level Supervised Contrastive Learning for Punctuation Restoration
INTERSPEECH 2021
A Meta-Learning Approach for User-Defined Spoken Term Classification with Varying Classes and Examples
INTERSPEECH 2021
AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification
INTERSPEECH 2020
An Investigation of Few-Shot Learning in Spoken Term Classification
INTERSPEECH 2020
Mixup Learning Strategies for Text-Independent Speaker Verification
INTERSPEECH 2019
Long Distance Voice Channel Diagnosis Using Deep Neural Networks
INTERSPEECH 2018
Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
INTERSPEECH 2018