Qingkai Fang
15 papers · 2022–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Cross-Pollinator (8) π Conference Polyglot (4) π Interdisciplinary Bridge π§ Keyword Pioneer π Renaissance Researcher (7)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(31)
π€
Dynamic Duo
(15)
β
The Questioner
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(68)
π
Century Club
(15)
π
Trend Setter
Conferences
ACL (10)
EMNLP (2)
ICLR (2)
NIPS (1)
Top co-authors
Keywords
speech translation
(4)
speech-to-speech translation
(4)
neural machine translation
(3)
machine translation
(3)
speech synthesis
(3)
non-autoregressive translation
(2)
zero-shot learning
(2)
contrastive learning
(2)
speech-to-text translation
(2)
multimodal machine translation
(2)
representation learning
(2)
multi-task learning
(2)
end-to-end model
(2)
cross-modal learning
(2)
ctc decoding
(2)
low-resource language
(2)
domain adaptation
(1)
speech recognition
(1)
image generation
(1)
cross-modal representation
(1)
Papers
LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
ACL 2025
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
ICLR 2025
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
ICLR 2025
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
ACL 2024
A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation
ACL 2024
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
ACL 2024
CTC-based Non-autoregressive Textless Speech-to-Speech Translation
ACL 2024
DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation
NIPS 2023
CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation
ACL 2023
Understanding and Bridging the Modality Gap for Speech Translation
ACL 2023
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
EMNLP 2023
Back Translation for Speech-to-text Translation Without Transcripts
ACL 2023
STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation
ACL 2022
Neural Machine Translation with Phrase-Level Universal Visual Representations
ACL 2022
Low-resource Neural Machine Translation with Cross-modal Alignment
EMNLP 2022