Qingkai Fang

15 papers · 2022–2025 · 4 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (8) 🌍 Conference Polyglot (4) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7)

🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (31) 🤝 Dynamic Duo (15) ❓ The Questioner ⚡ Prolific Year (5) 🗃️ Keyword Collector (68) 💎 Century Club (15) 📈 Trend Setter

Conferences

ACL (10) EMNLP (2) ICLR (2) NIPS (1)

Top co-authors

Yang Feng (15) Shaolei Zhang (6) Yan Zhou (5) Zhengrui Ma (5) Shoutao Guo (4) Min Zhang (4) Zhe Yang (2) Mingxuan Wang (1) Wenyu Guo (1) Rong Ye (1)

Keywords

speech translation (4) speech-to-speech translation (4) neural machine translation (3) machine translation (3) speech synthesis (3) non-autoregressive translation (2) zero-shot learning (2) contrastive learning (2) speech-to-text translation (2) multimodal machine translation (2) representation learning (2) multi-task learning (2) end-to-end model (2) cross-modal learning (2) ctc decoding (2) low-resource language (2) domain adaptation (1) speech recognition (1) image generation (1) cross-modal representation (1)

Papers

LLaMA-Omni 2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis ACL 2025 LLaMA-Omni: Seamless Speech Interaction with Large Language Models ICLR 2025 LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token ICLR 2025 Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? ACL 2024 A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation ACL 2024 StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning ACL 2024 CTC-based Non-autoregressive Textless Speech-to-Speech Translation ACL 2024 DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation NIPS 2023 CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation ACL 2023 Understanding and Bridging the Modality Gap for Speech Translation ACL 2023 Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation EMNLP 2023 Back Translation for Speech-to-text Translation Without Transcripts ACL 2023 STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation ACL 2022 Neural Machine Translation with Phrase-Level Universal Visual Representations ACL 2022 Low-resource Neural Machine Translation with Cross-modal Alignment EMNLP 2022