Lu Lu

18 papers · 2023–2026 · 7 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (7)

🌉 Interdisciplinary Bridge 🤝 Dynamic Duo (10) 👑 Triple Crown 🏆 Grand Slam 🗃️ Keyword Collector (82) ⚡ Prolific Year (6) ❓ The Questioner 💎 Century Club (17)

Conferences

INTERSPEECH (7) AAAI (2) ACL (2) ICLR (2) ICML (2) NIPS (2) IJCAI (1)

Top co-authors

Zejun Ma (10) Jun Zhang (7) Yuxuan Wang (6) Chao Zhang (4) Wenyi Yu (4) Wei Li (4) Xianzhao Chen (4) Guangzhi Sun (3) Changli Tang (3) Tian Tan (3)

Keywords

domain adaptation (3) attention mechanism (2) large language model (2) data augmentation (2) automatic speech recognition (2) multimodal large language model (2) word error rate (2) neural network optimization (1) intent classification (1) knowledge distillation (1) physics-informed neural network (1) spoken language understanding (1) cross-modal learning (1) speech analysis (1) stochastic process (1) multimodal learning (1) instruction tuning (1) deep neural network (1) cross-modal representation (1) speech recognition (1)

Papers

Q Cache: Visual Attention Is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model AAAI 2026 QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions ACL 2025 ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming AAAI 2025 SALMONN: Towards Generic Hearing Abilities for Large Language Models ICLR 2024 Challenges in Training PINNs: A Loss Landscape Perspective ICML 2024 video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models ICML 2024 SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words NIPS 2024 PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs NIPS 2024 PolyVoice: Language Models for Speech to Speech Translation ICLR 2024 MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning INTERSPEECH 2024 Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR INTERSPEECH 2024 Can Large Language Models Understand Spatial Audio? INTERSPEECH 2024 Knowledge Distillation Approach for Efficient Internal Language Model Estimation INTERSPEECH 2023 Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition INTERSPEECH 2023 CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training ACL 2023 AudioQR: Deep Neural Audio Watermarks For QR Code IJCAI 2023 Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer INTERSPEECH 2023 Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition INTERSPEECH 2023