Lu Lu
18 papers · 2023–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (7)
π
Interdisciplinary Bridge
π€
Dynamic Duo
(10)
π
Triple Crown
π
Grand Slam
ποΈ
Keyword Collector
(82)
β‘
Prolific Year
(6)
β
The Questioner
π
Century Club
(17)
Conferences
INTERSPEECH (7)
AAAI (2)
ACL (2)
ICLR (2)
ICML (2)
NIPS (2)
IJCAI (1)
Top co-authors
Keywords
domain adaptation
(3)
attention mechanism
(2)
large language model
(2)
data augmentation
(2)
automatic speech recognition
(2)
multimodal large language model
(2)
word error rate
(2)
neural network optimization
(1)
intent classification
(1)
knowledge distillation
(1)
physics-informed neural network
(1)
spoken language understanding
(1)
cross-modal learning
(1)
speech analysis
(1)
stochastic process
(1)
multimodal learning
(1)
instruction tuning
(1)
deep neural network
(1)
cross-modal representation
(1)
speech recognition
(1)
Papers
Q Cache: Visual Attention Is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model
AAAI 2026
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
ACL 2025
ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
AAAI 2025
SALMONN: Towards Generic Hearing Abilities for Large Language Models
ICLR 2024
Challenges in Training PINNs: A Loss Landscape Perspective
ICML 2024
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
ICML 2024
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
NIPS 2024
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs
NIPS 2024
PolyVoice: Language Models for Speech to Speech Translation
ICLR 2024
MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning
INTERSPEECH 2024
Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR
INTERSPEECH 2024
Can Large Language Models Understand Spatial Audio?
INTERSPEECH 2024
Knowledge Distillation Approach for Efficient Internal Language Model Estimation
INTERSPEECH 2023
Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition
INTERSPEECH 2023
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
ACL 2023
AudioQR: Deep Neural Audio Watermarks For QR Code
IJCAI 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer
INTERSPEECH 2023
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition
INTERSPEECH 2023