Jian Luan
41 papers · 2019–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (8) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (20) π Academic Marathon (6)
π
Academic Marathon
(6)
π
Cross-Pollinator
(14)
π
Renaissance Researcher
(8)
π
Keyword Champion
(2)
π€
Dynamic Duo
(23)
π§¬
Topic Evolution
β‘
Prolific Year
(16)
π₯
Unstoppable
(5)
π
Century Club
(36)
π
Trend Setter
ποΈ
Keyword Collector
(198)
β
The Questioner
Conferences
ACL (17)
EMNLP (7)
INTERSPEECH (7)
AAAI (3)
NAACL (3)
COLING (2)
ICCV (1)
IJCAI (1)
Top co-authors
Research topics
Keywords
large language model
(13)
model compression
(6)
reinforcement learning
(4)
attention mechanism
(3)
language model
(3)
multimodal learning
(3)
in-context learning
(3)
data augmentation
(3)
task completion
(2)
transformer architecture
(2)
mobile agent
(2)
simultaneous translation
(2)
few-shot learning
(2)
polyphonic music
(2)
speech synthesis
(2)
vision-language model
(2)
visual reasoning
(2)
knowledge distillation
(2)
instruction tuning
(2)
multimodal large language model
(2)
Papers
AV-Edit: Multimodal Generative Sound Effect Editing via Audio-Visual Semantic Joint Control
AAAI 2026
End-to-End Optimization of LLM-Driven Multi-Agent Search Systems via Heterogeneous-Group-Based Reinforcement Learning
ACL 2026
VecInfer: Efficient LLM Inference with Low-Bit KV Cache via Outlier-Suppressed Vector Quantization
ACL 2026
Doc-V*: Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA
ACL 2026
Attention Basin: Why Contextual Position Matters in Large Language Models
ACL 2026
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
NAACL 2025
ReachAgent: Enhancing Mobile Agent via Page Reaching and Operation
NAACL 2025
Browsing Like Human: A Multimodal Web Agent with Experiential Fast-and-Slow Thinking
ACL 2025
Demystifying Small Language Models for Edge Deployment
ACL 2025
BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
EMNLP 2025
HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
ACL 2025
Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains
ACL 2025
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives
ACL 2025
TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization
ACL 2025
PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning
COLING 2025
Stability and Generalization of Zeroth-Order Decentralized Stochastic Gradient Descent with Changing Topology
AAAI 2025
Global Eye: Breaking the βFixed Thinking Patternβ during the Instruction Expansion Process
ACL 2025
MAKAR: a Multi-Agent framework based Knowledge-Augmented Reasoning for Grounded Multimodal Named Entity Recognition
EMNLP 2025
SPO: Self Preference Optimization with Self Regularization
EMNLP 2025
Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs
ICCV 2025
Theoretical Insights into Fine-Tuning Attention Mechanism: Generalization and Optimization
IJCAI 2025
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
EMNLP 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
ACL 2024
DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy
ACL 2024
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
ACL 2024
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
ACL 2024
ToolRerank: Adaptive and Hierarchy-Aware Reranking for Tool Retrieval
COLING 2024
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback
EMNLP 2024
Mixture of Diverse Size Experts
EMNLP 2024
The Xiaomi AI Labβs Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task
ACL 2023
BERT-ERC: Fine-Tuning BERT Is Enough for Emotion Recognition in Conversation
AAAI 2023
Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation
EMNLP 2023
Exploring Better Text Image Translation with Multimodal Codebook
ACL 2023
Improving Bilingual TTS Using Language And Phonology Embedding With Embedding Strength Modulator
INTERSPEECH 2023
LightClone: Speaker-guided Parallel Subnet Selection for Few-shot Voice Cloning
INTERSPEECH 2023
BIT-Xiaomiβs System for AutoSimTrans 2022
NAACL 2022
Transfer Learning for Improving Singing-Voice Detection in Polyphonic Instrumental Music
INTERSPEECH 2020
Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting
INTERSPEECH 2020
Adversarially Trained Multi-Singer Sequence-to-Sequence Singing Synthesizer
INTERSPEECH 2020
XiaoiceSing: A High-Quality and Integrated Singing Voice Synthesis System
INTERSPEECH 2020
Vocal Pitch Extraction in Polyphonic Music Using Convolutional Residual Network
INTERSPEECH 2019