Bei Li

53 papers · 2017–2026 · 9 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8) 🐝 Cross-Pollinator (12)

🗺️ Taxonomy Completionist (68) 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🧬 Topic Evolution 🏆 Grand Slam 🔬 Deep Specialist (18) 🤝 Dynamic Duo (33) 🗃️ Keyword Collector (197) ❓ The Questioner ⚡ Prolific Year (12) 💎 Century Club (44) 🔥 Unstoppable (9)

Conferences

ACL (21) EMNLP (16) AAAI (6) COLING (3) ICLR (2) ICML (2) IJCNLP (1) INTERSPEECH (1) NIPS (1)

Top co-authors

Tong Xiao (41) Jingbo Zhu (41) Yongyu Mu (13) Chenglong Wang (12) Jingang Wang (9) Ziyang Wang (7) Tong Zheng (7) Quan Du (7) Chunliang Zhang (7) Ye Lin (6)

Keywords

neural machine translation (11) machine translation (10) knowledge distillation (10) large language model (6) model compression (6) transformer architecture (5) neural network optimization (4) cross-lingual transfer (4) reinforcement learning (3) reward model (3) transformer model (3) sequence generation (3) neural network (3) in-context learning (3) abstractive summarization (3) parameter-efficient fine-tuning (2) vision-language model (2) proximal policy optimization (2) image captioning (2) language modeling (2)

Papers

NiuTrans.LMT: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs ACL 2026 On the Emotion Understanding of Synthesized Speech ACL 2026 Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models AAAI 2026 GRAM-R²: Self-Training Generative Foundation Reward Models for Reward Reasoning AAAI 2026 SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement AAAI 2026 Tuning Medical Foundation Models for Inner Ear Temporal CT Analysis with Plug-and-play Domain Knowledge Aggregator AAAI 2026 RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment ACL 2026 LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance ACL 2026 MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks ACL 2026 Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching ACL 2025 Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation ACL 2025 Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment COLING 2025 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment COLING 2025 ReMamba: Equip Mamba with Effective Long-Sequence Modeling EMNLP 2025 TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making EMNLP 2025 IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method EMNLP 2025 GRAM: A Generative Foundation Reward Model for Reward Generalization ICML 2025 Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective ICLR 2025 Language-Specific Layer Matters: Efficient Multilingual Enhancement for Large Vision-Language Models EMNLP 2025 Revealing the Parallel Multilingual Learning within Large Language Models EMNLP 2024 ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation AAAI 2024 EIT: Enhanced Interactive Transformer ACL 2024 PartialFormer: Modeling Part Instead of Whole for Machine Translation ACL 2024 Hybrid Alignment Training for Large Language Models ACL 2024 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset COLING 2024 Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation EMNLP 2024 Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models EMNLP 2024 Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models EMNLP 2024 Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning NIPS 2024 CodeAgent: Autonomous Communicative Agents for Code Review EMNLP 2024 Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers ICLR 2024 Rethinking and Improving Multi-task Learning for End-to-end Speech Translation EMNLP 2023 Augmenting Large Language Model Translators via Translation Memories ACL 2023 ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning ACL 2023 TranSFormer: Slow-Fast Transformer for Machine Translation ACL 2023 Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs EMNLP 2023 ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation ACL 2022 On Vision Features in Multimodal Machine Translation ACL 2022 Learning Multiscale Transformer Models for Sequence Generation ICML 2022 The NiuTrans’s Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task ACL 2022 The NiuTrans System for the WMT 2021 Efficiency Task EMNLP 2021 The NiuTrans Machine Translation Systems for WMT21 EMNLP 2021 Weight Distillation: Transferring the Knowledge in Neural Network Parameters IJCNLP 2021 Learning Light-Weight Translation Models from Deep Transformer AAAI 2021 Weight Distillation: Transferring the Knowledge in Neural Network Parameters ACL 2021 The NiuTrans System for WNGT 2020 Efficiency Task ACL 2020 The NiuTrans Machine Translation Systems for WMT20 EMNLP 2020 Shallow-to-Deep Training for Neural Machine Translation EMNLP 2020 Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation ACL 2020 Learning Deep Transformer Models for Machine Translation ACL 2019 The NiuTrans Machine Translation Systems for WMT19 ACL 2019 The NiuTrans Machine Translation System for WMT18 EMNLP 2018 Mechanisms of Tone Sandhi Rule Application by Non-Native Speakers INTERSPEECH 2017