Bei Li
53 papers · 2017–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π£ Hot Topic Early Bird π Conference Polyglot (9) π Interdisciplinary Bridge π Academic Marathon (8) π Cross-Pollinator (12)
πΊοΈ
Taxonomy Completionist
(68)
π§
Keyword Pioneer
π
Conference Polyglot
(9)
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(18)
π€
Dynamic Duo
(33)
ποΈ
Keyword Collector
(197)
β
The Questioner
β‘
Prolific Year
(12)
π
Century Club
(44)
π₯
Unstoppable
(9)
Conferences
ACL (21)
EMNLP (16)
AAAI (6)
COLING (3)
ICLR (2)
ICML (2)
IJCNLP (1)
INTERSPEECH (1)
NIPS (1)
Top co-authors
Keywords
neural machine translation
(11)
machine translation
(10)
knowledge distillation
(10)
large language model
(6)
model compression
(6)
transformer architecture
(5)
neural network optimization
(4)
cross-lingual transfer
(4)
reinforcement learning
(3)
reward model
(3)
transformer model
(3)
sequence generation
(3)
neural network
(3)
in-context learning
(3)
abstractive summarization
(3)
parameter-efficient fine-tuning
(2)
vision-language model
(2)
proximal policy optimization
(2)
image captioning
(2)
language modeling
(2)
Papers
NiuTrans.LMT: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
ACL 2026
On the Emotion Understanding of Synthesized Speech
ACL 2026
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
AAAI 2026
GRAM-RΒ²: Self-Training Generative Foundation Reward Models for Reward Reasoning
AAAI 2026
SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement
AAAI 2026
Tuning Medical Foundation Models for Inner Ear Temporal CT Analysis with Plug-and-play Domain Knowledge Aggregator
AAAI 2026
RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment
ACL 2026
LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance
ACL 2026
MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks
ACL 2026
Selecting Demonstrations for Many-Shot In-Context Learning via Gradient Matching
ACL 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
ACL 2025
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
COLING 2025
SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
COLING 2025
ReMamba: Equip Mamba with Effective Long-Sequence Modeling
EMNLP 2025
TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
EMNLP 2025
IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
EMNLP 2025
GRAM: A Generative Foundation Reward Model for Reward Generalization
ICML 2025
Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective
ICLR 2025
Language-Specific Layer Matters: Efficient Multilingual Enhancement for Large Vision-Language Models
EMNLP 2025
Revealing the Parallel Multilingual Learning within Large Language Models
EMNLP 2024
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
AAAI 2024
EIT: Enhanced Interactive Transformer
ACL 2024
PartialFormer: Modeling Part Instead of Whole for Machine Translation
ACL 2024
Hybrid Alignment Training for Large Language Models
ACL 2024
3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
COLING 2024
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
EMNLP 2024
Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models
EMNLP 2024
Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models
EMNLP 2024
Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning
NIPS 2024
CodeAgent: Autonomous Communicative Agents for Code Review
EMNLP 2024
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
ICLR 2024
Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
EMNLP 2023
Augmenting Large Language Model Translators via Translation Memories
ACL 2023
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
ACL 2023
TranSFormer: Slow-Fast Transformer for Machine Translation
ACL 2023
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs
EMNLP 2023
ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation
ACL 2022
On Vision Features in Multimodal Machine Translation
ACL 2022
Learning Multiscale Transformer Models for Sequence Generation
ICML 2022
The NiuTransβs Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task
ACL 2022
The NiuTrans System for the WMT 2021 Efficiency Task
EMNLP 2021
The NiuTrans Machine Translation Systems for WMT21
EMNLP 2021
Weight Distillation: Transferring the Knowledge in Neural Network Parameters
IJCNLP 2021
Learning Light-Weight Translation Models from Deep Transformer
AAAI 2021
Weight Distillation: Transferring the Knowledge in Neural Network Parameters
ACL 2021
The NiuTrans System for WNGT 2020 Efficiency Task
ACL 2020
The NiuTrans Machine Translation Systems for WMT20
EMNLP 2020
Shallow-to-Deep Training for Neural Machine Translation
EMNLP 2020
Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation
ACL 2020
Learning Deep Transformer Models for Machine Translation
ACL 2019
The NiuTrans Machine Translation Systems for WMT19
ACL 2019
The NiuTrans Machine Translation System for WMT18
EMNLP 2018
Mechanisms of Tone Sandhi Rule Application by Non-Native Speakers
INTERSPEECH 2017