Tong Xiao
134 papers · 2009–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (19) π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (14)
πΊοΈ
Taxonomy Completionist
(19)
π§
Keyword Pioneer
π
Academic Marathon
(16)
π
Conference Loyalist
(27)
π€
Dynamic Duo
(94)
π
Grand Slam
π±
Topic Pioneer
π¬
Deep Specialist
(40)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π
Trend Setter
ποΈ
Keyword Collector
(423)
π
Century Club
(123)
π₯
Unstoppable
(17)
β‘
Prolific Year
(14)
β
The Questioner
(3)
Conferences
ACL (47)
EMNLP (27)
CVPR (13)
AAAI (11)
COLING (10)
IJCAI (6)
IJCNLP (6)
ICCV (4)
ICML (3)
ICLR (2)
NIPS (2)
ECCV (1)
INTERSPEECH (1)
NAACL (1)
Top co-authors
Research topics
Keywords
neural machine translation
(25)
machine translation
(16)
knowledge distillation
(16)
model compression
(14)
speech translation
(11)
large language model
(10)
person re-identification
(7)
neural network
(6)
language modeling
(5)
transformer architecture
(5)
transformer model
(5)
neural network optimization
(5)
multi-task learning
(4)
sequence generation
(4)
metric learning
(4)
in-context learning
(4)
neural architecture search
(4)
transfer learning
(4)
automatic speech recognition
(4)
cross-lingual transfer
(4)
Papers
CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling
ACL 2026
MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks
ACL 2026
NiuTrans.LMT: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs
ACL 2026
On the Emotion Understanding of Synthesized Speech
ACL 2026
Empirical Analysis of Decoding Biases in Masked Diffusion Models
ACL 2026
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
AAAI 2026
WaveEx: Accelerating Flow Matching-based Speech Generation via Wavelet-guided Extrapolation
AAAI 2026
GRAM-RΒ²: Self-Training Generative Foundation Reward Models for Reward Reasoning
AAAI 2026
SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement
AAAI 2026
RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment
ACL 2026
LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance
ACL 2026
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders
EMNLP 2025
HEAL: A Hypothesis-Based Preference-Aware Analysis Framework
EMNLP 2025
Language-Specific Layer Matters: Efficient Multilingual Enhancement for Large Vision-Language Models
EMNLP 2025
Position IDs Matter: An Enhanced Position Layout for Efficient Context Compression in Large Language Models
EMNLP 2025
SAM Encoder Breach by Adversarial Simplicial Complex Triggers Downstream Model Failures
ICCV 2025
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
AAAI 2025
Enhancing Neural Machine Translation Through Target Language Data: A kNN-LM Approach for Domain Adaptation
ACL 2025
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
ACL 2025
Alleviating Hallucinations from Knowledge Misalignment in Large Language Models via Selective Abstention Learning
ACL 2025
Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
ACL 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
ACL 2025
SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
COLING 2025
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models
ICML 2025
Apollo: An Exploration of Video Understanding in Large Multimodal Models
CVPR 2025
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
CVPR 2025
Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
CVPR 2025
GRAM: A Generative Foundation Reward Model for Reward Generalization
ICML 2025
Can LLMs Solve Longer Math Word Problems Better?
ICLR 2025
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
ICLR 2025
IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method
EMNLP 2025
Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models
EMNLP 2025
SocraticLM: Exploring Socratic Personalized Teaching with Large Language Models
NIPS 2024
Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation
ACL 2024
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
AAAI 2024
RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners
COLING 2024
Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning
NIPS 2024
Recent Advances in End-to-End Simultaneous Speech Translation
IJCAI 2024
Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process
IJCAI 2024
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
CVPR 2024
EIT: Enhanced Interactive Transformer
ACL 2024
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
EMNLP 2024
Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models
EMNLP 2024
Revealing the Parallel Multilingual Learning within Large Language Models
EMNLP 2024
Teaching Language Models to Self-Improve by Learning from Language Feedback
ACL 2024
PartialFormer: Modeling Part Instead of Whole for Machine Translation
ACL 2024
Revisiting Interpolation Augmentation for Speech-to-Text Generation
ACL 2024
Hybrid Alignment Training for Large Language Models
ACL 2024
TranSFormer: Slow-Fast Transformer for Machine Translation
ACL 2023
Bridging the Granularity Gap for Acoustic Modeling
ACL 2023
MobileNMT: Enabling Translation in 15MB and 30ms
ACL 2023
Improving Autoregressive Grammatical Error Correction with Non-autoregressive Models
ACL 2023
Improving End-to-End Speech Translation by Leveraging Auxiliary Speech and Text Data
AAAI 2023
Information Magnitude Based Dynamic Sub-sampling for Speech-to-text
INTERSPEECH 2023
Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
EMNLP 2023
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs
EMNLP 2023
Recent Advances in Direct Speech-to-text Translation
IJCAI 2023
The NiuTrans End-to-End Speech Translation System for IWSLT23 English-to-Chinese Offline Task
ACL 2023
Augmenting Large Language Model Translators via Translation Memories
ACL 2023
CTC-based Non-autoregressive Speech Translation
ACL 2023
Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation
ACL 2023
Prompting Neural Machine Translation with Translation Memories
AAAI 2023
The NiuTrans Machine Translation Systems for WMT22
EMNLP 2022
Multi-Path Transformer is Better: A Case Study on Neural Machine Translation
EMNLP 2022
On Vision Features in Multimodal Machine Translation
ACL 2022
ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation
ACL 2022
The NiuTransβs Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task
ACL 2022
Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection
EMNLP 2022
Learning Multiscale Transformer Models for Sequence Generation
ICML 2022
The NiuTrans Machine Translation Systems for WMT21
EMNLP 2021
An Efficient Transformer Decoder with Compressed Sub-layers
AAAI 2021
Non-Autoregressive Translation by Learning Target Categorical Codes
NAACL 2021
The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
IJCNLP 2021
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders
IJCNLP 2021
Weight Distillation: Transferring the Knowledge in Neural Network Parameters
IJCNLP 2021
Learning Light-Weight Translation Models from Deep Transformer
AAAI 2021
RankNAS: Efficient Neural Architecture Search by Pairwise Ranking
EMNLP 2021
Weight Distillation: Transferring the Knowledge in Neural Network Parameters
ACL 2021
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders
ACL 2021
The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
ACL 2021
Bag of Tricks for Optimizing Transformer Efficiency
EMNLP 2021
The NiuTrans System for the WMT 2021 Efficiency Task
EMNLP 2021
The NiuTrans System for WNGT 2020 Efficiency Task
ACL 2020
The NiuTrans System for the WMT20 Quality Estimation Shared Task
EMNLP 2020
Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
COLING 2020
Layer-Wise Multi-View Learning for Neural Machine Translation
COLING 2020
A Simple and Effective Approach to Robust Unsupervised Bilingual Dictionary Induction
COLING 2020
Towards Fully 8-bit Integer Inference for the Transformer Model
IJCAI 2020
Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild
ECCV 2020
Shallow-to-Deep Training for Neural Machine Translation
EMNLP 2020
Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation
EMNLP 2020
The NiuTrans Machine Translation Systems for WMT20
EMNLP 2020
Neural Machine Translation with Joint Representation
AAAI 2020
MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs
ACL 2020
Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation
ACL 2020
Learning Architectures from an Extended Search Space for Language Modeling
ACL 2020
Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition
IJCNLP 2019
Learning Deep Transformer Models for Machine Translation
ACL 2019
Shared-Private Bilingual Word Embeddings for Neural Machine Translation
ACL 2019
The NiuTrans Machine Translation Systems for WMT19
ACL 2019
Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition
EMNLP 2019
Order-Aware Generative Modeling Using the 3D-Craft Dataset
ICCV 2019
Sharing Attention Weights for Fast Transformer
IJCAI 2019
Deep Group-Shuffling Random Walk for Person Re-Identification
CVPR 2018
Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding
CVPR 2018
A Simple and Effective Approach to Coverage-Aware Neural Machine Translation
ACL 2018
End-to-End Deep Kronecker-Product Matching for Person Re-Identification
CVPR 2018
The NiuTrans Machine Translation System for WMT18
EMNLP 2018
Multi-layer Representation Fusion for Neural Machine Translation
COLING 2018
Identity-Aware Textual-Visual Matching With Latent Co-Attention
ICCV 2017
Object Detection in Videos With Tubelet Proposal Networks
CVPR 2017
Implicit Syntactic Features for Target-dependent Sentiment Analysis
IJCNLP 2017
Towards Bidirectional Hierarchical Representations for Attention-based Neural Machine Translation
EMNLP 2017
Learning Deep Neural Networks for Vehicle Re-ID With Visual-Spatio-Temporal Path Proposals
ICCV 2017
Fast Parallel Training of Neural Language Models
IJCAI 2017
Joint Detection and Identification Feature Learning for Person Search
CVPR 2017
Person Search With Natural Language Description
CVPR 2017
Learning Deep Feature Representations With Domain Guided Dropout for Person Re-Identification
CVPR 2016
NiuParser: A Chinese Syntactic and Semantic Parsing Toolkit
IJCNLP 2015
Learning From Massive Noisy Labeled Data for Image Classification
CVPR 2015
NiuParser: A Chinese Syntactic and Semantic Parsing Toolkit
ACL 2015
A Hybrid Approach to Skeleton-based Translation
ACL 2014
DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification
CVPR 2014
Effective Incorporation of Source Syntax into Hierarchical Phrase-based Translation
COLING 2014
Easy-First POS Tagging and Dependency Parsing with Beam Search
ACL 2013
Learning Better Rule Extraction with Translation Span Alignment
ACL 2012
NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation
ACL 2012
Easy-First Chinese POS Tagging and Dependency Parsing
COLING 2012
Improving Decoding Generalization for Tree-to-String Translation
ACL 2011
Heterogeneous Parsing via Collaborative Decoding
COLING 2010
An Empirical Study of Translation Rule Extraction with Multiple Parsers
COLING 2010
Boosting-Based System Combination for Machine Translation
ACL 2010
Better Synchronous Binarization for Machine Translation
EMNLP 2009
The Feature Subspace Method for SMT System Combination
EMNLP 2009