Baosong Yang
61 papers · 2017–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (12) π Interdisciplinary Bridge π Conference Polyglot (9)
πΊοΈ
Taxonomy Completionist
(12)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(22)
π€
Dynamic Duo
(26)
π
Grand Slam
π¬
Deep Specialist
(30)
π§¬
Topic Evolution
π₯
Unstoppable
(9)
β‘
Prolific Year
(11)
β
The Questioner
(2)
ποΈ
Keyword Collector
(281)
π
Trend Setter
π
Century Club
(60)
π
Conference Pioneer
Conferences
EMNLP (22)
ACL (19)
NAACL (7)
AAAI (5)
COLING (2)
IJCNLP (2)
NIPS (2)
ICLR (1)
ICML (1)
Top co-authors
Keywords
neural machine translation
(20)
machine translation
(14)
large language model
(10)
quality estimation
(5)
attention mechanism
(5)
contrastive learning
(4)
representation learning
(4)
natural language generation
(4)
self-attention network
(4)
language model
(4)
pretrained language model
(3)
unsupervised learning
(3)
multi-head attention
(3)
domain adaptation
(3)
multilingual model
(3)
multilingual translation
(3)
text generation
(3)
embedding learning
(3)
self-supervised learning
(3)
cross-lingual transfer
(3)
Papers
A Data-Efficient Path to Multilingual LLMs: Language Expansion via Post-training PARAMπ₯ Integration into Upcycled MoE
ACL 2026
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
ICLR 2025
CultureSynth: A Hierarchical Taxonomy-Guided and Retrieval-Augmented Framework for Cultural Question-Answer Synthesis
EMNLP 2025
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
EMNLP 2025
Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese
EMNLP 2025
ConText: Driving In-context Learning for Text Removal and Segmentation
ICML 2025
NOVA-63: Native Omni-lingual Versatile Assessments of 63 Disciplines
EMNLP 2025
Unveiling Language-Specific Features in Large Language Models via Sparse Autoencoders
ACL 2025
P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs
EMNLP 2025
Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models
ACL 2025
Enhancing Machine Translation with Self-Supervised Preference Data
ACL 2025
From English to Second Language Mastery: Enhancing LLMs with Cross-Lingual Continued Instruction Tuning
ACL 2025
Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
ACL 2024
Final Submission of SJTULoveFiction to Literary Task
EMNLP 2024
SJTU System Description for the WMT24 Low-Resource Languages of Spain Task
EMNLP 2024
Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
EMNLP 2024
AnyTrans: Translate AnyText in the Image with Large Scale Models
EMNLP 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
EMNLP 2024
MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation
COLING 2024
Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning
NIPS 2024
Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models
ACL 2024
Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation
EMNLP 2023
Tailor: A Soft-Prompt-Based Approach to Attribute-Based Controlled Text Generation
ACL 2023
Fantastic Expressions and Where to Find Them: Chinese Simile Generation with Multiple Constraints
ACL 2023
Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation
ACL 2023
MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks
EMNLP 2023
EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning
NIPS 2023
Dynamic Voting for Efficient Reasoning in Large Language Models
EMNLP 2023
UniTE: Unified Translation Evaluation
ACL 2022
Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis
NAACL 2022
Frequency-Aware Contrastive Learning for Neural Machine Translation
AAAI 2022
GCPG: A General Framework for Controllable Paraphrase Generation
ACL 2022
Competency-Aware Neural Machine Translation: Can Machine Translation Know its Own Translation Quality?
EMNLP 2022
WR-One2Set: Towards Well-Calibrated Keyphrase Generation
EMNLP 2022
Alibaba-Translate Chinaβs Submission for WMT2022 Metrics Shared Task
EMNLP 2022
Alibaba-Translate Chinaβs Submission for WMT 2022 Quality Estimation Shared Task
EMNLP 2022
Dangling-Aware Entity Alignment with Mixed High-Order Proximities
NAACL 2022
Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation
NAACL 2022
KGR4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation
AAAI 2022
Attention Mechanism with Energy-Friendly Operations
ACL 2022
Unsupervised Preference-Aware Language Identification
ACL 2022
Towards User-Driven Neural Machine Translation
ACL 2021
RoBLEURT Submission for WMT2021 Metrics Task
EMNLP 2021
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation
ACL 2021
Towards User-Driven Neural Machine Translation
IJCNLP 2021
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation
IJCNLP 2021
Multi-Hop Transformer for Document-Level Machine Translation
NAACL 2021
Domain Transfer based Data Augmentation for Neural Query Translation
COLING 2020
Unsupervised Neural Dialect Translation with Commonality and Diversity Modeling
AAAI 2020
Neuron Interaction Based Representation Composition for Neural Machine Translation
AAAI 2020
Self-Paced Learning for Neural Machine Translation
EMNLP 2020
Uncertainty-Aware Curriculum Learning for Neural Machine Translation
ACL 2020
Assessing the Ability of Self-Attention Networks to Learn Word Order
ACL 2019
Modeling Recurrence for Transformer
NAACL 2019
Context-Aware Self-Attention Networks
AAAI 2019
Information Aggregation for Multi-Head Attention with Routing-by-Agreement
NAACL 2019
Leveraging Local and Global Patterns for Self-Attention Networks
ACL 2019
Convolutional Self-Attention Networks
NAACL 2019
Multi-Head Attention with Disagreement Regularization
EMNLP 2018
Modeling Localness for Self-Attention Networks
EMNLP 2018
Towards Bidirectional Hierarchical Representations for Attention-based Neural Machine Translation
EMNLP 2017