Shujian Huang
124 papers · 2010–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🌍 Conference Polyglot (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🏃 Academic Marathon (15)
🗺️
Taxonomy Completionist
(13)
🧭
Keyword Pioneer
🏃
Academic Marathon
(15)
🏠
Conference Loyalist
(33)
🤝
Dynamic Duo
(95)
🏆
Grand Slam
🔬
Deep Specialist
(45)
🏆
Keyword Champion
(2)
📈
Trend Setter
⚡
Prolific Year
(14)
❓
The Questioner
(2)
🗃️
Keyword Collector
(417)
💎
Century Club
(120)
🔥
Unstoppable
(11)
Conferences
ACL (36)
EMNLP (32)
NAACL (14)
AAAI (12)
IJCNLP (8)
COLING (6)
IJCAI (6)
CONLL (2)
ICLR (2)
ICML (2)
AACL (1)
ACML (1)
EACL (1)
NIPS (1)
Top co-authors
Keywords
neural machine translation
(25)
large language model
(22)
machine translation
(15)
domain adaptation
(9)
knowledge distillation
(8)
quality estimation
(8)
cross-lingual transfer
(8)
neural network
(7)
representation learning
(6)
transfer learning
(5)
word embedding
(5)
preference optimization
(4)
multilingual reasoning
(4)
word-level prediction
(4)
multi-task learning
(3)
multilingual nlp
(3)
instruction tuning
(3)
k-nearest neighbor
(3)
text generation
(3)
zero-shot learning
(3)
Papers
Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers
ACL 2026
Improving Long-Context Translation via Self-Supervised Dual Learning
ACL 2026
A Data-Efficient Path to Multilingual LLMs: Language Expansion via Post-training PARAM𝛥 Integration into Upcycled MoE
ACL 2026
How Does Alignment Enhance LLMs’ Multilingual Capabilities? A Language Neurons Perspective
AAAI 2026
TRANS-ZERO: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data
ACL 2025
SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
COLING 2025
Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training
ACL 2025
Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement
ACL 2025
MoE-LPR: Multilingual Extension of Large Language Models Through Mixture-of-Experts with Language Priors Routing
AAAI 2025
SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models
EMNLP 2025
R-PRM: Reasoning-Driven Process Reward Modeling
EMNLP 2025
Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation
ACL 2025
Process-based Self-Rewarding Language Models
ACL 2025
LLM’s Weakness in NER Doesn’t Stop It from Enhancing a Stronger SLM
NAACL 2025
Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
NAACL 2025
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models
EMNLP 2025
EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
EMNLP 2025
Understanding LLMs’ Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From
EMNLP 2025
Self-Evolution Knowledge Distillation for LLM-based Machine Translation
COLING 2025
Elucidating the Design Space of Multimodal Protein Language Models
ICML 2025
DPLM-2: A Multimodal Diffusion Protein Language Model
ICLR 2025
Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models
NAACL 2024
kNN-BOX: A Unified Framework for Nearest Neighbor Generation
EACL 2024
PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment
EMNLP 2024
Diffusion Language Models Are Versatile Protein Learners
ICML 2024
Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis
NAACL 2024
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
NAACL 2024
Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
NAACL 2024
Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners
EMNLP 2024
Formality is Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge
EMNLP 2024
Large Language Models are Limited in Out-of-Context Knowledge Reasoning
EMNLP 2024
EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
EMNLP 2024
Multilingual Contrastive Decoding via Language-Agnostic Layers Skipping
EMNLP 2024
MAPO: Advancing Multilingual Reasoning through Multilingual-Alignment-as-Preference Optimization
ACL 2024
Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models
ACL 2024
Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation
ACL 2024
Question Translation Training for Better Multilingual Reasoning
ACL 2024
MultiSQL: A Schema-Integrated Context-Dependent Text2SQL Dataset with Diverse SQL Operations
ACL 2024
A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily
NAACL 2024
Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation
IJCNLP 2023
Addressing Linguistic Bias through a Contrastive Analysis of Academic Writing in the NLP Domain
EMNLP 2023
BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training
ACL 2023
Roles of Scaling and Instruction Tuning in Language Perception: Model vs. Human Attention
EMNLP 2023
Unify Word-level and Span-level Tasks: NJUNLP’s Participation for the WMT2023 Quality Estimation Shared Task
EMNLP 2023
Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning
AAAI 2023
Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation
AACL 2023
CoP: Factual Inconsistency Detection by Controlling the Preference
AAAI 2023
Local Interpretation of Transformer Based on Linear Decomposition
ACL 2023
INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation
ACL 2023
What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation
ACL 2023
Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation
AAAI 2023
Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search
EMNLP 2023
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems
EMNLP 2023
Probing Cross-modal Semantics Alignment Capability from the Textual Perspective
EMNLP 2022
Non-parametric Online Learning from Human Feedback for Neural Machine Translation
AAAI 2022
BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine Translation
ACL 2022
latent-GLAT: Glancing at Latent Variables for Parallel Text Generation
ACL 2022
Rethinking Document-level Neural Machine Translation
ACL 2022
Towards Multi-label Unknown Intent Detection
COLING 2022
Alleviating the Inequality of Attention Heads for Neural Machine Translation
COLING 2022
Learning from Adjective-Noun Pairs: A Knowledge-enhanced Framework for Target-Oriented Multimodal Sentiment Classification
COLING 2022
Analyzing the Intensity of Complaints on Social Media
NAACL 2022
Helping the Weak Makes You Strong: Simple Multi-Task Learning Improves Non-Autoregressive Translators
EMNLP 2022
Structure-Unified M-Tree Coding Solver for Math Word Problem
EMNLP 2022
NJUNLP’s Participation for the WMT2022 Quality Estimation Shared Task
EMNLP 2022
CrossQE: HW-TSC 2022 Submission for the Quality Estimation Shared Task
EMNLP 2022
HW-TSC’s Participation at WMT 2021 Quality Estimation Shared Task
EMNLP 2021
When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation
ACL 2021
Adaptive Nearest Neighbor Machine Translation
ACL 2021
Adaptive Nearest Neighbor Machine Translation
IJCNLP 2021
Automated Cross-prompt Scoring of Essay Traits
AAAI 2021
DirectQE: Direct Pretraining for Machine Translation Quality Estimation
AAAI 2021
When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation
IJCNLP 2021
Energy-based Unknown Intent Detection with Data Manipulation
IJCNLP 2021
Non-Autoregressive Translation by Learning Target Categorical Codes
NAACL 2021
Energy-based Unknown Intent Detection with Data Manipulation
ACL 2021
Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
NIPS 2021
Learning Kernel-Smoothed Machine Translation with Retrieved Examples
EMNLP 2021
Meta-LMTC: Meta-Learning for Large-Scale Multi-Label Text Classification
EMNLP 2021
Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation
EMNLP 2021
NJU’s submission to the WMT20 QE Shared Task
EMNLP 2020
Generating Diverse Translation by Manipulating Multi-Head Attention
AAAI 2020
GRET: Global Representation Enhanced Transformer
AAAI 2020
Acquiring Knowledge from Pre-Trained Model to Neural Machine Translation
AAAI 2020
Latent Opinions Transfer Network for Target-Oriented Opinion Words Extraction
AAAI 2020
Dialogue State Tracking with Explicit Slot Connection Modeling
ACL 2020
Explicit Semantic Decomposition for Definition Generation
ACL 2020
A Reinforced Generation of Adversarial Examples for Neural Machine Translation
ACL 2020
RPD: A Distance Function Between Word Embeddings
ACL 2020
A Simple and Effective Approach to Robust Unsupervised Bilingual Dictionary Induction
COLING 2020
Mirror-Generative Neural Machine Translation
ICLR 2020
Towards Making the Most of Context in Neural Machine Translation
IJCAI 2020
Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation
EMNLP 2019
Correct-and-Memorize: Learning to Translate from Interactive Revisions
IJCAI 2019
Utilizing Non-Parallel Text for Style Transfer by Making Partial Comparisons
IJCAI 2019
Dynamic Past and Future for Neural Machine Translation
IJCNLP 2019
Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation
IJCNLP 2019
Dynamic Past and Future for Neural Machine Translation
EMNLP 2019
Online Distilling from Checkpoints for Neural Machine Translation
NAACL 2019
Target-oriented Opinion Words Extraction with Target-fused Neural Sequence Labeling
NAACL 2019
Exploiting Noisy Data in Distant Supervision Relation Classification
NAACL 2019
Learning Representation Mapping for Relation Detection in Knowledge Base Question Answering
ACL 2019
Generating Sentences from Disentangled Syntactic and Semantic Spaces
ACL 2019
Unsupervised Bilingual Lexicon Induction via Latent Variable Models
EMNLP 2018
Combining Character and Word Information in Neural Machine Translation Using a Multi-Level Attention
NAACL 2018
Neural Machine Translation with Word Predictions
EMNLP 2017
Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder
ACL 2017
Word-Context Character Embeddings for Chinese Word Segmentation
EMNLP 2017
Top-Rank Enhanced Listwise Optimization for Statistical Machine Translation
CONLL 2017
Deep Matrix Factorization Models for Recommender Systems
IJCAI 2017
AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles
IJCAI 2017
Chunk-Based Bi-Scale Decoder for Neural Machine Translation
ACL 2017
PRIMT: A Pick-Revise Framework for Interactive Machine Translation
NAACL 2016
A Search-Based Dynamic Reranking Model for Dependency Parsing
ACL 2016
Tree-State Based Rule Selection Models for Hierarchical Phrase-Based Machine Translation
IJCAI 2016
A Neural Probabilistic Structured-Prediction Model for Transition-Based Dependency Parsing
IJCNLP 2015
A Unified Framework for Jointly Learning Distributed Representations of Word and Attributes
ACML 2015
A Neural Probabilistic Structured-Prediction Model for Transition-Based Dependency Parsing
ACL 2015
Non-linear Learning for Statistical Machine Translation
ACL 2015
Graph-Based Collective Lexical Selection for Statistical Machine Translation
EMNLP 2015
Non-linear Learning for Statistical Machine Translation
IJCNLP 2015
Enhancing Statistical Machine Translation with Character Alignment
ACL 2012
Dealing with Spurious Ambiguity in Learning ITG-based Word Alignment
ACL 2011
Improving Word Alignment by Semi-Supervised Ensemble
CONLL 2010