Shaohan Huang
63 papers · 2017–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (12) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π Academic Marathon (8)
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π
Academic Marathon
(8)
π€
Dynamic Duo
(57)
π
Grand Slam
π¬
Deep Specialist
(20)
π§¬
Topic Evolution
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(15)
ποΈ
Keyword Collector
(231)
π
Century Club
(59)
π₯
Unstoppable
(9)
Conferences
ACL (21)
EMNLP (14)
NIPS (6)
AAAI (5)
ICLR (4)
IJCNLP (4)
COLING (3)
EACL (2)
AACL (1)
ICCV (1)
ICML (1)
JMLR (1)
Top co-authors
Research topics
Keywords
large language model
(9)
text generation
(7)
cross-lingual transfer
(7)
reinforcement learning
(5)
language model
(5)
multimodal large language model
(5)
cross-lingual language model
(5)
sentence embedding
(4)
knowledge distillation
(4)
multilingual model
(4)
masked language modeling
(3)
domain adaptation
(3)
contrastive learning
(3)
representation learning
(3)
human preference
(3)
zero-shot learning
(3)
mixture of expert
(3)
transformer architecture
(3)
machine translation
(3)
direct preference optimization
(3)
Papers
SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
ACL 2026
Reasoning with Exploration: An Entropy Perspective
AAAI 2026
Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts
ACL 2026
VFA: Empowering Multilingual MLLMs via Vision-Free Adaptation
ACL 2026
Context-DPO: Aligning Language Models for Context-Faithfulness
ACL 2025
GeAR: Generation Augmented Retrieval
ACL 2025
Rethinking DPO-style Diffusion Aligning Frameworks
ICCV 2025
BitNet: 1-bit Pre-training for Large Language Models
JMLR 2025
On Domain-Adaptive Post-Training for Multimodal Large Language Models
EMNLP 2025
NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning
EMNLP 2025
Textual Aesthetics in Large Language Models
EMNLP 2025
HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition
ACL 2024
You Only Cache Once: Decoder-Decoder Architectures for Language Models
NIPS 2024
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better
NIPS 2024
Multi-Head Mixture-of-Experts
NIPS 2024
Boosting Text-to-Video Generative Model with MLLMs Feedback
NIPS 2024
Text Diffusion with Reinforced Conditioning
AAAI 2024
Se2: Sequential Example Selection for In-Context Learning
ACL 2024
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
ACL 2024
Calibrating LLM-Based Evaluator
COLING 2024
Instruction Pre-Training: Language Models are Supervised Multitask Learners
EMNLP 2024
Scaling Sentence Embeddings with Large Language Models
EMNLP 2024
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
ICLR 2024
Mixture of LoRA Experts
ICLR 2024
Adapting Large Language Models via Reading Comprehension
ICLR 2024
Grounding Multimodal Large Language Models to the World
ICLR 2024
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation
EMNLP 2023
A Length-Extrapolatable Transformer
ACL 2023
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
ACL 2023
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
ACL 2023
MoEC: Mixture of Expert Clusters
AAAI 2023
Magneto: A Foundation Transformer
ICML 2023
Language Is Not All You Need: Aligning Perception with Language Models
NIPS 2023
Pre-training Language Model as a Multi-perspective Course Learner
ACL 2023
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
EMNLP 2023
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning
ACL 2023
On the Representation Collapse of Sparse Mixture of Experts
NIPS 2022
PromptBERT: Improving BERT Sentence Embeddings with Prompts
EMNLP 2022
CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
EMNLP 2022
THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption
ACL 2022
Snapshot-Guided Domain Adaptation for ELECTRA
EMNLP 2022
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA
ACL 2022
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
IJCNLP 2021
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers
ACL 2021
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains
ACL 2021
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains
IJCNLP 2021
Pseudo-Label Guided Unsupervised Domain Adaptation of Contextual Embeddings
EACL 2021
mT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs
EMNLP 2021
Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training
EMNLP 2021
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task
EMNLP 2021
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers
IJCNLP 2021
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
ACL 2021
Consistency Regularization for Cross-Lingual Fine-Tuning
ACL 2021
Consistency Regularization for Cross-Lingual Fine-Tuning
IJCNLP 2021
Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph
EMNLP 2020
DocBank: A Benchmark Dataset for Document Layout Analysis
COLING 2020
Unsupervised Fine-tuning for Text Clustering
COLING 2020
Generating Commonsense Explanation by Extracting Bridge Concepts from Reasoning Paths
AACL 2020
Dictionary-Guided Editing Networks for Paraphrase Generation
AAAI 2019
Response Generation by Context-Aware Prototype Editing
AAAI 2019
Neural Document Summarization by Jointly Learning to Score and Select Sentences
ACL 2018
SuperAgent: A Customer Service Chatbot for E-commerce Websites
ACL 2017
Learning to Generate Product Reviews from Attributes
EACL 2017