Hao Yang
191 papers · 2012–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (28) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (18)
🌉
Interdisciplinary Bridge
🌍
Conference Polyglot
(18)
🗺️
Taxonomy Completionist
(28)
🏠
Conference Loyalist
(60)
👥
Mega-Team
(22)
🏆
Grand Slam
🔬
Deep Specialist
(60)
🧬
Topic Evolution
🏆
Keyword Champion
(14)
🤝
Dynamic Duo
(56)
🗃️
Keyword Collector
(71)
❓
The Questioner
💎
Century Club
(187)
📈
Trend Setter
🚀
Conference Pioneer
🔥
Unstoppable
(11)
⚡
Prolific Year
(37)
Conferences
EMNLP (60)
ACL (33)
CVPR (21)
AAAI (17)
ICCV (9)
COLING (8)
INTERSPEECH (8)
NAACL (6)
IJCAI (6)
NIPS (4)
ECCV (4)
SEMEVAL (3)
ICLR (3)
ICML (2)
MICCAI (2)
AACL (2)
WACV (2)
ACML (1)
Top co-authors
Research topics
Keywords
machine translation
(45)
neural machine translation
(34)
large language model
(24)
automatic speech recognition
(17)
quality estimation
(14)
domain adaptation
(14)
back translation
(14)
transfer learning
(10)
data augmentation
(10)
ensemble learning
(10)
transformer architecture
(9)
knowledge distillation
(9)
transformer model
(8)
multilingual translation
(7)
representation learning
(7)
named entity recognition
(7)
contrastive learning
(6)
knowledge graph
(6)
speech translation
(6)
document-level translation
(5)
Papers
CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling
AAAI 2026
DAPointMamba: Domain Adaptive Point Mamba for Point Cloud Completion
AAAI 2026
PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification
AAAI 2026
Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models
WACV 2026
MAVERIX: Multimodal Audio-Visual Evaluation and Recognition IndeX
AAAI 2026
Look Beyond Feeling: Unveiling Latent Needs from Implicit Expressions for Proactive Emotional Support
EMNLP 2025
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency
ICML 2025
Enhancing Numerical Prediction of MLLMs with Soft Labeling
ICCV 2025
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
ICCV 2025
ZeroStereo: Zero-shot Stereo Matching from Single Images
ICCV 2025
Met2Net: A Decoupled Two-Stage Spatio-Temporal Forecasting Model for Complex Meteorological Systems
ICCV 2025
Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization
AAAI 2025
SRDC: Semantics-based Ransomware Detection and Classification with LLM-assisted Pre-training
AAAI 2025
Function-to-Style Guidance of LLMs for Code Translation
ICML 2025
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model
AAAI 2025
Reshaping Representation Space to Balance the Safety and Over-rejection in Large Audio Language Models
EMNLP 2025
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders
EMNLP 2025
Generative Annotation for ASR Named Entity Correction
EMNLP 2025
VQA-Augmented Machine Translation with Cross-Modal Contrastive Learning
EMNLP 2025
Imagination and Contemplation: A Balanced Framework for Semantic-Augmented Multimodal Machine Translation
EMNLP 2025
M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models
EMNLP 2025
An Evaluation Resource for Grounding Translation Errors
EMNLP 2025
HW-TSC’s Submissions to the WMT 2025 Segment-level Quality Score Prediction Task
EMNLP 2025
Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation
ACL 2025
Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement
ACL 2025
Basic Reading Distillation
ACL 2025
Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation
ACL 2025
TableDreamer: Progressive and Weakness-guided Data Synthesis from Scratch for Table Instruction Tuning
ACL 2025
Multimodal Machine Translation with Text-Image In-depth Questioning
ACL 2025
DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation
ACL 2025
From Observation to Understanding: Front-Door Adjustments with Uncertainty Calibration for Enhancing Egocentric Reasoning in LVLMs
ACL 2025
YNU-HPCC at SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Using Multiple Prediction Headers
ACL 2025
MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning
WACV 2025
Enhancing Large Language Models for Document-Level Translation Post-Editing Using Monolingual Data
COLING 2025
HW-TSC at Multilingual Counterspeech Generation
COLING 2025
YNU-HPCC at SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Using Multiple Prediction Headers
SEMEVAL 2025
Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
NAACL 2025
Audio Is the Achilles’ Heel: Red Teaming Audio Large Multimodal Models
NAACL 2025
A Diffusion-Driven Temporal Super-Resolution and Spatial Consistency Enhancement Framework for 4D MRI imaging
MICCAI 2025
End-to-End Learnable Psychiatric Scale Guided Risky Post Screening for Depression Detection on Social Media
EMNLP 2025
Taming Text-to-Image Synthesis for Novices: User-centric Prompt Generation via Multi-turn Guidance
EMNLP 2025
Scaling up Image Segmentation across Data and Tasks
CVPR 2025
Goku: Flow Based Video Generative Foundation Models
CVPR 2025
HW-TSC 2024 Submission for the SemEval-2024 Task 1: Semantic Textual Relatedness (STR)
SEMEVAL 2024
Language-driven All-in-one Adverse Weather Removal
CVPR 2024
LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
CVPR 2024
HW-TSC 2024 Submission for the SemEval-2024 Task 1: Semantic Textual Relatedness (STR)
NAACL 2024
A Novel Paradigm Boosting Translation Capabilities of Large Language Models
NAACL 2024
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models
CVPR 2024
Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining
MICCAI 2024
RASU: Retrieval Augmented Speech Understanding through Generative Modeling
INTERSPEECH 2024
AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose
AAAI 2024
Moderate Message Passing Improves Calibration: A Universal Way to Mitigate Confidence Bias in Graph Neural Networks
AAAI 2024
Translate Meanings, Not Just Words: IdiomKB’s Role in Optimizing Idiomatic Translation with Language Models
AAAI 2024
Multilingual Transfer and Domain Adaptation for Low-Resource Languages of Spain
EMNLP 2024
CB-Whisper: Contextual Biasing Whisper Using Open-Vocabulary Keyword-Spotting
COLING 2024
CHisIEC: An Information Extraction Corpus for Ancient Chinese History
COLING 2024
An End-to-End Speech Summarization Using Large Language Model
INTERSPEECH 2024
Evaluation Dataset for Lexical Translation Consistency in Chinese-to-English Document-level Translation
COLING 2024
Using Large Language Model for End-to-End Chinese ASR and NER
INTERSPEECH 2024
A Multitask Training Approach to Enhance Whisper with Open-Vocabulary Keyword Spotting
INTERSPEECH 2024
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
ECCV 2024
Submodular-based In-context Example Selection for LLMs-based Machine Translation
COLING 2024
Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR
INTERSPEECH 2024
Context-aware and Style-related Incremental Decoding Framework for Discourse-Level Literary Translation
EMNLP 2024
Exploring the Traditional NMT Model and Large Language Model for Chat Translation
EMNLP 2024
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
EMNLP 2024
Cross-Domain Audio Deepfake Detection: Dataset and Analysis
EMNLP 2024
Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights
EMNLP 2024
DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators
EMNLP 2024
Choose the Final Translation from NMT and LLM Hypotheses Using MBR Decoding: HW-TSC’s Submission to the WMT24 General MT Shared Task
EMNLP 2024
HW-TSC 2024 Submission for the Quality Estimation Shared Task
EMNLP 2024
HW-TSC’s Participation in the WMT 2024 QEAPE Task
EMNLP 2024
Machine Translation Advancements of Low-Resource Indian Languages by Transfer Learning
EMNLP 2024
Enhancing Hyperbolic Knowledge Graph Embeddings via Lorentz Transformations
ACL 2024
Pause-Aware Automatic Dubbing using LLM and Voice Cloning
ACL 2024
Improving the Quality of IWLST 2024 Cascade Offline Speech Translation and Speech-to-Speech Translation via Translation Hypothesis Ensembling with NMT models and Large Language Models
ACL 2024
HW-TSC’s Speech to Text Translation System for IWSLT 2024 in Indic track
ACL 2024
HW-TSC’s Submissions To the IWSLT2024 Low-resource Speech Translation Tasks
ACL 2024
HW-TSC’s Simultaneous Speech Translation System for IWSLT 2024
ACL 2024
HW-TSC’s submission to the IWSLT 2024 Subtitling track
ACL 2024
HW-TSC at TextGraphs-17 Shared Task: Enhancing Inference Capabilities of LLMs with Knowledge Graphs
ACL 2024
The Path to Continuous Domain Adaptation Improvements by HW-TSC for the WMT23 Biomedical Translation Shared Task
EMNLP 2023
Empowering a Metric with LLM-assisted Named Entity Annotation: HW-TSC’s Submission to the WMT23 Metrics Shared Task
EMNLP 2023
Unify Word-level and Span-level Tasks: NJUNLP’s Participation for the WMT2023 Quality Estimation Shared Task
EMNLP 2023
HW-TSC 2023 Submission for the Quality Estimation Shared Task
EMNLP 2023
HW-TSC’s Participation in the WMT 2023 Automatic Post Editing Shared Task
EMNLP 2023
Local and Global Logit Adjustments for Long-Tailed Learning
ICCV 2023
InterFormer: Real-time Interactive Image Segmentation
ICCV 2023
Length-Aware NMT and Adaptive Duration for Automatic Dubbing
ACL 2023
Improving Neural Machine Translation Formality Control with Domain Adaptation and Reranking-based Transductive Learning
ACL 2023
HW-TSC at IWSLT2023: Break the Quality Ceiling of Offline Track via Pre-Training and Domain Adaptation
ACL 2023
The HW-TSC’s Speech-to-Speech Translation System for IWSLT 2023
ACL 2023
The HW-TSC’s Simultaneous Speech-to-Text Translation System for IWSLT 2023 Evaluation
ACL 2023
The HW-TSC’s Simultaneous Speech-to-Speech Translation System for IWSLT 2023 Evaluation
ACL 2023
Denoising Pre-training for Machine Translation Quality Estimation with Curriculum Learning
AAAI 2023
ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-Real Novel View Synthesis via Contrastive Learning
CVPR 2023
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
CVPR 2023
A Meta-Learning Approach to Predicting Performance and Data Requirements
CVPR 2023
Guided Recommendation for Model Fine-Tuning
CVPR 2023
Investigating Pre-trained Audio Encoders in the Low-Resource Condition
INTERSPEECH 2023
WhiSLU: End-to-End Spoken Language Understanding with Whisper
INTERSPEECH 2023
Stochastic Feature Averaging for Learning with Long-Tailed Noisy Labels
IJCAI 2023
Probabilistic Masked Attention Networks for Explainable Sequential Recommendation
IJCAI 2023
From Trainable Negative Depth to Edge Heterophily in Graphs
NIPS 2023
Your representations are in the network: composable and parallel adaptation for large scale models
NIPS 2023
PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds
NIPS 2023
FreeEnricher: Enriching Face Landmarks without Additional Cost
AAAI 2023
SwiftAvatar: Efficient Auto-Creation of Parameterized Stylized Character on Arbitrary Avatar Engines
AAAI 2023
Text Style Transfer Back-Translation
ACL 2023
Prompt Tuning for Unified Multimodal Pretrained Models
ACL 2023
Lexical Translation Inconsistency-Aware Document-Level Translation Repair
ACL 2023
Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search
EMNLP 2023
SmartSpanNER: Making SpanNER Robust in Low Resource Scenarios
EMNLP 2023
Chain-of-Thought Reasoning in Tabular Language Models
EMNLP 2023
INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion
EMNLP 2023
Treating General MT Shared Task as a Multi-Domain Adaptation Problem: HW-TSC’s Submission to the WMT23 General MT Shared Task
EMNLP 2023
Multifaceted Challenge Set for Evaluating Machine Translation Performance
EMNLP 2023
HW-TSC’s Submissions to the WMT23 Discourse-Level Literary Translation Shared Task
EMNLP 2023
General Facial Representation Learning in a Visual-Linguistic Manner
CVPR 2022
Noninvasive Lung Cancer Early Detection via Deep Methylation Representation Learning
AAAI 2022
Exploring Entity Interactions for Few-Shot Relation Learning (Student Abstract)
AAAI 2022
Part Represents Whole: Improving the Evaluation of Machine Translation System Using Entropy Enhanced Metrics
AACL 2022
Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors
ACL 2022
Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference
ACL 2022
The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation
ACL 2022
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation
ACL 2022
The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation
ACL 2022
HW-TSC’s Participation in the IWSLT 2022 Isometric Spoken Language Translation
ACL 2022
HwTscSU’s Submissions on WAT 2022 Shared Task
COLING 2022
Omni-DETR: Omni-Supervised Object Detection With Transformers
CVPR 2022
Large-Scale Pre-Training for Person Re-Identification With Noisy Labels
CVPR 2022
Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
ECCV 2022
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images
ECCV 2022
Face-Sensitive Image-to-Emotional-Text Cross-modal Translation for Multimodal Aspect-based Sentiment Analysis
EMNLP 2022
Modeling Consistency Preference via Lexical Chains for Document-level Neural Machine Translation
EMNLP 2022
Self-supervised Rewiring of Pre-trained Speech Encoders:Towards Faster Fine-tuning with Less Labels in Speech Processing
EMNLP 2022
RedApt: An Adaptor for wav2vec 2 EncodingFaster and Smaller Speech Translation without Quality Compromise
EMNLP 2022
HW-TSC’s Submissions to the WMT 2022 General Machine Translation Shared Task
EMNLP 2022
Exploring Robustness of Machine Translation Metrics: A Study of Twenty-Two Automatic Metrics in the WMT22 Metric Task
EMNLP 2022
Partial Could Be Better than Whole. HW-TSC 2022 Submission for the Metrics Shared Task
EMNLP 2022
NJUNLP’s Participation for the WMT2022 Quality Estimation Shared Task
EMNLP 2022
CrossQE: HW-TSC 2022 Submission for the Quality Estimation Shared Task
EMNLP 2022
HW-TSC’s Submission for the WMT22 Efficiency Task
EMNLP 2022
HW-TSC Translation Systems for the WMT22 Biomedical Translation Task
EMNLP 2022
HW-TSC Translation Systems for the WMT22 Chat Translation Task
EMNLP 2022
HW-TSC Systems for WMT22 Very Low Resource Supervised MT Task
EMNLP 2022
HW-TSC’s Submissions to the WMT22 Word-Level Auto Completion Task
EMNLP 2022
Normalization of Language Embeddings for Cross-Lingual Alignment
ICLR 2022
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation
INTERSPEECH 2022
Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints
NAACL 2022
HW-TSC at SemEval-2022 Task 7: Ensemble Model Based on Pretrained Models for Identifying Plausible Clarifications
NAACL 2022
HW-TSC at SemEval-2022 Task 7: Ensemble Model Based on Pretrained Models for Identifying Plausible Clarifications
SEMEVAL 2022
HW-TSC’s Submissions to the WMT21 Biomedical Translation Task
EMNLP 2021
HW-TSC’s Participation in the WMT 2021 Large-Scale Multilingual Translation Task
EMNLP 2021
On Position Embeddings in BERT
ICLR 2021
HW-TSC’s Participation in the WMT 2021 Triangular MT Shared Task
EMNLP 2021
HW-TSC’s Participation in the WMT 2021 News Translation Shared Task
EMNLP 2021
Online Credit Payment Fraud Detection via Structure-Aware Hierarchical Recurrent Neural Network
IJCAI 2021
How Length Prediction Influence the Performance of Non-Autoregressive Translation?
EMNLP 2021
Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems
NIPS 2021
Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion
CVPR 2021
Unsupervised Pre-Training for Person Re-Identification
CVPR 2021
Beating Attackers At Their Own Games: Adversarial Example Detection Using Adversarial Gradient Directions
AAAI 2021
ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment
ICCV 2021
Adversarial Example Detection Using Latent Neighborhood Graph
ICCV 2021
HW-TSC’s Participation at WMT 2021 Quality Estimation Shared Task
EMNLP 2021
HW-TSC’s Participation in the WMT 2021 Efficiency Shared Task
EMNLP 2021
Huawei’s Submissions to the WMT20 Biomedical Translation Task
EMNLP 2020
HW-TSC’s Participation in the WMT 2020 News Translation Shared Task
EMNLP 2020
Modelling Long-distance Node Relations for KBQA with Global Dynamic Graph
COLING 2020
Advancing High Fidelity Identity Swapping for Forgery Detection
CVPR 2020
HW-TSC’s Participation in the WAT 2020 Indic Languages Multilingual Task
AACL 2020
Rethinking the Hyperparameters for Fine-tuning
ICLR 2020
Face X-Ray for More General Face Forgery Detection
CVPR 2020
HGMAN: Multi-Hop and Multi-Answer Question Answering Based on Heterogeneous Knowledge Graph (Student Abstract)
AAAI 2020
HW-TSC’s Participation at WMT 2020 Quality Estimation Shared Task
EMNLP 2020
The HW-TSC Video Speech Translation System at IWSLT 2020
ACL 2020
HW-TSC’s Participation at WMT 2020 Automatic Post Editing Shared Task
EMNLP 2020
Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes
ICCV 2019
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding Distillation with Ensemble Learning
IJCAI 2019
Face Parsing With RoI Tanh-Warping
CVPR 2019
Mask-Guided Portrait Editing With Conditional GANs
CVPR 2019
Position Focused Attention Network for Image-Text Matching
IJCAI 2019
An End-to-End Multi-task Learning Model for Fact Checking
EMNLP 2018
Zero-Annotation Object Detection with Web Knowledge Transfer
ECCV 2018
MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks With Privileged Information
CVPR 2017
Exploit Bounding Box Annotations for Multi-Label Object Recognition
CVPR 2016
Efficient 3D Room Shape Recovery From a Single Panorama
CVPR 2016
Reduced Heteroscedasticity Linear Regression for Nyström Approximation
IJCAI 2013
Practical Large Scale Classification with Additive Kernels
ACML 2012