Yuhao Zhang
49 papers · 2017–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (17) π Interdisciplinary Bridge π Renaissance Researcher (7) π Conference Polyglot (15)
π
Interdisciplinary Bridge
π
Conference Polyglot
(15)
πΊοΈ
Taxonomy Completionist
(17)
π€
Dynamic Duo
(14)
π
Grand Slam
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Conference Pioneer
π₯
Unstoppable
(9)
β‘
Prolific Year
(7)
ποΈ
Keyword Collector
(239)
π
Century Club
(47)
β
The Questioner
π
Trend Setter
Conferences
ACL (16)
EMNLP (14)
NAACL (4)
AAAI (3)
ICML (2)
CONLL (1)
EACL (1)
ECCV (1)
ICLR (1)
IJCNLP (1)
INTERSPEECH (1)
MLHC (1)
NIPS (1)
NSDI (1)
OSDI (1)
Top co-authors
Keywords
speech translation
(10)
named entity recognition
(5)
transfer learning
(5)
dependency parsing
(4)
large language model
(4)
multi-task learning
(4)
automatic speech recognition
(4)
knowledge distillation
(4)
contrastive learning
(3)
information retrieval
(3)
relation extraction
(3)
speech recognition
(3)
end-to-end model
(3)
language model
(3)
radiology report
(3)
reinforcement learning
(3)
multimodal learning
(2)
text generation
(2)
text summarization
(2)
information extraction
(2)
Papers
Atom-level Adaptive Receptive Fields: A Pruning-Based Encoder for 2D Molecular Graphs (Student Abstract)
AAAI 2026
AFT-Tab: Adversarial Fine-Tuning for Tabular Data Synthesis with Long Text Columns
ACL 2026
Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
ACL 2025
Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs
NSDI 2025
Soundwave: Less is More for Speech-Text Alignment in LLMs
ACL 2025
SPFT-SQL: Enhancing Large Language Model for Text-to-SQL Parsing by Self-Play Fine-Tuning
EMNLP 2025
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
ICLR 2025
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders
EMNLP 2025
Stripeless Data Placement for Erasure-Coded In-Memory Storage
OSDI 2025
DSQG-Syn: Synthesizing High-quality Data for Text-to-SQL Parsing by Domain Specific Question Generation
NAACL 2025
DragVideo: Interactive Drag-style Video Editing
ECCV 2024
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering
EMNLP 2024
CodeFort: Robust Training for Code Generation Models
EMNLP 2024
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models
EMNLP 2024
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
NAACL 2024
Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
EMNLP 2023
CTC-based Non-autoregressive Speech Translation
ACL 2023
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
ACL 2023
RobustQA: Benchmarking the Robustness of Domain Adaptation for Open-Domain Question Answering
ACL 2023
Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
ACL 2023
Bridging the Granularity Gap for Acoustic Modeling
ACL 2023
The NiuTrans End-to-End Speech Translation System for IWSLT23 English-to-Chinese Offline Task
ACL 2023
Improving End-to-End Speech Translation by Leveraging Auxiliary Speech and Text Data
AAAI 2023
Information Magnitude Based Dynamic Sub-sampling for Speech-to-text
INTERSPEECH 2023
Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
EMNLP 2023
A Query-Parallel Machine Reading Comprehension Framework for Low-resource NER
EMNLP 2023
The NiuTransβs Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task
ACL 2022
A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular Space
ACL 2022
Contrastive Learning of Medical Visual Representations from Paired Images and Text
MLHC 2022
BagFlip: A Certified Defense Against Data Poisoning
NIPS 2022
Overview of the MEDIQA 2021 Shared Task on Summarization in the Medical Domain
NAACL 2021
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders
ACL 2021
Do Syntax Trees Help Pre-trained Transformers Extract Information?
EACL 2021
Certified Robustness to Programmable Transformations in LSTMs
EMNLP 2021
Online Selection Problems against Constrained Adversary
ICML 2021
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders
IJCNLP 2021
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation
NAACL 2021
Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports
ACL 2020
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages
ACL 2020
Learning Architectures from an Extended Search Space for Language Modeling
ACL 2020
The NiuTrans Machine Translation Systems for WMT20
EMNLP 2020
Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations
EMNLP 2020
Robustness to Programmable String Transformations via Augmented Abstract Training
ICML 2020
The NiuTrans Machine Translation Systems for WMT19
ACL 2019
Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search
AAAI 2019
Universal Dependency Parsing from Scratch
CONLL 2018
Learning to Summarize Radiology Findings
EMNLP 2018
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
EMNLP 2018
Position-aware Attention and Supervised Data Improve Slot Filling
EMNLP 2017