Yuhao Zhang

49 papers · 2017–2026 · 15 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (17) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (15)

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (15) 🗺️ Taxonomy Completionist (17) 🤝 Dynamic Duo (14) 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (2) 🚀 Conference Pioneer 🔥 Unstoppable (9) ⚡ Prolific Year (7) 🗃️ Keyword Collector (239) 💎 Century Club (47) ❓ The Questioner 📈 Trend Setter

Conferences

ACL (16) EMNLP (14) NAACL (4) AAAI (3) ICML (2) CONLL (1) EACL (1) ECCV (1) ICLR (1) IJCNLP (1) INTERSPEECH (1) MLHC (1) NIPS (1) NSDI (1) OSDI (1)

Top co-authors

Jingbo Zhu (14) Tong Xiao (14) Chen Xu (11) Peng Qi (9) Christopher D. Manning (8) Xiaoqian Liu (5) Zhiheng Huang (4) Bei Li (4) Lan Liu (4) William Yang Wang (4)

Keywords

speech translation (10) named entity recognition (5) transfer learning (5) dependency parsing (4) large language model (4) multi-task learning (4) automatic speech recognition (4) knowledge distillation (4) contrastive learning (3) information retrieval (3) relation extraction (3) speech recognition (3) end-to-end model (3) language model (3) radiology report (3) reinforcement learning (3) multimodal learning (2) text generation (2) text summarization (2) information extraction (2)

Papers

Atom-level Adaptive Receptive Fields: A Pruning-Based Encoder for 2D Molecular Graphs (Student Abstract) AAAI 2026 AFT-Tab: Adversarial Fine-Tuning for Tabular Data Synthesis with Long Text Columns ACL 2026 Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation ACL 2025 Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs NSDI 2025 Soundwave: Less is More for Speech-Text Alignment in LLMs ACL 2025 SPFT-SQL: Enhancing Large Language Model for Text-to-SQL Parsing by Self-Play Fine-Tuning EMNLP 2025 Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models ICLR 2025 Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders EMNLP 2025 Stripeless Data Placement for Erasure-Coded In-Memory Storage OSDI 2025 DSQG-Syn: Synthesizing High-quality Data for Text-to-SQL Parsing by Domain Specific Question Generation NAACL 2025 DragVideo: Interactive Drag-style Video Editing ECCV 2024 RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering EMNLP 2024 CodeFort: Robust Training for Code Generation Models EMNLP 2024 Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models EMNLP 2024 Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models NAACL 2024 Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks EMNLP 2023 CTC-based Non-autoregressive Speech Translation ACL 2023 Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge ACL 2023 RobustQA: Benchmarking the Robustness of Domain Adaptation for Open-Domain Question Answering ACL 2023 Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations ACL 2023 Bridging the Granularity Gap for Acoustic Modeling ACL 2023 The NiuTrans End-to-End Speech Translation System for IWSLT23 English-to-Chinese Offline Task ACL 2023 Improving End-to-End Speech Translation by Leveraging Auxiliary Speech and Text Data AAAI 2023 Information Magnitude Based Dynamic Sub-sampling for Speech-to-text INTERSPEECH 2023 Rethinking and Improving Multi-task Learning for End-to-end Speech Translation EMNLP 2023 A Query-Parallel Machine Reading Comprehension Framework for Low-resource NER EMNLP 2023 The NiuTrans’s Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task ACL 2022 A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular Space ACL 2022 Contrastive Learning of Medical Visual Representations from Paired Images and Text MLHC 2022 BagFlip: A Certified Defense Against Data Poisoning NIPS 2022 Overview of the MEDIQA 2021 Shared Task on Summarization in the Medical Domain NAACL 2021 Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders ACL 2021 Do Syntax Trees Help Pre-trained Transformers Extract Information? EACL 2021 Certified Robustness to Programmable Transformations in LSTMs EMNLP 2021 Online Selection Problems against Constrained Adversary ICML 2021 Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders IJCNLP 2021 Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation NAACL 2021 Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports ACL 2020 Stanza: A Python Natural Language Processing Toolkit for Many Human Languages ACL 2020 Learning Architectures from an Extended Search Space for Language Modeling ACL 2020 The NiuTrans Machine Translation Systems for WMT20 EMNLP 2020 Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations EMNLP 2020 Robustness to Programmable String Transformations via Augmented Abstract Training ICML 2020 The NiuTrans Machine Translation Systems for WMT19 ACL 2019 Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search AAAI 2019 Universal Dependency Parsing from Scratch CONLL 2018 Learning to Summarize Radiology Findings EMNLP 2018 Graph Convolution over Pruned Dependency Trees Improves Relation Extraction EMNLP 2018 Position-aware Attention and Supervised Data Improve Slot Filling EMNLP 2017