Dawei Zhu
38 papers · 2020–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (5) π Conference Polyglot (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(87)
π₯
Mega-Team
(82)
π€
Dynamic Duo
(13)
π₯
Unstoppable
(6)
β
The Questioner
(2)
π
Century Club
(36)
ποΈ
Keyword Collector
(184)
β‘
Prolific Year
(11)
Conferences
EMNLP (16)
ACL (8)
COLING (4)
NAACL (3)
AAAI (2)
EACL (2)
ICLR (2)
INTERSPEECH (1)
Top co-authors
Keywords
large language model
(11)
text classification
(4)
document-level translation
(4)
language model
(4)
translation quality
(3)
named entity recognition
(3)
model evaluation
(3)
few-shot learning
(3)
label noise
(3)
machine translation
(3)
distant supervision
(2)
information bottleneck
(2)
knowledge graph
(2)
noisy label
(2)
natural language processing
(2)
text generation
(2)
weakly supervised learning
(2)
noisy label learning
(2)
prompt engineering
(2)
low-resource language
(2)
Papers
DocLens: A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding
ACL 2026
What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation
ACL 2026
Hierarchical Memory Organization for Wikipedia Generation
ACL 2025
LongAttn: Selecting Long-context Training Data via Token-level Attention
ACL 2025
From Calculation to Adjudication: Examining LLM Judges on Mathematical Reasoning Tasks
ACL 2025
PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
EMNLP 2025
Same evaluation, more tokens: On the effect of input length for machine translation evaluation using Large Language Models
EMNLP 2025
Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation
NAACL 2025
WIKIGENBENCH:Exploring Full-length Wikipedia Generation under Real-World Scenario
COLING 2025
EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
COLING 2025
InternLM-Law: An Open-Sourced Chinese Legal Large Language Model
COLING 2025
Findings of the WMT25 Terminology Translation Task: Terminology is Useful Especially for Good MTs
EMNLP 2025
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
EMNLP 2025
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
EMNLP 2025
AFRIDOC-MT: Document-level MT Corpus for African Languages
EMNLP 2025
MMTEB: Massive Multilingual Text Embedding Benchmark
ICLR 2025
Language models can learn implicit multi-hop reasoning, but only if they have lots of training data
EMNLP 2025
To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models
EMNLP 2024
Large Language Models are not Fair Evaluators
ACL 2024
Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?
EMNLP 2024
LongEmbed: Extending Embedding Models for Long Context Retrieval
EMNLP 2024
The Accuracy Paradox in RLHF: When Better Reward Models Donβt Yield Better Language Models
EMNLP 2024
LawBench: Benchmarking Legal Knowledge of Large Language Models
EMNLP 2024
Assessing βImplicitβ Retrieval Robustness of Large Language Models
EMNLP 2024
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
EMNLP 2024
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
ICLR 2024
CoUDA: Coherence Evaluation via Unified Data Augmentation
NAACL 2024
A Preference-driven Paradigm for Enhanced Translation with Large Language Models
NAACL 2024
GraphPrompt: Graph-Based Prompt Templates for Biomedical Synonym Prediction
AAAI 2023
Meta Self-Refinement for Robust Learning with Weak Supervision
EACL 2023
Weaker Than You Think: A Critical Look at Weakly Supervised Learning
ACL 2023
InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective
EMNLP 2023
Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification
ACL 2022
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech
COLING 2022
ROXANNE Research Platform: Automate Criminal Investigations
INTERSPEECH 2021
Analysing the Noise Model Error for Realistic Noisy Label Data
AAAI 2021
Neural Data-to-Text Generation with LM-based Text Augmentation
EACL 2021
Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages
EMNLP 2020