Xinyu Hu
21 papers · 2023–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Cross-Pollinator (14) π Conference Polyglot (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Renaissance Researcher (8)
πΊοΈ
Taxonomy Completionist
(51)
π
Interdisciplinary Bridge
π€
Dynamic Duo
(13)
β
The Questioner
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(89)
π
Century Club
(18)
Conferences
ACL (9)
EMNLP (5)
NAACL (3)
ICLR (2)
AAAI (1)
COLING (1)
Top co-authors
Keywords
large language model
(9)
natural language generation
(6)
evaluation metric
(3)
automatic evaluation
(2)
retrieval-augmented generation
(2)
hallucination detection
(2)
elo rating
(1)
data augmentation
(1)
mathematical reasoning
(1)
llm evaluation
(1)
synthetic data generation
(1)
evaluation framework
(1)
noisy channel model
(1)
monte carlo tree search
(1)
preference alignment
(1)
multimodal large language model
(1)
foundation model
(1)
text generation
(1)
evaluation benchmark
(1)
hidden state
(1)
Papers
SCOPE: Intrinsic Semantic Space Control for Mitigating Copyright Infringement in LLMs
AAAI 2026
HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy
ACL 2026
LEDOM: Reverse Language Model
ACL 2026
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
NAACL 2025
Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
NAACL 2025
STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent Framework
ACL 2025
ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs
ACL 2025
A Dual-Perspective NLG Meta-Evaluation Framework with Automatic Benchmark and Better Interpretability
ACL 2025
GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models
ACL 2025
MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency
ACL 2025
DAMON: A Dialogue-Aware MCTS Framework for Jailbreaking Large Language Models
EMNLP 2025
Towards A βNovelβ Benchmark: Evaluating Literary Fiction with Large Language Models
ACL 2025
Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
NAACL 2025
Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability
EMNLP 2024
Error-Robust Retrieval for Chinese Spelling Check
COLING 2024
Are LLM-based Evaluators Confusing NLG Quality Criteria?
ACL 2024
Task Oriented In-Domain Data Augmentation
EMNLP 2024
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
ICLR 2024
Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency
ICLR 2024
Exploring Context-Aware Evaluation Metrics for Machine Translation
EMNLP 2023
Exploring Discourse Structure in Document-level Machine Translation
EMNLP 2023