Xiangru Tang
30 papers · 2020–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Renaissance Researcher (6) π Interdisciplinary Bridge π Academic Marathon (5) π Conference Polyglot (6) πΊοΈ Taxonomy Completionist (64)
π
Academic Marathon
(5)
πΊοΈ
Taxonomy Completionist
(64)
π
Cross-Pollinator
(15)
π₯
Mega-Team
(32)
π€
Dynamic Duo
(12)
π₯
Unstoppable
(6)
π
Century Club
(30)
ποΈ
Keyword Collector
(113)
β‘
Prolific Year
(6)
β
The Questioner
Conferences
EMNLP (11)
ACL (8)
NAACL (5)
ICLR (4)
COLING (1)
SEMEVAL (1)
Top co-authors
Keywords
large language model
(12)
benchmark evaluation
(5)
retrieval-augmented generation
(3)
factual consistency
(3)
prompt engineering
(3)
question answering
(3)
table-to-text generation
(3)
few-shot learning
(3)
in-context learning
(2)
multi-task learning
(2)
text classification
(2)
commonsense reasoning
(2)
dialogue summarization
(2)
medical reasoning
(2)
natural language generation
(2)
zero-shot learning
(2)
data contamination
(2)
claim verification
(1)
adversarial robustness
(1)
self-supervised learning
(1)
Papers
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
ICLR 2025
OAgents: An Empirical Study of Building Effective Agents
EMNLP 2025
Self-Supervised Prompt Optimization
EMNLP 2025
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
EMNLP 2025
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
EMNLP 2025
ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
ICLR 2025
FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents
EMNLP 2024
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents
ACL 2024
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
ACL 2024
Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation
ACL 2024
Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data?
NAACL 2024
Investigating Data Contamination in Modern Benchmarks for Large Language Models
NAACL 2024
OpenT2T: An Open-Source Toolkit for Table-to-Text Generation
EMNLP 2024
MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications
EMNLP 2024
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes
EMNLP 2024
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
ICLR 2024
OctoPack: Instruction Tuning Code Large Language Models
ICLR 2024
QTSumm: Query-Focused Summarization over Tabular Data
EMNLP 2023
RWKV: Reinventing RNNs for the Transformer Era
EMNLP 2023
GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context Learning
ACL 2023
Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning
ACL 2023
Crosslingual Generalization through Multitask Finetuning
ACL 2023
RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
ACL 2023
Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios
EMNLP 2023
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
ACL 2022
CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning
NAACL 2022
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries
NAACL 2022
DART: Open-Domain Structured Data Record to Text Generation
NAACL 2021
CUHK at SemEval-2020 Task 4: CommonSense Explanation, Reasoning and Prediction with Multi-task Learning
COLING 2020
CUHK at SemEval-2020 Task 4: CommonSense Explanation, Reasoning and Prediction with Multi-task Learning
SEMEVAL 2020