Xiangru Tang

30 papers · 2020–2025 · 6 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (5) 🌍 Conference Polyglot (6) 🗺️ Taxonomy Completionist (64)

🏃 Academic Marathon (5) 🗺️ Taxonomy Completionist (64) 🐝 Cross-Pollinator (15) 👥 Mega-Team (32) 🤝 Dynamic Duo (12) 🔥 Unstoppable (6) 💎 Century Club (30) 🗃️ Keyword Collector (113) ⚡ Prolific Year (6) ❓ The Questioner

Conferences

EMNLP (11) ACL (8) NAACL (5) ICLR (4) COLING (1) SEMEVAL (1)

Top co-authors

Yilun Zhao (12) Arman Cohan (12) Mark Gerstein (9) Dragomir Radev (7) Linyong Nan (6) Wangchunshu Zhou (4) Niklas Muennighoff (3) Yanjun Shao (3) Chunyuan Deng (3) Haowei Zhang (2)

Keywords

large language model (12) benchmark evaluation (5) retrieval-augmented generation (3) factual consistency (3) prompt engineering (3) question answering (3) table-to-text generation (3) few-shot learning (3) in-context learning (2) multi-task learning (2) text classification (2) commonsense reasoning (2) dialogue summarization (2) medical reasoning (2) natural language generation (2) zero-shot learning (2) data contamination (2) claim verification (1) adversarial robustness (1) self-supervised learning (1)

Papers

OpenHands: An Open Platform for AI Software Developers as Generalist Agents ICLR 2025 OAgents: An Empirical Study of Building Effective Agents EMNLP 2025 Self-Supervised Prompt Optimization EMNLP 2025 Improving Context Fidelity via Native Retrieval-Augmented Reasoning EMNLP 2025 Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards EMNLP 2025 ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning ICLR 2025 FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents EMNLP 2024 DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents ACL 2024 MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning ACL 2024 Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation ACL 2024 Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? NAACL 2024 Investigating Data Contamination in Modern Benchmarks for Large Language Models NAACL 2024 OpenT2T: An Open-Source Toolkit for Table-to-Text Generation EMNLP 2024 MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications EMNLP 2024 PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes EMNLP 2024 ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs ICLR 2024 OctoPack: Instruction Tuning Code Large Language Models ICLR 2024 QTSumm: Query-Focused Summarization over Tabular Data EMNLP 2023 RWKV: Reinventing RNNs for the Transformer Era EMNLP 2023 GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context Learning ACL 2023 Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning ACL 2023 Crosslingual Generalization through Multitask Finetuning ACL 2023 RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations ACL 2023 Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios EMNLP 2023 PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts ACL 2022 CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning NAACL 2022 Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries NAACL 2022 DART: Open-Domain Structured Data Record to Text Generation NAACL 2021 CUHK at SemEval-2020 Task 4: CommonSense Explanation, Reasoning and Prediction with Multi-task Learning COLING 2020 CUHK at SemEval-2020 Task 4: CommonSense Explanation, Reasoning and Prediction with Multi-task Learning SEMEVAL 2020