Research Explorer

Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Somin Wadhwa, Jay DeYoung, Benjamin Nye et al.

2023 MLHC

LLMSYN: Generating Synthetic Electronic Health Records Without Patient-Level Data

Yijie Hao, Huan He, Joyce C. Ho

2024 MLHC

Leveraging LLMs for Multimodal Medical Time Series Analysis

Nimeesha Chan, Felix Parker, William C Bennett et al.

2024 MLHC

ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs

Fahmida Liza Piya, Rahmatollah Beheshti

2025 MLHC

FactEHR: A Dataset for Evaluating Factuality in Clinical Notes Using LLMs

Monica Munnangi, Akshay Swaminathan, Jason Alan Fries et al.

2025 MLHC

Evaluation of Multi-Agent LLMs in Multidisciplinary Team Decision-Making for Challenging Cancer Cases

Jaesik Kim, Byounghan Lee, Kyung-Ah Sohn et al.

2025 MLHC

Enhancing Adaptive Behavioral Interventions with LLM Inference from Participant Described States

Karine Karine, Benjamin M. Marlin

2025 MLHC

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Chase M Fensore, Rodrigo M Carrillo-Larco, Megha Shah et al.

2025 MLHC

LLMs Are Few-Shot In-Context Low-Resource Language Learners

Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

2024 NAACL

FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs

Shamik Roy, Sailik Sengupta, Daniele Bonadiman et al.

2024 NAACL

E5: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and Extrapolate

Zhehao Zhang, Yan Gao, Jian-Guang Lou

2024 NAACL

SELF-GUARD: Empower the LLM to Safeguard Itself

Zezhong Wang, Fangkai Yang, Lu Wang et al.

2024 NAACL

MART: Improving LLM Safety with Multi-round Automatic Red-Teaming

Suyu Ge, Chunting Zhou, Rui Hou et al.

2024 NAACL

Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings

Chen Cecilia Liu, Fajri Koto, Timothy Baldwin et al.

2024 NAACL

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning

Xuansheng Wu, Wenlin Yao, Jianshu Chen et al.

2024 NAACL

How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities

Lingbo Mo, Boshi Wang, Muhao Chen et al.

2024 NAACL

Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks

Ting-Yun Chang, Jesse Thomason, Robin Jia

2024 NAACL

Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications

Yanchen Liu, Srishti Gautam, Jiaqi Ma et al.

2024 NAACL

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

Chonghua Wang, Haodong Duan, Songyang Zhang et al.

2024 NAACL

On-the-fly Definition Augmentation of LLMs for Biomedical NER

Monica Munnangi, Sergey Feldman, Byron Wallace et al.

2024 NAACL

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey

Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi et al.

2024 NAACL

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Liyan Tang, Igor Shalyminov, Amy Wong et al.

2024 NAACL

Flames: Benchmarking Value Alignment of LLMs in Chinese

Kexin Huang, Xiangyang Liu, Qianyu Guo et al.

2024 NAACL

Fake Alignment: Are LLMs Really Aligned Well?

Yixu Wang, Yan Teng, Kexin Huang et al.

2024 NAACL

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs

Yassir Fathullah, Chunyang Wu, Egor Lakomkin et al.

2024 NAACL

Papers