Papers
Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs
Somin Wadhwa, Jay DeYoung, Benjamin Nye et al.
LLMSYN: Generating Synthetic Electronic Health Records Without Patient-Level Data
Yijie Hao, Huan He, Joyce C. Ho
Leveraging LLMs for Multimodal Medical Time Series Analysis
Nimeesha Chan, Felix Parker, William C Bennett et al.
ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs
Fahmida Liza Piya, Rahmatollah Beheshti
FactEHR: A Dataset for Evaluating Factuality in Clinical Notes Using LLMs
Monica Munnangi, Akshay Swaminathan, Jason Alan Fries et al.
Evaluation of Multi-Agent LLMs in Multidisciplinary Team Decision-Making for Challenging Cancer Cases
Jaesik Kim, Byounghan Lee, Kyung-Ah Sohn et al.
Enhancing Adaptive Behavioral Interventions with LLM Inference from Participant Described States
Karine Karine, Benjamin M. Marlin
Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?
Chase M Fensore, Rodrigo M Carrillo-Larco, Megha Shah et al.
LLMs Are Few-Shot In-Context Low-Resource Language Learners
Samuel Cahyawijaya, Holy Lovenia, Pascale Fung
FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs
Shamik Roy, Sailik Sengupta, Daniele Bonadiman et al.
E5: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and Extrapolate
Zhehao Zhang, Yan Gao, Jian-Guang Lou
SELF-GUARD: Empower the LLM to Safeguard Itself
Zezhong Wang, Fangkai Yang, Lu Wang et al.
MART: Improving LLM Safety with Multi-round Automatic Red-Teaming
Suyu Ge, Chunting Zhou, Rui Hou et al.
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings
Chen Cecilia Liu, Fajri Koto, Timothy Baldwin et al.
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
Xuansheng Wu, Wenlin Yao, Jianshu Chen et al.
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities
Lingbo Mo, Boshi Wang, Muhao Chen et al.
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks
Ting-Yun Chang, Jesse Thomason, Robin Jia
Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications
Yanchen Liu, Srishti Gautam, Jiaqi Ma et al.
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks
Chonghua Wang, Haodong Duan, Songyang Zhang et al.
On-the-fly Definition Augmentation of LLMs for Biomedical NER
Monica Munnangi, Sergey Feldman, Byron Wallace et al.
Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi et al.
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Liyan Tang, Igor Shalyminov, Amy Wong et al.
Flames: Benchmarking Value Alignment of LLMs in Chinese
Kexin Huang, Xiangyang Liu, Qianyu Guo et al.
Fake Alignment: Are LLMs Really Aligned Well?
Yixu Wang, Yan Teng, Kexin Huang et al.
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
Yassir Fathullah, Chunyang Wu, Egor Lakomkin et al.