Papers
Named Entity Inference Attacks on Clinical LLMs: Exploring Privacy Risks and the Impact of Mitigation Strategies
Adam Sutton, Xi Bai, Kawsar Noor et al.
Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries
Sahil Kale, Vijaykant Nadadur
Multi-lingual Multi-turn Automated Red Teaming for LLMs
Abhishek Singhania, Christophe Dupuy, Shivam Sadashiv Mangale et al.
Summary the Savior: Harmful Keyword and Query-based Summarization for LLM Jailbreak Defense
Shagoto Rahman, Ian Harris
Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods
Nicola Cecere, Andrea Bacciu, Ignacio Fernández-Tobías et al.
A Calibrated Reflection Approach for Enhancing Confidence Estimation in LLMs
Umesh Bodhwani, Yuan Ling, Shujing Dong et al.
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation
Neeraj Varshney, Satyam Raj, Venkatesh Mishra et al.
Beyond LLMs A Linguistic Approach to Causal Graph Generation from Narrative Texts
Zehan Li, Ruhua Pan, Xinyu Pi
Narrative Studio: Visual narrative exploration using LLMs and Monte Carlo Tree Search
Parsa Ghaffari, Chris Hokamp
Speaker Identification and Dataset Construction Using LLMs: A Case Study on Japanese Narratives
Seiji Gobara, Hidetaka Kamigaito, Taro Watanabe
Automatic normalization of noisy technical reports with an LLM: What effects on a downstream task?
Mariame Maarouf, Ludovic Tanguy
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation
Rune Birkmose, Nathan Mørkeberg Reece, Esben Hofstedt Norvin et al.
Accelerating Design Space Exploration for LLM Training Systems with Multi-experiment Parallel Simulation
Fei Gui, Kaihui Gao, Li Chen et al.
Holmes: Localizing Irregularities in LLM Training with Mega-scale GPU Clusters
Zhiyi Yao, Pengbo Hu, Congcong Miao et al.
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Amey Agrawal, Nitin Kedia, Ashish Panwar et al.
dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving
Bingyang Wu, Ruidong Zhu, Zili Zhang et al.
WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training
Zheng Wang, Anna Cai, Xinfeng Xie et al.
DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization
Yeonhong Park, Jake Hyun, Hojoon Kim et al.
UMUTeam at SemEval-2023 Task 12: Ensemble Learning of LLMs applied to Sentiment Analysis for Low-resource African Languages
José Antonio García-Díaz, Camilo Caparros-laiz, Ángela Almela et al.
HalluSafe at SemEval-2024 Task 6: An NLI-based Approach to Make LLMs Safer by Better Detecting Hallucinations and Overgeneration Mistakes
Zahra Rahimi, Hamidreza Amirzadeh, Alireza Sohrabi et al.
iML at SemEval-2024 Task 2: Safe Biomedical Natural Language Interference for Clinical Trials with LLM Based Ensemble Inferencing
Abbas Akkasi, Adnan Khan, Mai A. Shaaban et al.
OUNLP at SemEval-2024 Task 9: Retrieval-Augmented Generation for Solving Brain Teasers with LLMs
Vineet Saravanan, Steven Wilson
GAVx at SemEval-2024 Task 10: Emotion Flip Reasoning via Stacked Instruction Finetuning of LLMs
Vy Nguyen, Xiuzhen Zhang
Halu-NLP at SemEval-2024 Task 6: MetaCheckGPT - A Multi-task Hallucination Detection using LLM uncertainty and meta-models
Rahul Mehta, Andrew Hoblitzell, Jack O’keefe et al.