Papers
Persistent Instability in LLM’s Personality Measurements: Effects of Scale, Reasoning, and Conversation History
Tommaso Tosato, Saskia Helbling, Yorguin-Jose Mantilla-Ramos et al.
Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding
Youze Wang, Zijun Chen, Ruoyu Chen et al.
STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
Zijun Wang, Haoqin Tu, Yuhan Wang et al.
CluCERT: Certifying LLM Robustness via Clustering-Guided Denoising Smoothing
Zixia Wang, Gaojie Jin, Jia Hu et al.
HumorReject: Decoupling LLM Safety from Refusal Prefix via a Little Humor
Zihui Wu, Haichang Gao, Jiacheng Luo et al.
MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text
Ronghao Xu, Zhen Huang, Yangbo Wei et al.
Differentiated Directional Intervention: A Framework for Evading LLM Safety Alignment
Peng Zhang, Peijie Sun
GEM: Generative Entropy-Guided Preference Modeling for Few-Shot Alignment of LLMs
Yiyang Zhao, Huiyu Bai, Xuejiao Zhao
Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models
Tianyi Zhou, Johanne Medina, Sanjay Chawla
On the Feasibility of Using MultiModal LLMs to Execute AR Social Engineering Attacks
Ting Bi, Chenghang Ye, Zheyu Yang et al.
Can LLMs Identify Tax Abuse?
Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme
CO2-Meter: A Comprehensive Carbon Footprint Estimator for LLMs on Edge Devices
Zhenxiao Fu, Fan Chen, Lei Jiang
LocalBench: Benchmarking LLMs on County-Level Local Knowledge and Reasoning
Zihan Gao, Yifei Xu, Jacob Thebault-Spieker
Can LLMs Truly Embody Human Personality? Analyzing AI and Human Behavior Alignment in Dispute Resolution
Deuksin Kwon, Kaleen Shrestha, Bin Han et al.
Evaluating LLMs for Police Decision-Making: A Framework Based on Police Action Scenarios
Sangyub Lee, Heedou Kim, Hyeoncheol Kim
Should You Use LLMs to Simulate Opinions? Quality Checks for Early-Stage Deliberation
Terrence Neumann, Maria De-Arteaga, Sina Fazelpour
LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
Elinor Poole-Dayan, Deb Roy, Jad Kabbara
The Confidence Trap: Gender Bias and Predictive Certainty in LLMs
Ahmed Sabir, Markus Kängsepp, Rajesh Sharma
CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling
Bichen Wang, Yixin Sun, Junzhe Wang et al.
LLM Safety in Judicial AI: A Stress Test of Social Media Influence on Real-World Judgments
Yixuan Xie, Yang He, Xiaoyu Yang et al.
Assessing Automated Fact-Checking for Medical LLM Responses with Knowledge Graphs
Shasha Zhou, Mingyu Huang, Jack Cole et al.
Is Word Sense Disambiguation Dead in the LLM Era?
Roberto Navigli