Papers
5,479 papers found
Can LLMs Identify Tax Abuse?
Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme
CO2-Meter: A Comprehensive Carbon Footprint Estimator for LLMs on Edge Devices
Zhenxiao Fu, Fan Chen, Lei Jiang
LocalBench: Benchmarking LLMs on County-Level Local Knowledge and Reasoning
Zihan Gao, Yifei Xu, Jacob Thebault-Spieker
Can LLMs Truly Embody Human Personality? Analyzing AI and Human Behavior Alignment in Dispute Resolution
Deuksin Kwon, Kaleen Shrestha, Bin Han et al.
Evaluating LLMs for Police Decision-Making: A Framework Based on Police Action Scenarios
Sangyub Lee, Heedou Kim, Hyeoncheol Kim
Should You Use LLMs to Simulate Opinions? Quality Checks for Early-Stage Deliberation
Terrence Neumann, Maria De-Arteaga, Sina Fazelpour
LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
Elinor Poole-Dayan, Deb Roy, Jad Kabbara
The Confidence Trap: Gender Bias and Predictive Certainty in LLMs
Ahmed Sabir, Markus Kängsepp, Rajesh Sharma
CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling
Bichen Wang, Yixin Sun, Junzhe Wang et al.
LLM Safety in Judicial AI: A Stress Test of Social Media Influence on Real-World Judgments
Yixuan Xie, Yang He, Xiaoyu Yang et al.
Assessing Automated Fact-Checking for Medical LLM Responses with Knowledge Graphs
Shasha Zhou, Mingyu Huang, Jack Cole et al.
Is Word Sense Disambiguation Dead in the LLM Era?
Roberto Navigli
SARA: Leveraging LLM Agents and Jurisprudential Ontologies for Automated Legal Reasoning
Francisco C J Bonfim, Sara Pessoa SIlva, Alicia S Neves et al.
PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation
Abhishek Divekar, Anirban Majumder
Scalable and Efficient Large-Scale Log Analysis with LLMs: An IT Software Support Case Study
Pranjal Gupta, Karan Bhukar, Harshit Kumar et al.
Physics-Informed Autonomous LLM Agents for Explainable Power Electronics Modulation Design
Junhua Liu, Fanfan Lin, Xinze Li et al.
AquaSentinel: Next-Generation AI System Integrating Sensor Networks for Urban Underground Water Pipeline Anomaly Detection via Collaborative MoE-LLM Agent Architecture
Qiming Guo, Bishal Khatri, Wenbo Sun et al.
A Metacognitive Architecture for Correcting LLM Errors in AI Agents
Jisu Kim, Mahimul Islam, Ashok Goel
LLM4Sweat: A Trustworthy Large Language Model for Hyperhidrosis Support
Wenjie Lin, Jin Wei-Kocsis
TreeBridge: Aligning LLM Embeddings in Industrial Recommender Systems
Yabo Ni, Cao Yuanpeng, Wenhang Zhou et al.
Stratos: An End-to-End Distillation Pipeline for Customized LLMs Under Distributed Cloud Environments
Ziming Dai, Tuo Zhang, Fei Gao et al.