Papers

5,479 papers found

Can LLMs Identify Tax Abuse?

Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme

2026 AAAI

CO2-Meter: A Comprehensive Carbon Footprint Estimator for LLMs on Edge Devices

Zhenxiao Fu, Fan Chen, Lei Jiang

2026 AAAI

LocalBench: Benchmarking LLMs on County-Level Local Knowledge and Reasoning

Zihan Gao, Yifei Xu, Jacob Thebault-Spieker

2026 AAAI

Can LLMs Truly Embody Human Personality? Analyzing AI and Human Behavior Alignment in Dispute Resolution

Deuksin Kwon, Kaleen Shrestha, Bin Han et al.

2026 AAAI

Evaluating LLMs for Police Decision-Making: A Framework Based on Police Action Scenarios

Sangyub Lee, Heedou Kim, Hyeoncheol Kim

2026 AAAI

Should You Use LLMs to Simulate Opinions? Quality Checks for Early-Stage Deliberation

Terrence Neumann, Maria De-Arteaga, Sina Fazelpour

2026 AAAI

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users

Elinor Poole-Dayan, Deb Roy, Jad Kabbara

2026 AAAI

The Confidence Trap: Gender Bias and Predictive Certainty in LLMs

Ahmed Sabir, Markus Kängsepp, Rajesh Sharma

2026 AAAI

CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling

Bichen Wang, Yixin Sun, Junzhe Wang et al.

2026 AAAI

LLM Safety in Judicial AI: A Stress Test of Social Media Influence on Real-World Judgments

Yixuan Xie, Yang He, Xiaoyu Yang et al.

2026 AAAI

Assessing Automated Fact-Checking for Medical LLM Responses with Knowledge Graphs

Shasha Zhou, Mingyu Huang, Jack Cole et al.

2026 AAAI

Democratizing LLM Efficiency: From Hyperscale Optimizations to Universal Deployability

Hen-Hsen Huang

2026 AAAI

Is Word Sense Disambiguation Dead in the LLM Era?

Roberto Navigli

2026 AAAI

Beyond Neuron-Level Sparsity: Achieving Faithful and Interpretable LLMs with Mixture of Decoders

Grigorios Chrysos

2026 AAAI

Breaking the Resource Monopoly: LLM Post-Training and Serving with Modest Data and Compute

Jiaxin Huang

2026 AAAI

Toward Controllable and Trustworthy LLM Reasoning: From Failure Mapping to Cognition-inspired Control and Real-world Impact

Ben Zhou

2026 AAAI

SARA: Leveraging LLM Agents and Jurisprudential Ontologies for Automated Legal Reasoning

Francisco C J Bonfim, Sara Pessoa SIlva, Alicia S Neves et al.

2026 AAAI

PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation

Abhishek Divekar, Anirban Majumder

2026 AAAI

Scalable and Efficient Large-Scale Log Analysis with LLMs: An IT Software Support Case Study

Pranjal Gupta, Karan Bhukar, Harshit Kumar et al.

2026 AAAI

Physics-Informed Autonomous LLM Agents for Explainable Power Electronics Modulation Design

Junhua Liu, Fanfan Lin, Xinze Li et al.

2026 AAAI

AquaSentinel: Next-Generation AI System Integrating Sensor Networks for Urban Underground Water Pipeline Anomaly Detection via Collaborative MoE-LLM Agent Architecture

Qiming Guo, Bishal Khatri, Wenbo Sun et al.

2026 AAAI

A Metacognitive Architecture for Correcting LLM Errors in AI Agents

Jisu Kim, Mahimul Islam, Ashok Goel

2026 AAAI

LLM4Sweat: A Trustworthy Large Language Model for Hyperhidrosis Support

Wenjie Lin, Jin Wei-Kocsis

2026 AAAI

TreeBridge: Aligning LLM Embeddings in Industrial Recommender Systems

Yabo Ni, Cao Yuanpeng, Wenhang Zhou et al.

2026 AAAI

Stratos: An End-to-End Distillation Pipeline for Customized LLMs Under Distributed Cloud Environments

Ziming Dai, Tuo Zhang, Fei Gao et al.

2026 AAAI