Research Explorer

To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices

Sean McGregor, Allyson Ettinger, Nick Judd et al.

2025 AAAI

Can LLMs Reliably Simulate Human Learner Actions? A Simulation Authoring Framework for Open-Ended Learning Environments

Amogh Mannekote, Adam Davies, Jina Kang et al.

2025 AAAI

Advancing Intelligent Software Development and Trustworthy Models Through the Synergy of Software Engineering and LLMs

Guanqun Yang

2025 AAAI

Advancing Medical Multimodal Learning and Data Generation with Diffusion Model and LLM

Yuan Zhong

2025 AAAI

Leveraging Textual Memory and Key Frame Reasoning for Full Video Understanding Using Off-the-Shelf LLMs and VLMs (Student Abstract)

Harsh Dubey, Chulwoo Pack

2025 AAAI

Entity Only vs. Inline Approaches: Evaluating LLMs for Adverse Drug Event Detection in Clinical Text (Student Abstract)

Howard Prioleau, Saurav Aryal

2025 AAAI

ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC) (Student Abstract)

Kartik Singhal, Gautam Shroff

2025 AAAI

Domain-Informed Label Fusion Surpasses LLMs in Free-Living Activity Classification (Student Abstract)

Shovito Barua Soumma, Abdullah Mamun, Hassan Ghasemzadeh

2025 AAAI

Federated Learning with Heterogeneous LLMs: Integrating Small Student Client Models with a Large Hungry Model

Gautam Jajoo

2025 AAAI

Truth Behind the Scene: Designing Evaluations Benchmarks to Assess LLMs’ Task-Specific Understanding over Test-Taking Strategies

Thao Pham

2025 AAAI

An Automated Explainable Educational Assessment System Built on LLMs

Jiazheng Li, Artem Bobrov, David West et al.

2025 AAAI

TRACE-CS: A Synergistic Approach to Explainable Course Scheduling Using LLMs and Logic

Stylianos Loukas Vasileiou, William Yeoh

2025 AAAI

MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs

Tianyang Zhang, Zhuoxuan Jiang, Haotian Zhang et al.

2025 AAAI

Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Tomohito Kasahara, Daisuke Kawahara

2023 AACL

Few-Shot Adaptation for Parsing Contextual Utterances with LLMs

Kevin Lin, Patrick Xia, Hao Fang

2023 AACL

Characterised LLMs Affect its Evaluation of Summary and Translation

Yu-An Lu, Yu-Ting Lin

2023 AACL

Little Giants: Exploring the Potential of Small LLMs as Evaluation Metrics in Summarization in the Eval4NLP 2023 Shared Task

Neema Kotonya, Saran Krishnasamy, Joel Tetreault et al.

2023 AACL

ESG Impact Type Classification: Leveraging Strategic Prompt Engineering and LLM Fine-Tuning

Soumya Mishra

2023 AACL

“Dr LLM, what do I have?”: The Impact of User Beliefs and Prompt Formulation on Health Diagnoses

Wojciech Kusa, Edoardo Mosca, Aldo Lipani

2023 AACL

Do LLMs Need Inherent Reasoning Before Reinforcement Learning? A Study in Korean Self-Correction

Hongjin Kim, Jaewook Lee, Kiyoung Lee et al.

2025 AACL

Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs

Yohan Mathew, Ollie Matthews, Robert McCarthy et al.

2025 AACL

Multilingual, Not Multicultural: Uncovering the Cultural Empathy Gap in LLMs through a Comparative Empathetic Dialogue Benchmark

Woojin Lee, Yujin Sim, Hongjin Kim et al.

2025 AACL

Chain-of-Query: Unleashing the Power of LLMs in SQL-Aided Table Understanding via Multi-Agent Collaboration

Songyuan Sui, Hongyi Liu, Serena Liu et al.

2025 AACL

ProofTeller: Exposing recency bias in LLM reasoning and its side effects on communication

Mayank Jobanputra, Alisa Kovtunova, Brisca Balthes et al.

2025 AACL

An Adversary-Resistant Multi-Agent LLM System via Credibility Scoring

Sana Ebrahimi, Mohsen Dehghankar, Abolfazl Asudeh

2025 AACL

Papers