Research Explorer

Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural Adjustments

Tuka Alhanai, Adam Kasumovic, Mohammad M. Ghassemi et al.

2025 AAAI

Leveraging Computer Vision and Visual LLMs for Cost-Effective and Consistent Street Food Safety Assessment in Kolkata India

Alexey Chernikov, Klaus Ackermann, Caitlin Brown et al.

2025 AAAI

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?

Adrian de Wynter, Ishaan Watts, Tua Wongsangaroonsri et al.

2025 AAAI

Reference-Based Post-OCR Processing with LLM for Precise Diacritic Text in Historical Document Recognition

Thao Do, Dinh Phu Tran, An Vo et al.

2025 AAAI

Cognitive Bias and Reassignment: Who Can Contribute High Quality LLM Data

Yunfan Gao, Yun Xiong, Zhongyuan Hu et al.

2025 AAAI

Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages

Zihao Li, Yucheng Shi, Zirui Liu et al.

2025 AAAI

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs

David Restrepo, Chenwei Wu, Zhengxu Tang et al.

2025 AAAI

Breaking the Resource Monopoly from Industries: Sustainable and Reliable LLM Serving by Recycling Outdated and Resource-Constrained GPUs

Tianlong Chen

2025 AAAI

CVE-LLM: Ontology-Assisted Automatic Vulnerability Evaluation Using Large Language Models

Rikhiya Ghosh, Hans-Martin von Stockhausen, Martin Schmitt et al.

2025 AAAI

ScriptSmith: A Unified LLM Framework for Enhancing IT Operations via Automated Bash Script Generation, Assessment, and Refinement

Pooja Aggarwal, Oishik Chatterjee, Ting Dai et al.

2025 AAAI

To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices

Sean McGregor, Allyson Ettinger, Nick Judd et al.

2025 AAAI

Can LLMs Reliably Simulate Human Learner Actions? A Simulation Authoring Framework for Open-Ended Learning Environments

Amogh Mannekote, Adam Davies, Jina Kang et al.

2025 AAAI

Advancing Intelligent Software Development and Trustworthy Models Through the Synergy of Software Engineering and LLMs

Guanqun Yang

2025 AAAI

Advancing Medical Multimodal Learning and Data Generation with Diffusion Model and LLM

Yuan Zhong

2025 AAAI

Leveraging Textual Memory and Key Frame Reasoning for Full Video Understanding Using Off-the-Shelf LLMs and VLMs (Student Abstract)

Harsh Dubey, Chulwoo Pack

2025 AAAI

Entity Only vs. Inline Approaches: Evaluating LLMs for Adverse Drug Event Detection in Clinical Text (Student Abstract)

Howard Prioleau, Saurav Aryal

2025 AAAI

ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC) (Student Abstract)

Kartik Singhal, Gautam Shroff

2025 AAAI

Domain-Informed Label Fusion Surpasses LLMs in Free-Living Activity Classification (Student Abstract)

Shovito Barua Soumma, Abdullah Mamun, Hassan Ghasemzadeh

2025 AAAI

Federated Learning with Heterogeneous LLMs: Integrating Small Student Client Models with a Large Hungry Model

Gautam Jajoo

2025 AAAI

Truth Behind the Scene: Designing Evaluations Benchmarks to Assess LLMs’ Task-Specific Understanding over Test-Taking Strategies

Thao Pham

2025 AAAI

An Automated Explainable Educational Assessment System Built on LLMs

Jiazheng Li, Artem Bobrov, David West et al.

2025 AAAI

TRACE-CS: A Synergistic Approach to Explainable Course Scheduling Using LLMs and Logic

Stylianos Loukas Vasileiou, William Yeoh

2025 AAAI

MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs

Tianyang Zhang, Zhuoxuan Jiang, Haotian Zhang et al.

2025 AAAI

Characterised LLMs Affect its Evaluation of Summary and Translation

Yu-An Lu, Yu-Ting Lin

2023 AACL

Little Giants: Exploring the Potential of Small LLMs as Evaluation Metrics in Summarization in the Eval4NLP 2023 Shared Task

Neema Kotonya, Saran Krishnasamy, Joel Tetreault et al.

2023 AACL

Papers