Papers
5,479 papers found
Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural Adjustments
Tuka Alhanai, Adam Kasumovic, Mohammad M. Ghassemi et al.
Leveraging Computer Vision and Visual LLMs for Cost-Effective and Consistent Street Food Safety Assessment in Kolkata India
Alexey Chernikov, Klaus Ackermann, Caitlin Brown et al.
RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
Adrian de Wynter, Ishaan Watts, Tua Wongsangaroonsri et al.
Reference-Based Post-OCR Processing with LLM for Precise Diacritic Text in Historical Document Recognition
Thao Do, Dinh Phu Tran, An Vo et al.
Cognitive Bias and Reassignment: Who Can Contribute High Quality LLM Data
Yunfan Gao, Yun Xiong, Zhongyuan Hu et al.
Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages
Zihao Li, Yucheng Shi, Zirui Liu et al.
Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs
David Restrepo, Chenwei Wu, Zhengxu Tang et al.
CVE-LLM: Ontology-Assisted Automatic Vulnerability Evaluation Using Large Language Models
Rikhiya Ghosh, Hans-Martin von Stockhausen, Martin Schmitt et al.
ScriptSmith: A Unified LLM Framework for Enhancing IT Operations via Automated Bash Script Generation, Assessment, and Refinement
Pooja Aggarwal, Oishik Chatterjee, Ting Dai et al.
To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices
Sean McGregor, Allyson Ettinger, Nick Judd et al.
Can LLMs Reliably Simulate Human Learner Actions? A Simulation Authoring Framework for Open-Ended Learning Environments
Amogh Mannekote, Adam Davies, Jina Kang et al.
Entity Only vs. Inline Approaches: Evaluating LLMs for Adverse Drug Event Detection in Clinical Text (Student Abstract)
Howard Prioleau, Saurav Aryal
ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC) (Student Abstract)
Kartik Singhal, Gautam Shroff
Domain-Informed Label Fusion Surpasses LLMs in Free-Living Activity Classification (Student Abstract)
Shovito Barua Soumma, Abdullah Mamun, Hassan Ghasemzadeh
An Automated Explainable Educational Assessment System Built on LLMs
Jiazheng Li, Artem Bobrov, David West et al.
TRACE-CS: A Synergistic Approach to Explainable Course Scheduling Using LLMs and Logic
Stylianos Loukas Vasileiou, William Yeoh
MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs
Tianyang Zhang, Zhuoxuan Jiang, Haotian Zhang et al.
Characterised LLMs Affect its Evaluation of Summary and Translation
Yu-An Lu, Yu-Ting Lin
Little Giants: Exploring the Potential of Small LLMs as Evaluation Metrics in Summarization in the Eval4NLP 2023 Shared Task
Neema Kotonya, Saran Krishnasamy, Joel Tetreault et al.