Papers
BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition
Quanjiang Guo, Yihong Dong, Ling Tian et al.
Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark
Zhikun Xu, Yinghui Li, Ruixue Ding et al.
KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions
Yanxu Zhu, Jinlin Xiao, Yuhang Wang et al.
IberoBench: A Benchmark for LLM Evaluation in Iberian Languages
Irene Baucells, Javier Aula-Blasco, Iria de-Dios-Flores et al.
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Yi Fang, Moxin Li, Wenjie Wang et al.
Evaluating the Consistency of LLM Evaluators
Noah Lee, Jiwoo Hong, James Thorne
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia, Rui Wang, Xu Liu et al.
Data Augmentation for Cross-domain Parsing via Lightweight LLM Generation and Tree Hybridization
Ziyan Zhang, Yang Hou, Chen Gong et al.
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
Yuxia Wang, Minghan Wang, Hasan Iqbal et al.
Evaluating Model Alignment with Human Perception: A Study on Shitsukan in LLMs and LVLMs
Daiki Shiono, Ana Brassard, Yukiko Ishizuki et al.
BeefBot: Harnessing Advanced LLM and RAG Techniques for Providing Scientific and Technology Solutions to Beef Producers
Zhihao Zhang, Carrie-Ann Wilson, Rachel Hay et al.
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs
Yijie Li, Yuan Sun
RAGthoven: A Configurable Toolkit for RAG-enabled LLM Experimentation
Gregor Karetka, Demetris Skottis, Lucia Dutková et al.
PDC & DM-SFT: A Road for LLM SQL Bug-Fix Enhancing
Yiwen Duan, Yonghong Yu, Xiaoming Zhao et al.
Automated Clinical Data Extraction with Knowledge Conditioned LLMs
Diya Li, Asim Kadav, Aijing Gao et al.
No Size Fits All: The Perils and Pitfalls of Leveraging LLMs Vary with Company Size
Ashok Urlana, Charaka Vinayak Kumar, Bala Mallikarjunarao Garlapati et al.
Fine-Tuning Medium-Scale LLMs for Joint Intent Classification and Slot Filling: A Data-Efficient and Cost-Effective Solution for SMEs
Maia Aguirre, Ariane Méndez, Arantza del Pozo et al.
LLM Evaluate: An Industry-Focused Evaluation Tool for Large Language Models
Harsh Saini, Md Tahmid Rahman Laskar, Cheng Chen et al.
Page Stream Segmentation with LLMs: Challenges and Applications in Insurance Document Automation
Hunter Heidenreich, Ratish Dalvi, Nikhil Verma et al.
CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding
Johannes Kirmayr, Lukas Stappen, Phillip Schneider et al.
Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI
Yuya Asano, Sabit Hassan, Paras Sharma et al.
Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud
Yuanhao Yue, Chengyu Wang, Jun Huang et al.
Where do LLMs Encode the Knowledge to Assess the Ambiguity?
Hancheol Park, Geonmin Kim
A Simple yet Efficient Prompt Compression Method for Text Classification Data Annotation Using LLM
Yiran Xie, Debin Xiao, Ping Wang et al.