Research Explorer

BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition

Quanjiang Guo, Yihong Dong, Ling Tian et al.

2025 COLING

Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark

Zhikun Xu, Yinghui Li, Ruixue Ding et al.

2025 COLING

KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions

Yanxu Zhu, Jinlin Xiao, Yuhang Wang et al.

2025 COLING

IberoBench: A Benchmark for LLM Evaluation in Iberian Languages

Irene Baucells, Javier Aula-Blasco, Iria de-Dios-Flores et al.

2025 COLING

Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs

Yi Fang, Moxin Li, Wenjie Wang et al.

2025 COLING

Evaluating the Consistency of LLM Evaluators

Noah Lee, Jiwoo Hong, James Thorne

2025 COLING

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs

Yu Xia, Rui Wang, Xu Liu et al.

2025 COLING

Data Augmentation for Cross-domain Parsing via Lightweight LLM Generation and Tree Hybridization

Ziyan Zhang, Yang Hou, Chen Gong et al.

2025 COLING

OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs

Yuxia Wang, Minghan Wang, Hasan Iqbal et al.

2025 COLING

Evaluating Model Alignment with Human Perception: A Study on Shitsukan in LLMs and LVLMs

Daiki Shiono, Ana Brassard, Yukiko Ishizuki et al.

2025 COLING

Streamlining Biomedical Research with Specialized LLMs

Linqing Chen

2025 COLING

BeefBot: Harnessing Advanced LLM and RAG Techniques for Providing Scientific and Technology Solutions to Beef Producers

Zhihao Zhang, Carrie-Ann Wilson, Rachel Hay et al.

2025 COLING

EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs

Yijie Li, Yuan Sun

2025 COLING

RAGthoven: A Configurable Toolkit for RAG-enabled LLM Experimentation

Gregor Karetka, Demetris Skottis, Lucia Dutková et al.

2025 COLING

PDC & DM-SFT: A Road for LLM SQL Bug-Fix Enhancing

Yiwen Duan, Yonghong Yu, Xiaoming Zhao et al.

2025 COLING

Automated Clinical Data Extraction with Knowledge Conditioned LLMs

Diya Li, Asim Kadav, Aijing Gao et al.

2025 COLING

No Size Fits All: The Perils and Pitfalls of Leveraging LLMs Vary with Company Size

Ashok Urlana, Charaka Vinayak Kumar, Bala Mallikarjunarao Garlapati et al.

2025 COLING

Fine-Tuning Medium-Scale LLMs for Joint Intent Classification and Slot Filling: A Data-Efficient and Cost-Effective Solution for SMEs

Maia Aguirre, Ariane Méndez, Arantza del Pozo et al.

2025 COLING

LLM Evaluate: An Industry-Focused Evaluation Tool for Large Language Models

Harsh Saini, Md Tahmid Rahman Laskar, Cheng Chen et al.

2025 COLING

Page Stream Segmentation with LLMs: Challenges and Applications in Insurance Document Automation

Hunter Heidenreich, Ratish Dalvi, Nikhil Verma et al.

2025 COLING

CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding

Johannes Kirmayr, Lukas Stappen, Phillip Schneider et al.

2025 COLING

Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI

Yuya Asano, Sabit Hassan, Paras Sharma et al.

2025 COLING

Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud

Yuanhao Yue, Chengyu Wang, Jun Huang et al.

2025 COLING

Where do LLMs Encode the Knowledge to Assess the Ambiguity?

Hancheol Park, Geonmin Kim

2025 COLING

A Simple yet Efficient Prompt Compression Method for Text Classification Data Annotation Using LLM

Yiran Xie, Debin Xiao, Ping Wang et al.

2025 COLING

Papers