Papers
5,479 papers found
LLM-Human Pipeline for Cultural Grounding of Conversations
Rajkumar Pujari, Dan Goldwasser
Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
Kaushal Kumar Maurya, Kv Aditya Srivatsa, Kseniia Petukhova et al.
Divergent Thoughts toward One Goal: LLM-based Multi-Agent Collaboration System for Electronic Design Automation
Haoyuan Wu, Haisheng Zheng, Zhuolun He et al.
ALERT: An LLM-powered Benchmark for Automatic Evaluation of Recommendation Explanations
Yichuan Li, Xinyang Zhang, Chenwei Zhang et al.
CVE-Bench: Benchmarking LLM-based Software Engineering Agent’s Ability to Repair Real-World CVE Vulnerabilities
Peiran Wang, Xiaogeng Liu, Chaowei Xiao
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue
Hao Li, Chenghao Yang, An Zhang et al.
GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing
Jinhao Duan, Xinyu Zhao, Zhuoxuan Zhang et al.
SPeCtrum: A Grounded Framework for Multidimensional Identity Representation in LLM-Based Agent
Keyeun Lee, Seo Hyeong Kim, Seolhee Lee et al.
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
Nikhil Abhyankar, Vivek Gupta, Dan Roth et al.
IMRRF: Integrating Multi-Source Retrieval and Redundancy Filtering for LLM-based Fake News Detection
Dayang Li, Fanxiao Li, Bingbing Song et al.
Simulating Classroom Education with LLM-Empowered Agents
Zheyuan Zhang, Daniel Zhang-Li, Jifan Yu et al.
LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue
Sangyeop Kim, Sohhyung Park, Jaewon Jung et al.
LLM-Supported Natural Language to Bash Translation
Finnian Westenfelder, Erik Hemberg, Stephen Moskal et al.
PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics
Daniil Larionov, Steffen Eger
SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data
Suyoung Bae, YunSeok Choi, Hyojun Kim et al.
ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based Chatbots
Shani Goren, Oren Kalinsky, Tomer Stav et al.
Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs
Hyeonwoo Kim, Dahyun Kim, Jihoo Kim et al.
CuriousLLM: Elevating Multi-Document Question Answering with LLM-Enhanced Knowledge Graph Reasoning
Zukang Yang, Zixuan Zhu, Jennifer Zhu
Towards Reliable Agents: Benchmarking Customized LLM-Based Retrieval-Augmented Generation Frameworks with Deployment Validation
Kevin Shukang Wang, Karel Joshua Harjono, Ramon Lawrence
RxLens: Multi-Agent LLM-powered Scan and Order for Pharmacy
Akshay Jagatap, Srujana Merugu, Prakash Mandayam Comar
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu, Yixin Ji, Zeyuan Chen et al.
An Efficient Context-Dependent Memory Framework for LLM-Centric Agents
Pengyu Gao, Jinming Zhao, Xinyue Chen et al.
ChatCRS: Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems
Chuang Li, Yang Deng, Hengchang Hu et al.
QuaLLM: An LLM-based Framework to Extract Quantitative Insights from Online Forums
Varun Nagaraj Rao, Eesha Agarwal, Samantha Dalal et al.
A Federated Framework for LLM-based Recommendation
Jujia Zhao, Wenjie Wang, Chen Xu et al.