Papers
5,479 papers found
AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents
Luca Gioacchini, Giuseppe Siracusano, Davide Sanvito et al.
Exploring Inherent Biases in LLMs within Korean Social Context: A Comparative Analysis of ChatGPT and GPT-4
Seungyoon Lee, Dongjun Kim, Dahyun Jung et al.
Distilling Text Style Transfer With Self-Explanation From LLMs
Chiyu Zhang, Honglong Cai, Yuezhang Li et al.
Human-AI Interaction in the Age of LLMs
Diyi Yang, Sherry Tongshuang Wu, Marti A. Hearst
Efficiently Distilling LLMs for Edge Applications
Achintya Kundu, Yu Chin Fabian Lim, Aaron Chew et al.
Optimizing LLM Based Retrieval Augmented Generation Pipelines in the Financial Domain
Yiyun Zhao, Prateek Singh, Hanoz Bhathena et al.
Leveraging LLMs for Dialogue Quality Measurement
Jinghan Jia, Abi Komma, Timothy Leffel et al.
EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM
Henry Peng Zou, Gavin Heqing Yu, Ziwei Fan et al.
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text
Wenting Zhao, Ye Liu, Tong Niu et al.
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning
Yinger Zhang, Hui Cai, Xierui Song et al.
Comparing Two Model Designs for Clinical Note Generation; Is an LLM a Useful Evaluator of Consistency?
Nathan Brake, Thomas Schaaf
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Weihao Zeng, Dayuan Fu, Keqing He et al.
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Victor Carbune, Hassan Mansoor, Fangyu Liu et al.
What Makes Math Word Problems Challenging for LLMs?
Kv Aditya Srivatsa, Ekaterina Kochmar
Pruning as a Domain-specific LLM Extractor
Nan Zhang, Yanchi Liu, Xujiang Zhao et al.
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback
Wenda Xu, Daniel Deutsch, Mara Finkelstein et al.
More Samples or More Prompts? Exploring Effective Few-Shot In-Context Learning for LLMs with In-Context Sampling
Bingsheng Yao, Guiming Chen, Ruishi Zou et al.
Enhancing Perception: Refining Explanations of News Claims with LLM Conversations
Yi-Li Hsu, Jui-Ning Chen, Yang Fan Chiang et al.
Rethinking Machine Ethics – Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
Jingyan Zhou, Minda Hu, Junan Li et al.
Unleashing the Power of LLMs in Court View Generation by Stimulating Internal Knowledge and Incorporating External Knowledge
Yifei Liu, Yiquan Wu, Ang Li et al.
Enhancing the General Agent Capabilities of Low-Paramter LLMs through Tuning and Multi-Branch Reasoning
Qinhao Zhou, Zihan Zhang, Xiang Xiang et al.
BotChat: Evaluating LLMs’ Capabilities of Having Multi-Turn Dialogues
Haodong Duan, Jueqi Wei, Chonghua Wang et al.
WebWISE: Unlocking Web Interface Control for LLMs via Sequential Exploration
Heyi Tao, Sethuraman T V, Michal Shlapentokh-Rothman et al.
Tokenizer Choice For LLM Training: Negligible or Crucial?
Mehdi Ali, Michael Fromm, Klaudia Thellmann et al.
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Linyong Nan, Ellen Zhang, Weijin Zou et al.