Papers
2,781 papers found
Can LLMs Recognize Toxicity? A Structured Investigation Framework and Toxicity Metric
Hyukhun Koh, Dohyung Kim, Minwoo Lee et al.
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?
Ehsan Doostmohammadi, Oskar Holmström, Marco Kuhlmann
VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
Ruotong Liao, Max Erler, Huiyu Wang et al.
CEAMC: Corpus and Empirical Study of Argument Analysis in Education via LLMs
Yupei Ren, Hongyi Wu, Zhaoguang Long et al.
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
Sihui Yang, Keping Bi, Wanqing Cui et al.
Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas
Salvatore Giorgi, Tingting Liu, Ankit Aich et al.
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Sihang Zhao, Youliang Yuan, Xiaoying Tang et al.
RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Xijie Huang, Zechun Liu, Shih-Yang Liu et al.
Is Compound Aspect-Based Sentiment Analysis Addressed by LLMs?
Yinhao Bai, Zhixin Han, Yuhua Zhao et al.
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs
Seyed Mahed Mousavi, Simone Alghisi, Giuseppe Riccardi
Exploring the Capability of Multimodal LLMs with Yonkoma Manga: The YManga Dataset and Its Challenging Tasks
Qi Yang, Jingjie Zeng, Liang Yang et al.
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
Minsang Kim, Cheoneum Park, Seung Jun Baek
LoRAExit: Empowering Dynamic Modulation of LLMs in Resource-limited Settings using Low-rank Adapters
Jiacheng Liu, Peng Tang, Xiaofeng Hou et al.
Do LLMs Think Fast and Slow? A Causal Study on Sentiment Analysis
Zhiheng Lyu, Zhijing Jin, Fernando Gonzalez Adauto et al.
Can LLMs Reason in the Wild with Programs?
Yuan Yang, Siheng Xiong, Ali Payani et al.
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Zhexin Zhang, Yida Lu, Jingyuan Ma et al.
Multimodal Misinformation Detection by Learning from Synthetic Data with Multimodal LLMs
Fengzhu Zeng, Wenqian Li, Wei Gao et al.
Exploring Design Choices for Building Language-Specific LLMs
Atula Tejaswi, Nilesh Gupta, Eunsol Choi
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs.
Clement Christophe, Tathagata Raha, Svetlana Maslenkova et al.
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation
Yusheng Liao, Shuyang Jiang, Zhe Chen et al.
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication
Weize Chen, Chenfei Yuan, Jiarui Yuan et al.
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
Wenhua Cheng, Weiwei Zhang, Haihao Shen et al.
Using LLMs to simulate students’ responses to exam questions
Luca Benedetto, Giovanni Aradelli, Antonia Donvito et al.
Will LLMs Sink or Swim? Exploring Decision-Making Under Pressure
Kyusik Kim, Hyeonseok Jeon, Jeongwoo Ryu et al.
“Vorbești Românește?” A Recipe to Train Powerful Romanian LLMs with English Instructions
Mihai Masala, Denis Ilie-Ablachim, Alexandru Dima et al.