Papers
6,952 papers found
LLM-Based Explicit Models of Opponents for Multi-Agent Games
XiaoPeng Yu, Wanpeng Zhang, Zongqing Lu
LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation
Junyeong Park, Seogyeong Jeong, Seyoung Song et al.
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
Saaket Agashe, Yue Fan, Anthony Reyna et al.
LLM DEBATE OPPONENT : Counter-argument Generation focusing on Implicit and Critical Premises
Taisei Ozaki, Chihiro Nakagawa, Naoya Inoue et al.
LLM-Generated Passphrases That Are Secure and Easy to Remember
Jie S. Li, Jonas Geiping, Micah Goldblum et al.
LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue
Sangyeop Kim, Sohhyung Park, Jaewon Jung et al.
LLM-Human Pipeline for Cultural Grounding of Conversations
Rajkumar Pujari, Dan Goldwasser
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
Anton Razzhigaev, Matvey Mikhalchuk, Temurbek Rahmatullaev et al.
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning
Shuguang Chen, Guang Lin
LLM Safety for Children
Prasanjit Rath, Hari Shrawgi, Parag Agrawal et al.
LLMs and Copyright Risks: Benchmarks and Mitigation Approaches
Denghui Zhang, Zhaozhuo Xu, Weijie Zhao
LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education
Iain Weissburg, Sathvika Anand, Sharon Levy et al.
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
Do Xuan Long, Ngoc-Hai Nguyen, Tiviatis Sim et al.
LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs
Arash Gholami Davoodi, Seyed Pouyan Mousavi Davoudi, Pouya Pezeshkpour
LLMs are not Zero-Shot Reasoners for Biomedical Information Extraction
Aishik Nagar, Viktor Schlegel, Thanh-Tung Nguyen et al.
LLMs as Meta-Reviewers’ Assistants: A Case Study
Eftekhar Hossain, Sanjeev Kumar Sinha, Naman Bansal et al.
LLMs for Extremely Low-Resource Finno-Ugric Languages
Taido Purason, Hele-Andra Kuulmets, Mark Fishel
LLM-Supported Natural Language to Bash Translation
Finnian Westenfelder, Erik Hemberg, Stephen Moskal et al.
LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?
Jan Cegin, Jakub Simko, Peter Brusilovsky
LLM’s Weakness in NER Doesn’t Stop It from Enhancing a Stronger SLM
Weilu Xu, Renfei Dang, Shujian Huang
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models
Kaichen Zhang, Bo Li, Peiyuan Zhang et al.
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Zhenyue Qin, Yu Yin, Dylan Campbell et al.
LM-Pub-Quiz: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models
Max Ploner, Jacek Wiland, Sebastian Pohl et al.