Papers
Instruction Tuning-free Visual Token Complement for Multimodal LLMs
Dongsheng Wang, Jiequan Cui, Miaoge Li et al.
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Andrea W Wen-Yi, David Mimno
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang, Khai Doan, Qisheng Liao et al.
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
Odhran O’Donoghue, Aleksandar Shtedritski, John Ginger et al.
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs
Minghao Li, Yingxiu Zhao, Bowen Yu et al.
Can LLMs Facilitate Interpretation of Pre-trained Language Models?
Basel Mousi, Nadir Durrani, Fahim Dalvi
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition
Sander Schulhoff, Jeremy Pinto, Anaum Khan et al.
Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata
Silei Xu, Shicheng Liu, Theo Culhane et al.
Personalized Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation
Hailin Chen, Amrita Saha, Steven Hoi et al.
EtiCor: Corpus for Analyzing LLMs for Etiquettes
Ashutosh Dwivedi, Pradhyumna Lavania, Ashutosh Modi
An Investigation of LLMs’ Inefficacy in Understanding Converse Relations
Chengwen Qi, Bowen Li, Binyuan Hui et al.
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models
Daman Arora, Himanshu Singh, Mausam
Don’t Trust ChatGPT when your Question is not in English: A Study of Multilingual Abilities and Types of LLMs
Xiang Zhang, Senyu Li, Bradley Hauer et al.
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs
Hanlin Tang, Yifu Sun, Decheng Wu et al.
Learning Preference Model for LLMs via Automatic Preference Data Generation
Shijia Huang, Jianqiao Zhao, Yanyang Li et al.
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization
Philippe Laban, Wojciech Kryscinski, Divyansh Agarwal et al.
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Cheng Zhang, Jianyi Cheng, Ilia Shumailov et al.
Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews
Hye Yun, Iain Marshall, Thomas Trikalinos et al.
CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
Myra Cheng, Tiziano Piccardi, Diyi Yang
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Jon Saad-Falcon, Omar Khattab, Keshav Santhanam et al.
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Minje Choi, Jiaxin Pei, Sagar Kumar et al.
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Jiaao Chen, Diyi Yang
Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration
Yiquan Wu, Siying Zhou, Yifei Liu et al.
Let’s Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
Pranjal Aggarwal, Aman Madaan, Yiming Yang et al.