Papers
Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace, Konstantinos Papakostas, Rochelle Choenni et al.
POSQA: Probe the World Models of LLMs with Size Comparisons
Chang Shu, Jiuzhou Han, Fangyu Liu et al.
“You Are An Expert Linguistic Annotator”: Limits of LLMs as Analyzers of Abstract Meaning Representation
Allyson Ettinger, Jena Hwang, Valentina Pyatkin et al.
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers
Mosh Levy, Shauli Ravfogel, Yoav Goldberg
DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM
Weijie Xu, Wenxiang Hu, Fanyou Wu et al.
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models
Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai et al.
Zero-shot Topical Text Classification with LLMs - an Experimental Study
Shai Gretz, Alon Halfon, Ilya Shnayderman et al.
TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction
Junyi Liu, Liangzhi Li, Tong Xiang et al.
LLM aided semi-supervision for efficient Extractive Dialog Summarization
Nishant Mishra, Gaurav Sahu, Iacer Calixto et al.
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
Oscar Sainz, Jon Campos, Iker García-Ferrero et al.
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
Boshi Wang, Xiang Yue, Huan Sun
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
Hongru Wang, Rui Wang, Fei Mi et al.
BLM-s/lE: A structured dataset of English spray-load verb alternations for testing generalization in LLMs
Giuseppe Samo, Vivi Nastase, Chunyang Jiang et al.
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting
Haoyang Huang, Tianyi Tang, Dongdong Zhang et al.
LLMs – the Good, the Bad or the Indispensable?: A Use Case on Legal Statute Prediction and Legal Judgment Prediction on Indian Court Cases
Shaurya Vats, Atharva Zope, Somsubhra De et al.
LLMaAA: Making Large Language Models as Active Annotators
Ruoyu Zhang, Yanzeng Li, Yongliang Ma et al.
SGP-TOD: Building Task Bots Effortlessly via Schema-Guided LLM Prompting
Xiaoying Zhang, Baolin Peng, Kun Li et al.
Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs
Abhinav Rao, Aditi Khandelwal, Kumar Tanmay et al.
Beyond Testers’ Biases: Guiding Model Testing with Knowledge Bases using LLMs
Chenyang Yang, Rishabh Rustogi, Rachel Brower-Sinning et al.
Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Sahal Shaji Mullappilly, Abdelrahman Shaker, Omkar Thawakar et al.
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
Shubhra Kanti Karmaker Santu, Dongji Feng
PaRaDe: Passage Ranking using Demonstrations with LLMs
Andrew Drozdov, Honglei Zhuang, Zhuyun Dai et al.
A Confederacy of Models: a Comprehensive Evaluation of LLMs on Creative Writing
Carlos Gómez-Rodríguez, Paul Williams
Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
Inderjeet Nair, Shwetha Somasundaram, Apoorv Saxena et al.
Learning Interpretable Style Embeddings via Prompting LLMs
Ajay Patel, Delip Rao, Ansh Kothary et al.