Papers
Probing LLMs for hate speech detection: strengths and vulnerabilities
Sarthak Roy, Ashish Harshvardhan, Animesh Mukherjee et al.
Probing LLMs for Joint Encoding of Linguistic Categories
Giulio Starace, Konstantinos Papakostas, Rochelle Choenni et al.
POSQA: Probe the World Models of LLMs with Size Comparisons
Chang Shu, Jiuzhou Han, Fangyu Liu et al.
“You Are An Expert Linguistic Annotator”: Limits of LLMs as Analyzers of Abstract Meaning Representation
Allyson Ettinger, Jena Hwang, Valentina Pyatkin et al.
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers
Mosh Levy, Shauli Ravfogel, Yoav Goldberg
DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM
Weijie Xu, Wenxiang Hu, Fanyou Wu et al.
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models
Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai et al.
Zero-shot Topical Text Classification with LLMs - an Experimental Study
Shai Gretz, Alon Halfon, Ilya Shnayderman et al.
TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction
Junyi Liu, Liangzhi Li, Tong Xiang et al.
LLM aided semi-supervision for efficient Extractive Dialog Summarization
Nishant Mishra, Gaurav Sahu, Iacer Calixto et al.
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
Oscar Sainz, Jon Campos, Iker García-Ferrero et al.
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
Boshi Wang, Xiang Yue, Huan Sun
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs
Hongru Wang, Rui Wang, Fei Mi et al.
BLM-s/lE: A structured dataset of English spray-load verb alternations for testing generalization in LLMs
Giuseppe Samo, Vivi Nastase, Chunyang Jiang et al.
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting
Haoyang Huang, Tianyi Tang, Dongdong Zhang et al.
LLMs – the Good, the Bad or the Indispensable?: A Use Case on Legal Statute Prediction and Legal Judgment Prediction on Indian Court Cases
Shaurya Vats, Atharva Zope, Somsubhra De et al.
LLMaAA: Making Large Language Models as Active Annotators
Ruoyu Zhang, Yanzeng Li, Yongliang Ma et al.
SGP-TOD: Building Task Bots Effortlessly via Schema-Guided LLM Prompting
Xiaoying Zhang, Baolin Peng, Kun Li et al.
Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs
Abhinav Rao, Aditi Khandelwal, Kumar Tanmay et al.
Beyond Testers’ Biases: Guiding Model Testing with Knowledge Bases using LLMs
Chenyang Yang, Rishabh Rustogi, Rachel Brower-Sinning et al.
Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Sahal Shaji Mullappilly, Abdelrahman Shaker, Omkar Thawakar et al.
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
Shubhra Kanti Karmaker Santu, Dongji Feng
PaRaDe: Passage Ranking using Demonstrations with LLMs
Andrew Drozdov, Honglei Zhuang, Zhuyun Dai et al.
A Confederacy of Models: a Comprehensive Evaluation of LLMs on Creative Writing
Carlos Gómez-Rodríguez, Paul Williams
Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering
Inderjeet Nair, Shwetha Somasundaram, Apoorv Saxena et al.