Papers
Personas as a Way to Model Truthfulness in Language Models
Nitish Joshi, Javier Rando, Abulhair Saparov et al.
Persuasiveness of Generated Free-Text Rationales in Subjective Decisions: A Case Study on Pairwise Argument Ranking
Mohamed Elaraby, Diane Litman, Xiang Lorraine Li et al.
PFA-ERC: Psuedo-Future Augmented Dynamic Emotion Recognition in Conversations
Tanmay Khule, Rishabh Agrawal, Apurva Narayan
P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Simeng Han, Aaron Yu, Rui Shen et al.
PG-Story: Taxonomy, Dataset, and Evaluation for Ensuring Child-Safe Content for Story Generation
Alicia Y. Tsai, Shereen Oraby, Anjali Narayan-Chen et al.
PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study
Yuqing Zhang, Baoyi He, Yihan Chen et al.
Phonetic and Lexical Discovery of Canine Vocalization
Theron S. Wang, Xingyuan Li, Chunhao Zhang et al.
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion
Hengrui Gu, Kaixiong Zhou, Yili Wang et al.
Pitfalls and Outlooks in Using COMET
Vilém Zouhar, Pinzhen Chen, Tsz Kin Lam et al.
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models
Kushal Tatariya, Vladimir Araujo, Thomas Bauwens et al.
PizzaCommonSense: A Dataset for Commonsense Reasoning about Intermediate Steps in Cooking Recipes
Aissatou Diallo, Antonis Bikakis, Luke Dickens et al.
PKAD: Pretrained Knowledge is All You Need to Detect and Mitigate Textual Backdoor Attacks
Yu Chen, Qi Cao, Kaike Zhang et al.
Platform-Invariant Topic Modeling via Contrastive Learning to Mitigate Platform-Induced Bias
Minseo Koo, Doeun Kim, Sungwon Han et al.
Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning
Shramay Palta, Nishant Balepur, Peter Rankel et al.
Please note that I’m just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification
Esra Dönmez, Thang Vu, Agnieszka Falenska
Plot Twist: Multimodal Models Don’t Comprehend Simple Chart Details
Yasaman Razeghi, Ishita Dasgupta, Fangyu Liu et al.
Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking across Diverse Vocabularies
Sai Koneru, Matthias Huck, Miriam Exel et al.
Polish Coreference Corpus as an LLM Testbed: Evaluating Coreference Resolution within Instruction-Following Language Models by Instruction–Answer Alignment
Karol Saputa, Angelika Peljak-Łapińska, Maciej Ogrodniczuk
PolyWER: A Holistic Evaluation Framework for Code-Switched Speech Recognition
Karima Kadaoui, Maryam Al Ali, Hawau Olamide Toyin et al.
Position Engineering: Boosting Large Language Models through Positional Information Manipulation
Zhiyuan He, Huiqiang Jiang, Zilong Wang et al.
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
Noah Wang, Feiyu Duan, Yibo Zhang et al.
Position Paper: Data-Centric AI in the Age of Large Language Models
Xinyi Xu, Zhaoxuan Wu, Rui Qiao et al.
POSIX: A Prompt Sensitivity Index For Large Language Models
Anwoy Chatterjee, H S V N S Kowndinya Renduchintala, Sumit Bhatia et al.
Post-edits Are Preferences Too
Nathaniel Berger, Miriam Exel, Matthias Huck et al.
PostMark: A Robust Blackbox Watermark for Large Language Models
Yapei Chang, Kalpesh Krishna, Amir Houmansadr et al.