Papers
Pro-Woman, Anti-Man? Identifying Gender Bias in Stance Detection
Yingjie Li, Yue Zhang
ProxyQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models
Haochen Tan, Zhijiang Guo, Zhan Shi et al.
PRP-Graph: Pairwise Ranking Prompting to LLMs with Graph Aggregation for Effective Text Re-ranking
Jian Luo, Xuanang Chen, Ben He et al.
PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails
Neal Mangaokar, Ashish Hooda, Jihye Choi et al.
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Bowen Shen, Zheng Lin, Daren Zha et al.
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
Qisen Yang, Zekun Wang, Honghui Chen et al.
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety
Zaibin Zhang, Yongting Zhang, Lijun Li et al.
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang, Chenchen Yuan, Yao Rong et al.
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs’ Pragmatics Capabilities
Settaluri Sravanthi, Meet Doshi, Pavan Tankala et al.
Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes
Sunjun Kweon, Junu Kim, Jiyoun Kim et al.
Pungene at DialAM-2024: Identification of Propositional and Illocutionary Relations
Sirawut Chaixanien, Eugene Choi, Shaden Shaar et al.
Pushing the Limits of Low-Resource NER Using LLM Artificial Data Generation
Joan Santoso, Patrick Sutanto, Billy Cahyadi et al.
Pushing the Limits of Zero-shot End-to-End Speech Translation
Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa et al.
PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Yew Ken Chia, Vernon Toh, Deepanway Ghosal et al.
PyFoma: a Python finite-state compiler module
Mans Hulden, Michael Ginn, Miikka Silfverberg et al.
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference
Dongjie Yang, Xiaodong Han, Yan Gao et al.
QAES: First Publicly-Available Trait-Specific Annotations for Automated Scoring of Arabic Essays
May Bashendy, Salam Albatarni, Sohaila Eltanbouly et al.
Qalam: A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
Gagan Bhatia, El Moatez Billah Nagoudi, Fakhraddin Alwajih et al.
QAVSA: Question Answering using Vector Symbolic Algebras
Ryan Laube, Chris Eliasmith
Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model
Christian Tomani, David Vilar, Markus Freitag et al.
Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Martin Riddell, Ansong Ni, Arman Cohan
Quantifying Generalizations: Exploring the Divide Between Human and LLMs’ Sensitivity to Quantification
Claudia Collacciani, Giulia Rambelli, Marianna Bolognesi
Quantifying the Persona Effect in LLM Simulations
Tiancheng Hu, Nigel Collier
Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness
Jiuhai Chen, Jonas Mueller
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models
Zhengxin Zhang, Dan Zhao, Xupeng Miao et al.