Papers
5,479 papers found
A Fair Comparison without Translationese: English vs. Target-language Instructions for Multilingual LLMs
Taisei Enomoto, Hwichan Kim, Zhousi Chen et al.
Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3
Gaspard Michel, Elena V. Epure, Romain Hennequin et al.
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMs
Kawshik Manikantan, Makarand Tapaswi, Vineet Gandhi et al.
Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
Huaman Sun, Jiaxin Pei, Minje Choi et al.
RuleR: Improving LLM Controllability by Rule-based Data Recycling
Ming Li, Han Chen, Chenguang Wang et al.
How LLMs React to Industrial Spatio-Temporal Data? Assessing Hallucination with a Novel Traffic Incident Benchmark Dataset
Qiang Li, Mingkun Tan, Xun Zhao et al.
Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
Xuanqing Liu, Luyang Kong, Wei Niu et al.
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation
Yi-Chang Chen, Po-Chun Hsu, Chan-Jan Hsu et al.
Challenges and Remedies of Domain-Specific Classifiers as LLM Guardrails: Self-Harm as a Case Study
Bing Zhang, Guang-Jie Ren
Efficient Continual Pre-training of LLMs for Low-resource Languages
Arijit Nag, Soumen Chakrabarti, Animesh Mukherjee et al.
QueryShield: A Platform to Mitigate Enterprise Data Leakage in Queries to External LLMs
Nitin Ramrakhiyani, Delton Myalil, Sachin Pawar et al.
RevieWeaver: Weaving Together Review Insights by Leveraging LLMs and Semantic Similarity
Jiban Adhikary, Mohammad Alqudah, Arun Palghat Udayashankar
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use
Hitesh Laxmichand Patel, Amit Agarwal, Arion Das et al.
Granite Guardian: Comprehensive LLM Safeguarding
Inkit Padhi, Manish Nagireddy, Giandomenico Cornacchia et al.
Break-Ideate-Generate (BrIdGe): Moving beyond Translations for Localization using LLMs
Swapnil Gupta, Lucas Pereira Carlini, Prateek Sircar et al.
Evaluating Bias in LLMs for Job-Resume Matching: Gender, Race, and Education
Hayate Iso, Pouya Pezeshkpour, Nikita Bhutani et al.
PLEX: Adaptive Parameter-Efficient Fine-Tuning for Code LLMs using Lottery-Tickets
Jaeseong Lee, Hojae Han, Jongyoon Kim et al.
LLM Safety for Children
Prasanjit Rath, Hari Shrawgi, Parag Agrawal et al.
Distill-C: Enhanced NL2SQL via Distilled Customization with LLMs
Cong Duy Vu Hoang, Gioacchino Tangari, Clemence Lanfranchi et al.
Chatbot Arena Estimate: towards a generalized performance benchmark for LLM capabilities
Lucas Spangher, Tianle Li, William F. Arnold et al.
Developing Japanese CLIP Models Leveraging an Open-weight LLM for Large-scale Dataset Translation
Issa Sugiura, Shuhei Kurita, Yusuke Oda et al.
Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs
Marina Sakharova, Abhinav Anand, Mira Mezini
(CPER) From Guessing to Asking: An Approach to Resolving Persona Knowledge Gap in LLMs during Multi-Turn Conversations
Sarvesh Baskar, Manas Gaur, Srinivasan Parthasarathy et al.