Papers
5,479 papers found
Position Paper: How Should We Responsibly Adopt LLMs in the Peer Review Process?
Juhwan Choi, JungMin Yun, Changhun Kim et al.
Continual Pretraining on Encrypted Synthetic Data for Privacy-Preserving LLMs
Honghao Liu, Xuhui Jiang, Chengjin Xu et al.
Do Diacritics Matter? Evaluating the Impact of Arabic Diacritics on Tokenization and LLM Benchmarks
Go Inoue, Bashar Alhafni, Nizar Habash et al.
VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy
Yu Cui, Sicheng Pan, Yifei Liu et al.
Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs
Paiheng Xu, Gang Wu, Xiang Chen et al.
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
Zara Siddique, Irtaza Khalid, Liam Turner et al.
Harmful Factuality: LLMs Correcting What They Shouldn’t
Mingchen Li, Hanzhi Zhang, Heng Fan et al.
Toward Beginner-Friendly LLMs for Language Learning: Controlling Difficulty in Conversation
Meiqing Jin, Liam Dugan, Chris Callison-Burch
CodeGuard: Improving LLM Guardrails in CS Education
Nishat Raihan, Noah Erdachew, Jayoti Devi et al.
ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs
Yassir Lairgi, Ludovic Moncla, Khalid Benabdeslem et al.
Where do LLMs currently stand on biomedical NER in both clean and noisy settings ?
Christophe Ye, Cassie S. Mitchell
The Unintended Trade-off of AI Alignment: Balancing Hallucination Mitigation and Safety in LLMs
Omar Mahmoud, Ali Khalil, Thommen George Karimpanal et al.
The Model’s Language Matters: A Comparative Privacy Analysis of LLMs
Abhishek Kumar Mishra, Antoine Boutet, Lucas Magnana
LLMs Faithfully and Iteratively Compute Answers During CoT: A Systematic Analysis With Multi-step Arithmetics
Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi et al.
Intention-Adaptive LLM Fine-Tuning for Text Revision Generation
Zhexiong Liu, Diane Litman
Don’t Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation
Jiwon Moon, Yerin Hwang, Dongryeol Lee et al.
CrowdSelect: SyntheticInstruction Data Selection with Multi-LLM Wisdom
Yisen Li, Lingfeng Yang, Wenxuan Shen et al.
Breaking the Illusion of Reasoning in Polish LLMs: Quality over Quantity of Thought
Dzmitry Pihulski, Mikołaj Langner, Jan Eliasz et al.
WebNovelBench: Placing LLM Novelists on the Web Novel Distribution
Liangtao Lin, Jun Zheng, Haidong Wang
Feature Drift: How Fine-Tuning Repurposes Representations in LLMs
Andrey V. Galichin, Anton Korznikov, Alexey Dontsov et al.
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Dialogue Evaluators
John Mendonça, Alon Lavie, Isabel Trancoso
Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
Luca Giordano, Simon Razniewski
Bias in the East, Bias in the West: A Bilingual Analysis of LLM Political Bias on U.S.- and China-Related Issues
Ying Ying Lim, Paul Röttger
A Simple and Efficient Learning-Style Prompting for LLM Jailbreaking
Xuan Luo, Yue Wang, Zefeng He et al.
Aggregating Crowd of LLMs for Cost-Effective Data Annotation
Jiacheng Liu, Xiaofeng Hou