Papers
5,479 papers found
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space
Leo Schwinn, David Dobre, Sophie Xhonneux et al.
LoFiT: Localized Fine-tuning on LLM Representations
Fangcong Yin, Xi Ye, Greg Durrett
PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques
Derui Zhu, Dingfan Chen, Xiongfei Wu et al.
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
Wenyu Du, Tongxu Luo, Zihan Qiu et al.
PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations
Jiatong Li, Renjun Hu, Kunzhe Huang et al.
Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exiting
Fangcheng Liu, Yehui Tang, Zhenhua Liu et al.
UniTox: Leveraging LLMs to Curate a Unified Dataset of Drug-Induced Toxicity from FDA Labels
Jake Silberg, Kyle Swanson, Elana Simon et al.
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference
Jiabao Ji, Yujian Liu, Yang Zhang et al.
$\texttt{Model-GLUE}$: Democratized LLM Scaling for A Large Model Zoo in the Wild
Xinyu Zhao, Guoheng Sun, Ruisi Cai et al.
Alignment at Pre-training! Towards Native Alignment for Arabic LLMs
Juhao Liang, Zhenyang Cai, Jianqing Zhu et al.
Benchmarking LLMs via Uncertainty Quantification
Fanghua Ye, Mingming Yang, Jianhui Pang et al.
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning
Hao Ma, Tianyi Hu, Zhiqiang Pu et al.
HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis
Shraddha Barke, Emmanuel Anaya Gonzalez, Saketh Ram Kasibatla et al.
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Yuxin Xiao, Chaoqun Wan, Yonggang Zhang et al.
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection
Xiao Yu, Yuang Qi, Kejiang Chen et al.
SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices
Ruslan Svirschevski, Avner May, Zhuoming Chen et al.
Ad Auctions for LLMs via Retrieval Augmented Generation
MohammadTaghi Hajiaghayi, Sébastien Lahaie, Keivan Rezaei et al.
WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off
Eva Giboulot, Teddy Furon
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Aniket Didolkar, Anirudh Goyal, Nan Rosemary Ke et al.
Active Learning with LLMs for Partially Observed and Cost-Aware Scenarios
Nicolás Astorga, Tennison Liu, Nabeel Seedat et al.
Efficient multi-prompt evaluation of LLMs
Felipe Maia Polo, Ronald Xu, Lucas Weber et al.
SnapKV: LLM Knows What You are Looking for Before Generation
Yuhong Li, Yingbing Huang, Bowen Yang et al.
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
Kai Hu, Weichen Yu, Yining Li et al.
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Abhimanyu Hans, Yuxin Wen, Neel Jain et al.
SIRIUS : Contexual Sparisty with Correction for Efficient LLMs
Yang Zhou, Zhuoming Chen, Zhaozhuo Xu et al.