Papers
5,479 papers found
LLM Circuit Analyses Are Consistent Across Training and Scale
Curt Tigges, Michael Hanna, Qinan Yu et al.
Verified Code Transpilation with LLMs
Sahil Bhatia, Jie Qiu, Niranjan Hasabnis et al.
Exploiting LLM Quantization
Kazuki Egashira, Mark Vero, Robin Staab et al.
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
Amrith Setlur, Saurabh Garg, Xinyang (Young) Geng et al.
Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Mucong Ding, Chenghao Deng, Jocelyn Choo et al.
Is Programming by Example Solved by LLMs?
Wen-Ding Li, Kevin Ellis
Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems
Lingjiao Chen, Jared Davis, Boris Hanin et al.
LLMs Can Evolve Continually on Modality for $\mathbb{X}$-Modal Reasoning
Jiazuo Yu, Haomiao Xiong, Lu Zhang et al.
CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence
Md Tanvirul Alam, Dipkamal Bhusal, Le Nguyen et al.
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Huiqiang Jiang, Yucheng Li, Chengruidong Zhang et al.
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Ye Tian, Baolin Peng, Linfeng Song et al.
EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical Dilemmas
Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu et al.
Can LLMs Implicitly Learn Numeric Parameter Constraints in Data Science APIs?
Yinlin Deng, Chunqiu Steven Xia, Zhezhen Cao et al.
DALD: Improving Logits-based Detector without Logits from Black-box LLMs
Cong Zeng, Shengkun Tang, Xianjun Yang et al.
Rethinking LLM Memorization through the Lens of Adversarial Compression
Avi Schwarzschild, Zhili Feng, Pratyush Maini et al.
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Hao Fei, Shengqiong Wu, Hanwang Zhang et al.
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Minghao Shao, Sofija Jancheska, Meet Udeshi et al.
To Believe or Not to Believe Your LLM: Iterative Prompting for Estimating Epistemic Uncertainty
Yasin Abbasi Yadkori, Ilja Kuzborskij, András György et al.
CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Jing Yao, Xiaoyuan Yi, Xing Xie
Efficient LLM Scheduling by Learning to Rank
Yichao Fu, Siqi Zhu, Runlong Su et al.
S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity
Xinyu Yang, Jixuan Leng, Geyang Guo et al.
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Anay Mehrotra, Manolis Zampetakis, Paul Kassianik et al.
Make Your LLM Fully Utilize the Context
Shengnan An, Zexiong Ma, Zeqi Lin et al.
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang, Ruomeng Ding, Yong Lin et al.
The ALCHEmist: Automated Labeling 500x CHEaper than LLM Data Annotators
Tzu-Heng Huang, Catherine Cao, Vaishnavi Bhargava et al.