Research Explorer

LLM Circuit Analyses Are Consistent Across Training and Scale

Curt Tigges, Michael Hanna, Qinan Yu et al.

2024 NIPS

Verified Code Transpilation with LLMs

Sahil Bhatia, Jie Qiu, Niranjan Hasabnis et al.

2024 NIPS

Exploiting LLM Quantization

Kazuki Egashira, Mark Vero, Robin Staab et al.

2024 NIPS

RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

Amrith Setlur, Saurabh Garg, Xinyang (Young) Geng et al.

2024 NIPS

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Mucong Ding, Chenghao Deng, Jocelyn Choo et al.

2024 NIPS

Is Programming by Example Solved by LLMs?

Wen-Ding Li, Kevin Ellis

2024 NIPS

Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems

Lingjiao Chen, Jared Davis, Boris Hanin et al.

2024 NIPS

LLMs Can Evolve Continually on Modality for $\mathbb{X}$-Modal Reasoning

Jiazuo Yu, Haomiao Xiong, Lu Zhang et al.

2024 NIPS

CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence

Md Tanvirul Alam, Dipkamal Bhusal, Le Nguyen et al.

2024 NIPS

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Huiqiang Jiang, Yucheng Li, Chengruidong Zhang et al.

2024 NIPS

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Ye Tian, Baolin Peng, Linfeng Song et al.

2024 NIPS

EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical Dilemmas

Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu et al.

2024 NIPS

Can LLMs Implicitly Learn Numeric Parameter Constraints in Data Science APIs?

Yinlin Deng, Chunqiu Steven Xia, Zhezhen Cao et al.

2024 NIPS

DALD: Improving Logits-based Detector without Logits from Black-box LLMs

Cong Zeng, Shengkun Tang, Xianjun Yang et al.

2024 NIPS

Rethinking LLM Memorization through the Lens of Adversarial Compression

Avi Schwarzschild, Zhili Feng, Pratyush Maini et al.

2024 NIPS

Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Hao Fei, Shengqiong Wu, Hanwang Zhang et al.

2024 NIPS

NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

Minghao Shao, Sofija Jancheska, Meet Udeshi et al.

2024 NIPS

To Believe or Not to Believe Your LLM: Iterative Prompting for Estimating Epistemic Uncertainty

Yasin Abbasi Yadkori, Ilja Kuzborskij, András György et al.

2024 NIPS

CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses

Jing Yao, Xiaoyuan Yi, Xing Xie

2024 NIPS

Efficient LLM Scheduling by Learning to Rank

Yichao Fu, Siqi Zhu, Runlong Su et al.

2024 NIPS

S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Xinyu Yang, Jixuan Leng, Geyang Guo et al.

2024 NIPS

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically

Anay Mehrotra, Manolis Zampetakis, Paul Kassianik et al.

2024 NIPS

Make Your LLM Fully Utilize the Context

Shengnan An, Zexiong Ma, Zeqi Lin et al.

2024 NIPS

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

Rui Yang, Ruomeng Ding, Yong Lin et al.

2024 NIPS

The ALCHEmist: Automated Labeling 500x CHEaper than LLM Data Annotators

Tzu-Heng Huang, Catherine Cao, Vaishnavi Bhargava et al.

2024 NIPS

Papers