Research Explorer

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time

Zichang Liu, Aditya Desai, Fangshuo Liao et al.

2023 NIPS

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs

Lijun Yu, Yong Cheng, Zhiruo Wang et al.

2023 NIPS

D4: Improving LLM Pretraining via Document De-Duplication and Diversification

Kushal Tirumala, Daniel Simig, Armen Aghajanyan et al.

2023 NIPS

Joint Prompt Optimization of Stacked LLMs using Variational Inference

Alessandro Sordoni, Eric Yuan, Marc-Alexandre Côté et al.

2023 NIPS

Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations

Lifan Yuan, Yangyi Chen, Ganqu Cui et al.

2023 NIPS

To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis

Fuzhao Xue, Yao Fu, Wangchunshu Zhou et al.

2023 NIPS

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data Only

Guilherme Penedo, Quentin Malartic, Daniel Hesslow et al.

2023 NIPS

Jailbroken: How Does LLM Safety Training Fail?

Alexander Wei, Nika Haghtalab, Jacob Steinhardt

2023 NIPS

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

Xuan Zhang, Chao Du, Tianyu Pang et al.

2024 NIPS

AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning

Minghao Chen, Yihang Li, Yanting Yang et al.

2024 NIPS

Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives

Vincent Hanke, Tom Blanchard, Franziska Boenisch et al.

2024 NIPS

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh et al.

2024 NIPS

Efficient Adversarial Training in LLMs with Continuous Attacks

Sophie Xhonneux, Alessandro Sordoni, Stephan Günnemann et al.

2024 NIPS

D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models

Yikun Jiang, Huanyu Wang, Lei Xie et al.

2024 NIPS

SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization

Wanhua Li, Zibin Meng, Jiawei Zhou et al.

2024 NIPS

ReMoDetect: Reward Models Recognize Aligned LLM's Generations

Hyunseok Lee, Jihoon Tack, Jinwoo Shin

2024 NIPS

QBB: Quantization with Binary Bases for LLMs

Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos

2024 NIPS

Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning

Hang Zhou, Yehui Tang, Haochen Qin et al.

2024 NIPS

Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers

Xiuying Wei, Skander Moalla, Razvan Pascanu et al.

2024 NIPS

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons

Dan Shi, Renren Jin, Tianhao Shen et al.

2024 NIPS

PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression

Vladimir Malinovskii, Denis Mazur, Ivan Ilin et al.

2024 NIPS

AGILE: A Novel Reinforcement Learning Framework of LLM Agents

Peiyuan Feng, Yichen He, Guanhua Huang et al.

2024 NIPS

WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Seungju Han, Kavel Rao, Allyson Ettinger et al.

2024 NIPS

SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training

Jinda Jia, Cong Xie, Hanlin Lu et al.

2024 NIPS

Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities

Alexander Nikitin, Jannik Kossen, Yarin Gal et al.

2024 NIPS

Papers