Papers
CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses
Jing Yao, Xiaoyuan Yi, Xing Xie
Efficient LLM Scheduling by Learning to Rank
Yichao Fu, Siqi Zhu, Runlong Su et al.
S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity
Xinyu Yang, Jixuan Leng, Geyang Guo et al.
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Anay Mehrotra, Manolis Zampetakis, Paul Kassianik et al.
Make Your LLM Fully Utilize the Context
Shengnan An, Zexiong Ma, Zeqi Lin et al.
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang, Ruomeng Ding, Yong Lin et al.
The ALCHEmist: Automated Labeling 500x CHEaper than LLM Data Annotators
Tzu-Heng Huang, Catherine Cao, Vaishnavi Bhargava et al.
Efficient Multi-task LLM Quantization and Serving for Multiple LoRA Adapters
Yifei Xia, Fangcheng Fu, Wentao Zhang et al.
Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs
Rudolf Laine, Bilal Chughtai, Jan Betley et al.
Memory-Efficient LLM Training with Online Subspace Descent
Kaizhao Liang, Bo Liu, Lizhang Chen et al.
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
Dan Zhang, Sining Zhoubian, Ziniu Hu et al.
GraphVis: Boosting LLMs with Visual Knowledge Graph Integration
Yihe Deng, Chenchen Ye, Zijie Huang et al.
Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers
Yibo Jiang, Goutham Rajendran, Pradeep Ravikumar et al.
LLM Evaluators Recognize and Favor Their Own Generations
Arjun Panickssery, Samuel R. Bowman, Shi Feng
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
Hao Tang, Darren Key, Kevin Ellis
Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
Xuefei Ning, Zifu Wang, Shiyao Li et al.
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs
Ching-An Cheng, Allen Nie, Adith Swaminathan
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong et al.
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models
Shangqian Gao, Chi-Heng Lin, Ting Hua et al.
Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus
Terufumi Morishita, Gaku Morio, Atsuki Yamaguchi et al.
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Chang Ma, Junlei Zhang, Zhihao Zhu et al.
Panacea: Pareto Alignment via Preference Adaptation for LLMs
Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang et al.
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Ziyuan Huang, Kaixiang Ji, Biao Gong et al.
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
Junho Myung, Nayeon Lee, Yi Zhou et al.
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
Yubin Kim, Chanwoo Park, Hyewon Jeong et al.