Papers
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
Yue Yu, Wei Ping, Zihan Liu et al.
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
Yutao Mou, Shikun Zhang, Wei Ye
LLM Dataset Inference: Did you train on my dataset?
Pratyush Maini, Hengrui Jia, Nicolas Papernot et al.
Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions
Vinamra Benara, Chandan Singh, John X. Morris et al.
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Jiaxiang Li, Siliang Zeng, Hoi-To Wai et al.
Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation
Jiawei Wang, Renhe Jiang, Chuang Yang et al.
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries
Sunjun Kweon, Jiyoun Kim, Heeyoung Kwak et al.
Self-playing Adversarial Language Game Enhances LLM Reasoning
Pengyu Cheng, Yong Dai, Tianhao Hu et al.
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases
Shirley Wu, Shiyu Zhao, Michihiro Yasunaga et al.
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang, Aoxue Li, Zhenguo Li et al.
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
Zhaorun Chen, Zhen Xiang, Chaowei Xiao et al.
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
Mustafa Shukor, Matthieu Cord
GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration
Jiachen T. Wang, Tong Wu, Dawn Song et al.
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li, Xinyao Wang, Sijie Zhu et al.
LLMDFA: Analyzing Dataflow in Code with Large Language Models
Chengpeng Wang, Wuqi Zhang, Zian Su et al.
Compositional 3D-aware Video Generation with LLM Director
Hanxin Zhu, Tianyu He, Anni Tang et al.
Enhancing LLM’s Cognition via Structurization
Kai Liu, Zhihang Fu, Chao Chen et al.
Detecting Bugs with Substantial Monetary Consequences by LLM and Rule-based Reasoning
Brian Zhang, Zhuo Zhang
Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation
Kehan Guo, Bozhao Nan, Yujun Zhou et al.
Aligning LLM Agents by Learning Latent Preference from User Edits
Ge Gao, Alexey Taymanov, Eduardo Salinas et al.
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
Yangyang Yu, Zhiyuan Yao, Haohang Li et al.
Truth is Universal: Robust Detection of Lies in LLMs
Lennart Bürger, Fred A. Hamprecht, Boaz Nadler
No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices
Qi Pang, Shengyuan Hu, Wenting Zheng et al.
Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data
Johannes Treutlein, Dami Choi, Jan Betley et al.
CLUES: Collaborative Private-domain High-quality Data Selection for LLMs via Training Dynamics
Wanru Zhao, Hongxiang Fan, Shell Xu Hu et al.