Papers
AgentPro: Enhancing LLM Agents with Automated Process Supervision
Yuchen Deng, Shichen Fan, Naibo Wang et al.
Learn and Unlearn: Addressing Misinformation in Multilingual LLMs
TaiMing Lu, Philipp Koehn
PRISM: Efficient Long-Range Reasoning With Short-Context LLMs
Dulhan Jayalath, James Bradley Wendt, Nicholas Monath et al.
Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training
Yao-Ching Yu, Tsun-Han Chiang, Cheng-Wei Tsai et al.
Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework
Yuhang Chen, Zhen Tan, Ajay Kumar Jaiswal et al.
Calibrating LLM Confidence by Probing Perturbed Representation Stability
Reza Khanmohammadi, Erfan Miahi, Mehrsa Mardikoraem et al.
CIFLEX: Contextual Instruction Flow for Sub-task Execution in Multi-Turn Interactions with a Single On-Device LLM
Juntae Lee, Jihwan Bang, Seunghan Yang et al.
Latent Inter-User Difference Modeling for LLM Personalization
Yilun Qiu, Tianhao Shi, Xiaoyan Zhao et al.
SelfRACG: Enabling LLMs to Self-Express and Retrieve for Code Generation
Qian Dong, Jia Chen, Qingyao Ai et al.
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training
Huishuai Zhang, Bohan Wang, Luoxin Chen
Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
Feiyang Kang, Newsha Ardalani, Michael Kuchnik et al.
From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations
Benlu Wang, Iris Xia, Yifan Zhang et al.
Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge
Yi Sui, Chaozhuo Li, Chen Zhang et al.
Identifying Unlearned Data in LLMs via Membership Inference Attacks
Advit Deepak, Megan Mou, Jing Huang et al.
LLMs cannot spot math errors, even when allowed to peek into the solution
Kv Aditya Srivatsa, Kaushal Kumar Maurya, Ekaterina Kochmar
Can LLMs be Good Graph Judge for Knowledge Graph Construction?
Haoyu Huang, Chong Chen, Zeang Sheng et al.
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
Abdellah El Mekki, Houdaifa Atou, Omer Nacar et al.
Collaborative Beam Search: Enhancing LLM Reasoning via Collective Consensus
Yangyifan Xu, Shuo Ren, Jiajun Zhang
Stimulate the Critical Thinking of LLMs via Debiasing Discussion
Ruiyu Xiao, Lei Wu, Yuanxing Liu et al.
Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs
Ayush Gupta, Ramneet Kaur, Anirban Roy et al.
Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation
François Ledoyen, Gaël Dias, Jeremie Pantin et al.
Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore’s Low-Resource Languages
Yujia Hu, Ming Shan Hee, Preslav Nakov et al.
Self-Augmented Preference Alignment for Sycophancy Reduction in LLMs
Chien Hung Chen, Hen-Hsen Huang, Hsin-Hsi Chen
Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving
Chuxue Cao, Mengze Li, Juntao Dai et al.
CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space
Yong Zhao, Kai Xu, Zhengqiu Zhu et al.