Papers
246 papers found
Aligning LLMs with Individual Preferences via Interaction
Shujin Wu, Yi R. Fung, Cheng Qian et al.
Co-Evolving LLMs and Embedding Models via Density-Guided Preference Optimization for Text Clustering
Zetong Li, Qinliang Su, Minhua Huang et al.
Training LLMs for Optimization Modeling via Iterative Data Synthesis and Structured Validation
Yang Wu, Yifan Zhang, Yurong Wu et al.
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing
Qi Le, Enmao Diao, Ziyan Wang et al.
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
Yuheng Zhang, Dian Yu, Baolin Peng et al.
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Guorui Zheng, Xidong Wang, Juhao Liang et al.
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Changle Qu, Sunhao Dai, Xiaochi Wei et al.
Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration
Qintong Li, Jiahui Gao, Sheng Wang et al.
It Helps to Take a Second Opinion: Teaching Smaller LLMs To Deliberate Mutually via Selective Rationale Optimisation
Sohan Patnaik, Milan Aggarwal, Sumit Bhatia et al.
Auto-Cypher: Improving LLMs on Cypher generation via LLM-supervised generation-verification framework
Aman Tiwari, Shiva Krishna Reddy Malay, Vikas Yadav et al.
Distill-C: Enhanced NL2SQL via Distilled Customization with LLMs
Cong Duy Vu Hoang, Gioacchino Tangari, Clemence Lanfranchi et al.
Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs
Paiheng Xu, Gang Wu, Xiang Chen et al.
AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization
Jingyi Liao, Yongyi Su, Rong-Cheng Tu et al.
Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following
Chenyang Wang, Liang Wen, Shousheng Jia et al.
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization
Wanhua Li, Zibin Meng, Jiawei Zhou et al.
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Jingnan Zheng, Han Wang, An Zhang et al.
CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning
Peiyuan Liu, Hang Guo, Tao Dai et al.
Putting People in LLMs’ Shoes: Generating Better Answers via Question Rewriter
Junhao Chen, Bowen Wang, Zhouqiang Jiang et al.
JailPO: A Novel Black-Box Jailbreak Framework via Preference Optimization Against Aligned LLMs
Hongyi Li, Jiawei Ye, Jie Wu et al.
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression
Huiqiang Jiang, Qianhui Wu, Xufang Luo et al.
LLMs Can Simulate Standardized Patients via Agent Coevolution
Zhuoyun Du, Lujie Zheng, Renjun Hu et al.
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Xinghua Zhang, Haiyang Yu, Cheng Fu et al.
Intent-Aware and Hate-Mitigating Counterspeech Generation via Dual-Discriminator Guided LLMs
Haiyang Wang, Zhiliang Tian, Xin Song et al.
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng, Weijia Shi, Yike Wang et al.