Papers
5,479 papers found
Assessing LLMs for Serendipity Discovery in Knowledge Graphs: A Case for Drug Repurposing
Mengying Wang, Chenhui Ma, Ao Jiao et al.
MSR-Rec: Multi-Step Reasoning-Enhanced LLM for Sequential Recommendation
Tuo Wang, Meng Jian, Ge Shi et al.
DMGIN: How Multimodal LLMs Enhance Large Recommendation Models for Lifelong User Post-click Behaviors
Zhuoxing Wei, Qingchen Xie, Qi Liu et al.
ICAD-LLM: One-for-All Anomaly Detection via In-Context Learning with Large Language Models
Zhongyuan Wu, Jingyuan Wang, Zexuan Cheng et al.
DiMA: Distinguishing Resident and Tourist Preferences via Multi-Modal LLM Alignment for Out-of-Town Cross-Domain Recommendation
Fan Zhang, Jinpeng Chen, Tao Wang et al.
Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning
Yipeng Zhang, Xin Wang, Hong Chen et al.
Pricing Online LLM Services with Data-Calibrated Stackelberg Routing Game
Zhendong Guo, Wenchao Bai, Jiahui Jin
Hide and Seek with LLMs: An Adversarial Game for Sneaky Error Generation and Self-Improving Diagnosis
Rui Zou, Mengqi Wei, Yutao Zhu et al.
Reconstruction Attack-Resistant Inference Paradigm for LLM Cloud Services
Zipeng Ye, Wenjian Luo, Qi Zhou et al.
ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph Reconstruction
Yan Yu, Yilun Liu, Minggui He et al.
Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute
Jianhao Chen, Zishuo Xun, Bocheng Zhou et al.
OTARo: Once Tuning for All Precisions Toward Robust On-Device LLMs
Shaoyuan Chen, Zhixuan Chen, Dawei Yang et al.
Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation
Xinrui Chen, Hongxing Zhang, Fanyi Zeng et al.
Combining LLM Semantic Reasoning with GNN Structural Modeling for Multi-View Multi-Label Feature Selection
Zhiqi Chen, Yuzhou Liu, Jiarui Liu et al.
MemoryART: Enhancing LLMs via Multi-Memory Models with Adaptive Resonance Theory for Healthcare Agents
Renke Dai, Hebin Hu, Jiahui Zhang et al.
Sliding-Window Merging for Compacting Patch-Redundant Layers in LLMs
Xuan Ding, Rui Sun, Yunjian Zhang et al.
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching
Yanhao Dong, Yubo Miao, Weinan Li et al.
TFRank: Think-Free Reasoning Enables Practical Pointwise LLM Ranking
Yongqi Fan, Xiaoyang Chen, Dezhi Ye et al.
The Semantic Architect: How FEAML Bridges Structured Data and LLMs for Multi-Label Tasks
Wanfu Gao, Zebin He, Jun Gao
FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching
Hongyaoxing Gu, Lijuan Hu, Shuzi Niu et al.
From Diagnosis to Generalization: A Cognitive Approach to Data Selection for Educational LLMs
Yuxiang Guo, Yan Zhuang, Qi Liu et al.
HALO: Hardware-Aware Quantization with Low Critical-Path-Delay Weights for LLM Acceleration
Rohan Juneja, Shivam Aggarwal, Safeen Huda et al.
FedP²EFT: Federated Learning to Personalize PEFT for Multilingual LLMs
Royson Lee, Minyoung Kim, Fady Rezk et al.
Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging
Lujun Li, Qiyuan Zhu, Jiacheng Wang et al.
LLMC+: Benchmarking Vision-Language Model Compression with a plug-and-play Toolkit
Chengtao Lv, Bilang Zhang, Yang Yong et al.