Papers
16,749 papers found
Awes, Laws, and Flaws From Today’s LLM Research
Adrian de Wynter
AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents
Junting Lu, Zhiyang Zhang, Fangkai Yang et al.
BabelEdits: A Benchmark and a Modular Approach for Robust Cross-lingual Knowledge Editing of Large Language Models
Tommaso Green, Félix Gaschi, Fabian David Schmidt et al.
BabyLM’s First Words: Word Segmentation as a Phonological Probing Task
Zébulon Goriely, Paula Buttery
BadWindtunnel: Defending Backdoor in High-noise Simulated Training with Confidence Variance
Ruyi Zhang, Songlei Jian, Yusong Tan et al.
Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text Generation
Yuxuan Zhou, Margret Keuper, Mario Fritz
Balancing the Budget: Understanding Trade-offs Between Supervised and Preference-Based Finetuning
Mohit Raghavendra, Junmo Kang, Alan Ritter
Bandit-Based Prompt Design Strategy Selection Improves Prompt Optimizers
Rin Ashizawa, Yoichi Hirose, Nozomu Yoshinari et al.
BanStereoSet: A Dataset to Measure Stereotypical Social Biases in LLMs for Bangla
Mahammed Kamruzzaman, Abdullah Al Monsur, Shrabon Kumar Das et al.
BAR: A Backward Reasoning based Agent for Complex Minecraft Tasks
Weihong Du, Wenrui Liao, Binyu Yan et al.
BARTABSA++: Revisiting BARTABSA with Decoder LLMs
Jan Pfister, Tom Völker, Anton Vlasjuk et al.
Basic Reading Distillation
Zhi Zhou, Sirui Miao, Xiangyu Duan et al.
Batayan: A Filipino NLP benchmark for evaluating Large Language Models
Jann Railey Montalan, Jimson Paulo Layacan, David Demitri Africa et al.
Battling against Tough Resister: Strategy Planning with Adversarial Game for Non-collaborative Dialogues
Haiyang Wang, Zhiliang Tian, Yuchen Pan et al.
Bayesian Optimization for Controlled Image Editing via LLMs
Chengkun Cai, Haoliang Liu, Xu Zhao et al.
BayesKD: Bayesian Knowledge Distillation for Compact LLMs in Constrained Fine-tuning Scenarios
Wei Li, Lujun Li, Mark G. Lee et al.
bbStar at SemEval-2025 Task 10: Improving Narrative Classification and Explanation via Fine Tuned Language Models
Rishit Tyagi, Rahul Bouri, Mohit Gupta
BD at BEA 2025 Shared Task: MPNet Ensembles for Pedagogical Mistake Identification and Localization in AI Tutor Responses
Shadman Rohan, Ishita Sur Apan, Muhtasim Ibteda Shochcho et al.
BDA-UC3M @ BioLaySumm: Efficient Lay Summarization with Small-Scale SoTA LLMs
Ilyass Ramzi, Isabel Bedmar
bea-jh at BEA 2025 Shared Task: Evaluating AI-powered Tutors through Pedagogically-Informed Reasoning
Jihyeon Roh, Jinhyun Bang
BeamLoRA: Beam-Constraint Low-Rank Adaptation
Naibin Gu, Zhenyu Zhang, Xiyu Liu et al.
BeaverTalk: Oregon State University’s IWSLT 2025 Simultaneous Speech Translation System
Matthew Raffel, Victor Agostinelli III, Lizhong Chen
Be Cautious When Merging Unfamiliar LLMs: A Phishing Model Capable of Stealing Privacy
Guo Zhenyuan, Yi Shi, Wenlong Meng et al.
BEDAA: Bayesian Enhanced DeBERTa for Uncertainty-Aware Authorship Attribution
Iqra Zahid, Youcheng Sun, Riza Batista-Navarro
Behavioral Analysis of Information Salience in Large Language Models
Jan Trienes, Jörg Schlötterer, Junyi Jessy Li et al.