Papers
21,849 papers found
Belief-State Query Policies for User-Aligned POMDPs
Daniel Bramblett, Siddharth Srivastava
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Abhimanyu Hans, Yuxin Wen, Neel Jain et al.
BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models
Fangyikang Wang, Hubery Yin, Yuejiang Dong et al.
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving
Xiaosong Jia, Zhenjie Yang, Qifeng Li et al.
Benchmark Data Repositories for Better Benchmarking
Rachel Longjohn, Markelle Kelly, Sameer Singh et al.
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Bosi Wen, Pei Ke, Xiaotao Gu et al.
Benchmarking Counterfactual Image Generation
Thomas Melistas, Nikos Spyrou, Nefeli Gkouti et al.
Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm
R. Teal Witter, Christopher Musco
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming
Victor-Alexandru Pădurean, Adish Singla
Benchmarking LLMs via Uncertainty Quantification
Fanghua Ye, Mingming Yang, Jianhui Pang et al.
Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual Cortex.
Spandan Madan, Will Xiao, Mingran Cao et al.
Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization Regime
Haoyu Geng, Hang Ruan, Runzhong Wang et al.
Benchmarking Structural Inference Methods for Interacting Dynamical Systems with Synthetic Data
Aoran Wang, Tsz Pan Tong, Andrzej Mizera et al.
Benchmarking the Attribution Quality of Vision Models
Robin Hesse, Simone Schaub-Meyer, Stefan Roth
Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks
Bálint Mucsányi, Michael Kirchhof, Seong Joon Oh
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Yang Zhou, Tan Li Hui Faith, Yanyu Xu et al.
BendVLM: Test-Time Debiasing of Vision-Language Embeddings
Walter Gerych, Haoran Zhang, Kimia Hamidieh et al.
Benign overfitting in leaky ReLU networks with moderate input dimension
Kedar Karhadkar, Erin George, Michael Murray et al.
BertaQA: How Much Do Language Models Know About Local Culture?
Julen Etxaniz, Gorka Azkune, Aitor Soroa et al.
BERTs are Generative In-Context Learners
David Samuel
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
Anka Reuel, Amelia Hardy, Chandler Smith et al.
Better by default: Strong pre-tuned MLPs and boosted trees on tabular data
David Holzmüller, Léo Grinsztajn, Ingo Steinwart
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Xiang Zhang, Bingxin Ke, Hayko Riemenschneider et al.
Beware of Road Markings: A New Adversarial Patch Attack to Monocular Depth Estimation
Hangcheng Liu, Zhenhu Wu, Hao Wang et al.
Beyond Accuracy: Ensuring Correct Predictions With Correct Rationales
Tang Li, Mengmeng Ma, Xi Peng