Papers
When is Multicalibration Post-Processing Necessary?
Dutch Hansen, Siddartha Devic, Preetum Nakkiran et al.
When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search
Xuan Chen, Yuzhou Nie, Wenbo Guo et al.
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models
Yinghui Li, Qingyu Zhou, Yuanzhen Luo et al.
When to Act and When to Ask: Policy Learning With Deferral Under Hidden Confounding
Marah Ghoummaid, Uri Shalit
When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL
Lenart Treven, Bhavya Sukhija, Yarden As et al.
When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback
Leon Lang, Davis Foote, Stuart Russell et al.
Where does In-context Learning Happen in Large Language Models?
Suzanna Sia, David Mueller, Kevin Duh
Where Do Large Learning Rates Lead Us?
Ildus Sadrtdinov, Maxim Kodryan, Eduard Pokonechny et al.
Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval
Dvir Samuel, Rami Ben-Ari, Matan Levy et al.
WhodunitBench: Evaluating Large Multimodal Agents via Murder Mystery Games
Junlin Xie, Ruifei Zhang, Zhihong Chen et al.
Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
Michael Saxon, Fatima Jahara, Mahsa Khoshnoodi et al.
Who's asking? User personas and the mechanics of latent misalignment
Asma Ghandeharioun, Ann Yuan, Marius Guerard et al.
Who’s Gaming the System? A Causally-Motivated Approach for Detecting Strategic Adaptation
Trenton Chang, Lindsay Warrenburg, Sae-Hwan Park et al.
Why are Visually-Grounded Language Models Bad at Image Classification?
Yuhui Zhang, Alyssa Unell, Xiaohan Wang et al.
Why Do We Need Weight Decay in Modern Deep Learning?
Francesco D'Angelo, Maksym Andriushchenko, Aditya Varre et al.
Why Go Full? Elevating Federated Learning Through Partial Network Updates
Haolin Wang, Xuefeng Liu, Jianwei Niu et al.
Why the Metric Backbone Preserves Community Structure
Maximilien Dreveton, Charbel Chucri, Matthias Grossglauser et al.
Why Transformers Need Adam: A Hessian Perspective
Yushun Zhang, Congliang Chen, Tian Ding et al.
Why Warmup the Learning Rate? Underlying Mechanisms and Improvements
Dayal Singh Kalra, Maissam Barkeshli
Wide Two-Layer Networks can Learn from Adversarial Perturbations
Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Yufang Hou, Alessandra Pascale, Javier Carnerero-Cano et al.
WikiDBs: A Large-Scale Corpus Of Relational Databases From Wikidata
Liane Vogel, Jan-Micha Bodensohn, Carsten Binnig
WikiDO: A New Benchmark Evaluating Cross-Modal Retrieval for Vision-Language Models
T Pavan Kalyan, Piyush Singh Pasi, Sahil Nilesh Dharod et al.
WildGaussians: 3D Gaussian Splatting In the Wild
Jonas Kulhanek, Songyou Peng, Zuzana Kukelova et al.
Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections
Jiacong Xu, Yiqun Mei, Vishal M. Patel