Samy Bengio

49 papers · 2001–2026 · 9 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8)

🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (14) 🌟 Keyword Trendsetter Combo (10) 🏆 Keyword Champion 👑 Triple Crown 🌱 Topic Pioneer 💎 Century Club (48) ⚡ Prolific Year (6) 📈 Trend Setter ❓ The Questioner (7) 🔥 Unstoppable (13) 🗃️ Keyword Collector (177)

Conferences

NIPS (18) ICLR (8) JMLR (8) ICML (6) CVPR (4) CONLL (2) ACL (1) EACL (1) INTERSPEECH (1)

Top co-authors

Emmanuel Abbe (6) Oriol Vinyals (5) Navdeep Jaitly (5) Aryo Lotfi (4) Yoram Singer (4) Maithra Raghu (4) Chiyuan Zhang (4) Etai Littwin (3) Hossein Mobahi (3) Omid Saremi (3)

Keywords

neural network (7) deep learning (5) image captioning (4) online learning (3) neural machine translation (3) convolutional neural network (3) curriculum learning (3) logic reasoning (2) boolean function (2) sequence-to-sequence model (2) image similarity (2) image classification (2) zero-shot learning (2) recurrent neural network (2) attention mechanism (2) natural language generation (2) sparse representation (2) machine translation (2) dimensionality reduction (2) out-of-distribution generalization (2)

Papers

Reasoning’s Razor: Reasoning Improves Accuracy but Hurts Recall at Critical Operating Points in Safety and Hallucination Detection EACL 2026 TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining ACL 2025 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models ICLR 2025 How Far Can Transformers Reason? The Globality Barrier and Inductive Scratchpad NIPS 2024 What Algorithms can Transformers Learn? A Study in Length Generalization ICLR 2024 When can transformers reason with abstract symbols? ICLR 2024 Generalization on the Unseen, Logic Reasoning and Degree Curriculum JMLR 2024 Continuous pseudo-labeling from the start ICLR 2023 Generalization on the Unseen, Logic Reasoning and Degree Curriculum ICML 2023 Transformers learn through gradual rank increase NIPS 2023 Are All Layers Created Equal? JMLR 2022 Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures NIPS 2022 Learnable Fourier Features for Multi-dimensional Spatial Positional Encoding NIPS 2021 Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss NIPS 2021 Identity Crisis: Memorization and Generalization Under Extreme Overparameterization ICLR 2020 Fantastic Generalization Measures and Where to Find Them ICLR 2020 Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards NIPS 2020 Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML ICLR 2020 Area Attention ICML 2019 Predicting the Generalization Gap in Deep Networks with Margin Distributions ICLR 2019 You Look Twice: GaterNet for Dynamic Filter Selection in CNNs CVPR 2019 Transfusion: Understanding Transfer Learning for Medical Imaging NIPS 2019 Fast Decoding in Sequence Models Using Discrete Latent Variables ICML 2018 Large Margin Deep Networks for Classification NIPS 2018 Content preserving text generation with attribute controls NIPS 2018 Insights on representational similarity in neural networks with canonical correlation NIPS 2018 Context-Aware Captions From Context-Agnostic Supervision CVPR 2017 Tacotron: Towards End-to-End Speech Synthesis INTERSPEECH 2017 Device Placement Optimization with Reinforcement Learning ICML 2017 Sharp Minima Can Generalize For Deep Nets ICML 2017 ADIOS: Architectures Deep In Output Space ICML 2016 Reward Augmented Maximum Likelihood for Neural Structured Prediction NIPS 2016 Generating Sentences from a Continuous Space CONLL 2016 Can Active Memory Replace Attention? NIPS 2016 An Online Sequence-to-Sequence Model Using Partial Conditioning NIPS 2016 LLORMA: Local Low-Rank Matrix Approximation JMLR 2016 Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks NIPS 2015 Show and Tell: A Neural Image Caption Generator CVPR 2015 Learning Semantic Relationships for Better Action Retrieval in Images CVPR 2015 Training Highly Multiclass Classifiers JMLR 2014 DeViSE: A Deep Visual-Semantic Embedding Model NIPS 2013 Large Scale Online Learning of Image Similarity Through Ranking JMLR 2010 Label Embedding Trees for Large Multi-Class Tasks NIPS 2010 Why Does Unsupervised Pre-training Help Deep Learning? JMLR 2010 Group Sparse Coding NIPS 2009 An Online Algorithm for Large Scale Image Similarity Learning NIPS 2009 The Need for Open Source Software in Machine Learning JMLR 2007 Investigating Lexical Substitution Scoring for Subtitle Generation CONLL 2006 SVMTorch: Support Vector Machines for Large-Scale Regression Problems JMLR 2001