Samy Bengio
49 papers · 2001–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (8)
π
Conference Polyglot
(8)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(14)
π
Keyword Trendsetter Combo
(10)
π
Keyword Champion
π
Triple Crown
π±
Topic Pioneer
π
Century Club
(48)
β‘
Prolific Year
(6)
π
Trend Setter
β
The Questioner
(7)
π₯
Unstoppable
(13)
ποΈ
Keyword Collector
(177)
Conferences
NIPS (18)
ICLR (8)
JMLR (8)
ICML (6)
CVPR (4)
CONLL (2)
ACL (1)
EACL (1)
INTERSPEECH (1)
Top co-authors
Keywords
neural network
(7)
deep learning
(5)
image captioning
(4)
online learning
(3)
neural machine translation
(3)
convolutional neural network
(3)
curriculum learning
(3)
logic reasoning
(2)
boolean function
(2)
sequence-to-sequence model
(2)
image similarity
(2)
image classification
(2)
zero-shot learning
(2)
recurrent neural network
(2)
attention mechanism
(2)
natural language generation
(2)
sparse representation
(2)
machine translation
(2)
dimensionality reduction
(2)
out-of-distribution generalization
(2)
Papers
Reasoningβs Razor: Reasoning Improves Accuracy but Hurts Recall at Critical Operating Points in Safety and Hallucination Detection
EACL 2026
TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining
ACL 2025
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
ICLR 2025
How Far Can Transformers Reason? The Globality Barrier and Inductive Scratchpad
NIPS 2024
What Algorithms can Transformers Learn? A Study in Length Generalization
ICLR 2024
When can transformers reason with abstract symbols?
ICLR 2024
Generalization on the Unseen, Logic Reasoning and Degree Curriculum
JMLR 2024
Continuous pseudo-labeling from the start
ICLR 2023
Generalization on the Unseen, Logic Reasoning and Degree Curriculum
ICML 2023
Transformers learn through gradual rank increase
NIPS 2023
Are All Layers Created Equal?
JMLR 2022
Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures
NIPS 2022
Learnable Fourier Features for Multi-dimensional Spatial Positional Encoding
NIPS 2021
Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss
NIPS 2021
Identity Crisis: Memorization and Generalization Under Extreme Overparameterization
ICLR 2020
Fantastic Generalization Measures and Where to Find Them
ICLR 2020
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards
NIPS 2020
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
ICLR 2020
Area Attention
ICML 2019
Predicting the Generalization Gap in Deep Networks with Margin Distributions
ICLR 2019
You Look Twice: GaterNet for Dynamic Filter Selection in CNNs
CVPR 2019
Transfusion: Understanding Transfer Learning for Medical Imaging
NIPS 2019
Fast Decoding in Sequence Models Using Discrete Latent Variables
ICML 2018
Large Margin Deep Networks for Classification
NIPS 2018
Content preserving text generation with attribute controls
NIPS 2018
Insights on representational similarity in neural networks with canonical correlation
NIPS 2018
Context-Aware Captions From Context-Agnostic Supervision
CVPR 2017
Tacotron: Towards End-to-End Speech Synthesis
INTERSPEECH 2017
Device Placement Optimization with Reinforcement Learning
ICML 2017
Sharp Minima Can Generalize For Deep Nets
ICML 2017
ADIOS: Architectures Deep In Output Space
ICML 2016
Reward Augmented Maximum Likelihood for Neural Structured Prediction
NIPS 2016
Generating Sentences from a Continuous Space
CONLL 2016
Can Active Memory Replace Attention?
NIPS 2016
An Online Sequence-to-Sequence Model Using Partial Conditioning
NIPS 2016
LLORMA: Local Low-Rank Matrix Approximation
JMLR 2016
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
NIPS 2015
Show and Tell: A Neural Image Caption Generator
CVPR 2015
Learning Semantic Relationships for Better Action Retrieval in Images
CVPR 2015
Training Highly Multiclass Classifiers
JMLR 2014
DeViSE: A Deep Visual-Semantic Embedding Model
NIPS 2013
Large Scale Online Learning of Image Similarity Through Ranking
JMLR 2010
Label Embedding Trees for Large Multi-Class Tasks
NIPS 2010
Why Does Unsupervised Pre-training Help Deep Learning?
JMLR 2010
Group Sparse Coding
NIPS 2009
An Online Algorithm for Large Scale Image Similarity Learning
NIPS 2009
The Need for Open Source Software in Machine Learning
JMLR 2007
Investigating Lexical Substitution Scoring for Subtitle Generation
CONLL 2006
SVMTorch: Support Vector Machines for Large-Scale Regression Problems
JMLR 2001