Florian Strub
20 papers · 2017–2024 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (12) π Academic Marathon (7) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (10)
π§
Keyword Pioneer
π
Renaissance Researcher
(6)
π
Conference Polyglot
(12)
π€
Dynamic Duo
(13)
π
Triple Crown
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Trend Setter
ποΈ
Keyword Collector
(86)
π
Conference Pioneer
β
The Questioner
π₯
Unstoppable
(5)
π
Century Club
(20)
Conferences
NIPS (4)
ICLR (3)
EMNLP (2)
ICML (2)
IJCAI (2)
ACL (1)
AISTATS (1)
CVPR (1)
ECCV (1)
ICCV (1)
INTERSPEECH (1)
NAACL (1)
Top co-authors
Keywords
reinforcement learning
(6)
self-supervised learning
(3)
language model
(2)
policy gradient
(2)
dialogue system
(2)
model alignment
(2)
language drift
(2)
exponential moving average
(2)
contrastive learning
(2)
visual question answering
(2)
seeded iterated learning
(2)
large language model
(2)
representation learning
(2)
preference optimization
(1)
kl divergence
(1)
sequential decision making
(1)
temporal modeling
(1)
question generation
(1)
visual grounding
(1)
policy learning
(1)
Papers
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
EMNLP 2024
Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
ACL 2024
The Edge of Orthogonality: A Simple View of What Makes BYOL Tick
ICML 2023
Language Model Alignment with Elastic Reset
NIPS 2023
SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations
ICLR 2023
Emergent Communication: Generalization and Overfitting in Lewis Games
NIPS 2022
Learning Natural Language Generation with Truncated Reinforcement Learning
NAACL 2022
Emergent Communication at Scale
ICLR 2022
On the role of population heterogeneity in emergent communication
ICLR 2022
Broaden Your Views for Self-Supervised Video Learning
ICCV 2021
Donβt Do What Doesnβt Matter: Intrinsic Motivation with Action Usefulness
IJCAI 2021
Countering Language Drift with Seeded Iterated Learning
ICML 2020
Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning
NIPS 2020
A Machine of Few Words: Interactive Speaker Recognition with Reinforcement Learning
INTERSPEECH 2020
Supervised Seeded Iterated Learning for Interactive Language Learning
EMNLP 2020
Visual Reasoning with Multi-hop Feature Modulation
ECCV 2018
Modulating early visual processing by language
NIPS 2017
GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue
CVPR 2017
Learning Nash Equilibrium for General-Sum Markov Games from Batch Data
AISTATS 2017
End-to-end optimization of goal-driven and visually grounded dialogue systems
IJCAI 2017