Florian Strub

20 papers · 2017–2024 · 12 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (12) 🏃 Academic Marathon (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (10)

🧭 Keyword Pioneer 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (12) 🤝 Dynamic Duo (13) 👑 Triple Crown 🧬 Topic Evolution 🏆 Keyword Champion (2) 📈 Trend Setter 🗃️ Keyword Collector (86) 🚀 Conference Pioneer ❓ The Questioner 🔥 Unstoppable (5) 💎 Century Club (20)

Conferences

NIPS (4) ICLR (3) EMNLP (2) ICML (2) IJCAI (2) ACL (1) AISTATS (1) CVPR (1) ECCV (1) ICCV (1) INTERSPEECH (1) NAACL (1)

Top co-authors

Olivier Pietquin (13) Bilal Piot (5) Harm de Vries (4) Corentin Tallec (4) Jean-Bastien Grill (4) Aaron Courville (4) Mathieu Rita (3) Florent Altché (3) Jeremie Mary (3) Emmanuel Dupoux (3)

Keywords

reinforcement learning (6) self-supervised learning (3) language model (2) policy gradient (2) dialogue system (2) model alignment (2) language drift (2) exponential moving average (2) contrastive learning (2) visual question answering (2) seeded iterated learning (2) large language model (2) representation learning (2) preference optimization (1) kl divergence (1) sequential decision making (1) temporal modeling (1) question generation (1) visual grounding (1) policy learning (1)

Papers

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion EMNLP 2024 Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning ACL 2024 The Edge of Orthogonality: A Simple View of What Makes BYOL Tick ICML 2023 Language Model Alignment with Elastic Reset NIPS 2023 SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations ICLR 2023 Emergent Communication: Generalization and Overfitting in Lewis Games NIPS 2022 Learning Natural Language Generation with Truncated Reinforcement Learning NAACL 2022 Emergent Communication at Scale ICLR 2022 On the role of population heterogeneity in emergent communication ICLR 2022 Broaden Your Views for Self-Supervised Video Learning ICCV 2021 Don’t Do What Doesn’t Matter: Intrinsic Motivation with Action Usefulness IJCAI 2021 Countering Language Drift with Seeded Iterated Learning ICML 2020 Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning NIPS 2020 A Machine of Few Words: Interactive Speaker Recognition with Reinforcement Learning INTERSPEECH 2020 Supervised Seeded Iterated Learning for Interactive Language Learning EMNLP 2020 Visual Reasoning with Multi-hop Feature Modulation ECCV 2018 Modulating early visual processing by language NIPS 2017 GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue CVPR 2017 Learning Nash Equilibrium for General-Sum Markov Games from Batch Data AISTATS 2017 End-to-end optimization of goal-driven and visually grounded dialogue systems IJCAI 2017