Silviu Pitis
10 papers · 2019–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (4) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌍
Conference Polyglot
(4)
🏆
Grand Slam
👑
Triple Crown
🚀
Conference Pioneer
💎
Century Club
(10)
Conferences
NIPS (4)
ICLR (3)
AAAI (2)
ICML (1)
Top co-authors
Keywords
sequential decision making
(2)
counterfactual data augmentation
(2)
preference modeling
(2)
sample efficiency
(2)
preference learning
(2)
function approximation
(1)
markov decision process
(1)
decision theory
(1)
out-of-distribution generalization
(1)
language model alignment
(1)
offline reinforcement learning
(1)
discount factor
(1)
value function
(1)
model-based reinforcement learning
(1)
off-policy reinforcement learning
(1)
off-policy learning
(1)
sequential decision
(1)
temporal difference
(1)
multi-objective optimization
(1)
reward modeling
(1)
Papers
Improving Context-Aware Preference Modeling for Language Models
NIPS 2024
Identifying the Risks of LM Agents with an LM-Emulated Sandbox
ICLR 2024
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards
NIPS 2023
Large Language Models are Human-Level Prompt Engineers
ICLR 2023
MoCoDA: Model-based Counterfactual Data Augmentation
NIPS 2022
Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning
ICML 2020
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
AAAI 2020
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality
ICLR 2020
Counterfactual Data Augmentation using Locally Factored Dynamics
NIPS 2020
Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach
AAAI 2019