Hiteshi Sharma
5 papers · 2019–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Academic Marathon (5)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(17)
π
Conference Pioneer
Conferences
EMNLP (1)
ICML (1)
NAACL (1)
NIPS (1)
UAI (1)
Top co-authors
Keywords
large language model
(2)
reinforcement learning
(2)
function approximation
(1)
logical reasoning
(1)
question answering
(1)
trajectory prediction
(1)
label smoothing
(1)
instruction tuning
(1)
reinforcement learning from human feedback
(1)
markov decision process
(1)
model alignment
(1)
value iteration
(1)
kernel density estimation
(1)
continuous state space
(1)
human feedback
(1)
regret bound
(1)
cognitive map
(1)
language model
(1)
average reward
(1)
average reward mdp
(1)
Papers
Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing
EMNLP 2024
Language Models can be Deductive Solvers
NAACL 2024
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval
NIPS 2023
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
ICML 2020
Approximate Relative Value Learning for Average-reward Continuous State MDPs
UAI 2019