conftrace_

Hiteshi Sharma

5 papers · 2019–2024 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🏃 Academic Marathon (5)

🐝 Cross-Pollinator (13) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (17) 🚀 Conference Pioneer

Conferences

EMNLP (1) ICML (1) NAACL (1) NIPS (1) UAI (1)

Top co-authors

Rahul Jain (2) Weizhu Chen (1) Dongyan Zhao (1) Junheng Hao (1) Yelong Shen (1) Jiazhan Feng (1) Chen-Yu Wei (1) Jonathan Larson (1) Ida Momennejad (1) Yi Mao (1)

Keywords

large language model (2) reinforcement learning (2) function approximation (1) logical reasoning (1) question answering (1) trajectory prediction (1) label smoothing (1) instruction tuning (1) reinforcement learning from human feedback (1) markov decision process (1) model alignment (1) value iteration (1) kernel density estimation (1) continuous state space (1) human feedback (1) regret bound (1) cognitive map (1) language model (1) average reward (1) average reward mdp (1)

Papers

Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing EMNLP 2024 Language Models can be Deductive Solvers NAACL 2024 Evaluating Cognitive Maps and Planning in Large Language Models with CogEval NIPS 2023 Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes ICML 2020 Approximate Relative Value Learning for Average-reward Continuous State MDPs UAI 2019