Yuheng Zhang
14 papers · 2020–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π Cross-Pollinator (14) π Conference Polyglot (6) π Academic Marathon (5) π Renaissance Researcher (6)
πΊοΈ
Taxonomy Completionist
(26)
π
Cross-Pollinator
(14)
π
Grand Slam
ποΈ
Keyword Collector
(50)
π
Century Club
(13)
π₯
Unstoppable
(6)
Conferences
NIPS (5)
AAAI (2)
ICLR (2)
ICML (2)
ACL (1)
ALT (1)
CVPR (1)
Top co-authors
Keywords
regret bound
(3)
neural network
(2)
model inversion attack
(2)
multi-agent reinforcement learning
(2)
feedback graph
(2)
offline reinforcement learning
(1)
off-policy evaluation
(1)
representation learning
(1)
face recognition
(1)
preference learning
(1)
policy learning
(1)
automatic speech recognition
(1)
temporal grounding
(1)
minimax optimization
(1)
reinforcement learning
(1)
reward function
(1)
mutual information
(1)
function approximation
(1)
speaker diarization
(1)
nash equilibrium
(1)
Papers
TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding
ACL 2026
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
ICLR 2025
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs
ICLR 2025
Efficient Contextual Bandits with Uninformed Feedback Graphs
ICML 2024
Provably Efficient Interactive-Grounded Learning with Personalized Reward
NIPS 2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
NIPS 2024
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
NIPS 2024
Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs
ALT 2023
Practical Contextual Bandits with Feedback Graphs
NIPS 2023
Offline Learning in Markov Games with General Function Approximation
ICML 2023
Improved Algorithms for Neural Active Learning
NIPS 2022
Batch Active Learning with Graph Neural Networks via Multi-Agent Deep Reinforcement Learning
AAAI 2022
Improving Robustness to Model Inversion Attacks via Mutual Information Regularization
AAAI 2021
The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks
CVPR 2020