Long Yang
17 papers · 2017–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🐝 Cross-Pollinator (10) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6)
🏃
Academic Marathon
(8)
🐝
Cross-Pollinator
(10)
🌈
Renaissance Researcher
(6)
💎
Century Club
(17)
🔥
Unstoppable
(5)
🗃️
Keyword Collector
(68)
Conferences
AAAI (4)
IJCAI (3)
NIPS (3)
CVPR (2)
ICML (2)
COLT (1)
CORL (1)
MICCAI (1)
Top co-authors
Keywords
reinforcement learning
(4)
policy optimization
(4)
safe reinforcement learning
(3)
policy gradient
(3)
constraint satisfaction
(3)
offline reinforcement learning
(1)
game theory
(1)
policy evaluation
(1)
convex optimization
(1)
variational inference
(1)
temporal difference learning
(1)
minimax optimization
(1)
function approximation
(1)
sample complexity
(1)
distributed optimization
(1)
structure from motion
(1)
risk management
(1)
markov decision process
(1)
lyapunov function
(1)
sample efficiency
(1)
Papers
Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization
ICML 2025
UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation
CORL 2025
A Semi-Supervised Knowledge Distillation Framework for Left Ventricle Segmentation and Landmark Detection in Echocardiograms
MICCAI 2025
Optimizing over Multiple Distributions under Generalized Quasar-Convexity Condition
NIPS 2024
Langevin Policy for Safe Reinforcement Learning
ICML 2024
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation
IJCAI 2024
Zeroth-order Optimization with Weak Dimension Dependency
COLT 2023
VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning
NIPS 2023
Augmented Proximal Policy Optimization for Safe Reinforcement Learning
AAAI 2023
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
IJCAI 2022
Constrained Update Projection Approach to Safe Policy Optimization
NIPS 2022
Policy Optimization with Stochastic Mirror Descent
AAAI 2022
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points
AAAI 2021
On Convergence of Gradient Expected Sarsa(λ)
AAAI 2021
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning
IJCAI 2018
Texture Mapping for 3D Reconstruction With RGB-D Sensor
CVPR 2018
Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context
CVPR 2017