Bin Yu
59 papers · 2006–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (25) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (12)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(25)
π
Keyword Trendsetter Combo
(5)
π€
Dynamic Duo
(10)
π
Triple Crown
π
Keyword Champion
(2)
π
Grand Slam
π±
Topic Pioneer
π¬
Deep Specialist
(14)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(8)
π
Century Club
(56)
ποΈ
Keyword Collector
(94)
π₯
Unstoppable
(8)
Conferences
JMLR (18)
ICML (10)
NIPS (9)
ICLR (6)
ACL (3)
AISTATS (3)
COLING (2)
COLT (2)
CVPR (2)
AAAI (1)
ICCV (1)
INTERSPEECH (1)
WACV (1)
Top co-authors
Research topics
Keywords
kernel methods
(5)
mixing time
(4)
model selection
(4)
feature selection
(4)
gradient descent
(4)
sparse regression
(3)
random forest
(3)
markov chain monte carlo
(3)
convergence rate
(3)
nonparametric regression
(3)
visual tracking
(2)
reproducing kernel hilbert space
(2)
importance sampling
(2)
high-dimensional statistics
(2)
feature importance
(2)
neural network optimization
(2)
sparse representation
(2)
dictionary learning
(2)
ridge regression
(2)
sparse learning
(2)
Papers
Formally Specifying the Intended Behavior of the Program: LLM-Driven Neuro-Symbolic Program Specification Synthesis
ACL 2026
T4NMTD: Transition-Centric Reinforcement Learning for Non-Markovian Task Decomposition
AAAI 2026
Bridging Kernel Drivers and Virtual Device Models with LLM-Powered Automation
ACL 2026
Instability, Computational Efficiency and Statistical Accuracy
JMLR 2025
The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning
JMLR 2025
Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition
ICLR 2025
Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression
ICML 2025
MITIGATING OVER-EXPLORATION IN LATENT SPACE OPTIMIZATION USING LES
ICML 2025
SPEX: Scaling Feature Interaction Explanations for LLMs
ICML 2025
Prominent Roles of Conditionally Invariant Components in Domain Adaptation: Theory and Algorithms
JMLR 2025
Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning
COLING 2025
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs
ICLR 2024
The Impact of Initialization on LoRA Finetuning Dynamics
NIPS 2024
ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance
ICML 2024
Minimum-Norm Interpolation Under Covariate Shift
ICML 2024
Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining
ICML 2024
LoRA+: Efficient Low Rank Adaptation of Large Models
ICML 2024
Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
COLT 2024
Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making
ICLR 2024
Bridging Discrete and Backpropagation: Straight-Through and Beyond
NIPS 2023
Revisiting minimum description length complexity in overparameterized models
JMLR 2023
Improve Speech Enhancement using Perception-High-Related Time-Frequency Loss
INTERSPEECH 2022
C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection
CVPR 2022
The Three Stages of Learning Dynamics in High-dimensional Kernel Methods
ICLR 2022
Hierarchical Shrinkage: Improving the accuracy and interpretability of tree-based models.
ICML 2022
A cautionary tale on fitting decision trees to data from additive models: generalization lower bounds
AISTATS 2022
High-Performance Discriminative Tracking With Transformers
ICCV 2021
Adaptive wavelet distillation from neural networks through interpretations
NIPS 2021
Fast Kernelized Correlation Filter Without Boundary Effect
WACV 2021
Interpretations are Useful: Penalizing Explanations to Align Neural Networks with Prior Knowledge
ICML 2020
Unique Sharp Local Minimum in L1-minimization Complete Dictionary Learning
JMLR 2020
Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models
AISTATS 2020
Fast mixing of Metropolized Hamiltonian Monte Carlo: Benefits of multi-step gradients
JMLR 2020
A Debiased MDI Feature Importance Measure for Random Forests
NIPS 2019
Log-concave sampling: Metropolis-Hastings algorithms are fast
JMLR 2019
Hierarchical interpretations for neural network predictions
ICLR 2019
Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs
ICLR 2018
High-Speed Tracking With Multi-Kernel Correlation Filters
CVPR 2018
Log-concave sampling: Metropolis-Hastings algorithms are fast!
COLT 2018
Local Identifiability of $\ell_1$-minimization Dictionary Learning: a Sufficient and Almost Necessary Condition
JMLR 2018
Fast MCMC Sampling Algorithms on Polytopes
JMLR 2018
Supervised Neighborhoods for Distributed Nonparametric Regression
AISTATS 2016
A Statistical Perspective on Algorithmic Leveraging
JMLR 2015
Counting and Exploring Sizes of Markov Equivalence Classes of Directed Acyclic Graphs
JMLR 2015
A Statistical Perspective on Algorithmic Leveraging
ICML 2014
Early Stopping and Non-parametric Regression: An Optimal Data-dependent Stopping Rule
JMLR 2014
Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows
JMLR 2013
Minimax-Optimal Rates For Sparse Additive Models Over Kernel Classes Via Convex Programming
JMLR 2012
Restricted Eigenvalue Properties for Correlated Gaussian Designs
JMLR 2010
Predicting Execution Time of Computer Programs Using Sparse Polynomial Regression
NIPS 2010
A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers
NIPS 2009
Lower bounds on minimax rates for nonparametric regression with additive sparsity and smoothness
NIPS 2009
Nonparametric sparse hierarchical models describe V1 fMRI responses to natural images
NIPS 2008
Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of \boldmath$\ell_1$-regularized MLE
NIPS 2008
Stagewise Lasso
JMLR 2007
On Model Selection Consistency of Lasso
JMLR 2006
Approximation Lasso Methods for Language Modeling
COLING 2006
Approximation Lasso Methods for Language Modeling
ACL 2006
Sparse Boosting
JMLR 2006