Bin Yu

59 papers · 2006–2026 · 13 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (25) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (12)

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (25) 🌟 Keyword Trendsetter Combo (5) 🤝 Dynamic Duo (10) 👑 Triple Crown 🏆 Keyword Champion (2) 🏆 Grand Slam 🌱 Topic Pioneer 🔬 Deep Specialist (14) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (8) 💎 Century Club (56) 🗃️ Keyword Collector (94) 🔥 Unstoppable (8)

Conferences

JMLR (18) ICML (10) NIPS (9) ICLR (6) ACL (3) AISTATS (3) COLING (2) COLT (2) CVPR (2) AAAI (1) ICCV (1) INTERSPEECH (1) WACV (1)

Top co-authors

Martin J. Wainwright (10) Raaz Dwivedi (7) Chandan Singh (7) Yuansi Chen (5) Jianfeng Gao (5) Garvesh Raskutti (5) Ming Tang (4) Abhineet Agarwal (4) Jinqiao Wang (4) Nikhil Ghosh (4)

Research topics

Statistics (2)

Keywords

kernel methods (5) mixing time (4) model selection (4) feature selection (4) gradient descent (4) sparse regression (3) random forest (3) markov chain monte carlo (3) convergence rate (3) nonparametric regression (3) visual tracking (2) reproducing kernel hilbert space (2) importance sampling (2) high-dimensional statistics (2) feature importance (2) neural network optimization (2) sparse representation (2) dictionary learning (2) ridge regression (2) sparse learning (2)

Papers

Formally Specifying the Intended Behavior of the Program: LLM-Driven Neuro-Symbolic Program Specification Synthesis ACL 2026 T4NMTD: Transition-Centric Reinforcement Learning for Non-Markovian Task Decomposition AAAI 2026 Bridging Kernel Drivers and Virtual Device Models with LLM-Powered Automation ACL 2026 Instability, Computational Efficiency and Statistical Accuracy JMLR 2025 The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning JMLR 2025 Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition ICLR 2025 Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression ICML 2025 MITIGATING OVER-EXPLORATION IN LATENT SPACE OPTIMIZATION USING LES ICML 2025 SPEX: Scaling Feature Interaction Explanations for LLMs ICML 2025 Prominent Roles of Conditionally Invariant Components in Domain Adaptation: Theory and Algorithms JMLR 2025 Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning COLING 2025 Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs ICLR 2024 The Impact of Initialization on LoRA Finetuning Dynamics NIPS 2024 ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance ICML 2024 Minimum-Norm Interpolation Under Covariate Shift ICML 2024 Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining ICML 2024 LoRA+: Efficient Low Rank Adaptation of Large Models ICML 2024 Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency COLT 2024 Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making ICLR 2024 Bridging Discrete and Backpropagation: Straight-Through and Beyond NIPS 2023 Revisiting minimum description length complexity in overparameterized models JMLR 2023 Improve Speech Enhancement using Perception-High-Related Time-Frequency Loss INTERSPEECH 2022 C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection CVPR 2022 The Three Stages of Learning Dynamics in High-dimensional Kernel Methods ICLR 2022 Hierarchical Shrinkage: Improving the accuracy and interpretability of tree-based models. ICML 2022 A cautionary tale on fitting decision trees to data from additive models: generalization lower bounds AISTATS 2022 High-Performance Discriminative Tracking With Transformers ICCV 2021 Adaptive wavelet distillation from neural networks through interpretations NIPS 2021 Fast Kernelized Correlation Filter Without Boundary Effect WACV 2021 Interpretations are Useful: Penalizing Explanations to Align Neural Networks with Prior Knowledge ICML 2020 Unique Sharp Local Minimum in L1-minimization Complete Dictionary Learning JMLR 2020 Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models AISTATS 2020 Fast mixing of Metropolized Hamiltonian Monte Carlo: Benefits of multi-step gradients JMLR 2020 A Debiased MDI Feature Importance Measure for Random Forests NIPS 2019 Log-concave sampling: Metropolis-Hastings algorithms are fast JMLR 2019 Hierarchical interpretations for neural network predictions ICLR 2019 Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs ICLR 2018 High-Speed Tracking With Multi-Kernel Correlation Filters CVPR 2018 Log-concave sampling: Metropolis-Hastings algorithms are fast! COLT 2018 Local Identifiability of $\ell_1$-minimization Dictionary Learning: a Sufficient and Almost Necessary Condition JMLR 2018 Fast MCMC Sampling Algorithms on Polytopes JMLR 2018 Supervised Neighborhoods for Distributed Nonparametric Regression AISTATS 2016 A Statistical Perspective on Algorithmic Leveraging JMLR 2015 Counting and Exploring Sizes of Markov Equivalence Classes of Directed Acyclic Graphs JMLR 2015 A Statistical Perspective on Algorithmic Leveraging ICML 2014 Early Stopping and Non-parametric Regression: An Optimal Data-dependent Stopping Rule JMLR 2014 Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows JMLR 2013 Minimax-Optimal Rates For Sparse Additive Models Over Kernel Classes Via Convex Programming JMLR 2012 Restricted Eigenvalue Properties for Correlated Gaussian Designs JMLR 2010 Predicting Execution Time of Computer Programs Using Sparse Polynomial Regression NIPS 2010 A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers NIPS 2009 Lower bounds on minimax rates for nonparametric regression with additive sparsity and smoothness NIPS 2009 Nonparametric sparse hierarchical models describe V1 fMRI responses to natural images NIPS 2008 Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of \boldmath$\ell_1$-regularized MLE NIPS 2008 Stagewise Lasso JMLR 2007 On Model Selection Consistency of Lasso JMLR 2006 Approximation Lasso Methods for Language Modeling COLING 2006 Approximation Lasso Methods for Language Modeling ACL 2006 Sparse Boosting JMLR 2006