Weinan E
16 papers · 2016–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🏃 Academic Marathon (9)
🏃
Academic Marathon
(9)
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(12)
🧬
Topic Evolution
🗃️
Keyword Collector
(71)
💎
Century Club
(15)
🔥
Unstoppable
(10)
📈
Trend Setter
Conferences
NIPS (6)
JMLR (4)
ACL (2)
ICML (2)
EMNLP (1)
ICLR (1)
Top co-authors
Research topics
Keywords
stochastic gradient descent
(4)
stochastic differential equation
(3)
optimal control
(2)
stochastic modified equation
(2)
deep learning
(2)
neural network optimization
(2)
weak approximation
(2)
approximation theory
(2)
flat minima
(2)
video generation
(1)
sharpness-aware minimization
(1)
feature extraction
(1)
sequence modeling
(1)
theoretical analysis
(1)
code generation
(1)
expressive power
(1)
information retrieval
(1)
machine learning
(1)
attention mechanism
(1)
signal processing
(1)
Papers
TeachMaster: Generative Teaching via Code
ACL 2026
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training
ICML 2025
PaSa: An LLM Agent for Comprehensive Academic Paper Search
ACL 2025
Exploring Molecular Pretraining Model at Scale
NIPS 2024
Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling
NIPS 2024
Improving Generalization and Convergence by Enhancing Implicit Regularization
NIPS 2024
An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extraction
EMNLP 2023
Approximation and Optimization Theory for Linear Continuous-Time Recurrent Neural Networks
JMLR 2022
On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis
ICLR 2021
Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning
NIPS 2020
Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations
JMLR 2019
Maximum Principle Based Algorithms for Deep Learning
JMLR 2018
How SGD Selects the Global Minima in Over-parameterized Learning: A Dynamical Stability Perspective
NIPS 2018
End-to-end Symmetry Preserving Inter-atomic Potential Energy Model for Finite and Extended Systems
NIPS 2018
Stochastic Modified Equations and Adaptive Stochastic Gradient Algorithms
ICML 2017
Multiscale Adaptive Representation of Signals: I. The Basic Framework
JMLR 2016