Jingfeng Wu | 吴京风

I am a postdoc fellow at the Simons Institute at UC Berkeley hosted by Peter Bartlett and Bin Yu. I am a part of the NSF/Simons Collaboration on the Theoretical Foundations of Deep Learning. I received my Ph.D. in Computer Science at Johns Hopkins University, advised by Vladimir Braverman.

Contact me via Email.

Short bio

Jingfeng Wu is a postdoctoral fellow at the Simons Institute for the Theory of Computing at UC Berkeley. His research focuses on deep learning theory, optimization, and statistical learning. He earned his Ph.D. in Computer Science from Johns Hopkins University in 2023. Prior to that, he received a B.S. in Mathematics (2016) and an M.S. in Applied Mathematics (2019), both from Peking University. In 2023, he was recognized as a Rising Star in Data Science by the University of Chicago and UC San Diego.

Research

I work on the theory and algorithms for machine learning. I am interested in

deep learning theory
optimization
statistical learning

Find my papers on arXiv and Google Scholar.

News

[12/2025] Upcoming tutorial presentation at NeurIPS 2025 with Yu-Xiang and Maryam.
[09/2025] Invited talks at Yale, UPenn, JHU, MIT, Harvard, NYU, Columbia.
[09/2025] Two papers accepted to NeurIPS 2025.
[07/2025] Invited talk at the 6th Youth in High-Dimensions Conference.
[05/2025] Invited talk at SIAM DS25.
[05/2025] Three papers accepted to ICML 2025.
[02/2025] Organizing a deep learning theory workshop at Simons.
[01/2025] One paper accepted to ICLR 2025.

Prior to 2025

[09/2024] Three papers accepted to NeurIPS 2024; huge congrats to Yuhang, Licong, and Ruiqi!!
[05/2024] One paper accepted to COLT 2024.
[03/2024] Invited talk at UCLA CS.
[02/2024] Invited talk at UC Berkeley Biostatistics.
[01/2024] Two papers accepted to ICLR 2024.
[10/2023] Selected as Rising Star in Data Science by UChicago and UCSD.
[09/2023] Two papers accepted to NeurIPS 2023.
[08/2023] Joining the Simons Institute at UC Berkeley as a postdoc.
[06/2023] Defended my PhD dissertation!
[05/2023] One paper accepted to CoLLAs 2023; congrats to Haoran!
[04/2023] One paper accepted to ICML 2023.
[09/2022] Two papers accepted to NeurIPS 2022.
[06/2022] Interning at Google Research Seattle.
[05/2022] One paper accepted to ICML 2022 as long presentation!
[01/2022] One paper accepted to AISTATS 2022.
[12/2021] Passed the PhD candidacy exam.
[09/2021] Two papers accepted to NeurIPS 2021.
[05/2021] Awarded MINDS Summer Data Science Fellowship!
[05/2021] One paper accepted to COLT 2021.
[02/2021] In a relationship with Yuan, happy Valentine's day~
[01/2021] One paper is accepted to ICLR 2021.
[05/2020] Two papers accepted to ICML 2020.
[09/2019] Joining Hopkins. Veritas vos liberabit!
[06/2019] Graduated from Peking University.
[04/2019] One paper accepted to ICML 2019.
[03/2019] One paper accepted to CVPR 2019 as an oral presentation.
[12/2018] Looking for a Ph.D. position in machine learning.

Selected Papers (All Papers)

* indicates equal contribution or alphabetical order

Optimization with Large Stepsizes (Slides)

Large Stepsizes Accelerate Gradient Descent for Regularized Logistic Regression
JW*, Pierre Marion*, Peter Bartlett
NeurIPS 2025 | poster
Minimax Optimal Convergence of Gradient Descent in Logistic Regression via Large and Adaptive Stepsizes
Ruiqi Zhang, JW, Licong Lin, Peter Bartlett
ICML 2025 | poster
Large Stepsize Gradient Descent for Logistic Loss: Non-Monotonicity of the Loss Improves Optimization Efficiency
JW, Peter Bartlett*, Matus Telgarsky*, Bin Yu*
COLT 2024 | poster
Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability
JW, Vladimir Braverman, Jason Lee
NeurIPS 2023 (spotlight) | poster

Statistical Views on Implicit Regularization (Slides, Slides')

Risk Comparisons in Linear Regression: Implicit Regularization Dominates Explicit Regularization
JW, Peter Bartlett*, Jason Lee*, Sham Kakade*, Bin Yu*
arXiv 2025 | poster
Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression
JW, Peter Bartlett*, Matus Telgarsky*, Bin Yu*
ICML 2025 | poster
Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression
JW*, Difan Zou*, Vladimir Braverman, Quanquan Gu, Sham Kakade
ICML 2022 (long presentation) | poster
Benign Overfitting of Constant-Stepsize SGD for Linear Regression
Difan Zou*, JW*, Vladimir Braverman, Quanquan Gu, Sham Kakade
COLT 2021 (journal version in JMLR 2023)

Implications of Implicit Regularization

Scaling Laws in Linear Regression: Compute, Parameters, and Data
Licong Lin, JW, Sham Kakade, Peter Bartlett, Jason Lee
NeurIPS 2024
How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?
JW, Difan Zou, Zixiang Chen, Vladimir Braverman, Quanquan Gu, Peter Bartlett
ICLR 2024 (spotlight) | poster
The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift
JW*, Difan Zou*, Vladimir Braverman, Quanquan Gu, Sham Kakade
NeurIPS 2022 | poster

Margin Theory for Neural Networks

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks
Yuhang Cai*, Kangjie Zhou*, JW, Song Mei, Michael Lindsey, Peter Bartlett
ICML 2025
Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization
Yuhang Cai, JW, Song Mei, Michael Lindsey, Peter Bartlett
NeurIPS 2024

Invited Talks

A Statistical View on Implicit Regularization: GD Dominates Ridge
- [10/2025] Columbia, Machine Learning and AI Seminar, hosted by Daniel Hsu
- [10/2025] NYU, Math and Data Seminar, hosted by Matus Telgarsky
- [09/2025] Yale, Statistics & Data Science Seminar, hosted by Theodor Misiakiewicz, Omar Montasser
Reimagining Gradient Descent: Large Stepsize, Oscillation, and Acceleration
- [10/2025] Harvard, Talk at Kempner, hosted by Sham Kakade
- [09/2025] MIT, Talk at LIDS, hosted by Pablo Parrilo
- [09/2025] JHU, CS Theory Seminar, hosted by Vova Braverman
- [09/2025] UPenn, Group Seminar, hosted by Jason Altschuler
- [06/2025] MPI & UCLA, Math Machine Learning Seminar, hosted by Guido Montufar
- [05/2025] SIAM DS25, Dynamical Systems for Machine Learning, hosted by Molei Tao
- [01/2025] UCLA, Level Set Meeting, hosted by Shu Liu, Stanley Osher
A Statistical View on Implicit Regularization: GD for Logistic Regression
- [07/2025] ICTP, 6th Youth in High-Dimensions Conference, hosted by Marco Mondelli et al.
- [02/2025] UC Berkeley, Deep Learning Theory Workshop, hosted by Peter Bartlett et al.

Prior to 2025

Reimagining Gradient Descent: Large Stepsize, Oscillation, and Acceleration
- [09/2024] Simons Foundation, MoDL Annual Meeting, hosted by Peter Bartlett, Rene Vidal
- [05/2024] UC San Diego, MoDL Collaboration Meeting, hosted by Chaoyue Liu et al.
- [03/2024] UCLA, Computer Science Seminar, hosted by Quanquan Gu
- [02/2024] UC Berkeley, Biostatistics Seminar, hosted by Lexin Li
- [02/2024] UC San Diego, Group Seminar, hosted by Mikhail Belkin
New Insights about SGD: Stepsize, Risk Convergence, and Implicit Regularization
- [10/2023] UC Davis, Statistics Seminar, hosted by Xiao Hui Tai
Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability
- [05/2023] TTIC, MoDL Collaboration Meeting, hosted by Sam Buchanan et al.
The Implicit Regularization of SGD in Least Squares and Beyond
- [01/2023] Rice, Algorithms and ML Seminar, hosted by Anastasios Kyrillidis
- [12/2022] Princeton, Group Seminar, hosted by Jason Lee
- [11/2022] Georgia Tech, Group Seminar, hosted by Molei Tao
- [08/2022] Google Research, Learning Theory Seminar, hosted by Mehryar Mohri
- [06/2022] MPI & UCLA, Math Machine Learning Seminar, hosted by Guido Montufar

Services

Organizer
- [02/2025] Deep learning theory workshop, Simons Institute, UC Berkeley
Conference Reviewer
ICML (2020 - 2025), NeurIPS (2020 - 2025), ICLR (2021 - 2025), SODA (2026, subreviewer), AISTATS (2021 - 2023), UAI (2023), AAAI (2021 - 2023, PC member reviewer)
Journal Reviewer
JMLR, TPAMI, TMLR, SIMODS, JAIR, Applied Probability Journals, IEEE Transactions on Information Theory, Information and Inference

People

Family

Recent Student Collaborators

Yuhang Cai Licong Lin Ruiqi Zhang

Collaborators

Peter Bartlett Vladimir Braverman Zixiang Chen Dean Foster Udaya Ghai Quanquan Gu Peter Kairouz Sham Kakade Jason Lee Haoran Li Xuheng Li Michael Lindsey Pierre Marion Song Mei Depen Morwani Matus Telgarsky Nikhil Vyas Lin Yang Bin Yu Hanlin Zhang Wennan Zhu Dongruo Zhou Kangjie Zhou Difan Zou