[Statlist] ETH Young Data Science Researcher Seminar Zurich, Virtual Seminar by Pragya Sur, Harvard University, 26 June 2020
Maurer Letizia
letiziamaurer at ethz.ch
Tue Jun 23 18:33:18 CEST 2020
Dear all
We are glad to announce the following talk in the virtual ETH Young Data Science Researcher Seminar Zurich
"A precise high-dimensional theory for boosting"
by Pragya Sur, Harvard University
Time: Friday, 26 June 2020, 15:00-16:00
Place: Zoom at https://ethz.zoom.us/j/92367940258
Abstract: This talk will introduce a precise high-dimensional asymptotic theory for Boosting on separable data, taking both statistical and computational perspectives. We will consider the common modern setting where the number of features p and the sample size n are both large and comparable, and in particular, look at scenarios where the data is separable in an asymptotic sense. On the statistical front, we will provide an exact analysis of the generalization error of Boosting, when the algorithm interpolates the training data and maximizes an empirical L1 margin. The angle between the Boosting solution and the ground truth can be explicitly characterized. On the computational front, we will provide a sharp analysis of the stopping time when Boosting approximately maximizes the empirical L1 margin. Our theory provides several insights into properties of Boosting, for instance, we discover that the larger the margin, the smaller the proportion of active features (with zero initialization). At the heart of our theory lies a detailed study of the maximum L1 margin, using tools from stochastic convex geometry. This is based on joint work with Tengyuan Liang.
Best wishes,
M. Löffler, A. Taeb, Y. Chen
Seminar website: https://math.ethz.ch/sfs/news-and-events/young-data-science.html
More information about the Statlist
mailing list