2019-12-18 | Ying Sun: Collaborative Statistical Learning: Algorithms and Guarantees

2019-12-18

Abstract

Advances in computation, communication, and data storage techniques in recent decades significantly reduced the cost of data acquisition, leading to an explosion of data generated across different interconnected platforms. Apart from the computational difficulties arise from nonconvex formulations; the sheer volume and spatial disparity of data also pose challenges to traditional learning procedures, which typically require centralized training sets. Reaping the dividend offered by the data deluge, it then urges for the development of collaborative learning methods capable of making inferences from data over the network.

 

This talk will present a novel algorithmic framework, SONATA, and its guarantees for in-network statistical learning, formulated as an empirical risk minimization (ERM) problem.  By leveraging local successive convexification and network communication, our algorithm, for the first time in the literature, is able to solve fairly general nonconvex ERM problems over (time-varying directed) networks; it matches the performance of a centralized learning algorithm, in the sense that it converges linearly for strongly convex ERM problems and sublinearly for (non)convex ERM instances. Furthermore, when it comes to regularized high-dimensional ERM problems (i.e., models where the parameter dimension is larger than the sample size), SONATA enjoys linear convergence up to the statistical precision of the model, even in the absence of strong convexity. Generalizations of the algorithm to large-scale problems and the asynchronous setting will also be discussed.

 

Time

2019年12月18日(星期三)10:00-11:30

 

Speaker

Ying Sun is a post-doctoral researcher with the School of Industrial Engineering, Purdue University. She received her Ph.D. degree in Electronic and Computer Engineering from the Hong Kong University of Science and Technology in 2016. Her research focuses on computational optimization, statistical learning and the interplay between them, with an emphasis in decentralized and collaborative inference methods.

 

Venue

信息管理与工程学院308

上海财经大学(第三教学楼西侧)

上海市杨浦区武东路100号