首页科学研究学术预告 正文

【学术预告】Efficient Data Clustering Using Coresets

【 发布日期:2020-09-14 】    作者:

一、题目:  Efficient Data Clustering Using Coresets

二、主讲人:   Prof. Shaofeng Jiang

三、摘要

 k-Clustering (e.g. k-median/keans) is a basic task for data analysis and machine learning.  However, classic clustering algorithms do not scale well on huge data sets. To this end, coresets were introduced as a powerful data reduction technique that turns a huge dataset into a tiny proxy. Moreover, coresets have been successfully applied to clustering in various settings including streaming and distributed computing.

       Coresets for k-clustering in Euclidean spaces have been very well studied. However, very few results are known when the space is beyond Euclidean or the objective is more general than k-clustering. In this talk, I will review the classic results in Euclidean spaces, and introduce a series of my recent works on coresets, including coresets for k-clustering in doubling spaces, in planar graphs, and generalized coresets for flexible and fair clustering.

四、主讲人简介

       Prof.Shaofeng Jiang is currently an assistant professor in Aalto University. Before joining Aalto, he has been a postdoctoral researcher in the Weizmann Institute of Science hosted by Robert Krauthgamer during 2017 - 2020. He obtained his Ph.D. degree from the University of Hong Kong at 2017. His research interest is generally theoretical computer science, and especially algorithms for massive data sets, approximation algorithms and online algorithms. He is a recipient of an MSRA Fellowship Nomination Award, and an Outstanding Achievements in Postdoctoral Research Prize at the Weizmann Institute of Science.

五、邀请人

                               孙宇清 教授

六、时间

          91514:00-15:00(星期二下午2-3点)

七、地点

               腾讯会议 会议ID166 850 849

      点击链接入会:https://meeting.tencent.com/s/Hpn9x4rs68LG

八、主办

      山东大学软件学院