报告题目：Data Science for Deep Learning
In recent years, deep learning (DL) has significantly penetrated and has been widely adopted in various fields of application, including facial recognition, strategy games (AlphaGo and Texas hold'em) and question answering. However, the effectiveness of the models and efficiency of the training process strongly depend on how well the associated data is managed. It is very challenging to train an effective deep learning-based image classifier without properly labelled training data. Furthermore, training efficiency is severely affected by a large amount of training data, complex structures of the models and tones of hyper parameters. A lack of validation for result data and explanation also seriously affect the applicability of trained models. In this talk, I will discuss three important issues related to data science for AI: 1) how to prepare data for effective DL, which includes data extraction and integration as well as data labelling; 2) how to optimize DL training, including data compression and computation graph optimization; and 3) how to conduct explanation to make the model robust and transparent. Some future work will be highlighted at the end. At then enf of my talk, I will give a brief introduction of Data Science and Analytic Program in HKUST (GZ) and Postgraduate recruitment information of the DSA program.
Lei Chen has BS degree in computer science and engineering from Tianjin University, Tianjin, China, MA degree from Asian Institute of Technology, Bangkok, Thailand, and PhD in computer science from the University of Waterloo, Canada. He is a chair professor in the Department of Computer Science and Engineering, Hong Kong University of Science and Technology (HKUST). Currently, Prof. Chen serves as the head of Data Science and Analytic trust at HKUST (GZ), director of Big Data Institute at HKUST, director of HKUST MOE/MSRA Information Technology Key Laboratory. Prof. Chen’s research interests include human-powered machine learning, crowdsourcing, Blockchain, graph data analysis, probabilistic and uncertain databases and time series and multimedia databases. Prof. Chen got the SIGMOD Test-of-Time Award in 2015 and Best Research Paper Award in VLDB 2022 .The system developed by Prof. Chen’s team won the excellent demonstration award in VLDB 2014. Prof. Chen has served as VLDB 2019 PC Co-chair and Editor-in-Chief of VLDB Journal. Currently, Prof. Chen serves as Editor-in-Chief of IEEE Transaction on Data and Knowledge Engineering and PC Co-chairs of IEEE Conference on Data Engineering (ICDE 2023). He is an IEEE Fellow, ACM Distinguished Member and an executive member of the VLDB endowment.