Social Network Analysis by Compression: Not Only Space Saving, but also Insight Gaining

Abstract:
Do you know compressing a social network may provide significant insights into the structure of the network? In this talk, I will argue that compressing a large social network not only helps us to save storage space, but the compressibility of nodes and subgraphs indeed offers a meaningful measurement for social network analysis. Compressing social networks effectively and efficiently is very challenging. I will advocate a systematic framework of social network compression and analysis using multi-position linearization, and discuss the challenges from the theoretical point of view. Moreover, I will present our latest progress on lossless compression methods. On the compression side, our methods support efficient neighborhood query answering without decomposition. On the social network analysis side, our methods can help to improve community detection.
Bio:
Jian Pei is a Professor at the School of Computing Science, Simon Fraser University, Canada. He is interested in researching, developing, and deploying effective and efficient data analysis techniques for novel data intensive applications, including data mining, Web search, data warehousing and online analytic processing, database systems, and their applications in social networks and media, health-informatics, business and bioinformatics. His research has been extensively supported in part by governmental funding agencies and industry partners. He is also active in developing industry relations and collaboration, transferring technologies developed in his group to industry applications, and developing proof-of-concept prototypes. Since 2000, he has published 1 textbook, 2 monographs and over 170 research papers in refereed journals and conferences, which have been cited thousands of times. He has served in the organization committees and the program committees of over 160 international conferences and workshops. He is the associate editor-in-chief of IEEE Transactions of Knowledge and Data Engineering (TKDE), and an associate editor or editorial board member of the premier academic journals in his fields, including ACM Transactions on Knowledge Discovery from Data (TKDD), Data Mining and Knowledge Discovery, Knowledge and Information Systems, Statistical Analysis and Data Mining, Intelligent Data Analysis, and Journal of Computer Science and Technology. He is an ACM Distinguished Speaker, and a senior member of ACM and IEEE.