Community detection: effective evaluation on large social networks

Files in This Item:
File Description SizeFormat 
insight_publication.pdf2.23 MBAdobe PDFDownload
Title: Community detection: effective evaluation on large social networks
Authors: Lee, Conrad
Cunningham, Pádraig
Permanent link:
Date: 2014
Abstract: While many recently proposed methods aim to detect network communities in large datasets, such as those generated by social media and telecommunications services, most evaluation (i.e. benchmarking) of this research is based on small, hand-curated datasets. We argue that these two types of networks differ so significantly that, by evaluating algorithms solely on the smaller networks, we know little about how well they perform on the larger datasets. Recent work addresses this problem by introducing social network datasets annotated with meta-data that is believed to approximately indicate a 'ground truth' set of network communities. While such efforts are a step in the right direction, we find this meta-data problematic for two reasons. First, in practice, the groups contained in such meta-data may only be a subset of a network’s communities. Second, while it is often reasonable to assume that meta-data is related to network communities in some way, we must be cautious about assuming that these groups correspond closely to network communities. Here, we consider these difficulties and propose an evaluation scheme based on a classification task that is tailored to deal with them.
Type of material: Journal Article
Publisher: Oxford University Press
Copyright (published version): 2013 the Authors
Keywords: Machine learningStatisticsSocial networksCommunity detectionEvaluationBenchmarking
DOI: 10.1093/comnet/cnt012
Language: en
Status of Item: Peer reviewed
Appears in Collections:Computer Science Research Collection
Clique Research Collection
Insight Research Collection

Show full item record

Citations 20

Last Week
Last month
checked on Aug 17, 2018

Download(s) 50

checked on May 25, 2018

Google ScholarTM



This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.