from PART II - MINING MULTILAYER NETWORKS
Published online by Cambridge University Press: 05 July 2016
There was a darkness; then a dizzy, sickening sensation of sight that was not like seeing; I saw a Line that was no Line.
– The SquareIn the previous chapter, we focused on actors: how to group them into communities. In this chapter, we focus on edges.
One of the most popular data mining tasks for social networks is link (or edge) prediction: given the current status of a social network, which edges are likely to appear at a later time? This problem was originally defined for single-layer networks and is related to the topic of network evolution that we present in Chapter 8. In both cases, there is an underlying assumption about the social dynamics that lead to the appearance of new edges, for example, the fact that actors with many edges have a higher probability of receiving even more edges, or the fact that actors with many common neighbors have a higher probability of becoming neighbors themselves. However, with social network evolution models, the objective is typically the generation of synthetic social networks having some desired global properties, for example, a given clustering coefficient or degree distribution. In the case of edge prediction, the objective is to infer local edges, that is, to predict if an edge is going to appear between two given nodes, or to rank currently disconnected pairs of nodes according to the likelihood that they will become connected.We discuss the edge prediction problem in Section 7.1.
Although edge prediction for multilayer social networks can be seen as a direct extension of existing methods for single-layer networks, a specific kind of knowledge that only makes sense when we have multiple layers is the existence of layer correlations. In Chapter 2 we described different measures to detail the relationship between some predefined layers. However, when many layers exist, we may need automated methods to identify the specific combinations of layers showing significant correlations. We cover this problem in Section 7.2.
Edge Prediction
Edge prediction, as defined by Liben-Nowell and Kleinberg (2003), addresses the following question: which edges are going to be present at time t1 given the edges existing at a previous time t0? Figure 7.1 shows the work layer of our running example at two different points in time: in the figure, three new edges appear and one disappears at time t1.
To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.