Graph metrics
RELISON integrates the following graph metrics.
Average shortest path length
As its name indicates, this metric just computes the average shortest path length of a graph. In order to deal with infinite distance edges, we propose two options:
Option 1: We only average over those pairs of users at finite distance.
Option 2: We obtain the average shortest path length for each strongly connected component of the network, and then, we average the result for the different components.
Parameters
mode
: “Non infinite distances” for option 1, “Components” for option 2.
Configuration file
ASL:
type: graph
params:
mode:
type: string
values: ["Non infinite distances", "Components"]
Average reciprocal shortest path length
This metric computes the harmonic mean of the distances between pairs of users:
where \(\delta(u,v)\) represents the distance between a pair of users.
Configuration file
ARSL:
type: graph
params:
mode:
type: string
values: ["Non infinite distances", "Components"]
Clustering coefficient
The global clustering coefficient of a social network graph measures the proportion of closed triads in the network:
Parameters
uSel
: selection of the orientation for the first of the two links to consider in the triad (the (u,v) one).IN
: we take the (v,u) link.OUT
: we take the (u,v) link.UND
: we take either the (u,v) or the (v,u) link (the one that exists)MUTUAL
: we only consider that the pair (u,v) exists if both (u,v) and (v,u) links exist.
vSel
: selection of the orientation for the second of the two links in the triad (the (v,w) one).IN
: we take the (v,w) linkOUT
: we take the (w,v) link.UND
: we take either the (w,v) or the (v,w) link (the one that exists)MUTUAL
: we only consider that the pair (u,v) exists if both (u,v) and (v,u) links exist.
The natural clustering coefficient can be chosen by taking uSel = IN
and vSel = OUT
.
Configuration file
The configuration file for the original method is the following
Clustering coefficient:
type: graph
params:
uSel:
type: orientation
values: [IN/OUT/UND/MUTUAL]
vSel:
type: orientation
values: [IN/OUT/UND/MUTUAL]
Clustering coefficient complement
This metric is the complement of the clustering coefficient, as it takes measures the proportion of the open triads in the network.
Parameters
uSel
: selection of the orientation for the first of the two links to consider in the triad (the (u,v) one).IN
: we take the (v,u) link.OUT
: we take the (u,v) link.UND
: we take either the (u,v) or the (v,u) link (the one that exists)MUTUAL
: we only consider that the pair (u,v) exists if both (u,v) and (v,u) links exist.
vSel
: selection of the orientation for the second of the two links in the triad (the (v,w) one).IN
: we take the (v,w) linkOUT
: we take the (w,v) link.UND
: we take either the (w,v) or the (v,w) link (the one that exists)MUTUAL
: we only consider that the pair (u,v) exists if both (u,v) and (v,u) links exist.
The natural clustering coefficient can be chosen by taking uSel = IN
and vSel = OUT
.
Configuration file
The configuration file for the original method is the following
Clustering coefficient complement:
type: graph
params:
uSel:
type: orientation
values: [IN/OUT/UND/MUTUAL]
vSel:
type: orientation
values: [IN/OUT/UND/MUTUAL]
Degree assortativity
The degree assortativity measures to what extent users create links towards similar users in terms of their degree (i.e. if users with small degree create links towards users with small degrees and users with large degree create links towards users with large degree) or not.
In undirected networks, it is computed as:
Reference : M.E.J. Newman. Mixing patterns in networks. Physical Review E, 67 026126 (2003)
Parameters
orientation
: selection for the degree to use.IN
: we take the incoming neighbors of the users.OUT
: we take the outgoing neighbors of the users.UND
: we take the incoming and outgoing neighbors of the users.MUTUAL
: we take those neighbors who are both incoming and outgoing at the same time.
Configuration file
The configuration file for the original method is the following
Degree assortativity:
type: graph
params:
orientation:
type: orientation
values: [IN/OUT/UND/MUTUAL]
Degree Pearson correlation
The degree assortativity measures the Pearson correlation of the degrees between the origin and destination endpoints of the nodes.
Parameters
uSel
: selection of the orientation for the neighborhood of the starting node of the edges. This allows the following values:IN
: we take the incoming neighbors of the users.OUT
: we take the outgoing neighbors of the users.UND
: we take the incoming and outgoing neighbors of the users.MUTUAL
: we take those neighbors who are both incoming and outgoing at the same time.
vSel
: selection of the orientation for the neighborhood of the ending node of the edges. This allows the following values:IN
: we take the incoming neighbors of the users.OUT
: we take the outgoing neighbors of the users.UND
: we take the incoming and outgoing neighbors of the users.MUTUAL
: we take those neighbors who are both incoming and outgoing at the same time.
Configuration file
The configuration file for the original method is the following
Degree Pearson:
type: graph
params:
uSel:
type: orientation
values: [IN/OUT/UND/MUTUAL]
vSel:
type: orientation
values: [IN/OUT/UND/MUTUAL]
Degree Gini complement
The degree Gini complement indicates how balanced the degree distribution of the network is. Values close to one indicate that the degree distribution is flat, whereas values close to 0 show that a few users concentrate all the links in the network.
where \(\Gamma(u)\) is the neighborhood of user \(u\) and \(u_i\) is the i-th node in the network with the smaller degree.
Parameters
orientation
: selects the type of degree we use (only affects directed networks).IN
: we take the in-degree of the users.OUT
: we take the out-degree of the users.UND
: we take the undirected degree of the users (in-degree + out-degree)MUTUAL
: we take as the degree the number of mutual links.
Configuration file
The configuration file for the original method is the following
Degree Gini Complement:
type: graph
params:
orientation:
type: orientation
values: [IN/OUT/UND/MUTUAL]
Density
The density of a network measures the proportion of the possible number of edges between nodes which exist in the network. This metric does not consider selfloops.
Configuration file
The configuration file for the original method is the following
Density:
type: graph
Diameter
The diameter of a network measures the maximum (finite) distance between two users in the network.
It is equivalent to the maximum eccentricity of the network.
Configuration file
The configuration file for the original method is the following
Diameter:
type: graph
Edge Gini complement
The edge Gini complement computes how balanced the number of links between different pairs of user is. This metric has only sense over multigraphs, where multiple links between users are allowed. The metric formulation is similar to:
where \((u,v)_i\) is the i-th pair of users with an smaller number of links.
We differentiate three variants:
Inter edge Gini complement: This metric does not consider the selfloops between the users. It takes the previous equation (considering that \(E\) does not have selfloops).
Semi-complete edge Gini complement: This metric stores selfloops as a different category for the Gini index, i.e. we add an element to the sum, counting the total number of selfloops in the network.
Complete edge Gini complement: This metric considers selfloops. In the previous equation, we would just need to substitute \(|\mathcal{U}|(|\mathcal{U}|-1)\) by \(|\mathcal{U}|^2\) when it appears.
Configuration file
The configuration for the inter edge Gini complement is:
Inter edge Gini complement:
type: graph
For the semi-complete variant is:
Semi-complete edge Gini complement:
type: graph
and, finally, the version considering selfloops is:
Complete edge Gini complement:
type: graph
Infinite distances
This metric measures the number of node pairs which do not have a path between the first and the second in the network.
Configuration file
Infinite distances:
type: graph
Radius
If we consider the eccentricity values of all the users (i.e. the maximum finite distance between a user and the rest of the network), the radius represents its minimum value.
where \(\delta(u,v)\) represents the distance between two users.
Configuration file
Radius:
type: graph
Reciprocal average eccentricity
This metric computes the inverse value of the average eccentricity of the network.
If we consider the eccentricity values of all the users (i.e. the maximum finite distance between a user and the rest of the network), the radius represents its minimum value.
- References:
Sanz-Cruzado, S.M. Pepa, P. Castells. Structural novelty and diversity in link prediction. 9th International Workshop on Modeling Social Media (MSM 2018) at The Web Conference (WWW 2018). The Web Conference Companion, pp. 1347–1351.
Sanz-Cruzado, P. Castells. Beyond Accuracy in Link Prediction. BIAS 2020: Bias and Social Aspects in Search and Recommendation, pp 79-94.
Configuration file
Reciprocal average eccentricity:
type: graph
Reciprocal diameter
This metric computes the inverse value of the diameter (see Diameter), so, when distances are reduced among the users in the network, the value of this metric increases.
If we consider the eccentricity values of all the users (i.e. the maximum finite distance between a user and the rest of the network), the radius represents its minimum value.
where \(\delta(u,v)\) represents the distance between two users.
- References:
Sanz-Cruzado, S.M. Pepa, P. Castells. Structural novelty and diversity in link prediction. 9th International Workshop on Modeling Social Media (MSM 2018) at The Web Conference (WWW 2018). The Web Conference Companion, pp. 1347–1351.
Sanz-Cruzado, P. Castells. Beyond Accuracy in Link Prediction. BIAS 2020: Bias and Social Aspects in Search and Recommendation, pp 79-94.
Configuration file
Reciprocal diameter:
type: graph
Reciprocity rate
This metric computes the proportion of the edges in the network which are reciprocal.
where \(\delta(u,v)\) represents the distance between two users.
- References:
Sanz-Cruzado, S.M. Pepa, P. Castells. Structural novelty and diversity in link prediction. 9th International Workshop on Modeling Social Media (MSM 2018) at The Web Conference (WWW 2018). The Web Conference Companion, pp. 1347–1351.
Sanz-Cruzado, P. Castells. Beyond Accuracy in Link Prediction. BIAS 2020: Bias and Social Aspects in Search and Recommendation, pp 79-94.
Configuration file
Reciprocity:
type: graph