CLUSTERING COEFFICIENT

 

Definition

In graph theory, a clustering coefficient is a measure of the degree to which nodes in a graph tend to cluster together.

Evidence suggests that in most real-world networks, and in particular social networks, nodes tend to create tightly knit groups characterised by a relatively high density of connections; this likelihood tends to be greater than the average probability. of a tie randomly established between two nodes.

 

Types of Clustering Coefficients

There are two types of Clustering Coefficients:

Global Clustering Coefficient

The Global Clustering Coefficient is based on triplets of nodes:
A triplet is defined as an undirected three nodes connected by: either two (open triplet) or three (closed triplet)

A triangle graph includes:

Local Clustering Coefficient

Definition by Example

clustering coefficient of the 🔵 is computed as the proportion of connections among its neighbours which are actually realised compared with the number of all possible connections.

 

center: the 🔵 has 3 neighbours, hence it can have a maximum of 3 connections as indicated by the 3 blue solid lines,
hence local clustering coefficient = 1

right: the 🔵 has only 1 connection realised as indicated by the 1 blue solid lines, and & the other 2 missing connections as indicated by 2 orange dotted lines, hence local clustering coefficient =

left: none of the possible connections among the neighbours of the 🔵 are realised, as indicated by the 3 orange dotted lines, hence local clustering coefficient = 0

 

Definition

The Local Clustering Coefficient of a vertex (node) in a graph quantifies the closeness of its neighbours are to being a clique viz: a complete graph

A graph G =(V, E) is defined as a set of vertices V & a set of edges E between these vertices.

An edge, e ij connects vertex vi with vj

ki = number of vertices

|Ni|, in the neighborhood Ni, of a vertex vi

The neighbourhood, Nij for a vertex vi is defined as its immediate connected neighbors as:

N i = { vj: eij∊E ∨ eji∊E}

The Local Clustering Coefficient, Ci, in the neighbourhood, of a vertex vi :

         proportion of links between the vertices within its neighbourhood
Ci= ___________________________________________________________________
               number of links that could possibly exist between them

Undirected Graph

In an undirected graph, eij≡eji    ki(ki-1)
If a vertex vi has ki neighbors & __________ edges
                                                               2
∴ the Local Clustering Coefficient, Ci

           2|{ ejk: vj, vk∊Ni, ∨ eji∊E}|
Ci= ________________________________________
                     ki(ki-1)

 

Directed Graph

directed graph, is a distinct form, whereby: ∨ neighbourhood links, ∵ among the vertices within the neighbourhood = the number of neighbours of a vertex.

In a directed graph, eij≠eji

neighborhood Ni ∃ ki(ki -1) links vertices in that neighborhood.

∴ the Local Clustering Coefficient, Ci

           |{ ejk: vj, vk∊Ni, ∨ eji∊E}|
Ci= ______________________________________
                     kI(kI-1)

Average Clustering Coefficient

As an alternative to Global Clustering Coefficient, the overall level of clustering in a network can also be measured by the average of the Local Clustering Coefficients of all the vertices n

          1  n
C=__Ci
         n  i=1