Betwenness Centrality

Recall that, according to Freeman (1979), the key conceptual distinction between closeness and betweenness is between (pun intended) the capacity to reach others quickly (e.g., via the shortest paths) and the capacity to intermediate among those same paths. High-betweenness nodes control the flow of information in the network between other pairs of nodes in the network when this information or resource flows via shortest paths.¹

Let us begin by defining some basic concepts on which betweenness centrality measures are built.

Pairwise Dependency, One-Sided Dependency, and Betweenness

We can define the pair dependency \(\delta_{i(k)j}\) (Brandes 2001, 166) of any two nodes \(\{i, j\}\) on any third node \(k\) in the graph as follows:

\[ \delta_{i(k)j}= \frac{\sigma_{i(k)j}}{\sigma_{ij}} \tag{1}\]

Where the denominator of the fraction (\(\sigma_{ij}\)) is a count of the total number of shortest paths that start and end with nodes \(i\) and \(j\) and the numerator of the fraction (\(\sigma_{i(k)j}\)) is the subset of those paths that include node \(k\) as an inner node.²

Because Equation 1 is a ratio it can range from zero to one with everything in between. As such the pair dependency of a pair of nodes relative to a third has an intuitive interpretation as a probability, namely the probability that if you send something starting from \(i\) in order for it to get to \(j\) it has to go through \(k\). This probability is 1.0 if \(k\) stands in every shortest path between \(i\) and \(j\) and zero if \(k\) does not lie in any of the shortest paths linking \(i\) and \(j\).

We can also define the one-sided dependency of any one node \(i\) on any other node \(k\) in the graph as the sum of the pairwise dependencies on \(k\) that involve \(i\) across every other node \(j\): \[ \delta_{i|k} = \sum_j \delta_{i(k)j} \tag{2}\]

The betweenness centrality of a node \(k\) is then defined as the sum of the pair dependencies that every pair of nodes in the graph has on that node:

\[ C^{B}_k = \sum_{i,j} \frac{\sigma_{i(k)j}}{\sigma_{ij}} \tag{3}\]

Betweenness, Geodesic Distance and Shortest Paths

Let’s see an example of how to calculate betweenness centrality using real data. We first load our trusty Pulp Fiction data set from the networkdata package, which is an undirected graph of character scene co-appearances in the film:

    library(networkdata)
    library(igraph)
    library(stringr) #using stringr to change names from all caps to title case
    g <- movie_559
    V(g)$name <- str_to_title(V(g)$name)
    V(g)$name[which(V(g)$name == "Esmarelda")] <- "Esmeralda" #fixing misspelled name
    E(g)$weight <- 1 #setting edge weights to 1.0 (relevant for betweenness calculation)
    V(g)$name

 [1] "Brett"           "Buddy"           "Butch"           "Capt Koons"     
 [5] "Ed Sullivan"     "English Dave"    "Esmeralda"       "Fabienne"       
 [9] "Fourth Man"      "Gawker #2"       "Honey Bunny"     "Jimmie"         
[13] "Jody"            "Jules"           "Lance"           "Manager"        
[17] "Marsellus"       "Marvin"          "Maynard"         "Mia"            
[21] "Mother"          "Patron"          "Pedestrian"      "Preacher"       
[25] "Pumpkin"         "Raquel"          "Roger"           "Sportscaster #1"
[29] "Sportscaster #2" "The Gimp"        "The Wolf"        "Vincent"        
[33] "Waitress"        "Winston"         "Woman"           "Young Man"      
[37] "Young Woman"     "Zed"

To compute the components of the betweenness centrality defined earlier, we need two pieces of information.

First we need to know the length of the shortest path between every pair of nodes, so that we can separate the shortest paths from the other longer paths. As we saw in our discussion of closeness, this is stored in the geodesic distance matrix (\(\mathbf{G}\)) of dimensions \(N \times N\), because the length of the shortest path between a pair of nodes is also the geodesic distance between them. In igraph we can calculate the entries for \(\mathbf{G}\) using the distances function:

    G <- distances(g)
    G[1:10, 1:10]

             Brett Buddy Butch Capt Koons Ed Sullivan English Dave Esmeralda
Brett            0     2     1          2           2            2         2
Buddy            2     0     2          2           2            2         3
Butch            1     2     0          1           2            1         1
Capt Koons       2     2     1          0           2            2         2
Ed Sullivan      2     2     2          2           0            2         3
English Dave     2     2     1          2           2            0         2
Esmeralda        2     3     1          2           3            2         0
Fabienne         1     3     1          2           3            2         2
Fourth Man       2     2     2          2           2            2         3
Gawker #2        2     3     1          2           3            2         2
             Fabienne Fourth Man Gawker #2
Brett               1          2         2
Buddy               3          2         3
Butch               1          2         1
Capt Koons          2          2         2
Ed Sullivan         3          2         3
English Dave        2          2         2
Esmeralda           2          3         2
Fabienne            0          2         2
Fourth Man          2          0         3
Gawker #2           2          3         0

Second, we need to know the number of shortest paths linking every pair of nodes (as nodes can be linked by multiple distinct shortest paths). In igraph we can use the all_shortest_paths function to figure out this information.

For instance, let’s say we wanted to find the number of shortest paths between the pair of nodes “Young Man” (at the diner) and “The Pedestrian” (who appears when Butch encounters Marsellus on the street). To do that we can just type:

    ap <- all_shortest_paths(g, from = "Pedestrian", to = "Young Man")
    ap$vpaths

[[1]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian  Butch       Jules       Honey Bunny Young Man  

[[2]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian  Marsellus   Jules       Honey Bunny Young Man  

[[3]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian  Butch       Vincent     Honey Bunny Young Man  

[[4]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian  Marsellus   Vincent     Honey Bunny Young Man  

[[5]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian Butch      Jules      Pumpkin    Young Man 

[[6]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian Marsellus  Jules      Pumpkin    Young Man 

[[7]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian Butch      Vincent    Pumpkin    Young Man 

[[8]]
+ 5/38 vertices, named, from 9e7cc7a:
[1] Pedestrian Marsellus  Vincent    Pumpkin    Young Man

The all_shortest_paths function has three arguments: The graph g, the from node (the starting node in the path count) and the to node (the end node in the path count).³

The function stores the results in a list object containing various sub-objects. The actual paths (containing the node ids) are in a list sub-object called vpaths, which is itself a list (with named vectors containing the indices of the nodes of each path as elements). The length of this list is the number of shortest paths between the young man at the diner and the pedestrian:

    length(ap$vpaths)

[1] 8

We can see that the list is of length eight, indicating that there are eight distinct shortest paths between the pedestrian and the young man.

If we wanted to verify the length of the shortest path between these two nodes, we could just type:

    G["Young Man", "Pedestrian"]

[1] 4

Which provides us with the relevant entry in the geodesic distance matrix, telling us that the young man is four steps away from the pedestrian in the network. This also means that each path will involve five nodes (counting the young man and the pedestrian, with three intermediary nodes), because the length of the path is the number of edges in it, and the length of a path is always \(n-1\) where \(n\) is the number of nodes in the path.

Note that we could have gotten the same information using the distances function in igraph specifying the v and to nodes:

    distances(g, v = "Young Man", to = "Pedestrian")

          Pedestrian
Young Man          4

Now, to populate the entries of the shortest paths matrix (\(\mathbf{S}\)), we can just loop through each pair of (non-adjacent) nodes in the graph and record the length of the vpaths object obtained when we run the all_shortest_paths function on that pair of nodes:

    n <- vcount(g)
    A <- as.matrix(as_adjacency_matrix(g))
    S <- matrix(1, n, n) #intializing shortest paths matrix
    diag(S) <- 0 #setting diagonals to zero (Brandes, 2008, p. 137)
    rownames(S) <- V(g)$name
    colnames(S) <- V(g)$name
    for (i in 1:n) {
        for (j in i:n) { #looping through upper triangle
            if (A[i,j] == 0) { #restrict to non-adjancent nodes 
                S[i,j] <- length(all_shortest_paths(g, from = i, to = j)$vpaths)
                }
            }
        }
    S[lower.tri(S)] <- t(S)[lower.tri(S)] #copying upper triangle into lower triangle
    S[1:10, 1:10]

             Brett Buddy Butch Capt Koons Ed Sullivan English Dave Esmeralda
Brett            1     1     1          2           1            3         1
Buddy            1     1     2          2           2            2         2
Butch            1     2     1          1           2            1         1
Capt Koons       2     2     1          1           2            3         1
Ed Sullivan      1     2     2          2           1            2         2
English Dave     3     2     1          3           2            1         1
Esmeralda        1     2     1          1           2            1         1
Fabienne         1     4     1          1           4            1         1
Fourth Man       2     1     2          1           1            1         2
Gawker #2        2     4     1          1           4            2         1
             Fabienne Fourth Man Gawker #2
Brett               1          2         2
Buddy               4          1         4
Butch               1          2         1
Capt Koons          1          1         1
Ed Sullivan         4          1         4
English Dave        1          1         2
Esmeralda           1          2         1
Fabienne            1          1         1
Fourth Man          1          1         4
Gawker #2           1          4         1

Once we have the information stored in the \(\mathbf{G}\) and \(\mathbf{S}\) matrices, we have all we need to compute the various quantities defined earlier.

Computing Pairwise Dependencies

Brandes (2001, 166) shows that we can take advantage of the fact that a node \(k\) can only lie on a shortest path between \(i\) and \(j\) if the sum of the geodesic distances between \(i\) and \(k\) and between \(k\) and \(j\) is equal to the geodesic distance between \(i\) and \(j\): \[ g_{ij} = g_{ik} + g_{kj} \tag{4}\]

If the above equality obtains, then the number of shortest paths between \(i\) and \(j\) that involve \(k\) as an inner node (the numerator of Equation 1) is simply \(s_{ik} \times s_{kj}\). This means that the pair dependency of \(i\) and \(j\) on \(k\) will be just:

\[ \delta_{i(k)j}= \frac{s_{ik} s_{kj}}{s_{ij}} \tag{5}\]

Where \(s_{ij}\) is the corresponding entry for \(i\) and \(j\) in the \(\mathbf{S}\) matrix (the total number of shortest paths between \(i\) and \(j\)).

This also means that if the above equality does not obtain and \(g_{ij} < g_{ik} + g_{kj}\), then we know for sure that the pair dependency of \(i\) and \(j\) on \(k\) is zero (\(k\) does not lie on any shortest path between \(i\) and \(j\)).

Using this approach, we can write a simple function to compute the pair dependency of any two nodes on a third node, using information stored in the \(\mathbf{G}\) and \(\mathbf{S}\) matrices:

    pair.dep <- function(a, b, c) {
        if (G[a, b] == G[a, c] + G[c, b]) {return((S[a, c]*S[c, b])/S[a, b])}
        else {return(0)}
        }

This function computes the pair dependency of the nodes entered as values of the a and b arguments on the node entered as the value of the c argument using the entries stored on the S matrix according to equation Equation 5.

We can test it out to compute the pair dependency of the young man and the pedestrian on Vincent:

    pair.dep("Pedestrian", "Young Man", "Vincent")

[1] 0.5

Which says that Vincent lies on 50% of the paths between the The Young Man and The Pedestrian (four out of eight).

Computing One-Sided Dependencies

As noted in Equation 2, the one-sided dependency of a node \(i\) on another node \(k\) is just the sum of the pairwise dependencies involving \(i\) as a source and \(k\) as an intermediary across every destination node \(j\). We can get these using a simple wrapper function over the pair.dep function:

    one.sided <- function(a, c) {
        od <- 0
        i <- which(V(g)$name == a)
        k <- which(V(g)$name == c)
        for (j in c(1:vcount(g))[-c(i, k)]) {od <- od + pair.dep(i, j, k)}
        return(od)
        }

So the one-sided dependence of the young man on Vincent is:

    one.sided("Young Man", "Vincent")

[1] 19

Which is the sum of the pair dependencies that involve the young man as a starting node and Vincent as an inner node.

Note that while the young man is highly dependent on Vincent to reach others, the reverse is not the case:

    one.sided("Vincent", "Young Man")

[1] 0

As the young man stands on none of the paths that link Vincent to other nodes in the graph.

Computing Betweenness

As noted in Equation 3, Vincent’s betweenness is just the sum of the pair dependencies of each pair of nodes in the graph on him. We could compute that quantity (not very efficiently) for any node in the graph using the following wrapper over the pair.dep function:

    bet.cent <- function(c) {
        n <- vcount(g)
        bc <- 0
        k <- which(V(g)$name == c)
        for (i in 1:n) {
            for (j in i:n) {
                if (i != k & j != k & A[i, j] == 0) {bc <- bc + pair.dep(i, j, k)}
            }
        }
    return(bc)
    }

And now to find Vincent’s betweenness:

    bet.cent("Vincent")

[1] 301

Which is the same result we would have gotten had we used the betweenness function in igraph specifying that we want the answer just for the Vincent node:

    betweenness(g, v = "Vincent")

Vincent 
    301

We can of course use the same igraph function to—efficiently, using Brandes’s (2001) algorithm—compute the betweenness centrality of each node in the graph stored in a vector of length \(N\):

    pulp.bet <- betweenness(g)

We should expect a character to have high betweenness in this network to the extent that they appear in scenes with characters who themselves don’t appear in any scenes together, thus inter-mediating between different parts of the story. Characters who only appear in one scene with some others (like The Wolf or The Gimp) are likely to be low in betweenness.

Using the information stored in the pulp.bet vector of betweenness centralities for each node, we can create a top ten table of betweenness for the Pulp Fiction network.

    library(kableExtra)
    top.10.bet <- sort(pulp.bet, decreasing = TRUE)[1:10]
    kbl(round(top.10.bet, 2), format = "pipe", align = c("l", "c"),
        col.names = c("Character", "Betweenness"),
        caption = "Top Ten Betweenness Characters in Pulp Fiction Network.") |> 
    kable_styling(bootstrap_options = c("hover", "condensed", "responsive"))

Top Ten Betweenness Characters in Pulp Fiction Network.
Character	Betweenness
Vincent	301.00
Butch	248.33
Jules	114.17
Maynard	70.00
Honey Bunny	49.50
Pumpkin	49.50
Sportscaster #1	36.00
Mia	33.50
Marsellus	26.67
Brett	2.67

Unsurprisingly, the three main characters in the story are also the highest in betweenness, with the already considered Vincent at the top of the list (that makes sense since Vincent intermediates between Butch and the rest of the story as he sadly found out in the toilet). Somewhat surprisingly, the main antagonist of the story (the pawn shop owner) is also up there. After that we see a big drop in the bottom five of the top ten.

Computing Normalized Betweenness

The betweenness function in igraph uses Freeman’s (1979) normalization for the betweenness centrality, which is equal to:

\[ C^{NB}_k = = \frac{2\left[\sum_{i,j} \frac{\sigma_{i(k)j}}{\sigma_{ij}}\right]}{(n-1)(n-2)} \tag{6}\]

Where the denominator of the fraction, \((n-1)(n-2)\), is equal to (two times) the betweenness a node would have if it was the center of a star graph of the same order as the observed network.

So for the Pulp Fiction network the normalized betweenness is equal to:

    n <- vcount(g)
    pulp.nbet <- (2*pulp.bet)/((n-1)*(n-2))
    top.10.nbet <- sort(pulp.nbet, decreasing = TRUE)[1:10]
    round(top.10.nbet, 4)

        Vincent           Butch           Jules         Maynard     Honey Bunny 
         0.4520          0.3729          0.1714          0.1051          0.0743 
        Pumpkin Sportscaster #1             Mia       Marsellus           Brett 
         0.0743          0.0541          0.0503          0.0400          0.0040

Which are the same scores we would obtain using the igraph function with the argument normalized set to TRUE:

    pulp.nbet <- betweenness(g, normalized = TRUE)
    top.10.nbet <- sort(pulp.nbet, decreasing = TRUE)[1:10]
    round(top.10.nbet, 4)

        Vincent           Butch           Jules         Maynard     Honey Bunny 
         0.4520          0.3729          0.1714          0.1051          0.0743 
        Pumpkin Sportscaster #1             Mia       Marsellus           Brett 
         0.0743          0.0541          0.0503          0.0400          0.0040

Betwenness in Directed Graphs

In contrast to closeness, there is no problem computing betweenness in the directed case. The reason is that if the graph is not strongly connected, and therefore there exists a pair of nodes \(i\) and \(j\) with no directed path from \(i\) to \(j\) (e.g., \(g_{ij}=\infty\)), then we just set \(\sigma_{ij} = 0\) in Equation 3.

Let us examine betweenness centrality in the directed young women lawyers advice network:

    g <- law_advice
    women <- which(V(g)$gender == 2) #selecting women
    wg <- subgraph(g, women)
    young <- which(V(wg)$age < 40) #selecting women under forty
    wg <- subgraph(wg, young)
    V(wg)$name <- 1:vcount(wg) #naming nodes
    w.bet <- betweenness(wg)
    round(w.bet, 3)

     1      2      3      4      5      6      7      8      9     10     11 
 0.000  3.000 16.333 11.000  7.000  0.000  0.000  5.000  0.000  0.333  1.000 
    12 
 0.333

Here we see that node 3 is the highest in betweenness, pictured in red in Figure 1. For comparison, the node with the highest closeness centrality is pictured in blue in the same figure.

Figure 1: Women lawyers advice network with highest closeness centrality node in blue and highest betweenness centrality node in red

This result makes sense. Node 3 intermediates all the connections linking the tightly knit group of nodes on the left side (6, 10, 11, 12) with the rest of the network. Also if nodes 5 and 7 need to pass something along to the rest, they have to use 3 at least half time. Node 4 also needs 3 to reach 6.

This result nicely illustrates the difference between closeness and betweenness.

Induced Betweenness

Borgatti and Everett (2020, 340–45) argue that another way of thinking about centrality of a node (or edge) is to calculate the difference that removing that node makes for some graph property in the network. They further suggest that the sum of the centrality scores of each node is just such a property, proposing that betweenness is particularly interesting in this regard. Let’s see how this works.

We will use the undirected version of the women lawyers advice network for this example. Let’s say we are interested in the difference that node 10 makes for the betweenness centralities of everyone else. In that case we would proceed as follows:

   bet <- betweenness(wg) #original centrality scores
   Sbet <- sum(bet) #sum of original centrality scores
   wg.d <- wg - vertex("10") #removing vertex 10 from the graph
   bet.d <- betweenness(wg.d) #centrality scores of node deleted subgraph
   Sbet.d <- sum(bet.d) #sum of centrality scores of node deleted subgraph
   total.c <- Sbet - Sbet.d #total centrality
   indirect.c <- total.c - bet[10] #indirect centrality
   indirect.c

      10 
5.666667

Line 1 just calculates the regular betweenness centrality vector for the graph. Line 2 sums up all of the entries of this vector. Line 3 creates a node deleted subgraph by removing node 10. This is done using the “minus” operator and the igraph function vertex, which takes a node id or name as input. Lines 4-5 just recalculate the sum of betweenness centralities in the subgraph that excludes node 10. Then in line 6 we subtract the sum of centralities of the node deleted subgraph from the sum of centralities of the original graph. If this number, which Borgatti and Everett call the “total” centrality, is large and positive then that means that node 10 makes a difference for the centrality of others.

However, part of that difference is node 10’s own “direct” centrality, so to get a more accurate sense of node 10’s impact on other people’s centrality we need to subtract node 10’s direct centrality from the total number, which we do in line 7 to get node 10’s “indirect” centrality. The result is shown in the last line, which indicates that node 10 has a pretty big impact on other people’s betweenness centralities, net of their own (which is pretty small).

Now all we need to do is do the same for each node to create a vector of indirect betweenness centralities. So we incorporate the code above into a short loop through all vertices:

   total.c <- 0 #empty vector
   indirect.c <- 0 #empty vector
   for (i in 1:vcount(wg)) {
      wg.d <- wg - vertex(i)
      bet.d <- betweenness(wg.d) #centrality scores of node deleted subgraph
      Sbet.d <- sum(bet.d) #sum of centrality scores of node deleted subgraph
      total.c[i] <- Sbet - Sbet.d #total centrality
   indirect.c[i] <- total.c[i] - bet[i] #total minus direct
   }

We can now list the total, direct, and indirect betweenness centralities for the women lawyers graph using a nice table:

   i.bet <- data.frame(n = 1:vcount(wg), total.c, round(betweenness(wg), 1), round(indirect.c, 1))
   kbl(i.bet, format = "pipe", align = c("l", "c", "c", "c"),
       col.names = c("Node", "Total", "Direct", "Indirect"), row.names = FALSE,
       caption = "Induced Betweenness Scores in the Women Lawyers Advice Network") %>% 
   kable_styling(bootstrap_options = c("hover", "condensed", "responsive"))

Induced Betweenness Scores in the Women Lawyers Advice Network
Node	Total	Direct	Indirect
1	16	0.0	16.0
2	14	3.0	11.0
3	27	16.3	10.7
4	26	11.0	15.0
5	17	7.0	10.0
6	8	0.0	8.0
7	12	0.0	12.0
8	3	5.0	-2.0
9	4	0.0	4.0
10	6	0.3	5.7
11	7	1.0	6.0
12	6	0.3	5.7

This approach to decomposing betweenness centrality provides a new way to categorize actors in a network:

On the one hand, we have actors like nodes 3 and 4 who “hog” centrality from others. Perhaps these are the prototypical high betweenness actors who monopolize the flow through the network. Their own direct centrality is high, but their indirect centrality is negative, suggesting that others become more central when they are removed from the graph as they can now become intermediaries themselves.
In contrast, we also have actors like node 5 who are high centrality themselves, but who’s removal from the network does not affect anyone else’s centrality. These actors are high betweenness but themselves don’t monopolize the flow of information in the network.
Then we have actors (like nodes 9-12) who have low centrality, but whose removal from the network makes a positive difference for other people’s centrality, which overall decreases when they are removed from the network.
Finally, we have actors line nodes 2 and 8, who are not particularly central, but who also hog centrality from others, in that removing them from the network also increases other people’s centrality (although not such an extent as the hogs).

References

Borgatti, Stephen P, and Martin G. Everett. 2020. “Three Perspectives on Centrality.” In The Oxford Handbook of Social Networks, edited by R. Light and J. Moody, 334--351. Sage Publications.

Brandes, Ulrik. 2001. “A Faster Algorithm for Betweenness Centrality.” Journal of Mathematical Sociology 25 (2): 163–77.

———. 2008. “On Variants of Shortest-Path Betweenness Centrality and Their Generic Computation.” Social Networks 30 (2): 136–45.

Freeman, Linton C. 1979. “Centrality in Social Networks Conceptual Clarification.” Social Networks 1: 215–39.

Footnotes

The lesson on random walk centrality concepts covers variations of centrality that relax the assumption that information or resources flow strictly via shortest paths.↩︎
As Brandes (2008, 137) notes, when \(i = j\), then \(\sigma_{ij} = 1\) and when \(k \in \{i, j\}\), then \(\sigma_{i(k)j} = 0\).↩︎
In an undirected graph you get the same result for a pair of nodes regardless of which one you put on the from or to slots.↩︎