Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
When you're analyzing networks—whether social media platforms, transportation systems, or the web itself—the fundamental question is always: which nodes actually matter? Centrality measures give you the mathematical toolkit to answer that question, but here's the catch: "importance" means different things in different contexts. A node that spreads information fastest isn't necessarily the one controlling information flow, and the most connected node might not be the most influential. You're being tested on your ability to select the right centrality measure for the right question and to understand why each measure captures a different dimension of network importance.
These concepts connect directly to core themes in networked life: information cascades, network resilience, influence dynamics, and search algorithms. When an exam question asks about identifying influential users, finding network vulnerabilities, or explaining how Google ranks pages, you need to know which centrality measure applies and what it reveals. Don't just memorize formulas—know what structural property each measure captures and when you'd choose one over another.
The simplest way to measure importance is to count how many connections a node has. These measures focus on a node's immediate neighborhood in the network.
Compare: Degree Centrality vs. Katz Centrality—both measure connectivity, but degree only counts direct neighbors while Katz incorporates the entire path structure. If an FRQ asks about "reach" or "indirect influence," Katz is your answer.
Some nodes matter not because of how many connections they have, but because of where they sit in the network structure. These measures examine a node's position relative to all other nodes.
Compare: Closeness vs. Betweenness—closeness measures how fast you can reach others, while betweenness measures how much traffic passes through you. A node in the center of a cluster has high closeness; a node connecting two clusters has high betweenness.
Being connected to important nodes makes you more important. These recursive measures define centrality in terms of the centrality of a node's neighbors.
Compare: Eigenvector Centrality vs. PageRank—both weight connections by importance, but PageRank handles directed graphs and includes the "random surfer" model. For web networks or citation analysis, PageRank is the standard; for undirected social networks, eigenvector centrality applies.
Not all important nodes play the same role. The HITS algorithm recognizes that some nodes are valuable for what they link to, while others are valuable for who links to them.
Compare: PageRank vs. HITS—PageRank assigns one importance score per node, while HITS assigns two (hub and authority). HITS better captures networks where nodes play different functional roles, like the web where some pages aggregate links and others provide original content.
| Concept | Best Examples |
|---|---|
| Direct connection counting | Degree Centrality, Katz Centrality |
| Speed of information spread | Closeness Centrality |
| Control over information flow | Betweenness Centrality |
| Influence through important connections | Eigenvector Centrality, PageRank |
| Recursive importance weighting | Eigenvector Centrality, PageRank, Katz Centrality |
| Role differentiation in networks | Hub and Authority Scores (HITS) |
| Web search and ranking | PageRank, Hub and Authority Scores |
| Network vulnerability analysis | Betweenness Centrality |
A social network analyst wants to find users who could spread a rumor to the entire network in the fewest steps. Which centrality measure should they use, and why would degree centrality be insufficient?
Compare betweenness centrality and closeness centrality: both involve shortest paths, but what different structural properties do they capture? Give an example of a node that would score high on one but low on the other.
Why might a node with relatively low degree still have high eigenvector centrality? What does this reveal about network influence that simple connection counting misses?
If you were analyzing a citation network to find the most authoritative papers, would you use PageRank or HITS? Explain the tradeoff and when you might prefer the alternative.
A network engineer needs to identify which routers, if removed, would most disrupt communication across the network. Which centrality measure directly addresses this question, and what structural role do these critical nodes play?