Watts–Strogatz model


The Watts–Strogatz model is a random graph generation model that produces graphs with small-world properties, including short average path lengths and high clustering. It was proposed by Duncan J. Watts and Steven Strogatz in their joint 1998 Nature paper. The model also became known as the beta model after Watts used to formulate it in his popular science book .

Rationale for the model

The formal study of random graphs dates back to the work of Paul Erdős and Alfréd Rényi. The graphs they considered, now known as the classical or Erdős–Rényi graphs, offer a simple and powerful model with many applications.
However the ER graphs do not have two important properties observed in many real-world networks:
  1. They do not generate local clustering and triadic closures. Instead, because they have a constant, random, and independent probability of two nodes being connected, ER graphs have a low clustering coefficient.
  2. They do not account for the formation of hubs. Formally, the degree distribution of ER graphs converges to a Poisson distribution, rather than a power law observed in many real-world, scale-free networks.
The Watts and Strogatz model was designed as the simplest possible model that addresses the first of the two limitations. It accounts for clustering while retaining the short average path lengths of the ER model. It does so by interpolating between a randomized structure close to ER graphs and a regular ring lattice. Consequently, the model is able to at least partially explain the "small-world" phenomena in a variety of networks, such as the power grid, neural network of C. elegans, networks of movie actors, or fat-metabolism communication in budding yeast.

Algorithm

Given the desired number of nodes, the mean degree , and a special parameter, satisfying and, the model constructs an undirected graph with nodes and edges in the following way:
  1. Construct a regular ring lattice, a graph with nodes each connected to neighbors, on each side. That is, if the nodes are labeled , there is an edge if and only if
  2. For every node take every edge connecting to its rightmost neighbors, that is every edge with, and rewire it with probability. Rewiring is done by replacing with where is chosen uniformly at random from all possible nodes while avoiding self-loops and link duplication.

    Properties

The underlying lattice structure of the model produces a locally clustered network, while the randomly rewired links dramatically reduce the average path lengths. The algorithm introduces about of such non-lattice edges. Varying makes it possible to interpolate between a regular lattice and a structure close to an Erdős–Rényi random graph with at. It does not approach the actual ER model since every node will be connected to at least other nodes.
The three properties of interest are the average path length, the clustering coefficient, and the degree distribution.

Average path length

For a ring lattice, the average path length is and scales linearly with the system size. In the limiting case of, the graph approaches a random graph with, while not actually converging to it. In the intermediate region, the average path length falls very rapidly with increasing, quickly approaching its limiting value.

Clustering coefficient

For the ring lattice the clustering coefficient, and so tends to as grows, independently of the system size. In the limiting case of the clustering coefficient is of the same order as the clustering coefficient for classical random graphs, and is thus inversely proportional to the system size. In the intermediate region the clustering coefficient remains quite close to its value for the regular lattice, and only falls at relatively high. This results in a region where the average path length falls rapidly, but the clustering coefficient does not, explaining the "small-world" phenomenon.

Degree distribution

The degree distribution in the case of the ring lattice is just a Dirac delta function centered at. The degree distribution for can be written as,
where is the number of edges that the node has or its degree. Here , and. The shape of the degree distribution is similar to that of a random graph and has a pronounced peak at and decays exponentially for large. The topology of the network is relatively homogeneous, meaning that all nodes are of similar degree.

Limitations

The major limitation of the model is that it produces an unrealistic degree distribution. In contrast, real networks are often scale-free networks inhomogeneous in degree, having hubs and a scale-free degree distribution. Such networks are better described in that respect by the preferential attachment family of models, such as the Barabási–Albert model.
The Watts and Strogatz model also implies a fixed number of nodes and thus cannot be used to model network growth.