The breadth-first search algorithm is commonly defined by the following process:
Initialize a queue of graph vertices, with the starting vertex of the graph as the queue's only element.
While the queue is non-empty, remove a vertex from the queue, and add to the queue all the other vertices that can be reached by an edge from that have not already been added in earlier steps.
However, rather than defining the vertex to choose at each step in an imperative way as the one produced by the dequeue operation of a queue, one can define the same sequence of vertices declaratively by the properties of these vertices. That is, a standard breadth-first search is just the result of repeatedly applying this rule:
Repeatedly output a vertex, choosing at each step a vertex that has not already been chosen and that has a predecessor as early in the output as possible.
In some cases, this ordering of vertices by the output positions of their predecessors may have ties — two different vertices have the same earliest predecessor. In this case, the order in which those two vertices are chosen may be arbitrary. The output of lexicographic breadth-first search differs from a standard breadth-first search in having a consistent rule for breaking such ties. In lexicographic breadth-first search, the output ordering is the order that would be produced by the rule:
Repeatedly output a vertex, choosing at each step a vertex that has not already been chosen and whose entire set of already-output predecessors is as small as possible in lexicographic order.
So, when two vertices and have the same earliest predecessor, earlier than any other unchosen vertices, the standard breadth-first search algorithm will order them arbitrarily. Instead, in this case, the LexBFS algorithm would choose between and by the output ordering of their second-earliest predecessors. If only one of them has a second-earliest predecessor that has already been output, that one is chosen. If both and have the same second-earliest predecessor, then the tie is broken by considering their third-earliest predecessors, and so on. Applying this rule directly by comparing vertices according to this rule would lead to an inefficient algorithm. Instead, the lexicographic breadth-first search uses a set partitioning data structure in order to produce the same ordering more efficiently, just as a standard breadth-first search uses a queue data structure to produce its ordering efficiently.
Algorithm
The lexicographic breadth-first search algorithm replaces the queue of vertices of a standard breadth-first search with an ordered sequence of sets of vertices. The sets in the sequence form a partition of the remaining vertices. At each step, a vertex v from the first set in the sequence is removed from that set, and if that removal causes the set to become empty then the set is removed from the sequence. Then, each set in the sequence is replaced by two subsets: the neighbors of v and the non-neighbors of v. The subset of neighbors is placed earlier in the sequence than the subset of non-neighbors. In pseudocode, the algorithm can be expressed as follows:
Initialize a sequence Σ of sets, to contain a single set containing all vertices.
Initialize the output sequence of vertices to be empty.
While Σ is non-empty:
*Find and remove a vertex v from the first set in Σ
*If the first set in Σ is now empty, remove it from Σ
*Add v to the end of the output sequence.
*For each edge v-w such that w still belongs to a set S in Σ:
**If the set S containing w has not yet been replaced while processing v, create a new empty replacement setT and place it prior to S in the sequence; otherwise, let T be the set prior to S.
**Move w from S to T, and if this causes S to become empty remove S from Σ.
Each vertex is processed once, each edge is examined only when its two endpoints are processed, and each iteration of the inner loop takes only constant time. Therefore, like simpler graph search algorithms such as breadth-first search and depth first search, this algorithm takes linear time. The algorithm is called lexicographic breadth-first search because the order it produces is an ordering that could also have been produced by a breadth-first search, and because if the ordering is used to index the rows and columns of an adjacency matrix of a graph then the algorithm sorts the rows and columns into lexicographical order.
Applications
Chordal graphs
A graph G is defined to be chordal if its vertices have a perfect elimination ordering, an ordering such that for any vertex v the neighbors that occur later in the ordering form a clique. In a chordal graph, the reverse of a lexicographic ordering is always a perfect elimination ordering. Therefore, one can test whether a graph is chordal in linear time by the following algorithm:
Use lexicographic breadth-first search to find a lexicographic ordering of G
For each vertex v:
*Let w be the neighbor of v occurring prior to v, as close to v in the sequence as possible
**
*If the set of earlier neighbors of v is not a subset of the set of earlier neighbors of w, the graph is not chordal
If the loop terminates without showing that the graph is not chordal, then it is chordal.
This application was the original motivation that led to develop the lexicographic breadth first search algorithm.
Graph coloring
A graph G is said to be perfectly orderable if there is a sequence of its vertices with the property that, for any induced subgraph of G, a greedy coloring algorithm that colors the vertices in the induced sequence ordering is guaranteed to produce an optimal coloring. For a chordal graph, a perfect elimination ordering is a perfect ordering: the number of the color used for any vertex is the size of the clique formed by it and its earlier neighbors, so the maximum number of colors used is equal to the size of the largest clique in the graph, and no coloring can use fewer colors. An induced subgraph of a chordal graph is chordal and the induced subsequence of its perfect elimination ordering is a perfect elimination ordering on the subgraph, so chordal graphs are perfectly orderable, and lexicographic breadth-first search can be used to optimally color them. The same property is true for a larger class of graphs, the distance-hereditary graphs: distance-hereditary graphs are perfectly orderable, with a perfect ordering given by the reverse of a lexicographic ordering, so lexicographic breadth-first search can be used in conjunction with greedy coloring algorithms to color them optimally in linear time.
Other applications
describe an extension of lexicographic breadth-first search that breaks any additional ties using the complement graph of the input graph. As they show, this can be used to recognize cographs in linear time. describe additional applications of lexicographic breadth-first search including the recognition of comparability graphs and interval graphs.
LexBFS ordering
An enumeration of the vertices of a graph is said to be a LexBFS ordering if it is the possible output of the application of LexBFS to this graph. Let be a graph with vertices. Recall that is the set of neighbors of. Let be an enumeration of the vertices of. The enumeration is a LexBFS ordering if, for all with, there exists such that .