breadth first search

BFS Example

Assuming that we push characters onto the frontier in alphabetical order (A before B ...), specify the order of the nodes that would be explored by BFS.

If we use the above source code
If we don't use the node_has_been_visited check

Assume that S is the initial node, while G is the goal node.

Breadth First Search is Correct

Given any graph, breadth first search traverses all nodes in the graph

Enqueued

Suppose that

v

is a vertex in a graph being explored by BFS, then we say that

v

is enqueued, if it has been added to the queue.

Expanded

Suppose that

v

is a vertex in a graph being explored by BFS, then we say that

v

is expanded if all of it's successors have been added to the queue.

Depth

Suppose that

v, w

are two connected vertices in a graph, then we say that the depth of $w$ with respect to $v$ is the length of the shortest path from

v

w

in the graph, and we notate it by

depth (v, w)

Found

We say that a specific vertex

v

is said to be found if it has been expanded.

BFS Searches Less Deep Vertices First

Let

s

be the initial vertex that BFS is called on, and let us define the set

X_{k} : = {v \in V : depth (s, v) = k}

, then for any

i < j \in N_{0}

BFS will expand everything in

X_{i}

before it will expand anything in

X_{j}

BFS Expands by Depth Layer

Specifically given the following sequence of sets:

X_{0}, X_{1}, X_{2}, \dots

BFS will expand every node in the previous set before expanding any vertex in the next set.

BFS Expands All Enqueued Vertices

Suppose that the queue consists of some finite collection of vertices

V

, then BFS will expand every vertex in this set

BFS is Complete

Suppose that we have a potentially infinite graph

G = (V, E)

with an maximum branching factor of

b

. Given two specific vertices

s, g \in V

denoted as the start state and the goal state respectively if there exists a path from

s

g

then if BFS is initialized on

s

, it will expand

g

in a finite number of iterations.

Let $X_{k}$ be the collection of all nodes which have a path of length $k$ connecting them to $s$ and that $X_{0} = {s}$ . We will prove by that BFS will expand every node in $X_{k}$ by induction.

For the base case we know that $X_{0} = {s}$ will be expanded on the initial iteration, so the claim holds. Now assume that it holds true for every $k \in N_{0}$ , and we'll show that it holds true for $k + 1$ .

We know that BFS will expand every node in $X_{k}$ , we also know that any vertex that has depth $k + 1$ , is a neighbor of a node that had length $k$ , therefore by expanding every node in $X_{k}$ , this means that $X_{k + 1}$ will be contained on the queue, and therefore will expand every vertex in $X_{k + 1}$

Consider a graph that has one root node, and then infinitely many child nodes, and then in the third layer all the child nodes lead to a singlar goal node. BFS will get stuck on this graph because it will be spending an infinite amount of time adding every child of the root node to the queue, and thus will never terminate even though there is a trivial path from the root to the goal node.

BFS Time Complexity

Suppose that BFS is run on a potentially infinite graph that has maximum branching factor

b

, then given two connected nodes

s, g

, such that

d = depth (s, g)

, then the number of steps taken by BFS by starting it on

s

until it expands

g

O (b^{b + 1})

Observe that there are at most $b^{k}$ nodes in $X_{k}$ (symbolically $| X_{k} | \leq b^{k}$ ), therefore to expand everything in layer $X_{k}$ it will take at most $b \cdot b^{k} = b^{k + 1}$ steps to expand every vertex in $X_{k}$ while enqueuing their neighbors. Therefore, since we've proven that BFS expands by depth layer, then we would say that the runtime is upper bounded by

b^{1} + b^{2} + \dots b^{k + 1}

We'll also note that during the last layer, we could find our goal node immediately if it is the first thing we expand at depth $d$ . But in the worst case, we iterate through every vertex in $X_{d - 1}$ , and finally add in the node $g$ at the end of the queue, this means we will have to expand $| X_{d} | - 1$ vertices before we finally expand $g$ , which results in at most $b \cdot (| X_{d} | - 1) \leq b (b^{d} - 1) = b^{d + 1} - b^{d}$ steps taken by BFS, which is why our upper bound involving just $b^{k + 1}$ above is justified. Also we can simplify inside the big-o:

O (b^{1} + b^{2} + \dots + b^{k + 1}) = O (b^{k + 1})

as needed.

BFS Space Complexity

The space complexity of BFS is given by

O (b^{d + 1})

The space taken by BFS is directly given by the amount of space used up by the queue, we can assume that everything in the queue has constant size, but we note that in the worst case as we iterate over vertices in

X_{d}

g

is the last one to be popped off, therefore the queue would contain

b \cdot (| X_{d} | - 1) \leq b (b^{d} - 1)

vertices, which is upper bounded by

b^{d} + 1

vertices, as needed.

Note that if you run BFS on a graph with edge weights hoping that it will produce a minimum cost solution to the goal node, it will not.

BFS Cost Optimal?

Suppose that there is a solution path within a finite search space, is it true that BFS is cost-optimal if given any level of the search tree, all step costs are greater than the step costs in the previous level.

expansion order	frontier	about to expanded
S	S	S
S	B, C	B
S-B	C, A, D, E	C
S-B-C	A, D, E, E	A
S-B-C-A	D, E, E, F	D
S-B-C-A-D	E, E, F	E
S-B-C-A-D-E	E, F, D	E
S-B-C-A-D-E	F, D	F
S-B-C-A-D-E-F	D, G	D
S-B-C-A-D-E-F	G	G
S-B-C-A-D-E-F-G

expansion order	frontier	about to expand
S	S	S
S	B, C	B
S-B	C, A, D, E	C
S-B-C	A, D, E, E	A
S-B-C-A	D, E, E, F	D
S-B-C-A-D	E, E, F	E
S-B-C-A-D-E	E, F, D	E
S-B-C-A-D-E-E	F, D, D	F
S-B-C-A-D-E-E-F	D, D, G	D
S-B-C-A-D-E-E-F-D	D, G	D
S-B-C-A-D-E-E-F-D-D	G	G
S-B-C-A-D-E-E-F-D-D-G

🏗️ ΘρϵηΠατπ🚧 (under construction)

🏗️ $Θ ρ ϵ η Π α τ π$ 🚧 (under construction)