I'm studying Tarjan's algorithm for strongly-connected components and the way it works is clear to me. Anyway there's a line I don't understand:
// Consider successors of v
for each (v, w) in E do
if (w.index is undefined) then
// Successor w has not yet been visited; recurse on it
strongconnect(w)
v.lowlink := min(v.lowlink, w.lowlink)
else if (w.onStack) then
// Successor w is in stack S and hence in the current SCC
v.lowlink := min(v.lowlink, w.index) // *************
end if
end for
I marked the line with asterisks. Why should we take the discovery index/time of a node when encountering a back-edge
v.lowlink := min(v.lowlink, w.index)
rather than just grabbing its component value?
v.lowlink := min(v.lowlink, w.lowlink)
I can't think of a case where this would be a problem.
Can someone enlighten me please? Edit: I suspect this is only a semantic requirement, i.e. lowlink being defined as the earliest ancestor reachable from the node with only one back-edge, but this is just a wild guess.
The correctness proof goes through if w.lowlink
is at least the lowest index reachable from w
and at most the lowest index reachable from w
using at most one back edge. Component detection just requires us to know if we can "escape" to a lower index.
Probably the reason that it's presented the way that it is is that one can imagine lowlink
only being set in post-order, and then your variation wouldn't be well defined. (The Wikipedia pseudocode initializes it in pre-order to index
.)