algorithmlinked-listdetectioncyclefloyd-cycle-finding

Cycle detection in linked list with the Hare and Tortoise approach


I understand that in order to detect a cycle in a linked list I can use the Hare and Tortoise approach, which holds 2 pointers (slow and fast ones). However, after reading in wiki and other resources, I do not understand why is it guaranteed that the two pointers will meet in O(n) time complexity.


Solution

  • Here's an attempt at an informal proof.

    The shape of the cycle doesn't matter. It can have a long tail, and a loop towards the end, or just a loop from the beginning to the end without a tail. Irrespective of the shape of the cycle, one thing is clear - that the tortoise pointer can not catch up to the hare pointer. If the two were ever to meet, the hare pointer has to catch up the to tortoise pointer from behind.

    With that established, consider the two possibilites:

    1. hare pointer is one step behind the tortoise pointer.
    2. hare pointer is two steps behind the tortoise pointer.

    All greater distances will reduce to one or two eventually.

    Assuming the tortoise pointer always moves first (can be the other way around too), then in the first case, the tortoise pointer takes one step forward. Now the distance between them is 2. When the hare pointer takes 2 steps now, they will land on the same node. Here's the same thing illustrated for easier understanding. Too much text can get in the way.

    ♛ = hare
    ♙ = tortoise
    X = both meet
    
    ..♛♙... #1 - Initially, the hare is one step behind the tortoise.
    ..♛.♙.. #2 - Now the tortoise takes one step. now hare is two steps behind.
    ....X.. #3 - Next, the hare takes two steps and they meet!
    

    Now let's consider the second case where the distance between them is 2. The slow pointer moves one step forward and the distance between them becomes 3. Next, the fast pointer moves forward two steps and the distance between them reduces to 1 which is identical to the first case in which we have already proved that they will meet in one more step. Again, illustrated for easier understanding.

    .♛.♙... #1 - Initially, the hare is two steps behind the tortoise.
    .♛..♙.. #2 - Now the tortoise takes one step and they become three steps apart.
    ...♛♙.. #3 - Next, the hare takes two steps which is identical to previous case.
    

    Now, as to why this algorithm is guaranteed to be in O(n), use what we've already established that the hare will meet the tortoise before it moves ahead. Which means that once both are inside the loop, before the tortoise completes another round, it will meet the hare since the hare moves twice as fast. In the worst case, the loop will be a circle with n nodes. While the tortoise can only complete one round in n steps, the hare can complete two rounds in that time. Even if the hare missed the tortoise in its first round (which it will), it's definitely going to catch up to the tortoise in its second round.