clojureprologquicksortlazy-sequencesreify

How to reify Prolog's backtracking state to perform the same task as "lazy seq" from Clojure?


Here is a quicksort algorithm for numbers written in Clojure. It is basically the quicksort algorithm found in "The Joy of Clojure", 2nd edition, page 133. I modified it slightly for (hopefully) better readability, because the original felt a bit too compact:

(defn qsort-inner [work]
   (lazy-seq        
      (loop [loopwork work]
         (let [[ part & partz ] loopwork ]
            (if-let [[pivot & valuez] (seq part)]
                  (let [ smaller? #(< % pivot)
                         smz      (filter smaller? valuez)
                         lgz      (remove smaller? valuez)
                         nxxt     (list* smz pivot lgz partz) ]
                        (recur nxxt))
                  (if-let [[oldpivot & rightpartz] partz]
                          (cons oldpivot (qsort-inner rightpartz))
                          []))))))

(defn qsort [ xs ]
   (qsort-inner (list xs)))

The algorithm is started by a call to qsort, which envelops a passed list of numbers into another list (thus creating a list containing a single list), then calls qsort-inner.

(qsort [10 4 5 88 7 1])     ;; (qsort-inner [[10 4 5 88 7 1]])
;; (1 4 5 7 10 88)

qsort-inner has three noteworthy points:

With the help of the lazy-seq thing, we can make the algorithm emit data one-by-one:

;; the full result is generated on printout
(qsort [10 4 5 88 7 1])
(1 4 5 7 10 88)

;; store the lazy-seq and query it
(def l (qsort [10 4 5 88 7 1]))
(first l)
;; 1
(second l)
;; 4

I was thinking about how to perform this lazy quicksort in Prolog. In fact, laziness, at least in this instance, is given for free in Prolog by backtracking! We can ask for a first result, computation halts and the next result is then obtained by backtracking.

qsort_inner(X, [[],X|_]).
qsort_inner(X, [[],_|WorkRest]) :- qsort_inner(X, WorkRest).
qsort_inner(X, [[Piv|Ns]|WorkRest]) :- 
    pick_smaller(Piv,Ns,SMs),
    pick_notsmaller(Piv,Ns,NSMs),
    qsort_inner(X,[SMs,Piv,NSMs|WorkRest]).

pick_smaller(Pivot,Ins,Outs) :- include(@>(Pivot),Ins,Outs).
pick_notsmaller(Pivot,Ins,Outs) :- exclude(@>(Pivot),Ins,Outs).

qsort(X,Lin) :- qsort_inner(X,[Lin]).

Sort a list "lazily":

qsort(X,[3,2,1]).
X = 1;
X = 2;
X = 3;
false

Gotta get them all:

qsort_fully(Lin,Lout) :- bagof(X, qsort(X, Lin), Lout).

Unfortunately, the data structure that keeps track of the computational state is not apparent: it is on the stack, it cannot be unified to a variable. Thus is can only use this kind of "laziness" when I am on Prolog's top level.

How do I capture the state of the computation and invoke it later?

Note on how the quick sort works

The tree structure does not need to be explicitly kept as it holds no information. Instead, the sequence of alternating "leaf lists" and "pivot numbers" is kept in a list. Which is why we the intial "list of a lits of numbers".

Partial example run for quicksort


Solution

  • Prolog is a very reifiable language. Just turn your code into data:

    qsort_gen(Lin, G) :- 
        % G is the initial generator state for Lin's quicksorting
        G = qsort_inner([Lin]).
    
        %    This_State                   Next_Elt      Next_State
    next( qsort_inner([[], X    | WorkRest]), X, qsort_inner(WorkRest) ).
    next( qsort_inner([[Piv|Ns] | WorkRest]), X, G ) :-
        pick_smaller(  Piv, Ns, SMs),
        pick_notsmaller(Piv, Ns, NSMs),
        next( qsort_inner([SMs, Piv, NSMs | WorkRest]), X, G).
    
    pick_smaller(  Pivot, Ins, Outs) :- include( @>(Pivot), Ins, Outs).
    pick_notsmaller(Pivot, Ins, Outs) :- exclude( @>(Pivot), Ins, Outs).
    

    That's all.

    15 ?- qsort_gen([3,2,5,1,9,4,8], G), next(G,X,G2), next(G2,X2,G3), next(G3,X3,G4).
    G = qsort_inner([[3, 2, 5, 1, 9, 4, 8]]),
    X = 1,
    G2 = qsort_inner([[], 2, [], 3, [5, 9, 4|...]]),
    X2 = 2,
    G3 = qsort_inner([[], 3, [5, 9, 4, 8]]),
    X3 = 3,
    G4 = qsort_inner([[5, 9, 4, 8]]).
    
    16 ?- qsort_gen([1,9,4,8], G), next(G,X,G2), next(G2,X2,G3), next(G3,X3,G4).
    G = qsort_inner([[1, 9, 4, 8]]),
    X = 1,
    G2 = qsort_inner([[9, 4, 8]]),
    X2 = 4,
    G3 = qsort_inner([[8], 9, []]),
    X3 = 8,
    G4 = qsort_inner([[], 9, []]).
    
    17 ?- qsort_gen([1,9,4], G), next(G,X,G2), next(G2,X2,G3), next(G3,X3,G4).
    G = qsort_inner([[1, 9, 4]]),
    X = 1,
    G2 = qsort_inner([[9, 4]]),
    X2 = 4,
    G3 = qsort_inner([[], 9, []]),
    X3 = 9,
    G4 = qsort_inner([[]]).
    

    For easier interfacing, we can use take/4:

    take( 0, Next, Z-Z, Next):- !.
    take( N, Next, [A|B]-Z, NextZ):- N>0, !, next( Next, A, Next1),
      N1 is N-1,
      take( N1, Next1, B-Z, NextZ).
    

    Then,

    19 ?- qsort_gen([3,2,5,1,9,4,8], G), take(6, G, L-[], _).
    G = qsort_inner([[3, 2, 5, 1, 9, 4, 8]]),
    L = [1, 2, 3, 4, 5, 8].
    
    20 ?- qsort_gen([3,2,5,1,9,4,8], G), take(7, G, L-[], _).
    G = qsort_inner([[3, 2, 5, 1, 9, 4, 8]]),
    L = [1, 2, 3, 4, 5, 8, 9].
    
    21 ?- qsort_gen([3,2,5,1,9,4,8], G), take(10, G, L-[], _).
    false.
    

    take/4 obviously needs tweaking to close the output list gracefully when next/3 fails. It was written with the infinite lists in mind, originally. This is left as an exercise for a keen explorer.