stringmathcontext-free-grammarproofpumping-lemma

Using Ogden’s Lemma versus regular Pumping Lemma for Context-Free Grammars


I'm learning the difference between the lemmata in the question. Every reference I can find uses the example:

{(a^i)(b^j)(c^k)(d^l) : i = 0 or j = k = l}

to show the difference between the two. I can find an example using the regular lemma to "disprove" it.

Select w = uvxyz, s.t. |vy| > 0, |vxy| <= p. Suppose w contains an equal number of b's, c's, d's.

I selected:

u,v,x = ε
y = (the string of a's)
z = (the rest of the string w)

Pumping y will just add to the number of a's, and if |b|=|c|=|d| at first, it still will now.

(Similar argument for if w has no a's. Then just pump whatever you want.)

My question is, how does Ogden's lemma change this strategy? What does "marking" do?

Thanks!


Solution

  • One important stumbling issue here is that "being able to pump" does not imply context free, rather "not being able to pump" shows it is not context free. Similarly, being grey does not imply you're an elephant, but being an elephant does imply you're grey...

    Grammar context free        => Pumping Lemma is definitely satisfied  
    Grammar not context free    => Pumping Lemma *may* be satisfied
    Pumping Lemma satisfied     => Grammar *may* be context free
    Pumping Lemma not satisfied => Grammar definitely not context free
    # (we can write exactly the same for Ogden's Lemma)
    # Here "=>" should be read as implies
    

    That is to say, in order to demonstrate that a language is not context free we must show it fails(!) to satisfy one of these lemmata. (Even if it satisfies both we haven't proved it is context free.)

    Below is a sketch proof that L = { a^i b^j c^k d^l where i = 0 or j = k = l} is not context free (although it satisfies The Pumping Lemma, it doesn't satisfy Ogden's Lemma):

    Pumping lemma for context free grammars:

    If a language L is context-free, then there exists some integer p ≥ 1 such that any string s in L with |s| ≥ p (where p is a pumping length) can be written as s = uvxyz
    with substrings u, v, x, y and z, such that:
    1. |vxy| ≤ p,
    2. |vy| ≥ 1, and
    3. u v^n x y^n z is in L for every natural number n.

    In our example:

    For any s in L (with |s|>=p):

    (So unfortunately: using the Pumping Lemma we are unable to prove anything about L!
    Note: the above was essentially the argument you gave in the question.)

    Ogden's Lemma:

    If a language L is context-free, then there exists some number p > 0 (where p may or may not be a pumping length) such that for any string w of length at least p in L and every way of "marking" p or more of the positions in w, w can be written as w = uxyzv
    with strings u, x, y, z, and v such that:
    1. xz has at least one marked position,
    2. xyz has at most p marked positions, and
    3. u x^n y z^n v is in L for every n ≥ 0.

    Note: this marking is the key part of Ogden's Lemma, it says: "not only can every element be "pumped", but it can be pumped using any p marked positions".

    In our example:

    Let w = a b^p c^p d^p and mark the positions of the bs (of which there are p, so w satisfies the requirements of Ogden's Lemma), and let u,x,y,z,v be a decomposition satisfying the conditions from Ogden's lemma (z=uxyzv).

    This leaves us with five cases to check (for i,j>0):

    in every case (by comparing the number of bs, cs and ds) we can see that u x^2 v y^2 z is not in L (and we have a contradiction (!) to the language being context-free, that is, we've proved that L is not context free).

    .

    To summarise, L is not context-free, but this cannot be demonstrated using The Pumping Lemma (but can by Ogden's Lemma) and thus we can say that:

    Ogden's lemma is a second, stronger pumping lemma for context-free languages.