grammarautomatacomputation-theorylanguage-theorycontext-sensitive-grammar

Can someone give a simple but non-toy example of a context-sensitive grammar?


I'm trying to understand context-sensitive grammars, and I understand why languages like

  1. {ww | w is a string}
  2. {an bn cn | a,b,c are symbols}

are not context free, but what I'd like to know if a language similar to the untyped lambda calculus is context sensitive. I'd like to see an example of a simple, but non-toy (I consider the above toy examples), example of a context-sensitive grammar that can, for some production rule, e.g., tell whether or not some string of symbols is in scope currently (e.g. when producing the body of a function). Are context sensitive grammars powerful enough to make undefined/undeclared/unbound variables a syntactic (rather than semantic) error?


Solution

  • Yes, context-sensitive grammars (CSG) are powerful enough to make undefined/undeclared/unbound variables check, but unfortunately we don't know any efficient algorithm to parse strings of CSG.

    A real example of a context-sensitive language is the C programming language. A feature like declare variables first and then use them later make C-language a context-sensitive language (CSL). (I don't know about untyped lambda calculus).

    And because we don't know any linear parsing algorithm for CSL (or CSG). That is the reason in compiler design, we use CFG (and its parsing algoritm only) for syntax checking since we know efficient algorithms to parse CFG (if it's in restricted form). Compilers first parse a context free feature and then later handle context-sensitive features in a problematically way (for example, checks any used variable in the symbol table if it's defined. Otherwise, it generates an error).

    Also context-sensitive grammar is used in natural-language processing (NLP). And most natural languages are examples of context-sensitive languages. (I am not sure for the Sanskrit language).

    I will try to explain it with a silly but simple example (it's just an idea, you can refine it):

    NOUN     -->  { BlueBomber, Grijesh, I, We}
    TENSE    -->  { am, was, is, were}
    VERB     -->  { going, eating, working}
    
    SENTENCE --> <NOUN> <TENSE> <VERB>
    

    Now, using this grammar, we can generate some correct statements, but some are wrong too. For example,

    SENTENCE --> <NOUN>   <TENSE>   <VERB>
                 Grijesh    is       working       [Correct statement]
    

    But

                 Grijesh    am       working       [wrong statement]
    

    Reason: the value of <TENSE> depends on value <NOUN> (for example, I <TENSE> --> I am) and hence the grammar doesn't generate correct statements in the English language.

    Actually we can't write a context-free grammar for complete English!

    You might have noticed, any natural language translator or grammar checker doesn't works correctly (try with long statements). Because this problem comes under the context-sensitive parsing algorithm.


    REFERENCE: You can watch Dr. Arun Kumar's lectures. In some lecture he explains exactly what you are interested in.