Conjunctive grammar – Wikipedia

Posted on January 27, 2014 by lordneo

before-content-x4

Conjunctive grammars are a class of formal grammars
studied in formal language theory.
They extend the basic type of grammars,
the context-free grammars,
with a conjunction operation.
Besides explicit conjunction,
conjunctive grammars allow implicit disjunction
represented by multiple rules for a single nonterminal symbol,
which is the only logical connective expressible in context-free grammars.
Conjunction can be used, in particular,
to specify intersection of languages.
A further extension of conjunctive grammars
known as Boolean grammars
additionally allows explicit negation.

after-content-x4

The rules of a conjunctive grammar are of the form

{displaystyle Ato alpha _{1}And ldots And alpha _{m}}

where

{displaystyle A}

$A$ is a nonterminal and

{displaystyle alpha _{1}}

$alpha _{1}$ , …,

{displaystyle alpha _{m}}

$alpha _{m}$
are strings formed of symbols in

{displaystyle Sigma }

$Sigma$ and

after-content-x4

{displaystyle V}

$V$ (finite sets of terminal and nonterminal symbols respectively).
Informally, such a rule asserts that
every string

{displaystyle w}

$w$ over

{displaystyle Sigma }

$Sigma$
that satisfies each of the syntactical conditions represented
by

{displaystyle alpha _{1}}

$alpha _{1}$ , …,

{displaystyle alpha _{m}}

$alpha _{m}$
therefore satisfies the condition defined by

{displaystyle A}

$A$ .

Table of Contents

Formal definition[edit]

A conjunctive grammar

{displaystyle G}

$G$ is defined by the 4-tuple

{displaystyle G=(V,Sigma ,R,S)}

$G=(V,Sigma ,R,S)$ where

$V$ is a finite set; each element ${displaystyle vin V}$
$Σ$ is a finite set of terminals, disjoint from $V$ , which make up the actual content of the sentence. The set of terminals is the alphabet of the language defined by the grammar $G$ .
$R$ is a finite set of productions, each of the form ${displaystyle Arightarrow alpha _{1}&ldots &alpha _{m}}$
$S$ is the start variable (or start symbol), used to represent the whole sentence (or program). It must be an element of $V$ .

It is common to list all right-hand sides for the same left-hand side on the same line, using | (the pipe symbol) to separate them. Rules

{displaystyle Arightarrow alpha _{1}&ldots &alpha _{m}}

${displaystyle Arightarrow alpha _{1}&ldots &alpha _{m}}$ and

{displaystyle Arightarrow beta _{1}&ldots &beta _{n}}

${displaystyle Arightarrow beta _{1}&ldots &beta _{n}}$ can hence be written as

{displaystyle Arightarrow alpha _{1}&ldots &alpha _{m} | beta _{1}&ldots &beta _{n}}

${displaystyle Arightarrow alpha _{1}&ldots &alpha _{m} | beta _{1}&ldots &beta _{n}}$ .

Two equivalent formal definitions
of the language specified by a conjunctive grammar exist.
One definition is based upon representing the grammar
as a system of language equations with union, intersection and concatenation
and considering its least solution.
The other definition generalizes
Chomsky’s generative definition of the context-free grammars
using rewriting of terms over conjunction and concatenation.

Definition by derivation[edit]

For any strings

{displaystyle u,vin (Vcup Sigma cup {{text{“(”}},{text{“}}&{text{”}},{text{“)”}}})^{*}}

${displaystyle u,vin (Vcup Sigma cup {{text{“(”}},{text{“}}&{text{”}},{text{“)”}}})^{*}}$ , we say $u$ directly yields $v$ , written as

{displaystyle uRightarrow v,}

$uRightarrow v,$ , if

For any string

{displaystyle win Sigma ^{*},}

${displaystyle win Sigma ^{*},}$ we say $G$ generates $w$ , written as

{displaystyle S {stackrel {*}{Rightarrow }} w}

${displaystyle S {stackrel {*}{Rightarrow }} w}$ , if

{displaystyle exists kgeq 1,exists ,u_{1},cdots ,u_{k}in (Vcup Sigma cup {{text{“(”}},{text{“}}&{text{”}},{text{“)”}}})^{*}}

${displaystyle exists kgeq 1,exists ,u_{1},cdots ,u_{k}in (Vcup Sigma cup {{text{“(”}},{text{“}}&{text{”}},{text{“)”}}})^{*}}$ such that

{displaystyle S=,u_{1}Rightarrow u_{2}Rightarrow cdots Rightarrow u_{k},=w}

${displaystyle S=,u_{1}Rightarrow u_{2}Rightarrow cdots Rightarrow u_{k},=w}$ .

The language of a grammar

{displaystyle G=(V,Sigma ,R,S)}

$G=(V,Sigma ,R,S)$ is the set of all strings it generates.

Example[edit]

The grammar

{displaystyle G=({S,A,B,C,D},{a,b,c},R,S)}

${displaystyle G=({S,A,B,C,D},{a,b,c},R,S)}$ , with productions

{displaystyle Srightarrow AB&DC}

{displaystyle Arightarrow aA | epsilon }

{displaystyle Brightarrow bBc | epsilon }

{displaystyle Crightarrow cC | epsilon }

{displaystyle Drightarrow aDb | epsilon }

is conjunctive. A typical derivation is

{displaystyle SRightarrow (AB&DC)Rightarrow (aAB&DC)Rightarrow (aB&DC)Rightarrow (abBc&DC)Rightarrow (abc&DC)Rightarrow (abc&aDbC)Rightarrow (abc&abC)Rightarrow (abc&abcC)Rightarrow (abc&abc)Rightarrow abc}

It can be shown that

{displaystyle L(G)={a^{n}b^{n}c^{n}:ngeq 0}}

${displaystyle L(G)={a^{n}b^{n}c^{n}:ngeq 0}}$ . The language is not context-free, proved by the pumping lemma for context-free languages.

Parsing algorithms[edit]

Though the expressive power of conjunctive grammars
is greater than those of context-free grammars,
conjunctive grammars retain some of the latter.
Most importantly, there are generalizations of the main context-free parsing algorithms,
including the linear-time recursive descent,
the cubic-time generalized LR,
the cubic-time Cocke-Kasami-Younger,
as well as Valiant’s algorithm running as fast as matrix multiplication.

Theoretical properties[edit]

A property that is undecidable already for context-free languages or finite intersections of them, must be undecidable also for conjunctive grammars; these include:
emptiness, finiteness, regularity, context-freeness,^{[n 1]} inclusion and equivalence.^{[n 2]}

The family of conjunctive languages is closed under union, intersection, concatenation and Kleene star, but not under string homomorphism, prefix, suffix, and substring.
Closure under complement and under ε-free string homomorphism are still open problems (as of 2001).^[1]^: 533

The expressive power of grammars over a one-letter alphabet has been researched.^{[citation needed]}

This work provided a basis
for the study of language equations of a more general form.

Synchronized alternating pushdown automata[edit]

Aizikowitz and Kaminski^[2] introduced a new class of pushdown automata (PDA) called synchronized alternating pushdown automata (SAPDA). They proved it to be equivalent to conjunctive grammars in the same way as nondeterministic PDAs are equivalent to context-free grammars.

^ Given a conjunctive grammar, is its generated language empty / finite / regular / context-free?
^ Given two conjunctive grammars, is the first’s generated language a subset of / equal to the second’s?

References[edit]

^ Alexander Okhotin (2001). “Conjunctive Grammars” (PDF). Journal of Automata, Languages and Combinatorics. 6 (4): 519–535.
^ Aizikowitz, Tamar; Kaminski, Michael (2011). “LR(0) Conjunctive Grammars and Deterministic Synchronized Alternating Pushdown Automata”. Computer Science – Theory and Applications. Lecture Notes in Computer Science. Vol. 6651. pp. 345–358. doi:10.1007/978-3-642-20712-9_27. ISBN 978-3-642-20711-2. ISSN 0302-9743.

External links[edit]

after-content-x4

Conjunctive grammar – Wikipedia

Formal definition[edit]

Definition by derivation[edit]

Example[edit]

Parsing algorithms[edit]

Theoretical properties[edit]

Synchronized alternating pushdown automata[edit]

References[edit]

External links[edit]

Recent Posts

Recent Comments

Archives

Categories

Meta