Finite Automata

9/8/09
CS240 Language Theory and Automata Fall 2009
Finite Automata
Example
The following FA scans HTML documents, looking for a list of what could be title-author pairs, perhaps in a reading list for some literature course. It accepts whenever it nds the end of a list item. In an application, the strings that matched the title (before 'by') and author (after) might be stored in a table of title-author pairs being accumulated.
Transition Diagram
An FA can be represented by a graph:
nodes = states arc from q to p is labeled by the set of input symbols a such that (q, a) = p. No arc if no such a. Start state indicated by word "start" and an arrow. Accepting states get double circles.
9/8/09

start!
<ol>, <ul>!
<li>!
Any non-tag! space!
FA that recognizes simple identiers

start! letter!
letter or digit!
b! <li>! y!
other character! (delimeter)!
space!
</ol>, </ul>!
</li>!
Any non-tag!
FA for Newspaper Vending Machine!

quarter! quarter! dime! dime! dime! dime,! quarter!
FA for coin ipping

Four ways of arranging two coins, depending on which is heads (H) and which is tails (T)
HH, HT, TT, TH
start!
nickel! nickel! nickel! nickel! nickel,! dime,! quarter!
Two operations:
Flip the rst coin (a) Flip the second coin (b)
quarter!
Assume initially coins are laid out as HH What are all possible ways of applying the operations so that the conguration is TT?
9/8/09

Model as an FA
HH: Flip rst coin
start
Conventions
It helps if we can avoid mentioning the type of every name by following some rules:
Input symbols are a, b, etc., or digits. Strings of input symbols are u, v, . . . , z. States are q, p, etc.
HH
b b
a a b a
TH
b
HH: Flip second coin TH: Flip rst coin TH: Flip second coin HT: Flip rst coin HT: Flip second coin TT: Flip rst coin TT: Flip second coin
HT
a
TT

Final state
Formal Denition of Finite Automaton

Finite set of states, Q. Alphabet of input symbols, . One state is the start/initial state, q0. Zero or more nal/accepting states, the set is F. A transition function, . This function:
Takes a state and input symbol as arguments. Returns a state. One "rule" of would be written (q, a) = p, where q and p are states, and a is an input symbol. Intuitively: if the FA is in state q, and input a is received, then the FA goes to state p (note: q = p OK).
Example: Clamping Logic

We may think of an accepting state as representing a "1" output and non-accepting states as representing "0" output. A "clamping" circuit waits for a 1 input, and forever after makes a 1 output. However, to avoid clamping on spurious noise, we'll design an FA that waits for two 1's in a row, and "clamps" only then. In general, we may think of a state as representing a summary of the history of what has been seen on the input so far.
An FA is represented as the ve-tuple: A = (Q, , , q0 , F).
9/8/09

The states we need are:

State q0, the start state, says that the most recent input (if there was one) was not a 1, and we have never seen two 1's in a row. State q1 says we have never seen 11, but the previous input was 1. State q2 is the only accepting state, it says that we have at some time seen 11. Thus, A = ({q0, q1, q2}, {0, 1}, , q0, {q2}), where is given by:
0 1 By marking the start state with > and accepting states with *, the transition table that denes also species the entire FA
Transition Graph
0! start 1! 1!
0,1!
0!
>q0 q1 *q2
q0 q0 q2
q1 q2 q2
Extension of to Paths
Intuitively, a FA accepts a string w = a1a2 an if there is a path in the transition diagram that:
1. Begins at the start state, 2. Ends at an accepting state, and 3. Has sequence of labels a1,a2 , , an .
Formally, we extend transition function to (q,w), where w can be any string of input symbols: Basis: (q, ) = q (i.e., on no input, the FA doesn't go
anywhere). Induction: (q, wa) = ( (q, w),a), where w is a string, and a a single symbol (i.e., see where the FA goes on w, then look for the transition on the last symbol from that state).
Important fact with a straightforward, inductive proof: ^

^
really represents paths. That is, if w = a1a2 an, and (pi, ai) = pi+1 for ^
all i = 0, 1, . . . , n-1, then (p0, w) = pn
9/8/09

Acceptance of Strings
An FA A = (Q, , , q0, F) accepts string w if
Type Errors
A major source of confusion when dealing with automata (or mathematics in general) is making "type errors."
Example: Don't confuse A, a FA, i.e., a program, with L(A), which is of type "set of strings." Example: the start state q0 is of type "state," but the accepting states F is of type "set of states." Trickier example: Is a a symbol or a string of length 1?
(q0, w) F
Language of a FA
FA A accepts the language L(A) = {w | (q0, w) F}
^
Answer: it depends on the context, e.g., is it used in (q, a), ^ where it is a symbol, or (q, a), where it is a string?
Regular Languages
Union : L1 L2 Intersection : L1 L2 Concatenation : L1L2 Complementation : Star-closure : L1*
9/8/09

Closure under Union

How would you show L1L2 is regular ? We know that:
There is a nite automaton M1 that accepts L1 and a nite automaton M2 that accepts L2 How can we construct a nite automaton that accepts L1L2?
Non-deterministic Finite Automata

Allow (deterministic) FA to have a choice of 0 or more next states for each stateinput pair. Important tool for designing string processors, e.g., grep, lexical analyzers. But "imaginary," in the sense that it has to be implemented deterministically.
Example
Design an NFA to accept strings over alphabet {1, 2, 3} such that the last symbol appears previously, without any intervening higher symbol, e.g.,
11 21112 312123
1,2,3

q
1
1
1
t

p
2

Trick: use start state to mean "I guess I haven't seen the symbol that matches the ending symbol yet." Three other states represent a guess that the matching symbol has been seen, and remembers what that symbol is.
3
s
1,2

9/8/09

Formal NFA
N = (Q, , , q0, F) where all is as DFA, but:
(q, a) is a set of states, rather than a single state. Basis: (q, ) = {q}. Induction: Let:
(q, w) = {p1, p2, , pk}. (pi, a) = Si for i = 1, 2 , k. Then (q, wa) = S1 S2 Sk.
Example
Here is a DFA that accepts a language L consisting of all strings over (a,b) that begin with either aa or bb
a
0
b
2
a
b
t
5
1
a
b
4
a,b
a,b
6
1,2
3 a,b

Extension to
Language of an NFA
An NFA accepts w if any path from the start state to an accepting state is labeled w. Formally:
L(N) = {w | (q0, w) F }.
Suppose we want to make an automaton to recognize REV(L), the language of all strings that end in aa or bb Easy solution: reverse all transitions and interchange start and nal states:

a
0
b
2
a
b
1
a
b
4
But this is not a DFA! 3
1,2
a,b
More than one start state a,b
More than one 5
transition labeled a,b
with the same symbol
NFAs and DFAs

It might seem that because there is a degree of choice available in an NFA that it might be more powerful than a DFA
That is, NFAs might be able to recognize languages a DFA could not
But this is not the case!
This is an NFA
9/8/09

Equivalence of NFAs and DFAs

NFAs and DFAs recognize the same class of languages A bit surprising: NFAs seem more powerful Useful: easier to specify an NFA for a language, convert to DFA
Two machines are equivalent if they recognize the same language
Theorem
Every non-deterministic nite automaton has an equivalent deterministic nite automaton
Proof Idea
If a language is recognized by an NFA, show the existence of a DFA that also recognizes it Convert NFA to an equivalent DFA that simulates the NFA
Proof by construction
Proof
Let N = (Q,,,q0,F) be an NFA recognizing some language A Construct a DFA recognizing A
M = (Q,,,q0,F)
1. Q = the set of subsets of N 2. For R Q and a let (R,a) = {q Q | q (r,a) for some r R
If R is a state of M, it is also a set of states of N (because of 1 above). When M reads a symbol a in a state R, it shows where a takes each state in R. Because each state may go to a set of states, we take the union of all these sets. This can be written as:
Intuitively, can simulate the NFA by keeping track of all the states you can get to on a given input
q0={q0}
M starts in the state corresponding to the collection containing just the start state of N. The machine M accepts if one of the possible states that N could be in at this point is an accept state.
F = {R Q|R contains an accept state of N}.
9/8/09

Visual version
5
The DFA
({7,9},a)= ({7,9},b)= ({5,6,8},a)= ({5,6,8},b)= ({10},a)= ({10},b)= (1,a)={2,3} (1,b)={4} ({2,3},a)={7,9} ({2,3},b)={5,6,8} ({4},a)= ({4},b)={10} ({7,9},a)= ({7,9},b)= ({5,6,8},a)= ({5,6,8},b)= ({10},a)= ({10},b)= a a {1} b {4} b Start state q0={1} Final states F={{5,6,8},{10}} {2,3} b a
{5,6,8}
b a
1 2
Start state q0= {1} (1,a)= {2,3} (1,b)= {4}
{7,9} a,b
a b
3 4
b a b a b
7 8 9 10
({2,3},a)= ({2},a) ({3},a) = {7,9} = {7,9} ({2,3},b)= ({2},b) ({3},b) = {5,6} {8} = {5,6,8} Final states F= ({4},a)=
{{5,6,8},{10}} ({4},b)= {10}
a,b {} {10} a,b
Try this for the NFA constructed before: 1,2,3

q
1
1
p
2
r
2
1

NFA With -Transitions

Allow to be a label on arcs
t
Nothing else changes: acceptance of w is still the existence of a path from the start state to an accepting state with label w. But can appear on arcs, and means the empty string (i.e., no visible contribution to w) When an arc labeled is traversed, no input is consumed 1,2

3
s

9/8/09

Example
0

DFAs and NFA-s

-transitions are a convenience, but do
not increase the power of FA's. For any NFA- there is an equivalent (i.e., accepts the same language) DFA The construction is similar to the NFA-toDFA construction

0
1

001 is accepted by the path q, s, r, q, r, s, with label 0 01 = 001.
Creating a DFA from a NFA-

(or, eliminating -transitions) 1. Compute the -closure for each state (set of states reachable from that state on -transitions only). 2. Start state is -CLOSE(q0). 3. Compute for each a and each set S (each of the -CLOSEd sets) as follows: If a state p S can reach state q on input a (not ), then add a transition on input a from S to -CLOSE(q). 4. The set of nal states includes those sets that contain at least one accepting state of the NFA-.
Example
1. Compute -closure for all states: ECLOSE(q) = {q} ECLOSE(r) = {r,s} ECLOSE(s) = {r,s} q
1
0

2. Compute : ({q},0) = ECLOSE({s})={r,s} ({q},1) = ECLOSE({r})={r,s} ({r,s},0) = ECLOSE({q})={q} ({r,s},1) = ECLOSE({q})={q} 3. Final states FD ={{r,s}}
RESULTING DFA:
q 0,1
r,s
0,1
10

Finite Automata

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Finite Automata

Hochgeladen von

Copyright:

Verfügbare Formate

9/8/09

CS240 Language Theory and Automata Fall 2009

Any non-tag! space!

FA that recognizes simple identiers

FA for Newspaper Vending Machine!

FA for coin ipping

Formal Denition of Finite Automaton

Example: Clamping Logic

An FA is represented as the ve-tuple: A = (Q, , , q0 , F).

The states we need are:

Important fact with a straightforward, inductive proof: ^

Closure under Union

Non-deterministic Finite Automata

NFAs and DFAs

But this is not the case!

Equivalence of NFAs and DFAs

F = {R Q|R contains an accept state of N}.

Start state q0= {1} (1,a)= {2,3} (1,b)= {4}

a,b {} {10} a,b

Try this for the NFA constructed before: 1,2,3

NFA With -Transitions

DFAs and NFA-s 

001 is accepted by the path q, s, r, q, r, s, with label 0 01 = 001.

Creating a DFA from a NFA-

Das könnte Ihnen auch gefallen

NFA With -Transitions

DFAs and NFA-s