Beruflich Dokumente
Kultur Dokumente
A word lattice may be described using a set of nodes (with word id and time stamp) and a set of arcs
(with scores) connecting the nodes. A word lattice provides a compact representation for a set of sentences
(sequence of words). Typically, a word lattice may be generated by a recogniser and the score of the arcs
may be a combination of acoustic and language model scores.
Now, consider a lattice with the following properties:
• N : number of nodes
• L : number of arcs
α0 = 1 (3)
βN = 1 (4)
where 0 and 1 denote the start and end nodes of the lattice respectively. The arc posteriors are given by the
probability of making a transition through that arc. For an arc connecting nodes i and j, the arc posterior
is given by
αi sij βj
γij = (5)
β0
This is similar to the statistics used to estimate the transition probabilities for an HMM.
Note that when using the posteriors to compute the expected counts for N -grams with N > 2, it
is neccesary to expand the lattice such that distinct nodes are used to represent the contexts. In the
SRILM toolkit, the lattices are not expanded. Instead, context dependent forward probabilities are computed
recursively on the fly when needed.
1
1.1 Example 1
Consider the following lattice:
3
3 4
1 2 5 6
2 3
6 1
4
2
1.2 Example 2
Consider the following lattice:
4
2 2
2 7
1 1 2 1
1 5 9
2 2 2 2
3 8
1
2
6
3
1.3 Lattice expansion for high order N-grams.
Consider the word lattice with the same structure as Example 1:
eating
3
3 4
I like western food
1 2 5 6
2 3
6 1
4
cooking
1 2 6
2
6 3
4 5b
1
cooking western | cooking