Beruflich Dokumente
Kultur Dokumente
13
World Academy of Science, Engineering and Technology 5 2005
For the purpose of this study, we will narrow down the We translate sentence (1) from English to Turkish in four
scope of anaphoric phenomena and focus on a sub-problem of different forms as indicated in sentences (3), (4), (5) and (6).
anaphora resolution, namely, the resolution of 3rd person
singular pronominal anaphora to noun-phrase antecedents. (3) Mr. Smith bir şoför-üi o-nuni,k
Most algorithms in the literature resolve the pronouns ‘he’, Mr. Smith one driver-ACC s/he-GEN-3.SG
‘she’, ‘it’, ‘her’, ‘him’, ‘his’, ‘her’, ‘its’ in English whenever kamyon-u-n-da gör-dü.
they have an antecedent which is a noun phrase. The truck-POSS-3.SG-LOC see-PAST.
algorithm we offer in this study will resolve the pronouns ‘o’, ‘Mr.Smith saw a driver in his truck.’
‘onu’, ‘onun’, and ‘kendi’ in Turkish whenever they have an
antecedent which is a noun phrase. (4) Mr. Smithi bir şoför-ük Øi,k kamyon-u-n-da
Mr.Smith one driver-ACC truck-POSS-3.SG-LOC
B. The Naїve Algorithm
gör-dü.
In his 1977 paper, Hobbs presents two algorithms of
see-PAST.
pronominal anaphora resolution: - a syntax-based algorithm,
‘Mr.Smith saw a driver in (his) truck.’
known as the Naїve Algorithm, and a semantic algorithm. We
will concentrate on the Naїve Algorithm for finding (5) Mr. Smithi o-nunk kamyon-u-n-da
antecedents of pronouns here. Mr. Smith s/he-GEN-3.SG truck-POSS-3.SG-LOC
The Naїve Algorithm consists of a single resolution
procedure based on traversing full parse trees starting from the bir şoför gör-dü.
pronoun in a search for an appropriate antecedent. The one driver see-PAST.
‘Mr.Smith saw a driver in his truck.’
algorithm assumes that the data is presented in the format of
parse trees produced by a particular grammar- namely, the one
(6) Mr. Smithi Øi kamyon-u-n-da bir şoför
where an NP node dominates an N-bar node, to which Mr.Smith truck-POSS-3.SG-LOC one driver
arguments of the head noun attach. The algorithm traverses
the tree, from the pronoun up, stopping on certain S, NP and gör-dü.
VP nodes, searching left-to-right breadth-first in the subtrees see-PAST.
dominated by these nodes. ‘Mr.Smith saw a driver in (his) truck.’
It will be necessary to assume that an NP node has an Nbar In sentences (3), (4), (5) and (6) there are some ambiguous
node below it, as proposed by Chomsky [1], to which a states. Let us look at them one by one:
prepositional phrase containing an argument of the head noun In sentence (3) “onun” may be co-referential with “şoför”
may be attached. Truly adjunctive prepositional phrases are or another person in the previous sentences as the parse tree of
attached to the NP node in English. This assumption, or (3) shows in Fig. 2. The syntactic tree structures of Turkish
something equivalent to it, is necessary to distinguish between which are used in this study are based on [11, 9].
sentences (1) and (2) in English [6]. It is worth noting that Previous Sentence S1
where English has a prepositional phrase we use an NP which
has a locative case in Turkish. NP VP2
14
World Academy of Science, Engineering and Technology 5 2005
15
World Academy of Science, Engineering and Technology 5 2005
5. If the pronoun is ‘kendi’, continue to step 8. (8) a. The castle in Camelot remained the residence of the
6. If X is an NP node, traverse all branches below node king until 536 when he moved it to London[6].
X to the left of path p in a left-to-right, breadth-first
fashion. Propose as the antecedent any accusative NP b. Camelot-ta-ki kale, kral-ın o-nu
node which is immediately dominated by X or Camelot-LOC-REL castle, king-GEN it-ACC
propose as the antecedent any accusative NP node Londra-ya taşı-dı-ğı 536-ya kadar,
that is encountered which has an NP, VP or S node Londra-DAT move-PAST-ACC 536-DAT until,
between it and X.
7. If X is an VP node, traverse all the other branches o-nun rezidans-ı kal-dı.
below node X except path p. Propose as the s/he-GEN-3.SG residence-ACC remain-PAST.
antecedent any accusative NP node which is
immediately dominated by X or propose as the Beginning from node NP1 which is immediately
antecedent any accusative or genitive NP node that is dominating the pronoun ‘onu’, step 3 rises to node NP2. Step
encountered which has an NP, VP or S node between 4 does not apply, because the pronoun is not ‘kendi’. It’s
it and X. passed from step 3 to step 5. Step 5 searches the left portion of
8. From node X go up the tree to the first NP, VP or S NP2’s tree but finds no eligible NP node. Step 6 does not
node encountered. Call this new node X, and the path apply. Step 7 rises to node NP3. It’s passed from step 7 to step
traversed to reach it p. If X is an NP node or a VP 4. Step 5 searches the left portion of NP3’s tree but finds no
node, continue to step 5. If X is an S node, continue to eligible NP node. Step 6 does not apply. Step 7 rises to node
step 9. VP1 and it’s passed from step 7 to step 4. Step 5 does not
9. If the pronoun is “kendi”, the antecedent is a apply, it’s passed to step 6. Step 6 searches the all branches
nominative or genitive case-marked NP preceding it. below node VP1 except path p and proposes NP4 as
If the pronoun is not “kendi”, continue to step 10. antecedent. NP4 correctly determines ‘rezidansı’ as the
10. If node X is the highest S node in the sentence, antecedent of ‘onu’, as shown in Fig. 6.
traverse the surface parse trees of previous sentences S
proposed as the antecedent. If X is not the highest S Nbar Nbar NP3_dat post NP4_acc VP
node in the sentence, continue to step 11. noun noun NP2_nomV NP_dat
kadar
pronoun NP5_acc verb
11. From node X, go up the tree to the first NP, VP or S Camelot'taki kale onun kaldı
NP_gen NP1_nomV Nbar Nbar
node encountered. Call this new node X, and call the
path traversed to reach it p. Nbar pronoun NP_nomV noun noun
onu 536'ya rezidansı
12. If X is an NP node and if the path p to X did not pass noun Anaphora NP_dat NP_nomV Antecedent
through the Nbar node that X immediately dominates, kralın
Nbar Nbar
propose X as the antecedent.
noun noun
13. If X is an NP node and if the path p passed through Londra'ya taşıdığı
the N-bar node that X immediately dominates,
traverse all branches below node X to the left of path Fig. 6. The illustration of the parse tree of sentence (8b), the
p in a left-to-right, breadth-first manner. Propose any algorithm working on it and the determination of the
NP node encountered as the antecedent. antecedent of anaphora ‘onu’.
14. If X is a VP or S node, traverse all branches of node X
to the right of path p in left-to-right, breadth-first If we search for the antecedent of ‘onun’, beginning from
manner, but do not go below any NP or VP or S node node NP1 immediately dominating the pronoun ‘onun’, step 3
encountered. Propose any NP node encountered as the rises to node VP1. Step 4 does not apply, because the pronoun
antecedent. is not ‘kendi’. Step 5 does not apply and it’s passed from step
15. Go to step 10. 3 to step 6. Step 6 searches the all branches below node VP1
except path p. Firstly it’s proposed NP2 as antecedent in step
As [6] points out, a breadth-first search of a tree is one in 6. Thus, ‘536-ya’ is recommended as the antecedent of ‘onun’.
which every node of depth n is visited before any node of The algorithm can be improved somewhat by applying
depth n+1. simple selectional constraints, such as; Dates and places and
Figures 2, 3, 4 and 5 illustrate the algorithm working on the large fixed objects can’t move [6].
sentences (3), (4), (5) and (6). Figures 6 and 7 illustrate the After NP2 is rejected, it’s proposed NP3 as antecedent in
algorithm working on the sentence (8b) which is the step 6. And finally ligting upon NP3 ‘kralın’ as the antecedent
translation of the sentence (8a) from English to Turkish for of ‘onun’ in step 6 as shown in Fig. 7.
determining the antecedents of each anaphora.
16
World Academy of Science, Engineering and Technology 5 2005
REFERENCES
[1] Chomsky, N., “Remarks on nominalization.” In: R.Jacobs and
P.Rosenbaum(eds.), Readings in transformational grammar, 184-221.
Waltham, Mass.:Blaisdell, 1970.
[2] Denber M., “Automatic Resolution of Anaphora in English”, Technical
Report, Eastman Kodak Co. Imaging Science Division, June 30,1998;
http://www.wlv.ac.uk/~le1825/anaphora_resolution_papers/denber.ps
[3] Erguvanlı-Taylan E., “Pronominal versus Zero Representation of
Anaphora in Turkish”, Studies in Turkish Linguistics, 1986.
[4] Hobbs J.R., “Pronoun Resolution,”.Research Report 76-1, Department
of Computer Sciences, City College, City University of New York,
August 1976.
[5] Hobbs J.R., “38 Examples of Elusive Antecedents from Published
Texts,” Research Report 77-2, Department of Computer Sciences, City
College, City University of New York, August 1977.
[6] Hobbs J.R., “Resolving Pronoun References,” Lingua, Vol. 44, pp. 311-
338. Also in Readings in Natural Language Processing, B. Grosz, K.
Sparck-Jones, and B. Webber, editors, pp. 339-352, Morgan Kaufmann
Publishers, Los Altos, California, 1978.
[7] Huang Y., “Anaphora: A Cross-linguistic Approach,” New York:
Oxford University Press, 2000.
[8] Kennedy, C. and Boguraev, B., “Anaphora for everyone: Pronominal
Anaphora Resolution without a Parser.”, COLING 96 pages 113-118
(89), 1996.
[9] Kennelly, S.D., “Nonspecific External Arguments in Turkish”, Dilbilim
Araştırmaları 7, İstanbul, p.58-75, 1997.
[10] Kılıçaslan, Y., “Information packaging in Turkish.” Unpublished MSc.
Thesis, University of Edinburg, Edinburg, 1994.
17