Beruflich Dokumente
Kultur Dokumente
Trends in Linguistics
Studies and Monographs 22
Editor
Werner Winter
Mouton Publishers
Berlin New York Amsterdam
Contrastive Linguistics
Prospects and Problems
edited by
Jacek Fisiak
Mouton Publishers
Berlin New York Amsterdam
Professor Jacek Fisiak
Institute of English
Adam Mickiewicz University
Poznan, Poland
Contrastive linguistics.
Copyright 1984 by Walter de Gruyter & Co., Berlin. All rights reserved, including
those of translation into foreign languages. No part of this book may be reproduced in
any form - by photoprint, microfilm, or any other means - nor transmitted nor trans-
lated into a machine language without written permission from Mouton Publishers,
Division of Walter de Gruyter & Co., Berlin.
Typesetting: Mercedes-Druck, Berlin. - Printing: Druckerei Hildebrand, Berlin. -
Binding: Lderitz & Bauer Buchgewerbe GmbH, Berlin. Printed in Germany
Preface
The present volume contains a selection of papers prepared for the 4th Inter-
national Conference of Contrastive Projects held at Charzykowy (Poland)
from 3 - 6 December 1980. (Two papers have been accepted for publication
(i.e. Filipovic and Gussmann) although they were not presented at the con-
ference).
Unlike the previous conferences of contrastive projects in Zagreb (1970),
Bucharest (1975) and Trier (1978), the one at Charzykowy had no papers
summarizing the progress of particular contrastive projects. The organizers
assumed that the ideas and trends developed within different projects can be
most profitably presented by papers addressing themselves to substantive
issues in the field of contrastive linguistics. Therefore it was considered that
most interesting results would be achieved if some scholars were invited to
speak on specific topics and others to discuss problems of special interest to
them. It seemed to us that this would help to identify additional problems
for further research and would point to further uses of contrastive research
apart from the ones widely discussed over the last few years.
Those who expect to find definite answers to numerous, often contro-
versial questions posed by contrastive research will be disappointed. What the
conference has demonstrated is firstly the value of contrastive research for
finding solutions to theoretical linguistic issues, e.g. papers by Borsley, Grze-
gorek, Horn, Gussmann, Jaworska or Zabrocki. Secondly, the conference wit-
nessed a very productive broadening of the scope of contrastive research by
going beyond the more traditional sentence-bound orientation into the fields
of contrastive pragmatics, text linguistics and discourse analysis (e.g. Enkvist,
Fillmore, Faerch and Kasper, Oleksy and Szwedek). Thirdly, the conference
has proved the necessity for further refinement of some fundamental concepts
of contrastive linguistics (e.g. Krzeszowski and Filipovic). Fourthly, the con-
ference has made it clear that in spite of scepticism voiced by some scholars as
regards the application of contrastive research (cf. Khlwein), the present
research in the area of second language acquisition and psycholinguistics (e.g.
Sajavaara and Sharwood Smith), reinvestigation of error analysis (e.g. Mukat-
vi Preface
tash) and research in the field of translation leave no doubt that the uses of
contrastive linguistics are quite numerous and of a different calibre than ex-
pected at the outset of pedagogically oriented contrastive work.
In general it can be easily noticed that the volume contains more theoretic-
ally oriented linguistic works (for my distinction between theoretical and
applied contrastive analysis made earlier in 1968, 1970, 1973, 1975 etc. see
several references in this volume, e.g. papers by Khlwein and Sajavaara) al-
though the applied side has not been totally neglected. It seems that this
reflects the situation in contrastive linguistics in general if one looks both at
the history of the discipline and the amount of theoretical works produced
in both hemispheres nowadays.
The editor of this volume hopes, therefore, that the essays presented here
will be of interest both to theoretical linguists as well as those readers who
will look for some insight into the field of application of contrastive analysis.
The 4th International Conference on Contrastive Projects was sponsored
by Adam Mickiewicz University in Poznan and organized by its Institute of
English. On this occassion words of thanks are due here to Professor Stefan
Kozarski, Pro-vice-Chancellor of Adam Mickiewicz University, for substantial
financial support in the days of budgetary cuts and economic stringency.
Our thanks also go to Ms. Barbara Piocinska, M.A., for a skillful and
devoted handling of the administrative arrangements of the conference in all
stages of its organization.
Finally our gratitude is due to all the conference members whose active
participation has greatly contributed to the final shape of the present volume.
Preface
List of conference participants ix
ROBERT D. BORSLEY
Free relatives in Polish and English 1
STEFAN DYtA
A note on inversion/conjunct ascension constructions in Polish and
English 19
NILS ERIK ENKVIST
Contrastive linguistics and text linguistics 45
CLAUS F/ERCH and GABRIELE KASPER
Ja und? - og hva s? a contrastive discourse analysis of gambits in
German and Danish 69
RUDOLF FILIPOVIC
What are the primary data for contrastive analysis? 107
CHARLES J. FILLMORE
Remarks on contrastive pragmatics 119
MARIA GRZEGOREK
English sentences with introductory there and their Polish counter-
parts 143
EDMUND GUSSMANN
Abstract phonology and contrastive analysis 171
RAYMOND HICKEY
Towards a contrastive syntax of Irish and English 187
GEORGE M. HORN
Constraints on transformations: evidence from contrastive analysis . 205
viii Contents
EWA JAWORSKA
On the structure of adverbial subordinate constructions in English
and Polish 251
ANDRZEJ KOPCZYNSKI
Problems of quality in conference interpreting 283
TOMASZ P. KRZESZOWSKI
Tertium comparationis 301
WOLFGANG KHLWEIN
Pedagogical limitations of contrastive linguistics 313
LEWIS MUKATTASH
Contrastive analysis, error analysis and learning difficulty 333
WIESLAW OLEKSY
Towards pragmatic contrastive analysis 349
JERZY RUBACH
Rule ordering in phonological interference 365
KARI SAJAVAARA
Psycholinguistic models, second language acquisition, and con-
trastive analysis 379
MICHAEL SHARWOOD SMITH
Learnability and second language acquisition 409
ALEKSANDER SZWEDEK
Some problems of contrastive analysis and text linguistics 419
TADEUSZ ZABROCKI
On the nature of movement rules in English and Polish 431
Index of names 445
List of conference participants
Director
Participants
Docent Wieslaw Awedyk Adam Mickiewicz University, Poznan
Professor John L. Behling University of Sofia
Dr. Robert Borsley University College, London
Dr. Paul van Buren University of Utrecht
Dr. Donna Christian Adam Mickiewicz University, Poznan
Docent Liudvika Drazdauskiene Vilnius State University
Dr. Stefan Dyta Adam Mickiewicz University, Poznan
Dr. Emilia Dykczak Adam Mickiewicz University, Poznan
Professor Nils Erik Enkvist bo Akademi
Dr. Claus Faerch University of Copenhagen
Professor Charles J. Fillmore University of California, Berkely
Dr. Jadwiga Fisiakowa Adam Mickiewicz University, Poznan
Professor Rosemarie Glser University of Leipzig
Professor Claus Gnutzmann University of Hannover
Ms. Elzbieta Gorska Adam Mickiewicz University, Poznan
Dr. Maria Grzegorek Adam Mickiewicz University, Poznari
Dr. Raymond Hickey University of Bonn
Dr. Gabriele Hohenwart University of Salzburg
Dr. Helga Hhlein Technical University, Aachen
Professor George Horn University of Newcastle, Australia
Dr. Aleksandra Horn University of Newcastle, Australia
Mr. Gerhard Jakob University of Mannheim
X List of conference participants
Bresnan and Grimshaw (1978) reject the traditional assumption that English
free relatives are headless. Instead, they argue that the wh-phrases are heads,
and that they involve not w/i-movement but a rule of controlled-pro-deletion.
They assume, for example, that the free relative in (1) has the underlying
structure represented in (2).
They also discuss free relatives in certain other languages. They appear to be
suggesting that their analysis is universally applicable. They suggest, for
example, that the form of free relatives in Tok Pisin, an English-based pidgin
spoken in Papua New Guinea, "strikingly confirms our hypothesis" (1978:
377). It is hard to see how this could be relevant unless the analysis is assumed
to be universal. (Woolford 1978, which Bresnan and Grimshaw draw on, ex-
plicitly assumes that free relatives are essentially similar in all languages.) They
go on, however, to deny that the analysis is universally applicable by suggest-
ing that free relatives are headless in Moroccan Arabic. In this paper, I will
look in some detail at Polish free relatives. I will show that there is strong
evidence that they are headless relatives involving w/z-movement and thus that
Bresnan and Grimshaw's analysis is certainly not universally applicable. Then,
I will argue that it is not at all obvious that their analysis is the correct one
for English.
An important weakness of Bresnan and Grimshaw's discussion is their failure
to recognize that there are at least two distinct headless analyses that might
be proposed for free relatives. (This point is clearly recognized in Woolford
1978.) On the one hand, there is an analysis in which they are isomorphic
2 Robert D. Borsley
(3)
COMP
(that) COMP
The above data suggest that Polish free relatives are quite like English free
relatives. In some respects, this is the case. There is strong evidence, however,
against a Bresnan-Grimshaw analysis. I will consider this shortly. First, how-
ever, I want to argue that Polish free relatives are not isomorphic with em-
bedded questions.
The simplest argument against such an analysis involves number agree-
ment. Bresnan and Grimshaw point out that when a free relative appears in
subject position, the form of the verb depends on the number of the wh-
phrase. If the w/z-phrase is plural, the plural form of the verb is necessary.
(19) illustrates this.
This is not the case, however, with an embedded question in subject position,
as (20) illustrates.
The situation is just the same in Polish. We have data like the following:
f/esf (sg.)| , .
(22) Jakie ksiqzki ma Maria j ^
I *sq (pi.) J Jemnicel'
what books has Mary is/are secret
'What books Mary has a secret.'
These contrasts are quite inexplicable if free relatives are isomorphic with
embedded questions. They are quite natural on a Bresnan-Grimshaw analysis.
They are also quite natural, however, on a superficially headless analysis. On
such an analysis, the wh -phrase will agree in number with the deleted or phono-
logically empty head, and the latter will determine the form of the verb.
We can turn now to the evidence against a Bresnan-Grimshaw analysis.
Some initial evidence comes from the fact that for every free relative intro-
duced by a simple -word, there is an identical demonstrative-headed rela-
tive clause. Thus, parallel to (5) - (9), we have (23) - (27).
(26) Wtedy, kiedy Piotr jest nami, jest nam bardzo przyjemnie.
then when Peter is with us is (to) us very pleasant
'It's very pleasant to us when Peter is with us.'
(27) Tak, jak to zrobisz, na pewno bdzie dobrze.
so how it (you) will do certainly will be good
'The way you do it will certainly be good.'
If the w/z-words in Polish free relatives are heads, this fact will be quite
accidental. If we assume a superficially headless analysis, however, it will be
quite natural. Such an analysis will either involve a rule deleting a demon-
strative head under certain circumstances or will allow a phonologically empty
head under certain circumstances. In either case, it is only to be expected that
there will be many cases where free relatives are identical to demonstrative-
headed relative clauses.
6 Robert D. Borsley
Some further evidence against Bresnan and Grimshaw comes from the fact
that there is no independent evidence for a general rule of controlled-pro-
deletion. In English, relative clauses introduced by that provide independent
motivation for such a rule. There is good evidence that that is a complemen-
tizer when it introduces a relative clause. (See, for example, Morgan 1972.) It
is natural, therefore, to assume that relative clauses introduced by that in-
volve deletion.4 In Polish, the situation is rather different. Relative clauses
normally involve w/i-movement. There are, however, relative clauses that
appear to involve deletion. Consider, for example, (28).
Thus, the free relatives in (43) - (46) pose serious problems for a Bresnan-
Grimshaw analysis. Such free relatives pose no problems, however, for a head-
less analysis. A central fact about wh-movement in Polish is that it can violate
Ross's (1967) left branch condition. The following illustrate.
Thus, if Polish free relatives are headless relatives involving w/z-movement, the
free relatives in (43) - (46) are only to be expected.
Although the facts surveyed so far provide evidence against a Bresnan-
Grimshaw analysis, they are compatible with an analysis of the kind that
Hirschbiihler proposes. There are facts, however, that argue against both kinds
of analysis.
Some important evidence comes from the fact that there are no cases else-
where in Polish of embedded finite clauses not introduced by some element
in COMP. Ordinary relative clauses are always introduced by some element in
COMP (usually a wh-word). Thus, only the a. examples in the following are
grammatical.
(51) a. Ten cztowiek, ktorego Maria kocha, jest gtupi.
the man who mary loves is foolish
'The man who Mary loves is a fool.'
b. *Ten cztowiek, Maria kocha, jest gtupi.
(52) a. Ksiqzki, ktore mi dates, sq bardzo ciekawe.
books which me (you) gave are very intersting
'The books which you gave me are very interesting.'
b. *Ksiqzki, mi dates, sq bardzo ciekawe.
10 Robert D. Borsley
Here, we have a free relative with swoj within the w/i-phrase and the main
clause subject as the antecedent. The fact that such sentences are ungram-
matical suggests strongly that the wh-phrases in free relatives are in COMP and
are not heads.
The distribution of swoj in fact provides another argument against a Bresnan-
Grimshaw analysis. Both (62) and (64) are grammatical if the deleted com-
plement subject is the antecedent of swoich. On either a Bresnan-Grimshaw
analysis or a Hirschbhler analysis, (64) will on this interpretation involve a
form of swoj with an antecedent in a lower clause. In general, this is impos-
sible. (65) illustrates.
(65) *Jan zaspiewa kazdq ze swoichf piosenek, jakq tyi wy-
bierzesz.
John will sing each from your songs which you will choose
'John will sing each of your songs that you choose.'
(69) The head of a relative clause and the wA-phrase have the
same categorial specification.
Thus, the matching effect argues against an analysis of free relatives in which
they are isomorphic with embedded questions but is quite compatible with a
superficially headless analysis.
The situation is similar with certain facts relating to what Bresnan and
Grimshaw term the Internal NP-over-S Constraint. This constraint, which has
its origins in the work of Ross (1967) and Kuno (1973), stipulates that sen-
tences exhaustively dominated by NP cannot appear internal to phrases of
which they are immediate constituents. It accounts for the dubious character
of sentences like the following.
Clearly, this contrast argues that free relatives are not isomorphic with em-
bedded questions. It poses no real problems, however, for a superficially
headless analysis. If the constraint is a surface filter, it will provide evidence
against a superficially headless analysis involving a deleted head. It will pose
no problems, however, for a superficially headless analysis involving a phono-
logically empty head. Again, then, the facts Bresnan and Grimshaw cite pro-
vide evidence against an analysis of free relatives in which they are isomorphic
with embedded questions, but they are quite compatible with a superficially
headless analysis.
It is clear, then, that three of Bresnan and Grimshaw's arguments only pro-
vide evidence against an analysis of free relatives in which they are isomorphic
with embedded questions. There are three other arguments that might seem
to provide some evidence against a superficially headless analysis. I will show,
however, that doubts can be raised about all of them.
The first argument involves certain wh -constructions where it is fairly clear
that the -words are not ordinary wh-words in COMP. The following illu-
strate.
(74) She wrote whenever possible.
(75) Eat what you please.
Bresnan and Grimshaw show that it is not plausible to derive the w/i-construc-
tions here from ordinary free relatives and hence that the wh -words cannot be
ordinary wh-words in COMP. They seem then to suggest that considerations
of simplicity support the view that the wh -words in free relatives are not
ordinary wh -words in COMP either.
Polish provides an objection to this argument. In Polish, -words with the
suffix -kolwiek occur quite widely in contexts where they are clearly not
ordinary -words in COMP. Roughly, they occur in all the contexts in which
English any forms occur except negative environments, where negated forms
appear. The following illustrate.
It is clear, then, that the appearance of w/j-words in contexts where they can-
not be ordinary w/i-words in COMP does not in any way necessitate the con-
clusion that they are not ordinary wft-words in COMP in free relatives.
The second argument involves pied piping. Unlike Polish free relatives,
English free relatives never involve pied piped prepositions. A superficially
headless analysis leads one to expect pied piped prepositions in free relatives.
It looks, then, as if we have a real argument against such an analysis here. It
is not clear that this is the case, however. As we have seen, free relatives with
pied piped prepositions are sometimes acceptable in Polish. Often, however,
they are ungrammatical.None of the following,for example, are grammatical. 6
Notes
1. I am grateful to Ewa Jaworska for placing her Polish intuitions at my disposal and
providing much other assistance, and to Frank Heny, George Horn, Dick Hudson and
Neil Smith for a number of helpful comments.
2. A further headed analysis is advanced in Jackendoff 1977 (chapter 9). This involves
a rule moving a wA-word from S directly to head position. The facts that argue against
a Hirschbhler analysis also argue against an analysis of this kind.
3. Another interesting class of free relatives involve an idiomatic use of the negative
particle nie. Free relatives of this kind seem to be similar to free relatives involving
kolwiek. The following illustrate.
(i) Co nie zrobi$, zawsze jest tie.
what not (I) do always is bad
'Whatever I do is always wrong.'
(ii) Kogo nie zapytasz, wskaze ci drogq.
who not (you) will ask (he) will show you way
'Whoever you ask will show you the way.'
4. Chomsky, of course, would assume that such relative clauses involve wA-movement
and deletion in COMP. I would reject this view. For criticisms of Chomsky's
views on wA-movement and deletion, see Bresnan (1976), Grimshaw (1974), Allen
(1980) and Borsley (1981).
5. I have glossed jak here as 'how' because it appears elsewhere as an adverbial wA-word.
There is strong evidence, however, that it is not a wA-word here and that such exam-
ples involve deletion. See Borsley (1981).
6. A plausible suggestion, I think, is that pied piped prepositions are only possible in
free relatives that have an adverbial function. It seems reasonable to say that the free
relative in (66) has an adverbial function. The free relative in (67) appears to be a
subject. It is possible, however, that the sentence involves a deleted subject pronoun
and thus that the free relative has the same role as in (66). It is interesting to note in
this context that Old English had a distinctive class of free relatives used in an ad'
verbial role. See Allen (1980).
References
Allen, C.
1980 "Movement and deletion in Old English", Linguistic inquiry 11:261-323.
Borsley, R. D.
1981 "WA-movement and unbounded deletion in Polish equatives", Journal of
linguistics 17:270-288.
Bresnan, J. W.
1976 "Evidence for a theory of unbounded transformations", Linguistic analysis
2:353-393.
Bresnan, J. W. - J. Grimshaw
1978 "The syntax of free relatives in English", Linguistic inquiry 9:331-391.
Farkas, D. - W. M. Jacobsen - K. W. Todrys (eds)
1978 Papers from the Fourteenth Regional Meeting of the Chicago Linguistic
Society (Chicago, Illinois: University of Chicago).
Ford, A. - J. Reighard - R. Singh (eds)
1976 Papers from the Sixth Annual Meeting of the North Eastern Linguistic Soci-
ety (Montreal: McGill University).
18 Robert D. Borsley
Grimshaw, J.
1974 "Evidence for relativization by deletion in Chaucerian Middle English", in:
Kaisse - Hankamer (eds) 1974:216-224.
Hirschbiihler, P.
1976 "Two analyses of free relatives in French", in: Ford et al. (eds) 1976.
Jackendoff, R. S.
1977 X syntax: A study of phrase structure (= Linguistic inquiry monograph, 2)
Cambridge, Mass.: M.I.T.Press).
Kaisse, E. - J. Hankamer (eds)
1974 Papers from the Fifth Annual Meeting of the North Eastern Linguistic Soci-
ety (Cambridge, Mass.: Harvard University).
Kuno, S.
1973 "Constraints on internal clauses and sentential subjects", Linguistic inquiry
4:363-385.
Morgan, J. L.
1972 "Some aspects of relative clauses in English and Albanian", in: Peranteau
et al. (eds) 1 9 7 2 : 6 3 - 7 2 .
Peranteau, P. M. - J. Levi - G. Phares (eds)
1972 The Chicago which hunt (Chicago, Illinois: Chicago Linguistic Society).
Ross, J. R.
1967 Constraints on variables in syntax (Ph. D. dissertation, M.I.T.).
Woolford, E.
1978 "Free relatives and other base generated WH constructions", in: Farkas et al.
(eds) 1978:482-490.
STEFAN DYLA
1 Introduction
(1) Figure 1
(2) Figure 2
Inversion/conjunct ascension in Polish and English 21
1.2 Evidence for inversion and conjunct ascension in Polish and English
Surprisingly, 3 arcs which are not local successors of 1 arcs cannot do the
unhooking, as is evidenced by the ill-formedness of the following sentence
under a reading where it was me who returned home.
22 Stefan Dyta
One might claim that (15) involves conjunct ascension because it is truth-
conditionally equivalent to (14), in which the conjunction Marek iJola is the
head of a 1 arc at both the initial and final stratum. This, however, is a very
weak argument. A much more telling one is provided by the fact that the
following sentence, which is also truth-conditionally equivalent to (14), is
perfectly well-formed.
Despite the fact that it is Marek and not Marek Jolq which is the head of
a final 1 arc, the verb appears in plural. This means that what we have to do
here is an instance of agreement triggered by an initial 1 arc. If Marek Jolq
were not a conjunction at any stratum, then the plural agreement on the verb
in (16) would be unaccountable. The conclusion follows that sentences like
(15) and (16) involve an immigrant 1 arc whose foreign predecessor is a Con
arc and whose 1 arc local sponsor is overrun.
(19) Figure 4
(20) Figure 5
Stefan Dyia
(22) Figure 7
Inversion/conjunct ascension in Polish and English 27
(23) Figure 8
Note also that the head of a Gen arc in (3435) is semantically possessor.
Since possessors are semantically paired with possessions, it will not be
counterintuitive to claim that every self-sponsoring Gen arc must have a
neighbor at the initial stratum which is not a Gen arc. Thus, one may con-
clude that the Gen arcs headed by Tom's and Tomka in (34) and (35), re-
spectively, have neighbors at the initial stratum which are foreign erased by
final arcs headed by apartment and samochod, respectively.
30 Stefan Dyia
It follows from the above data that the genitive case marking is assigned to
heads of final absolutive arcs in action nominalizations. Surprisingly, what
seems to be the head of a final ergative arc in the examples below is likewise
marked with the genitive case.
(45) ogolenie sif Tomka
(having-shaved REFL GEN)
'Tomek's having shaved himself
3.1 Type I
Consider the following sentences:
(48) Sue is similar to Pat.
(49) Rysiek jest podobny do Leszka.
(NOM is similar to GEN)
'Rysiek is similar to Leszek.'
32 Stefan Dyia
(50) Figure 9
3.2 Type II
Consider the following English sentence:
(51) The street parallels the railway.
It appears that this type of construction is almost identical to Type I except
that it additionally involves a 2 arc local successor of a 3 arc, which defines it
as a 32 advancement construction. Thus, it is reconstructive in terms of
APG PNs as a PN of the form shown in Figure 10.
Inversion/conjunct ascension in Polish and English 33
3.4 Type IV
The fact that Sam's immediately follows the verb indicates that the nominal
it modifies is the head of a final 2 arc in (56). Therefore, it is quite safe to
claim that this type of construction differs from the previous one only in
that it is also a 3 - 2 advancement construction. In other words, sentences
like (56) involve 2 arc local successors of 3 arcs. In APG, such sentences are
analyzed as involving PNs of the form presented in Figure 12.
Other predicates which can occur in this type of construction:
a. English: to parallel
b. Polish: the construction is unattested.
3.5 Type V
(57) Figurel2.
3.6 Type VI
Consider the following example:
(63) He equals you in strength.
Like the previous type, this type of construction involves a pair of immigrant
Con arcs whose foreign predecessors are Gen arcs and whose local sponsors
are Con arcs which are supports of their foreign predecessors. But it also in-
volves a 2 arc locally sponsored by a 3 arc. Thus, in terms of PNs, the struc-
ture of sentences like (63) is reconstructed as a PN of the form shown in
Figure 14.
Other predicates which can occur in this type of construction:
a. English: parallel
b. Polish: the construction is unattested.
Inversion/conjunct ascension in Polish and English 37
o f
C3C4C5
3.9 Type IX
Consider the following sentence from Polish:
(71) Maria rozni si? od Ewy zachowaniem.
(NOM differs REFL from GEN behavior/INSTR)
'Maria differs from Ewa in the way she behaves.'
Sentences like the above have something in common with Type V, as they
are genitive ascension constructions too, and with Type VII, as they involve
antipassivization. Like Types VII and VIII, they can be analyzed in two ways:
either in the way shown in Figure 19 or in the way shown in Figure 20.
Other predicates which can occur in this type of construction:
a. English: the construction is unattested
b. Polish: unknown.
3.10 TypeX
Hitherto, I have been dealing with inversion/CA constructions in which an im-
migrant, 1 arc whose foreign predecessor is a Con arc and a 3 arc have the
Inversion/conjunct ascension in Polish and English 41
CH(AC 3 C 4 C5
\t
Slf
(72) Figure 1 9 .
C 2 C3C4C5
Yes
(73) Figure 2 0 .
42 Stefan Dyta
same 1 arc local sponsor. The number of predicates which can occur in such
constructions is rather limited, in some cases, I could list only one such pre-
dicate. In this section, I will present a construction in which a 3 arc is locally
sponsored by an immigrant 1 arc foreign sponsored by a Con arc. Moreover,
as will become clearer later, this type of construction is fully productive, as
any predicate which can take a conjunction as a head of the final 1 arc can
occur in this type of construction. This type of construction is illustrated by
the following pair of examples:
(74) Markowi dobrze sif Krzysztofem pracuje.
(DAT well REFL with INSTR work/IMPERS)
'Marek feels that his joint work with Krzysztof goes well.'
(75) Marii nie mieszka sif EWQ zbyt wygodnie.
(DAT NEG live/IMPERS REFL with INSTR too comfort-
ably)
'Maria does not feel comfortable sharing a room with Ewa.'
Characteristically, sentences like the above obligatorily take Manner arcs.
Their structure can be reconstructed as a PN of the form shown in Figure 21.
The reason why I argue here that in sentences like ( 7 4 - 7 5 ) the final 3 arc
is locally sponsored by the immigrant 1 arc and not vice versa is that neither
Polish nor English CA constructions allow immigrants whose foreign pre-
decessors are Con arcs to be locally sponsored by 3 arcs. That this is true can
be seen in the fact that sentences like ( 7 7 ) and ( 7 8 ) can only be interpreted in
the way suggested in ( 7 9 ) and (80), respectively.
Notes
1. I would like to express here my gratitude to Dr. Robert Borsley for insightful com-
ments.
2. For a discussion of Inversion in as diverse languages as Russian and Kannada, cf. Perl-
mutter (to appear). For a discussion of conjunct ascension, cf. Pullum 1975, 1977.
3. This approach rules out the distinction between a syntactic rule and a lexical redun-
dancy rule, as a grammar of a particular language does not contain rules like inversion
or conjunct ascension. Of course, restrictions on particular lexical items which can or
cannot occur in a given type of construction are idiosyncratic and should be handled
in the lexicon.
4. In the glosses, the following abbreviations will be used: NOM = nominative, ACC =
accusative, GEN = genitive, INSTR = instrumental, LOC = locative, NEG = negation,
IMPERS = the impersonal form of the verb, PI. = plural, REFL = reflexive, Sg. =
singular.
5. Beheading should not be confused with uncontrolled beheading discussed by Borkin
(1972).
6. In TG, the relation between sentences like the (a)-forms and the (b)-forms of ( 3 8 -
39) was handled by a rule called Promotion (Ross 1973). The following predicates
can occur in this type of the genitive ascension construction:
a. English: to annoy, to amuse, to astonish, to amaze, to bother, to horrify, to ir-
ritate, to scare, to frighten, to terrify, to worry, etc.
b. Polish: denerwowac 'to make nervous', dziwic 'to amaze', rmrtwic 'to worry',
bawic 'to amuse', irytowac 'to irritate', przerazic 'to scare', rozimieszac
'to make laugh', etc.
44 Stefan Dyta
7. Postal (1977) noted that Reflexive, Reciprocal and Unspecified Object (UO) Clauses
behave as if they were finally intransitive with respect to Clause Union, and to ac-
count for this strange fact he postulated that French has a rule of antipassive.
8. Note that the following Polish sentences is ill-formed:
i. *Waga Janka jest rowna Tomka.
weight/NOM GEN is equal GEN
'Janek's weight is equal to Tomek's.'
One might claim that the genitive case marking is not sufficient to code the 3-hood of
the final arc headed by the beheaded nominal, but how to explain then the well-
formedness of sentences like (35) above?
References
Borkin, A.
1972 "Conference and beheaded NP's", PIL 5:28-45.
Johnson, D. . - P. M. Postal
1980 Arc pair grammar (Princeton: Princeton University Press).
Perlmutter, D. M.
to appear "Evidence for inversion in Russian and Kannada", Studies in relational
grammar, edited by D. M. Perlmutter.
Postal, P. M.
1977 "Antipassive in French",NELS 7:273-313.
Pullum, G. K.
1975 "Squibs on relational grammar", unpublished papers nos. 1 - 6 , University
College, London.
1977 "Word order universals and grammatical relations", SynS 8:107-111.
Ross, J. R.
1973 "Nouniness", Three dimensions of linguistic theory, edited by Osamu
Fujimura (Tokyo: TEC corp.), 137-257.
NILS ERIK ENKVIST
encompass various aspects of cognition and interaction. But here I shall only
describe current models, without filling in the full bibliographical data or
viewing the material in a historical perspective. Nor shall I try to relate my
descriptive classification to any linguistic or syntactic theory.
Allowing for some overlap, then, text models can be descriptively grouped
into four major categories:
(a) Sentence-based text models. These models regard texts as strings of
sentences which are given as an input for analysis and description. Sentence-
based models add textual features to the traditional concepts of sentence
grammars. They can explain many phenomena of intersentential co-reference,
for instance in terms of pronominal reference across sentence borders, and
cross-reference (a good example is Halliday Hasan 1976). They can also
show how sentences are given their textual fit with the aid of lexical and syn-
tactic devices, theme-rheme-focus (or topic-comment-focus) structures, and
overall theme-dynamic patterns of the kind described in DaneS 1974, Enkvist
1974, and a number of other papers. Sentence-based models are thus capable
of describing intersentential cohesion such as it appears in the given sentences
of a given text. But sentence-based models must accept the original sentence
division of the text, which itself is part of the input and cannot be manipu-
lated. If we want to show how such an input might be related to other texts
involving a different sentence division of a similar text, we must go on to
devise
(b) Predication-based text models. These models start out, not from a text
consisting of ready-made sentences but from a set of predications and inter-
predicational semantic relations (temporal, causal, etc.). These predications
must be given in some canonical form based on logic or semantics, or directly
in some natural language (cf. Chomsky's kernel sentences of 1957). The model
then shows how these predications can be textualized with the aid of grouping
which involves conjunction and embedding. Such operations are governed
by a text strategy. Thus different strategies will result in different textualiza-
tions of the same input predications, as illustrated for instance in Sigurd 1977
and Kllgren 1979. One might compare a predication to the physicist's atom
and call it a text atom. Just as the same set of atoms can combine into differ-
ent molecules, the same set of text atoms (predications) can be combined in-
to different texts, each of which reflects a text strategy of its own.
Predication-based models can thus explain relations between different tex-
tualizations of the same input predications. The model can of course also be
used analytically: we then take a text and split it up into its underlying pre-
dications. If we want to compare the given text with other potential textuali-
zations of the same predications we can go on to retextualize them with the
aid of different strategies. This makes possible a comparison of a given text
Contrastive linguistics and text linguistics 47
with the other potential texts that can be built out of the same predications
- an operation that is of particular interest to students of style. A pioneering
experiment along these lines was Ohmann 1964.
What a predication-based model cannot do, however, is to explain the
origin of predications, or the reasons why a given speaker or writer has, in a
given situation, chosen to textualize a certain set of predications instead of
another such set. Meeting this challenge takes us on to
(c) Cognitive text models. In a cognitive text model we do not start out
from ready-made predications. We start out from a body of experience and
knowledge out of which predications can be drawn. The usual way of model-
ling this predication-producing process is based on associative networks (Find-
ler 1979). The concepts themselves are usually placed in nodes of the net-
work, their relations appearing as paths between the nodes. A text strategy
then becomes definable in terms of a set of points of entry into, and paths
chosen through, the network.
It may be easy enough to model certain limited areas of knowledge, and
perhaps cognition, in terms of networks. Trying to capture the full range of
generic and episodic experience and knowledge of a normal adult is a daunt-
ing and probably hopeless task. Today different investigators discuss cognitive
models with various degrees of optimism and pessimism; one way of resolving
such debates is to view a given network in terms of a definite task: what can
the network do, and where does it fail?
Even if a cognitive model can show how predications arise and how they
are connected, it cannot explain a speaker's or writer's motives and intentions.
It cannot explain why a given speaker in a given situation has chosen to ex-
tract a particular set of predications out of storage. To do this we shall need
text models of a fourth type, namely
(d) Interactional text models. Interactional models are concerned with the
interactional behaviour patterns of the people communicating. They reckon
with a speaker/writer's intentions, for instance to distinguish between direct
and indirect speech acts or between literal meanings and irony. They are also
concerned with felicity conditions, conversational maxims, face, and the ways
in which turn-taking affects discourses and texts. Under various labels, includ-
ing 'pragmatics', these questions have been discussed in a flood of publications
over the past several years. Samples of interactional approaches, often linked
with cognitive ones, can be consulted for instance in Clark Clark 1977,
Duncan - Fiske 1977, Freedle 1977-80, Givon 1979, Goody 1978, Just -
Carpenter 1977, Osgood - Bock 1977, Rosenberg 1977, Schank - Abelson
1977, Schenkein 1978, and a host of other recent publications in psycholin-
guistics, sociolinguistics and cognitive science.
48 Ns Erik Enkvist
In a wider context we might note that what compels us to enlarge our field
of vision from sentence-based models to predication-based, cognitive, and
interactional ones is usually the question why? rather than the question what?.
Any proper grammar ought to be able to tell us what the sentences of a text
look like in grammatical terms. But if we go on to ask, why does this text
make use of this particular sentence structure?, we shall have to go from sen-
tence-based models at least to predication-based ones. If we ask, why has the
speaker/writer chosen these particular predications?, we get involved with
cognitive and interactional considerations. But once we start worrying about
causal explanations, we shall find no convenient place to stop until we have
reached the ultimate problems of human cognition and interaction.
To this expansionist challenge, different linguists have responded in differ-
ent ways. Some want to circumscribe their discipline and stay in their own
walled garden. Others follow their problems boldly into strange and alien ter-
ritories. The debate between scholars of the two temperaments, the restric-
tionists and the expansionists, goes on and is perhaps at the heart of a great
deal of the disagreement that prevails in today's linguistics. And in this dis-
agreement, text linguistics and discourse studies provide a lot of ground for
experiments and also for contention. Even the restrictionists have been com-
pelled to recognize the need for admitting certain types of discourse rules into
their grammars. Their usual method is to bring in textual features into the
description of single sentences under various devices such as performative
superstructures, presuppositions, old and new information, and the like. The
expansionists want to face cognitive and interactional problems more directly
(and, as they themselves would say, more frankly). Some of them, particular-
ly those approaching language from outside traditional linguistics, are even
prepared to subordinate linguistic concepts to cognitive and interactional
ones. In the models of Schank, for instance, syntactic parsing is an aid to
cognitive operations. This is an outlook fundamentally different from that of
the syntactician who brings in semantic and pragmatic contexts only when he
must, for instance to disambiguate structurally ambiguous sentences in con-
text.
2 After this presentation of four model types in text linguistics we can ask
ourselves what types of contrastive problems can be studied with the aid of
each type of model.
Sentence-based types are adequate for the study of those text-linguistic
problems that can be defined within the single sentence. Prominent among
these problems is the interplay between the syntactic structure and the infor-
mation structure of the clause and sentence, that is, the way in which the
syntactic structure is brought into harmony with the desired distribution of
Contrastive linguistics and text linguistics 49
old and new information in the sentence and the text. Here we have an area
where textual macrostructures govern, and interact with, syntactic micro-
structures. As the syntactic microstructures of different languages differ, and
often differ more than textual macrostructures, the syntactic exponence of
information structure offers very central topics for contrastive study.
Predication-based models must be brought in as soon as our contrastive
studies compel us to manipulate borders between clauses and sentences. In
translations for instance a translator is often compelled to change the clause
and sentence division of the source text. When this happens we should no
longer compare a sentence of the source text with a sentence of the trans-
lation. We should rather regard the translation as a re-textualization of the
text atoms in the source text. Thus in contrastive studies involving translation
we are likely to profit from predication-based models.
Cognitive models will be needed when we suspect that cognition is struc-
tured differently by speakers of the languages we wish to contrast. Among
other things, cognition involves the structural categorization of the world. If
Benjamin Lee Whorf was right in postulating that speakers of different lan-
guages may live in cognitively different worlds, we can no longer compare
cognitive units as if they meant the same. We must reckon with differences in
the meanings of lexemes, which means that the nodal structures of the
cognitive networks illustrating different languages need not necessarily be iso-
morphic. The associative links between the concepts, illustrated by the paths
that join the nodes in a network, will also be different, as are the culturally
determined schemata or frames or scripts which they reflect. To deal with
such contrasts we need cognitive models. To what extent we may wish to for-
malize them, into networks for instance, is another matter. The choice of de-
grees and levels of formalization depends on what we are trying to do.
Finally, as soon as linguistic contrasts correlate with contrasts in inter-
actional patterns, we shall need interactional models of some kind. Inter-
actional considerations must be brought in when we get involved with dia-
logue, turn-taking, speech acts, style, registers of politeness, considerations of
face, and so on. Experience, particularly in anthropological linguistics, tells us
that interactional considerations increase in importance when the languages
we are contrasting are distant in culture, which often implies distance in space
or in time. But even in closely related languages and in different styles and
sociolects within the same language, interactional patterns may vary greatly.
A good example are conventions of politeness: literal translations of texts
from one sociolect or language into another may lead to absolutely disastrous
results unless the translator modifies the text according to complex rules of
decorum. In a wider sense, then, interactional considerations include all social
traditions in language such as matters of politeness, appropriateness and ling-
50 Nils Erik Enkvist
uistic manners. They also include conformity to social, narrative and literary
traditions where such conformity is indicated. In this sense all of literary
theory and history enters into the interactional study of textual macrostruc-
tures. And the other way round: some grasp of the interactional effects of
textual macrostructures is necessary for all translators who aim, not only at
cognitive equivalence but also at equivalent effect.
3 In the past several years we have learned that the discussion of word order
must be based on an analysis of textual parameters, and perhaps situational
ones as well. But word-order patterns have also been studied from other points
of view. Today we need an approach integrating a number of theories and
models of word order which have often been dealt with one by one.
Important among these theories to list them merely in the form of brief
reminders are: first, syntactic word-order typology; secondly, information
structure and information dynamics in terms of the distribution of old and
new information; thirdly, those aspects of linearity that fall under what I have
called experiential iconicity, as well as the related concept of salience; and, in
the fourth place, the stylistic variation of word-order patterns. Each of these
problems would of course be a subject for a volume of its own. What I shall
say is in the nature of brief reminders and references to principles well known
to us all. I shall, however, use some space to discuss some conceptual, methodo-
logical and terminological problems which often play havoc with meaningful
discussion. I shall indicate such asides and digressions by using small type.
Like all descriptions of language, those of word order too must be filtered
through descriptive models. And all such models lend their own colour to the
description, the colour springing from the theoretical base with its descriptive
limitations. The models of word order that grammarians have used fall into
three major categories.
First c o m e the word-order descriptions of traditional grammarians which w e might
label as relative. Relative word-order descriptions state the relative positions o f constit-
uents: the subject goes before or after the verb, the modifier goes in front of or behind
its head, and so forth. The ordinary, unmarked order is usually defined as basic, and de-
partures from such a basic order are called inversions. Secondly, structuralist grammarians
of the behaviourist persuasion were fond of slot-and-filler models. They defined posi-
tions in terms of fields or slots and then explained what types of elements could fill the
slots. Finally, generative grammarians arrived at certain word-order patterns as a result of
their phrase-structure rules and those transformations whose primary aim was other than
the adjustment o f linear patterning of constituents. If the basic order was not the same
as the desired surface order, special movement transformations had to be added to the
generative system.
ly reflections of our own grammatical theories and models and of our own
principles of descriptive economy? Or are they based on diachronic principles
supported by historical data? Or do they reflect some kind of psychological
reality or psychological markedness, or are they founded on statistical counts
which identifies 'basic' with 'most frequent'?
A fundamental query about word-order typologies relates to variation. If a
language always arranges its elements in the same order, well and good. But in
fact languages use numerous alternative word-order patterns. To begin with,
even languages whose word-order mechanisms serve largely syntactic ends
('analytic' languages such as Modern English) allow more variation than we
often realize. Such variation occurs for instance in impromptu dialogue which
is rich in what syntacticians like to call 'deviance' (and which is nevertheless
a wholly normal phenomenon). And there is a lot of variation that might be
labelled as stylistic, whatever that ultimately means. The question is, once
there is variation in word-order patterns, how can such variation be accom-
modated into a word-order typology which by definition implies the domi-
nance of one pattern over the others?
The only road to a satisfactory answer involves a study of the factors moti-
vating the variation. We must identify the forces that cause modifications in
the basic pattern. We must thus make statements such as "language X is SVO
except when forces p, q and are strong enough to motivate the use of other
patterns such as OVS".
As such discussions are often expressed in terms of markedness, another terminological
note may be in order.
One way of speaking about 'markedness* involves a contrast between a form which
has a marker and another form that has not: b is marked and unmarked because b has
a marker, namely voice, which does not have. (Markers can be manipulated, however:
we might also say that is marked and b unmarked because has aspiration and b has
not: some principle of naturalness would be necessary to constrain such manipulation.)
A second way is to look at distribution: -s is the unmarked plural in English and -n as in
oxen one of the marked ones because -s has a wider distribution and is applied to a larger
number of substantives. A third approach is statistical frequency: the most common
forms in a corpus are unmarked, rarer forms marked. A fourth possible approach is based
on informant responses. What native speakers regard as natural, common, ordinary and
'unremarkable' is therefore unmarked, and what they regard as strange or strained is
marked.
Yet another difficulty is that markedness may be both paradigmatic and syntagmatic.
A form such as The apple John ate is marked when seen out of context (at least marked
in some of the senses given above). But in a context such as The apple John ate but the
oranges he only sniffed at, and when pronounced with proper focussing, it may be en-
tirely natural. Thus such a sentence may be paradigmatically marked but syntagmatically
(textually, contextually) unmarked (and there are parallels to this in phonology and
lexis as well).
Thus 'markedness' can mean a number of different things. In discussions we ought to
make clear which type of markedness we are talking about - or, alternatively, use differ-
ent terms for what I have here called different types of markedness.
Contrastive linguistics and text linguistics 53
4. Definitions based on syntactic form. The theme/rheme contrast can be marked mor-
phologically (by articles, special thematic markers etc.) and by word order (in some
languages one could try defining themes as sentence- or clause-initial elements).
These definitions need not necessarily be mutually exclusive. Indeed one might try the
hypothesis that the 'least marked' thematization occurs when all definitions agree: in
English a 'least marked' theme would then be a sentence-initial subject or adverbial ex-
pressing old information, intuitively understood to signal what the sentence is really
about, having maximal scope (when measurable in spite of the fuzziness of natural lan-
guages), and functioning as logical subject in its canonical predication. This is a problem
I cannot pursue here.
As with 'markedness' we should always make clear in what sense we are using theme
and rheme or topic and comment, or, alternatively, abandon these terms and speak about
logical subjects and predicates, maximally and minimally scopeful elements, psycho-
logical subjects, old and new information, initial and non-initial constituents, etc. An ex-
treme Occamist might even argue that theme and rheme are comparable to the ether of
now-outdated physics: we can do without them if we know what we are really speaking
about.
In this light, then, the basic word-order typology discussing the relative
order of subject, verb and object (and sometimes the "X" which means 'other
constituents') should be integrated with theme-rheme typology so that the
thematically motivated modifications in basic word order could be made
maximally clear.
Those who set out to clarify these matters will find a number of interesting
problems in their path. One fundamental principle which on the whole re-
stricts and constrains thematic modification of basic word-order patterns is
what I shall call the principle of valency: elements that are intimately bound
to each other by strong valency bonds should also stay close to each other in
the linear sequence. There have been many attempts at stating such movement
restrictions: Ross's constraints, cross-over principles, anaphoric islands and
other such devices have been presented in this very context. But many of
these word-order phenomena in fact also have stylistic correlates. There are,
in other words, styles in which certain word order patterns are permitted
which would be awkward in other styles.
Let me take just one example. Adverbials which are tied to their verbs by
semantic specification are usually hard to move, for instance to topicalize.
Thus the verb to put implies both object and place (you must put something
somewhere). Hence examples such as
(2) a *Put!
b *Put it! *Put your hat!
c Put it down! Put your hat under your chair!
d ??Under the chair he put it. He put it under the chair.
56 Nils Erik Enkvist
(c) and (d) are not experientially iconic because the order of predications in
the text is not identical with the order of events. Generally, whenever we con-
join predications which can be interpreted as having a temporal or causal rela-
tionship, we understand them to be experientially iconic as in (3 a) and (3 b).
'If they are not, the departure from experiential iconicism must be marked, as
in (3c) and (3d), in which after carries the onus of marking the departure.
I have elsewhere (forthcoming in Text, volume 1 number 1) discussed ex-
periential iconicity at some more length and pointed out that it can be of
three major types: temporal (as in (3)), spatial, and social (as in ladies and
gentlemen! rather than gentlemen and ladies!). What is of particular interest
here is the fact that corpus studies show the importance of experiential
iconicism in texts that have to order experience for the benefit of the receptor.
Such texts are instructions, for instance guidebooks and cookery-books. In
cookery-books we find examples in which strongly valency-bound adverbials
are fronted, as in
The motivation for such fronting, which would seem very strongly 'marked'
in the sense of 'rare' in isolation, is found in text strategy. The writer's aim is
to indicate the order of operations: by (4) she is implying 'take a champagne
glass and put into it a lump of sugar' and by (5) 'take a saucepan and put into
it sufficient water. ..'. Here, then, the text strategy proves strong enough to
override the syntactic inertia and to stretch the valency bonds between put
and its adverbials. In textual corpora such fronting of place adverbials with
put is rare. My pupil Ann Westerlund tells me that she has found some two
examples out of some four hundred potential instances in the Brown Corpus
(while adding that such counts always build on arguable principles of classifi-
cation).
The discussion of the connection between a certain genre or text type,
such as a cookery-book, and sentence structure leads us into stylistics. In
terms of what I have said so far, different styles can be said to offer different
degrees of resistance to the use of different types of devices that regulate
textual fit. In each style and substyle each thematic operation has a threshold
of a certain definite height. In spoken, informal, impromptu dialogue, there is
a low threshold for dislocations of the type John he is a nice chap and She is a
doctor, Susie is. But they do not occur in laws or statutes or legal contracts:
58 Nils Erik Enkvist
the threshold is too high to let them in. Similarly, the threshold against front-
ing place adverbials with put is very high, except in styles containing operating
instructions, such as a cookery-book style. Here the threshold ist lowered by
the need for experiential iconicity.
This is of course only a metaphorical way of stating what is common and
what is rare in different functional varieties of language. But the images of a
tug-of-war, of a dualistic struggle between different forces, of the forces
stretching valency bonds and of the syntactic inertia being overcome by re-
quirements of textual fit, or of different styles having thresholds of different
heights against different syntactic structures all have one thing in common.
They link up with a view of language as a process rather than simply as a set
of structures. In these terms, the ultimate syntactic form of a sentence, in-
cluding its thematic arrangement, is the outcome of a process which is a result-
ant of different, often opposing, forces. In different situations these forces
acquire different relative weights. One such force is the syntactic inertia of
the sentence. The inertia helps to maintain a canonical, typologically funda-
mental, and in that sense least-marked word-order pattern. But another set of
forces strives to adjust the sentence to its textual and situational surroundings
and to give it an optimal textual fit. What is optimal in textual terms depends
on the speaker/writer's intentions and motives, on the text type and on the
text strategy. These are exposed through textual characteristics that have been
studied, among other things, under the heading of 'style'.
With these few examples and many metaphors I have tried to suggest an
avenue towards resolving the tensions between canonical forms such as those
of language typologies, and the variation that we actually find in texts. In
brief: canonical forms are subjected to various textual and situational stresses
and strains, and in different situations they offer different degrees of resist-
ance. Many of the forces straining the syntax of the sentence by compelling it
into more strongly marked patterns are textual and discoursal. They cannot
be found and understood unless we operate with a textual apparatus. In some
text types and styles the canonical forms tend to prevail more strongly, and
in others they are more freely modified. To extract the maximum value out
of linguistic typologies we should study and learn to define the forces that
compel departures from typologically basic patterns. Once these departures
are reduced to rule, our typologies can be set up with more confidence.
At the same time the concept of a dualistic struggle between canonical
forms and modifying forces helps us to understand what happens in im-
promptu speech and in syntactically deviant literary and poetic texts. The
task of a text is not necessarily to present a receptor with a set of perfect syn-
tactic and lexical patterns. What a text must do is to trigger off a process of
interpretation. In principle any text which succeeds in triggering off a success-
Contrastive linguistics and text linguistics 59
4 If these views are correct, the form of a sentence and therefore also the
form of a text can be seen as results of a set of text-productive processes. In
discussing word order, for instance, one should not only describe structures
as such, but also view them as an output of a set of processes which have been
triggered off by certain definable forces. Similarly the interpretation of a text
is a result of an interpretive process.
Such arguments are becoming so prominent in today's linguistics that one
is tempted to ask: are we in fact moving from a predominantly structural lin-
guistics to a predominantly processual 2 linguistics? Like Humboldt, many
linguists today like to see language as energeia and not only as ergon, as pro-
cess and not only as structure.
I do not want to insist that we have, overnight and through a dramatic
paradigm shift, moved out of structuralism into processualism. Indeed the
history of linguistics shows us many movements which have combined struc-
ture and process. Even in the more extreme variants of behaviorist-structural
linguistics, units such as phonemes and morphemes could be defined pro-
cessually: a phoneme or a morpheme was what one arrived at if one segmented
a string according to certain closely defined processes. And in generative-trans-
formational grammar too, re-writing operations and transformations are pro-
cesses. They explain how one state (or, in linguistics, structure) changes into
another state (or structure). And such a change is precisely what we mean by
a process.
60 Nils Erik Enkvist
The other kind of reality, beside the psychological, to which we can appeal in
the quest for real processes beyond natural language is the world of the com-
puter. If we base process models of language on psychology we are pattern-
ing them on what we know about human intelligence; if we base them on
computer science we are writing grammars patterned on artificial intelligence.
In saying this I am of course begging the question, what precisely is the rela-
tion between human intelligence and computer processes? What can man do
that computers cannot? And what can computers do that man cannot com-
fortably imitate? These are questions easy to put and hard to answer.
A question more relevant to our present context is, what can process lin-
guistics and processual thinking contribute to contrastive linguistics?
Structural linguistics uses structural concepts which vary with different
models. Typical examples of structural units are distinctive features, pho-
Contrastive linguistics and text linguistics 61
evaluated in terms of a new strategy, which in turn triggers off a new plan,
leading to a new process.
A process, as I said above, is the change of one state (in linguistics, struc-
ture) into another. To identify the trigger of a process we need a strategy. Both
strategies and processes form hierarchies: communicative macrostrategies lead
on to microstrategies, processes lead on to subprocesses. In describing strate-
gies and processes we must decide what is the optimal delicacy level for our
descriptions: we can either trace them in major outline, or we can try to work
them out in delicate detail through a large number of intermediate stages. In
practice, our descriptions of strategies and processes can be oriented either
towards message production or towards message interpretation. But we should
not forget that production strategies too are usually receptor-oriented. People
speak and write to be understood, and text strategies are always founded on
estimates of what the receptor already knows or does not know, and how
much information the receptor can process (information density). The Gricean
postulates, theme-rheme studies, discussions of implication and presupposition
and entailment and the like are all concerned with the sender's estimates of
the knowledge and of the message-processing capacity of the receptor. Such
estimates are supremely important in determining the content and shape of a
message.
A concrete example may be in order to show what I mean by weighting of
strategic parameters. Let us assume that we have a choice between three ex-
pressions: it's getting light, the blank day breaks on the bald street, and on
the bald street breaks the blank day. Which should we prefer? If our goal is to
produce ordinary informal conversation over morning coffee, it's getting light
would presumably be the most appropriate alternative. The blank day breaks.. .
would reveal the speaker's poetic vein but maintain the canonical SVO order.
But when composing/ Memoriam Tennyson preferred the third alternative:
strength suffices to overcome the syntactic inertia which tries to maintain the
canonical and unmarked SVO pattern. We might add in passing that one cate-
gory of people who have used an approach evaluating text-strategic parameters
are the literary critics. When criticising a literary text a critic tacitly compares
that text with other alternative texts which he, the critic, has either experi-
enced or conjured forth out of his own imagination. It is such a comparison
that provides the foundation of critical judgments.
I have here dressed up my discussion of Tennyson in terms of production
strategies, but of course analyses of strategies and processes can be focussed
on reception as well. One of the areas of linguistics where one can profitably
look for concrete examples of processual views of text reception is parsing, a
process carried out by the receptor according to conventions and strategies
anticipated by the sender. To add another small morsel of flesh to my pro-
grammatic skeleton I shall give an example of a processual approach to one
detail in the contrastive study of English, Swedish and Finnish.
In Swedish there is an optional element which has sometimes been called
'resumptive s' which can be inserted between an adverbial clause and a fol-
lowing main clause and roughly rendered by English then, as in
(7) Om Kalle kommer (s) gr jag till farfar.
'If Charlie comes, (then) go I to grandfather.'
In Finnish the corresponding element is optional resumptive niin, as in
(8) Jos Kalle tulee (niin) min menen kotiin.
'If Charlie comes, (then) I go home.'
Resumptive sd and niin are both more frequent in spoken and in informal
written styles than in formal writing (cf. Platzack 1973: 58ff.).
A structural description would presumably simply note that sd and niin
can occur in the juncture from adverbial to main clause, particularly in spoken
and informal written styles. In the light of parsing strategies and processes,
however, their function appears in a clearer light. One of the main problems
of parsing is, of course, to know at what point a phrase or clause ends and
another phrase or clause begins. This is a problem students of parsing have
discussed in terms of a Principle of Early Closure (according to which the
parser should close a phrase unless the following elements can be construed
as parts of that phrase) or a Principle of Late Closure (whenever possible the
parser should attach an element to the phrase or clause currently being con-
strued) (e.g. Frazier 1979: 33 and 43). A speaker or writer who wants to
maximize the clarity and ease of parsing of his message would obviously do
well in marking closure points such as junctures between clauses. In English
this strategy can be carried out largely through processes involving intonation
64 Nils Erik Enkvist
and perhaps rhythm and pause patterning. In Swedish and in Finnish the
strategy can be realized through the process of sa or niin insertion. These ele-
ments are parsing cues triggering off a certain type of segmentation and struc-
turing process. In English too we can sometimes use then in a similar function;
in Old English pa had a comparable function (particularly in pa . . . pa ...).
The marking of such closure points is of course especially useful in speech,
where the receptor cannot parse and re-parse the message at leisure. This ex-
plains why s and niin are particularly frequent in spoken discourse and in-
formal writing. That they seem more common than English then in this func-
tion may be connected with the fact that English generally makes more use of
intonation than Finnish or Finland-Swedish (I am not sufficiently familiar
with the Swedish of Sweden to venture an opinion at this point).
Another way of illustrating the functions of resumptive s and niin is to
show how they avoid garden-path phenomena by marking junctures between
clauses in instances such as
To sum up: while marking closure of the adverbial clause, s and niin simul-
taneously also mark the continuation of the sentence through the beginning
of the main clause. Their use can be explicated with the aid of a predication-
based text model.
Thus the full structural and contrastive significance of resumptive s and
niin best appears when we see them as cues to parsing and as processing sig-
nals. In this light they no longer seem redundant and in that sense marginally
functional. On the contrary they are part of the receptor-oriented strategies
of the person producing the text.
It should now be clear what use observations of these kinds can be in con-
trastive linguistics. They enable us to compare parsing strategies and to teach
our pupils to observe, and to use, parsing cues in a more systematic way. For
Swedes and Finns who learn English the closure markers of their native lan-
Contrastive linguistics and text linguistics 65
guages signal places where they should learn to use tone-unit boundaries and
an intonational rise in English. And conversely those learning Swedish or Fin-
nish will be helped in their parsing by the junction marking with s or niin.
Such junction markers at the same time increase the redundancy of the mes-
sage and thus make comprehension easier. Further, once we start drawing
attention to parsing cues such as sa and niin we get involved with all the other
devices that Swedish and Finnish use to make clear where are the junctures
between phrases and clauses. We are learning to pose new questions which
unroll the entire field of parsing strategies and processes. And in so doing we
are conforming to the basic principle of process linguistics: we are using
structures and structural cues to illustrate a process, namely parsing. To trans-
lators too, processual thinking is useful. For instance those who wish to trans-
late Tennyson's In Memoriam will have to analyze the original text strategy
and decide how it should be adapted, and how its parameters ought to be
weighted, and perhaps re-weighted, to produce an optimal rendering in the
target language.
It is, then, text linguistics, together with psycholinguistics and cognitive
science, that has helped to bring process linguistics to the fore. Of course the
basic concepts of strategies and processes are also immediately applicable to
another area of applied linguistics, namely language learning. But that is an-
other subject.
Notes
1. To some people the terms 'text linguistics* and 'discourse linguistics' are synonymous.
To others, text linguistics is the linguistic study of texts as such, whereas discourse lin-
guistics also reckons with the situational context: it is the situational envelope that
turns a text into discourse.
2. The term 'procedural' is also in use. I prefer 'processual' or 'process' linguistics because
so many other currents in linguistics have emphasized procedures and been in that
sense 'procedural'. For example the behaviorist structuralists emphasized discovery
procedures and adopted procedural definitions of linguistic units: a certain unit was
what one got if one segmented a string according to a certain procedure.
References
Beaugrande, Robert, de
1980 Text, discourse and process. Toward a multidisciplinary science of texts
(Norwood, New Jersey: Ablex).
Chesterman, Andrew
1980 "Contrastive generative grammar and the psycholinguistic fallacy", Papers
and studies in contrastive linguistics 11:17-24.
66 Nils Erik Enkvist
Kllgren, Gunnel
1979 Innehlt i text (Lund: Studentlitteratur).
Kohonen, Viljo
1978 On the development of English word order in religious prose around 1000
and 1200 A.D. (Abo: Publications of the Research Institute of the Abo
Akademi Foundation, 38).
Li, Charles N., ed.
1975 Word order and word order change (Austin: University of Texas Press).
1976 Subject and topic (New York: Academic Press).
Longacre, Robert E.
1980 "Discourse typology in relation to language typology", Tentative publication
draft in Preprints, Nobel Symposium on Text Processing, Stockholm.
Ohmann, Richard
1964 "Generative grammars and the concept of literary style", Word 2 0 : 4 2 3 - 4 3 9 .
Reprinted in various readers on stylistics.
Osgood, Charles E. - J. Kathryn Bock
1977 "Salience and sentencing", Sentence production, edited by S. Rosenberg
(Hillsdale, N.J.: L. Erlbaum), 8 0 - 1 4 0 .
Platzack, Christer
1974 Sprket och lsbarheten (Lund: Gleerup).
Rosenberg, Sheldon, ed.
1977 Sentence production (Hillsdale, New Jersey: Lawrence Erlbaum).
Schank, Roger - Robert Abelson
1977 Scripts plans goals and understanding (Hillsdale, N.J.: Lawrence Erlbaum).
Schenkein, Jim, ed.
1978 Studies in the organization of conversational interaction (New York: Aca-
demic Press).
Sigurd, Bengt
1977 "Att presentera Buffalo Bill", Tre experiment med text, edited by B. Sigurd
and G. Kllgren (Stockholm: Akademilitteratur), 2 9 - 7 2 .
CLAUS F.ERCH AND GABRIELE KASPER
1 Introduction
2 Gambits defined
4 A typology of gambits
Receipt, a signal by which its user indicates that he has taken in what his
interlocutor is or has been saying. It can either have the function of a neutral
attention signal (mm, uhu), or it can express understanding of (I see what you
mean) or even agreement to the interlocutor's speech act ((you're) right,
72 Claus Faerch and Gabriele Kasper
I agree, that's true). The first two functions are sometimes realized by a repre-
sent, i.e. an utterance in which the speaker repeats to himself a part or the
whole of what has just been communicated to him (SI: "these cuts in the
university budget are a catastrophe"/S2: "catastrophe ... yes . . . yes I sup-
pose that's an adequate way of putting it").
Go-on: A special case of a neutral receipt whereby the present hearer signals
to the present speaker that he is listening and that he wants the present speaker
to keep the turn (oh I see, tyes, tmm).
Exclaim : a special case of a receipt which is marked for an expressive func-
tion, i.e. it indicates its user's reaction to his interlocutor's utterance or a part
thereof. Although the discourse functions of an exclaim are identical to those
of a neutral receipt (see below), we shall treat them as two categories because
in both German and Danish they are realized by different classes of tokens,
i.e. either by a receipt carrying an emphatic marker (\oh, \no), or by elements
which function regularly as "interjections" (good grief, wow).
Starter: as a preliminary to a following utterance, it indicates that its user is
going to say something (well, oh, erm).
Underscorer: serves to focus the interlocutor's attention to a point being made
by the present speaker (the point/thing is).
Aside: functions to inform the interlocutor about what the present speaker is
doing while he is not addressing himself to the present hearer (SI: "can I see
you about those references tomorrow"/S2: "tomorrow ... let me see . . .
I've got appointments all morning . . . how about 2 o'clock").
Appealer : functions to elicit a response from the hearer to what the speaker
has just said (tag questions, uh, okay).
Marker: signals boundaries in discourse in either indicating the completion of
an ongoing discourse topic or phase (okay, good) or the opening of a next dis-
course topic or phase (well now, by the way).
We shall also include into the analysis a number of linguistic items whose
primary function is not that of a gambit as defined above, but which serve as
gambits in a given context, i.e. their 'derived' function is that of a gambit.
Thus hesitators which occur at a possible completion point and have thereby
a floor-holding function will be classified as gambits, whereas hesitation
phenomena which occur at non-completion points within a turn will not be
included. Similarly, cajolers (you know, you see, I mean) will only be classi-
fied as gambits if they are used as turn-keepers; if they function on an inter-
personal level exclusively, they will not be included into the present analysis.
Conjunctions obviously have the primary function of creating cohesion in
texts by expressing how what follows the conjunction relates to what pre-
cedes it. However, by virtue of the fact that sentence conjunctions are nor-
Gambits in German and Danish: a discourse CA 73
go-on
Uptaking receipt SH
exclaim
starter
receipt
Turntaking
exclaim SH
(conjunction)
underscorer
aside
Turnkeeping
(cajoler) S
(hesitator)
(conjunction)
Turngiving appealer SH
We should like to mention three problem areas which the analyst is faced with
in setting up a gambit typology and applying it to data:
(1) how many different gambit classes should be established
(2) if and how the individual gambit classes should be subclassified
(3) how to distinguish a gambit from a speech act.
As for (1), our analysis of 'interruptors' may serve as an illustration. It
would be possible to draw a distinction between the discourse function of
'interrupting' and the discourse function of 'turntaking', and consequently to
establish one class of 'interrupting gambits' and one class of turntaking gam-
bits. The reason why we have not done so are (1) that the two functions seem
to be realized by the same types of gambit; (2) that the difference between
the two functions is not so much a difference in the function itself as in the
context in which the function is used: interruptors can be characterized as
turntaking gambits placed 'inappropriately' at non-completion points in the
interlocutor's turn, their turntaking function becomes particularly clear if the
interlocutor yields the floor and the 'interruptor' is followed by a move. We
would therefore say that turntaking gambits have the additional function of
interrupting if placed at a non-completion point; and that turntaking gambits
'misfire' if the speaker does not succeed in winning a turn.
Problem area (2), the subclassification of gambit classes (cf. the general
discussion of "diminishing returns" in Lyofis 1968:152f), will be illustrated
by the way in which we have classified the gambit classes receipt and exclaim
(cf. section 4). If one chooses as a subclassifying criterion the aspect of the
present speaker's utterance which the present hearer focusses on, one can
distinguish between at least four aspects, which can be paraphrased as
"I'm listening" (mm, uhu)
"I understand what you're saying" (/ see)
"I agree to what you're saying" (that's right/true, sure, I agree)
"I react with anger, surprise, joy, . . . to what you're saying" (really, oh no,
how nice).
The first aspect refers to the communicative channel being open, the second
to the present speaker's message being taken in by the present hearer, and the
third and the fourth to the hearer's attitude and reaction, and to the interloc-
utor's utterance, respectively. As can be seen from the typology of gambits
which we have adopted for the present analysis, we have decided to treat the
last-mentioned subtype as a class of gambits in its own right ('exclaims'),
whereas the first three subtypes are all subsumed under the class of 'receipts'.
From a functional point of view there are no compelling reasons for classifying
in this particular way, our reasons for doing so are simply as stated above
Gambits in German and Danish: a discourse CA 75
that in the languages we have been concerned with, it seems to be the case that
the tokens used to express reactions to the interlocutor's utterance differ sys-
tematically from the tokens used to realize the other three aspects of 'receipts'.
The third aspect of receipts mentioned above ("I agree to what you're say-
ing") provides a link to problem area (3): On what theoretical and empirical
grounds can gambits be distinguished from speech acts, i.e., acts which carry
illocutionary force? For example, if acts which express agreement can be
classified as gambits, doesn't this hold true for acts which express disagree-
ment as well?
The answer to this can be found in our definition of gambits are constitut-
ing a subclass of acts which do not develop the ongoing discourse. They are
thereby functionally distinct from speech acts which, by virtue of their carry-
ing illocutionary force, do contribute to the progression of the discourse, and
can thus function as moves. Expressing unelicited agreement to the inter-
locutor's utterance clearly does not develop the discourse, as the present
speaker can simply ignore it without violating conversational rules. The ex-
pression of disagreement, on the other hand, is likely to elicit a reaction from
the interlocutor and can potentially steer the discourse in a new direction. In
the empirical analysis, the distinction between, e.g., a receipt and the speech
act agree is often not clear-cut, in particular if the same token can be used for
the realization of both functions (e.g. yes/yah/yeah). One criterion for inter-
preting such a token one or the other way is whether or not it has been elic-
ited: if it has, it is likely to function as a speech act, if it has not, it probably
functions as a gambit.
6 Analysis
As data for our analysis of gambits in German and Danish we have used role
plays, recorded at Ruhr-Universitt Bochum within the "Projekt Kommunika-
tive Kompetenz als realisierbares Lernziel" (cf. Edmondson House Kasper
- Stemmer 1982) and at the University of Copenhagen in connection with a
seminar on discourse analysis.3 The German subjects were university students
at the Ruhr-Universitt Bochum, coming from the Ruhr district. The Danish
subjects were university students at the University of Copenhagen, coming
from various areas in Denmark.
The analysis is based on the German and the Danish data in the sense that
the phenomena described are those contained in the data. However, as is well-
known to everybody working with linguistic corpora, these can rarely provide
all the information needed in order to produce a linguistic description. We
have relied on our native speaker competence in German and Danish, respec-
76 Claus Faerch and Gabriele Kasper
tively, whenever this was necessary in order to fill in a 'gap' in the data or in
order to establish more precise rules for the occurrence of specific gambit
tokens. The data quoted in the following is authentic unless expressly marked
as 'fabricated data'.
We shall postpone our tentative formulation of the rules governing the distri-
bution of the receipts ja and nein to section 6.2.1, as our data provide more
instances of these receipts in a turntaking than in an uptaking function.
The lexicalized uptaking receipts are typically realized with a falling in-
tonation, whereas the non-lexicalized receiving noise hmm is most often used
with a neutral intonation contour. Uptaking receipts with a rise always func-
tion as go-ons, as the rise in combination with the present hearer's non-claim
of the turn produces an inherently eliciting effect, encouraging the present
speaker to continue his speech activity.
Uptakers can combine in various ways:
the same token is repeated, which sometimes conveys exasperation, com-
pare the repeated go-on in (6):
(6) x: h darf ich dann mal um Ihren "Namen bifiten oder be-
ziehungsweise um Ihren er so' 'nalaus fweis
y: \ja \klar \hier mein Name is Wilfried "Steinfberg steht
ja auf meiner Ent"leihkarfte liegt da bei [Ihnen "tvor]
[x: /ja /ja] und die A"dresse steht tdrauf
different tokens belonging to the same gambit type combine; compare the
two receipts in (7):
(7) x: ja ich meine wenn du nun mal - so'n - guten Sound
"haben willst tn und - ah da is ne Platte - vonm ziem-
lich durchschnittlichen Schallplattenspieler "abgespielt
tworden dann "kriste den eben nich mehr so \hin das is
[y: Vja ja "siVcher] das hngt immer mit der "Nadel
zu\sammen
different classes of uptakers combine; compare the sequence receipt - go-
on in (8):
(8) x: tjah tweite das is'n bichen "schwie\rig - ja ich meine
h ich fahr die immer "na \ab tn und die mssen dann
auch na "abgefahren \werden und [y: hah \so /ja] das
is nich so - ganz "einfach
As is the case with the German uptakers ja and nein, the Danish up takers ja,
jo, nej and n are best discussed in connection with a description of their turn-
taking function.
Some of the tokens which can be used as receipts can also have the func-
tion of a go-on, and as in German, the difference between the two types of
gambits is expressed by their intonation: receipts are marked by a level or
falling intonation, go-ons by a rising intonation.
The same types of combinations of uptakers as in German are found in
Danish: combinations of the same tokens, combinations of different tokens
belonging to the same gambit type, and combinations of tokens belonging to
different gambit types (cf. (12)(14.
While Danish can have og hvad /s [and what then] as a go-on, the conjunc-
tion og cannot on its own be used in this function. A tentative explanation
for this might be that in contradistinction to German und, whose phono-
logical structure (vowel+nasal+dental) provides the necessary basis for carry-
ing rising intonation, Danish og [o] lacks such phonological conditions, being
composed of a single vowel. The supplementation of og by elements such as
hvad s is thus obligatory in Danish, whereas the supplementation of German
und by e.g. (was) dann is optional.
(16) x: oh \je wer ist das denn heute \abend (opens door)
y: ja guten Abend Frau Bergmann
(17) x: es ist ja nicht /"so da du die [Schallplatten] jetzt die-
sem Typ da \gibst den du berhaupt nicht "kennst
oder \so
y: Yja das "stimmt \schon \aber...
(18) x: . . . nee \du ich "/glaub dat "klappt nich mehr mit mei-
nem '\Bruder [y: ooh] dat wird 'nich mehr "lau\fen
y: Vnee da mten wir ja noch nach "Essen \fahren
(19) x: ich schaff das so meistens zwischen vier und fnf "Stun-
den
y: ah \so - wie raffst du das denn mit Lingu "fistik also
mir fllt das unheimlich "\schwer
(20) x: Gott ja das kommt eben "\vor - wiege'sagt "Calf dum
y: kommt vor kommt vor - und wenn Patienten sich das
nicht "durchlesen...
(21) x: ich mein ich hob da volles Verstndnis fr deine Situa-
tion ich wr da wahrscheinlich - war da wahrscheinlich
"auch ganz froh wenn ich 'n Referat am andern Morgen
zu "schreiben htte wenn ich dann meine "Ruhe htte
also
y: \Mensch wie "soll ich das nur scha\ffen
In both their uptaking (cf. 6.1.1 above) and their turn taking functions, the
two receipts ja und nein (or nee), cf. (17), (18), seem to be distributed in the
following way in German. After a turn which contains no negation, only ja
can be used as a receipt; if nein is used, it is not a receipt but an exclaim (i. e.
it is emotionally marked). After a turn which contains a negation both ja and
nein can be used, as illustrated by (17) and (18). Even in the latter case, there
may be a tendency to use ja rather than nein, but this cannot be confirmed
on the basis of our corpus, as the occurrence of these items is too limited to
allow for quantitative analyses.
Various combinations of these tokens are possible, both within the same
class (ja also) and across the classes (Gott ja). One frequently occurring com-
bination is receipt + starter, cf. the following example:
(22) x: . . . wenn das bei Ihnen nicht "\klappt dann knnen Sie
mich ja mal "an\rufen dann wrd ich die "fra\gen
y: /ja - Vja ich mte mal "\schauen ...
Gambits in German and Danish: a discourse CA 81
The tokens n, ja, jo, nej are all used as receipts, both with an uptaking and
with a turn taking function. Of these, the occurrence of ja, jo, and nej is re-
stricted by the proposition^ content of both the preceding turn and - in the
case of the receipts being used with a turntaking function - the move they
initiate, whereas n is neutral and signals "I've heard what you say". This ac-
counts for the fact that n, when used in a situation in which the interlocutor
expects agreement, can be felt as less polite than ja, jo, nej (cf. (29)). 7
As can be seen from this example, n can combine with the receipts ja, jo, nej,
in which way a following disagreement is more strongly downtoned. Such
combinations of n with another receipt are very frequent in the data.
That AW is a more neutral receipt than ja, jo, nej does not mean that there
is more or less a free choice between n and ja, jo, nej: there are situations in
which n, and not ja, jo, nej can be used, and vice versa. Before we can de-
scribe these restrictions, however, we have to introduce the distinction be-
tween l X-propositions', -propositions', and 'X-Y-propositions': 'X-propo-
sitions' are propositions which X (i.e. the speaker of the first turn) only can
have knowledge about, -propositions' are propositions which Y (i.e. the
addressee of X and the speaker of the second turn) only can have knowledge
about, and 'X-Y-propositions' are propositions which both X and Y can have
knowledge about (cf. the "speaker knows best principle", Foreman 1974).
It is possible to formulate the rule for Danish receipts that ja, jo and nej
can only be used if either the preceding turn or the subsequent move contains
a Y or an X-Y-proposition: if both turns contain X-propositions, only n can
be used (cf. (31)):
Gambits in German and Danish: a discourse CA 83
turn y turn
positive proposition ja
positive proposition . ,. .
+ negative appealer
negative proposition
+ positive appealer
Finally a few words about the conjunction men 'but'. As can be seen from
example (28), it can function as a completely neutral 'starter' without any ad-
versative semantic meaning. Examples of this in the data are rare, however
what one finds more often is men used with some of its original semantic
meaning preserved, and used in combination with gambits proper: men alts,
ja men, na men. Particularly common is the combination receipt + men,
which often has the pragmatic function of downtoning disagreement, contrast,
opposition or the like: "I don't think I can do it!" - "no but don't you think
you could at least try?". Of these combinations, ja + men is frequently used
as a single gambit {jamen/jam), which we have classified as a starter proper
(cf. (24) above).
ia - ja ja (agree)
. } (disagree)
nej nein nej nein J v '
(2) Unlike ja, the token ah/ach so is not ambiguous between 'knowing' and
'understanding', but expresses the latter exclusively: it is used as a receipt
following X-propositions only and thus clearly corresponds to n in this func-
tion, cf. (39).
(39) x: aber ich kenn mich hier ' bichen besser "/aus da - ich
hier "wohfne [y: ah \so] und h auch jetzt wohn ich
hier bei meinen "Eltern und fahr dann tglich "hier\hin
The distribution of Danish ja and n vs German ja and ah/ach so can be
schematized as in table 4.
Danish n n n
ja ja
German ja ja ja
ah/ach so
function is to signal to the hearer that the speaker intends to continue past a
possible completion point. Turnkeepers typically occur turn-internally;how-
ever, they are occasionally also used turn-initially, following a turntaker (cf.
(44) below). The relevant gambit classes are: underscores, asides, cajolers and
hesitators, the latter two classes having a derived turnkeeping function (see
section 4 above).
(42) x: (filling in a French form for y): und du hast immer sehr
ordentliche "Leistungen getzeigt und ich glaub das das
kannst du "auch \schaffen \ja so dann wollen we mal
das heutige "Datum auch ein/tragen signature "/date
heute ham we den "/Via den wievielten "ham we denn
\heute
y: zweiter Au'%gust
(43) x: ja Gott der "E\ggert der war heute schlechter als "\sonst
aber ich "meine - der Lippens is "\gut der Lippens "is
\gut und wenn man den nur "foulen kann dann "foult
man den \eben ich meine der Eggert hat getan was er
"konn\te
(44) x: . . . das ist dann "sch\ner als wenn man vom Anfang
gleich so iso "liert und da rum\geht ich
y: \ja also ich mein - ich hab da'n bichen "mehrErfah-
rung . ..
88 Claus Faerch and Gabriele Kasper
(45) x: ich meine ich hab auch nicht allzu "groe Ahnung \da-
von h aber es mu eben daran \liegen - am besten wir
rufen sofort "\an
Sequences of turnkeeping gambits occur frequently in the German data, com-
prising both elements of the same (cf. (46)) and of different classes (cf. (47)).
(46) x: ["stt dann einer Mran und so] dann wird'n "Bierglas
bern Apparat gekippt
y: ja "\komm ich \mein es gibt ja die Appa " Trte wo man
da 'n "Deckel drber \macht tn
(47) x: ja "tnun aber ich /mein ku\mmal Vhm ich mein wir
"wohnen ja jetzt hier wohl doch falle zwar mehr oder
weniger zu"\sammen aber "immer\hin tn ich mein du
kennst mich ja doch jetzt eigentlich schon etwas \lnger
tn
Moreover, receipts and starters are sometimes used turn-internally, thereby as-
suming a 'secondary' turnkeeping function (cf. (48)). A special case of a
turn-internal receipt is the 'self-receipt' as illustrated in (49), whereby the
present speaker 'takes up' her own preceding speech activity. Thus, discourse
functions which are normally realized in a dialogue structure (interlocutor
taking up the speech activity of interlocutor A) are transformed into a mono-
logue, by means of which the present speaker makes sure that he keeps his turn.
(48) x: na ja is \klar - na - ich mcht ja auch nich eure "Fite
deswegen - aufn Hund kommen \lassen aber - nun na
Yja also hm - ganz alte knnt ich dir "mittgeben aber
dann mt ich wirklich auch selbst "mit\kommen tn
(49) x: . . . und "langweilt man sich zu \Tode dann geht man
schon gar nicht mehr in die Vorlesung "\hin
y: ja ja "\eben das ich "mei\ne das is mir schon zweimal
pa"\ssiert aber bei dem wollte ichs "nicht grad \machen
das isn bichen "eigenartig mmh Vja - he ihr trinkt hier
"\Bier kannste mir "auch 'n Bier tgeben
what', ved du hvad *you know what', h<j>r lige 'just listen'
(cf. (50)).
asides: cf. (51)
cajolers: du ved you know', jeg mener mean' (cf. (52), (53))
hesitators: cf. (54)
(50) x: det var nok mulig men hva' er - - jeg syns der er
sdan lidt - kedeligt her syns du ikke [y: jo] til festen...
[: that was possible I suppose but what er I find it a
little bit boring here don't you think [y: yes] at the
party . . . ]
(51) x: n og s er det fru Hansen - lad mig lige se [looks into
her file] - ja De var her sidst. ..
[x: well so now it's your turn Mrs. H. let me just see [ . . . ]
ok you were here l a s t . . . ] (fabricated data)
(52) x: . . . jeg har fet ny ven s du ved jeg vil jo jeg vil jo
osse gerne med ham ik'
[x: . . . I've got a new friend so - you know I also want to
go out with him of course]
(53) x: hvad tror du for (frvrigt mine chancer er for at eer det
overhovedet gr igennem alts jeg mener det er jo for
sent ik'
[x: by the way how good a chance do I stand do you think
that er it will work at all well I mean it is too late isn't
it-]
(54) x: mm - har du snakket med nogen andre alts erm - og
hva'de havde syns om det
[x: mm - did you talk to some of the others - erm to
hear how they like it ]
The Danish data happen not to contain sequences of either different tokens
belonging to the same gambit class or tokens belonging to different gambit
classes (cf. the description of sequences in the German data above, section
6.3.1). This is no doubt completely accidental, and the following (fabricated)
example would be a perfectly possible turn in Danish:
(55) x: . . . men erm alts sdan en bog der alts ved du hva' -
det vil jeg sige dig - den kan du alts ikke aflevere
[x: . . . but erm such a book - well you know what - I can
assure you you can't possibly return it] (fabricated
data)
90 Claus Faerch and Gabriele Kasper
(55) can also be used to illustrate the occurrence of a starter (altsa) turn-inter-
nally, with a secondary turnkeeping function (cf. also (53), (54)). Examples
of 'self-receipts' (cf. section 6.3.1) are frequent in the Danish data:
Appealers are primarily used with declarative sentences, as these are not inher-
ently turn-giving: the appealer adds an eliciting function to an utterance which
does not in itself invite the interlocutor to react. With interrogative and
imperative sentence types, the need for using appealers is obviously very
limited as the utterance of these sentence types themselves has a direct elicit-
ing function if an appealer is added in such cases (which, incidentally, is
found in neither the German nor the Danish data), the function is to reinforce
the inherently eliciting element in the utterance ("kommst du mit - /ja").
Gambits in German and Danish: a discourse CA 91
the rules for the distribution of ne/m/nich, ja and oder can roughly be stated
as follows:
ne/m/nich can be used in all three appealing functions, following both a posi-
tive or a negative proposition. Thus
- appeal for information:
"du hast den Film schon gesehen tne"
"du hast den Film noch nicht gesehen tne'''
- appeal for agreement:
"das ist 'ne ziemlich langweilige Vorlesung hier tne'''
"das ist keine besonders spannende Vorlesung hier tne"
- appeal for acknowledgement:
"derSearle verwechselt eben Sprechakte und illokutive Verben \ne"
"der Searle unterscheidet eben nicht konsequent zwischen Sprechakten
und illokutiven Verben \ne"
In contradistinction to the 'universal' applicability of the negative appeal-
ers, the use of ja seems to be restricted to
- appeals for information, following a positive proposition:
"du hast den Film schon gesehen tja"
- appeals for acknowledgement, following a positive proposition:
"der Searle verwechselt eben Sprechakte und illokutive Verben \ja"
Whereas in these cases the low acceptability of the sequence negative pro-
position + positive appealer might be due to grammtical co-occurrence restric-
tions holding in German, the restriction on the realization of an appeal for
agreement by the token ja is pragmatically determined: the use of ja as an
appealer carries a strong bias on the part of the speaker that the state of af-
fairs he refers to in the proposition is true, and that his interlocutor's response
wl be in accordance with this presupposition. If one appeals for agreement,
however, such a bias is interactionally weird, as the speaker then expresses
himself what he in fact had intended to elicit from his interlocutor. It seems
to be precisely this bias or presupposition in the use of ja which makes our
subjects prefer ne over ja where both tokens are possible, as ne is neutral in
terms of the speaker's presuppositions.
The token oder can be used interchangeably with ne in both positive and
negative appeals for information and agreement. It cannot, however, be used
as an appeal for acknowledgement, as it is open for a potential hearer's re-
sponse which expresses disagreement.
From what has been said above, one can tentatively suggest that there is a
cline in terms of the speaker's presupposition holding for the proposition of
his utterance, and the expected hearer response: oder has the lowest speaker
bias and leaves most room for the hearer's disagreement, ne/m/nich is neutral
Gambits in German and Danish: a discourse CA 93
in both these respects, and ja implies that the hearer shares the speaker's pre-
suppositions and reacts accordingly.
Finally, the restrictions holding for the use of okay should briefly be
stated: okay presupposes that the speaker and hearer have reached a common
decision, opinion or agreement on a state of affairs; cf. (61).
(61) x: ja pa "Xauf ich geb dir dann die "drei vonn Beatles
und "eine vonn Stones \mit o/kay
y: fgut herzlichen \Dank
The appealers ikke/vel clearly belong to one and the same group, the only dif-
ference between them being that ikke follows a positive proposition, vel a
negative proposition (as can be seen from examples (62), (64)).
The distribution of ikke/vel and of hva' is only partially identical, there be-
ing situations in which either of the two subcategories is unacceptable.
With appeals for information (cf. the discussion in section 6.4.1 above),
ikke/vel only seem to be possible (cf. (62)):
94 Claus Faerch and Gabriele Kaspei
That hva' is not possible in these examples should not be taken to mean that
it cannot co-occur with the type of sentence exemplified above but that the
reading of the preceding sentence is different if hva', and not ikke/vel is
added. With ikke/vel added, the utterance is used in a situation in which the
speaker is not sure whether Peter is coming and where he presupposes that the
hearer will know hence the appeal is an appeal for information. With hva'
added, the utterance implies that the speaker has reasons to believe that Peter
is coming and that he wants the hearer to agree to this. Hence hva' is used as
an appeal for agreement, something which is clearly brought out by (64): the
proposition contained in X's turn is a Y-proposition (cf. section 6.2.2 above),
the speaker assumes that Y is hungry and requests Y to confirm this assump-
tion. In such cases - in which the speaker could not have direct knowledge
about the state of affairs which his utterance refers to - only hva' can be
used as an appealer. If the appeal for agreement relates to a propositional con-
tent which could logically be a X-Y-proposition, both hva' and ikke/vel are
possible (with the difference that hva' more clearly than ikke/vel expresses
that the proposition is tentative, based on inferring from something else).
The final type of appeal appeal for acknowledgement is inevitably ex-
pressed by ikke/vel in Danish, as shown in example (65). In this case the speaker
states the proposition as something he knows/believes, and simply appeals for
the interlocutor to communicate that he has taken in this information. One
could perhaps very tentatively say that ikke/vel are primarily used in connec-
tion with factual information, either information which the speaker requests
the hearer to provide (appeal for information) or information which the
speaker already possesses and which he wants the hearer to agree to or to
acknowledge the existence of (appeal for agreement and appeal for acknow-
ledgement, respectively), whereas hva' is predominantly used in connection
with non-factual information, in particular in situations in which the speaker
can only make assumptions about the state of affairs and appeals to the inter-
locutor for confirmation (appeal for agreement). We can summarize this as
presented in table 5.
Gambits in German and Danish: a discourse CA 95
speaker pre- + +
+ - + - + -
supposition
("+, , " refer to the strength of the speaker presupposition as discussed in 6.4.1)
positive
ikke ikke hva' ikke
proposition
negative
vel vel hva' vel
proposition
speaker pre-
supposition + - + + - + -
("+, , " refer to the strength of the speaker presupposition as discussed in 6.4.1)
6.5 Markers
Markers have the discourse function of signalling boundaries in discourse be-
tween discourse units. They may occur both turn-initially, turn-medially, and
turn-finally, as can be seen from examples (65), (67), and (72).
Whereas many of the gambit types discussed so far functioned as coherence
establishing devices (cf. table 1 above), markers serve to create cohesion be-
tween various phases of a discourse (cf. our distinction between coherence
and cohesion in section 3 above). In referring back to a preceding discourse
phase and marking its termination, they provide an anaphoric link, and in
marking the opening of a new discourse phase, they function as cataphoric
linking devices.
(65) : na hrt sich ja alles ganz schn und gut "\an ich mein -
"\gut - wenn ich Ihnen also so weit "glauben Marfwas
machense denn film "Vorschlag wo kann man denn
"hin\gehn
(67) x: ja ich will diesmal noch drber hin "weg\sehn wenn das
in "Ordnunggebracht \wird tja
y: das das "Bade\zimmer
x: ja das "Bade\zimmer
y: ja das Badezimmer soll nich mehr "vorkommen "\nein
x: \gut dann is es in "Ord\nung
y: \gut - dann ja - schn dann freu ich mich jetzt auf mei-
nen "Ka\ffee
(68) x: wir ham heute "a\bend wolln wir wohl auf so'ne "Fete
\gehn von som Freund vom "\Hans - ich wei "auch
\nicht ich kenn den "auch nicht \unbedingt tn [y:
/hmm] soll wohl ganz "nett tsein soll ganz "gut Ver-
den - ach "bVrigens Yja - fllt mir nmlich "auch \ein
h (.. .) du hast sollst doch auch irgendwie unheimlich
klasse "Pla\tten so "weit schon
As can be seen from the examples, both co-occurrence of different tokens (cf.
(67) and combination of markers with other gambits, e.g. starters (cf. (68)),
are possible. There seem to be no selectional restrictions operating on the ana-
phoric markers: they can be used in order to terminate the opening phase of
a discourse, various topics within the core phase or the core phase as a whole,
i.e. before the opening of the closing phase.
With cataphoric markers, the situation is slightly more complicated: starters
in a marking function and the token so can be used as initiators to the core
phase and new topics within the core phase as well as to the closing phase.
Moreover, they can even be used at the beginning of the opening phase, thus
marking the transition from a non-contact to a contact situation as in (69):
98 Claus Faerch and Gabriele Kasper
Co-occurrence with opening signals (greetings) is also possible, cf. (70), (71):
7 Conclusion
In the preceding section of the article we described the use of gambits with
uptaking, turntaking, turnkeeping, turngiving and marking functions in
German and Danish. Although this description is necessarily limited in various
respects (see further below), it should provide useful information about an
area of discourse which, with one notable exception, namely a CA of gambits
in German and English carried out by House (1980), has not previously been
100 Claus Faerch and Gabriele Kasper
Danish German
over- under-
error representation representation
jo doch
Receipts nej nein
n na
ja
German Danish
over- under-
error representation representation
ja ja ja
jo
Receipts
nej
n
jamen
Starters ja ja men
First, on the basis of our two corpora we have not been able to describe
frequency differences in the use of various gambit types and tokens. Similarly,
we have only given rather sketchy rules of distribution for individual gambit
elements. Without an analysis of the frequency with which various gambits
are actually used, and without a more detailed specification of the pragmatic
constraints on their applicability, our predictions about potential over- and
underrepresentation of gambit types and tokens in German-Danish or Danish-
German interlanguage are necessarily very tentative. Second, it is difficult to
determine to what extent our results are representative for a majority of
Danish and German native speakers with comparable social features, inter-
acting in similar situations, and to what extent they are due to idiosyncracies
in our subjects' use of gambits. It seems to be the case that these linguistic
phenomena are more subject to individual variation, and more directly related
to a speaker's socio-psychological characteristics, than many other aspects of
language. What one would like to know, for descriptive as well as for pedagog-
ical purposes, is which gambits are in fact used and accepted by most native
speakers under given pragmatic conditions, which are less frequently used,
but widely accepted, and which gambits are so strongly idiosyncratically
marked that they create communication problems (irritation and/or misunder-
standing) for most interlocutors. Here we touch upon the problem of linguistic
norms and tolerance margins, which is of course another highly relevant issue
both for the description and evaluation of learner language and for teaching
purposes. Third, we feel that we have failed to find a satisfying procedure in
establishing a tertium comparationis for our contrastive analysis. Rather, we
have started by using a classification system which was developed on the basis
of English, and which we modified for the present analysis. Especially with
the various subfunctions within a particular class of gambits, we have the
suspicion that the lack of a more general frame of reference which is neutral
between the languages under investigation obscures functional equivalences
which are more intricate and complex than the ones which we have established
in a rather ad-hoc manner. However, the issues of universality and specificity
in discourse have hardly begun to be investigated, and their formulation into
a model for contrastive discourse analysis has not been attacked at all. We
would like the reader to understand this closing remark both as an excuse for
the shortcomings of the present article, and as a suggestion for future research.
Gambits in German and Danish: a discourse CA 103
Notes
* We would like to thank the participants at the 4th International Conference on Con-
trastive Projects, Charzykowy, for a long and illuminating discussion of a preliminary
version of the present article.
1. Cf. Faerch - Kasper 1980: 92 for a brief overview of learners' discourse problems as
discussed in the literature. A detailed study of pragmatic and discourse problems of
German learners of English has been presented by Kasper (1981).
2. Other terms used in the literature for reference to the same or similar linguistic func-
tions are: "speaker/hearer signals" (Duncan 1974; Schwitalla 1976), "attention sig-
nals", "linking signals" (Leech - Svartvik 1975), "connectives" (Crystal - Davy 1975),
"greasers" (Fillmore 1983), "fumbles" (Edmondson 1981).
3. We would like to thank the students who participated in the seminar for their coopera-
tion in collecting and transcribing the data.
4. As receipts, exclaims and represents can also be used as turntaking gambits, the occur-
rence of one of these at a possible completion point would most likely be interpreted
by the present speaker as a turntaking gambit.
5. The following conventions are used in the transcriptions of the data:
" primary stress
1
secondary stress
\ fall
t rise
V fall - rise
(These symbols precede the syllable concerned.)
- pause relative to speaker's speed of delivery
I . . . ] simultaneous speech activity of speaker who has not got the turn
( . . . ) extraverbal activity
6. For a more detailed analysis of the German data, see House 1980.
7. Thus Danish children are sometimes told not to use but ja or nej as a receipt as we
often conveys the connotation of "I don't care".
8. The restrictions we have described here hold true for receipts with both an uptaking
and a turntaking function, but not for n and ja when used as go-ons. In this case, ja
can be used as an uptaker even if the hearer does not know the content of the speak-
er's S-H proposition: "x: og s m^dte jeg Lone [y: tja\ og hun fortalte mig . . ."
x: ' and then I met Lone [y: tyes] and she told me . . . ' .
References
Beneke, Jrgen
1975 "Verstehen und Miverstehen im Englischunterricht", Praxis des neu sprach-
lichen Unterrichts 2 2 : 3 5 1 - 3 6 2 .
Coulthard, Malcolm
1977 An introduction to discourse analysis (London: Longman).
Crystal, David - Derek Davy
1975 Advanced conversational English (London: Longman).
104 Claus Faerch and Gabriele Kasper
Schwitalla, Johannes
1976 "Dialogsteuerung. Vorschlge zur Untersuchung", Projekt Dialogstrukturen,
edited by F. J. Berens, K.-H. Jger, G. Schnk and J. Schwitalla (Mnchen:
Hueber), 7 3 - 1 0 4 .
Sinclair, John - Malcolm Coulthard
1973 Towards an analysis of discourse. The English used by teachers and pupils
(London: Oxford University Press).
Strevens, Peter D.
1972 British and American English (London: Collier-Macmillan).
Widdowson, Henry
1978 Teaching language as communication (London: Oxford University Press).
RUDOLF FILIPOVIC
3 Learners' errors. During the first phase of the YSCECP we came to the
conclusion that the decision made at the very inception of our project, to
108 Rudolf Filipovic
organize two teams of researchers, one working on CA and the other on error
analysis (EA), was of great importance for our future work on the project.
First we never faced the dilemma, which vexed some other contrastive pro-
jects, which of these two analyses (CA or EA) should be given preference,
which is more useful, and which can give better and more lasting results.
Secondly, through a very careful and exhaustive error analysis we identi-
fied all the topics that deserved thorough treatment in both projects. For the
YSCECP, error analysis was carried out within the framework of a project
called "Morphological and syntactic errors in the speech of students of English
in Serbo-Croatian speaking areas" (Andrassy 1971; Bilimic 1971; Kranjfievic
1971). This work helped us to ascertain several types of deviation and to
identify various causes for these deviations. For this purpose we collected a
great number of learners' errors typical for students of English in the Serbo-
Croatian speaking areas in Yugoslavia.
For the ZESCCP, error analysis was not carried out on such a large scale.
However, the choice of topics that were examined and analysed contrastively,
was done through error analysis based on spoken and written material col-
lected at the S-C courses for English speaking students from Britain and the
U.S.A. (Nakic 1975a-d; Bauer 1978;Mazalin 1978).
The second stage of error analysis was carried out in the form of tests; all
the results of CA achieved in the YSCECP were checked with the aid of spe-
cial tests in experiment of 500 learners (Filipovic 1973-1974b). In this way
additional material about the errors made by Serbo-Croatian speaking learners
was supplied. Through the results achieved in A we got acquainted with
learners' errors in both directions and this was of great use in the first phase
of CA in the YSCECP and in the ZESCCP.
4 A corpus in Li and L2. For both projects we decided to apply the same
method: the translation method. This decision led us logically to a corpus
since this method requires work on language material, i.e. a corpus. A com-
plete CA based on the translation method requires a detailed corpus in both
languages with translations into the other language. Only a corpus constructed
in this way enables us to carry out a complete investigation of some phenom-
enon in both languages.
Even in my first paper outlining our programme of work (Filipovic 1967)
I emphasized our intention of developing a bidirectional corpus of 100,000
English sentences translated into S-C and 100,000 S-C sentences translated in-
to English. Soon after that I laid down specific principles for the construction
of these two corpora. We intended to include both British and American
authors, non-fiction and fiction, along with S-C translations. We also planned
to record some spoken language and have it translated into S-C. Similarly, the
The primary data for CA 109
8 The Polish-English Contrastive Project also used the T.G. model and did
not envision the use of a corpus. This project did in its beginning stages begin
to collect a corpus; but this corpus was considered to be only an aid in the
work. The director of the project, Professor J. Fisiak, confirmed later that the
Polish-English Contrastive Project considers "the corpus to be of help only in
some cases and unnecessary in other cases, because the corpus is not in itself
our goal [. . .]. If the corpus provides only a few examples of the unit being
investigated, then one must look for examples outside the corpus." The main
reason for this view towards the corpus was that the Polish project was using
the T. G. model which does not envision the use of a corpus, but only the use
of native speakers as a source of information about the language.
11 All the projects mentioned above, regardless of the linguistic model ac-
cording to which they were working, had some kind of corpus, but none of
The primary data for CA 111
them had a standard corpus. For us, 'standard' means that the corpus can be
used in comparative studies in which it is necessary to have the same collec-
tion of data for each language. Standard can mean that the corpus is suitable
for further work preparing and presenting other data in English and other
languages (Filipovic 1969:49).
12 The fact is that each contrastive project has a corpus: a) those projects
based on the T.G. model, according to which one uses intuitions of native
speakers and does not need a corpus, use a corpus anyway generally refer-
red to as a 'helping' corpus; b) those projects which are based on some other
model have a corpus, more systematic than the 'helping' corpus used by the
first group, but still not 'standard'; and c) YSCECP is the only project to
work with a standard and representative corpus. This shows that contrastive
analysis, regardless of the linguistic model on which it is based, regardless of
whether or not it relies on native speakers, must use some sort of corpus, pre-
ferably one which is standard and representative.
14 Work on individual grammatical units from the syntax for example the
analysis of English possessive pronouns and their equivalents in S-C (Spalatin
1970) - has shown that without a standard corpus the investigator cannot
complete his analysis. This analysis would begin with an examination of the
uses of these pronouns in basic literature and the author's own knowledge
and experience with the language. However, as soon as the investigator began
to seek formal semantic correspondences in the second language, he would
immediately feel the lack of a corpus. As at that time computer processing of
our corpus was not yet complete, the investigator would compile a small pilot
corpus, and from this he would work out a table showing the equivalents. All
these data remained provisional and statistically incomplete as long as the
analysis did not proceed from the computer processing of the corpus and so
long as these data were not supported by contrastive concordances. Further-
more, so long as the investigator based his analysis on a pilot corpus, he could
112 Rudolf Filipovic
qualify his conclusions only with phrases such as Very little', 'almost always',
Very rarely', etc. By using the whole corpus, he would be able to provide
relevant statistical data and reach more precise conclusions.
whom investigations within the T-G model depend. Ilse Lehiste (1971:79)
maintains that today we can rely less and less on grammaticality judgements
of native speakers because of an increasing amount of variation among speakers
and an increasing amount of similarity between native and non-native speakers.
can give data of only one individual, and these data often reflect the bias of
the investigator.
Distributional information obtained from grammars is not reliable. The
grammarian usually seeks examples that illustrate a theory; consequently,
there is the danger that he has used only those examples that support his
theory and ignored all the rest. In the systematic use of the corpus, this is not
possible.
23 In conclusion, we can state that there are several primary data that the
contrastive analysis of the two language systems can be based on. We have
discussed the following: a) grammatical descriptions of Ll and L2 which are
very suitable for the initial stage of CA; b) learners' errors which help to de-
velop further the analysis and its scope; c) a corpus of Li and L 2 , i.e. a bi-
directional corpus, which can ideally satisfy the requirements for CA; d) a
unidirectional corpus which does not satisfy the requirements for CA as per-
fectly as a bidirectional one. However, we have shown that even a unidirec-
tional corpus has an important place among the primary data for CA, since:
a) a valid contrastive project cannot be considered complete before its results
have been verified and completed with the help of some representative corpus;
b) only a corpus can verify certain cases of doubtful grammaticality; c) fre-
quency and distribution can be established only on the basis of a corpus;
d) without a corpus we could not analyse the stylistic value, i.e. stylistic levels
and registers, of certain forms; e) the corpus is necessary for the component
of 'use', which with the taxonomic and generative components constitutes
the 'contrastive mix' (Ivir 1971:167), without which it would be difficult to
imagine the successful completion of contrastive analysis; f) without a corpus
it would be impossible to obtain a more or less complete list of all units which
belong to some part of speech; such a list is important for contrastive analysis
and its practical applications.
The primary data for CA 115
References
Andrassy, V.
1971 "Errors in the parts of speech in the English of learners from the Serbo-
Croatian-speaking area", YSCECP, Pedagogical materials 1, edited by
R. Filipovic (Zagreb: Institute of Linguistics), 7 - 3 2 .
Bauer, I.
1978 "Verbal aspect: error analysis", ZESCCP, Contrastive analysis of English
and Serbo-Croatian, Vol. II, edited by R. Filipovic (Zagreb: Institute of
Linguistics), 2 7 4 - 3 2 5 .
Bilinic, J.
1971 "Errors in the morphology and syntax of the verb in the speech of learn-
ers of English in the Serbo-Croatian-speaking area", YSCECP, Pedagog-
ical materials 1, edited by R. Filipovic (Zagreb: Institute of Linguistics),
32-60.
Bujas, Z.
1975 "Computers in the Yugoslav Serbo-Croatian-English contrastive project",
Bulletin of the Institute of Linguistics (Zagreb) 1 : 4 4 - 5 6 .
Carstensen, B.
1972 "Contrastive syntax and semantics of English and German", Active meth-
ods and modern aids in the teaching of foreign languages, edited by
R. Filipovic (London: OUP), 2 0 6 - 2 1 6 .
Chitoran, D.
1976 "Report on the Romanian-English Contrastive Analysis Project", 2nd
International Conference on English Contrastive Projects, edited by
D. Chitoran (Bucharest: Bucharest University Press), 1 1 - 3 4 .
Desz, Laszlo
1971 "Contrastive linguistics project on English and Hungarian in Hungary",
Zagreb Conference on English Contrastive Projects, edited by R. Fili-
povic (Zagreb: Institute of Linguistics), 1 2 4 - 1 2 9 .
Desz, Lszl - Eva Stephanides
1976 "Report on the English-Hungarian Contrastive Linguistics Project", 2nd
International Conference on English Contrastive Projects, edited by
D. Chitoran (Bucharest: Bucharest University Press), 5358.
Filipovic, R.
1967 "Contrastive analysis of English and Serbo-Croatian - Theory and prac-
tice", Studio Romanica et Anglica Zagrebiensia 2 3 : 5 - 2 3 .
1968 "Uloga kontrastivne analize u lingvistiSkom istraiivanju" [The role of
contrastive analysis in linguistic research], Filoloiki pregled (Beograd)
3-4: 1-10.
1969 a "Poietne faze rada na projektu 'Kontrastivna analiza hrvatskosrpskog i
engleskog jezika* " [The initial phase of the project 'Contrastive analysis
of Serbo-Croatian and English'], Prilozi i gradja, edited by R. Filipovic
(Zagreb: Institute of Linguistics), 3 - 2 5 .
1969b- YSCECP, Reports, 1 - 1 0 , edited by R. Filipovic (Zagreb: Institute of
1976 Linguistics).
1969c- YSCECP, Studies, 1 - 6 , edited by R. Filipovic (Zagreb: Institute of
1975 Linguistics).
1969d "The choice of the corpus for the contrastive analysis of Serbo-Croatian
and English", YSCECP, Studies 1, edited by R. Filipovic (Zagreb: Insti-
tute of Linguistics), 3 7 - 4 6 .
1970-1971 "Problems of contrastive w o r k " , S / M Z 2 9 - 3 2 : 1 9 - 5 4 .
116 Rudolf Filipovic
Hegeds, J .
1971 "Two questions of Hungarian-English contrastive studies", Zagreb Con-
ference on English Contrastive Projects, edited by R. Filipovic (Zagreb:
Institute of Linguistics), 1 0 1 - 1 2 1 .
Ivii, V.
1971 "Generative and taxonomic procedures in contrastive analysis", Zagreb
Conference on English Contrastive Projects, edited by R. Filipovid
(Zagreb: Institute of Linguistics), 1 5 6 - 1 6 7 .
Knig, .
1971 "Transformational grammar and contrastive analysis", Zagreb Conference
on English Contrastive Projects, edited by R. Filipovic (Zagreb: Institute
of Linguistics), 1 2 9 - 1 4 6 .
The primary data for CA 117
KranjSevic, S.
1971 "Errors in the syntax of the sentence in the speech of learners of English
in the Serbo-Croatian-speaking area", YSCECP, Pedagogical materials 1,
edited by R. Filipovic (Zagreb: Institute of Linguistics), 6 0 - 8 1 .
Lehiste, I.
1971 "Grammatical variability and the difference between native and non-
native speakers", Papers in contrastive linguistics, edited by G. Nickel
(Cambridge: Cambridge University Press), 6 9 - 7 4 .
MaCek, D.
1970 "Relative pronouns in English and Serbo-Croatian", YSCECP, Reports 3,
edited by R. Filipovic (Zagreb: Institute of Linguistics), 105127.
1975 "Relatives in English and their Serbo-Croatian equivalents", YSCECP,
Studies 6, edited by R. Filipovic (Zagreb: Institute of Linguistics), 2 7 - 6 2 .
Mazalin, D.
1978 "Word order: error analysis", ZESCCP, Contrastive analysis of English
and Serbo-Croatian, Vol. II, edited by R. Filipovic (Zagreb: Institute of
Linguistics), 2 7 4 - 3 2 5 .
Nakic, A.
1975a "Word order (error analysis)", ZESCCP, Contrastive analysis of English
and Serbo-Croatian, Vol. I, edited by R. Filipovic (Zagreb: Institute of
Linguistics), 9 7 - 1 0 4 .
1975b "Complement clauses (error analysis)", ZESCCP, Contrastive analysis of
English and Serbo-Croatian, Vol. I, edited by R. Filipovic (Zagreb: Insti-
tute of Linguistics) 146-148.
1975c "Relative clauses (error analysis)", ZESCCP, Contrastive analysis of
English and Serbo-Croatian, Vol. I, edited by R. Filipovic (Zagreb: Insti-
tute of Linguistics), 1 4 8 - 1 5 4 .
1975d "Lexical errors (error analysis)", ZESCCP, Contrastive Analysis of English
and Serbo-Croatian, Vol.1, edited by R. Filipovic (Zagreb: Institute of
Linguistics), 1 5 5 - 1 6 6 .
Nickel, G.
1972 "Project on applied contrastive linguistics: a report", Active methods and
modern aids in the teaching of foreign languages, edited by R. Filipovic
(London: Oxford University Press), 2 1 7 - 2 2 6 .
Slama-Cazacu, T.
1971 "The Romanian-English language project", Zagreb Conference on Eng-
lish Contrastive Projects, edited by R. Filipovic (Zagreb: Institute of
Linguistics), 2 2 6 - 2 3 4 .
Spalatin, L.
1970 "The English possessive adjectives my, your, his, her, its, our, their and
their Serbo-Croatian equivalents", YSCECP, Reports 2, edited by R. Fili-
povic (Zagreb: Institute of Linguistics), 9 4 - 1 0 2 .
CHARLES J. FILLMORE
1 Introduction
My purposes in this paper are: first, to sketch out what I take to be a work-
able notion of linguistic pragmatics, and to formulate in its terms a distinction
between 'large' (or systematic) and 'small' (or item-by-item) differences in the
pragmatic practices of different linguistic communities; and second, to illus-
trate methods of discovering and describing some of the item-by-item kinds
of pragmatic knowledge, in particular those related to formulaic expressions
in English and pragmatic particles in German. In both cases what I have to say
is merely programmatic and illustrative. The material on formulaic expressions
is part of a collaborative effort with Lily Wong Fillmore, a work that will not
be finished soon. The remarks on German modal particles are nothing more
than an illustration of the difficulties one faces in doing work in linguistic
pragmatics, responding to a number of insights by Jan-Ola stman, Knud
Lambrecht and Dorothea Franck.
ular time, their pay scale, etc. (2) A second intended characteristic of the ana-
logy is that it forces a certain way of talking about texts. In particular, it does
not allow us an easy way of viewing a text as a complex structure having
phrases and sentences as its parts. This is so because we cannot easily think of
a text as an assembly of tools (Fillmore 1983).
We do, of course, need to be able to talk about texts, especially if we are
agreed that linguistic pragmatics can most usefully be formulated as know-
ledge about how to associate texts with their potential or actual contexts of
use. Following Halliday, we can distinguish 'text as process' from 'text as pro-
duct' (Halliday 1983). Within our analogy, it is easy enougji to think about
text-as-process: the process is an activity in which somebody is engaged
(telling a story, insulting a neighbor, proving a theorem), the tools of language
(words, phrases, sentences) being the means for carrying out that activity.The
novelty of the analogy is that it requires us to think of text-as-product as
something like a record of the particular tools used in carrying out a partic-
ular activity, and the sequence in which those tools were used.
The critical characteristic of the analogy shows up when we think of what
it is to understand a text. If a text is a record of the tools used in carrying out
an activity, then the process of understanding that text is best understood in
terms of the steps one goes through in figuring out, from the list of tools, just
what that activity was. An interpreter achieves comprehension of a text by
knowing the 'semantics' of the tools, by knowing conventional ways of using
tools in the local workplace, by appealing to common sense assumptions about
what people ordinarily do (and why they do what they do), and so on.
To make my point more concrete,I offer a short 'text' of non-linguistic tools:
hot water
shaving brush and shaving cream
razor
hot water
washcloth and towel
tincture of iodine
bandage.
I have suggested that it should be possible to interpret such a 'text' by know-
ing what the various tools are for. (You wl recall my asking you to be gener-
ous about the category 'tool': not everybody thinks of water or bandages as
tools.) In trying to figure out what chain of activities underlies this record, we
use both our knowledge of typical uses of tools and typical activities of
people. Once we know what such a list stands for, we will readily find ourselves
constructing 'envisionments' of the activity revealed by our list, probably at
more than one 'level', these levels corresponding to the degree of certainty we
124 Charles Fillmore
feel about the details of the envisionment. At the lowest level (the level at
which we feel the most secure about our hypotheses) we are likely to infer
that somebody was shaving, that while shaving that person drew blood, and
that he or she later patched up the wound with a bandage. If our envision-
ment goes further, we might find ourselves imagining events the record does
not clearly warrant, for example, that the shaver was male, that the area of
the body that got shaved was the face, that this activity took place in the
morning as a part of a more general activity sequence (a man's morning toilet);
and, embellishing further, we might even imagine that feelings of exaspera-
tion or utterings of unsavory pieces of language make up parts of the total
scene.
For any given tool record, of course, the conclusion we draw could be
wrong. With language, however, there is something else we can count on: the
knowledge of communication as an activity with its own methods and pur-
poses. I have made the present record more like a linguistic text than an
ordinary record of tool use might be, since I have deliberately constructed
this list in order to communicate an idea, and you know that that is what I
was doing. Since my argument would not work at all if I were being unco-
operative, you can be sure that the scene I intended you to construct was very
much like the one I described,perhaps without the later embellishments. You
can be sure, too, that if I had decided to surprise you by describing a thorough-
ly bizarre situation that happened by sheer coincidence to call for the tools
I listed here, it would have been because I intended you to be surprised. You
would still know that I intended you to create the interpretation I spelled out
at first, because if I had not, there would have been no point to the surprising
revelation at the end.
Any ongoing interpretation of a text of this kind or the more familiar
kind can be marked with guesses and uncertainties. Such uncertainties, un-
fulfilled expectations, briefly maintained hypotheses, etc., are an important
part of the interpretation process, because they give significance to new infor-
mation as it comes along. In our tool text for example, there was the assump-
tion that the shaving was part of a morning ritual.That assumption would have
been quickly and easily supported if we later found out that our text was an
excerpt from a larger text in which what immediately followed the items in
our passage were a dressing gown, a new ap er, and a coffee pot.
The importance of the tool record analogy is that it provides a clear case
of activities (shaving), natural cause-effect relations (contact with a blade,
getting cut), the functions of artifacts (razors, shaving brushes), standard uses
of natural objects (hot water for washing and moistening), etc., and for which
it is absolutely essential to find coherence in the record by assuming that it re-
presents the traces of some activity and by figuring out what that activity was.
Contrastive pragmatics 125
Let us now consider, from the same point of view, a simple linguistic text.
It is a conversational text, which means that in our representation of it, we
need to indicate that different tools were used by different workers, that is,
that different sentences were uttered by different speakers. Here is the text:
(knowledge of what kinds of things people use tools to build), and knowledge
of possible communicative intentions, all contribute simultaneously to the
'picture of the world' we get when we interpret a text. How much of that
knowledge, we need to ask, can be thought of as pragmatic knowledge? A rea-
son we need to concern ourselves with this question is that we need to know
if there could in principle be a 'contrastive pragmatics'. A notion of 'con-
trastive pragmatics' makes sense if for different languages either distinct con-
ventions exist for 'reading between the lines' (that is, for discovering the mes-
sage in the pairing of an utterance and a context), or if special and distinct
pragmatic purposes have linguistic means dedicated to them in different lan-
guages.
Pragmatic competence, briefly put, consists in being able to make judg-
ments on questions of the following form: In such-and-such a setting, what
could a speaker say which would produce such-and-such an effect? Pragmatics
comprises judgments on the fittingness of particular expression types (words,
grammatical forms, intonations, etc.) to particular situations. These situations
can be roughly (but not mutually exclusively) grouped into those in which
what is predominant is (1) the speaker's feelings or attitudes, (2) the character
of the activity in which the utterance plays a role, and the degree of speciali-
zation of the utterance form to the activity, (3) the nature of the social and
spatial relationships between the participants in the communication event,
and (4) the development of the speaker's activities (and the interpreter's ex-
perience) through time, i.e., from the beginning of the text to the end. Within
the field of pragmatics we find such topics as: expressive language, illocution-
ary force indicating devices, routine formulas, activity-bound speech acts,
politeness levels, registral varieties, style markers, forms of poetic diction,
genre types, evidentiality, presupposition, indirectness, topic marking, devices
for emphasis or downplaying, markers of cohesive links, word order variation,
etc., etc. Whether such a miscellany can be seriously regarded as a single field
is unclear, but that this list incorporates a very great amount of what is in-
volved in knowing how to use a language is beyond doubting. From the reality
that all of the items in this list do indeed vary across languages, they obvious-
ly make up an important part of what needs to be considered in contrastive
linguistics.
3 "Applied pragmatics"
There have been several good reasons for the relative rarity of pragmatic
studies in contrastive linguistics in the past. First, this kind of knowledge has
looked like knowledge that could freely cross linguistic boundaries and that
could vary arbitrarily among speakers of the same language. Hence it was
Contrastive pragmatics 127
thought of as having more to do with what people are like, or what cultures
are like, than with what languages are like. Second, this kind of knowledge
appeared to be easier to learn 'in the world' than in the classroom. If it is dif-
ficult to teach something in the classroom but easy to learn it in the world,
that can be taken as a pretty good reason for not devoting a great deal of time
to it in a language teaching program. Third, pragmatic knowledge seemed to
presuppose mastery of the lexical and grammatical resources of a language, so
grammar and lexicon clearly had to take first place. And fourth, it has proved
difficult to provide, at least for many areas of pragmatics, not just an ade-
quate formalism, but even sufficiently satisfying informal subjective descrip-
tions of just what the facts are. Fifth, it has been in general hard to find out
what the components of pragmatic abilities are. The data for pragmatics are
the linking of sentences with the contexts of use in which 'performances' of
them are welcome and fitting, and the evaluation of the nature of the fit be-
tween the sentence and its contexts. Precisely because the notion 'context'
includes so much, attempts to correlate particular linguistic choices with spe-
cific aspects of context are so difficult.
polite ("Gohan moo sukosi itadakemasen ka?" 'Can't we receive some more
rice?'), but their English translations sound sarcastic and rude. Maybe the
phenomena I have pointed out here reflect, in the context of their languages,
really 'large' questions, but it seems nevertheless that they can be treated here
as 'small' questions. They appear to be among the things that, from a lan-
guage learner's point of view, are best learned one piece at a time.
A Japanese acquaintance of mine, rejecting (or not knowing) the English
formula "I am pleased to meet y o u " used instead an English translation of the
standard Japanese formula spoken when meeting someone for the first time.
It came out in English as, "This is the first time I have seen you". While the
remark is unexpectedly flattering to people who assume themselves to be well
known, it strikes others as being very mysterious.
One of the most telling signs of German-speaking background for some-
body speaking English is the uncommonly high frequency of use of the word
already. This comes, of course, from translating German schon as English al-
ready. I have some German friends who are so self-consciously aware of the
mismatch between these two words that they avoid using the word already al-
together. Many people would have reason to be grateful if teachable insights
into the subtle pragmatic differences between these two words could be dis-
covered and publicized.
We are dealing here with linguistic practices that do not 'translate' well;
put differently, we are dealing here with sources of 'nonphonological accent'.
As an illustration, take the expression YOU should talk, with YOU stressed
and the other two words destressed. The setting for using this utterance is one
in which A (the speaker of our formula) and (A's interlocutor) are discus-
sing C. The antecedent event is that has just said something unflattering
about C. The speaker's attitude revealed in A's uttering this formula is that
B's criticism of C is unfair, since the same undesirable qualities can be found
in B. The function of the utterance is to scold for remarking on C's faults
while ignoring B's own identical faults. In the usage notes we might add that
the expression is common, at least in the United States. As a prototype ex-
ample we might bring the following conversation.
B: C never has a kind word to say about anybody.
A: You should talk.
(A collection of formulaic expressions annotated in this style is included in
the appendix to this paper.)
We were fortunate in this class to have groups of speakers of several other
languages present, especially Japanese and German, and we were therefore
able to make a number of item-specific as well as systematic comparisons
among several languages. We learned, for example, that Japanese has elab-
orate speech routines in areas calling for apologies, which both German and
English lacked; that English (at least American English) has a large repertory
of non-serious formulas that played an important role as conversational
'greasers', particularly important in a society in which people become super-
ficially intimate very quickly, whereas both Japanese and German seemed to
be 'impoverished' in that regard; and we learned that Yiddish has whole genres
of speech formulas which English, German and Japanese lack, in the realm of
curses, wishes, blessings, and the like, as described in Matisoff's recent book
(Matisoff 1979).
We became aware of three important characteristics of formulaic language.
The first is that it comprises expressions whose pragmatic functions are rela-
tively easy to bring to consciousness. When we asked native speakers to con-
struct situations that called for the use of particular formulaic expressions we
tended to get recognizably similar reports each time and found that most of
what we learned from the informant interviews or questionnaires was relevant
to the pragmatic description of these forms. They tend to be expressions de-
dicated to particular pragmatic purposes. The second is that the 'effect' of
their use can be very subtle, is seldom identical from one language to another,
and can often be the occasion of cross-cultural misunderstandings. We heard
many anecdotes of American travellers offending their foreign hosts in com-
pletely innocent ways. Two separate anecdotes involve the formula I thought
you'd never ask It's a fairly innocent teasing expression in American English,
130 Charles Fillmore
but it could easily be taken as insulting by people who did not know its spe-
cial status as a routine formula. In one case a European man asked an American
woman to join him in dance, and she, being playful, said, "I thought you'd
never ask". Her potential dancing partner withdrew his invitation in irritation.
In another case a European hostess offered an American guest something to
drink, when he, unilaterally assuming a teasing relationship, said, "I thought
you'd never ask". He was asked to leave the party for having insulted his host.
The implications for contrastive pragmatics are obvious.
These observations lead naturally to the third important characteristic of
formulaic language: since it almost always involves a kind of secondary con-
ventionalization of language, its presence in a text does not stand out. Gen-
erally, if you have misunderstood something because of its 'formulaic' con-
tent you will have no reason to believe that there was anything in the text
that you missed. If somebody uses a noun or a verb that you don't know, you
can know that you've missed something, and you can ask questions to have
the thing you missed explained. In the cases that I have in mind, however,
what you heard can be given an interpretation. It's just that the people who
know conventional formulaic ways of saying things understand aspects of texts
that the more innocent interpreter has no reason to suspect are even there.
318 contained one or more pragmatic particles. In those 318 jokes were
115 instances of doch, 95 of denn, 86 of mal, 49 of ja, 27 of noch, 15 of
wohl, 15 of eigentlich, 12 of eben, 11 of nmlich, and then a smaller number
of instances of blo, einmal, etwa, halt, zwar, jetzt, allerdings, as well as a few
others of whose status I was unsure.
There was too much material, of course, so I concentrated on ja, doch and
nmlich. Here is one of the jokes from p. 54 of Die besten Witze der Zeit
(Kortmann 1977):
Very many of the particles in my joke books remain opaque to me, and the
collection contains probably as many instances of doch and ja which do not
fit my generalizations as those which do. The pragmatic points to be made
here are (1) that the German particles correspond (however imperfectly) to
fairly subtle parenthetical formulaic expressions in colloquial English, and
(2) that these expressions, if used in English as often as their counterparts
are used in German, would produce very mannered speech. A 'large' prag-
matic fact about German is that the colloquial language welcomes (one might
almost say 'requires') pragmatic particles that reflect choices among the
numerous ways in which individual utterances can be situated in their dis-
course context. English has a number of such expressions, too, but an im-
portant difference is that the corresponding forms in English cannot be gram-
matically incorporated into surface clauses. This difference seems to exist by
134 Charles Fillmore
virtue of the fact that the German forms are 'particles'while the English forms
are 'formulas'.
3.4 Conclusion
In conclusion it seems that with the 'small' pragmatic principles and practices
that language learners need to know, the elements of their description require
one to pay attention to essentially all of the ingredients of that mysterious
box in the diagram called context: speaker's attitude, speaker/hearer relation-
ships, discourse purposes, institutional setting, events in the surrounding
world, position in an ongoing dicourse, mutual knowledge assumptions, and
all the rest. Even when the 'small' facts are described carefully and accurately
one at a time for two languages, the contrastive aspect of the total description
remains seriously incomplete. These 'small' facts are too often instances of
'larger' practices by which the two languages differ more seriously. The de-
scription of these 'larger' patterns of use appears to me to be about as diffi-
cult to come by as a description of the two cultures. It was not for want of
space that I chose not to deal with such problems in this paper.
Contrastive pragmatics 135
4 Appendix
Antecedent event A has come upon and has noticed that has ob-
served A's situation
Prototype example A's colleague C has invited A into C's hotel room to
examine a manuscript; C's beautiful sister has been
in the room visiting her brother; A and C's sister
leave the room together; A notices B, a close friend
of A's wife; A says, "It's not what you think".
"Creative" departure
Miscellaneous observations
136 Charles Fillmore
Antecedent event -
Function of utterance A would say this just while, or just before, perform-
ing the act which will be painful to
Similar formulas
Miscellaneous observations
Contrastive pragmatics 137
Miscellaneous observations
138 Charles Fillmore
Antecedent event
Similar formulas
Miscellaneous observations
Contrastive pragmatics 139
Antecedent event -
Miscellaneous observations
"I'll tell you what" could be used by either party in a bargaining setting;
"I'll tell you what I'm gonna do" would be used only by the salesperson. I
associate these expressions only with males.
140 Charles Fillmore
Prototype example says, "C never has a kind word to say about any-
one." A says, " Y o u should talk."
Miscellaneous observations
Use of this formula seems to include A's acknowledgment of the aptness of
B's description of C as X. "It takes one to know one" does not include that.
Contrastive pragmatics 141
References
Downing, Pamela
1977 "On the creation and use of English compound nouns", Language 53.4:
810-842.
Fillmore, Charles J.
1973 "May we come in?", Semiotica 9:98-115.
1979 "On fluency", Individual differences in language ability and language be-
havior, edited by C. J. Fillmore et al. (New York: Academic Press), 8 5 - 1 0 2 .
1982 "Ideal readers and real readers", Georgetown University Roundtable on Lan-
guages and Linguistics 1981, edited by Deborah Tannen (Georgetown:
Georgetown University Press), 248-270.
1983 "Discussion of Halliday's 'How is a text like a clause?'", Nobel Symposium
on Text Processing, edited by Sture Allen (Stockholm: Almqvist & Wiksell
International) , 2 4 9 - 2 5 8 .
Halliday, . A. K.
1974 "Modes of meaning and modes of expression", Functions and context in
linguistic analysis, edited by D. J. Allerton et al. (Cambridge: Cambridge Uni-
versity Press), 5 7 - 5 9 .
1983 "How is a text like a clause?", Nobel Symposium on Text Processing, edited
by Sture Allen (Stockholm: Almqvist & Wiksell International), 2 0 9 - 2 4 8 .
Kortmann, Erhard
1977 Die besten Witze der Zeit (Hamburg: Hoffmann und Campe).
Lakoff, George P.
1974 "Syntactic amalgams", Papers from the Tenth Regional Meeting of the
Chicago Linguistic Society, 224-244.
Matisoff, James A.
1979 Blessings, curses, hopes and fears: psycho-ostensive expressions in Yiddish
(Philadelphia: Institute for the Study of Human Issues).
Van Valin, Robert D., Jr. - William A. Foley
1980 "Role and reference grammar", Syntax and semantics 13: Current approaches
to syntax, edited by Edith Moravcsik and Jessica Werth (New York: Aca-
demic Press), 329-351.
Zimmer, Karl E.
1972 "Appropriateness conditions for nominal compounds", Working papers on
language universal 8 (Stanford: Stanford University), 3 - 2 0 .
MARIA GRZEGOREK
Some examples:
(1) There are no ghosts.
Nie ma duchow.
(2) There was a dog in the garden.
Wogrodzie byl pies.
(3) There developed a serious crisis.
Rozwinqi sif powazny kryzys.
(4) There entered a strange looking man.
Wszedl dziwny czlowiek.
(5) There were beautiful flowers all over the place.
Wszdzie byiy pigkne kwiaty.
(6) There was a rumour that the president was killed.
Byty pogtoski, ze prezydent zostal zabity.
(7) On the platform there were many women who waited for
their husbands.
Na peronie byto wiele kobiet, ktore czekaiy na swoich
m$z0w.
(8) There were several people sick after the party.
? Po przyjeciu kilku ludzi byto chorych.
Po przyj^ciu kilku ludzi zachorowalo.
(9) There was a girl dancing on the roof of one of the cars.
Na dachu jednego samochodw tanczyla (jakas)
dziewczyna.
(10) There were fifty people injured during the riots.
Podczas rozruchow zraniono pi$cdziesiqt osob.
Podczas rozruchow zostalo zranionych pigcdziesiqt osob.
144 Maria Grzegorek
0 An introductory note
The second problem with this solution concerns the so-called sourceless
existential sentences such as 18 and 19.
Another reason why this solution is not the best one is that it suggests that
the change of order in existential sentences is caused by the same type of
rules as other pragmatic reorderings such as stylistically motivated word order
or emphatic thematization. As is well known, existential f/2 ere-constructions
are stylistically and otherwise communicatively unmarked. In fact, it is their
alleged sources which are marked.
3. The third solution is to allow the element there to be the subject in the
underlying sentence, i.e. there could be introduced by the PS rules (as in
Jenkins 1975). Apart from theoretical objections (cf. Milsark 1974), some
other objections are that this approach eliminates the difference between
existential and presentational there-sentences and that it hides the fact that
the verb in presentational sentences is put in the position preceding the sub-
ject for strictly pragmatic reasons.
Summing up, we choose the first solution as the best one and claim that
the order in existential sentences V NP is due to basic linearization rules of
English.
there-sentences in English and Polish 147
Syntactic classification
77jere-constructions have been classified in several ways by various linguists
(cf. Ross 1974, Milsark 1974, van Oosten 1978). Here we will only mention
the main criteria which have to be taken into account in any classification of
these sentences.
1. Type of the main verb
a. be3
b. verbs other than be but of related meaning, e.g. appear, emerge, occur,
develop, remain4
c. some stative verbs describing location (hang, lie, stand) and some verbs
of directional movement (step out, burst in, walk into).
2. The structure of the NP following the verb
a. without a codicil, e.g. unmodified NP as in examples 1 and 2; NP with a
prenominal modifier (examples 3, 4, 10); NP with a complement (16).
b. with a codicil. Codicil can be of several types:
- prepositional phrase s (examples 12,13)
- relative clause (7)
- adjective6 (8)
148 Maria Grzegoiek
Semantic classification
The only f/jere-constructions which are considered to be unquestionably
existential are the so-called "bare existential sentences", i.e. sentences in
which the subject NP is not followed by any locative or time adverbial, and
also sentences with locative or time adverbial in the thematic or final position.
But even among sentences which look like bare existential sentences there are
those which do not assert existence as such. For instance, as noticed by van
Oosten (1978) sentences such as 24:
(24) There is even a garden.
do not assert that a garden exists somewhere in the universe. By uttering sen-
tence 24 the speaker wants to draw the listener's attention to the existence of
a garden in the frame of reference of their current discourse. In a similar way,
sentence 25 is interpreted:
(25) There's plenty of time.
Either one of the two functions or both (because they are related) underlies
the use of f/zere-constructions:
1. to assert the existence of the referent of the displaced subject NP,
2. to raise the referent of the subject to the addressee's consciousness.
True existential sentences assert existence (plus also location or time). The
non-existential f/iere-constructions, such as presentational sentences or lists,
perform primarily the second of these two functions.
If we extend the notion of 'asserting existence' to cover also cases of bring-
ing into existence in the listener's mind, we can (following the idea of van
Oosten 1978) establish a-scale of existential sentences according to the de-
gree to which the existential assertion is expressed. 7 If we start from the
"pure" existential we have the following order:
rAere-sentences in English and Polish 149
1. existential sentences
there - be indef. NP codicil
2. locative-existential sentences
there - be - indef. NP - codicil locative
3. existential sentences with verbs of existence other than be
there verb indef. NP codicil locative
4. presentation /iere-constructions with stative verbs
there - verb - | ^ ^ j NP - locative
or
there - verb - locative - NP
5. list r/iere-constructions
Bare existential sentences, i.e. those in which the subject NP is not followed
by a locative phrase, are always themeless. The speaker has no other choice,
except to choose a different verb:
(26) There are no ghosts.
(27) *No ghosts are.
*Ghosts are not.
(28) Ghosts do not exist.
26 and 28 are not textually equivalent. In 28 the noun phrase ghosts is ana-
phoric, probably mentioned or implied in the context or co-situation. Accord-
ingly, ghosts in 28 carries a lower degree of communicative dynamism than
the verb following it.
Generally bare existential sentences assert the existence or non-existence of
some object with some property described in the coda or prenominal modi-
fier. Locative-existential sentences assert the existence of a given object in
some place, i.e. they assert both existence and location. The locative phrase
may belong to the focus, in which case intonation pattern (a) is used, or it
may be contextually bound, in which case the intonation pattern (b) is used.
150 Maria Grzegorek
the referent of the subject is identifiable. Notice that the first sentence in ex-
amples 3436 is unacceptable, wheras the (b) sentences are good English sen-
tences because in /zere-constructions the requirement that the referent of the
subject be identifiable by the listener does not hold (you do not have to iden-
tify the object if your purpose is merely to assert its existence).
(34) a. *A book is probably on the table.
b. There is probably a book on the table.
(35) a. *Lions are in Africa.
b. There are lions in Africa.
(36) a. *A famous cathedral is in Guildford.
b. There is a famous cathedral in Guildford.
(examples from Breivik 1975)
41 c, 42b, 42c) the locative phrase is presented as the given information, but
in each case the speaker has a different view of the state of affairs. In 4 2 b the
jacket is viewed more as a location and in 42c more as an object with some
property.
The relationship of sentences with preposed locative phrases with there to
sentences with preposed locative phrases without there is not clear. Breivik
(1975) noticed that there cannot be omitted under the same conditions
under which the 'paraphrase' of existential sentences with thematic indefinite
NP cannot be used. Hence sentences 4 3 - 4 5 are unacceptable (just like sen-
tences 34a, 35a and 36a are).
There can be omitted if location is specific enough. Consider for instance this
example from Breivik (1975):
(46) The old rocks which make up Wales stretch eastward into
the English counties of Shropshire and Herefordshire. . . .
In the North are the rugged mountains (several exceeding
3,000 feet in height) of Snowdonia; in the centre and South
are extensive moorlands.
Breivik (1975) treats sentences with omitted there as variants of those with
there, i. e. sentences 47a and 4 7 b express the same proposition. 11
Locative there - be NP
(On the table there are two books)
are thematic variants of the same existential proposition (providing on the
table is outside the scope of the focus NP, because only then is it informa-
tionally equivalent to the preposed locative phrase).
constructions. The same explanation can be extended to stative verbs (cf. van
Oosten 1978).
A little girl is the last element of the picture perceived by the speaker. It could
not be put in the thematic position because it was not 'available' when the
speaker started his message.
Even without a more detailed analysis two things can be said about the struc-
ture of Polish counterparts of English there-sentences, especially existential
sentences:
1. There is no empty subject in Polish in any counterpart of any type of
English i/iere-constructions,
2. In both languages the relative order of the verb and the notional subject is the
same, i.e. verb - subject and follows the principle of arranging lexical items
according to the increasing degree of C.D. (communicative dynamism).
The occurrence of empty subject in English existential sentences and its non-
occurrence in the Polish counterparts of these sentences are instances of more
general rules of English and Polish grammars. The scope of these rules is
broader than the domain of existential sentences. These rules can be inform-
ally stated in the following way:
English. The basic word order of English declarative sentences is SV(O).
Any re-arragements of lexical items (motivated by pragmatic considerations
such as context, co-situation, speaker's intentions, choice of discourse topic,
etc.) tend to re-establish that order, e.g. Passive I, Passive II, Pseudopassive,
to some extent also Subject-to-subject Raising and Tough-movement.
The statement that no English declarative sentence may have the order
verb - subject would be too strong because this order is found in the so-
called "root permutations" to which also presentational r/iere-sentences be-
156 Maria Grzegorek
The relative order of subject and verb in existential sentences is the same in
English and Polish (and in some other languages). Although this order dis-
agrees with the basic word order in declarative sentences in both languages, it
is still the unmarked order of existential sentences in English and Polish, i.e.
it is not motivated by contextual factors but rather by inherent semantic
features of verbs of being which are such that these verbs always have a lower
degree of C.D. than any noun accompanying them.
Since in polish word order always tends to reflect the degree of C.D., the
fact that the order of existential sentences is verb - subject is nothing ex-
ceptional in this language. The only thing which seems worth stressing is the
fact that in existential sentences this particular order is lexically rather than
contextually governed. But if in a particular context the verb bye 'be' be-
comes the information focus, it is put in the position after the subject (i.e. ac-
cording to the degree of C.D.) in Polish whereas in English such change is not
possible:
Since, however, without specific contexts such as this, the order subject
verb is unacceptable in Polish existential sentences, the same linearization rule
as for English should be postulated : S -> V NP (if V = verb of being).
English: There - V ex - NP
Polish: Vex -NP
or
lexical
function 1 2
Examples:
(1) There are cannibals.
S%/istniej$ ludozercy.
English: There - be - NP - { ^ f * j
158 Maria Grzegorek
locative
- there - be- NP
time
Locative I
Polish: - bye - NP
time J
or
lexical
function
Examples:
The differences between English and Polish existential sentences concern two
problems:
1. the position of the adverbial,
2. the lexical value of the verb.
b. verbs which characterize the typical action or state of the referent of the
subject NP (examples 1620):
gwiazdy - swiecq
'stars - shine'
orkiestra - gra
'band - plays'
trawa - rosnie
'grass - grows'
deszcz - pada
'rain drops - fall'
strumien - piynie
'stream - flows'.
c. verbs of perception in impersonal form.
The verb corresponds to the senses by means of which we perceive the referent
of the subject (existence is asserted indirectly via sensory effects examples
21,5,6):
plamy - widac
'patches - one can see'
muzyka - slychac
'music one can hear'
mroz - czuc
'frost - one can feel'.
d. verbs which are idiomatically associated with the subject noun, i.e. in
collocation with a given noun the verb functions as a synonym of bye:
cisza - zapasc
'silence - fall'
strzal - pasc
'shot - fall'.
the fact that the sentence is existential, the choice of the lexical verb is pre-
dictable. From the point of view of their communicative function these verbs
have a lower degree of C.D. than the subject NP and are thus put in the posi-
tion preceding the subject, like the verb bye.
The English Mere-constructions with participial forms can have two readings:
the assertion of the occurrence of the event (in nonthematic form) and the
assertion of the existence of the referent of the indefinite NP. 1 7
The possibility of two interpretations is due to the structural ambiguity of
these constructions. The present participle may be treated as part of the pro-
gressive form (the event reading) or as a postnominal modifier of the inde-
finite NP (the existential reading). Similarly, the past participle may be ana-
lyzed as part of the passive form of the verb (the event reading) or as a post-
nominal modifer of the indefinite NP (the existential reading). All English
Mere-constructions with participal forms have thematic counterparts, with
the indefinite NP functioning as theme (but only for the event reading), e.g.:
Sentences 29 and 33 are different. The word order is different than in the rest
of sentences with participial forms because in these sentences the requirement
of an independent principle, i.e. the 'heavy end' principle, motivates the shift
of the subject NP to the position following the participle. It seems that such
sentences allow only the event interpretation.
77iere-sentences with participial forms are translated into Polish like other
English sentences with the progressive aspect or passive voice, respectively.
Thus sentences with present participle are translated in Polish as sentences
with verbs in imperfective form, whereas sentences with past participles are
164 Maria Grzegorek
translated as sentences with the verb in passive form with the auxiliary zostac
(rarely with the auxiliary bye).
The order of lexical elements in Polish counterparts of English there-con-
structions with participial forms signals their relative degree of C. D. plus the
fact that the subject NP is nonanaphoric (hence generally is in postverbal po-
sition, corresponding to the postcopular position in English). The only thing
worth mentioning is that the Polish equivalents do not allow any existential
interpretation because the verb bye does not occur in them.
If in the Polish translation of an English there-sentence with past participle
the passive auxiliary bye is used, the existential interpretation is possible. It
seems, however, that bye rather than zostac is used if the past participle re-
sembles adjectives, i.e. if there is no przez-phrase {by-phrase). Moreover, the
restrictions here are similar to the restrictions on the occurrence of post-
nominal adjectives in sentences with expletive there and their Polish counter-
parts, i.e. only those past participles are allowed which are followed by a
complement:
(34) There were many children left at home.
Pozostawiono wiele dzieci w domu.
? By to wiele dzieci pozostawionych w domu.
(35) There were many children left without food and water there.
By I tarn wiele dzieci pozostawionych bez jedzenia i wody.
The order of the verb and the subject in English presentational sentences
and in their Polish equivalents can be easily explained by the principle of put-
ting lexical items representing a higher degree ofC.D. after those with a lower
degree of C.D. In these examples verbs of appearance on the scene have a
lower degree of C. D. than the subject NPs denoting those entities that appear
on the scene. Similarly, stative verbs describing the manner of spatial existence
are less communicatively relevant than the NPs following them.
The only important difference between English presentational sentences
and their Polish counterparts can be noticed if we compare these sentences
with other types of sentences in both languages. Thus English presentational
sentences are stylistically marked because they violate the basic word order
subject - verb and the principle that locative adverbials are usually in clause-
peripheral positions, for instance in thematic position as in these sentences
without there:
In English the order subject - verb can be changed to follow the degree of
C.D. only for a restricted class of verbs. In Polish, on the other hand, the
communicative order is obligatory and unmarked. The verb can precede the
subject not only in the case of verbs of being and verbs of appearance on the
scene but also in the case of other verbs:
3 Conclusion
3.1 Existential sentences in English and Polish
a. the order of subject and verb
In both languages the obligatory (contextually unmotivated) change of the
basic order subject - verb to verb - subject distinguishes existential sentences
from other types of sentences in these languages.
b. theme
Bare existential sentences are in both languages themeless. The initial bye in
Polish cannot be treated as theme because being a verb it cannot be used re-
ferentially, and having such general meaning it cannot be contextually bound.
Locative-existential sentences have different themes in English and Polish: in
English they are basically themeless, and only optionally the locative phrase
can be shifted to the thematic position. In Polish, on the other hand, locative
phrases are obligatorily thematic in these sentences. The change of the position
of the locative adverbial in Polish results in a stylistically marked utterance.
Notes
1. The definition of theme and various types of themes are discussed in some detail in
my paper "Thematization in English" to appear in Studia Anglica Posnaniensia 17.
2. This observation is due to Rando and Napoli (1978). They replace the notion of
cardinality words introduced by Milsark (1977) with the notion of anaphoricity. Ac-
cording to Milsark only NPs with cardinality determiners (as opposed to quantifica-
tion determiners) can occur in existential sentences. Milsark's notion does not ac-
count for definite NPs and generics in fAere-constructions, whereas Rando and
Napoli's notion of anaphoricity does. By anaphoric NP they mean such that the
fAere-sentences in English and Polish 167
speaker can safely assume that the hearer can associate not only a unique referent
with this NP but also the proper unique referent.
3. It is important to remember that only the first occurrence of be in the clause wheth-
er 'existential' or passive auxiliary or progressive auxiliary) can trigger fAere-insertion
if the NP is indefinite. The only exceptions are the occurrences of be in modal ex-
pressions such as be about to and be going to (cf. Milsark 1974).
4. Which verbs other than be can occure in existential sentences cannot be determined
on a purely semantic basis. The matter remains a mystery. For instance, there is no
explanation why begin can occur in fAere-constructions and its synonym start is less
acceptable:
{ ??*started }
5. Prepositional phrases of location which are preposable are excluded here and are
treated separately.
6. In fact, very few adjectives can appear in codicil position. According to Milsark
(1974, 1977) only adjectives denoting temporary states can occur in this position.
Gee (1978) stresses the relevance of the opposition between adjectives with and
without a complement because of the following contrasts in acceptability:
T h e r e are some people arrogant.
There are some people arrogant to the point of obnoxiousness.
7. Van Oosten (1978) points out that this scale corresponds to a scale in their syntactic
behaviour, such as embeddability. True existential fAere-constructions are freely em-
beddable whereas presentational sentences are not. This fact is explained in terms of
word order markedness - in true existential sentences the word order is unmarked,
whereas presentationals have a marked word order. Clauses with marked word order,
i.e. clauses in which syntactic changes have been made in order to meet the com-
municative requirements are always restricted in embeddability.
8. In fact both sentences:
a. A book is on the table.
b. The book is on the table.
are locative (cf. DuSkova 1971). The difference is in the way the identity of the refer-
ent of the subject NP is established. In (a) it can be established only with reference
to the predicate, whereas in (b) it has been previously established.
9. Cf. Breivik (1975). Kimball (1973), from whom the example 37 is taken, explains
the difference in acceptability between 37 (a) and 37 (b) in terms of inalienable pos-
session: existential sentences with the inalienably possessed NPs do not have non-
existential counterparts.
10. The possibility of replacing a locative fAere-construction with subject - have - ob-
ject structure differs for different types of locative phrases. Also, there are differences
in the behaviour of pronominal copy which in some cases has to be retained, in others
it cannot be retained, and sometimes it is optional (cf. Erdmann 1978), e.g.:
a. There are footsteps on the stairs.
b. The stairs have footsteps (on them).
a. There are many good theatres in London.
b. London has many good theatres.
c. *London has many good theatres in it.
a. There is snow on the ground.
b. *The ground has snow.
c. The ground has snow on it.
168 Maria Grzegorek
11. Breivik (1975) argues that analogy can account for occurrences of there after the
clause initial locative.
12. The term "lexical function" is used here in the sense in which it is used by such
authors as I. A. Mel'iuk and J. D. Apresjan.
13. Polish makes a distinction between assertion of existence and assertion of nonexist-
ence not only in terms of negation but also by means of changing in present tense the
copula bye to the copula miec 'have'.
14. The term is taken from Firbas 1975.
15. Sentences with bye are also acceptable but less natural.
16. Babby uses the notion of lexical function to explain common features of all verbs
which can replace the verb bye in existential sentences, and he opposes this notion
to the concept of a natural semantic class which is of little use here.
17. The second (existential) reading is marginal, and sometimes even impossible. For in-
stance,' Milsark (1974) gives the following example in which existential interpretation
leads to absurd conclusions:
There are peasants constantly being murdered.
The existential interpretation, according to which constantly being murdered is a de-
scription, claims that the peasants mentioned here can die repeatedly.
References
Aissen, J.
1975 "Presentational f/iere-insertion: a cyclic root transformation", CLS 1 1 : 1 - 1 4 .
Allan, K.
1971 "A note on the source of there in existential sentences", FL 7:1 - 1 8 .
Apresjan, J. D.
1980 Semantyka leksykalna (Wroclaw: Zaklad Narodowy im. Ossolinskich).
Babby, L.
1978 "Lexical functions and syntactic constructions: Russian existential sentences",
Papers from the Parasession on Lexicon, edited by D. Farkas, W. M. Jacob-
sen, and K. Todrys (Chicago: CLS), 2 6 - 3 4 .
Breivik, L. E.
1975 "The use and non-use of 'existential there'' in present-day English", Forum
Linguisticum. Contributions to applied linguistics 2 : 5 8 - 1 0 3 .
1978 "Existential sentences revisited", Papers from the 4th Scandinavian Con-
ference of Linguistics, edited by K. Gregersen (Austin: University of Texas),
235-240.
Cole, P. - J. Sadock, eds.
1977 Syntax and semantics 8: Grammatical relations (New York: Academic Press).
DuSkova, L.
1977 "A note on 'there is' in present-day Enghsh", Philologica Pragensia 5 9 : 9 7 - 1 0 5 .
Erdmann, P.
1978 "rfcere-constructions in English and German", IRAL 1 6 : 1 8 7 - 2 1 1 .
Firbas, J.
1975 "On the thematic and the non-thematic section of the sentence", Style and
text: Studies presented to Nils Erik Enkvist, edited by H. Ringbom (Stock-
holm: Sprkforlaget Skriptor AB), 3 1 7 - 3 3 4 .
Gee, James Paul
1978 "Adjective preposing and there-insertion: a point about syntactic rules and
semantic processes", Studies in language 2 : 1 0 3 - 1 1 1 .
i/zere-sentences in English and Polish 169
Green, Georgia M.
1977 "Do inversions in English change grammatical relations?", Studies in the lin-
guistic sciences 7 : 1 5 7 - 1 8 1 .
Halliday, . A. K.
1967 "Notes on transitivity and theme in English", JL 3 : 3 7 - 8 1 .
Jenkins, Lyle
1975 The English existential (= Linguistische Arbeiten 12) (Tbingen: Niemeyer).
Kimball, John P.
1973 "The grammar of existence", PCLS 9 : 2 6 2 - 2 7 0 .
Kirkwood, H. W.
1977 "Discontinuous noun phrases in existential sentences in English and German",
JL 1 3 : 5 3 - 6 6 .
Milsark, G. L.
1974 Existential sentences in English. Repr. by the Indiana University Linguistics
Club in 1976.
1977 "Toward an explanation of certain peculiarities of the existential construc-
tion in English", Linguistic analysis 3:1 - 2 9 .
Oosten, J. van
1978 "Expletive f/iere-sentences", unpublished MS.
Pullum, G.
1977 "Word order universale and grammatical relations", Syntax and semantics 8:
Grammatical relations, edited by P. Cole and J. Sadock (New York: Academic
Press), 2 4 9 - 2 7 7 .
Rando, Emily - Donna Jo Napoli
1978 "Definiteness in r/iere-sentences", Language 5 4 : 3 0 0 - 3 1 3 .
Ross, J.
1974 "There, there, (there, (there, (there . ..)))", CLS 1 0 : 5 6 9 - 5 8 7 .
Searle, John R.
1971 "What is a speech act?", The philosophy of language, edited by J. R. Searle
(London: Oxford University Press), 3 9 - 5 3 .
EDMUND GUSSMANN
selves and assign them to appropriate classes. This is, to all intents and pur-
poses, a marginal improvement over a purely quantitative evaluation for two
major reasons. An inventory of phonetically specified segments, even if ac-
companied by a set of morpheme structure constraints, which we bypass here,
remains nothing more than a list; usually no claims are made as to the internal
structure of the inventory and the segments could be arranged in any way
with no consequences following from a particular arrangement (see the list
of segments in Chomsky Halle 1968:177; for a dissenting voice, see Lass
1976:84). More importantly, however, phonological rules modify the seg-
ments in various ways including the complete elimination of some of them as
well as the introduction of units which do not appear in underlying represen-
tations. In this way the static juxtaposition of phonological segments offers
little more than a glimpse into the phonology of the languages. If our com-
parison were to be restricted to inventories, the result would not only be in-
complete but could also be downright misleading or false. By isolating what
we call shared segments we would be making an implicit and illicit claim that
they are the same (identical) segments. In actual fact, however, a group of
segments may be the same in terms of the features which define them but
they need not be identical in terms of the phonological behaviour. To take a
concrete example both for Polish and for English a group of underlying
dental obstruents /s, z, t, d/ has to be recognized 3 where the segments can
be viewed as the same. In terms of the modifications they undergo in various
phonological contexts, however, they are very different, and thus the initial
comparison of the segments in isolation has to be supplemented by a survey
of the rules that affect them.
( 2 ) [s - 5] expre[s] - expre[5]ion
face facial
[z - ] sei[z]e sei[Z]ure
revise revision
[t - c] ri[t]e ri[c]ual
part departure
[d J] procee[d] proce[j]ure
grade gradual
The rule also affects derived segments, i.e. those that result from the applica-
tion of other (earlier) rules, in particular velar softening, spirantization, voic-
ing and devoicing. These additional processes (specified in parentheses) com-
bine with palatalization to produce the following sets of surface alternations:
The rule of palatalization is ordered relatively late among the rules of English
phonology (Chomsky - Halle 1 9 6 8 : 2 4 4 ) , which agrees both with the exist-
ence of a certain amount of variation in its application (e.g. gradual with or
without a palatal reflex) and with the extension of the rule's applicability
across word-boundaries, particularly in rapid speech (e.g.: the[i]e young,
ni\s] e young etc.). Rules which follow palatalization include specifically
deletion of the glide /j/, i.e. of the segment which is crucially involved in the
triggering off of the rule (Chomsky - Halle 1 9 6 8 : 2 3 1 , rule ( 1 2 2 ) ) , as well as
the reduction of unstressed lax vowels. Although details of the rule could, no
doubt, be improved on, the basic pattern is clear: dental obstruents are turned
Abstract phonology and CA 177
Notes
References
Chomsky, . - M. Halle
1968 The sound pattern of English (New York: Harper and Row).
Dressler, W. U.
1977 Grundfragen der Morphonologie (Wien: sterreichische Akademie der Wis-
senschaft).
Abstract phonology and CA 185
Eliasson, S.
1978 "Theoretical problems in Scandinavian contrastive phonology", The Nordic
languages and modern linguistics 3, edited by J. Weinstock (Austin, Texas:
The University of Texas Press), 217-243.
1982 "Transfer as evidence for phonological solutions", SAP 14. 185-196.
Fischer-J<rgensen, E.
1975 "Perspectives in phonology", Annual report of the Institute of Phonetics,
University of Copenhagen 9:215-236.
Fisiak, J.
1973 "The Polish-English contrastive project", Papers and studies in contrastive
linguistics 1:5-13.
1975 "The contrastive analysis of phonological systems", Kwartalnik Neofilo-
logiczny 23:119-124.
1976 "Generative phonology and contrastive studies", The Canadian journal of
linguistics 21:171-179.
Goyvaerts, D. L.
1978 Aspects of post-SPE phonology (Ghent: E. Story-Scientia).
Gussmann, .
1975 "How do phonological rules compare?", Papers and studies in contrastive
linguistics 3:113 -124.
1978a Explorations in abstract phonology (Lublin: Uniwersytet im. . Curie-
Skiodowskiej).
1978b Contrastive Polish-English consonantal phonology (Warszawa: Panstwowe
Wydawnictwo Naukowe).
1979 "Abstract phonology and psychological reality", Proceedings of the Ninth
International Congress of phonetic sciences, vol. II, edited by E. Fischer-
J^rgensen, J. Rischel and N. Thorsen (Ktfbenhavn: University of Copen-
hagen), 101-107.
to appear "Contrastive analysis, substantive evidence and the abstractness issue",
Theoretical issues in contrastive phonology, edited by S. Eliasson (Heidel-
berg: Julius Groos Verlag).
Hooper, J. B.
1976 >1 introduction to natural generative phonology (New York: Academic
Press).
Kenstowicz, M. - Ch. Kisseberth
1977 Topics in phonological theory (New York: Academic Press).
1979 Generative phonology (New York: Academic Press).
Kiparsky, P.
1968 How abstract is phonology? (Bloomington, Indiana: Indiana University Lin-
guistics Club).
1973 "Abstractness, opacity and global rules", Three dimensions of linguistic
theory, edited by O. Fujimura (Tokyo: TEC), 5 7 - 8 6 .
Krzeszowski, T.
1971 "Equivalence, congruence and deep structure", Papers in contrastive lin-
guistics, edited by G. Nickel (Cambridge: Cambridge University Press), 3 7 - 4 8 .
1974 Contrastive generative grammar. Theoretical foundations (Lodz: Uniwersytet
Lodzki).
Laskowski, R.
1975 Studia nad morfonologiq wspotczesnego fczyka polskiego [Studies in the
morphonology of modern Polish] (Wroclaw: Ossolineum).
Lass, R.
1976 English phonology and phonological theory (Cambridge: Cambridge Uni-
versity Press).
186 Edmund Gussmann
Leben, W.
1977 "On the interpretive function of phonological rules", Phonologica 1976,
edited by W. U. Dressier and . E. Pfeiffer (Innsbruck: Innsbrucker Beitrge
zur Sprachwissenschaft), 2 1 - 2 7 .
Marton, W.
1968 "Equivalence and congruence in transformational contrastive studies",Studio
Anglica Posnaniensia 1 : 5 3 - 6 2 .
Rubach, J.
1977 Changes of consonants in English and Polish. A generative account (Wroclaw:
Ossolineum).
Zonneveld, W.
1978 A formal theory of exceptions in generative phonology (Lisse: The Peter de
Ridder Press).
Zwicky, A.
1975 "The strategy of generative phonology", Phonologica 1972, edited by
W. U. Dressier and F. V. Mares (Mnchen: W. Fink Verlag), 151 - 1 6 8 .
RAYMOND HICKEY
0 Introduction
In contrasting the syntax of two languages which are typologically as far re-
moved from each other as are Irish and English one is presented with an abun-
dance of conflicting structures. The purpose of the study, of which this article
is a preliminary stage, is to classify these differing structures and also to in-
vestigate the question of interference for the non-native user of either language.
Not all structural differences lead to interference forms, however, and in
many of the cases illustrated below adaption to the varying structures occurs
without the hampering effect of transfer of native language structures.
In the case of this particular study the contrast to be made is non-direc-
tional, assessing what transfer forms can be registered in Irish on the part of
native speakers of English while also viewing the attested interference forms
from Irish in Hiberno-English which have lead to its deviation from Standard
English.
The question of the tertium comparationis arises when comparing struc-
tures of both languages which are suspected of being semantically equivalent. 1
Here the decision procedure used avails primarily of the intuition of the in-
vestigator. Thus with a pair of sentences such as the following
Irish and English lie first and foremost in the realm of surface structure it
seems to me to be most fruitful to consider the syntactic realizations with a
view to establishing differences between them. In this study I shall leave aside
the question of positive transfer 4 as it is unproblematical in practice.
Although I have chosen no specific grammatical model for my investigation
it should not be imagined that I will confine myself to a registration of dif-
ferent syntactic constructions in Irish and English. Such diverging syntagmas
are of limited interest when merely given as particular lexical realizations (al-
though this serves the purpose of illustration) but need to be abstracted so
that one can see them in terms of various combinations of syntactic units
which conflict with each other when considered from the standpoint of the
opposing language, but which each serve the purpose of representing a distinct
semantic complex common to both languages.
In contrasting the structures of English and Irish there are some which are
different to those of English and which have within them a unit or units
which form part of a relatively large paradigm. In this paper when dealing
with such structures I will opt on the grounds of concision for illustrating the
particular structure with a single example. A case in point here would be the
following sentence:
(3) Td fdm dul amdrach.
is under-me go tomorrow
intend going tomorrow.'
In each of the many possible cases of such constructions we have the defective
existential verb td followed by a form of one of the many prepositional pro-
nouns and verbal noun. The construction can be abstracted as follows:
V(ftQ + PREPRO + VN
This represents the structure minimally: it may be varied by the paradigmatic
elements of PREPRO and expanded by, for example, the addition of adver-
bials or other complements.
The sections found below each deal with a particular area of the syntax of
Irish and English where divergences are to be encountered. They represent a
selection of such areas only, contain single examples as a rule and do not deal
with detail of lexical realization.
Apart from the most obvious fact of Irish being a VSO language as opposed
to the SVO character of English5 there are many further instances where Irish
Towards a contrastive syntax of Irish and English 189
shows a word order which deviates from English. Variation of word order
within Irish itself is greater that in English due to the increased degree of
morphological marking. This allows fronting when emphasis is required. The
first element of Irish word order to be treated, however, is that which obtains
in neutral statement sentences.
In the eventuality of further complements these are then added at the end:
(4) a. Chonaic mi Sedn ag an stdisidn .
saw I John at the station yesterday
saw John at the station yesterday
Should it be required to emphasize, say, the direct object then this can be
done as follows:
The complex direct object can always be placed at the end of the sentence
thus giving a structure which in contrast to (5a) is allowed in English:
Common to both Irish and English is the restriction that only a complex
direct object, that is one consisting of a noun with a correlating relative clause,
can be removed from the sentence framework. Again in both languages the
space left by the extracted object must be filled by a personal pronoun. Thus
is unacceptable.
Irish goes a step further in allowing fronting of elements qualified by the
subject. A case in point is afforded by the extraction of a genitive attribute
from its normal position after its determiner:
Fronting of this kind is not possible in English as the element in the genitive
has a correlating relative clause and so cannot precede its qualifier:
(6) b. * The woman who spoke to you's son is dead.8
Towards a contrastive syntax of Irish and English 191
When it is a case of the subject itself being fronted then the pattern given
above whereby the place of a functional element in the sentence framework
must be occupied can also be used:
(7) An bheirt bhan atd anseo anois bfonn siad anseo gach bliaiti.
the two woman who-are here now are they here every year
'The two women who are here now are here every year.'
Were it not for siad in (7) it would have the same word order as its English
semantic equivalent; its presence,however, shows that the essential VSO order
of Irish is maintained and furthermore that (7) is a case of rhematic fronting.
The constructions to be considered under this rubric are those which are only
ambiguous from the point of view of one of the languages being considered,
so that here we have more than one equivalent each of which is semantically
different. In almost all cases of equivalence we can offer more than one con-
struction on either side which reflects accurately the meaning of the original
construction. This depends on our definition of equivalence. The cases here
are such that a particular distinction is not made which is necessary for a single
interpretation of the construction involved. Consider the following:
The ambiguity of both (8) and (9), however, is due to a constraint in the mor-
phonemics of Irish which does not allow the mutation of liquids. Should the
elements after ar and gur in (8) and (9) respectively be either vocalic or a
lenitable consonant then the ambiguity is resolved, as a distinction is then
made between past and present. Consider
(8) c. An buachaill arb uaidh [aeisb wan] an rothar
the boy which-is from-him the bicycle
'The boy who wants the bicycle'
(8) d. An buachaill arbh uaidh [aeaav wsei] an rothar
the boy which-was from-him the bicycle
'The boy who wanted the bicycle'
(9) C. Ddirt gur mr [GAJ mo: J] an trua 4.
said he that-is great the pity it
'He said that it is a great pity'
(9) d. Ddirt sdgur mhr [gaj w o n ] an trua 4.
said he that-was great the pity it
'He said that it was a great pity.'
While (9d) is no longer ambiguous with regard to the present indicative it
nonetheless remains so with regard to mood so that it can also be interpreted
as 'He said it would be a great pity.'
There are further instances of unilateral syntactical ambiguity which are
not so easily resolvable. Such an instance is provided by the lack of a pluper-
fect with the verb td in Irish which is noticeable when translating from English.
Thus the following Irish sentence has two interpretations:
(10) Nt raibh ann ach an gaoth.
(10) a. It was only the wind.
(10) b. It had only been the wind.
It is nonetheless possible in Irish to express the pluperfect with other verbs
which then have a finite form of the verb td plus past participle:
(10) c. Bhisd imithe nuair a thdinig a bhean.
was he gone when came his woman
'He had gone when his wife came.'
Quite distinct from tense uncertainty (that is in a contextfree situation) is the
ambiguity with regard to agent which also occurs in relative clauses in Irish.
Here the relative clause connector a does not indicate sentence function
among the units in both main and relative clauses. This is also the case in
English (if we choose not to make the distinction between who and whom)
Towards a contrastive syntax of Irish and English 193
but it is not critical as the word order in the relative clause indicates the func-
tion of the sentence elements. Compare
(11) Sin an fear a mhol an sagart.
that it the man who praised the priest
with its two possible equivalents
(11) a. This is the man who praised the priest.
(11) b. This is the man who the priest praised,9
The word order of the relative clause in Irish is fixed though there is the pos-
sibility of disambiguizing (11) by substituting ar for a thus obtaining the
meaning of ( l i b ) . Should (11) be negated then it is no longer possible to dis-
ambiguize it and so (1 lc) has the negated meanings of both (1 la) and (1 lb):
(11) c. Sin an fear ndr mhol an sagart.
that is the man who-not praised the priest
By alternative is meant here that two or more syntagmas can express the same
semantic content and that both do not stand in the relationship of paraphrase
to one another. Such syntagmas exist in both English and Irish for which the
following may serve as examples:
(12) An obair atd sibh a dhSanamh
the work which-is you do 1 0
(12) a. An obair atd dh6anamh agaibh
the work which-is do at-you
(12) b. The work which you (pi.) are doing
Strictly speaking we have here a case of bilateral alternative syntagmas as
(12b) also exists as a sentence without which. However, I choose here not to
regard those syntagmas as true alternatives of which one is merely a reduced
version of the other. Neither do I regard as such those sentences which con-
tain the same elements but in positional variations.
As alternative syntagmas in English we can offer the two translations of
the following Irish sentence:
(13) An obair atthar a dhianamh
the work which-0-are 11 do
(13) a. The work which is being done
(13) b. The work which they are doing
194 Raymond Hickey
The above sentences lead to another area of syntax where Irish and English
usage diverge. The English passive has no formal equivalent in Irish. I stress
formal as there are a variety of means by which one can convey the notion of
passivity in Irish. These involve constructions which are peculiar to Irish. Thus
if one begins with a sentence such as
(14) It is said that there is only a poor chance of it.
(where the passive can also be replaced by a finite verb with indefinite agent
They say that .. .) one can most appropriately render this in Irish by using
the autonomous form (the term used traditionally in Irish grammars). This is
a finite verb form which is not subject-marked. Thus the translation of (14) is:
It is possible to express the agent while using the autonomous form. It then
appears in the form of a prepositional pronoun as in the sentence:
5 Concatenative verbs
There are very definite differences in the area of verbal concatenation be-
tween Irish and English which arise out of the dissimilarity of non-finite verb
forms in both languages. In English a concatenated verb can assume one of
three forms:
Infinitive with to : INF+
Infinitive without to : INF
Present participle : PART
as in the sentences
(19) He was told to leave.
(20) She must stay.
(21) They considered emigrating.
In Irish the non-finite forms in (19) (21) are all rendered by the verbal
noun, a verb form which is marked for neither person or number and is used,
among other situations, in that of verb concatenation:
Although Irish has only one non-finite verb form in such constructions it has
three possible syntagmas with this depending on whether there is (i) single
196 Raymond Hickey
In those cases where a noun is used instead of the present participle in English
the bare verbal noun is used again, this being the equivalent of either a gerund
or as in (25) of a deverbative nominalization:
(24) We heard them talking.
(24) a. Chualamar iad ag caint.
heard-we them talking
(25) We heard the talk of the girls.
(25) a. Chualamar caint na gcailtni.
heard-we talk of the girls
6 Prepositional usage
Among the structural differences between Irish and English which lead to in-
terference that of prepositional usage occupies a prominent place. In most
cases we have prepositional verbs of English with literal translations of the
English prepositions into Irish:
(26) The time is up.
(26) a. * an t-am suas.
is the time up
(26) b. an t-am istigh.
is the time in
But in the case of (26 b) we have a figurative use of istigh which does not exist
in English, the equivalent here being up. The overgeneralization of the range
of up thus gives rise to (26a). The use of istigh is not confined to this particu-
lar phrase but can be used productively to express the notion of completion
or pastness, for example in
7 Polyfunctional morphemes
Among the elements of Irish which present difficulty to the learner are those
which can assume more than one function. This is particularly the case when
Towards a contrastive syntax of Irish and English 199
The syntactic correlation of the non-specific clause connector and with the
non-finite verb forms which correspond to the verbal nouns of the Irish sen-
tences is quite accurate. Here the causal relation implied in (39) is left unspe-
cified in (39 a) thus neglecting the temporal-causal connectors while/when
which could be used (in (40a) in a purely temporal function).
I would like to make a further subdivision in (iii). The cases which I have
just discussed are those of specific neglective interference. General neglective
interference is a term which could be applied to the non-use of syntactical
possibilities present in L2 though not in L t . This gives us a designation for the
phenomenon whereby a limited number of the syntactic possibilities of L 2
are used. I would assert that neglective and compensatory interference go
hand in hand for it is precisely the ignorance of, or unfamiliarity with a large
range of syntagmas which leads to one falling back on those of and trans-
ferring them into L 2 where a certain proportion of them will of necessity not
be permissible.
Notes
10. It is difficult to gloss a and in English. For our purposes here we can say that a is
the form used before a verbal noun when a pronominal subject is present with its
connected finite verb, while d is that used when a prepositional pronoun subject is
present. This at least gives the distribution conditions for the two particles in these
syntagmas.
11. The zero symbol here refers to the fact that this relative, finite verb form is not sub-
ject-marked.
12. The copula is is formally a separate verb from the existential verb td. Its syntagmas
differ considerably as well.
13. This sentence is strictly speaking
Smaoinigh siad ar dul thar saile.
but in spoken Irish ar is dropped.
14. The abbreviations L, 0 and stand for (i) lenition (fricativization of the following
consonant), (ii) placing of h before vowels (but no further changes) and (iii) placing
of a homoorganic nasal or equivalent voiced consonant before following consonant.
In each case the added consonant is the only one pronounced, so that, for example,
a dteach, 'their house' [3<Jae:x].
15. That is those elements which perform similar functions. Thus we do not have con-
fusion of Irish a with English a.
16. This is by no means intended to be an original treatment of interference typology
but merely a set of remarks on the subject which are particularly relevant to the at-
tested interference forms, especially those present in Hiberno-English.
References
Ahlqvist, A.
1976 "On the position of pronouns in Irish", iigse 1 6 : 1 7 1 - 1 7 6 .
Carroll, J. B.
1968 "Contrastive linguistics and interference theory", Report of the 19th. annual
round table meeting on linguistics and language studies, edited by J. E. Alatis
(Washington, D.C.: Georgetown University Press), 1 1 3 - 1 2 2 .
Krzeszowski, T. P.
1971 "Equivalence, congruence and deep structure", Papers in contrastive lingu-
istics, edited by G. Nickel (Cambridge: Cambridge University Press), 3 7 - 4 8 .
McCawley, J. D.
1970 "English as a VSO language", Language 4 6 : 2 8 6 - 2 9 9 .
MacMathuna, L.
1980 "I dtreo anailis chodarsnach Gaeilge-Bearla", Teagasc na Gaeilge 1 : 7 2 - 9 1 .
Wigger, A.
1972 "Grammatik und Sprachverwendung in der Satzordnung des Neuirischen",
Festschrift fr Wilhelm Giese, edited by . Haarmann and . Studemund
(Hamburg: Buske), 251 - 2 8 2 .
GEORGE . HORN
1 The proposal
The rules of the syntactic component produce surface structures. The surface
structure level is the only level relevant for interpretation.
The lexical entries of verbs and verblike constituents contain, among other
information, functional structures, in which NP terms are identified according
to their grammatical relations and are assigned semantic relations. A typical
entry is that for the verb see, shown below:
(2) SEE
+V
(3) a. S[(NW]
b. V p[ V(g) ]
Grammatical relations are defined on syntactic surface structures. In languages
like English, they are defined configurationally, in terms of tree structures, as
shown above. In highly inflected languages like Polish, they are defined mor-
phologically, in terms of casemarking. The particular definitions of gram-
matical relations can vary from language to language. The function of gram-
matical relations is to identify NPs in surface structures in order that their in-
dices can be inserted into the correct positions in functional structures. All
languages must have some mechanism for accomplishing this task, and there
need not be (and probably is not) a universal set of grammatical relations.
Returning to the functional structure of see, the terms and represent
semantic relations. Semantic relations are distinct from grammatical relations,
and are assigned by verbs and verblike constituents to the NP terms that appear
in their functional structures. There is not a one to one correlation between
semantic relations and grammatical relations, and the particular semantic rela-
tion that a particular syntactically or morphologically defined term bears de-
pends upon the individual verb. It therefore must be listed in the lexical entry
as shown in (2).
The organizational innovation in the model is the functional component.
For any given syntactic surface structure, a (complex) functional structure is
produced by combining the functional structures of the verbs and verblike
constituents that appear in that syntactic structure. Various operations on
functional structures apply to produce the (complex) functional structure in
its final form. Complex functional structures must conform to two well-
formedness conditions, only one of which is relevant to the present discussion.
It is formulated as follows:
formed functional structure can be derived from a given surface structure, the
sentence that that surface structure underlies is ungrammatical.
Two sample derivations, one from English and one from Polish, are shown
below. Consider the following examples:
(5) a. John saw Bill
b. Jan widziat Marka
John (nom) saw Mark (acc)
(6) ^ S
NP
The indices of John and Bill, the NP terms in the syntactic structure, are in-
serted into the proper positions in (7) to produce the following:
At this point, semantic relations are assigned to the indexed NP terms to pro-
duce (9):
The structure in (9) is the functional structure of (5 a), in its final form, and
as such, it provides the basis for the interpretation of this example.
Now consider the Polish example in (5 b). Its surface structure is shown
below:
(10) /S^
NP
1
V nom Vr NP
1
V acc
(I have not included a VP node in this structure. Whether or not Polish sur-
face structures contain VP nodes is not relevant to the present discussion.)
In (10), Jan is identified as NP n o m and can be indexed as i, and Marka is
identified as NP acc and can be indexed as/. The lexical entry for widziec 'see'
is shown in (11):
(11) WIDZIEC
+v
WIDZIEC NP n NP a
(Note that widziec and see have essentially the same type of functional struc-
ture. The difference between the two languages at this level lies in the way
that grammatical relations are defined in each, as mentioned above. As we
will see below, there are important consequences of this distinction.)
The functional structure of example (5 b) is derived from the functional
structure of widziec, beginning with the structure shown in (12):
(12) WIDZIEC N P n o m NP a c c
The indices of Jan and Marka are inserted into the proper positions in (12) to
produce (13), below:
about WHAT
The [e]-anaphora rule applies to bind the empty node in each structure to the
constituent in COMP. This rule is formulated as shown in (17):
(17) [e]-anaphora
w j i ] w 2 [e]
1 2 => 1 w 2 [Xi]
The only condition that must be placed on this rule is that Wt and W2 must
be of the same category.
In the structures shown in (16), assume that the terms are indexed as fol-
lows: in a, WHO is indexed as i, John (NP,) is indexed as /'; in b, WHO is in-
dexed as /', John is indexed as / , and Bill (NPX of see) is indexed as k \ in c,
WHAT is indexed as i, about WHAT is indexed a s / , and John is indexed as k.
CA evidence for constraints on transformations 211
Thus, the functional structures of the examples in (15) are derived from the
following:
(20) a. (NP[WH]) [SEENPi NP 2 ]
b. ( n p [WH]) [BELIEVE NP t [SEE NPJ NP 2 ]]
c. (p p [. .WH..]) [READ NPj PP]
212 George . Horn
(In c, y represents the semantic relation that the preposition about assigns to
its object.)
These structures are well-formed since all of the NP terms are indexed and
assigned semantic relations. The NP constituents prefixed to structures a and b
are assigned semantic relations by virtue of the fact that they bind NP terms
in argument position which themselves bear semantic relations. Pairs of the
form NPj/xj are, in fact, functional level units. In structure c, the PP about
what, /, binds a term in argument position as required by the WFC.2
There is no need to constrain whconstituent insertion into COMP. To see
this, first consider the examples in (22):
(22) a. *Who did John see Bill
b. *Who did John believe that Bill saw Fred
c. *About what did John read about politics
Examples like (22) have syntactic structures like the following which contain
no empty node:
(23) a.
COMP
WHO
CA evidence for constraints on transformations 213
syntactic structures shown in (23), and this accounts for the ungrammaticality
of the examples in (22). There is no need for a condition on lexical insertion
to guarantee, for example, that there must be an empty node in any structure
that contains a -form in COMP.
Examples like the following are also ungrammatical:
(25) a. *Which city did John go
b. *Which closet did John put the clothes
c. * About what did John see
These examples have surface structures like the following:
(26) a. ^S
about what Jo
Each of these structures contains a constituent in COMP that does not occur
in the VP with the relevant verb. In these structures, suppose that the various
CA evidence for constraints on transformations 215
terms are indexed as follows: in a, which city is indexed as i and John is in-
dexed as /; in b, which closet is indexed as i, John is indexed as /, and the
clothes is indexed as k\ and in c, what is indexed as /', about what is indexed
as /, and John is indexed as k. The [e]-anaphora rule cannot apply in any of
these structures. In structure a, there is no empty node, and in structures b
and c, the empty nodes are not of the same syntactic category as the con-
stituents in COMP.
The functional structures of these examples must be derived from the fol-
lowing:
(27) a. (Wh-NP) [GO NP t ]
b. (Wh-NP) [PUT NP t NP2 PP]
c. (PP) [SEE NPX NP 2 ]
Index insertion and assignment of semantic relations apply to produce the fol-
lowing from (27):
(28) a. ( N P . [ w h , . . ] ) [ G O N P l [ j ] ]
b. ( N P .[wh-.. .]) [ P U T N P l [ j ] N p 2 [ k ] ]
c. (ppjfABOUT Np[whatj]]) [SEE N P j [k]]
7
None of the functional structures in (28) are well-formed. The ones in a and b
contain indexed NP terms that are not assigned a semantic relation, and the
one in c contains a prefixed PP term that does not bind a term in argument
position. Thus none of the three structures satisfies WFC I. 3 So we see that
there is no need of a condition on the insertion of /-words, or phrases con-
taining them, into COMP position to guarantee, for example, that these con-
stituents can only be inserted into the COMP of a structure that contains a
verb with the proper subcategorization features. In fact, no conditions need
be placed on either the rule that inserts whwords (and constituents contain-
ing them) into COMP or on the optionality of lexical insertion.
Furthermore, the [e]-anaphora rule itself is optional and unbounded, and
no command or precedence conditions need be placed on it. If this rule had
not applied to any of the syntactic structures in (16), for example, the func-
tional structures derived from these (on the basis of the structures in (20))
would be the following:
29. a. ( N p[wh 0 i ]) [SEE N P l ] ]
a
b. ( N p[wh 0 i ]) [BELIEVE N P l [ j ] [SEE N P l [ k ] ] ]
a a
c. ( p p [ABOUT Np[whati]]) [ R E A D N P l [ k ] ]
' y oc
216 George . Horn
These structures are not well-formed. All three contain a prefixed term that
does not bind a term in argument position, and a and b contain an indexed
NP term that is not assigned a semantic relation. These structures thus do not
form the basis of a possible interpretation of the examples in (15). However,
this presents no problem for the analysis since the well-formed functional
structures in (21), above, can be derived for these examples. The rule, then,
need not be obligatory.
It is easy to see that the [e]-anaphora rule need not be bounded. The only
information required for interpretation is the surface structure position of the
wA-word (in COMP) and the position of the single empty node which is bound
to it. Consider the following example:
(30) Who did John believe that Bill thought that Sally told Frank
that Fred hit
The following functional structure can be derived for this example:
(31) (Npfwho,]) [BELIEVE NPl[j] [THINK Npjk]
It is well-formed and provides the basis for the proper interpretation of ex-
ample (30).
Turning to the third point, consider the following structures, in which a
-word has been inserted into the COMP of an embedded S.
NP VP
V NP S
COMP s
John told [e] who Frank hit Bill
CA evidence for constraints on transformations 217
COMP S
S V NP
COMP S
In structure a, the wh-word neither commands nor precedes the empty node,
and in structure b, it does not command the empty node. Now, let us suppose
that the [e]-anaphora rule, in the absence of any further conditions, can apply
to bind the empty node to the w/i-word in both cases, ignoring for the mo-
ment the fact that in (17), above, it is formulated as a unidirectional rule in
which NPj must precede [e], The functional structures derived from these
syntactic structures will then be the following:
(34) a. T E L L N p [ i ] N P [Xj] [( N P [who,]) [HIT N P l [ k ] N p 2 [m]]]
a a
i = John; k = Frank; m = Bill
b. ANNOY NpJCNptwhOi])
[KISS N P [j] N p 2 [k]]] Np 2 [ X i ]
i = John; k = Mary
The [e]-anaphora rule is the only major operation that applies to syntactic
structures, and because it is unbounded and unfettered by command/preced-
ence conditions, there is no need for a syntactic cycle. (On this point, see
Horn 1979:125-144 and Horn 1983, sections 2.1 and 2.2.).
Because the [e]-anaphora rule in the proposed framework is the analog of
w/z-movement in earlier analyses, we can conclude that 'w/i-movement' has
the properties noted at the beginning of this section: it is an unbounded, non-
cyclic process.
The [e]-anaphora analysis extends, with no major modifications, to data
involving topicalization and AP deletion, to provide account of all unbounded
syntactic processes. This is discussed in Horn 1983, section 2.2, and here I will
only offer three examples as illustrations. These are shown in (36):
NP VP
V NP ADV
The [e]-anaphora rule applies to all three structures to bind the empty node
to John. Suppose that in structure a, John is indexed as /', we is indexed as / ;
in structure b, John is indexed as i, we is indexed as / ; and in structure c, John
is indexed as /'. The complex functional structures derived from these syntac-
tic structures are shown in (38):
The following constraints on the [e]-anaphora rule will account for the un-
grammaticality of the examples in (39):
(40) a. The Noun Phrase Constraint: The [e]-anaphora rule can-
not apply if either term is properly contained in an NP.
The Noun Phrase Constraint (NPC) was first proposed by Horn 1974:1143.
It is discussed in Horn 1975:348, Bach - Horn 1976:279-284,Horn 1979:
9 5 - 1 0 1 , and Horn 1983, section 3.1. The Single Gap Constraint (SGC) has
appeared in various formulations, and the NP/Gap Order Condition (NPGOC)
is discussed by Bach, who attributes it to Arlene Berman, and by Chomsky,
who credits it to Bordelois. The formulation of the NPGOC in (40c) differs
slightly from Bach's formulation.
I will not repeat the arguments justifying this set of constraints for English
and supporting it over other possible sets. Rather, I will demonstrate how the
constraints predict the ungrammatically of examples like (39).
Consider examples (3 9 a - f ) . The surface structures of these all contain empty
nodes which are properly contained in an NP. The structure of (39 d) is shown
below:
(41)
COMP
Suppose that who is indexed as i, John as /, and a book about [e] as k. The
NPC prevents the [e]-anaphora rule from applying to this structure.
The following functional structure is derived from (41), by the application
of index insertion and semantic relations assignment:
(43) ^
COMP S
V S
The SGC blocks the application of the [e]-anaphora rule to this structure. As-
sume that the leftmost who is indexed as i, you is indexed as /, and the right-
most who is indexed as k. The following functional structure is derived from
this surface structure:
Such examples contrast with examples like the following, which are structural-
ly identical and differ only in the surface order of the NPs that occur in them:
(45) Which violin are the sonatas easy to play on
The surface structures of these examples are shown in (46 a) and (b) respec-
tively:
(46) a.
b. X
COMP
NP NP VP
In these structures, there is a clause S which contains two gaps. However, only
one of the antecedent NPs (which sonatas in structure a and which violin in
structure b) is located outside of that S. Therefore the [e]-anaphora rule can
apply in both structures to bind the empty nodes to the NPs which sonatas
and violin in structure a and which violin and sonatas in structure b. In ac-
cordance with the NPGOC, the leftmost [e] in structure a is bound to the
rightmost NP, violin, and the leftmost [e] in structure b is bound to the right-
CA evidence for constraints on transformations 223
most NP, sonatas. Then the rightmost [e] in both structures is bound to the
leftmost NP.
Assuming that the NPs are indexed as shown, the application of the [e]-
anaphora rule to the structures in (46) produces (47):
The following functional structures are derived from these surface structures:
b. ( N p[i])[[BE[EASY[PLAYNP, 2[;][
Np[ x i]]]]] NP[ Lj]]
7
These structures are well-formed and thus provide the basis for interpretations
of examples (39j) and (45) respectively. However, the interpretation based on
the structure in (48a) is nonsensical. That violin is interpreted as the object of
play and which sonatas is interpreted as the object of the preposition on. Ex-
224 George . Horn
ample (39j) is unacceptable for the same reason that the following examples
are unacceptable:
(49) a. *That violin is easy to play on the sonatas
b. *It is easy to play that violin on the sonatas
In contrast, the interpretation based on the functional structure in (48b), for
example (45), is acceptable. Which violin is interpreted as the object of the
preposition on and the sonatas is interpreted as the object of the verb play.
This interpretation is analogous to the interpretation of the following, accept-
able, examples:
The NPC, SGC, and NPGOC suffice to constrain the [e]-anaphora rule and
provide an account of island phenomena within the context of the proposed
framework. No notion of cyclicity is required to account for these facts, and
the cycle can thus be eliminated in this framework.
The true test of any analysis, of course, lies in how easily it can be extend-
ed to account for additional relevant data. The obvious question to ask at this
point is whether the proposal outlined in this section will account for island
phenomena in other languages. In the next section, I will look at an analogous
range of data in Polish, which superficially pattern quite differently from the
English data, and I will show that the same generalizations apply to both
languages.
2 Extension to Polish
(52) a.
COMP
NP
1>r
acc NPnom
1,1
COMP
PPj NP,
b. (PPJONP^JIC]]) [P/^CNpnom[j]pp[Xi]]
7 Of
k = kim
These functional structures are well-formed and thus provide the basis for the
interpretations of these examples.
As in English, if the [e]-anaphora rule is blocked for some reason, or does
not apply, to a given syntactic structure like the above, then the functional
structure ultimately derived from it will not be well-formed. To see this, con-
sider the following:
(57) a.
COMP s
NPacc NP,nom
b. S
COMP s
PP NP,nom V NP
c. S.
NP
(58) a. ( N P [ + W H ] ) [WIDZIEC N P N O M N P A C C ]
b. (pp[. ,+WH. .]) [WIDZIEC HPnom NP a c c ]
c. ( N P [ + W H ] ) [WIDZIECHPnomnom N P A C C ]
( 5 9 ) a. ( N p a c c [ i ] ) [ W I D Z I E C N P n o m ] NP a c c [k]]
b. r D D.ro WD m n \wiDziEc mo [k]]
(62)
COMP
NP
I
1- acg
(I assume that the pronominal subject ty 'you' is present and is later deleted.)
Suppose that jaki numer, NP acc in COMP, is indexed as i. The [e]-anaphora
rule can apply as above to bind the empty node to this constituent. The fol-
lowing well-formed functional structure can be derived for example (60a):
In the same way, functional structures can be derived for the other examples
in (60).
The examples in (61) are also grammatical, and are variants of (60). Their
surface structures, however, contain a constituent in COMP that must be re-
lated to an empty node which is properly contained in an NP elsewhere in the
structure. The syntactic structure of (61a), shown below, illustrates this:
(64)
COMP
I
ADJ
If the adjectival form jaki in COMP does not bind the empty node in
structure (65), the functional structure derived from it will not be well-formed.
Such a functional structure is shown in (66):
(66) ( Aacc [i]) [WYKR^CIC N p n o m [j] NPacc[k]]
i = jaki', j = ty \ k = [e] numer
In (66), the prefixed term A [i] does not bind a term in argument position,
and this structure does not meet the requirements of the Well-formedness
Condition as formulated in (4).
We might conclude from these data that the NPC is not a universal con-
straint, and does not apply to block the application of the [e]-anaphora rule
in Polish. If this were the case, it would suggest that the explanatory power of
the proposed set of constraints is not great.
It is not true, however, that the [e]-anaphora rule can apply freely into
NPs in Polish structures. The following examples, like their English counter-
parts, are ungrammatical:6
(67) a. *0 kim on zniszczyt ksiqzk
About whom (he) destroyed book (acc)
'About whom did he destroy a book'
b. *0 kim Jan podari artykul
About whom John tore up article (acc)
'About whom did John tear up an article'
c. *0 czym Jan przegapil film
About what John missed film (acc)
'About what did John miss a film'
d. *Z jakim dzieckiem Jan spotkal kobietf
With which child John met woman (acc)
'With which child did John meet the woman'
e. *Z jakiego miasta lubil ludzi
From which town (he) liked people (acc)
'From which town did he like the people'
f. *W ktorym pokoju podobaly mu si meble
In which room pleased him (dat) REFLEX furniture
(nom)
'In which room did he like the furniture'
The corresponding statements, in which the PPs are located in the NP with
the head noun, are, of course, grammatical:
CA evidence for constraints on transformations 231
In such structures, the NPC correctly blocks the [e]-anaphora rule, preventing
it from binding the PP [e] term to the PP in COMP, and the functional struc-
tures that can be derived from these surface structures, shown below, are not
well-formed because they contain a prefixed term that does not bind a term
in argument position:
(70) a. ( P P [ i ] ) [ZNISZCZYC N p n o m [j] N P a c c [ k ] ]
i = kim; j = Jan\ k = ksiqzk$ [e]
232 George . Horn
PP: NP,nom V PP
(73) ( P P i [ 0 N P [ j ] ] ) [ ^ / M d N p ^ f k J p p I X i ] ]
y a
Consequently, examples like (71) are grammatical.
Examples like the following, which contain an NP in the highest COMP,
show that it is not possible for the [e]-anaphora rule to apply to bind an empty
node within a (complex) NP to a term outside of that NP:
(74) *Jaki plaszcz weszia jakas dziewczyna ktora miala na sobie
Which coat came in a girl who wore
'Which coat did a girl who wore come in'
CA evidence for constraints on transformations 233
The PP in (77) is identical in form to the PP in (78). Similarly, NPs are not
marked according to whether they occur in a relative clause, or in some em-
bedded S that is not an NP complement. The above data can be accounted for
by assuming that the NPC is valid in Polish, but can be overridden just in case
there are morphological markers present in the surface structure that can
function to associate a constituent in COMP with an empty node elsewhere in
the structure.
In (61a), for example, the form of jaki indicates that it is the accusative
singular masculine form, and it can thus be associated with numer, which is
an accusative, singular, masculine noun. In (61b), the form of ktorych indi-
cates that it is the accusative plural masculine animate form, and it can thus
be associated with studentow, which is a masculine animate noun marked for
accusative case (plural).
234 George . Horn
In contrast, the PP kirn in (67 a) is not marked to agree with the noun
ksiqzk, and no morphological marker is present in the surface structure that
can serve to associate the two constituents. The situation is the same for the
other examples in (67).
This proposal is further supported by the following data. Consider example
(79), below:
COMP S
NP,gen
COMP S
rule from binding the empty node in the NP to the adverb bardzo in COMP.
These contrast with the acceptable c and d examples, to whose surface struc-
tures the [e]-anaphora rule can apply to bind the empty node to the adverb
bardzo:
(88) a.
COMP
Adv NP n o m V
b. S.
COMP
Adv NP n o m V
c. S.
d. s
COMP S
0. ( A dv[i]) [^WAdv[Xi]k]] N P n m]
d. (AdvU)
VAOVL'J/ [CZUC
N P n o m [ uj
i>ir j ] Advp[Adv[Xi]
rtu k] ]
nom
i = bardzo', j =on\ k = niepewnie
(96) a. s[c[kogokto]s[[e]powiedziats[zes[ranekpobit[(;]]]]]
b s[dkogo}s[Janekwies[c[kto]s[[Q]spotkat [e]]]]]
( 9 7 ) a. s [ c [ X Y ] s [ - -[e]...[e]...]]
c. s [ c [ X Y ] s [ - s[COMPs[...[e]...[e]...]]]]
d. s[c[X]s[.-.s[c[Y]s[-..[e]...[e],..]]]]
stituents such as head nouns and their modifiers, as we have seen, are con-
figurationally indicated in English and morphologically indicated in Polish
(while configuration plays only a marginal role.). As a consequence of this
distinction, surface word order in Polish is relatively freer than it is in English.
The overall mechanics of w/2-movement is the same in both English and
Polish. In both languages, 'w/i'-words (and constituents containing them) oc-
cur in COMP (in S-initial position), and the [e]-anaphora rule operates to bind
empty nodes to these ''-words (and constituents). In both languages, this
rule is unbounded and applies non-cyclically, and it is constrained by the
same island conditions: the Noun Phrase Constraint (NPC) and the Single Gap
Constraint (SGC). 8
The quite different pattern of grammaticality observed in the Polish data is
another consequence of the 'morphological/configurational' distinction be-
tween the two languages. In those situations in which constituent associations
are not indicated by a surface structure morpheme, the constraints cannot be
violated. Thus the NPC prevents PPs in COMP, which are not marked in a spe-
cial way when they are related to the head nouns of NPs, from binding empty
PP nodes located in NPs, as illustrated by the ungrammaticality of examples
like (67). In complex sentences like (93 d), (94d) and (95), casemarking is not
sufficient to associate the constituents in COMP with empty nodes elsewhere
in the structure, so the SGC prevents the empty nodes from being bound to
the constituents in COMP. Examples like these are ungrammatical in both
English and Polish. However, because English lacks casemarking, the condi-
tions necessary for overriding the NPC and SGC (to relate adjectives in COMP
to head nouns, for example) never occur.
Another way in which Polish and English differ is that in Polish surface
structures more than one constituent can occur in COMP, while this is not
possible in English surface structures. These surface differences tend to obscure
underlying, and more fundamental, similarities between the two languages.
In the proposed framework, the effect of casemarking on the applicability
of the [e]-anaphora rule is to be expected. The [e]-anaphora rule functions to
relate a constituent A to some constituent in constructions of the form
shown in (100):
Ii.:",: ':::!!
The constituent A can be any of several constituent types, as we have seen,
such as NP, PP, Adj, or Adv, and can be a verb (PRED) in structures like
(101 a) and (b) below, or the head noun of an NP in structures like (101 c), for
example:
242 George . Horn
3 Chomsky's analysis
I do not intend to argue against Chomsky's framework in this paper, and will
not discuss it in detail. For a more extensive treatment of it as it applies to
English, see the Chomsky papers listed in the bibliography. Rather, I wish to
point out that, in its present version, his analysis is unable to account for the
Polish data discussed in the preceding section.
Chomsky attempts to account for island phenomena by introducing the
notion of 'bounding' node and formulating the Subjacency Condition in terms
of bounding nodes as shown below:
(102) The Subjacency Condition: No transformation can apply to
move a constituent over more than one bounding node.
CA evidence for constraints on transformations 243
(His other constraints, the Nominative Island Constraint and the Opacity Con-
straint, also play a role in accounting for island phenomena, but I will not dis-
cuss them here.)
His rule of w/i-movement applies to move a wh-word, or constituent con-
taining a w/i-word from its original position in the structure to the COMP of
the S in which it is located, and then from COMP to COMP until it arrives at
its surface position (usually in the COMP of the highest S in the structure).
Because of the nature of the contraints, w/z-movement must apply cyclically.
In surface structures, traces, or empty nodes from which constituents have
been moved, must be bound by these constituents. An additional condition
on binding is that the antecedent constituent must c-command its trace.
The following example illustrates the mechanics of w/j-movement in this
framework:
However, if this is done, then the Subjacency Condition will not prevent ex-
traction from the structure shown in (109) to block the derivation of the un-
grammatical example in (110):
Here, I assume that kto is first moved to COMP, and that komu is subsequent-
ly moved and adjoined to the COMP node containing kto. In this structure, as
in analogous structures for English examples, kto does not c-command its
trace in S. Thus, in this alternative, the c-command requirement cannot be
maintained for Polish.
A final alternative is the one briefly mentioned in section 2, in which only
the first 'w/i'-word is located in COMP and the remaining ones are merely
scrambled (obligatorily) to the initial position of the clause, S. Under this al-
ternative, the c-command condition can be retained.
None of these alternatives is desirable. Under the first and third, a universal
rule of w/?-movement cannot be maintained with the concomitant loss of any
explanation for the observed similarities between Polish and English (and many
other languages). Under the second alternative, the universality of the c-com-
mand condition on anaphoric binding cannot be maintained, and yet this is
precisely the sort of condition that, if required by one language, would be ex-
pected to be universally required.
Furthermore, there is no way in Chomsky's analysis to distinguish ungram-
matical examples like (93 d), (94d), and (95) from grammatical examples like
(93 b and c) and (94b and c). Examples (93d), (94d), and (95) are repeated
below, along with their syntactic structures:
(93) d. *Jaki prezent komu Bill chciat zeby dano
s tc \jaki prezent komu] s[Bill chciat [c[zebv] s[t dano t]]]]
To see this more clearly, suppose first that either S or S is a bounding node,
but not both. In (93 d), on the lower cycle, both jaki prezent and komu can
be moved to the rightmost COMP, and then on the highest cycle, both can be
moved to the leftmost COMP. The same movement can apply to the structure
of (94d). In the structure of (95a), on the first cycle, kogo can be moved to
the rightmost COMP. On the next cycle, both kogo and kto can be moved to
the leftmost COMP. In the structure of (95b), kto and kogo can be moved to
the rightmost COMP on the first cycle, and on the next cycle, kogo can be
moved to the leftmost COMP. In no case is the Subjacency Condition violated,
and, as we have seen in the previous case with examples like (90), more than
one 'wh'-word can occur in a single COMP. Thus, ungrammatical examples like
these can be derived under these assumptions.
If both S and S are bounding nodes, the derivation of these examples is
correctly blocked by the Subjacency Condition, and at the same time, exam-
ples like (90) can be derived (if we make the undesirable assumptions concern-
ing c-command or the \-movement rule that were discussed above). How-
ever, in this case, w/j-movement will be blocked in the structures of examples
like (93b) and (c), and (94b) and (c), which are repeated below:
(93) b. Jaki prezent Bill chciai zeby dano Janowi
s [ c \Jakiprezent] s[Bill chcial s [ c t f f i y j s[dano Janowi jt]]]]]
Notes
1. It is important to note that, because grammatical relations are defined in terms of case-
marking, syntactic configuration, and hence, word order, are not relevant in identify-
ing NPs so that their indices can be inserted into the correct positions in functional
structures. The N P n o m and N P a c c terms can be identified as Jan and Marka in the fol-
lowing, which are variants of (5 b):
i. a. Widziat Jan Marka
b. Marka Jan widziat
c. Jan Marka widziat
and so forth.
Assuming, as above, that Jan is indexed as i and Marka as /, the functional structure
shown in (14), repeated below, can be derived for these examples:
(14) W/DZ/ifC'NP n o m [] NP a c c l
Consequently, word order in Polish is relatively freer than it is in English.
2. Example (15c) and the following are synonymous:
i. What did John read about
The functional structure of i is shown below:
ii. ( N P [ i ] [READ NPj 1 pp[ABOUT N P l x i l l l
248 George . Horn
This structure and (21 c) are equivalent because they contain the same predicate, read,
and the same NP terms, John and what, which are assigned the same semantic relations,
and 7, respectively.
3. If the [e]-anaphora rule binds the empty node in structure (26c) to Np(what] in
COMP, the prefixed PP term in the functional structure derived from this syntactic
structure will still not bind a constituent in argument position, and the functional
structure will not be well-formed.
4. The [e]-anaphora rule can apply to bind empty nodes to antecedents that are not lo-
cated in COMP, in structures like (37 c). However, if the rule applies to bind the empty
node to we in (37a), or to either we or it in (37 b), then the functional structures de-
rived from the resulting surface structures, shown below, will not be well-formed:
L (NPI'1) [SEE NPj [j] NP2I x jll
ii. a. [BE[( N P [i])[SEE N P l [j]NP 2 [Xjl]]]
i = John; j = we
b. [ B E [ ( N p [ i ] ) [ S E E N P l [ j ] N p 2 [ x k ] ] ] ]
i = John; / = we; k = it
All three functional structures contain a prefixed NP term that is not assigned a se-
mantic relation and does not bind a term in argument position.
5. If the empty node in structure (57b) is bound to the NP, kim, in COMP, the func-
tional structure produced will still not be well-formed, as it will contain the prefixed
PP term, kim, i, which does not bind a term in argument position.
A more complete explanation for the ungrammatically of example (56 c) is, per-
haps, called for. The application of the [e]-anaphora rule to structure (57 c) produces
the following:
-S.
nom
References
Bach, Emmon
1977 "Comments on the paper by Chomsky", Formal syntax, edited by Peter
Culicover, Thomas Wasow and Adrian Akmajian (New York: Academic
Press), 133-155.
Bach, Emmon - George M. Horn
1976 "Remarks on conditions on transformations", Linguistic inquiry 7:265:299.
Bresnan, Joan
1978 "A realistic transformational grammar", Linguistic theory and psychological
reality, edited by Morris Halle, Joan Bresnan and George Miller (Cambridge,
Mass.: M.I.T. Press), 1 - 5 9 .
Chomsky, Noam
1973 "Conditions on transformations", A Festschrift for Morris Halle, edited by
Stephen Anderson and Paul Kiparsky (New York: Holt, Rinehart and
Winston), 2 3 2 - 2 8 6 .
1976 "Conditions on rules of grammar", Linguistic analysis 2:202-352.
1977 "On wft-movement", Formal syntax, edited by Peter Culicover, Thomas
Wasow and Adrian Akmajian (New York: Academic Press), 7 7 - 1 3 2 .
1978 On binding (Cambridge, Mass.: M.I.T. Press).
Culicover, Peter - Thomas Wasow - Adrian Akmajian, eds.
1977 Formal syntax (New York: Academic Press).
250 George . Horn
In this paper I will be concerned with a suggestion made within X-bar theory,
notably by_ Jackendoff (1977) and Emonds (1976), that PP (P') 1 has the
string S as one of its expansions. This structure, they claim, can be as-
signed to adverbial subordinate clauses, with the traditionally understood sub-
ordinating conjunctions generated under the node and the clauses under the
S node.
In what is to follow, I will first survey the analysis at issue, as presented in
Jackendoff 1977 and Emonds 1976. Then, I will discuss a criticism of the ana-
lysis made by Hendrick (1976). Next, I will re-examine some data and tenta-
tively suggest that an alternative approach to English adverbial subordinate
clauses, proposed in passing by Hendrick, should be assumed. Finally, I will
consider some adverbial subordinate clauses in Polish and show that the
Jackendoff-Emonds proposal is not applicable there. I will tentatively con-
clude that a generalization made within X-bar theory, involving the expansion
of PP mentioned above, is questionable as far as English is concerned and, as
far as the Polish facts show, it cannot be claimed to be universal.
1 Theoretical preliminaries
Jackendoff (1973) argued that prepositional phrases have a richer syntax than
had been traditionally assumed. His most recent suggestion (Jackendoff 1977:
81, 82) is that (1.1) is an appropriate rule for expanding PP (P').
(N"')-(P"')
(i.i)P'-P-, (S)
The rule for P' in (1.1) is rather similar to the rule for V' in (1.7):
Comparing (1.1) and (1.7), it can be noticed that prepositions and verbs may
be followed by the same constituents in the same relative order. These paral-
lels in the expansions of P' and V' lead Jackendoff to the assumption within
the feature analysis of X-bar theory that both verbs and prepositions have the
anc
same basic set of syntactic features j ^ Q ^ p J * that their phrase structure
rules can be collapsed into one general rule. Thus, in place of (1.1) and (1.7)
Jackendorff (1977:82) introduces (1.8):
(1.8) X
+ Obj X-(N'")I-(P",)a-(S)3
. + Comp.
Condition: If X = P, not (3 and (1 or 2)).
Such cross-categorial generalizations provide evidence for the X-bar theory as-
sumption that syntactic categories should be analyzed as feature complexes. 2
The crucial point about the generalization involving (1.1) and (1.7) that I
will be concerned with here is, as mentioned in the introduction, the occur-
rence of S in the expansion of P' as well as V'. While it is quite uncontroversial
that an S can be a V' complement, the claim that it can also be a P' comple-
ment is a novel development in the treatment of prepositional phrases.
As Jackendoff points out, the idea that he develops goes back to Klima
(1965) who "claims that subordinating conjunctions can also be analyzed as
prepositions which take an S complement [ . . .] This provides the simplest
(1.9) a. John has been living in Canada since the end of the war.
b. John has been living in Canada since the war ended.
(1.10) a. I will stay here until the beginning of the film,
b. I will stay here until the film begins.
(1.12) a. John will come because you have asked him to.
b. * John will come because you.
(1.13) a. John will come if you ask him to.
b. *John will come if you.
(1.14) a. John will come unless you ask him not to.
b. *John will come unless you.
(1.15) a. John will come although you haven't asked him.
b. *John will come although you.
This is not the case with the other items. Clearly, then, there are at least three
distinct classes of traditionally understood subordinating conjunctions: those
which can be immediately followed by a noun phrase (e.g. before) and thus
function also as prepositions, those which can be followed by a noun phrase
with the preposition of intervening (e.g. because) and those which can never
be immediately followed by a noun phrase (e.g. i f ) . It is the first class of sub-
ordinating conjunctions that has been the main motivation for Klima and,
consequently, for Jackendoff to assign them to the category P. Emonds(1976:
175) explicitly assigns the subordinating conjunctions of the two other classes
to this category too. Within the assumptions presented here, this seems to be
a reasonable thing to do and perhaps Jackendoff would agree on the matter.
If he did not, the claim would not be a very interesting one as it would con-
cern only a small group of traditionally understood subordinating conjunc-
tions which would be analyzed as prepositions and the remainder would be
analyzed as something else. In this way, a possible generalization about sub-
ordinating conjunctions would be missed. It is preferable for a theory to ana-
lyze all subordinating conjunctions in a uniform way. Whether this is a simple
task to perform is not clear. For the purpose of my discussion, I will concen-
trate on the first two classes of subordinating conjunctions mentioned at the
beginning of this paragraph.
While Jackendoff (1977) gives only one reason for regarding adverbial sub-
ordinate constructions as PP's, namely that certain subordinating conjunc-
tions appear elsewhere as ordinary prepositions followed by a noun phrase,
Emonds (1976:173-175) provides a number of arguments for such an ana-
lysis. The first observation he makes is that adverbial subordinate clauses, like
ordinary PP's, can appear in focus position in cleft sentences:
V' PP
John V PP S NP
Assuming that adverbial subordinate clauses are also PP's, the relation between
the two sentences in (1.24) can be explained by claiming that the PP lowering
rule has moved the adverbial subordinate construction, a PP, into V' and thus
changed the order of the last two constituents.
(1.24) a. John believed that he was right until the lecture ended,
b. John believed until the lecture ended that he was right.
These three pieces of evidence, then, argue quite strongly that the tradition-
ally understood subordinating conjunctions are heads of prepositional phrases.
It does not, of course, follow that the underlying structure of adverbial sub-
ordinate constructions is S, as Emonds (1976:172) recognizes.
There are, then, two distinct questions to be asked. Firstly: are subordinat-
ing conjunctions in fact prepositions? Secondly: if they are prepositions, are
they followed by S or some other constituent in underlying structure?
256 Ewa Jaworska
(2.3) a
talked
HTj-movement applies to (2.3)b giving the question in (2.2). The lowering rule
can operate on (2.3)a because there is an available PP node in V'.
Hendrick concludes that adverbial subordinate constructions cannot be
PP's but are S's, sisters of V'. A close look at the data, however, suggests that
the situation is not as straightforward as Hendrick would like it to be.
Let us first consider sentences containing three ordinary prepositional
phrases following the verb. On Hendrick's assumptions, it should be impos-
sible to extract from the third PP. Such a PP will always be a sister of V' as
there are only two PP nodes generated in V'. The sentences in (2.4) and (2.5)
are Hendrick's (1976:117). He marks the questions in b as ungrammatical.
This is somewhat surprising since these questions, as well as the other two, are
grammatical for all my English informants.
(2.4) a. John sent a package to New York by registered mail
for his friend.
b. (*)Who did John send a package to New York by re-
gistered mail for?
258 EwaJaworska
(2.10) a. What tone of voice did John say that he had met a
spy in?
b. ?What tone of voice did John say in that he had met
a spy?
(2.11) a. Which day did John hear that Martin owns a Fiat on?
b. ?Which day did John hear on that Martin owns a Fiat?
(2.12) a. Which party did John announce that he was married at?
b. ?Which party did John announce at that he was married?
The questions in (2.10)a and (2.1 l)a are repeated examples (2.8) and (2.9).
We have just seen that these sentences are good, contrary to the predictions
of Hendrick's assumptions. His assumptions also predict that the b examples
in (2.10) - (2.12), involving extraction from a lowered PP, should be gram-
matical. Clearly, this prediction does not hold either. It should be stressed
that while it is difficult to extract from a PP which has been lowered into V',
extraction from a PP which originates in V' as a strictly subcategorized argu-
ment is perfectly natural. (2.13) (2.15) illustrate:
(2.13) Who did John hear from that Mary lives in Sweden?
(2.14) Who did Bill argue with that Carter was insane?
(2.15) Who did Sally shout at that she wasn't going to do the
washing-up?
It seems, then, that Hendrick's first argument against the PP analysis of ad-
verbial subordinate constructions is unacceptable. The assumptions it is based
on and the predictions they lead to are disconfirmed by the facts about ex-
traction from PP's illustrated in (2.4) - (2.15).
It should be noted finally that the formation of questions like the one in
(2.2) above does not necessarily involve the derivation argued for by Hendrick
and presented in (2.3) since the crucial PP does not precede a strictly subcate-
gorized argument.
Hendrick's second argument against the position that adverbial subordinate
constructions are PP's of the structure - S is that there is a parallel between
complementizers and subordinating conjunctions with respect to a rule mov-
ing sentence adverbs. He refers here to a rule called Sentence Adverb Fronting
(SAF), discussed by Wexler and Culicover(1976). This rule is responsible for
relating the following sentences:
Hendrick notices that an adverb can occur between that and the following
clause as well as between a subordinating conjunction and the following clause.
(2.17) John will be in Paris next year because, hopefully, he'll get
a Fulbright.
He claims that an adverb cannot immediately precede an S if the complemen-
tizer that is not present on the surface, as illustrated below:
(2.18) *John said, hopefully, he would arrive tomorrow.
Thus, he concludes, if we assume that subordinating conjunctions are com-
plementizers, the position of sentence adverbs in sentences like (2.16)b and
(2.17), and their impossibility in sentences like (2.18) can be explained in a
simple way by saying that they always have to follow an overt complementizer
in a complement clause. Under the S analysis of adverbial subordinate
constructions this statement would have to involve two categories, a comple-
mentizer and a preposition. Therefore, since the former description is less
complex than the latter one, the COMP - S analysis of adverbial subordinate
constructions should be preferred.
It is not clear from what Hendrick says, however, what kind of constraint
he is assuming. One possibility is to say that SAF is an ordinary movement
rule and that it is a constraint on the rule stipulating that it can only apply in
an embedded clause when there is a complementizer. This, however, would
not block the derivation of (2.18). Following Chomsky and Lasnik (1977),
Hendrick would probably assume that movement rules precede deletion rules
and thus that (2.18) had an underlying complementizer which was deleted
after the application of SAF. Hence, the condition stated on the rule itself
would fail to prevent the generation of sentences like (2.18).
Another possibility, then, is to say that the condition is a surface filter.
For Chomsky and Lasnik (1977), surface filters operate on the output of
deletion. A simple filter like (2.19) would rule out sentences like (2.18), as
Hendrick's grammaticality judgements require.
(2.19) *[coMpe] Adv
Yet another possibility is to say that SAF is not an ordinary syntactic but a
stylistic rule. For Chomsky and Lasnik (1977), stylistic rules operate after
deletion and filters. The condition could then be stated on the application of
the rule.
Whichever possibility Hendrick would adopt, the constraint on SAF within
his COMP S analysis would be simpler than within Jackendoff's S ana-
lysis. For Jackendoff, if the constraint had the form of a surface filter, an ad-
tional condition indicating that a sentence adverb may follow either a com-
Adverbial subordinate constructions 261
Secondly, one might claim that prepositions are followed by a bare S, not
an S. Bresnan (1979) proposes that this is true of certain verbs. Assuming that
prepositions often behave like verbs, subordinating conjunctions, regarded as
prepositions within X-bar theory, could be said to be followed by a bare S in
the underlying structure and thus a complementizer would never appear in
such structures.
It is clear, then, that the absence of the complementizer that in adverbial
subordinate constructions does not constitute a serious problem for the as-
sumption that subordinating conjunctions are prepositions followed by sen-
tences. Furthermore, as Hendrick himself notices, there are some cases where
subordinating conjunctions, which can also function as ordinary prepositions,
are followed by a /-clause:
In traditional grammar, despite that and in that are regarded as complex sub-
ordinating conjunctions(cf. Quirk et al. 1972). Clearly, they do not constitute
a problem for the Jackendoff analysis. In fact, he does not discuss such cases.
If he did, he would probably modify the claim about obligatory deletion of
that by saying that it is obligatory with some prepositions, like before, and
blocked with others, like despite. Again, this would make the class of pre-
positions look more similar to the class of verbs. Certain verbs, like want,
normally require deletion of a complementizer:
crosses three main boundaries. At least two of them must be bounding nodes
in order for subjacency to account for (3.1). Baltin (1978) argues that PP, NP
and S are all bounding nodes. Clearly, the wh-word in (3.2) crosses two of
them, S and PP on the second cycle. This, then, is how the islandhood of ad-
verbial subordinate clauses might be explained. However, this combination of
bounding nodes is problematic. Although it can correctly block sentences like
(3.3)a in the way presented schematically in (3.3)b, it cannot block sentences
like (3.4)a, noticed by George Horn (personal communication), as illustrated
in (3.4)b.
While in (3.3) the w/j-word crosses two bounding nodes, NP and PP, on its
way to the front of the sentence, the wh -phrase in (3.4) crosses only one
bounding node, NP. To block (3.4)a, S and NP must be bounding nodes.
However, if PP is also a bounding node, as it is for Baltin, (3.5)a is blocked, as
illustrated in (3.5)b.
If PP were not a bounding node and S and NP were, sentences like (3.3)a,
(3.4)a and (3.5)a would be correctly predicted by the subjacency condition
on w/i-movement. With S and NP as bounding nodes, however, subjacency
would not be able to explain the islandhood of adverbial subordinate clauses
exemplified in (3.1) and (3.2).
The above considerations suggest that subjacency cannot account for the
islandhood of adverbial subordinate clauses. 9
In fact, it is worth noting that Hendrick cannot invoke subjacency either.
Consider the following:
(3.6)a represents the structure which Hendrick assumes for adverbial subordi-
nate constructions. Unless S and S are both bounding nodes, there is no way
to prevent the derivation of (3.6)b. 1 0 Obviously, these two nodes cannot be
both bounding nodes, as it would never be possible to generate sentences like
(3.7)a.
Adverbial subordinate constructions 265
(3.8) ^ ^ P P ^ ^
NP
COMP S
that
For Hendrick, this structure is relevant only in the cases of adverbial subordi-
nate clauses where that is overtly present. I would like to propose, rather ten-
tatively, that all English adverbial subordinate clauses involve a complex NP
in the underlying structure. This proposal would treat all adverbial subordi-
nate constructions as PP's and would thus be compatible with the three argu-
ments given by Emonds. It would also, however, affect Jackendoff's generali-
zation concerning verbs and prepositions in that it would be only verbs but
not prepositions which can be followed by sentential complements. Thus, sen-
tences like (3.9)a and (3.10)a would, respectively, have structures presented
schematically in (3.9)b and (3.10)b.
raining]]]
(3.10) a. John went out before it started to rain.
b. John went out[ P P [pbefore][ N p[ N pe][gitstarted to
rain]]]
The advantage of assuming this analysis of adverbial subordinate constructions
rests in the fact that extraction from these clauses can be regarded as a viola-
tion of the complex NP constraint first formulated by Ross (1967:70). The
266 Ewa Jaworska
exact nature of the constraint is not important here: it can be regarded either
as a consequence of Chomsky's subjacency condition or of Horn's (1974,
1979) NP constraint. What is important here is that complex NP's are syntac-
tic islands and thus that the extraction facts can be explained if adverbial sub-
ordinate constructions involve a complex NP.
As mentioned earlier, Hendrick sees no real motivation for the empty head
of the complex NP in English. It seems, however, that some motivation can be
provided. Consider the following pairs of sentences:
(3.11) a. John arrived on time despite that the train was late,
b. John arrived on time despite the fact that the train was
late.
(3.12) a. John didn't come because his car had broken down,
b. John didn't come because of the fact that his car had
broken down.
As we have seen, the initial motivation for regarding the English subordinating
conjunctions as prepositions is the fact that they can be followed by a sen-
tence as well as by a noun phrase:
Jan przyjechat
Therefore, we can say that mimo is quite like the English despite in that it is
optionally followed by an NP preceding an embedded clause. In the case of
mimo, the NP is a form of a demonstrative pronoun; in the case of despite it
is the fact. The non-occurrence of the demonstrative in Polish, like of the fact
in English, can be explained in two ways: either the demonstrative head of
the complex NP is present in the underlying structure and then optionally
deleted or the head is generated as an empty node in some circumstances.
Which analysis is correct is not important here. The important thing is that
with prepositions like przed, po and mimo followed by adverbial subordinate
clauses the underlying structure is clearly - NP and not - S.
Another set of Polish adverbial subordinate constructions is presented in
(4.12M4.14).
(4.12) Jan bdzie w Paryzu w przyszlym roku dlatego, ze
dostanie stypendium.
John will be in Paris in next year for-this that (he) will get
grant.
'John will be in Paris next year because hell get a grant.'
(4.13) Piotr poznat Ann przedtem, jak kupil samochod.
Peter met Ann before-this how (he) bought car.
'Peter met Ann before he bought the car.'
(4.14) Piotr poznat Ann potem, jak kupit samochod.
Peter met Ann after-this how (he)bought car
'Peter met Ann after he bought the car.'
Extending Jackendoff s analysis, dlatego, przedtem and potem might be as-
signed to the category P, followed by ze- and /afc-clauses within a PP. How-
ever, dlatego, przedtem and potem can never be followed by a noun phrase.
Therefore, treating them as prepositions in the above sentences would be
rather dubious.
Following Hendrick's proposal of the COMP - S analysis of adverbial sub-
ordinate constructions, one might claim that dlatego ze, przedtem jak and
potem jak are complex complementizers. We have seen, however, that the idea
of a complex complementizer is not very plausible for English. In Polish,
there are two more plausible analyses for the adverbial subordinate construc-
tions in (4.12)-(4.14).
The first possible analysis is based on the observation that dlatego, przedtem
and potem are each composed of two elements: a preposition, dla 'for', przed
'before' and po 'after', and a form of the demonstrative pronoun ten. In the
case of dlatego, ten appears in its genitive form tego. This is not surprising
since dla is normally followed by a noun phrase in the genitive case:
270 Ewa Jaworska
The jak following przedtem and potem in (4.13) and (4.14) is also a comple-
mentizer (cf. note 14).
Thus, using evidence from case marking of ten, we could say that dlatego
ze S, przedtem jak S and potem jak S derive from a PP containing the prepo-
sition dla, przed and po, respectively, followed by a complex NP with ten as
the head noun. This analysis is essentially similar to the one proposed for sen-
tences like (4.6). (4.17) illustrates the underlying structure of (4.12).
(4.17) S
NP V"
V' PP PP
dla S
ten COMP S
ze Jan dostanie
stypendium
After ten in (4.17) has been assigned the genitive case marking, the preceding
preposition is adjoined to it, which results in dlatego. This readjustment opera-
tion is independently motivated by w/i-questions like (4.18).
Adverbial subordinate constructions 271
Maria rozmawiaia
ktrym m^zczyznq
(4.20)
Maria rozmawiaia
Det
Det mqzczyznq
ktrym
Thus, to claim that dlatego derives from dla and tego as well as przedtem and
pot em from przed and tym, and po and tym, respectively, does not seem un-
reasonable.
272 Ewa Jaworska
dlatego COMP S
ze Jan dostanie
stypendium
Shortly, I will argue that this is preferable to the one illustrated in (4.17).
A rather different type of adverbial subordinate clause is illustrated in the
following:
(4.22) Jan nie przyjdzie, bo ztamai nog.
John not will come because (he)broke leg
'John won't come because he's broken a leg.'
(4.23) Maria by la tu, zanim poznala Paw la.
Mary was here before (she)met Paul
'Mary was here before she met Paul.'
(4.24) Przetlumacz ten tekst, odkqd zaznaczylem.
(you)translate this text from-where (I)marked
'Translate this text from where I've made a mark.'
(4.25) Przeczytaj ten list, dokqd cipokazalem.
(you)read this letter to-where you (I)showed
'Read this letter up to where I've shown you.'
(4.26) Piotr zadzwoni, skqd bgdzie mgt.
Peter will phone from-where (he)will be can
'Peter will phone from where he'll be able to.'
Bo 'because', zanim 'before', odkqd 'from where', dokqd 'to where' and skqd
'from where' function here as subordinating conjunctions. They can never,
however, be followed by an NP like ordinary prepositions and therefore it
Adverbial subordinate constructions 273
Conclusion
In the above discussion I have been concerned with adverbial subordinate con-
structions. According to Jackendoff (1977) and Emonds (1976), the subor-
dinating conjunction belongs to the category and is followed by an S in the
underlying structure. This assumption makes the base rule of a PP parallel to
the base rule of a VP, which provides some motivation for X-bar theory: verbs
and prepositions can be referred to in terms of the same basic features [+Obj]
and [+Comp], and this kind of generalization across various syntactic cate-
gories is exactly what the theory is supposed to capture. If an S were not a
possible complement of a PP, the generalization about the two categories and
the subsequent simplification would be precluded. Consequently, a broader
generalization within which all the four major syntactic categories, NP, VP,
PP and AP, can take a final S in their expansions, would not be possible. The
initial motivation for treating subordinating conjunctions as prepositions was
that some of them, like before and after, are ordinary prepositions elsewhere.
Hendrick (1976) tries to show that, for various reasons, an adverbial sub-
ordinate construction cannot be treated as a preposition followed by an S.
His main alternative to the S analysis is that the subordinating conjunc-
tion is a complementizer, followed by an S. As I have shown, his arguments
against Jackendoff (and Emonds) and for his own position are untenable. He
makes an important observation, however, that extraction from adverbial sub-
ordinate clauses, unlike from other types of subordinate clauses, is impossible.
As we have seen, there is no obvious way to account for this with either a
- S or COMP - S analysis. In this respect, the COMP S analysis is as de-
fective as the - S analysis.
A re-examination of the English data within yet another analysis, men-
tioned but not explored by Hendrick, has lead us to an explanation of the
islandhood of certain adverbial subordinate clauses. This analysis treats an
adverbial subordinate construction as a PP consisting of a preposition (the
subordinating conjunction) followed by a complex NP. As is generally ac-
cepted, complex NP's are islands. A tentative claim is, then, that adverbial sub-
ordinate constructions are, indeed, prepositional phrases, which is consistent
with Emonds' arguments and Jackendoff s assumptions, but that their internal
structure is - NP, which is inconsistent with the X-bar theory claim. Thus,
the facts about extraction seem to argue against the cross-categorial generali-
zation and remove one piece of motivation for X-bar theory.
Certainly, more data have to be considered, especially sentences containing
subordinating conjunctions like i f , although, etc., which I have neglected in
the present paper. They seem to constitute a problem for the analysis advanced
here. Regarded as prepositions, such conjunctions could not be subcategorized
278 Ewa Jaworska
just for an NP because of the impossibility of, for example, *although John.
They would have to be subcategorized for a specific type of an NP, namely,
a complex NP with an empty head. Obviously, this is not a desirable solution.
It may be the case, then, that they should, indeed, be regarded as comple-
mentizers. Evidence more plausible than that presented by Hendrick would
have to be provided. It also remains to be seen whether the argument against
the - S analysis of English adverbial subordinate constructions, favouring
the - NP analysis, is the only one available and whether it is strong enough
to constitute a serious problem for X-bar theory where it makes a generaliza-
tion concerning verbs and prepositions.
Polish adverbial subordinate constructions do not provide any motivation
at all for the analysis advocated by Jackendoff. No ordinary preposition in
Polish can be immediately followed by a clause. Prepositions in Polish always
have to be followed by an NP, either simple or complex. Unlike irx English,
then, in Polish, the traditionally understood subordinating conjunctions and
prepositions are two distinct classes of words, not overlapping with each other.
As far as Polish subordinating conjunctions are concerned, good reasons have
been given above for regarding them as complementizers in an S embedded in
a complex NP or AdvP. Jackendoff s analysis of adverbial subordinate struc-
tures, even if it is adequate for English, is not applicable in Polish and cannot
be claimed to be universal. A further examination of Polish data should reveal
if it is true of all Polish subordinating conjunctions that they are comple-
mentizers.
Finally, it must be noted that other cross-categorial generalizations should
be subjected to similar scrutiny. The current state of X-bar theory requires
that more elementary research is done before the most basic assumptions can
be regarded as securely established. Along with English, other languages should
be investigated to secure the viability of X-bar theory generalizations.
Notes
* I would like to express my sincere thanks to Bob Borsley, who encouraged me to take
up the topic and was patient enough to give me a lot of guidance and enlightening com-
ments as the work on the paper progressed. He has also provided advice on the English.
He is not responsible for the faults of this paper. I am also grateful to my informants
for the English data, especially Karelia Trabold-Szkoda and Dick Weist.
1 . 1 will use the X-bar notation where it is necessary for the clarity of the presentation
of the material. Otherwise, I will use the traditional notation.
2. The apparently simple condition in (1.8) handles the fact that, unlike with verbs,
with prepositions, there cannot be any constituent beween the head and the com-
plement sentence. Jackendoff (1977:82) admits that in the cases of collapsing other
categories, where their individual expansions differ to a larger extent, the conditions
stated in the negative form may become so elaborate that the generalizations intended
to be captured by the phrase structure rules may look implausible.
Adverbial subordinate constructions 279
7, Chomsky and Lasnik (1977) would say that the that is deleted by a rule of free
deletion in COMP. This rule has been criticized by Pullum and Postal (1979).
8, The arguments outlined here owe much to discussion with Bob Borsley.
9, Subjacency is a rather dubious constraint anyway. For critical discussion see Bresnan
1976, Maling 1978, Allen 1980, Borsley 1981 and Horn - Borsley (1981).
10 Actually, the derivation of (3.6)b might be blocked by the opacity condition of
Chomsky (1980). The condition says that if a is in the domain of the subject of
( = S or NP) then cannot be free in . In (3.6)b, the trace in the original position
is in the domain of the subject of the embedded clause and, thus, should not be free
within the lower S. It is, however, since the trace in COMP does not c-command it.
It seems dubious, on the other hand, whether the opacity condition is a viable means
of blocking (3.6)b since, as Bob Borsley pointed out to me, it should also block good
sentences like (3.7)a.
11. (3.8) is a slightly modified version of the diagram given by Hendrick (1976:119).
12, The deletion of the head NP in (3.1 l)b and (3.12)b would trigger the deletion of the
complementizer. It is not quite clear how exactly to handle the of in (3.12).
Geis (1970) suggests that sentences like (3.13)b constitute an intermediate struc-
ture from which (3.13)a is derived through deletion of the time at which.
13. The instrumental and locative forms of ten are both tym:
(i) Jan rozmawiat tym ioinierzem.
John talked with this soldier
(ii) Jan rozmawiat tym zotnierzu.
John talked about this soldier
14. See Borsley 1981 for arguments that in certain circumstances jak should be regarded
as a complementizer rather than as a vWi-word.
15, Mimo takes also noun phrases in the accusative case:
(i) a. mimo wszystko
despite everything(acc)
b. *mimo wszystkiego(gen)
The demonstrative pronoun ten following mimo as the head of a complex NP may
appear either in the accusative or in the genitive form (cf., for example, Urbaiiczak
1966:274 -275 and Pisarek 1978:56).
16, For discussion of Polish free relatives and evidence that the -words are ordinary
wft-words in COMP see Borsley (this volume).
17. Speakers vary in the realization of this phenomenon with jak: jakzei and jakei are
the alternatives. Jaks has been chosen here for the sake of simplicity.
280 Ewa Jaworska
References
Allen, C.
1980 "Movement and deletion in Old English", Linguistic inquiry 11:261-323.
Baltin, .
1978 " as a bounding node", Proceedings from the Eighth Annual Meeting of
the North Eastern Linguistic Society, edited by M. J. Stein (Amherst, Mass.:
University of Massachusetts), 3 3 - 4 0 .
Borsley, R. D.
this volume "Free relatives in Polish and English".
1981 "WA-movement and unbounded deletion in Polish equatives", Journal of
linguistics 1 7 : 2 7 1 - 2 8 8 .
Bresnan, J. W.
1976 "Evidence for a theory of unbounded transformations", Linguistic analysis
2:353-393.
1979 Theory of complementation in English syntax (New York: Garland).
Chomsky, N.
1980 "On binding", Linguistic inquiry 1 1 : 1 - 4 6 .
Chomsky, . - H. Lasnik
1977 "Filters and control", Linguistic inquiry 8 : 4 2 5 - 5 0 4 .
Emonds, J. E.
1976 A transformational approach to English syntax (New York: Academic Press).
Geis, . L.
1970 "Time prepositions as underlying verbs", Papers from the Sixth Regional
Meeting of the Chicago Linguistic Society, edited by M.A.Campbell,
J. Lindholm, A. Davison et al. (Chicago, Illinois: University of Chicago),
235-249.
Hendrick, R.
1976 "Prepositions and the X' theory", UCLA papers in syntax 7 : 9 5 - 1 2 2 .
Horn, G. M.
1974 The noun phrase constraint (Ph. D. dissertation, University of Massachusetts,
Amherst). Reproduced by the Indiana University Linguistics Club, 1977.
1979 A lexical interpretive approach to some problems in syntax (Bloomington:
Indiana University Linguistics Club).
Horn, G. M. - R. D. Borsley
1981 "On O n binding'", Linguistics 1 9 - 1 1 / 1 2 : 1133-1164.
Jackendoff, R. S.
1973 "The base rules for prepositional phrases", A Festschrift for Morris Halle,
edited by S. Anderson and P. Kiparsky (New York: Holt, Rinehart and
Winston), 3 4 5 - 3 5 6 .
1977 X syntax: A study of phrase structure (= Linguistic inquiry monograph, 2)
(Cambridge, Mass.: M.I.T. Press).
Klima, . S.
1965 Studies in diachronic syntax (Ph. D. dissertation, Harvard University).
Maling, J.
1978 "An asymmetry with respect to w/i-islands", Linguistic inquiry 9 : 7 5 - 8 9 .
Pisarek, W.
1978 Stownik fezyka niby-polskiego, czyli bt^dy fezykowe w prasie [A dictionary
of semi-Polish or linguistic errors in the press] (Wroclaw: Zak tad Narodowy
im. Ossoliriskich).
Pullum, G. . - P. M. Postal
1979 "On an inadequate defense of 'trace t h e o r y ' " , Linguistic inquiry 10:689
706.
Adverbial subordinate constructions 281
The choice of the channel, either written or spoken, imposes specific con-
straints on the situation of the act of translation and, consequently, on the
requirements on the translator as well as the form of the message.
Here are some of the situational differences in interpreting as opposed to
written translation: the same context of situation for the speaker, interpreter
and the hearers; transient character of the message; transcoding spoken message.
As a result of these differences there are specific requirements on the inter-
preter, which can be summarized as follows: perfect command of the spoken
mode of Lj and L 2 ; ease of spoken expressions; good memory and/or skill of
note-taking (CI); special ability to divide attention between listening and
speaking (SI); resistance to psychological pressure resulting from the transient
character of the message and the presence of the speaker and the hearers;
ability of quick response to necessary adjustments in interpreting.
From our discussion of translational competence it follows that an ideal
interpreter/translator has an ability to transcode TVs into equivalent T 2 's,
which also means that he can unequivocally tell which texts or sentences in
L j and L 2 are equivalent. It seems that the problem of equivalence is of
crucial importance when we discuss quality in interpretation and in analyzing
the interpreter's strategies.
I submit that four concepts are important when we talk about interlan-
guage synonimity: equivalence, congruence, correspondence and communica-
tive appropriateness. Let us discuss each of them briefly,
- equivalent are pairs of messages in L t and L 2 when they have the same
semantic representations,
- congruent are pairs of messages in L j and L 2 when they have the same
semantic representations and consist of the same number of equivalent for-
matives arranged in the same order (Krzeszowski 1971, Marton 1968).
It follows from the definition that a pair of congruent constructions in ad-
dition to the same deep structures must meet the requirement of the sameness
of their surface structures. This claim on congruence is a very strong one espe-
cially the sameness of the number and order of formatives, and is suited for
very delicate contrastive analyses of restricted subsystems of grammar of re-
lated languages. It severely limits the number of congruent constructions.
Consider:
(1) a. Marek zaprosit Susan do Polski.
'Mark invited Susan to Poland.'
b. Mark invited Susan to Poland.
correspondent are pairs of messages in L t and L 2 when they have the same
semantic representations and are word-for-word translations of each other
inasmuch as the rules of grammar of either language permit.
Correspondent are both those cases where word-for-word translation is pos-
sible and those where such translation is impossible because of structural or
cultural differences between languages provided they are the closest possible
translations. Thus,
Only (12) and (14) are correspondent translations. (11) and (13) are not
because literal translations are possible but the translator, for some reason,
chose to use paraphrases. The reason for such a choice cannot be explained in
purely linguistic terms and one must take into consideration a larger context.
Hence the need for a concept of communicative appropriateness, which can
be defined as follows:
communicatively appropriate are pairs of messages in L t and L 2 when they
perform the same (= similar) communicative functions in terms of
the speaker, his status and the status of his receptors,
the speaker's intention in performing the speech act,
the speaker's stance toward the speech act and the receptors,
the receptors' stance toward the speech act and the speaker,
the illocutionary force of the speech act,
the verbal, paralinguistic and non-verbal form of the speech act,
the existing norms of interaction and interpretation,
the setting.
Here are some examples of translations in which some element of communi-
cative appropriateness has been missed.
In translations (15) and (16) exponents of the speaker's stance toward the
proposition have been missed I believe, might. In (17) the non-verbal signal
of agreement was replaced by its verbalized counterpart dobrze O . K . ' All
of the non-verbal signals must be missed in SI because the interpreter is not
visible. They can be easily expressed in CI.
Quality in conference interpreting 289
(19) After the delegation held talks with the Prime Minister, it
toured the city.
Delegacja odbyla rozmowy Premierem. Nast^pnie
zwiedziia miasto.
'The delegation held talks with the Prime Minister. Then it
toured the city.'
a) Errors of translation is the most general category overriding the other three
in the sense that an error of translation may be caused by any of the remain-
292 Andrzej Kopczynski
Errors of competence
Syntactic and phrasal errors of competence invariably result in a grammatical-
ly deviant form in L 2 . They are therefore easy to identify by the hearer. They
Quality in conference interpreting 293
create noise in the channel and can block communication altogether. Here are
some examples:
It is characteristic that all of the word order errors were made in SI. It is a
type of error that reflects a strategy of the interpreter: he keeps too close to
the speaker and follows the Polish word order.
(23) agreement
. . . tak narod chinskijak amerykanski cechuje. ..
'. . . the Chinese and American nations are characterized . . .
the Chinese and American nations is characterized
Among the phrasal errors the most common are those affecting the noun
phrase, especially those in the use of the article and prepositions, e.g.
(24) a. . . . Przewodniczacy Mao Tse-Tung. ..
' . . . Chairman Mao Tse-Tung . . . '
- the President Mao Tse-Tung
b. . . . przy tej okazji
. .on this occasion'
- in this occasion
They are committed with great regularity, which is one of the indications
that they are true errors of competence.
As opposed to grammatical errors lexical ones are not easily identifiable.
Some of the types found were: broadening of Lx meaning, narrowing of L j
meaning, use of a wrong meaning of a polysemous item, use of an L 2 item
similar to L t item, use of an L 2 item similar to another L 2 item, etc., e.g.
Errors of performance
Among errors of performance we have included all those identifiable disturb-
ances which deviate from its fluent rendition by an ideal speaker-interpreter,
e.g.
(26) hesitation:
a. exchanging . . . eh . . . of views on issues of . . . eh
interest
b. - chcialbym . .. mm . . . ziozyc ho id
'I'd like . . . eh . . . to pay homage . . . '
(27) stuttering:
- The Chinese peo . . . peo . . . people is a great peo . . .
peo . . . people
(28) repetition:
the . . . the . . . Chinese government
In (30) the words the and its are omitted in Polish because of differences in
structures and in (31) Szanowna 'Distinguished' is a necessary addition arising
from a difference in cultural conventions.
By contrast:
In the translation the word other was missed without any linguistic or cultural
motivation. It's an example of an optional omission.
In our material there is a definitive pattern among omissions and additions
which points to a strategy in tackling Tj by the interpreter. The overwhelming
majority of omissions were those of modifying elements of different kind:
from an omission of a determiner, an adjective, an adverbial to that of relative
phrases and clauses and adverbial clauses. The strategy is apparently to focus
attention in the first instance on the constitutive part of T x the main clause,
the main verb, the head noun. The modifying elements are evidently viewed
as redundant and frequently dismissed.
The causes of those errors cannot be unequivocally determined; some of
them are probably errors of performance (memory lapses etc.), others are er-
rors of receptive competence.
Although we found a small number of additions in the translated texts,
they followed the same pattern - most characteristic additions were those of
modifying elements.
296 Andrzej Kopczynski
Errors of appropriateness
In our corpus they involved inappropriate usage in terms of register and in-
appropriate expression of illocutionary force.
Errors of translation
As mentioned above errors of translation are those that block communication
and are caused by errors of competence, performance or appropriateness, or a
combination thereof.
Theoretically speaking T 2 in relation to Tj can be:
Non-equivalent T 2 's are those that block communication. From the above
typology it follows that only cases (vii)(xi) are errors of translation, (vii) be-
ing the 'purest' case of such error.
There is finally one more factor that should be taken into consideration
when we talk about errors in conference interpreting (and written translation
as well): the problem of identiflability of the error. Along this scale we can
have a continuum of texts. The two extremes would be:
1. an apparently flawless text in terms of L 2 grammar and appropriateness
but with numerous errors of competence, performance and appropriateness
that are not identifiable, e.g. case (vii) above.
2. a text which contains identifiable errors of performance and/or competence
and appropriateness but which is still practically equivalent (cases iii-vi).
If along this scale we find texts of type (1) and (2) which both are non-equi-
valent the type (1) text is especially damaging to communication because in
such a case the hearers are not alerted to the imprecision of translation. In the
type (2) text the hearers are at least aware that something is wrong.
Errors of various kinds, especially errors of performance, are part and
parcel of interpreting. Therefore the criteria for evaluating oral translation
should be less stringent than for written translation and the level of expecta-
tion lower. But the minimum limit must be set and that is practically equi-
valent translation, i.e., one that may not be flawless but which carries over the
referential content of the original.
We might say that successful conference interpreting is successful not be-
cause it is free of errors but in spite of errors.
References
Barik, H. A.
1969 A study of simultaneous interpretation, unpublished PhD diss. (University of
North Carolina).
1973 "Simultaneous interpretation: temporal and quantitative data", Language
and speech 1 6 : 2 3 7 - 2 7 0 .
1975 "Simultaneous interpretation: qualitative and linguistic data", Language and
speech 1 8 : 2 7 2 - 2 9 7 .
Casagrande, J. B.
1954 "The ends of translation", IRAL 2 0 . 4 : 3 3 5 - 3 4 0 .
Chernov, G. V.
1978 Teoria i praktika sinkhronnogo perevoda [Theory and practice of simultane-
ous interpretation) (Moskva: Mezhdunarodnoye Otnoshenya).
Corder, S. P.
1967 "The significance of learners' errors", IRAL 5 . 6 : 1 6 1 - 1 7 0 .
1971 "Idiosyncratic dialects and error analysis", IRAL 9 . 2 : 1 4 7 - 1 6 0 .
300 Andrzej Kopczynski
Enkvist, . E.
1973 "Should we count errors or measure success", Errata, edited by Jan Svartvik
(Lund: CWK Gleerup), 1 6 - 2 3 .
Gerver, D.
1969 "The effects of source language presentation rate on the performance of
simultaneous conference interpreters", Proceedings of the 2nd Louisville
Conference on Rate and/or Frequency Controlled Speech, edited by E. Foulke
(Louisville: University of Louisville), 1 6 2 - 1 8 4 .
1971 Simultaneous interpretation and human information processing, unpublished
PhD diss. (Oxford University).
1976 "Empirical studies of simultaneous interpretation: a review and a model",
Translation: application and research, edited by R. W. Brislin (New York:
Gardner Press), 1 6 5 - 2 0 7 .
Goldman-Eisler, F.
1968 Psycholinguistics: experiments in spontaneous speech (London: Academic
Press).
Kopczynski, A.
1980 Conference interpreting: some linguistic and communicative problems
(Poznan: UAM).
Krzeszowski, T. P.
1971 "Equivalence, congruence and deep structure", Papers in contrastive lin-
guistics, edited by G. Nickel (Cambridge: University Press), 3 7 - 4 8 .
Marton, W.
1968 "Equivalence and congruence in transformational contrastive studies",
Studia Anglica Posnaniensia 1 : 5 3 - 6 2 .
Ure, J. L.
1964 "Types of translation and translatability", flaie/ 10.1: 5 - 1 1 .
Van Hoof, M.
1962 Theorie et pratique de ('interpretation (Munich: Max Hueber).
Widdowson, H.
1973 "Directions in the teaching of discourse", Theoretical linguistic models in
applied linguistics, edited by S. P. Corder and E. Roulet (Brussels: Didier),
65-76.
TOMASZ P. KRZESZOWSKI
Tertium comparationis
One of the reasons why contrastive analysis (CA) continues to perform the
role of Cinderella of Linguistics is the fact that its most fundamental concept
remains as hazy as ever. Although such notions as semantic equivalence, trans-
lation equivalence, congruence, etc., are the standard jargon of the trade and
have received considerable attention from CA-theorists, tertium comparationis
(TC), the concept that lies at the heart of any comparison (eo ipso at the heart
of CA), remains remarkably neglected.
The existing CAs implicitly involve various platforms of interlinguistic refer-
ence, determined by specific linguistic models which they employ and specific
levels of analysis which they embrace. Thus different TCs are used for com-
parisons in lexicology, in phonology and in syntax. With the exception of the
omnipresent initial assumption concerning the semantic identity or at least
similarity of the compared phenomena, in few of these studies explicit men-
tion of any TC is made, leave alone any attempts at justifying a particular
choice. In the existing literature two types of TC have been employed and dis-
cussed: formal correspondence and semantic equivalence. Due to the fact
that neither concept is very sharp a peculiar schizophrenic situation emerged:
while, on one hand, it was fortunate that neither TC has been consistently
used in CA, it was, on the other hand, unfortunate that no satisfactory ana-
lysis of the relevance of these two kinds of TC in CA has been attempted.
Even a cursory glance at the wealth of the existing CAs suffices to notice
that the two types of TC are not the only ones that are used in practice, even
if theoretical discussions tend to be limited to just those two (e.g. Lado 1957,
Spalatin 1969, Ivir 1969,1970). 1 Moreover, neither of the two types of TC is
based on a homogenous set of assumptions determining practical and theoret-
ical status of CA.
302 Tomasz P. Krzeszowski
The present paper will outline three related topics: 1. the inherent am-
biguity of the concepts 'formal' and 'semantic' TC, 2. other possible TCs rele-
vant to CA; 3. interdependencies between various types of TC.
Formal and semantic likeness have been both used in contrasting various
aspects of languages. The two kinds of resemblance have also received some
attention in theoretical discussions. It has been pointed out, for instance, that
formal likeness alone cannot serve as TC without support from semantic equi-
valence (Liston 1970, Lipinska 1977). At best a comparison based on formal
criteria alone is incomplete, at worst it cannot be performed at all, and in
many cases it is misleading (see also Spalatin 1969).
If, for example, one compares Polish and English personal pronouns, a
formal analysis will ascertain the equivalence between the English you and
the Polish ty/wy and will be accurate as far as it goes. Such an analysis is in-
complete as it leaves out such forms as PanjPani and other possible equivalents
of you. These equivalents can only be established if criteria other than formal
are employed (for details see Krzeszowski 1976). English articles cannot be
compared with anything in those languages in which there are no articles, if
only formal criteria are considered. Finally, in the case of such phenomena as
the present perfect tense in English and passe compose in French the formal
analysis is misleading since the formal similarity is not matched by semantic
similarity, a kind of situation which often causes considerable learning pro-
blems (see Politzer 1968). Therefore, it is generally recognized that a con-
trastive analysis based on purely formal criteria falls short of both theoretical
and practical expectations.
Somewhat less obviously a contrastive analysis based on semantic similarity
alone can also be inadequate and misleading. In the contrastive practice se-
mantic equivalence is often erroneously identified with translation equivalence:
To establish that these (systems of deictics, supplied by me] are comparable, we first
need to show their contextual equivalence; this can be done most simply by reference
to translation (Halliday et al. 1964:115).
One can easily show that translation equivalents are often semantically non-
equivalent (for discussion and examples see Krzeszowski 1974, 1979). It ap-
pears that semantic equivalence must be constrained formally, while trans-
lation equivalence does not have to be so constrained. When one translates
one departs from semantic equivalence due to three types of reasons: 1. errors
in translation (which will be disregarded here); 2. formal properties of respec-
tive languages; 3. what is generally loosely termed 'stylistic' reasons. These
three types of reasons lead to situations in which actual translation practice,
with the exception of legal texts, seldom involves translations which are
semantically equivalent. This means that only some translations can be used
Tertium comparationis 303
as data for systematic CAs (cf. Ivir 1969), while translation as a method of
contrasting must be regarded with caution.
Translation may be viewed amorphously as the rendition of a text from one language to
another. This is translation from the standpoint of la parole: the text, the act of speech
or writing, is the thing. Or it may be viewed as a systematic comparison of two Lan-
guages: this is translation from the standpoint of la langue. (Bolinger 1966:130)
In the subsequent remarks I would like to say a little more about the inade-
quacy of the approaches limited to the alternative 'formal and/or semantic'
resemblance as the only possible TC. To begin with, as has been said earlier
on in this paper, the very meaning of the two concepts is richer than is com-
monly recognized in CA. For instance, 'formal' can be extended to cover the
entire plane of expression (in Hjelmslevian terminology). In most American
studies 'formal' is restricted to word order, function words, inflections, af-
fixation and suprasegmentale. In the broader perspective 'formal' would also
embrace such aspects of expression as alliteration, rhymes, rhythm, to men-
tion just a few phenomena of clearly 'formal' nature but extending beyond
the limits of traditional interpretations of 'formal'. Many of these 'formal'
properties would find their place in the study of functionally equivalent texts.
The notion 'semantic' is often also extended to cover matters of pragmatics,
especially by those authors who identify semantic equivalence with translation
equivalence:
Our experience is that languages can be effectively contrasted only on a semantic basis,
specifically, on the basis of translation equivalence (Spalatin 1969:34).
In reality, many authors have shown that semantic equivalence is not a neces-
sary prerequisite of a good translation (cf. Riilker 1973, Krzeszowski 1974,
Kopczynski 1980). What is expected of a correct translation is pragmatic or
functional equivalence. It cannot be denied that pragmatic equivalence can
serve as TC for contrastive analyses of such matters as the structure of dis-
course, stylistic properties and quantitative aspects of texts. But syntactic
CAs, the primary concern of earlier CAs, must be conducted within the limits
of the semantic component of the language, or more specifically that part of
the semantic component which can be systematically and predictably corre-
lated with the grammatical structure of sentences.
This restricted sense of 'semantic' still embraces some aspects of meaning
which are traditionally relegated to 'pragmatics' or "interpersonal function"
of sentences (Halliday 1970). According to Halliday the systems of mood and
304 Tomasz P. Kizeszowski
modality are precisely those systems which relate sentences to their inter-
personal functions. I suggest that the notion 'sentence-semantics' should cover
those elements of 'pragmatics' which can be correlated with the structure of
sentences, even if consistency in this area is definitely out of the question: de-
clarative, interrogative and imperative sentences do not necessarily perform
the functions of statements, questions and commands, respectively. In so far
as some correlation between form and function does exist those 'functional'
aspects of sentence structure constitute the border area between pragmatics
and semantics and should be included in any semantic-syntactic CAs (for de-
tails see Krzeszowski 1974). Therefore, James (1980) suggests that for the
purposes of CA translation equivalents should be limited to those which are
both semantically and pragmatically equivalent. However, this proposal also
raises doubts. Presumably what James means by 'semantic' refers to Halliday's
'ideational' function of sentences, while Halliday's 'interpersonal' and possi-
bly 'textual' functions fall under 'pragmatic'. Under James' proposal many sen-
tences across languages would exhibit both 'ideational' (semantic) and 'inter-
personal' (pragmatic) equivalence, yet one would hardly wish to use them as
data for syntactic CAs. For example:
are equivalent, both ideationally (agent, transitive, verb patient) and interper-
sonally (general question); yet they falsely suggest a relationship between syn-
tactic types represented by 1 and 2 in English and Polish. A systematic syn-
tactic correspondence would have to be ascertained between 1 and 3 rather
than between 1 and 2:
Both English and Spanish have two sets of determiners, commonly referred to as detinue
and indefinite articles. In many respects they are comparable [my emphasis]; in others
they are different.
type of CAs will have its own type of TC thus providing grounds for a
taxonomy. Within each type it should be possible to distinguish more specific
subtypes, unique within each type. Each type of TC will determine a different
kind of equivalence, i.e. the relation obtaining between the compared items.
It will be seen, therefore, that semantic equivalence and translation equivalence,
discussed so far, are but two of seven types of equivalence that I would like
to distinguish. Thus in addition to statistical equivalence and translation equi-
valence, providing TC for text-bound CAs, we shall distinguish several types
of systematic CAs.
In CAs constrained by semanto-syntactic equivalence it is possible to com-
pare a) constructions, b) systems and c) rules. Of these three types only CAs
of constructions are directly based on semanto-syntactic equivalence, con-
straining 2-texts for the purposes of such CAs. The comparison of systems re-
quires dependence on syntagmatic considerations in the same way in which
any paradigmatic analysis is linked to syntagmatic considerations. For ex-
ample, isolating a particular system in a particular language requires an exam-
ination of syntagmatic arrangements of elements, i.e. constructions in which
those elements appear. In CAs, likewise, no equivalence of systems can be
ascertained without assessing the equivalence of constructions in which ele-
ments of those systems appear. Investigators usually compare equivalent sys-
tems across languages, basing on intuitions their decisions to juxtapose the
relevant words, thus employing the concept of system equivalence as TC.
It takes little reflection to realize that system equivalence can be made ex-
plicit only through the examination of constructions in which elements of the
compared systems appear, i.e. via the notion of semanto-syntactic equivalence
relating the relevant 2-texts as primary data.
Likewise, any comparison of rules cannot be divorced from an implicit
comparison of constructions on which these rules operate. Therefore, the
semanto-syntactic equivalence of constructions is the central concept in syn-
tactic CAs, whether or not they are extended to comparisons of systems and
rules (for more discussion see Krzeszowski 1976).
Phonological and lexical CAs cannot be based on semanto-syntactic equi-
valence. Such studies use yet a different type of TC, which can be called sub-
stantial in so far as it is connected with the material substance outside lan-
guage but associated with language through phonological and semantic inter-
face of language (Hjelmslev 1963).
In the case of phonological CAs acoustic, articulatory and in principle also
auditory phenomena provide this substantial TC. Most phonological CAs
make reference to articulatory parameters, less frequently to acoustic ones.
I know of no CA based on auditory parameters alone.
308 Tomasz P. Kizeszowski
In the case of lexical CAs, the external reality, or strictly speaking its
psychic image in the minds of language users, provides the substantial TC as
items across languages are compared with respect to differences and similar-
ities concerning their reference to various elements of the reality in the world
at large as it is reflected in the minds of language users.
Both phonological and lexical CAs are mainly paradigmatic, even if, espe-
cially in various kinds of generative frameworks, syntagmatic arrangements
are also comparable. However, in the case of phonological and lexical CAs we
deal with a situation which is a mirror image of semanto-syntactic studies:
paradigmatic studies are now central while syntagmatic studies are secondary
and impossible to conduct without the ultimate reference to substantial TC.
Much less rigorous contraints apply to 2-texts which are to be compared
with regard to their styles or registers. Any translation will do here as long as
it is functionally (pragmatically) equivalent. Naturally enough some idealiza-
tion of the data is necessary. For example, obviously erroneous translations
must be disregarded as well as translations which fail to convey adequately
some relevant pragmatic functions, especially if alternative, more adequate
translations are available. Although constraints on the suitability of 2-texts
for this sort of CA are different from those required of 2-texts as data for
semanto-syntactic CAs, they are equally important and must be stated clearly
and unequivocally, lest the CAs will fail to grasp the relevant generalizations
concerning pragmatic or stylistic aspects of the compared texts.
Pragmatic or function equivalence as TC for stylistic CAs is therefore a
relation that holds between the constituents of 2-texts selected in such a way
as to retain certain invariables guaranteeing that such 2-texts are +equivalent
functionally:
Since, however, contrasting does presuppose, apart from agreements and differences
under observation, also a certain common foundation, one proceeds from the fact that
the fundamental differentiation of styles holds in roughly the same way for all cultured
languages and that it makes itself felt in roughly the same tendencies even though not
always realized by the same means of expression. In other words: French scientific style
will be characterized by analogous stylistic tendencies as its counterpart in Czech, in the
same way as the basic features distinguishing the belles-lettres narrative style from
descriptive style will be the same in English as, say, in Italian. This is due to the impact
of social, i.e. extralinguistic communicative needs which e.g. in languages of the Euro-
pean cultural sphere in the given period bear on the whole an analogous character. That
is why we engage in comparing discourses of belles-lettres prose in various languages, or
of scientific style in various languages and so on. In this manner a certain common
foundation is gained upon which contrastive analyses of national styles can be built
(Beka 1978:131-132).
tinue to flourish (for some examples see Fillmore and Oleksy in the present
volume) a detailed synthetic discussion will hopefully become possible.
The problem of generalizations in CAs requires some comments. Stating
any equivalence relation is itself an act of generalization. Such generalizations
are of little practical value unless they are supported by quantitative data. For
instance, in the area of system comparisons both syntagmatic and paradigmatic
criteria lead to establishing systematic equivalence between Polish and English
personal pronouns: / - ja, you - ty/wy, they - oni/one, etc. Correct, as far
as it goes, this generalization is extremely inadequate as it does not say any-
thing about the relatively low frequency of occurrence of personal pronouns
in Polish texts in contrast with a relatively high frequency of occurrence of
personal pronouns in English texts.
Even more obviously systematic generalizations about equivalence prove
inadequate in those cases when nearly synonymous equivalents are available
in the compared languages, i.e. in the majority of cases. Systematic CAs reveal
structural contrasts and similarities but give no cues as to the actual occur-
rences of the compared phenomena in 2-texts. For instance, without quanti-
tative data there is no way of determing the relative frequencies of such con-
structions as extraposed vs. non-extraposed sentences, psych-moved vs. psych-
non-moved, particle moved vs. particle-non-moved, to mention just a few.
Intuitive judgements and preliminary research clearly show that the relative
frequencies of various types of constructions characterize native vs. non-native
performance in a language. Graczyk (MS) demonstrates the inclination of
Polish learners to overuse extraposed constructions. The relative frequency of
such constructions in the compositions of Polish learners was found to be
significantly higher than in the compositions of native speakers, who used
synonymous constructions, avoiding extraposition.
Various instances of congruence or near congruence with very low prob-
ability of occurrence in actual texts suggest that the significance of results
obtained from systematic CAs is not in any way related to the significance of
results obtained in quantitative text-bound CAs. Therefore, at least from the
pedagogical point of view, quantitative CAs cannot be neglected as they pro-
vide the necessary complementary information which cannot be ignored in
preparing teaching materials, working out remedial courses, and constructing
drills and exercises.
The various kinds of CAs associated with the respective types of equivalence
are presented in the following diagram:
310 Tomasz P. Krzeszowski
CAs
text-bound systematic
(2-texts equivalent given) (2-texts + equivalent available)
/
- translationally + translationally
structurally functionally
equivalent texts equivalent texts
constrained constrained
quantitative
corpus-restricted projective
substantial
equivalence
(7)
Note
References
Bolinger, D.
1966 "Transformulation: structural translation", Acta linguistica hafniensia 9:
130-144.
Beika, J. V.
1978 "Application of quantitative methods in contrastive stylistics", Prague studies
in mathematical linguistics 6 : 1 2 9 - 1 4 7 .
DuSkova, L.
1978 "The simple sentence in Czech and English", Prague studies in mathematical
linguistics 6 : 8 3 - 9 2 .
Fillmore, C.
this volume "Remarks on contrastive pragmatics".
Graczyk, I.
no date "On writing in English as a second language: verb phrase in advanced learners'
written English", part 1, unpublished MS.
Halliday, . A. K.
1970 "Language structure and language function", New horizons in linguistics,
edited by John Lyons (Harmondsworth: Penguin Books Ltd.), 1 4 0 - 1 6 5 .
Halliday, . A. K. - A. Mcintosh - P. D. Strevens
1964 The linguistic sciences and language teaching (London: Longmans).
Hjelmslev, L.
1943 Prolegomena to a theory of language [English version 1961], translated by
F. J. Whitfield (Mardison: The University of Wisconsin Press).
Ivir, V.
1969 "Contrasting via translation", YSCECP, B. Studies 1 : 1 3 - 2 5 .
1970 "Remarks on contrastive analysis and translation", YSCECP, B. Studies 2:
14-25.
James, C.
1980 Contrastive analysis (Harlow, Essex: Longman).
Kopczynski, A.
1980 Conference interpreting: some linguistic and communicative problems
(Poznan: Adam Mickiewicz University).
Krzeszowski, T. P.
1974 Contrastive generative grammar: theoretical foundations (Lodz: Uniwersytet
Lodzki, reprinted in 1979, Tbingen: Gunther Narr Verlag).
1976 "On some linguistic limitations of classical contrastive analyses",Papers and
studies in contrastive linguistics 4 : 8 8 - 9 5 .
1981a "The problem of equivalence revisited", IRAL 19.2:113-128.
1981b "Quantitative contrastive analysis", Studia Linguistica 3 5 : 1 - 2 , 1 0 2 - 1 1 3 .
312 Tomasz P. Kizeszowski
Lado, R.
1957 Linguistics across cultures (Ann Arbor: University of Michigan Press).
Lipinska-Grzegorek, M.
1977 Some problems of contrastive analysis: sentences with nouns and verbs of
sensual perception in English and Polish (Edmonton, Canada: Linguistic
Research, Inc.).
Liston, J. L.
1970 "Formal and semantic considerations in contrastive analysis", YSCECP,
B. Studies 2: 2 7 - 4 9 .
Oleksy, W.
this volume "Towards pragmatic contrastive analysis".
Politzer, R. L.
1968 "An experiment in the presentation of parallel and contrasting structures",
Language learning 1 8 : 3 5 - 4 3 .
Riilker, K.
1973 "Zur pragmatischen Invarianz bei der Translation", Neue Beitrge zu Grund-
fragen der bersetzungswissenschaft, edited by . Neubert and . Kade
(Leipzig: Athenum Verlag), 2 9 - 3 5 .
Rusiecki, J.
1976 "The development of contrastive linguistics", Interlanguage studies bulletin
1:12-44.
Spalatin, L.
1969 "Approach to contrastive analysis", YSCECP, . Studies 1:2635.
Stockwell, R. P. - S. D. Bowen - J. W. Martin
1965 The grammatical structures of English and Spanish (= Contrastive structure
series) (Chicago and London: The University of Chicago Press).
WOLFGANG KHLWEIN
1 General considerations
Ever since languages have been described and compared in the course of the
history of language studies some underlying applied motives can be traced.
Philosophical interests were behind the dispute between analogists and
anomalists in ancient Greece: if language could be proved to be ordered ac-
cording to a systematic plan, this insight could be taken as evidence for equal-
ly systematic principles underlying reality in its entirety - language being part
of reality, its investigation was supposed to serve as a key to the whole struc-
ture (or non-structure) of the world as cosmos vs chaos. Central issues of this
philosophically oriented kind of language study like the relationship between
form and meaning, topic and comment, syntactic structure etc. have made
their way into 20th century descriptive linguistics. It seems remarkable, how-
ever, that the gap between this kind of philosophical application and the
rhetorical and pedagogical applications underlying language studies of the
Alexandrian school had never been properly bridged. Theoretical application
as opposed to pedagogical applications of language study?
Similarly Panini's mathematical and statistical observations in phonology
and morphology had remained rather normative and descriptive than pedagog-
ical in spite of their ultimate applied aim of enabling his contemporaries to
acquire the practical skills to cope with the older Veda texts.
More recently the Neogrammarians' aim of yielding access to older stages
of European languages resulted in comparable descriptions of these stages,
lacking, however, systematic strategies of linguistic comparison in general, and
pedagogical strategies concerning e.g. reading-comprehension skills in particular.
Likewise, the comparative potential of the 17th century attempts at esta-
blishing universal grids as a mentalist basis for the descriptions of language(s)
had never been fully exploited methodically from a contrastive point of view
nor pedagogically from an applied perspective.
Obviously the gaps between theoretical study of language comparison and
language pedagogy, so frequently complained at in our days, are not that new
314 Wolfgang Khlwein
Starting out from the same observation, found after investigating syntactic
errors made by foreign language learners and comparing them to hierarchies of
difficulty established on the basis of contrastive linguistics Whitmann and Jack-
son came to the harsh conclusion: "Contrastive analysis [ . . . ] is inadequate
theoretically and practically, to predict the interference problems of a lan-
guage learner" because "Interference, or native-to-target language transfer,
plays such a small role in language learning performance that no contrastive
analysis, no matter how well conceived, could correlate highly with perform-
ance data, at least on the level of syntax" (1972:40).
Also in 1972 Richards provided further evidence for this rather pessimistic
outlook on the state of pedagogical limitations of contrastive linguistics of
that epoch when he compared some essential factors which set apart the
learning of e.g. English as a foreign language from the study of immigrant
varieties of English, indigenous-minority varieties of English, pidginization,
creolization, and local varieties. As within the latter varieties of English limi-
tations are supposed to be rather socially imposed than individually, what
may be regarded as errors from the point of view of a foreign language learn-
ing situation should rather be seen as deviance within the latter social contexts.
Thus the frequently claimed possibility of transferring pedagogical contrastive
insight from the foreign language learning situation to these other sociolin-
guistic situations of linguistic variety should be handled with care. If at all, ac-
cording to this study contrastive linguistics can have a bearing on the 'normal'
foreign language teaching situation only, where interlingual processes do not
have a specific social role but are considered as indicators of partial learning,
in other words, where more is depending on the construction of teaching
materials: "The study of interlingual phenomena in language learning thus
leads to a focussing on the central processes of second language acquisition,
and to the study of the circumstances which give these processes significance"
(Richards 1972:185).
Despite the role conceded to contrastive linguistics in the foreign language
teaching situation, the emphasis on the part played by non-linguistic 'circum-
stances' such as individual differences in perseverance, motivation, aptitude
etc. becomes obvious.
The respective differences which result from the settings of different learn-
ing situations also bear on the pedagogical use of the interlanguage hypothesis
which is frequently claimed to be the hinge of pedagogical contrastive gram-
mars. As recently as in 1980 John strongly advocates the recognition of
f i x e d 'approximative languages' in relation to learning situations, of
Though this statement makes clear that an applied contrastive grammar is not
yet per se a pedagogical grammar it nevertheless follows from Engels that a
pedagogical grammar must necessarily have an applied contrastive component:
Where then are its deficiencies, its limitations? This would certainly be the
wrong place to give a specific review of all existing contrastive projects,
scrutinizing their individual pedagogical potentials. Nor can an encompassing
discussion of contrastive linguistics within the framework theoretical vs ap-
plied linguistics be expected here. Nor will we be heading towards a (further)
idealized model to determine the place value of contrastive analysis within the
pedagogical grammar. What could, however, be attempted, is the pinpointing
of some crucial relationships between (foreign) language learning and teaching
and contrastive analysis as viewed from some current linguistic trends, and as
they might already have become obvious from the preceding survey.
2 Specific considerations
2.0 The following four sections will concentrate on the four areas outlined
in Fig. 1.
Foreign language
learning and Linguistics
Contrastive analysis
Linguistics
Contrastive analysis
Fig. 1
Pedagogical limitations of contrastive linguistics 319
Transcultural
transcultural
deep structure
country A country
culture culture y
L2
Fig. 2
But is this transcultural deep structure not just another kind of tertium com-
parationis - which we do not happen to have ( - yet)?
2.2 It is obvious that these theoretical problems concerning the overlap lin-
guistics/contrastive analysis bear most heavily on language teachers' potential
use of contrastive studies. Pedagogical applications can never make up for
theoretical deficits. Within the respective scale of estimation of contrastive
linguistics ranging from 'pedagogical panacea' to 'pedagogical irrelevance'
Helbig (1976:15) adopts a fairly realistic position:
a reverse procedure: setting out from a fairly large corpus of errors, linguistic
error analysis will set apart psychologically, motivationally etc. determined
errors from linguistically determined ones; among the latter ones contrastive
analysis serves as a tool to separate intralingual ones from those which are or
at least could be accounted for by interlingual factors. Practical experience
indicates that in many cases it is hard, if not impossible, to draw the distinc-
tion between 'are' and 'could be' (cf. Khlwein 1972). Thus, all contrastive
linguistics can do is to indicate where interlingually determined errors might
occur. Consequently pedagogical grammars on a contrastive basis should not
be taken as the core of course design, but will rather sharpen the teacher's eye
for diagnosis. After all as early as in 1972 Marton had gathered the experi-
mental evidence against Lado's early assumption 'similar = easy' (1972:122;
cf. also Kruppa 1975:98), as it had been reinforced e.g. by Upshur:
It can hardly be explained with contrastive linguistic means that many Dutch
pupils commit more errors in those cases where Dutch hebben and zijn cor-
respond to French avoir and 2tre than in those cases where they don't.
Within the limited use of contrastive analysis for diagnostic purposes con-
cerning errors which have potential interlingual origins, the use of contrastive
linguistic procedures can, however, be extended into the didactic area lying
between diagnosis and therapy by rendering clear devices for the framing and
patterning of exercises and drills. The role which can be played by contrastive
linguistics in the final stage, in therapy, seems to be more controversial. As a
consequence, stage-models (like Vilke's proceeding from mechanist to cogni-
tive drills; cf. Filipovic 1974:11 f.) or spiral models (like Lee's gradual widen-
ing of the range of circumstances of the use of the item to be learnt; cf. also
Lee 1969:58f.; cf. also Dirven et al. 1976:138ff.) usually are preferred to
all-or-none procedures. - Whether, or rather, to which extent contrastive
linguistics should serve as a presentation technique (Darstellungsmethode;
Helbig 1976:10) in foreign language teaching methodology mainly depends on
non-linguistic factors, as mentioned above, including motivation, age,attitude,
aptitude of the learners. More than for many other applied linguistic disci-
plines Pike's statement is valid for contrastive linguistics: linguistics is not so
much concerned with what is being served by the waiter but with what is go-
ing on in the kitchen! Parallel to the extent, however, to which the pendulum
in language pedagogy is swinging towards more cognitive procedures, the im-
mediate use made of contrastive linguistics in foreign language teaching
324 Wolfgang Khlwein
2.3 The more linguistic theory has opened towards contact and context fea-
tures determining language use, the more emphatically psycholinguists point
out that most contrastive linguistic studies are still being carried out without
a corresponding contact with the reality of language use and language acquisi-
tion, that contrastive linguistic research is carried out in vacuo, in abstracto
(cf. Slama-Cazacu 1971:190). Basically this objection seems to boil down to
the problem of psychological reality of the linguistic theories used by con-
trastive linguistics. Thus it is rather directed against linguistics in general than
against contrastive linguistics in particular. Whether, however, any study of
language should be carried out within an encompassing psycholinguistic frame-
work can neither be generally discussed let alone be solved here. It is true, the
more performance oriented linguistics is becoming, and the more performance
models underlie contrastive studies, the better it accounts for use and acquisi-
tion of language. At the same time the deplorable gap between linguistic
theqry and its application(s), though frequently quoted by language teachers,
shrinks. Instead of linguistic theory psychological (and/or behavioural) theories
yield the basis for language acquisition research, within which linguistic the-
ory has to grope for its proper problem oriented place. The distinction
between theoretical and applied linguistics dwindles, and correspondingly that
between theoretical contrastive linguistics and applied (e.g. pedagogical) con-
trastive linguistics. Such a view, close as it is to the requirements of social
reality, sounds desirable. It will certainly not dismiss applied contrastive
achievements as being wrong, e.g. the interlanguage hypothesis respectively
Pedagogical limitations of contrastive linguistics 325
References
Barrera-Vidal, A. - W. Khlwein
1975 Angewandte Linguistik fr den fremdsprachlichen Unterricht (Dortmund:
Lensing).
Catford, J. C.
1965 A linguistic theory of translation (London: Oxford University Press).
Chesterman, A.
1980 "Contrastive generative grammar and the psycholinguistic fallacy", Papers
and studies in contrastive linguistics 1 1 : 1 7 - 2 4 .
Clyne, M.
1979 "Communicative competences in contact", ITL 4 3 : 1 7 - 3 8 .
Corder, S. P.
1974 "Pedagogical grammars or the pedagogy of grammars?", Linguistic insights in
applied linguistics, edited by S. P. Corder and E. Roulet (Brussels, Paris:
AIMAV), 1 6 7 - 1 7 3 .
Coste, D., et al.
1976 Un niveau seuil (Strasbourg: Council of Europe).
Dimitrijevic, N. R.
1977 "Problems and implications of contrastive analysis of vocabulary and culture",
Papers and studies in contrastive linguistics 7 : 1 3 3 - 1 4 4 .
Di Pietro, R. J.
1971 Language structures in contrast (Rowley, Mass.: Newbury House).
1976 "Contrasting patterns of language use: a conversational approach", Canadian
modern language review 3 3 : 4 9 - 6 1 .
Dirven, R.
1976 "A redefinition of contrastive linguistics", IRAL 1 4 : 1 - 1 3 .
Dirven, R. - W. Hnig - W. Khlwein - G. Radden - J. Strau
1976 Die Leistung der Linguistik fr den Englischunterricht (Tbingen: Niemeyer).
Ebneter, T.
1976 Angewandte Linguistik 2 (Mnchen: Fink).
Pedagogical limitations of contrastive linguistics 329
Engels, L. K.
1979 "Pedagogicalgrammars", Glottodidactica 1 2 : 1 3 - 3 4 .
Esser, J.
1980 "Contrastive analysis at the crossroads of linguistics and foreign language
teaching", IRAL 1 8 : 1 8 1 - 1 9 2 .
Faerch, C.
1979 Research in foreign language pedagogy - the PIFProject (Copenhagen: Uni-
versity of Copenhagen).
Filipovic, R.
1972-1973 "The use of a corpus in contrastive linguistics", Studio Romanica et
Anglica Zagrebiensia 3 3 - 3 6 : 4 8 9 - 5 0 0 .
1974 "The use of contrastive and error analysis to practising teachers", YSCECP-
pedagogical materials 2, edited by R. Filipovic (Zagreb: University of Zagreb),
3-17.
Fink, S.
1981 "Some thoughts on international standardization of linguistic terminology in
contrastive studies", Festschrift in honor ofR. DiPietro, edited by E. Danesi
(Newark: University of Delaware Press).
Fisiak, J.
1973 "The Polish-English contrastive project", Papers and studies in contrastive
linguistics 1 : 7 - 1 0 .
Ghring, .
1980 "Deutsch als Fremdsprache und interkulturelle Kommunikation", Fremd-
sprache Deutsch 1, edited by A. Wierlacher (Heidelberg: Groos), 7 0 - 9 0 .
Halliday, . . K.
1978 Language as social semiotic. The social interpretation of language and mean-
ing (London: Arnold).
Hartmann, R. R. K.
1977 "Contrastive textology in descriptive and applied linguistics" (Ms.).
1981 "Contrastive textology and translation", Kontrastive Linguistik und berset-
zungswissenschaft. Akten des Internationalen Kolloquiums Trier/Saarbrcken
Sept. 2 5 - 3 0 , 1978, edited by W. Khlwein, W. Wilss and G. Thome (Mn-
chen: Fink), 2 0 0 - 2 0 8 .
Heibig, G.
1976 "Zur Rolle des kontrastiven Sprachvergleichs fr den Fremdsprachenunter-
richt (Mglichkeiten, Voraussetzungen, Grenzen)", Deutsch als Fremdsprache
13:9-16.
Hllen, W.
1969 "Zwanzig englische Kernstze und ihre deutschen quivalente. Versuch einer
kontrastiven Analyse", Die Neueren Sprachen 1 8 : 5 9 0 - 6 0 0 .
1973 Linguistik und Englischunterricht. Didaktische Analysen (Heidelberg: Quelle
und Meyer).
Jackson, H.
1976 "Contrastive linguistics - What is it?", ITL 3 2 : 1 - 3 2 .
Jger, G.
1973 "Kommunikative und funktionelle quivalenz", Linguistische Arbeitsberich-
te 7 : 6 0 - 7 4 .
James, C.
1971 "The exculpation of contrastive linguistics", Papers in contrastive linguistics,
edited by G. Nickel (Cambridge: Cambridge University Press), 5 3 - 6 8 .
Janicki, K.
1979 "Contrastive sociolinguistics - Some methodological considerations", Papers
and studies in contrastive linguistics 1 0 : 3 3 - 4 0 .
330 Wolfgang Khlwein
John, A. P.
1980 '"Approximative languages' and language learning situations", IRAL 18:
205-216.
Karpf, A. - B. Kettemann - W. Viereck
1980 "Phonology of dialect interference in second language learning", IRAL 18:
193-208.
Kruppa, U.
1975 "Kontrastive Analyse von Interferenzerscheinungen im deutsch-englischen
Bereich", Neusprachliche Mitteilungen 2 8 : 9 2 - 9 9 .
Krzeszowski, T.
1974 Contrastive generative grammar: theoretical foundations (Lodz: University
of Lodz).
Khlwein, W.
1972 "Fehleranalyse im Bereich des englischen Vokalismus", Fehlerkunde, edited
by G. Nickel (Berlin: Cornelsen, Velhagen und Klasing), 5 1 - 6 6 .
1975 "Grundsatzfragen der kontrastiven Linguistik", Neusprachliche Mitteilungen
28:80-92.
1980 "Angewandte Linguistik", Lexikon der Germanistischen Linguistik, 2nd
completely revised and enlarged edition, edited by H.-P. Althaus et al. (T-
bingen: Niemeyer), 7 6 1 - 7 6 8 .
1981 "Soziolinguistische Aspekte der Fremdsprachvermittlung und -aneignung",
Kongrebericht der VI. Internationalen Deutschlehrertagung vom 4.-8. Au-
gust 1980 in Nrnberg,edited by . Brckner (Berlin: Langenscheidt),472-475.
Khlwein, W. - W. Wilss
1981 Kontrastive Linguistik und bersetzungswissenschaft", Kontrastive Lingui-
stik und bersetzungswissenschaft. Akten des Internationalen Kolloquiums
Trier/Saarbrcken Sept. 2 5 - 3 0 , 1978, edited by W. Khlwein, W. Wilss and
G. Thome (Mnchen: Fink), 7 - 1 7 .
Lee, W. R.
1969 "How can contrastive linguistic studies help foreign-language teaching?",
YSCECP-Studies 1, edited by R. Filipovic (Zagreb: University of Zagreb),
57-66.
1971 "Types of interference and contrasting - the kinds of research needed",
PAKS: 1 3 - 2 6 .
Lieb, H.-H.
1980 "Integrational linguistics as a basis for contrastive studies",Papers and studies
in contrastive linguistics 1 2 : 5 - 3 6 .
Markkanen, R.
1979 Tense and aspect in English and Finnish. A contrastive study (Jyvskyl:
University of Jyvskyl).
Marton, W.
1968 "Equivalence and congruence in transformational contrastive studies",
Studia Anglica Posnaniensia 1 : 5 3 - 6 2 .
1972 "Pedagogical implications of contrastive studies", Studia Anglica Posnaniensia
4:115-126.
Matteis, M. de
1981 "Zum gesprochenen Italienisch schulpflichtiger Migrantenkinder", Kongre-
bericht der VI. Internationalen Deutschlehrertagung vom 4.-8. August 1980
in Nrnberg, edited by . Brckner (Berlin: Langenscheidt), 4 8 8 - 4 9 1 .
Richards, J. C.
1972 "Social factors, interlanguage, and language learning", Language learning 22:
158-188.
Roos, .
1975 "Kontrastive Lexikologie", Contributions to applied linguistics (1), edited by
C. Gutknecht (Bern, Frankfurt/Main: Lang), 8 9 - 1 0 3 .
Pedagogical limitations of contrastive linguistics 331
Schwarze, C.
1975 "Empirische Probleme des Sprachvergleichs", Linguistische Berichte 35:
10-24.
Slama-Cazacu, T.
1971 "Psycholinguistics and contrastive studies", Zagreb Conference on English
Contrastive Projects, 7-9 December, 1970, edited by R. Filipovic (Zagreb:
University of Zagreb), 1 8 8 - 2 0 6 .
Spillner, .
1972 "Kontrastive Analysen auf der Grundlage von Comic Strips", Papers from
the International Symposium on Applied Contrastive Linguistics, Stuttgart,
Oct. 11-13, 1971, edited by G. Nickel (Bielefeld: Cornelsen, Velhagen und
Klasing), 2 7 - 4 1 .
1978 "Methoden der kontrastiven Linguistik in der Frankreichkunde", Landes-
kunde und Fremdsprachenunterricht, edited by . Arndt and F. R. Weller
(Frankfurt: Diesterweg), 1 5 1 - 1 7 8 .
Upshur, J. A.
1962 "Language proficiency testing and the contrastive analysis dilemma", Lan-
guage learning 1 2 : 1 2 3 - 1 2 7 .
Vater, .
1973 "Probleme der kontrastiven Grammatik", reproduced by L.A.U.T., Series ,
No. 12 (Trier: Linguistic Agency University of Trier).
Whitman, R. L. - K. L. Jackson
1972 "The unpredictability of contrastive analysis", Language learning 2 2 : 2 9 - 4 2 .
Zabrocki, T.
1976 "On the so-called 'Theoretical Contrastive Studies'", Papers and studies in
contrastive linguistics 4 : 9 7 - 1 0 9 .
LEWIS MUKATTASH
1 Introductory
This study reports on the results of a test in English grammar that was given
to 4,835 Arab learners of English. Furthermore, it seeks to establish patterns
of difficulty in foreign language learning (English in this case), and to show
that neither contrastive analysis (CA) nor the text-bound type of error ana-
lysis (EA) can predict with any reasonable degree of adequacy the relative
degree of difficulty which foreign learners of a particular NL will encounter
in the process of learning and producing a given TL, there being a multitude
of psychological, social and educational factors that interact simultaneously
and thus affect the overall linguistic achievement of the learner.
In this study 'difficulty' is assumed to be reflected in 'error'. However, an
attempt will be made to relate difficulty to both 'linguistic difference' be-
tween NL and TL and to 'formal marking' in TL. For other interpretations of
the term 'difficulty' and for objections to equating difficulty with error, 1 see
Kellerman(1977; 1979) and the references he cites; see also Schachter(1974);
James (1977); Eckman (1977).
The present study does not purport to raise any theoretical issues and
makes no novel claims as to the psychology of foreign language learning, for
it is our contention that as Schouten (1979:4) points out, "what is wrong
with second language learning research is the fact that its mainstream is based
on theories that are supported by no facts . . . " . The only commonplace as-
sumptions that are made here are the following:
1. In learning a certain TL, some linguistic structures will be more difficult
than others for speakers of a particular NL.
2. Difficulties in foreign language learning are primarily of two types: first,
those which are typical of speakers of a given NL, and secondly, those which
are shared by all learners of a given TL regardless of their language backgrounds.
334 Lewis Mukattash
2 Test
The test upon which this study is based was one of several test-types used to
discover and describe the prevalent grammatical errors in Jordanian English. It
consisted of 150 multiple-choice items covering eleven areas of English gram-
mar, and was based on the English language syllabus used in public secondary
schools in Jordan. All attempts were made to minimize the influence of vocab-
ulary unfamiliarity by using only words having a relatively high frequency in
the textbooks. For details about the construction of the test, and many other
aspects (e.g. pretesting, validity, reliability, etc.), the reader is referred to
Mukattash (forthcoming-b).
Before presenting the results of the test, a few points should be mentioned
in connection with the test itself:
(i) As pointed out above, the test was originally designed (together with
other test-types) to discover, describe and justify prevalent grammatical mis-
takes in Jordanian English (see Mukattash 1977, 1978, 1979, 1980, forth-
coming-a). In other words, the test was not intended to establish a hierarchy
of difficulty for Jordanian or Arab learners of English.
(ii) The limitations and restrictions imposed by the exclusive use of one test-
type should be obvious to any researcher in the field of foreign language
learning. Indeed, the conclusions and generalizations drawn from the results
of one test-type should be treated with caution.
(iii) The test does not measure the learners' production in as much as it mea-
sures their 'comprehension' and shows their ability to choose among alter-
natives, and thus it might be said to reflect their judgement about accept-
ability or grammaticalness. However, we argue below that there seems to be
a high correlation between the learner's receptive and productive competence.
To quote Chomsky(1971:37), " . . . the grammar that represents the speaker's
competence is, of course, involved in both his speaking and interpreting of
speech. . . " .
(iv) The choice of the distractors in each item was informed by the results of
previous studies in error analysis (Mukattash 1977, 1978, 1980, forthcom-
CA, error analysis and learning difficulty 335
ing-a), and thus it could be argued that the distractors biased the results, fcr
had there been different distractors, other choices would have been possible.
In spite of these shortcomings, the test has some positive aspects that
merit comment. First, the test was given to a very large number of students
and it has a very high reliability and validity, and on these grounds only it can
be expected to be informative to a very large extent. Secondly, there is a high
correlation between error-types gathered from this test and those gathered
from other test-types. Thirdly, as we shall see below, the difference in per-
formance amongst the various groups of learners who took the test is almost
consistent within each of the eleven grammatical areas that constitute it (see
Table 1, p. 337), a further indication of its validity. Finally, the problem of
avoidance (see Schachter, 1974) does not arise here, for students were forced
to choose an alternative even if they were not sure of the correct choice. As
such, we might add that information about learners' 'hunches' (see Kellerman,
1977:86) entered into the data.
3 Subjects
As pointed out above, the test was given to 4,835 students representing vari-
ous groups of learners of English in Jordan. Only five groups of learners are
relevant to this study:
I. First Group (GI): consisted of 56 fourth-year students in the Department
of English at the University of Jordan (J.U.). They had had eight years of in-
struction in English at school (56 hours a week), and four years of instruc-
tion in English (viz. practical English, language and linguistics, and literature)
at the university level.
II. Second Group (Gil): consisted of 114 final year students from 'foreign
private' secondary schools. They had begun learning English at the age of five
or six and at the time of the administration of the test they had had 1 2 - 1 3
years of instruction in English.
III. Third Group (GUI): consisted of 194 second-year English majors from
five Teachers' Training Institutes (T.T.I's). They had had eight years of in-
struction in English at school, and two years at their respective institute
(courses in practical English, linguistics and methodology). The test was ad-
ministered to them a few weeks before they graduated.
IV. Fourth Group (GIV): consisted of 1095 final year students from public
secondary schools in the city of Amman. They had had eight years of instruc-
tion in English.
V. Fifth Group (GV): consisted of 1841 final year students from public schools
in the various districts in Jordan (excluding Amman city). They had had the
same years of instruction in English and the same syllabus as GIV above.
336 Lewis Mukattash
The rationale behind choosing these five groups is as follows. First, GI and
Gil are considered to be the best in terms of language proficiency (see Mukat-
tash, forthcoming-b), and the two groups are comparable in terms of the
amount of instruction in English they had had. Secondly, GUI is meant to
show the linguistic difficulties which the English language teachers themselves
have. Thirdly, GIV and GV represent the majority of English language learners
in Jordan, and the two groups are comparable in terms of the amount of in-
struction they had had in English (8 years) and in terms of syllabus and method
of teaching.
The testees are all native speakers of Arabic, and though they speak dif-
ferent regional dialects, basically Jordanian and Palestinian Arabic, the differ-
ences between the two dialects are not sufficient on the syntax level as to
warrant differentiation between the two groups. At the time of the admin-
nistration of the test, all the testees had had a minimum of eight years of in-
struction in English.
4 Facility indices
Table 1 shows the average facility index (I.F.) of each of the eleven grammat-
ical areas for the entire population of the testees (first column) as well as for
each of the five groups listed in the preceding section. The grammatical areas
are arranged in descending order of difficulty (with respect to the 4,835 tes-
tees). 2 The table further shows the number of the items tested under each
grammatical area. (G stands for the entire population of testees, i.e. 4,835).
Before discussing the results of the test in detail, it is convenient to make a
general observation about the overall results listed in Table 1 (see the first
column). First, it will be seen that there is no appreciable difference between
the facility indices of any two adjacent grammatical areas 3 listed in Table 1:
the lowest being 0.37 and the highest 0.56. Thus if we discard a difference of
0.05 in the facility index, we can group the eleven grammatical areas into
three main groups: (i) quantifiers, adverbs, and verbals; (ii) sentence connec-
tors, nominals, prepositions, modals and relatives; (iii) adjectivals, articles and
pronouns. This seems to suggest that it is not feasible to talk about the relative
difficulty of an entire TL grammatical area (e.g. prepositions) for learners of
a particular NL. This statement does not hold true if the entire TL grammat-
ical category has no corresponding categories in L! (viz. the absence of articles
in some languages).
Returning to the 'scale' of difficulty suggested by the figures in Table 1,
one would assume that if difficulty is to be equated with linguistic difference 4
(Lado 1957), it is to be expected that this scale should be the same for the
CA, error analysis and learning difficulty 337
Table 1
average I.F.
areas and
no. of items
G GI Gil GUI GIV GV
various groups of testees since they share the same NL. In fact, this is not the
case. A brief look at the figures in Table 1 will convince us that different
scales of difficulty 5 emerge for the different groups of learners, for only in
the case of quantifiers and pronouns do we find agreement amongst the five
groups of learners with respect to the relative degree of difficulty. For instance,
verbals rank third on the suggested scale of difficulty in the case of three
groups, and second for one group, but sixth for another group. This suggests
that it does not seem to be possible to talk of 'scales of difficulty' for learners
of a particular TL even if they have the same linguistic background, the same
years of instruction in TL and the same syllabus (e.g. GIV and GV). This
statement is further reinforced by the fact that the scale suggested in Table 1
338 Lewis Mukattash
differs in various respects from other scales that were arrived at through the
use of other testing techniques; difficulty being equated with error in all cases
(see Mukattash 1977; 1978). This variation amongst the various groups of
learners with respect to difficulty is a function of a multitude of psycholog-
ical, social and educational factors (see Nickel 1971; 1980; Mukattash forth-
coming-!)).
Linguistic difference is just one factor that contributes to difficulty in for-
eign language learning. Amongst the factors that emerge in this study is the
educational background. For example, members of Group IV and Group V
followed exactly the same syllabus and had exactly the same years of instruc-
tion in English. Furthermore, the two groups used the same textbooks and
experienced exactly the same method of teaching. Yet, the overall perform-
ance of Group IV on the test is significantly higher than that of Group V.
Furthermore, the results of the two groups with respect to the relative degree
of difficulty are distinctly different. For instance, nominale rank fourth on
the suggested scale of difficulty (see Table 1) for Group IV and seventh for
Group V. Similarly, whereas adjectivals rank sixth for Group IV, they rank
ninth for Group V. The only explanation for this discrepancy between the
results of the two groups seems to reside in the quality of English language
instruction which the two groups had had. Schools in the Amman area (i.e.
the Capital) attract the best teachers, at least as far as command of English is
concerned. 6 To further explain the point under discussion, let us consider the
figures in the fourth column in Table 1, i.e. average I.F. of second-year Eng-
lish majors from five T.T.I's. As pointed out above, the testees were given the
test a few weeks before they graduated to become teachers of English in
primary schools, especially in the outlying districts in Jordan. If the teachers
themselves commit all types of errors, it follows that they will inculcate in
the learners such linguistic errors and shortcomings as they themselves have
fossilized.
The nature and type of instruction in foreign language also plays a notice-
able role with respect to difficulty. Members of Group I (i.e. English majors
at J.U.) and Group II (i.e. final year students from foreign private schools)
had had almost the same amount of instruction in English at the time of the
administration of the test. Their results, nonetheless, indicate that they differ
with respect to difficulty. For instance, whereas modals rank third on the
scale of difficulty for GI, they rank sixth in the case of Gil. Furthermore, we
notice that members of GI have less difficulty in all the grammatical areas
tested. The only explanation that can be offered is that at the time of admi-
nistration of the test, university students (i.e. Group I) had had two courses
in English grammar (mainly traditional) at least, and thus they might be said
to have had more awareness of the facts of English grammar. Maturity might
CA, error analysis and learning difficulty 339
5 Linguistic difference
has the present perfect progressive. The basic senses expressed by the English
present progressive (viz. temporariness, duration, futurity, etc.) are expressed
by means of the verb yaktubu, which corresponds formally to the English
simple present. As the English present perfect has no corresponding verbal
form in Arabic, the senses expressed by it are partially expressed by the Arabic
past form of the verb: viz. kataba. The senses expressed by the present per-
fect progressives are, however, partially expressed by the verb yaktubu, which
corresponds formally to the simple present and semantically (i.e. use) to both
the simple present and the present progressive.
This information about the correspondences between English and Arabic
present tenses 8 is represented diagramatically in Table 2, where FC stands for
formal correspondence; SC for semantic correspondence or strictly speaking
'use'(see Widdowson 1978:121); and for partial semantic correspondence.
Table 2
In contradistinction to the present verbal form, the past verbal form in Arabic
combines with the morpheme kan ('be' in the past) to express anteriority in
the past, and thus might be said to correspond formally and semantically to
the English past perfect: viz. kana taraka 'he had left'. Similarly, the English
past progressive has a corresponding verbal form in Arabic which consists of
the sequence kana + V (inf) (e.g. kana yaktubu 'he was writing'). The English
past perfect progressive does not, however, have any corresponding verbal
Table 3
form in Arabic. The basic sense associated with it is partially expressed by the
use of the Arabic past progressive (viz. kanayaktubu 'he was writing'). These
correspondences are diagrammatically represented in Table 3.
If linguistic difference is taken to be the basic source of difficulty in foreign
language learning and production, then we would expect the following diffi-
culties for Arab learners of English with respect to facts of the verbal system
discussed in the preceding two paragraphs:
1. The past perfect progressive, the present perfect progressive, the present
perfect and the present progressive, lacking formal correspondence in NL, will
prove to be the most difficult verbal forms for Arab learners.
2. The simple present having both formal and semantic correspondence will
be easier than the present perfect, the present progressive and the present per-
fect progressive.
3. The simple past, the past perfect and the past progressive, having both
formal and semantic correspondence in NL, will not prove to be difficult.
Table 4 shows the average I.F. of each of the verbal forms included in the
test. The first column shows the average I.F. for all the 4,835 testees, whereas
Table 4
average I.F.
category
G GI Gil
the second and third columns show the average I.F. of these forms for GI (stu-
dents from J.U.) and Gil (students from foreign private schools) respectively.
A careful study of the figures in Table 4 shows us that many of the predic-
tions made by CA are in fact true. First, the present perfect progressive and
the past perfect progressive prove to be the most difficult categories (I.F. 0.37
and 0.31 respectively). Secondly, the present perfect is more difficult than
both the simple present and the present progressive (I.F. 0.46,0.63 and 0.56
respectively). Thirdly, the simple present is easier than the present perfect,
the present progressive and the present perfect progressive.
The following cases cannot, however, be accounted for in terms of CA pre-
dictions, and constitute empirical confirmation that not all predictions made
by CA are valid:
1. The past perfect, which has both formal and semantic correspondence,
turned out to be more difficult than the present perfect, which has no formal
correspondence in NL (I.F. 0.38 and 0.46 respectively).
2. The past progressive, which has both formal and semantic correspondence
in NL, turned out to be more difficult than the present progressive, which
lacks formal correspondence (I.F. 0.47 and 0.56 respectively).
3. The simple past turned out to be more difficult than the past progressive
(I.F. 0.42 and 0.47 respectively) when both have formal and semantic cor-
respondences in NL.
Like the figures in Table 1, the figures in Table 4 clearly demonstrate that
the relative degree of difficulty of TL structures varies from one group to an-
other. For instance, while the past perfect progressive proves to be the most
difficult for the entire population (i.e. the 4,835 testees), the present perfect
progressive ranks second on the suggested scale of difficulty for the entire
population, but it ranks fourth in the case of GI (English majors at J.U.) and
Gil (students from foreign private schools). Similarly, while the present per-
fect ranks seventh in the case of GI, it ranks fifth in the case of Gil. In fact if
we examine the figures in the second and third columns (i.e. GI and Gil), we
notice that the facility indices of all the verbal categories are consistently
higher in the case of GI. The difference apparent in Table 4 (between GI and
Gil) was confirmed by a ' t ' test carried out on the means of the two groups;
the value o f ' t ' came out at 3.138, significant at 0.05. The only explanation
that one can think of is that members of GI, as pointed out above, had had at
least two courses in English grammar at J.U. prior to the administration of
the test. A major component of one of these grammar courses deals exclusive-
ly with the English verbal system.
It is quite obvious from the data presented in this section that not all the
predictions made by CA are valid. This is not by any means a new finding, for
it has been repeatedly demonstrated that CA predicts difficulties that do not
CA, error analysis and learning difficulty 343
occur and fails to account for all of the learning problems that occur in the
classroom (see Lee 1968; Whitman et al. 1972; see also Schachter (1974:206)
and the references she cites).
6 Formal marking
I argue elsewhere (Mukattash 1980) that the more formal marking 9 a verbal
form incorporates, the more difficult it will be for Arab learners of English
regardless of the formal and semantic correspondences between NL and TL.
I also argue that this is almost always the case except possibly in the case of
the present progressive, which is introduced at the very beginning of the Eng-
lish language syllabus in elementary schools and which has a reasonably high
frequency of occurrence in the textbooks, particularly in the initial stage.
A comparison of the figures in Table 4 will testify to this assumption. The
only exception in this study is the past progressive, which turned out to be
easier than the simple past (I.F. 0.47 and 0.41 respectively).
The principle of formal marking also accounts for the students' avoidance
of the marked verbal forms in both speech and writing, as exemplified by
their preference of the simple past and the simple present. Table 5 shows the
frequency of occurrence of the various verbal forms (appropriate and errone-
ous) which were encountered in fifty essays written by first-year students at
the University of Jordan (see Mukattash 1978:255).
Table 5
The figures in Table 5 are self-explanatory. The only point that is worth men-
tioning in this respect is that error counts based on the study of free compo-
sitions are misleading if the correlation between the frequency of use and the
frequency of errors is not calculated. This is one of the serious objections to
EA, which often, being text-bound, fails to account for the avoidance phe-
nomenon (see Schachter 1974). Indeed, the figures in Table 5 show that there
is not a single occurrence of the past perfect progressive in the fifty essays
and that there is only one occurrence of the present perfect progressive,
when - in fact - the present study as well as other test-types show us that
these two verbal forms are the most difficult for Arabs.
The principle of formal marking also accounts for the students' preference
of the active form of the verb, and their tendency to avoid the passive in both
speech and writing. Apart from two counter-examples (the simple past and
the past perfect), the figures in Table 6, which are extracted from the present
study, show that the average I.F. of the active verbal form is higher than its
corresponding passive form (for the entire population of testees).
Table 6
average I.F.
category
active passive
(iii) In 196 cases (39.2%) students used the active form of the verb in those
sentences that incorporated an explicit agentive phrase: viz. The windows
must clean by the workers daily.
(iv) In 124 cases (24.8%), and in an attempt to avoid the use of passive sen-
tences, students shifted the agentive phrase from sentence final position to
initial position, thus converting the passive sentence into an active one. 11
What is interesting to note in this respect is the fact that when the agent was
not explicitly expressed in the Arabic text, some students reconstructed a
generic agent and rendered the passive sentences in Arabic as active sentences
in English: viz. Samira gets angry when she is asked to do her duty (literal
translation), rendered as: Samira gets angry when someone asks her to do her
duty.
The data gathered from this translation test clearly demonstrate that Arabs
have a tendency to use active sentences even in contexts that require the use
of the passive.
It ought to be mentioned, however, that as in the case of linguistic differ-
ence, the principle of formal marking does not account for the whole range of
difficulties, there being variations amongst the various groups of learners. The
figures in Table 4 show us that in the case of GI, the past progressive is easier
than the simple past: I.F. 0.93 and 0.86 respectively. Notice further that in
the case of GI, the average I.F. of the present perfect progressive is higher
than that of the simple past (I.F. 0.90 and 0.86 respectively).
7 Concluding remarks
Notes
* I would like to thank Carl James for reading and commenting o n an earlier draft of
this paper. I alone am responsible for any errors in the text.
1. The following quotation from Kellerman (1979:43) shows the extent of the contro-
versy :
. . we have no satisfactory definition of 'difficulty'. It is not clear whether linguistic
difference = difficulty, and whether the degree of difference = the degree of diffi-
culty, whether error = difficulty or whether markedness hierarchies can predict diffi-
culty. Nor is it clear whether difficulty is something experienced by the learner or
may be something hypothetically attributed to him when he makes mistakes or
avoids."
2. The average facility indices of the various grammatical areas for the entire population
(i.e. 4,835 testees) vary in various respects from those for the five groups listed in
Table 1. First, in the case of the five groups, modals turn out to be more difficult
than prepositions (I.F. 0.61 and 0.62 respectively). Secondly, nominals and connec-
tors have the same I.F. (0.60), and so do adjectivals and relatives (0.65). This is also
true of articles and pronouns (I.F. 0.66). See, however, footnote 3.
3. Admittedly, it is not satisfying to simply state the rank of each grammatical area on
a scale of difficulty. In fact this may be misleading, particularly in the case where
CA, error analysis and learning difficulty 347
there is no noticeable difference between the average facility indices of two or more
grammatical areas. To illustrate this point, let us compare the average I.F. of sentence
connectors with that of nominals (viz. 0.44 and 0.45 respectively). This should not
be interpreted to mean that sentence connectors are more difficult than nominals.
This observation is based on the fact that the average I.F. of one of the items testing
the use of connectors was found to be 0.19; whereas the lowest I.F. of the items test-
ing the use of nominals was 0.34. This means that the one item whose I.F. was 0.19
lowered the average I.F. of connectors, thus making it lower than that of nominals.
This difference is, in fact, more apparent than real.
4. Orthodox CA (see Lado 1957; Stockwell et al. 1965) views learning difficulty and
linguistic difference as being directly and proportionally related. The view of equating
difficulty with linguistic difference has been repeatedly discredited and criticized on
theoretical, methodological and empirical grounds. For a summary of the basic ob-
jections to CA, see Tran-Thi-Chau (1975:124); see also Lee (1968), and Whitman et
al. (1972).
5. The most well-known and explicit hierarchy of difficulty based on CA is that pro-
posed by Stockwell, Bowen and Martin (1965) in The grammatical structures of Eng-
lish and Spanish. For a criticism of the hierarchy see: Nickel (1971), Whitman et al.
(1972), and Tran-Thi-Chau (1975).
6. Being the Capital, Amman also provides students with more opportunities for mixing
with native speakers of English than any other place.
7. In colloquial Jordanian and Palestinian Arabic, the English present progressive has a
corresponding verbal form that consists of the sequence lam+V. The morpheme
1am does not occur in Standard Arabic, and its linguistic status is still unkown. Mit-
chell (1978:238) refers to it as 'particle'. This might be another reason why the pre-
sent progressive does not constitute a difficulty for Jordanian learners of English.
8. All the verbal inflections in Arabic which relate to number, gender and person will be
ignored here.
9. It could be the case that what is involved here is what Tran-Thi-Chau (1975:133)
refers to as the 'systematic complexity' of the English verb system.
10. Eckman (1977:321) wrongly points out that Arabic does not have passive sentences
with expressed agents.
11. As in the case of English, agentive phrases in Arabic occur in the final position of
passive sentences.
12. Eckman (1977:320) defines markedness in the following way: "A phenomenon A in
some language is more marked than if the presence of A in a language implies the
presence of B; but the presence of does not imply the presence of A." His basic
assumption is that areas of TL which differ from NL will be difficult if they are more
marked than NL, but they will not be difficult if they are not more marked.
References
Chomsky, Noam
1971 "Formal discussion of Wick Miller and Susan Ervin's: The development of
grammar in child language". The acquisition of language, edited by Ursula
Bellugi and Roger Brown (Chicago: The University of Chicago Press), 3 5 - 3 9 .
Eckman, Fred R.
1977 "Markedness and the contrastive analysis hypothesis", Language learning
27.2:315-330.
James, Carl
1977 "The ignorance hypothesis in interlanguage studies", Interlanguage studies
bulletin 2 . 2 : 1 5 2 - 1 6 5 .
348 Lewis Mukattash
Kellerman, Eric
1977 "Towards a characterisation of the strategy of transfer in second language
learning", Interlanguage studies bulletin 2 . 1 : 5 8 - 1 4 5 .
1979 "The problem with difficulty", Interlanguage studies bulletin 4 . 1 : 2 7 - 4 7 .
Lado, Robert
1957 Linguistics across cultures (Ann Arbor, Michigan: The University of Chicago
Press).
Lee, W. R.
1968 "Thoughts on contrastive linguistics in the context of language teaching",
Report on the nineteenth annual round table meeting on linguistics and lan-
guage studies, edited by James E. Alatis (Washington, Georgetown University
Press), 1 8 5 - 1 9 4 .
Mitchell, T. F.
1978 "Educated spoken Arabic in Egypt and the Levant, with special reference
to participle and tense", Journal of linguistics 1 4 : 2 2 7 - 2 5 8 .
Mukattash, Lewis
1977 Problematic areas in English syntax for Jordanian students (Amman: The
University of Jordan).
1978 "A pilot project in common grammatical errors in Jordanian English", Inter-
language studies bulletin 3.2:250-291.
1979 Further studies in Jordanian English (Amman: The University of Jordan).
1980 "Yes/No questions and the contrastive analysis hypothesis", ELTJ. 34.2:
133-145.
forth-a " ^ - q u e s t i o n s in English: a problem for Arab students", to appear i n I R A L .
forth-b "English language proficiency in Jordan: educational, socio-economic and
other factors", to appear in Dirasat (The University of Jordan).
Nickel, Gerhard
1971 "Variables in a hierarchy of difficulty", Working papers in linguistics 3.4:
185-194.
1980 "Some pedagogical implications of error analysis and contrastive linguistics",
Tijdschrift van de VUB 2 1 : 6 0 - 7 0 .
Palmer, F. R.
1974 The English verb (2nd edition) (London: Longman).
Schachter, Jacquelyn
1974 "An error in error analysis", Language learning 2 4 . 2 : 2 0 5 - 2 1 4 .
Schouten, . . H.
1979 "The missing data in second language research", Interlanguage studies bulletin
4.1:3-14.
Stockwell, R. P. - J. D. Bowen - J. W. Martin
1965 The grammatical structures, of English and Spanish (Chicago: The University
of Chicago Press).
Tran-Thi-Chau
1975 "Error analysis, contrastive analysis, and students' perception: a study of
difficulty in second-language learning", IRAL 13.2:119-143.
Whitman, Randal L. - Kenneth L. Jackson
1972 "The unpredictability of contrastive analysis", Language learning 22.1:29-41.
Widdowson, H. G.
1978 Teaching language as communication (Oxford: Oxford University Press).
WIES LAW OLEKSY
the respective theories, i.e., the basic units of analysis. The form-centered
theories have been sentence-bound and syntactic considerations have been
prevailing in the research of the representatives of this linguistic orientation.
The above is also true of the transformational tradition, even though it re-
presents more interest in idealizations: rules are said to account for the native
speaker's competence.
There is a general agreement as to the validity of the study of language in
its sociocultural setting, both among the theoretical linguists and philosophers
of language as well as those concerned with language learning/teaching. How-
ever, little has been written on the nature of processes involved in the study
of social setting of language from the contrastive point of view.
In this paper I wish to propose that for CA to be able to cope with lan-
guage use in a sociocultural setting it must be based on or, at least, related to
linguistic pragmatics understood very broadly as the science of language use
(Haberland - Mey 1977). In other words, I wish to propose a model of Prag-
matic Contrastive Analysis (henceforth PCA). Because of the preliminary
nature of the enterprise, the limitations of this paper, and the lack of empir-
ical research in the field, this paper should be treated as a handful of sugges-
tions pointing to some possibilities rather than offering proofs or definite ana-
lyses. The model attempted here is programmatic rather than normative.
1 A model of PCA
1.1 LC comprizes what Bach and Harnish (1979:3) call 'Utterance Act'and
'Locutionary Act'. I shall not devote much attention to LC for reasons of
scope but it is worth noting that anyone who wishes to account for the locu-
tionary content of an expression used by a speaker S on a given occasion will
have to solve the problem of the choice of the apparatus that adequately ac-
counts for the expression in L 1 and, in addition, allows for a contrastive treat-
ment of the equivalent expression in L 2 . That is to say, the analyst will have
to decide on a particular type of contrastive grammar.
It can be added in passing that LC should account for both the syntactic
(including phonological) and the semantic characterization of expressions in
L! and L 2 . Typically, the term "expression in Lj and L 2 " will refer to a sen-
tence. Needless to say, most existing CAs have been limited to what I call
here LC. Furthermore, the existing CAs based on generative grammars have
been constructed so as to account for ideal objects, i.e., well-formed sentences.
This "virtue" of CA inherited from generative grammars makes the former
and the latter a rather inadequate mechanism to deal with linguistic com-
munication which, as is well known, allows for incorrectness. For example,
(1) and (2) below can both express the speaker's intention:
1.2 The Illocutionary Component. The basic unit of analysis in Ill.C is the
communicative act. Generally speaking, communicative acts can be realized
352 Wiesiaw Oleksy
usually coined after the predicate verb which occurs in the performative
clause.
b. Performative clause does not occur overtly but there are other linguistic
means of signalling IF, e.g., the occurrence of please can be said to signal
requests (cf. Sadock 1974:120). Intonation can also be used as a device to
signal the type of SA being performed, (a) and (b) above can be cases of
the literal use of an expression by S, unless (a) is the case of an embedded
performative or a hedged performative. If the latter is the case then S's ut-
terance can not be interpreted literally and the assignment of IF must be
done as is indicated in (c) below.
c. Performative clause does not occur overtly in S's utterance and there are
no linguistic signals present in S's utterance indicating IF. If (c) is the case
then S's utterance can be interpreted literally or nonliterally. In the latter
case, it is claimed, the identification of SA which S has performed will be
possible on the basis of FC and SC, and/or on the basis of what Grice called
implicature.
Felicity conditions (FC) are conditions which guarantee that an SA per-
formed by S in communicative context (CC) is effective, i.e., it expresses S's
intention to the addressee A who, in turn, decodes S's intention. Therefore,
the essential constituents which are present in defining FC are the following:
a. S's assumptions about himself and about A.
b. S's assumptions about the empirical setting in which the performance of
SA takes place.
Now I will propose FCs for the speech act Question (FCQ) which are based
on Oleksy 1979a and 1979 b.
As was said earlier in this paper, FC's are what the speaker assumes about
the addressee, about the content of what is being communicated, and about
the empirical setting in which an SA occurs. Judgments concerning felicity of
an SA rely heavily on general principles of conversation as well as on the
unique conditions for the SA being performed by the speaker at a given time
and in a given context. All this amounts to saying that SAs are not performed
in a vacuum and that the communicative context is an integral part of verbal
exchange. In the case of Question the communicative context in which this
SA typically occurs is the so-called erotetic context. (I coined the term after
Belnap's (1969) erotetic situation or, as he put it, the asking questions-in-
circumstances.) A typical erotetic context (EC) will have the following con-
stituents:
(3) S = the speaker
A = the addressee
An = the answer
354 Wiesiaw Oleksy
the participants by social norms accepted in the given speech community and
culture at the given time. Thus, sociocultural restrictions will have to account
for several things pertaining to the explanation of how and to what extent
such sociolinguistic parameters as, e.g., sex, age, status, etc., of the participants
on the one hand, and the cultural (institutional) and situational contexts on
the other interact with FCs.
Closely connected with SC seems to be the role relationship that exists be-
tween the speaker and the addressee. It is worth adding that from the point
of view of the speech act analyst it is crucial how the participants contribute
to the ongoing verbal exchange, i.e., what role in regard to the speaker/ad-
dressee distinction each of them assumes at the time of the speech act. Such
roles as, e.g., father-son, student-teacher, boss-secretary, friend-friend, etc.,
are relevant because they have a bearing on the felicitous use of certain speech
acts. For example, a 10-year-old boy cannot be said to have performed a (fe-
licitous) command if he uttered (5) to his father with an intention to issue a
command.
(5) Father, get me a bike at once.
One of the conditions for the felicitous use of commands is that the speaker
(who is issuing a command) is in a superior position in relation to the addres-
see. Since this is not the case in the situation dealt with above (5) is not effec-
tive: the speaker has failed to produce a felicitous command. However, on an-
other occasion the same boy may have performed a felicitous command, for
instance upon uttering (6) to his younger brother:
(6) Tom, get me a bike at once.
In the analysis of SAs it is crucial to remember that role relationship should
be defined for the given speaker and the given addressee relative to the spe-
cific communicative context in which SA is performed. It seems thus best to
consider role relationship as a set consisting of two members: one for the
speaker and one for the addressee. If any of the members in the set changes
membership, e.g., the addressee becomes the speaker, etc., or a new partici-
pant becomes involved in the verbal exchange or the communicative context
in which the participants are performing changes, a change in the role relation-
ship may be brought about. Kando (1977:246) points out to the fact that
sometimes two roles, although performed at different times, may result in
creating a role conflict in the individual. This is the case, for instance, of the
teaching assistant in a university who has the difficult task of reconciling
allegiance to the student body with identification with the faculty. For our
purposes it is enough to note that role relationship changes with the changing
conditions in which the participants are performing. Of course, it remains to
356 Wies taw Oleksy
be explained on the basis of empirical data what the elements of the changing
conditions are and how they contribute to the ongoing verbal exchange.
Perhaps what is most interesting and relevant to the study of SAs from the
point of view of SC is the choice of strategies for the performance of SAs by
the speakers on the one hand and the appropriateness of strategies in certain
contexts on the other. This brings me to the last part of the section dealing
with the Illocutionary Component:
Strategies for the performance of speech acts. Above I have tried to deal
with two things: one was connected with how it was possible to define a
felicitous SA and another one was connected with how the functioning of
SAs in socioculturally restricted contexts could be approached. Now I wish
to address the following question: How does a speaker perform the intended
act? The answer to this question involves whatFraser(1978:7) called strategies
for the performance of speech acts. He defined strategy in the following way:
" . . . the term strategy [ . . . ] refers to the particular choice of sentential form
and meaning which the speaker employs in order to perform the intended act".
The choice of a particular strategy may vary from speaker to speaker and
is related to the linguistic repertoire of the speaker on the one hand and the
speaker's assessment of the communicative context in which a speech act is
performed on the other. It seems obvious that the output of the sociocultural
analysis of speech acts can throw some light on the choice of strategies. Of
particular relevance will be the role relationship that exists between the
speaker and the addressee at the time of the speech act, the degree of famil-
iarity between the speaker and the addressee, the participants' age and sex, and
the institutional and situational context in which a speech act is performed.
All the abovementioned factors are assumed to be pertinent to the choice of
strategies for the performance of a particular speech act. It has recently been
suggested (Walters 1979:279) that closely related to the choice of strategies
for requests is the politeness of request forms.
The problems dealt with above can be summed up under one general label
which I call pragmatic competence. In my understanding of the term, prag-
matic competence is the ability of an individual to use an SA in the given com-
municative context with a particular strategy in order to obtain maximum
communicative and social goals. This treatment of pragmatic competence is
reminiscent of Edmondson's(1980:2) social competence.
In what follows I shall demonstrate a few strategies for the performance
of a direct question. One obvious strategy for any speech act is to announce
the intended force of the utterance with which a speech act is performed.
This can be done if IF (and hence the intended meaning) of the speech act
being performed occurs overtly in the sentential form which represents a given
strategy. (7), (8), and (9) are examples of such overt manifestations of the
Towards pragmatic CA 357
strategy for the performance of the following speech acts: a question, a pro-
mise, and a warning; respectively (7), (8), and (9).
(7) I want you to tell me "x".
(8) I promise I'll do "x".
(9) I warn you not to do "x".
But more than often, the speakers prefer not to announce their intentions in
performing a speech act and choose some less direct means of conveying to
the addressee the force of the speech act. (1023) are the strategies for the
performance of a direct question.
(10) What's your name?
rely on empirical data in this matter. The elicitation of data in this type of re-
search is of primary importance. It will be suggested that at the beginning
stage of data collecting the following procedure may prove useful:
a. elicitation of strategies: either retrospectively or experimentally
b. establishing hierarchies of strategies on the basis of the frequency of use as
reflected in the experimental data
c. differentiation among strategies relative to communicative context
d. differentiation among strategies relative to other pragmatic factors, such as,
e.g., politeness, mitigation, etc.
The elicitation of strategies can be based on pre-arranged situations, e.g., role
playing, in which the interactants would assume the roles of the speaker and
the addressee. These pre-arranged situations can be constructed in such a way
that they contain the following factors:
a. setting the communicative context in which S A will be performed is pro-
vided;
b. participants the role relationship between the speaker and the addressee
is indicated;
c. goal - the type of SA to be performed by the speaker is specified.
By way of illustration an example is provided. In the pre-arranged situation
demonstrated below the speaker is expected to produce a strategy for the per-
formance of the SA of Question. Presumably, the data obtained in this ex-
periment would be related to strategies retrospectively proposed in (10-23)
above.
a. setting A friend of yours has invited you to a party in his appartment.
b. participants At the party you've noticed someone you think you'd met
before but you can't recall her name.
c. goal You want to know her name. What would you say to her?
To finish this section it is worth pointing out that despite the fact that some
pioneering work on matters alluded to above has already been done (cf.
Fraser 1978, House Kasper 1979, Walters 1979) the research methodology
pertaining to the elicitation of data for the analysis of pragmatic competence
is waiting to be developed.
guistic expressions can be put are essentially the same across languages. The
above claim has, in fact, been recently put forth by Fraser, Rintell and Walters
(1979) as reported in Walters 1979:278. I repeat this claim (and two other
claims) here for convenience.
i. Every language makes available to the user the same basic set of speech
acts such as requesting, apologizing, declaring, promising, and the like,
with the exception of certain culture-specific ritualized acts such as baptiz-
ing, doubling at bridge, excommunicating and the like.
ii. Every language makes available basically the same set of strategies - se-
mantic formulae for performing a given speech act.
iii. Languages will differ significantly with respect to both when a particular
speech act ought to be, ought not to be, or may be performed, and with
what strategy.
From the contrastive point of view it is thus necessary to state conditions
which must be met for an SA to be equivalent in L t and L 2 . In order to do
this it will be suggested that the mechanisms proposed in the section above
for the identification of SA can be employed. Accordingly, an SA can be said
to be equivalent across languages if it is characterized by the same IF and the
same set of FCs. This is expressed in (24).
(24) SA,L! = SA,L2 if a. I F S A > L ] =IFSA,L2
b- FC S A j L i = F C s a
When the equivalence of an SAin Lj and L2 has been established what remains
to be done is to find out how a given SA is realized in Lx and L 2 . That is to
say, it is necessary to identify linguistic expressions in L t and L 2 which are
used by speakers of these languages for performing a particular SA. At this
stage, the procedures for the elicitation of strategies for the performance of
SAs outlined in the section above can be conveniently applied.
If the above suggested procedure is applied the equivalence of expressions
used by speakers in Li and L2 will be related to the fact that they are realiza-
tions of the same SA. In other words, if an SA in L { is equivalent to an SA in
L2 then the linguistic expressions which have been used in L 1 for performing
that SA must be equivalent to the linguistic expressions used for performing
the same SA in L 2 , for it would be absurd to hold that an SA in question is
equivalent in L t and L 2 and its realizations are not.
The above discussion can be summed up by saying that the comparability
of SAs across languages is guaranteed by the fact that SAs, as communicative
categories, are equivalent ex definitione: the speech act of requesting in Eng-
lish is, for example, equivalent to the speech act of requesting in Polish.
Similarly, IF and FCs which are used for the identification of a particular SA
are assumed to be language universal.
360 Wiesiaw Oleksy
On the other hand, (27) in English and (30) in German are FEq. and PEq., but
it was discovered experimentally by House and Kasper that in the data that
they investigated the frequency of use showed more correlation between (27)
and (28) though they are not FEq. than between (27) and (30) which are FEq.
The above discussion seems to support the claim about a necessity of the
introduction of the distinction between pragmatic equivalence and formal
equivalence into CA.
In 1.2, the section dealing with the Illocutionary Component (Ill.C), I have
distinguished the following criteria which characterize SA: (a) IF, (b) FC,
(c) SC. Also, I have indicated that (d) role relationship holding between the
speaker and the addressee and (e) strategies for the performance of SA are of
primary importance. IF and FCs have been closely connected with the identi-
fication of SA as an object of linguistic analysis. Furthermore, IF and FCs
have been proposed as indicators of equivalent SAs across languages.
In the discussion that follows some aspects of (c), (d) and (e) above will be
touched upon. While (a) and (b) seem to be useful in the characterization of
SAs irrespectively of the language and culture of the speaker, (c), (d) and (e).
are language and culture specific. Following the claims put forth by Fraser,
Rintell and Walters (1979) it will be proposed that a contrastive analysis of
SAs should incorporate (c), (d) and (e) above. By doing so it will be possible
to provide answers to questions about the culturally, institutionally and
situationally restricted contexts in which a particular SA ought to be, ought
not to be, or may be used.
A pragmatic contrastive analysis (PCA) of SA should thus demonstrate
how an equivalent SA functions in the cultures to which speakers of L, and
L 2 belong. It is believed that an adequate contrastive analysis of equivalent
SAs cannot be performed without the incorporation of the sociocultural com-
ponent, if the analysis is to demonstrate how SA under analysis functions in
culturally distinct societies.
The functioning of Eq.SA in culturally distinct societies can be carried out
successfully if strategies for the performance of a given SA are related to such
factors as role relationship holding between the speaker and the addressee and
other pragmatic factors such as politeness, mitigation, level of directness (cf.
House - Kasper 1979:3-12). It must be stressed at this point that the identi-
362 Wiesiaw Oleksy
fication of pragmatic factors and their interaction with linguistic and socio-
cultural factors must also become a matter of interest for a contrastive linguist.
Although research on the use of SAs across languages is scarce it is justi-
fied to claim, on the basis of the research alluded to in the first part of this
paper, that pragmatic factors together with sociocultural restrictions are es-
sential and underly speakers' decisions as to the choice of a particular strategy
for performing an SA. The aim of PCA is to isolate and explain these factors.
It goes without saying that the impact of pragmatic and sociocultural fac-
tors on speakers' decisions concerning the choice of a strategy can be per-
ceived differently across languages and cultures. Accordingly, the following
claim is put forth:
(31) The choice of strategy for the performance of SA is related
to pragmatic factors and sociocultural restrictions on the
use of a given SA, which is language specific.
It is believed that this claim should augment the claims put forth by Frser,
Rintell and Walters (1979).
It can be noticed that the choice of strategies is also related to what has
been previously called pragmatic competence. If the notion of pragmatic
competence is applied to foreign language learning it becomes obvious that
learners of a foreign language will display different degrees of familiarity with
sociocultural restrictions and pragmatic factors governing the use of a parti-
cular strategy for the performance of SAs. In other words, learners will en-
counter difficulties with regard to the acquisition of pragmatic competence:
their verbal behaviour in a foreign language will then display pragmatic errors
(cf. House Kasper 1979:2). It seems thus sensible to suggest that the proper
place where pragmatic errors can be studied is pragmatic error analysis. This
suggestion will not be elaborated any longer in this paper.
Summing up the last part of this paper it can be said that PCA should focus
on the identification and differentiation of linguistic expressions used by
speakers of Lj and L 2 for the performance of SAs and, in particular, PCA
should explain the motivation which underlies decisions made by the speaker
as to the choice of a particular linguistic expression (i.e., a strategy) to per-
form a given SA. As has been repeatedly pointed out in this paper research
methodology for PCA should be confronted with experimental data.
The present paper should be treated as an outline of problems to be solved
and areas to be investigated. However, it is believed that what has been sug-
gested here will prove useful in handling matters pertaining to the understand-
ing of how speakers across languages manipulate linguistic expressions to per-
form different societal tasks.
Towards pragmatic CA 363
References
Sadock, Jerrold
1974 Towards a linguistic theory of speech acts (New York: Academic Press).
Sajavaara, Kari
1977 "Contrastive linguistics past and present and a communicative approach",
Contrastive papers, edited by Kari Sajavaara and Jaakko Lehtonen(Jyvskyl:
University Press), 9 - 3 1 .
Stalnaker, Robert
1972 "Pragmatics", Semantics for natural language, edited by Donald Davidson
and Gilbert Harman (Dordrecht: D. Reidel), 3 8 0 - 3 9 8 .
1980 "Review of Gerald Gazdar, Pragmatics", Language 5 6 : 9 0 2 - 9 0 5 .
Szwedek, Aleksander
1975 Word order, sentence stress and reference in English and Polish (Edmonton:
Linguistic Research, Inc.).
Wachtel, Tom
1979 "A question of imperatives", PSiCL 1 0 : 5 - 3 3 .
Walters, Joel
1979 "Strategies for requesting in Spanish and English structural similarities and
pragmatic difference", Language learning 2 9 : 2 7 7 - 2 9 5 .
JERZY RUBACH
1 Introduction
2 Unordered rules
Let us look at two rules in Polish: Surface Palatalization and Regressive De-
voicing. They are both low phonetic and exceptionless. Furthermore, they
both cause interference.
Surface Palatalization applies before /i j/ and it affects consonants both in-
side words and across word boundaries:
(1) [ + c o s ] ^ [ ^ c h k ] / - ( I - s e g ] ) |+3jh]
In the case of velars /k g x/ rule (1) effects a change in the place of articula-
tion since velars are fronted to the prevelar (postpalatal) position. We thus
have [k' g' x'] in:
(2) kilometr 'kilometre', gimnastyka 'gymnastics', histeria
'hystery', obtoki 'clouds', drogi 'roads', podstuchiwac
'listen in', krzak jezyny 'blackberry bush', dach i ofcna 'roof
and windows'.
In all other cases the place of articulation is not affected by Surface Palatali-
zation. Consonants become palatalized but they retain their original place and
manner of articulation. For example, /1/ changes to /t'/ but it is still a dental
stop. Rule (1) should therefore be kept distinct from the well knownmorpho-
phonemic process of Anterior Palatalization (cf. Gussmann 1978 a, Rubach
1981). The latter changes dentals into palatals, for example, /t/ becomes an
alveolo-palatal affricate [tp] as in kat 'hangman' kaci (pi.).
In (3) below we list some examples showing how Surface Palatalization af-
fects labial and coronal consonants:
(3) / p b m f v / ^ [ p ' b ' m ' f ' v ' ] :
pisk 'scream',pierwszy 'first', biuro 'office', mieszkac live',
fiotek 'violet', witio 'wine';
/t d sz/ [ t ' d ' s'z']:
tiara 'tiara', diabet 'devil', sinus 'sine', zjawisko 'phenom-
enon', brat Ireny 'Irene's brother';
/I r n/ - [ ']:
litosc 'mercy', rizotto 'risotto', dzban jagod 'jar of black-
berries';
/ / . rv' / / jvf-i
/s c dz/->[s c dz J:
Chirac (French name), imazinizm 'imaginism', Chile 'Chile',
dzinsy 'jeans', pokaz je 'show them'.
Rule ordering in phonological interference 367
3 Ordered rules
Rule (7) refers to the dialects of central and northern Poland and it expresses
the generalization that obstruents are voiced before voiced obstruents both
inside words and across word boundaries:
(8) liczyc 'count' - liczba 'number', prosic 'to request'
prosba 'request', skonczze 'DO finish', tap go 'catch him',
gtos dziecka 'child's voice', sprzgt wojskowy 'military
equipment', etc.
Thus, for example, the // of liczyc 'to count' is replaced by /d2/ in liczba
'number' due to the voicing environment of /b/.
At the same time Polish has a rule of Progressive Devoicing:
Both Voicing and Progressive Devoicing are late and entirely exceptionless
rules. Voicing is clearly responsible for mistakes such as this *[z] value, Miss
*[z] Brown, a bit *[d] better and help *[b] David. Interference occurs also
internally: *[z] is pronounced in misbehave, misguided, etc. where the cor-
rect form is [s]. Observe that Voicing and Progressive Devoicing have partially
overlapping environments. The examples quoted so far all refer to the portion
of Voicing which could not be in conflict with Progressive Devoicing. The latter
cannot apply across word boundaries and it cannot affect noncontinuants.
Consequently, it could not produce *[f] for [v] in this value nor *[p] for [b]
in misbehave. However, all instances of word internal clusters of fricatives or
stops followed by fricatives are potential inputs to both rules. Thus English
svelte may, in theory, be mispronounced in two ways:
(i) as [zv] via Voicing
(ii)or as [sf] via Progressive Devoicing.
If our conclusion from section 2 that in phonological interference rules are
transfered at random orders is true, then both of these mispronunciations
should be current with Polish learners of English. Alternatively, only [zv]
should be attested if for some reason Progressive Devoicing could not cause
interference. Unfortunately for our theory none of these alternatives is true.
Progressive Devoicing does cause interference and what is more there is no pos-
sibility of mispronouncing svelte as *[zvelt]. Learners end up saying *[sfelt]
with perfect consistency, i.e. we have interference from Progressive Devoicing.
In English there are very few words that have clusters of obstruents of which
the second member is a fricative and which would disagree in the value for
voicing. There are, however, many relevant examples in German and Russian:
The word liczba 'number' is not affected by Progressive Devoicing since the
second member of the cluster is a stop. Neither can Progressive Devoicing ap-
ply if there is a word boundary. Compare Pol. gtos walki 'sound of battle'
with English svelte and this value:
(14) gtos walki svelte this value
II-S# # v - sv- -S# #v-//
*sf - Progr. Devoic. (9)
z##v - *z##v Voicing (7)
Summing up, our conclusion from section 2 is not true. In phonological inter-
ference rules are transfered along with their ordering. Random ordering of
Surface Palatalization and Regressive Devoicing shown by the derivation in
(6) is simply a consequence of the fact that also in Polish phonology the two
rules do not interact in any crucial way.
4 Partial ordering
The voiced postalveolar affricate /di/ derived from //g// via 1 s t Velar is further
changed to /2/ by Spirantization:
(18) waga 'scale' - wazyc 'weigh', ubogi 'poor' - ubozec 'be-
come poor', rg Tiorn' - rozek (dimin.), rtoga 'leg' -
nozysko (aug.) - nozyna (dimin.), snieg 'snow' sniezysty
(adj.) - sniezyca 'snow-storm', wilgotny 'moist' - zwilzyc
'moisten', skarga 'complaint' - skarzyc 'complain'.
Now let us see how 1 s t Velar is ordered with respect to Surface Palatalization,
rule (1), discussed in section 2. There is plenty of very compelling evidence
to show that 1 s t Velar must precede Surface Palatalization. The arguments
come from various interactions with a number of other rules in Polish. In this
paper we have introduced briefly only one of the relevant rules, Retraction,
so let us limit our discussion to the interaction between 1 s t Velar, Retraction
and Surface Palatalization 8 .
Recall that Retraction changes /i/ to [i] after nonhigh coronal consonants.
It enters therefore into a feeding relationship with 1 s t Velar which produces
/ df 5/ from //k g x//, i.e. noncoronals are changed into nonhigh coronals and
1 s t Velar feeds Retraction. At the same time Retraction bleeds Surface Palat-
alization since it destroys the environment for the latter by deriving / i / . Un-
like Surface Palatalization, Retraction is constrained to apply only inside
words, i.e. it is blocked by word boundaries. Let us look at the derivation for
kroczyc 'to step', the verb derived from krok 'step' by adding the verbalizing
suffix I/i/l and the infinitive ending //tp//. This derivation is contrasted with
that for palacz i 'smoker and', where I III I is the underlying segment and
forms part of the suffix -acz added to the verbal stem of palic 'to smoke':
Clearly the rules must be ordered as in (21). Otherwise we would not be able
to derive [i] in kroczyc 'to step'. Had Surface Palatalization applied before
Retraction, the /5/ of kroczyc 'to step' would have been made [+high] and
Retraction would not have applied: *[kro'it(5], In sum, 1 s t Velar precedes
Surface Palatalization.
Let us now return to the facts of phonological interference. Recall that in
section 3 we arrived at the following generalization: in the process of inter-
ference, rules are transfered along with their ordering and the rules which are
ordered early in the native system apply as early in phonological interference.
If this generalization is true, then words such as taking, leaking, cracking, etc.
should have a tendency to be mispronounced by Poles as *[tejii]]) *[ticiij],
*[kreCiij ], i.e. the interference should come from 1 s t Velar. Unfortunately
Rule ordering in phonological interference 373
for our theory such mistakes are not on record. Worse, they are impossible.
Poles do have interference in taking, leaking, cracking, etc. but it is the inter-
ference from Surface Palatalization: [k] is replaced by [k'] in the environment
of /i/. Thus the true generalization about rule ordering and phonological inter-
ference must be different. We have missed something of real significance in
our theory. As has been the case before, the answer should be sought in the
structure of Polish phonology and phonological theory.
It has been recently proposed (cf. Mascaro 1976, Halle 1978, Kiparsky,
forthcoming, and Rubach 1981) that all phonological rules fall into two
classes: cyclic rules (in the new sense, see below) and postcyclic rules. Cyclic
rules as a bloc precede postcyclic rules so that in a derivation we cannot have
any intermixing of these types of rules. The details and the mechanics of
cyclic phonology need not concern us here (for discussion see the references
above). However, the basic idea is necessary.
A rule which is designated as cyclic applies in accordance with the prin-
ciple of Strict Cyclicity (for a formal statement see Halle 1978). Most funda-
mentally, Strict Cyclicity incorporates the notion of 'derived environment'
(in the sense of Kiparsky 1973), i.e. cyclic rules may apply only if either of
the following two conditions is met:
(i) there is a morpheme boundary
(ii) or a segment, which is crucial to the application of a given rule, has been
derived by another rule applying earlier on the same cycle.
Thus, if condition (ii) is not met, a cyclic rule cannot apply morpheme inter-
nally.
First Velar and Spirantization are cyclic rules. Their cyclicity follows from
the principle that cyclic and postcyclic rules form separate blocks. In Rubach
(1981) I have shown that both of these rules must apply before rules which
have been found to be cyclic (e.g. Anterior Palatalization, Lower), con-
sequently, 1 s t Velar and Spirantization must be cyclic themselves. The data
bear out his claim. First Velar is entirely exceptionless and 100% productive
at morpheme boundaries (the examples quoted earlier in (16), (18) and (20)).
However, it does not apply morpheme-internally. Thus borrowings undergo
1 s t Velar if there is a morpheme boundary but never if velars are followed by
front vowels inside the same morpheme:
5 Conclusion
Notes
1. In Rubach 1974 I argue that the two should not be collapsed. For the purposes of
this paper the point is irrelevant hence we shall by-pass it in our discussion.
2 . 1 use double slashes for underlying representation, single slashes for intermediate
stages and square brackets for phonetic representation.
3. Actually, the change from //r// to // passes through an intermediate stage of palata-
lized r. Derivational details need not concern us here, for discussion see Rubach 1981.
4. As a matter of fact, [v] in these words as well as in English svelte is partially or even
completely voiceless. This, however, is irrelevant to the point that we are making: the
only important fact is that the in question is not identified with f in English and
German. The interference mistakes lie precisely in such an identification.
5. Incidentally, observe that language interference facts may also serve as an excellent
testing ground for various linguistic theories. The data quoted here show, for example,
that rule ordering is not an artifact of generative phonology. It is an absolutely neces-
sary concept in any phonological theory.
6. The term "palatalized" is used here in the phonological sense. Phonetically speaking,
/ df 5/ are nonhigh postalveolar consonants.
7. Also before /j/, see Gussmann (1978a) and Rubach (1981).
8. Other interactions are discussed in detail in Rubach (1981). Let us also point out that
all investigators have always assumed that 1 s t Velar is an early rule (cf. for example,
Gussmann 1978a).
9. This is true in cases of no contextual overlap between the rules of the native and the
target language (cf. Rubach, in press).
References
Elisasson, Stig
1978 "Theoretical problems in Scandinavian contrastive phonology", The Nordic
languages and modern linguistics 3, edited by J. Weinstock (Austin: The Uni-
versity of Texas), 2 1 7 - 2 4 3 .
Fisiak, Jacek
1976 "Generative phonology and contrastive studies", The Canadian journal of
linguistics 2 1 : 1 7 1 - 1 7 9 .
Gussmann, Edmund
1975 "How do phonological rules apply?", Papers and studies in contrastive lin-
guistics 3, edited by J. Fisiak (Poznan: A. Mickiewicz University Press), 113
-124.
1978a Contrastive Polish-English consonantal phonology (Warszawa: PWN).
1978b Explorations in abstract phonology (Lublin: M. Curie-Sktodowska Uni-
versity).
Halle, Morris
1978 "Formal vs. functional considerations in phonology" (Indiana University
Linguistics Club).
Kiparsky, Paul
1973 "Phonological representations", Three dimensions of linguistic theory .edited
by O. Fujimura (Tokyo: TEC Company), 3 - 1 3 6 .
(forthcoming) Lexical morphology and phonology.
Mascar, Joan
1976 Catalan phonology and the phonological theory (MIT Ph. D. diss, available
from Indiana University Linguistics Club).
Rule ordering in phonological interference 377
Rubach, Jerzy
1974 Variability of consonants in English and Polish (University of Warsaw Ph. D.
diss.).
1981 Cyclic phonology and palatalization in Polish and English (Warszawa: Uni-
versity of Warsaw).
(in press) "Rule typology and phonological interference", Theoretical issues in con-
trastive phonology, edited by S. Eliasson (Stuttgart: Julius Groos Verlag).
KARI SAJAVAARA
Maybe the things you want are like cards. You don't want them
for themselves, really, though you think you do. You don't want
a card because you want the card, but because in a perfectly
arbitrary system of rules and values and in a special combination
of which you already hold a part the card has meaning. But sup-
pose you aren't sitting in a game. Then, even if you know the
rules, a card doesn't mean a thing. They all look alike.
Robert Penn Warren, All the King's Men, Harcourt, 1946.
1 General considerations
Jacek Fisiak (1980:10) points out rightly that doubt concerning the validity
and usefulness of contrastive studies "results from a number of misinterpre-
tations and misunderstandings created by such factors as the peculiar metho-
dological status of CS, the lack of a clearcut distinction in the past between
theoretical and applied CS [. . .] and the lack of a precise formulation of the
different aims of theoretical CS and applied CS as well as the confusion of the
relationship between CS, the psycholinguistic theory of interference and er-
rors, and the theory of second language learning" (see also Sajavaara 1977).
It seems therefore justified to pay more attention than heretofore to the inter-
relationship between contrastive studies and psycholinguistic processes.
The link between foreign language teaching methodology and theoretical
linguistics has always been very close. Major shifts in linguistic theory have
been regularly reflected in language teaching methodology, which implies
that linguistic models have been associated with predominant psychological
conceptualizations. Audiolingual language teaching methodology was spon-
sored by structuralism through its link with behavioural psychology, and
cognitive-code approaches became popular in language teaching simultaneous-
ly with the heyday of generative theory. In the last few years, various ap-
380 Kari Sajavaara
This quotation may repeat what is a commonplace among linguists and ap-
plied linguists but its full relevance has seldom been recognized. It relates to
Lyons's distinction between language-system and language-behaviour (Lyons
1977:26ff.). In contrastive analysis which aims at applied goals, research
based on sentences (the way most of the practice has been) may be totally
misleading, since we should actually direct our attention to utterances, which
are expected to reflect what there is in a language-system. Grammars are ab-
stract descriptions of system sentences, and we do not have grammars based
on utterances of natural language use. Therefore it may be difficult to work
out details of language contrasts for applied purposes.
Speech reception and production can only be approached through refer-
ence to interaction; these processes should be related to the exchange 'game'
which is going on between two or more interlocutors. All models that neglect
382 Kari Sajavaara
items, words in a very broad sense, are identified directly on the basis of de-
tected cue information which constitutes the Gestalt of the items. The pro-
cesses are illustrated by Figure 1 (from Lehtonen Sajavaara 1980). If, how-
ever, the word is difficult, i.e. it cannot be immediately recognized from the
acoustic cue information, phonological categorization is needed for identifi-
cation as an auxiliary strategy; it is always available whenever the cue pattern
of the input signal finds no matching items in memory. Phonological analysis
is also needed when it is necessary to make use of the information available
in the affixes of complex derivative forms. Both channels are open all the
time, but the budding phonological process simply fades away, if it is not
needed when a new input chunk arrives. Thus, the identification of the phono-
logical segment string is possible, but not obligatory. Some experimental evi-
dence for the existence of parallel phonological processes can be found, for
instance, in recent work by Foss and his associates (Foss Blank 1980, Foss
et al., 1980). Lamminmki's study is also very revealing as regards the dual
nature of the interpretation process. It was normal for his subjects to identify
['LINGUISTIC COMPETENCE'
I OR MONITORING OF GRAMMAR,
Figure 1.
A partial model of speech perception (from Lehtonen - Sajavaara 1980a). Meaningful
items, 'words' in a broad sense, are identified directly on the basis of cue features con-
stituting the Gestalt of the items. An optional process of phonological analysis is also fed
by the same cue pattern. The two processes are assumed to be simultaneous.
Psycholinguistic models 387
the test items on the basis of features other than those which belonged to the
phonological code; whatever the speech signal, the subjects 'heard' lexical
items that were familiar to them (Lamminmki 1979).
guistic description are relevant units in the production and reception of mes-
sages must be considered unfounded (Lehtonen 1980b).
Lehtonen (1980b) points out that since grammatical rules are formulations
of structural regularities, or of the speaker's knowledge of such regularities, in
the sentences of the language, "an adequate description of a grammar is im-
portant for psycholinguistics, too, because in order to be able to approach the
productive mechanism we first have to learn all about the structure of its pro-
ducts". What we have to forget in this context is the hierarchical constitution
of grammar. Lehtonen concludes:
The interrelationship between grammar and lexicon is one of the most inter-
esting and problematic points in an interactive model of message processing.
The above discussion of phonological processing implied that lexical elements
bear an important function in such processing. It could be hypothesized ac-
cordingly that information is embedded in memory in the form of lexical
items, a kind of 'words'. In accordance with Marslen-Wilson and Welsh (1978:
58), such lexical memory items can thus be defined as intersections of various
procedures operating over a range of cognitive dimensions (cf. Morton 1979,
Ellis 1979). Such a 'word' implies a pretheoretical label for a processing unit;
it contains information concerning its use in utterances and its relationships
to other words within and outside utterances. A 'word' activates various pro-
cessing procedures through this information; such procedures reach from pre-
fabricated phrases and frequent word combinations over to grammatical con-
straints and semantic fields. What comes immediately to mind when this kind
of processing model is suggested is a left-to-right probabilistic model, which
was criticized as early as 1951 by Lashley (1951), and the Markov process,
a language 'automaton' which was given up by Chomsky as a finite state
grammar (Chomsky 1957). A grammatical view of sentence processing ob-
viously makes such models unacceptable; in a dynamic psycholinguistic mod-
el, in which 'words' are given a primary status and not understood in a narrow
Psycholinguistic m o d e l s 389
Figure 2.
A model of message processing (from Sajavaara Lehtonen 1980a).
and definitions (the borderline is never strict, which means that the whole is
to be seen as a continuum).
The 'buffer', which includes short-term memory, and the interface take
care of the dosage of the incoming flow of material into chunks which can be
processed by the system (see Ellis 1979). The interface is responsible for the
reduction of the incoming data (from sound waves and letter symbols) to the
extent that the buffer can handle it. The interface must carry this function
out, at least partially, on the basis of certain physical features of the incoming
stimulus as was pointed out above. A certain number of the cues embedded
in this stimulus are language-specific, others are universal (e.g. the informa-
tion that a stimulus is human speech). The interface and the buffer are neces-
sary both in the production and in the reception of speech. The processes
that are executed here are the reception and actual transmission of linguistic
data and the control of the articulatory organs and their motor programmes
and the hearing organs and processes.
All the information available in the data bank is of equal value and can be
used simultaneously by the cognitive mechanism materializing in the Over-
lord. The syntactic, morphotactic, and pragmatic rules as well as various
semantic networks which have been activated through the identification in
the incoming signal of a certain 'key element' together with the earlier results
of the progressing analysis are available at each stage of the process as con-
straints which open up or restrict potential choices in the construction of
message content or in the production of speech. The data-driven analysis is
necessarily an on-line process which proceeds word by word but, through the
retroactive reworking effect, any interpretation may be changed later in the
light of subsequent context. The Gestalts which are an important element in
the detection mechanism are used to make predictions as concerns the nature
of forthcoming data (see Lehtonen 1980a, Sajavaara 1980).
The dual nature of the processing system together with the general func-
tioning principle of the Overlord implies that most of the assumptions con-
cerning interference in various contrastive theorizings have been based on too
straightforward an idea of what takes place in the human brain. Yet the same
model also indicates that in some respects the contrastivist conjectures have
been too complicated in assuming that all the hierarchical 'linguistic' steps are
activated all the time in language processing. The partial model in Figure 1
above predicts interference on three levels of processing in cross-language
communication: detection of cue patterns, lexical identification, and recon-
struction of phonological target structure. Contrastive implications of the
model will be taken up later.
The production process is somewhat more complicated to reconstruct than
the reception process (here it is not necessary to take any stand as concerns
392 Kari Sajavaara
MESSAGE PARLOUR
VERBAL
GOAL PLANNING ex-
'EVENT' : l or, ecution covert
.
IN THE ; i-U speaker s :ommuni-
WORLD will motor rative be
and plans shaviou&/
intention p l a n s for i n t e n -
tional non-ver- ]
bal gestures
and paralanguage
t h e speakers
affective & : ' non-
physiologi- inten-
cal state tional
.feature
L e h t o n e n & S a j a v a a r a 1980
Figure 3.
In this model of the progression of communicative events from the speaker's viewpoint
(developed by the Finnish-English Cross-Language Project, see Lehtonen 1980 b), the
communicative event is seen to be based on the speaker's overt communicative behaviour,
both verbal and non-verbal, on the one hand, and on various non-intentional features to
be detected in the speaker, on the other. In addition, the 'event' in the world also may
add its own elements to the communicative event.
Psycholinguistic models 393
cessing, which results from acquisition, the synthetic Gestalt mode is the
regular one. Language learning emphasizes an analytical approach and gives
preference to processes which are dependent on phonological mediation or
similar procedures. This analytical mode of processing is often reinforced in
the available teaching methodologies. If the foreign language is approached
through detailed acoustic, physiological, and other similar phenomena, the
student may be misled to paying attention to factors which are of relatively
small importance rather than to more comprehensive units of speech and
communication. The processing of individual phonological items hardly causes
any serious problems if we find ways of teaching the learners to 'hear' the
foreign language input correctly. In most cases it is not necessary to pay spe-
cial attention to phonology.
In language contact, intake is regulated by a 'socioaffective filter' (see
Dulay Burt 1977), which is controlled by various social and personality
factors and knowledge of the world (Sajavaara Lehtonen 1980a). The filter
can be opened by integrating the learner psychologically or socially into the
target group or by adding to his knowledge of the world. Increased operation-
ality of the new language also lowers the filter. Conscious or subconscious
perception of what makes part of the world and the language is governed by
previous experience. In this way a great deal of what is important in commu-
nication can be transferred from first-language skills.
The teacher's main task in the classroom is the opening of the filter and
the provision of a sufficient amount of language input. It is important that
the language code is not an end in itself; it should be regarded as a product of
processes whose goals are non-linguistic. When the code was the main target
of teaching, it was not considered necessary to tell pupils anything about why
it was necessary for them to carry out certain procedures in the classroom. It
was sufficient that the teacher knew. For the pupil to know about the pur-
poses of the tasks was considered hazardous or even destructive for learning
(seen as habit-formation - the basic idea was obviously derived from behav-
iouristic psychology). Now it is understood that a learner's performance may
decisively be enhanced by his own view of his own chances of success, and
therefore it may be of some importance to tell the learner what is expected
of him. He may even be taught to learn. An integral part of this work in-
volves bringing language courses closer to reality as it is experienced by the
learner, closer to the world he lives in, and the environment he feels to be his
own. This means that it is not necessary for the language teacher to operate in
a vacuum without any link with what the learners are and what they do or are
planning to do outside the classroom.
Several attempts have been made during the past decade to break away
from the grammatical syllabus and to approach language teaching problems
Psycholinguistic models 397
fact that what is called the passive in Finnish grammars is not really a passive
in the same sense as the term is applied in the grammars of Indo-European
languages. A wrong equation between the two codes results in an overgener-
alization process in the target language.
Learning a new language means not only the acquisition of new categories
but also the reorganization of the criteria which are applied as cues for the
identification process of those new categories. The consequences which the
model of message processing sketched above has for a description of data pro-
cessing problems which the language learner encounters in FL communication
can be outlined only tentatively (cf. Lehtonen Sajavaara 1980).
The model allows for the following cross-language observations:
(1) Processing is based on the learner's previous linguistic knowledge, i.e.
the idiomacy of his Lj pronunciation and the corresponding receptive idio-
macy. Successful detection of L 2 phenomena is possible only if the speaker
has had a sufficient amount of contact with spoken L 2 . Because of the
universal nature of many features in speech, the cueing system seldom
fails totally in the case of a foreign-language learner, but if the system
resorts to cues which are irrelevant in the target-language system, incorrect
identification alternatives may hamper the functioning of the entire
mechanism.
(2) Interference in various paradigms and corresponding errors in identifi-
cation and production have been the object of most contrastive studies. Inter-
pretation of these data is still problematic and it is possible that the significance
of this interference has been exaggerated.
(3) Even if appropriate cue patterns are available for the word-identifica-
tion mechanism, the Overlord may fail to recognize the item because there is
a gap in the lexicon. Thus no lexical memory item can be activated on the
basis of the incoming cue pattern. It is also possible that the lexical item is
stored in a different form (as a graphemic representation without any cor-
responding phonetic shape or as a disguised morphological stem which does
not open all the necessary derivatives).
According to Krashen (1978), speech performance is always initiated by
means of the acquired system, while the learned system is available as a
Monitor to edit the output. In acquisition-poor environments, a speaker ob-
viously relies on his Lj competence as performance initiator. The string
is then 'translated' into an L 2 string, whose grammaticality and acceptability
depend on the availability of 'rules' and on the nature of the constraints pre-
sent. Under optimal circumstances, the L 2 string is initiated and processed
entirely on the basis of the acquired L 2 system without the Monitor being
activated. Most L 2 speakers are obviously located somewhere between the
two extremes: at least occasionally they have to rely on Li systems for speech
Psycholinguistic models 401
processing. This is the case when an L 2 unit has not been acquired when
there is a gap and the Monitor fails to give the right answer. L t influence
on surface strings may be due to the fact that (1) the string has been initiated
by the acquired Li system and the Monitor has not been able to correct the
string, (2) the Monitor lacks the correct 'rule' and an L j rule is used as a repair,
or (3) strings originally initiated by the correct L 2 system are mutilated by
the learned system (Sajavaara 1978b). The L j and L 2 acquired and learned
systems are closely interlinked in the light of the model presented above,
and both systems are referred to several times during speech production,
which may be one of the reasons for the variable performance by the same
speakers in different situations.
Second language acquirers may experience problems for several reasons
(see Sajavaara 1980): (1) A rule cannot be retrieved. (2) A rule is inaccurate
or wrong. (3) The speaker refuses to apply the rule entirely or in some situa-
tions (Krashen's (1978:1011) Monitor under-users are characterized by not
applying the rules even if they know them, and Faerch and Kasper's (1980)
strategy of formal reduction relates to instances where the insufficiency of the
linguistic system makes it necessary for the speaker to make do with "less of
their interlanguage repertoire than is in fact at their disposal"), or he wants to
avoid errors or facilitate speech production to reach higher levels of fluency
without distorting the message. (4) The rule has not been acquired at all (the
learner has not reached the stage where the rule is acquired, the rule has not
been taught, the rule cannot be taught because its exact formulation is not
possible or is so complicated that it is functionally unwieldy, or the learner is
unable to internalize explicitly formulated rules). (5) The language user exag-
gerates the importance of monitoring, i.e. the control of the processing
mechanism (Krashen's (1978:10-11) Monitor over-users), and makes an at-
tempt to apply explicit rules all the time. In L 2 , t h e problems are the greater
the further away we move from Lj-type language acquisition in naturalistic
situations, because the system can be expected to be more and more defective
and based on more explicit formal properties. It also becomes more delicate
and more liable to malfunction due to external factors, since more perform-
ance potential is normally required for second language processing.
In early contrastivist hypotheses, the distance between languages was seen
to be indicative of the degree of learning difficulty. The distance is important
in an indirect way: a learner's reactions to problems in L 2 processing may re-
flect his mental image of the potential correlation between Li and L 2 struc-
tures. This results in the speaker's judgments on what is transferable or not
(see Ringbom 1979, Sharwood Smith 1979:350).
In contrastive analysis aiming at applied goals, too much attention may
have been paid to the transfer of individual structures. According to Felix
402 Kari Sajavaara
7 Conclusion
One of the most important observations based on what has been said above
concerns the high degree of task dependency of speech production and recep-
tion. There are few factors that remain constant under all circumstances,from
speaker to speaker, and with the same speaker. The whole system aims at ef-
ficiency, which does not however mean the economy of surface features (cf.
Sajavaara 1978a:57-59).
At the onset of the second language learning process, the learner's cue
detection mechanism is tuned to the phenomena and processes of his first lan-
guage. He tends to hear the target-language utterances in terms of categories
and structures of his native language, and it is not surprising at all that he also
substitutes elements of his first language for the target structures. Unless
various plans and programmes are 'acquired', to use Krashen's dichotomy, the
speech processing mechanism has to resort to 'learned' items. This obviously
requires more processing capacity. Insufficient acquisition can be compen-
sated by increased capacity or low task demands. Other simultaneous tasks,
emotional states, and high loads on problem-solving capacity may also affect
the processing system.
One of the most important parts of the foreign-language learning process
involves a continuous reorganization and completion of the cueing mechanism,
a 'new deal' of the cards of the communication game. When acquiring the cor-
rect Gestalts of target-language items, the learner also tunes the perceptual
mechanism to pick up such Gestalts. The whole process is subconscious for
the most part, and it is impossible to see how it could be directly interfered
with. It might seem that the only way to this reorganization is through the
right kind of input, by exposing the learner to the kind of L 2 material which
brings about the shift as a gradual process. At the present moment we lack
Psycholinguistic models 403
Notes
1 . 1 wish to acknowledge my debt to Jaakko Lehtonen, who has participated in the pro-
duction of the model. The model should be considered as a representation of the
problem points in language processing and an attempt to correlate linguistic elements
to other elements. It is not meant to be a representation of the flow of data in the
processing mechanisms: thus the arrows indicate relationships and channels for feeding
in data (see Lehtonen 1980a).
2. No attempt will be made here to deal with the rich literature on second language learn-
ing strategies (see, e.g., Faerch - Kasper 1980).
References
d'Anglejan, A.
1978 "Language learning in and out of classrooms", Understanding second and
foreign language learning, edited by J. C. Richards (Rowley, Mass.: Newbury
House), 2 1 8 - 2 3 6 .
Bialystok, E.
1978 "A theoretical model of second language learning", Language learning 28:
69-83.
Buckingham, . - H. Hollien
1978 "A neural model for language and speech", Journal of phonetics 6 : 2 8 3 - 2 9 7 .
Canale, . - M. Swain
1980 "Theoretical bases of communicative approaches to second language teaching
and testing", Applied linguistics 1 : 1 - 4 7 .
404 Kari Sajavaara
Carroll, J. . - . G. Bever
1976 "Sentence comprehension", Language and speech (= Handbook of percep-
tion 7), edited by E. C. Carterette and M. P. Friedman (New York: Academic
Press), 3 0 0 - 3 4 4 .
Carroll, J. . - . K. Tanenhaus - T. G. Bever
1978 "The perception of relations: the interaction of structural, functional, and
contextual factors in the segmentation of sentences", Studies in the percep-
tion of language, edited by W. J. M. Levelt and G. B. Flores d'Arcais (New
York: Wiley), 1 8 7 - 2 1 8 .
Chomsky, N.
1957 Syntactic structures (The Hague: Mouton).
Clark, . . - . V. Clark
1977 Psychology and language (New York: Harcourt).
Cummins, J.
1979 "Linguistic interdependence and the educational development of bilingual
children", Review of educational research 49:222 - 2 5 1 .
Davis, S. M.
1978 "Audition and speech perception", Bases of language intervention, edited by
R. L. Schiefelbusch (Baltimore, Md.: University Park Press), 4 3 - 6 6 .
Dittmar, N.
1978 "Ordering adult learners according to language abilities", Papers from the
First Scandinavian-German Symposium on the Language of Immigrant Work-
ers and their Children, edited by N. Dittmar, H. Haberland, T. Skuttnabb-
Kangas, and U. Teleman (Roskilde: Universitetscenter), 1 1 9 - 1 4 7 .
Dulay, . - M. Burt
1977 "Remarks on creativity in language acquisition", Viewpoints on English as
a second language, edited by M. Burt, H. Dulay, and M, Finocchiaro (New
York: Regents), 9 5 - 1 2 6 .
van Ek, J.
1980 Threshold level English (Oxford: Pergamon).
Ellis, A. W.
1979 "Speech production and short-term memory", Structures and processes (=
Psycholinguistic series 2), edited by J. Morton and J. C. Marshall (London:
Paul Elek), 1 5 7 - 1 8 7 .
Faerch, C.
1979 Research in foreign language pedagogy: the PIF project (= Anglica and
Americana 7) (Copenhagen: University of Copenhagen).
Faerch, C. - G. Kasper
1980 "Processes and strategies in foreign language learning and communication",
ISB 5 : 4 7 - 1 1 8 .
Felix, S. W.
1978 Linguistische Untersuchungen zum natrlichen Zweitsprachenerwerb (Mn-
chen: Fink).
Fillmore, L. W.
1976 The second time around: cognitive and social strategies in second language
acquisition (Ph. D. dissertation, Stanford University).
Fisiak, J.
1980 "Some notes concerning contrastive linguistics", AILA bulletin 2 7 . 1 : 1 - 1 7 .
Flores d'Arcais, G. B.
1978 "The perception of complex sentences", Studies in the perception of lan-
guage, edited by W. J. M. Levelt and G. B.Flores d'Arcais (New York: Wiley),
155-186.
Foss, D. J. - M. A. Blank
1980 "Identifying the speech codes", Cognitive psychology 1 2 : 1 - 3 1 .
Psycholinguistic models 405
1978 "How can the theory and methods of speech sciences contribute to contras-
tive analysis?" (Paper read at the 16th International Conference on Polish-
English Contrastive Linguistics, Boszkowo, Poland, December 1978). Forth-
coming.
1980 a "Psykolingvistiska aspekter p en finnes frmaga att frst skandinaviska
sprk" (Paper read at the symposium Internordisk sprkfrstelse, Rungsted-
gaard, Denmark, March 1980). Forthcoming.
1980b "The communicative approaches to speech and language and the study of
grammar" (Comment paper read at the International Workshop on Psycho-
linguistic Models of Speech Production, University of Kassel, July 1980).
Forthcoming.
Lehtonen, J. - K. Sajavaara
1980 "Phonology and speech processing in cross-language communication", Theo-
retical issues in contrastive phonology, edited by S. Eliasson (Heidelberg:
Julius Groos). Forthcoming.
Levelt, W. J. M. - G. B. Flores d'Arcais (eds)
1978 Studies in the perception of language (New York: Wiley).
Lindsay, P. H. - D. A. Norman
1977 Human information processing (New York: Academic Press).
Lyons, J.
1977 Semantics 1 - 2 (Cambridge: University Press).
Marslen-Wilson, W. D. - A. Welsh
1978 "Processing interaction and lexical access during word recognition in con-
tinuous speech", Cognitive psychology 1 0 : 2 9 - 6 3 .
Marslen-Wilson, W. D. - L. K. Tyler
1980 "The temporal structure of spoken language understanding", Cognition 8:
1-71.
Marslen-Wilson, W. D. - L. K. Tyler - M. Seidenberg
1978 "Sentence processing and the clause boundary", Studies in the perception of
language, edited by W. J. M. Levelt and G. B. Flores d'Arcais (New York:
Wiley), 2 1 9 - 2 4 6 .
Meisel, J.
1980 "Strategies of second language acquisition", Wuppertaler Arbeitspapiere zur
Sprachwissenschaft 3 : 1 - 5 3 .
Morton, J.
1979 "Word recognition", Structures and processes (= Psycholinguistic series 2),
edited by J. Morton and J. C. Marshall (London: Paul Elek), 1 0 7 - 1 5 6 .
Munby, J.
1978 Communicative syllabus design (Cambridge: University Press).
Ringbom, H.
1979 "The English of Finns, Swedes, and Swedish Finns: some concluding re-
marks", The perception and production of English: papers on interlanguage
(= AFTIL 6), edited by R. Palmberg (Abo: Abo Akademi), 7 7 - 8 5 .
1980 "On the distinction between second language acquisition and foreign language
learning", AFinLA Yearbook 1980: Papers in language learning and language
acquisition (= Publications of the Finnish Association of Applied Linguistics
AFinLA 28), edited by K. Sajavaara, A. Rsnen, and T. Hirvonen (Jyvskyl:
AFinLA), 3 7 - 4 4 .
Ruder, K. F. - A. Finch
1980 "Toward a cognitive-based model of language production" (Paper read at the
International Workshop on Psycholinguistic Models of Speech Production,
University of Kassel, July 1980). Forthcoming.
Psycholinguistic models 407
Sajavaara, K.
1977 "Contrastive linguistics past and present and a communicative approach",
Contrastive papers (= Jyvskyl contrastive studies 4), edited by K. Sajavaara
and J. Lehtonen (Jyvskyl: Department of English, University of Jyvskyl),
9-30.
1978a "The Monitor Model and monitoring in foreign-language speech communica-
tion", Second language acquisition and foreign-language teaching, edited by
R. C. Gingras (Arlington, Va.: Center for Applied Linguistics), 5 1 - 6 7 .
1978b "The Monitor Model and contrastive analysis" (Paper read at the 16th Inter-
national Conference on Polish-English Contrastive Linguistics, Boszkowo,
Poland, December 1978). Forthcoming.
1980 "Second language speech production: factors affecting fluency" (Paper read
at the International Workshop on Psycholinguistic Models of Speech Produc-
tion, University of Kassel, July 1980). Forthcoming.
Sajavaara, K. - J. Lehtonen
1980a "Language teaching and acquisition of communication", AFinLA Yearbook
1980: Papers in language learning and language acquisition (= Publications of
the Finnish Association of Applied Linguistics 28), edited by K. Sajavaara,
A. Rsnen, and T. Hirvonen (Jyvskyl: AFinLA), 2 5 - 3 5 .
1980b "Prisoners of code-centred privacy: reflections on contrastive linguistics and
related disciplines", Papers in discourse and contrastive discourse analysis (=
Jyvskyl contrastive studies 5), edited by K. Sajavaara and J . Lehtonen
(Jyvskyl: Department of English, University of Jyvskyl), 7 - 2 6 .
Schumann, J.
1978 "The acculturation model for second language acquisition", Second language
acquisition and foreign-language teaching, edited by R. C. Gingras (Arling-
ton, Va.: Center for Applied Linguistics), 2 5 - 5 0 .
1979 "Three theoretical perspectives on second language acquisition" (Paper read
at the First Nordic Symposium on Interlanguage, Hanasaari, Finland, August
1980).
Seliger, . W.
1977 "Does practice make perfect?: a study of interaction patterns and L2 com-
petence", Language learning 2 7 : 2 6 3 - 2 7 8 .
1980 "Strategy and tactic in second language acquisition" (manuscript).
Selinker, L. - J. Lamendella
1978 "Two perspectives on fossilization in interlanguage learning",ISB 3 : 1 4 3 - 1 9 1 .
Shannon, C. - W. Weaver
1949 The mathematical theory of communication (Urbana: Illinois UP).
Sharwood Smith, M.
1979 "Strategies, language transfer and the simulation of the second language
learner's mental operations", Language learning 2 9 : 3 4 5 - 3 6 1 .
Slobin, D. I.
1979 Psycholinguistics. 2nd edition(Glenview, 111.: Scott,Foresman,and Company).
Smith, N. - D. Wilson
1979 Modern linguistics: the results of Chomsky's revolution (Harmondsworth:
Penguin).
Snow, C. - C. A. Ferguson (eds)
1977 Talking to children (Cambridge: University Press).
Tarone, E.
1974 "Speech perception in second language acquisition: a suggested model",
Language learning 2 4 : 2 2 3 - 2 3 3 .
Widdowson, . G.
1978 Teaching language as communication (London: Oxford University Press).
1979 Explorations in applied linguistics (London: Oxford University Press).
Wilkins, D. A.
1976 Notional syllabuses (London: Oxford University Press).
Wode, H.
1979 Studies in second language acquisition (= Occasional papers 11) (Singapore:
SEAMEO Regional Language Centre).
1980 "Language acquisitional universals - L j , L2, pidgins, and foreign language
teaching", AFinLA Yearbook 1980: Papers in language learning and language
acquisition (= Publications of the Finnish Association of Applied Linguistics
AFinLA 28), edited by K. Sajavaara, A. Rsnen, and T. Hirvonen (Jyvskyl:
AFinLA), 5 - 1 5 .
MICHAEL SHARWOOD SMITH
gave rise to such notions as the idea that children begin with deep structures
and then gradually acquire the appropriate transformations (see McNeill 1970
and White 1980 for discussion); a similar line of attack has been taken in
sociolinguistics where, for example, simple pidgins are compared with com-
plex Creoles. Again hypotheses were developed about language processing
whereby the transformational processes in the grammar were claimed to mir-
ror the routines carried out to produce and comprehend speech. Apart from
the fact that experimental evidence was not able to satisfactorily confirm this
idea of 'derivational complexity', the theoretical backing from linguistics has
been radically reduced since the revised versions of EST assume a much
simpler transformational component which can in principle generate a large
number of unacceptable surface structures. More general constraints are for-
mulated in various parts of the grammar in order to contain its power, such as
restrictions on phrase structure, like X-theory, proper binding in logical form,
and various mechanisms, like filters, for ruling out ungrammatical surface
structures (see White 1980:1 Iff.). Universal grammar is in fact a set of general
constraints on possible grammars rather than a kind of very deep base from
which all specific grammars may be derived. The term 'grammar' is therefore
somewhat misleading when applied to UG. These general constraints are what
the child must have a priori in order to arrive at an adequate grammar for the
data it is exposed to in such a short time, so runs the argument, and consider-
ing how degenerate that data are (see Hornstein - Lightfoot 1980). A child
must be able to rule out a large number of possible accounts, and linguists
seeking to account for linguistic data must make learnability a central criterion
in choosing between possible analyses. This criterion has been characterized
by Steven Pinker as extremely stringent; he defines a number of relevant con-
ditions that a theory would have to meet, namely the learnability condition,
the equipotentiality condition and the cognitive condition. These require that
the theory explain (see Pinker 1979:215):
(5) how learning can take place in accordance with empirical find-
ings in the study of child language (DEVELOPMENTAL)
(6) how learning takes place given the child's cognitive and per-
ceptual immaturity (COGNITIVE)
Chomsky has recently stressed that even the notion of acquisition is mis-
leading as a characterization of what the child achieves and he prefers the
term "attainment" which places more emphasis on the amount of a priori
knowledge that a child has (Chomsky 1980:9) and establishes more clearly
the role of primary data (exposure/experience) as a 'triggering' one. It is im-
portant to note that the principles of UG are not generally treated as 'impera-
tives': the child is not conceived as one who (unconsciously or intuitively, of
course) goes around looking for ways of applying them. Rather, once the con-
ditions for their application are part of the child's current grammar, the prin-
ciples simply emerge to take care of the data. This almost (ironically) has a
Piagetian ring to it, that is that there are apparently certain developmental
stages where some important restructuring of the current system has to take
place: the child has developed to such an extent that it ceases simply to assi-
milate everything but accomodates itself (adapts its internal representations)
to cope with what it experiences. 2 The actual means for doing this are given:
they are part of man's biological endowment. They lie dormant until the
right moment. Chomsky has of course disassociated himself from Piagetian
thought; there does seem to be some kind of parallel nevertheless. On the
other hand, some claims seem to imply that the child also has 'working as-
sumptions' about the nature of the specific language it is acquiring (or 'attain-
ing'). Take for example the notion of filters (see Chomsky Lasnik 1977).
These are designed to rule out certain overgeneralizations that might be
thrown up by the operation of core rules. In one account (discussed in White
1980:103 ff.) filters are represented as working assumptions which hold unless
positive evidence turns up to show that the filters are violated, in which case
the child may relax the constraint by dropping the filter. The filter in itself is
a complication or addition in UG but as a mechanism that is claimed to be a
typical feature of human language, it will simplify the task for the majority of
learners. This bears directly on notions of markedness. In this account a given
filter is assumed to be the unmarked case: this is the orientation that UG gives
the child in tackling the task of attainment. White (ibid.) also discusses Carol
Chomsky's work (Chomsky 1969) on the acquisition of verbs with regard to
to-complements like promise and ask.3 The former is marked in that it always
assigns control to subject: the subject of the complement verb is always the
subject of promise. The latter is marked in that it sometimes behaves like
promise (when it has the meaning of 'question') and sometimes like the more
usual (unmarked) type of verb (when it has the meaning of 'request') (White
1980:104), for example the verb tell which follows core grammar and assigns
control to the nearest NP (the NP in the complement of the matrix verb).
C. Chomsky's findings seem to support the idea that not only were unmarked
verbs mastered first but that the unambiguously marked verb, promise, was
Learnability and L2 acquisition 413
mastered before the partially marked verb,ask. White has this to say: "There
does seem to be a correlation between grammatical complexity and difficulty
of acquisition. [ . . . ] It would be a pity if degrees of complexity could not be
captured in some way by the concept of markedness" (1980:106).
This general observation concerning the difficulty of learning the more
particular rule holds, of course, even for theories of learning where little or no
a priori knowledge is assumed. Here we are to understand it in relation to the
notion of core grammar and indeed universal grammar as well.
The recreationist trend in second language acquisiton 4 studies centres
around the assumption that the acquisition mechanisms operating in primary
(first) language acquisition and touched on above do not atrophy but may in
principle still operate for non-primary languages as well. This thesis began in
one sense as a denial of the importance of mother tongue interference (see
Dulay - Burt 1975) and in another sense as an attempt to carry out the type
of morpheme studies pionieered by Roger Brown, and de Villiers and de Vil-
liers (see Brown 1973, de Villiers - de Villiers 1974) in the field of second
language acquisition. This has been done much more extensively now in the
latter field than in the former and a much quoted study by Bailey, Madden
and Krashen (1974) suggests a universal developmental order for young and
adult language learners (and Broca's aphasics) irrespective of the mother
tongues involved or the instructional backgrounds, as far as these grammatical
morphemes were concerned. Failure by second language learners to achieve
native-like grammars of the target language is no longer regularly attributed to
the supposed atrophying of the acquisition mechanisms that take care of first
language attainment. Various cognitive and affective factors are posited as
typical obstacles to achieving native speaker level.
White's concluding observations in her thesis where she considers the
interaction between acquisitional theory and the theory of grammar touch
upon limitations that must exist as far as what predictions can be derived from
grammatical theory. These limitations are indeed factors such as cognitive and
perceptual immaturity, limitations on memory and the like (White 1980:113).
Interestingly, Gass and Ard have recently taken a similar line to argue that
second language acquisition provides better opportunities for the investigation
into principles of UG than does first language acquisition 5 (see Gass Ard,
forthcoming). They claim that it is precisely the lack of the typical obstacles
mentioned by White and others (see for example Pinker 1979 and Wexler
Culicover 1980) that makes non-primary language acquisition, by mature
learners (presumably), a more attractive proposition. To back up this claim
Gass carried out an investigation into the acquisition of rules of relativization,
basing her study on first language work by Amy Sheldon (see Sheldon 1972,
1978) and the typological claims of Keenan and Comrie(see Keenan Comrie
414. Michael Sharwood Smith
(the first item in the pair characterizes the Head NP in the matrix sentence;
the second refers to the NP rel in the subordinate clause). If the accessibility
hierarchy had been adhered to, so to speak, the SS and OS types should have
been easiest; in fact, the rank order was: SS 0 0 OS SO (roughly: 50%, 49%,
27% and 17%, respectively). Gass' study with second language learners who
were adults and came from typologically diverse language backgrounds, and
which dealt with twelve types of relativization, was in agreement with the ac-
cessibility hierarchy (see Gass 1979). Gass and Ard (forthcoming) formulate
their conclusions as follows: "Second language acquisition is a fertile ground
for testing theories about language acquisition and particular proposed uni-
versals".
It is clear that contrasting two or more languages is provided with a very
specific orientation if the learnability issue is accepted as a central concern.
It is also clear that investigation into non-primary language learning is going
to benefit directly from this research if learnability claims are linked up with
recreationist claims: this is particularly striking where notions of markedness
are the focus of interest. Gass and Ard's claim is a very important one in this
connection if indeed the enormous cognitive and perceptual obstacles that are
supposed to obscure first language data (spontaneous and experimental) prove
to be a serious problem in child language research carried out with learnability
in mind. On the other hand, one is allowed to wonder whether factors specific
to second language acquisition might also prove exceedingly problematic as
well if learnability was made the major concern in this second branch of in-
quiry. Wexler and Culicover (1980:7ff.) point to the limitations the first lan-
guage acquirer has and include the idea that the child has to handle limited
data at any one time (cf. Nemser's "blinding flash fallacy" in Nemser 1971).
The fact that the second language acquirer has already got a fully developed
Learnability and L2 acquisition 415
grammar (the L j ) has surely to be taken into consideration as well: this 'new'
a priori knowledge may play an important role as mediated by the learner's
perception of likely relationships between L j and L 2 (see Sharwood Smith
1979, Kellerman 1980). Where the target language is perceived as distant, the
universal processes may well operate. Where a close native-target relationship
is perceived, language transfer may predominate in such a way as to make
learners adopt marked structures in agreement with their own native language
(see, however, Eckman 1977 and Kellerman 1979) before correct unmarked
structures. This is an empirical question and one cannot rule out in advance
the possibility that learnability and recreationist claims wl contribute in
some measure to the advancement of knowledge in this area. Some will no
doubt prefer to hang on to the idea that contrastive analysis is first and fore-
most a useful, heuristic tool in descriptive linguistics and as a presentation
device in teaching as well as an aid in error analysis. At this stage in the game
it is difficult to know where to put one's money unless one option can be
ruled out a priori as logically absurd or simply unrealizable. There are certain-
ly indications that more thorough investigations are needed into the links be-
tween different kinds of acquisition on the one hand and linguistic theory on
the other. The evidence put forward to date is interesting and suggestive but
hardly adequate at the moment to draw any hard and fast conclusions. For
people in second language acquisition studies it should be interesting to adapt
and use Pinker's six conditions as a framework for comparing first and second
language acquisition in the context of (relevant) current work on transforma-
tional generative grammar. For theoretical linguists, it should be interesting to
see if cross-language observations and claims may have empirical support from
second-language acquisition studies. The interdisciplinary character of con-
trastive studies, particularly as carried out in Poland, should provide an ex-
cellent forum for these purposes. In other words, it should now be possible to
have what has not happened before, that is, a principled dialogue between
pure, theoretical linguists and people investigating second language learning:
problems of learning difficulty, of cross-language analysis and general typo-
logy are provided with a common, unifying framework.
Notes
1. Thanks are due to Paul van Buren, Eric Kellerman, Rob van Oirsouw and James Pank-
hurst for supplying comments to an earlier draft of this paper at rather short notice.
2. Chomsky's position still remains distinguishable from Piaget's of course in that the
former argues for linguistic universale and the latter preferred to see things in terms of
general cognitive universals: this at least was Piaget's established position.
416 Michael Sharwood Smith
References
Matthei, E.
1981 "Children's interpretation of sentences containing reciprocals", Language
acquisition and linguistic theory, edited by S. L. Tavakolian (Cambridge,
Mass.: M.I.T. Press), 9 7 - 1 1 5 .
McNeill, D.
1970 The acquisition of language: the study of developmental psycholinguistics
(New York: Harper and Row).
Nemser, W.
1971 "Approximative systems of foreign language learners", IRAL 9 : 1 1 5 - 1 2 3 .
Pinker, S.
1979 "Formal models of language learning", Cognition 7 : 2 1 4 - 2 8 3 .
Sharwood Smith, M.
1980 "Strategies, language transfer and the simulation of the second language
learner's operations", Languge learning 2 9 : 3 4 5 - 3 6 1 . Reprint of same in
ISB 4 (1979):66-83.
Sheldon, A.
1972 The acquisition of relative clauses in English, unpublished Ph. D. dissertation
(Austin: University of Texas).
1978 Assumptions, methods and goals in language acquisition research (Blooming-
ton: Indiana University Linguistics Club).
de Villiers, J. - P. de Villiers
1973 "A cross-selectional analysis of grammatical morphemes in child speech",
Journal of psycholinguistic research 2 : 2 6 7 - 2 7 8 .
Wexler, K. - P. W. Culicover
1980 Formal principles of language acquisition (Cambridge, Mass.: M.I.T. Press).
White, L.
1980 Grammatical theory and language acquisition (Bloomington: Indiana Uni-
versity Linguistics Club).
ALEKSANDER SZWEDEK
I want to begin this paper with a superficial statement that it has been admit-
ted for some time now that sentence grammars are not able to cope with cer-
tain language phenomena. There are linguistic facts within and between sen-
tences (such as pronominalization, ellipsis, thematic structure) that can be
accounted for only in the framework of a larger context. As in other kinds of
linguistic comparison there are two problems in connection with text analysis:
the problem of equivalence and semantic representation on the one hand, and
comparison of certain surface phenomena on the other hand. This paper will
discuss some difficulties connected with those two problems.
It is obvious, that a) and b) are variants of the same material distinction be-
tween structures that do not surface and function as presuppositions, and
structures that surface and function as sentences. Since presuppositions seem
to play a crucial role in determining equivalence it is necessary to define their
status in the semantic representation more precisely.
crucial for our understanding of text formation, have been inconsistent and
vague.
The ambiguity can be traced back to Mathesius' (1939) description of the
structure of an utterance in terms of actual sentence partition or contextual
organization. Applying this description to Czech, Mathesius stated that word
order in non-emotive utterances directly reflects the actual partition. Every
sentence consists of a starting point which precedes a nucleus or core. The
starting point is defined as "that which is known or at least obvious in the
given situation and from which the speaker proceeds", while the nucleus is
"what the speaker states about, or in regard to, the starting point of the utter-
ance". At the same time, however, Mathesius described the theme ('zaklad') as
something "that is being spoken about in the sentence", and the core ('jadro') as
"what the speaker says about this theme". Thus in the contextual organiza-
tion of the sentence Mathesius identifies the starting point with known infor-
mation. As, however, has been pointed out time and again, the two need not
coincide, particularly in languages with grammatically determined word order.
As a matter of fact Mathesius seems to have described the starting point in
three ways, as a) known information, making no distinction between known
from the text and known from the situation, b) the first element of the utter-
ance from which the speaker proceeds, c) what is spoken about.
It must be remembered, however, that while identifying "the beginning of
the sentence" is a purely mechanical procedure, telling us nothing about its
role in the text, the phrase "what is spoken about" is vague and in some cases
may not be easy to determine. It is not clear that the notion "the first element
of the sentence" is necessary at all. As is well known the beginning of the sen-
tence in Slavic languages is associated with given information, while in lan-
guages like English with the subject. If we ignore the given information con-
cept in Slavic languages and the concept of the subject in English, what is left
is that the beginning of the sentence is the beginning. On the other hand, in
non-emotive utterances "what is spoken about" in Slavic languages is the given
information and "what is spoken about" in English is the subject. A question
arises whether the topic of a non-emotive utterance is of the same nature as in
the so-called topicalized structures. Or, if we hold that what is spoken about
in Slavic languages is not the first element (or given information) then what is
spoken about would be the subject. In that case "what is spoken about"
would be the subject in both English and Slavic languages. On the other hand,
there is no doubt that we need the concept 'topic' in the description of topi-
calized structures. It seems that in topicalized structures the aim of proposing
is to emphasize or contrast an element (in some cases though, as Enkvist
(1976:5) points out, "a constituent has to be moved simply to get it out of
the way because that final position is reserved for other [ . . . ] more important
CA and text linguistics: some problems 423
(with the interpretation in which HIM = John) we have to recognize the fol-
lowing relations:
a) HIM is definitely context bound,
b) HIM is stressed, i.e. has the highest degree of CD (if that
means anything), i.e. is the rheme.
426 Aleksander Szwedek
Thus, what we have here is a context bound rheme which contradicts Firbas's
(1972:82) description of the relation between the two features: "non-thematic
elements are always contextually independent".
Example (6) shows that one of the main problems that badly needs clarifi-
cation is what is new and what is given information in the sentence. Chafe
(1976:32) vaguely says that "givenness is a status decided on by the speaker"
and that "it is fundamentally a matter of the speaker's belief that the item is
in the addressee's consciousness, not that it is recoverable (Halliday)". Similar-
ly vague is Firbas's (1975) description of the "narrow scene" and context-
dependent elements. Elements are context-dependent, according to him, when
he piece of information they convey is derivable (or recoverable) from the
preceding verbal context and/or refers to some elements of the immediate
situational context. Also Dahl (1976:41) distinguishes between onstage con-
cepts and off-stage concepts which, he says, are similar to contextually bound
and contextually non-bound elements. On the other hand he makes a distinc-
tion between on-stage elements and definite elements when he says: ,.known
concepts whether they are on-stage or off-stage are referred to with the
help of definite noun phrases". If, however, we accept the distinction between
on-stage and known (= definite), then Dahl's description of on-stage concepts
as already present in the addressee's consciousness, and off-stage concepts as
those that must be activated, i.e. retrieved from some deeper place in his
mind (such as his long-term memory), has also a high degree of vagueness of
which Dahl accuses Chafe.
Th? problem of the context and its relation to theme/rheme seems to be
even more complex as the discussion of Akmajian and Jackendoff's (1970) ex-
ample will show. On the other hand it is hoped that the discussion will indicate
ways of solving the problem of criteria for given/new distinction. Akmajian
and Jackendoff s examples are pretty straightforward. In the sentence
(7) John hit Bill and then George hit him
if him is unstressed it refers to Bill, if it is stressed it may refer to John or
some outsider. In any case it does not, when stressed, refer to Bill. With the
standard definition of 'given', however, we would not be able to get the inter-
pretation where him refers to John, because him referring to John could not
be treated as new information and thus stressed. Therefore, in such a case we
have to look for 'new' information in other features of the relation between
John and him. It should be noticed that John is subject and agent, while him
is object and patient, which means that the relations between him and John
are different than between him and Bill, the latter pair exhibiting agreement
in grammatical function and semantic role. Similarly in a version of the above
example:
CA and text linguistics: some problems 427
References
1 NP Movement
What I want to claim is that there is no NP Movement in Polish. Passives and
other constructions whose transformational derivation would involve a
change of the syntactic function of an NP, reflected in its case ending, are
base derived. The natural consequence of this is that all potential arguments
for NP Movement applicable in English should fail in Polish.
Admittedly, not many such arguments exist. The old ones, based on vari-
ous co-occurrence facts, are no longer valid in a grammar which accounts for
selectional restrictions with the help of semantic well-formedness conditions
and which employs such devices as lexical redundancy rules to account for
the relationship between sentence types.
I am not going to present any new arguments for NP Movement. Instead,
I will mention two interesting facts about English passives noticed in recent
years by other people. It seems that these facts pose problems for an analysis
which derives all passives by base rules.
The first of them concerns peculiar constraints on adjectival passives (non-
transformational, according to the Lexicalist Hypothesis). These constraints
are not shared by verbal passives.
Wasow (1977) observes that (5)-(6) are ill-formed:
(4) *John is unknown to be rich.
(5) *John looks given a book.
What is interesting is that there are no idioms which take the form of a passive
sentence and whose active counterparts could be interpreted only literally.
A possible way to explain this would be to assume:
a) that there is a passive transformation and
b) that idiom interpretation rules apply at the deep structure level.
Under these assumptions, idioms will be listed in the lexicon as active
(base derived) structures, while the proper interpretation of the hypothetical
"exclusively passive idioms" would require a reference to their surface pas-
sive form.
No such explanation for the absence of the "exclusively passive idioms"
appears to be available within an approach which derives all passives by base
rules. The structural asymmetry between passive and active sentences, which
follows from transformational analysis, is destroyed here.
Let us examine the situation in Polish.
First of all, Polish has no structural counterparts of those English passives
which Wasow claims cannot be base derived. (11)(13), which correspond to
English ( 7 ) - ( 9 ) respectively, are all ill-formed:
2 RTJ-Movement
Resnic
To rule out constructions like (29) below, Chomsky (1979) proposes the fol-
lowing principle called Resnic.
(29) *Who1 does John believe that t, will come?
Resnic:
1 > r fei cannot be free in S
nom l 1
where "free" means not co-indexed with another element
and NP nom [e] is a trace of the subject in a tensed clause
There are ways in which this apparent violation of Resnic can be accomo-
dated within a trace-theoretic approach (PRO drop parameter). They involve
considerable theoretical subtlety and I will not discuss them here. What is
important is that the problem can be solved in a straightforwards way if we
assume that the moved w/z-word has not left a trace.
Nesting Hypothesis
Kster (1978) argues that Universal Grammar contains a condition which he
calls Nesting Hypothesis.
Nesting Hypothesis:
A marked empty node m e [not dominated immediately by VP,S
or S - T.Z] is unacceptable when it is nested within a domain of
type V* S (main projection), where V' S* are minimal
NP/GOC:
The following sequence will always be interpreted by relating the
leftmost NP to the rightmost trace and the rightmost NP to the
leftmost trace:
NP . . . NP . . . t . . . t
If wh-movement leaves a trace, the following sentence is a clear counterexam-
ple to the condition (Polish allows more than one w/z-word in Comp):
(34) KtOi kogo2 [ti zaprosit t 2 ]
who whom invited
'Who invited whom?'
Another apparent violation of the condition is shown in (35) where w/?-move-
ment interacts with clitic movement.
(35) Komu jej2 kazales ti pomoc t 2
whom her you-ordered to-help
'Whom did you order to help her? '
Consider, however, sentences (36) and (37) where both /-movement and
clitic movement extract elements from within NP's.
(36) Czyjeji memu2 kazales [t 2 bratu]panbc [t 2 zonie]
whose my you-ordered brother to help wife
'Whose wife did you order my brother to help? '
(37) ? *Czyjemui mej2 kanzales [tjbratu] pomoc [t 2 zom'e]?
whose my you-ordered brother to help wife
'Whose brother did you order to help my wife? '
(36) is clearly more acceptable than (37). 7 It seems that NP/GOC is obeyed in
these cases.
The situation changes radically when one of the two interacting movements
affects a whole NP instead of a part of it.
(38) [Czyjemu bratu]l m0c2radzilem t t pomoc t 2 zonie
whose brother my I-advised to-help wife
Similarly, the grammaticality of (34) could be contrasted with the ill-formed-
ness of (39).
(39) *Ktryi ktrq2 [t t chtopiec] zaprosii [t 2 dziewczyne]
which which boy invited girl
This seemingly puzzling configuration of data (in one language all traces obey
NP/GOC, in the other only the NP internal ones) can be explained rather
naturally on three assumptions:
Movement rules in English and Polish 439
It seems, and I believe that a more careful analysis will support this claim,
that the explanation of the fact that wfc-movement obeys both the w/z-Island
and the Complex Noun Phrase Constraint requires reference to principles
other than subjacency.
At this point, I can only give a very tentative answer to the second crucial
question. Of the syntactic processes described by generative grammarians,
Polish w/z-movement seems to resemble most a 'stylistic' rearrangement rule.
Thus, it will be related to the rule or rules responsible for 'scrambling' of con-
stituents in the Polish sentences in (44).
(44) Janek dal ksiqik? Mary si.
John gave book to-Mary
Janek dal Marysi ksiqzkg.
Janek Marysi dal ksiqzkf.
Marysi Janek dal ksiqzkq.
Ksiqzk$ dal Janek Marysi.
etc.
who I n 0 t q * a ^ o u t your
Of course, there are problems with this proposal. Contrary to the model of
grammar proposed in Chomsky - Lasnik 1977, 'stylistic' rules will have to
apply in Polish before semantic interpretation rules (and filters). Independent
Movement rules in English and Polish 441
3 Conclusions
In a number of cases, Polish counterparts of the transformationally derived
English constructions have been shown to possess properties which create
problems for a transformational analysis.
It is, of course, possible that in each particular case the problem can be
reduced to the way the parameters of Universal Grammar are fixed in the
two languages compared. Unfortunately, it is difficult to imagine a solution
of this sort which could account for the full range of facts considered and
which would not consist of a number of fragmentary, unrelated explanations.
If we accept the hypothesis as to non-transformational character of Polish,
such a unified account becomes possible.
What has been presented here is naturally only a tentative proposal. To
advance it beyond this initial stage would require, first of all, a theory of non-
transformational "stylistic" movements, within which we should be able to
account for the various properties of Polish w/2-fronting. Unfortunately, little
work has been done in this area, particularly important for the "free word
order" languages, within the EST framework.
Movement rules in English and Polish 443
Notes
1. The paper is a report on a work in progress. Some of the arguments merely signalled
here are developed in Zabrocki (1981) and in Zabrocki (in preparation).
2. The asymmetry of idioms argument is a reinterpretation of the one given in Chomsky
1980.
3. The weakness of the argument as presented here is that it fails to explain why there
are not (at least I could not find any) exclusively passive idioms among the adjectival,
base derived passives (cf. Wasow 1977).
4 .Zapomniec can take an accusative NP for an object but the meaning is different in
such a case:
zapomniec list forget a letter
zapomniec liicie - forget about a letter
The passive in 22a can have either of the two meanings.
5. The sentence is grammatical but with a reading different from that of 22b. It corre-
sponds rather to
zapomniatem twojq zaslugq
which means "I forgot what your merit was". The question has no ambiguity of the
passive in 22a, mentioned in note 4.
6. Cf. Fodor 1978 for a different formulation and an extensive discussion.
7. For numerous native speakers of Polish, the example in (36) sounds ill-formed as well.
As I see it, the reason for it is that the extraction from within NPs, although possible,
is clearly a "marked" process in the sense of Kster 1978. Consequently, the struc-
tures created by the rules of this sort are very unstable. Any increase in complexity
leads to processing difficulties and lowers their acceptability. It should not surprise
thus that (36), which involves two such movements of which one crosses a clause
boundary, sounds awkward. The point is that it is better than corresponding (37).
8. Of course, such a conclusion crucially depends on the assumption of the correctness
of trace theory. Notice that our analysis not only does not undermine trace theory
but indirectly supports it. The principles of the theory do not fail when applied to
Polish but are simply inapplicable in a predictable way.
9. Cf. Zabrocki (in preparation) for discussion.
References
Chomsky, N.
1977 "On wA-movement",Formal syntax, edited by P. Culicover et al. (New York:
Academic Press), 7 1 - 1 3 2 .
1979 "Lectures on core grammar and markedness", unpublished manuscript.
1980 Rules and Representations. Oxford: Basil Blackwell.
Chomsky, . - H. Lasnik
1979 "Filters and control",Linguistic inquiry 7 : 4 2 5 - 5 0 4 .
Culicover, P. - T. Wasow - A. Akmajian, eds
1977 Formal syntax (New York: Academic Press).
1978 "Parsing strategies and constraints on transformations", Linguistic inquiry
9:427-493.
444 Tadeusz Zabrocki
Horn, G. .
1978 lexical interpretive approach to some problems in syntax (Bloomington:
Indiana University Linguistics Club).
Koster, J.
1978 "Conditions, empty nodes and markedness", Linguistic inquiry 9:551 - 5 9 3 .
Wasow, T.
1977 "Transformations and the lexicon",Formal syntax, edited by P. Culicover et
al. (New York: Academic Press), 3 2 7 - 3 6 0 .
Zabrocki, T.
1981 Lexical rules of semantic interpretation; Control and NP movement in Eng-
lish and Polish (Poznan: A. Mickiewicz University),
in preparation "On vWi-movement in Polish".
Index of names
Davis, S. . 3 8 3 , 4 0 4 Foley, W. . 1 2 0 , 1 4 1
Davison, A. 280 Ford, . 17
Davy, D. 70, 103 Foreman, D. 82, 104
Dezs<5, L. 1 1 0 , 1 1 5 Foss, D . J . 3 8 6 , 4 0 4 , 4 0 5
van Dijk, T. 6 6 , 4 2 1 , 4 2 9 Foulke, E. 300
Dimitrijevic, N. 327, 328 Franck, D. 119
Dingwall, W. 3 4 9 , 3 6 3 Frser, B. 356, 358, 359, 361, 362, 363
Di Pietro, R. 320, 327, 328, 349, 363 Frazier, L. 63, 66
Dirven, R. 3 2 3 , 3 2 5 , 3 2 8 Freedle, R . O . 4 7 , 6 6
Dittmar, N. 3 9 4 , 4 0 4 Freihoff, R. 3 9 7 , 4 0 5
Downing, P. 1 2 1 , 1 4 1 Fried, V. 4 2 9
Dressler, W. U. 1 7 4 , 1 8 4 , 1 8 6 Friedman, . P. 4 0 4
Dulay, H. 3 9 6 , 3 9 9 , 4 0 4 , 4 1 3 , 4 1 6 Fujimura, O. 1 8 5 , 3 7 6
Duncan, S. 47, 66, 103, 104
Duskova, L. 1 6 7 , 1 6 8 , 3 0 6 , 3 1 1 Gass, S. 4 1 3 , 4 1 4 , 4 1 6
Dyta, S. 1 9 - 4 4 Gazdar, G. 363
Gee, J . P . 1 6 7 , 1 6 8
Ebneter, R. 320, 328 Geis, M. L. 279, 280
Eckman, F . R . 3 3 3 , 3 4 6 , 3 4 7 , 4 1 5 , 4 1 6 Gerver, D. 284, 300
E d m o n d s o n , W. 69, 70, 71, 75, 103, 104, Gingras, R. C. 3 9 5 , 4 0 5 , 4 0 7
356, 363 Givon, T. 47, 66
v a n E k , J. 3 9 7 , 4 0 4 Ghring, H. 3 2 6 , 3 2 7 , 3 2 9
Eliasson, S. 172, 173, 174, 185, 375, 376, Goldman-Eisler, F. 284, 300
377,406 Goody, . . 4 7 , 6 6
Ellis, A. W. 3 8 8 , 3 9 1 , 4 0 4 Goyvaerts, D. L. 1 7 4 , 1 8 5
Emonds, J . E . 2 5 1 , 2 5 4 , 2 5 5 , 2 5 8 , 2 6 3 , Graczyk, I. 309, 311
265, 277, 279, 280 Green, G. 1 5 6 , 1 6 9
Engels, L.K. 3 1 7 , 3 2 5 , 3 2 9 Greenbaum, S. 66, 281
Enkvist, . . 4 5 - 6 7 , 291, 3 0 0 , 4 2 2 , 428, Greenberg, J. H. 5 1 , 6 6
429 Gregersen, . 168
E r d m a n n , P. 1 5 0 , 1 6 7 , 1 6 8 Grice, . P. 4 2 0 , 4 2 9
Esser, J. 1 1 6 , 3 2 0 , 3 2 9 Grimes, J. 66
Grimshaw, J. 1 , 2 , 4 , 5 , 6 , 8 , 9 , 1 2 , 1 3 ,
Faerch, C. 6 9 - 1 0 5 , 3 2 7 , 3 2 9 , 3 9 4 , 4 0 1 , 14, 16, 17, 18
404 Grzegorek, M. 1 4 3 - 6 9 , 2 5 0 , 3 0 2 , 3 1 2
Farkas, D. 17, 18, 168 Gussmann, E. 1 7 1 - 1 8 6 , 3 6 6 , 3 7 1 , 3 7 5 ,
Felix, S.W. 4 0 1 , 4 0 4 376
Ferguson, C. A. 3 9 4 , 4 0 7 Gutknecht, C. 330
Filipovic, R. 1 0 7 - 1 1 7 , 3 2 0 , 3 2 3 , 3 2 5 ,
329, 330 Haarmann, . 203
Fillmore, C. 103, 104, 119 41, 309, 311, Haberland, . 3 5 0 , 3 6 3 , 4 0 4
429 Hkkinen, . 405
Fillmore, L. W. 119, 394, 404 Hakuta, . 3 9 5 , 4 0 5
Finch, A. 3 9 2 , 4 0 6 Halle, . 174, 175, 176, 177, 184, 249,
Findler, N . V . 4 7 , 6 6 373,376
Fink, S. 320, 329 Halliday, . . . 4 6 , 6 6 , 119, 123, 141,
Finocchiaro, M. 4 0 4 144, 169, 302, 303, 304, 311, 326,
Firbas, J. 5 3 , 1 6 8 , 4 2 3 , 4 2 4 , 4 2 5 , 4 2 6 329,426
Fischer-J0rgensen, E. 1 7 3 , 1 7 4 , 1 8 5 Hammerberg, . 66
Fisiak, J. 110, 116, 173, 175, 1 8 5 , 2 5 0 , Hankamer, J. 18
314, 329, 349, 350, 363, 375, 376, Harman, G. 3 6 4 , 4 2 9
379,404 Harnish, R . M . 3 5 1 , 3 6 3
Fiske, D.W. 4 7 , 6 6 Hartmann, R. R. . 3 1 9 , 3 2 4 , 3 2 9
Flores d'Arcais, G. B. 383, 384, 404, 406 Harwood, D . A . 405
Index of names 447