Sie sind auf Seite 1von 10

Bibliography

Cambridge International Dictionary of Idioms. Cambridge University Press,


1998.

Collins Cobuild Dictionary of Idioms. Harper Collins, 2002.

Anne Abeillé. The Flexibility of French Idioms: a representation with lex-


icalized tree adjoining grammar. In Martin Everaert, Erik-Jan van der
Linden, Andre Schenk, and Rob Schreuder, editors, Idioms: Structural
& Psychological Perspectives. Lawrence Erlbaum Associates, New Jersey,
1995.

Alan Agresti. Categorical Data Analysis. John Wiley and Sons, New York,
2002.

Mark Arehart. Identifying multiword tokens using pos tagging and bigram
statistics, June 2003. Handout given at the ach/allc 2003, University of
Georgia, Athens, Georgia.

T. Baldwin, C. Bannard, T. Tanaka, and D. Widdows. An Empirical Model


of Multiword Expressions Decomposability. In Proc. of the ACL-2003
Workshop on Multiword Expressions: Analysis, Acquisition and Treat-
ment, pages 89–96, Sapporo, Japan, 2003.

T. Baldwin, J. Beavers, L. van der Beek, F. Bond, D. Flickinger, and I.A. Sag.
In search of a systematic treatment of Determinerless PPs. Computational
Linguistics Dimensions of Syntax and Semantics of Prepositions. Kluwer
Academic, to appear.

Timothy Baldwin and Aline Villavicencio. Extracting the unextractable: A


case study on verb-particles. In Proceedings of CoNLL-2002, pages 98–104.
Taipei, Taiwan, 2002.

181
182 Bibliography

S. Banerjee and T. Pedersen. The design, implementation, and use of the


Ngram Statistic Package. In Proceedings of the Fourth International Con-
ference on Intelligent Text Processing and Computational Linguistics, Mex-
ico City, 2003.
M. Benson, E. Benson, and R. Ilson. The BBI combinatory dictionary of
English: A guide to word combinations. John Benjamins, Philadelphia,
1986.
J. Berghmans. Wotan, een automatische grammatikale tagger voor het
Nederlands, 1994. Masters Thesis, Dept. of Language and Speech.
Don Blaheta and Mark Johnson. Unsupervised learning of multi-word verbs.
In 39th Annual Meeting and 10th Conference of the European chapter
of the Association for Computational Linguistics (ACL39), pages 54–60,
Toulouse, France, 2001. CNRS.
Gosse Bouma and Geert Kloosterman. Querying dependency treebanks in
XML. In Proceedings of the 3rd International Conference on Language
Resources and Evaluation (LREC 2002), volume V, pages 1686–1691, Las
Palmas de Gran Canaria, Spain, 2002.
Gosse Bouma, Gertjan van Noord, and Robert Malouf. Alpino: Wide-
coverage computational analysis of dutch. In Computational Linguistics
in The Netherlands 2000. 2001.
Gosse Bouma and Begoña Villada. Corpus-based acquisition of collocational
prepositional phrases. In M. Theune, A Nijholt, and H. Hondorp, editors,
Computational Linguistics in the Netherlands 2001. Selected Papers from
the Twelfth CLIN Meeting, pages 23—37, Amsterdam, 2002. Rodopi.
Elisabeth Breidt, Frederique Segond, and Giuseppen Valetto. Local gram-
mars for the description of multi-word lexemes and their automatic recog-
nition in texts. In COMPLEX96, Budapest, 1996.
Ted Briscoe and John Carroll. Automatic extraction of subcategorization
from corpora. In Proceedings of the 5th ACL conference on applied Natural
Language Processing, pages 356–363, Washington,D.C, 1997.
Kenneth Ward Church and Patrick Hanks. Word association norms, mutual
information & lexicography. Computational Linguistics, 16(1):22–29, 1990.
Steffan Corley, Martin Corley, Frank Keller, Matthew W. Crocker, and Shari
Trewin. Finding syntactic structure in unparsed corpora. Computers and
the Humanities, 35(2):81–94, 2001.
Bibliography 183

Beatrice Daille. Study and implementation of combined techniques for auto-


matic extraction of terminology. In P. Resnik and J. Klavans, editors, The
Balancing Act: Combining Symbolic and Statistical Approaches to Lan-
guage, pages 49–66. MIT Press, 1996.

Fred J. Damerau. Generating and evaluating domain-oriented multi-word


terms from texts. Information processing and management, 29(4):433–447,
1993.

H. De Groot, editor. Van Dale Idioom Woordenboek: Verklaring en herkomst


van uitdrukkingen en gezegden. Van Dale Lexicografie, Utrecht, Antwerp,
1999.

A. M. Di Sciullo and E. Williams. On the definition of Word. MIT Press,


Cambridge Ma., 1987.

G. Dias, J.G. Pereira Lopes, and S. Guilloré. Mutual expectation: a meas-


ure for multiword lexical unit extraction. In Proceedings of VEXTAL’99,
Venezia, 1999.

Ricarda Dormeyer, Ingrid Fischer, and Martina Keil. A database for verbal
idioms. In Thierry Fontenelle, Philippe Hilismann, Archibald Michiels,
Andre Moulin, and Siegfried Theissen, editors, euralex’98 Proceedings,
pages 99—109, Belgium, 1998. Universite de Liege.

Erwin W. Drenth. Using a hybrid approach towards dutch part-of-speech


tagging. Master’s thesis, Alfa-Informatica, University of Groningen, 1997.

Ted Dunning. Accurate methods for the statistics of surprise and coincidence.
Computational linguistics, 19(1):61—74, 1993.

Gregor Erbach. Head–driven lexical representation of idioms in HPSG.


In Martin Everaert, Erik-Jan van der Linden, Andre Schenk, and Rob
Schreuder, editors, Proceedings of Idioms. International conference on
Idioms., volume 1, pages 11–24. Tilburg. The Netherlands, 1992.

Thomas Ernst. Grist for the Linguistic Mill: Idioms and “Extra” Adjectives.
Journal of Linguistic Research, I(3):51–68, 1982.

Martin Everaert. Vaste verbindingen (in woordenboeken). Spektator, 3:3–27,


1993.

Martin Everaert and Koenraad Kuiper. Theory and data in idiom research.
In L. McNair, K. Singer, L. M. Dobrin, and M. Aucoin, editors, Papers
184 Bibliography

from the Parasession on Theory and Data in Linguistics, volume CLS 32,
pages 43–58. Chicago Linguistics Society, 1996.

Stefan Evert. Mathematical properties of association measures. Special topic


session given at Workshop ‘Computational Approaches to Collocations’,
Vienna, July 2002.

Stefan Evert. The Statistics of Word Cooccurrences: Word Pairs and Col-
locations. PhD thesis, University of Stuttgart, 2004.

Stefan Evert and Hannah Kermes. Experiments on candidate data for col-
location extraction. In Proceedings of the 10th Conference of the European
Chapter of the Association for Computational Linguistics. EACL 2003,
pages 83–87, Budapest, Hungary, 2003.

Stefan Evert and Brigitte Krenn. Methods for the qualitative evaluation of
lexical association measures. In Proceedings of the 39th Annual Meeting of
the Association for Computational Linguistics, Toulouse, France, 2001.

Christiane Fellbaum. The determiner in English Idioms. In C. Cacciari and


P. Tabossi, editors, IDIOMS: Processing, Structure, and Interpretation,
chapter 12, pages 271–295. Lawrence Erlbaum Associates, New Jersey,
1993.

Chitra Fernando and Roger Flavell. On idiom. Critical views and perspect-
ives, volume 5 of Exeter Linguistic Studies. University of Exeter, 1981.

John Rupert Firth. A synopsis of linguistic theory, 1930–55. Studies in


linguistic analysis, pages 1–32, 1957. special volume of the Philological
Society.

Thierry Fontenelle. Collocation acquisition from a corpus or from a dic-


tionary: a comparison. In Proceedings of 5th EURALEX International
Congress, volume I, pages 221–228, Tampere, Finland, 1992.

Gerald Gazdar, Ewan Klein, Geoffrey Pullum, and Ivan Sag. Generalized
Phrase Structure Grammar. Basil Blackwell, London, England, 1985.

G. Geerts and H. Heestermans, editors. Van Dale Groot Woordenboek der


Nederlandse Taal. Van Dale Lexicografie, Utrecht,Antwerp, 1992.

W. Haeseryn, K. Romijn, G. Geerts, J. de Rooij, and M.C. van den Toorn.


Algemene Nederlandse Spraakkunst. Wolters-Noordhoff, Groningen, 1997.
Bibliography 185

Jack Hoeksema. Nonverbal Material in the Dutch Verb Cluster: a Word


Order Pattern in Decline. Talk given at tabu-dag, June 2004.

Bart Hollebrandse. Dutch light verb constructions. Master’s thesis, Tilburg


University, the Netherlands, 1993.

T.M.V. Janssen. Compositionality. In J. van Benthem and A. ter Meulen,


editors, Handbook of Logic and Language. Elsevier, Amsterdam, 1997.

H-J. Kaalep and K. Muischnek. Inconsistent selectional criteria in semi-


automatic multi-word unit extraction. In F. Kiefer and J. Pajzs, edit-
ors, COMPLEX 2003, 7th Conference on Computational Lexicography and
Corpus Research, pages 27–36, Budapest, 2003. Research Institute for Lin-
guistics, Hungarian Academy of Sciences.

Jerrold J. Katz. Compositionality, idiomaticity & lexical substitution. In


Stephen R. Anderson and Paul Kiparsky, editors, A Festschrift for Morris
Halle, pages 357—376. Holt, Rinehart and Winston, Inc., 1973.

Frank Keller and Mirella Lapata. Using the Web to obtain frequencies for
unseen bigrams. Computational Linguistics, 29(3):459–484, 2003.

Hannah Kermes and Ulrich Heid. Using chunked corpora for the acquisi-
tion of collocations and idiomatic expressions. In F. Kiefer and J. Pajzs,
editors, COMPLEX 2003, 7th Conference on Computational Lexicography
and Corpus Research, Budapest, 2003. Research Institute for Linguistics,
Hungarian Academy of Sciences.

Adam Kilgarrif and David Tugwell. Word sketch: Extraction & display of
significant collocations for lexicography. In Proceedings of the 39th ACL &
10th EACL -workshop ‘Collocation: Computational Extraction, Analysis
and Explotation’, pages 32–38, Toulouse, 2001.

Balázs Kis, Begoña Villada, Gosse Bouma, Gábor Ugray, Tamás Bíró, Gábor
Pohl, and John Nerbonne. A New Approach to the Corpus-based Statist-
ical Investigation of Hungarian Multi–word Lexemes. In Proceedings of 4th
International Conference on Language Resources and Evaluation (LREC)
2004, volume V, pages 1677–1681, Lisbon, Portugal, 2004. European Lan-
guage Resources Association.

Brigitte Krenn. Empirical implications on lexical association measures. In


Proceedings of the Ninth EURALEX International Congress, Stuttgart,
Germany, 2000a.
186 Bibliography

Brigitte Krenn. The Usual Suspects: Data Oriented Models for the Identi-
fication and Representation of Lexical Collocations. PhD thesis, DFKI &
Universitat des Saarlandes, 2000b.
Brigitte Krenn and Gregor Erbach. Idioms and support verb constructions.
In John Nerbonne, Klaus Netter, and Carl Pollard, editors, German in
Head-Driven Phrase Structure Grammar, pages 365—395. CSLI, 1994.
Brigitte Krenn and Stefan Evert. Can we do better than frequency? a
case study on extracting PP-verb collocations. In Proceedings of the ACL
workshop on Collocations, Toulouse, 2001.
Jonas Kuhn. The treatment of support verb constructions in HPSG-based
Machine Translation - a summary, 1994. URL citeseer.nj.nec.com/
kuhn94treatment.html.
Dekang Lin. Extracting collocations from text corpora. In First workshop
on computational terminology, Montreal, Canada, 1998.
Dekang Lin. Automatic identification of non-compositional phrases. In Pro-
ceedings of ACL-99, pages 317–324. University of Maryland, 1999.
Leonardus Johannes Maria Loonen. Stante pede gaande van dichtbij langs
AF bestemming @. PhD thesis, Universiteit Utrecht, June 2003.
A. Manaster Ramer and W. Zadrozny. Open idioms and constructions.
In Martin Everaert, Erik-Jan van der Linden, Andre Schenk, and Rob
Schreuder, editors, Proceedings of Idioms. International conference on
Idioms., volume 1, pages 65–75. Tilburg. The Netherlands, 1992.
Christopher D. Manning and Hinrich Schütze. Foundations of Statistical
Natural Language Processing. The MIT Press, Cambridge, Massachusetts,
1999.
Diana McCarthy, Bill Keller, and John Carroll. Detecting a Continuum of
Compositionality in Phrasal Verbs. In Proc. of the ACL-2003 Workshop
on Multiword Expressions: Analysis, Acquisition and Treatment, Sapporo,
Japan, 2003.
Rosamund Moon. Fixed expressions and Idioms in English. A corpus-based
approach. Clarendom Press, Oxford, 1998.
Michael Moortgat, Ineke Schuurman, and Ton van der Wouden. CGN syn-
tactische annotatie, March 2001. internal report Corpus Gesproken Neder-
lands.
Bibliography 187

Tim Nicolas. Semantics of idiom modification. In Martin Everaert, Erik-Jan


van der Linden, Andre Schenk, and Rob Schreuder, editors, Idioms: Struc-
tural & Psychological Perspectives. Lawrence Erlbaum Associates, New
Jersey, 1995.

Geoffrey Nunberg, Ivan A. Sag, and Thomas Wasow. Idioms. Language, 70


(3):491–539, 1994.

Michael P. Oakes. Statistics for Corpus Linguistics. Edinburgh Textbooks


in Empirical linguistics. Edinburgh University Press, 1998.

R.J.F. Ordelman. Twente Nieuws Corpus (TwNC), August 2002. Parlevink


Language Techonology Group. University of Twente.

P.C. Paardekooper. Voorzetsel-uitdrukkingen. Nieuwe Taalgids, 55:3–9,


1962.

P.C. Paardekooper. Grensproblemen bij v-z-uitdrukkingen. Nieuwe Taalgids,


66:137–145, 1973.

Darren Pearce. A comparative evaluation of collocation extraction tech-


niques. In Proceedings of the Third International Conference on Language
Resources and Evaluation, 2002.

Ted Pedersen. Fishing for exactness. In Proceedings of the South Central


SAS User’s Group (SCSUG-96) Conference, Austin, TX, 1996.

Ted Pedersen, M. Kayaalp, and R. Bruce. Significant lexical relationships.


In Proceedings of the 13th National Conference on Artificial Intelligence,
Portland, OR, 1996.

Carl Pollard and Ivan A. Sag. Head–Driven Phrase Structure Grammar. The
University of Chicago Press, CSLI: Stanford, 1994.

Robbert Prins and Gertjan van Noord. Reinforcing parser preferences


through tagging. Traitement Automatique des Langues, 2004. Accepted
for Special Issue on Evolutions of Parsing.

Eric Reuland. Relating morphological and syntactic structure. In Martin


Everaert, A. Evers, R. Huybregts, and M. Trommelen, editors, Morpho-
logy and Modularity. In honour of Henk Schultink, pages 303–338. Foris
Publications, Dordrecht, 1988.
188 Bibliography

F. Richter and M. Sailer. Cranberry Words in Formal Grammar. In Patricia


Cabredo Hofherr Claire Beyssade, Olivier Bonami and Francis Corblin,
editors, Empirical Issues in Formal Syntax and Semantics, volume 4, pages
155–171. Presses de l’Université de Paris-Sorbonne, 2003.

Susanne Riehemann. A constructional approach to idioms and word forma-


tion. PhD thesis, Stanford University, 2001.

Ivan Sag, T. Baldwin, F. Bond, A. Copestake, and D. Flickinger. Multiword


expressions: a pain in the neck for NLP. LinGO Working Paper No. 2001-
03, 2001.

Manfred Sailer. Combinatorial Semantics & Idiomatic Expressions in Head-


Driven Phrase Structure Grammar. PhD thesis, University of Tuebingen,
2000.

Manfred Sailer and Frank Richter. Not for love or money: Collocations! In
G. Jäger, P. Monachesi, G. Penn, and S. Wintner, editors, Proceedings of
Formal Grammar 2002, 2002.

André Schenk. The syntactic behavior of idioms. In Martin Everaert, Erik-


Jan van der Linden, Andre Schenk, and Rob Schreuder, editors, Proceed-
ings of Idioms. International conference on Idioms., volume 1, pages 97–
110. Tilburg. The Netherlands, 1992.

Andre Schenk. Idioms & collocations in compositional grammars. PhD thesis,


Universiteit Utrecht, 1994.

P. Schone and D. Jurafsky. Is knowledge-free induction of multiword unit dic-


tionary headwords a solved problem? In Proc. of the 2001 Conference on
Empirical Methods in Natural Language Processing, pages 100–108, Hong
Kong, 2001.

Sabine Schulte im Walde. A collocation database for german verbs and


nouns. In F. Kiefer and J. Pajzs, editors, COMPLEX 2003, 7th Con-
ference on Computational Lexicography and Corpus Research, Budapest,
2003. Research Institute for Linguistics, Hungarian Academy of Sciences.

J.McH. Sinclair. Beginning the study of lexis. In C.E. Bazell, J.C. Catford,
M.A.K. Halliday, and R.H. Robins, editors, In memory of J.R.Firth, pages
410–430. Longmans, 1966.

Frank Smadja. Retrieving collocations from text: Xtract. Computational


Linguistics, 19(1):143–177, 1993.
Bibliography 189

Kristina Spranger. Beyond subcategorization acquisition. Multi-parameter


extraction from German text corpora. In Proceedings of the 11th
EURALEX International Congress, volume I, pages 171–177, Lorient,
France, 2004.
Gregor Thurmair. Multilingual content processing. In Proceedings of 4th
International Conference on Language Resources and Evaluation (LREC)
2004, Lisbon, Portugal, 2004. ELRA.
Beata Trawinski. Licensing Complex Prepositions via Lexical Constraints.
In Proceedings of the ACL2003 Workshop on Multiword Expressions: Ana-
lysis, Acquisition and Treatment, pages 97–104, Sapporo, Japan, 2003.
Van Dale. Groot Woordenboek Nederlands-Engels, 1986.
L. van der Beek, G. Bouma, R. Malouf, and G. van Noord. The Alpino
Dependency Treebank. In Computational Linguistics in The Netherlands
CLIN 2001, 2002a.
Leonoor van der Beek, Gosse Bouma, Jan Daciuk, Tanja Gaustad, Robert
Malouf, Gertjan van Noord, Robbert Prins, and Begoña Villada. Al-
gorithms for Linguistic Processing NWO PIONIER Progress Report.
Available electronically at http://odur.let.rug.nl/~vannoord/alp.,
Groningen, 2002b.
Leonoor van der Beek, Gosse Bouma, and Gertjan van Noord. Een brede
computationele grammatica voor het Nederlands. Nederlandse Taalkunde,
2002c.
Ton van der Wouden. Negative contexts. PhD thesis, University of Groningen,
The Netherlands, 1994.
Hans van Halteren, Jakub Zavrel, and Walter Daelemans. Improving accur-
acy in word class tagging through the combination of machine learning
systems. Computational Linguistics, 27(2):199–230, 2001.
Linda Verstraten. Idioms in Dutch Dictionaries. In Martin Everaert and
Erik-Jan van der Linden, editors, Proceedings of the First Workshop on
Idioms, pages 171–186. Tilburg. The Netherlands, 1989.
M. Begoña Villada Moirón. Discarding noise in an automatically acquired
lexicon of support verb constructions. In Proceedings of 4th Interna-
tional Conference on Language Resources and Evaluation (LREC) 2004,
volume V, pages 1859–1862, Lisbon, Portugal, 2004a. European Language
Resources Association.
190 Validation data

M. Begoña Villada Moirón. Distinguishing prepositional complements from


fixed arguments. In Proceedings of the 11th EURALEX International Con-
gress, volume III, pages 935–942, Lorient, France, 2004b.

M.Begoña Villada Moirón and Gosse Bouma. A corpus-based approach to


the acquisition of collocational prepositional phrases. In Proceedings of the
10th EURALEX International Congress, volume Vol.I, pages 153—158,
Copenhagen, Denmark, 2002. Center for Sprogteknologi.

Martin Volk. Combining supervised and unsupervised methods for pp at-


tachment disambiguation. In Proceedings of coling 2002, Taipei, 2002.

Thomas Wasow, Ivan A. Sag, and Geoffrey Nunberg. Idioms: An interim


report. In S. Ilattori and I. Kazulco, editors, Proceedings of the XIIIth
international congress of linguists, pages 102–115, Tokio, 1982.

M. Weeber, R. Vos, and Harald Baayen. Extracting the lowest-frequency


words: Pitfalls and possibilities. Computational Linguistics, 26(3):301–
317, 2000.

I.H. Witten, A. Moffat, and T. C. Bell. Managing Gygabytes: Compressing


and Indexing Documents and Images. Morgan Kaufmann Publishers, 1999.

Wlodek Zadrozny. On compositional semantics. In Proceedings of coling


1992, Nantes, France, 1992.

Heike Zinsmeister and Ulrich Heid. Collocations of complex words: Implica-


tions for the acquisition with a stochastic grammar, July 2002. Talk given
at the Workshop on ‘Computational approaches to Collocations’. Vienna.

Heike Zinsmeister and Ulrich Heid. Significant triples: Adjective + noun


+ verb combinations. In F. Kiefer and J. Pajzs, editors, Proceedings of
COMPLEX 2003. 7th Conference on Computational Lexicography and Text
Research, pages 92–101, Budapest, 2003. Hungarian Academy of Sciences.

Das könnte Ihnen auch gefallen