Sie sind auf Seite 1von 15

RANGEandFREQUENCY

ProgramsforWindowsbasedPCs

20 Nov 2002

RunningRANGE
FollowthesestepstorunRANGE.Roughlythesamestepscanbefollowedtorun
FREQUENCY.
1

SavethetextortextsthatyouwanttoruntheprogramonasASCIIorText
(.txt)files.Therearesomepracticeonesaccompanyingtheprogramcalled
INDO1.TXT,INDO2.TXTetc.

DoubleclickontheRANGEiconinWindowsexplorer

OpentheFilemenuinRANGEandchoosetheheadingOpen.

Selectthefileorfilesyouwanttoruntheprogramover.Rememberthese
mustallbetextfiles.Youwillhavetogototheappropriatedirectorytofind
thefilesandyoumayneedtochangetheentryintheboxFilesoftypetoText
filesorAllfiles.

Afteryouhaveselectedthefiles,gototheFilemenuagainandchooseSave.
Typethenameofthefilethatyouwanttosavetheresultsto.

LookatthelistofoptionsatthebottomoftheRANGEwindow.Youcan
changetheseoptionsorleavetheoptionsastheyare.

ClickthebuttonProcessFileswhichisbelowthefilelistintheRANGE
window.

LookattheresultsfileusingawordprocessorlikeMSWord.Theresultsfilewill
bethenameyouchoseplus_range.txt,forexampleresults_range.txt.Thetable
andlistslookbetteriftheCOURIER8or9pointfontisused.

Ifthingsgowrong,readtheinstructionsespeciallythenotesonTroubleshootingnearthe
endoftheseinstructions.
RANGE,andFREQUENCYareavailableathttp://www.vuw.ac.nz/lals/staff/Paul_Nation.
ItispossibletorunRANGEwithoutthebasewordlists.Thereissomesampleoutputfrom
RANGEattheendoftheseinstructions.
RANGE

RANGEisusedtocomparethevocabularyofupto32differenttextsatthesametime.For
eachwordinthetexts,itprovidesarangeordistributionfigure(howmanytextstheword
occursin),aheadwordfrequencyfigure(thetotalnumberoftimestheactualheadwordtype
appearsinallthetexts),afamilyfrequencyfigure(thetotalnumberoftimesthewordandits
familymembersoccurinallthetexts),andafrequencyfigureforeachofthetextstheword
occursin.Itcanbeusedtofindthecoverageofatextbycertainwordlists,createwordlists
basedonfrequencyandrange,andtodiscoversharedanduniquevocabularyinseveral
piecesofwriting.
Thesampleinputandoutputattheendofthissetofinstructionsshowsatypicalresultontwo
shorttexts.
RANGEcanbeusedtocompareatextagainstvocabularyliststoseewhatwordsinthetext
areandarenotinthelists,andtoseewhatpercentageoftheitemsinthetextarecoveredby
thelists.Itcanalsobeusedtocomparethevocabularyoftwotextstoseehowmuchofthe
samevocabularytheyuseandwheretheirvocabularydiffers.
Itisusefulforexampleforseeingwhatlowfrequencywordsareinanexamquestionpaper,a
technicalinformationnoteoratextaimedatforeignreaders.Itmayalsobeusedtocheckthe
vocabularyofsimplifiedreadingtextsorlanguagecoursebookstoseehowmanyofthe
wordsinthetextsareamongthehighfrequencywordsofEnglish.Itmayalsobeusedtosee
howmuchlearningthevocabularyofonetexthelpswithdealingwiththewordsinadifferent
text.
Incombinationwiththethreebaseliststhatareavailablewithit,ithasbeenusedtoanswer
thefollowingquestions.
Whatcommonvocabularyisfoundinallthesetexts?
Howlargeavocabularyisneededtoreadthistext?
Ifalearnerhasavocabularyof2,000words,howmuchofthevocabularyinthetext
willbefamiliartothelearner?
Whatarethewordsinthetextwhichthelearnerisnotlikelytoknow?
Howwelldoesthecoursebookpreparelearnersforthevocabularyinnewspapers?
Howrichavocabularydosecondlanguagelearnersuseintheirfreewriting?
Seetheapplicationssectionoftheseinstructionsforresearchcompletedusingthisprogram.
RANGEprovidesatablewhichshowshowmuchcoverageofatexteachofthethreebase
listsprovides.

WORDLIST

TOKENS/%

TYPES/%

FAMILIES

one
two
three
notinthelists

54/72.0
2/2.7
14/18.7
5/6.7

34/69.4
2/4.1
9/18.4
4/8.2

33
2
9
?????

Total

75

49

44

Thisshowsthat54oftherunningwordsinthetextareinbaselistoneandthese54words
makeup72%ofthetotalrunningwordsinthetext.Inthewordlistcolumn,one,two,three
refertoeachofthebaselists.
WhatisneededtorunRANGE?
ThisprogramisdesignedforPCs.Toruntheprogramyouneed
1
2
3

theprogramRange.exe,
thethreebasewordlists(BASEWRD1.txt,BASEWRD2.txt,BASEWRD3.txt),
textfilesinASCII(DOS)format.

Hereisanexampleofwhatshouldbeinthedirectory.
range.exe
basewrd1.txt
basewrd2.txt
basewrd3.txt
indo1.txt
indo2.txt
indo3.txt
indo4.txt
indo5.txt
indo6.txt
range.txt
function.txt

(theprogram)
(thebasewordlists)
(thefilestobeprocessed)

(ifyouneedtouselettersnotinEnglish)
(ifyouwantRANGEtonotcountfunctionwords)

RANGEcanrunwithupto10basewordlists.Theymustbecalledbasewrd1.txt,
basewrd2.txtandsoon.
FREQUENCY
FREQUENCYisanotherprogramthatrunsonanASCIItexttomakeafrequencylistofall
thewordsinasingletext.Itcanonlyrunonetextatatime.Theoutputisanalphabeticallist,
orafrequencyorderedlist.Itgivestherankorderofthewords,theirrawfrequencyandthe
cumulativepercentagefrequency.HereissomesampleoutputfromFREQUENCY.
3

Word Type
THE
OF
A
IN
TO
GROUP

Rank
1
2
3
4
5
6

Frequency
271
134
108
101
98
88

Cumulative Percent
7.55
11.28
14.29
17.10
19.83
22.28

Intheexample,thewordtypeaisthethirdmostfrequentword.Itoccurs108timesinthe
text,andalongwiththeandofcovers14.29%ofthetext.Onitsownitcovers3.01%(14.29
minus11.28)ofthetext.Seethebeginningofthissetofinstructionstoseehowtorun
FREQUENCY.
RunningRANGE
Followtheinstructionsatthebeginningofthisdocument.Youcantypeanynameforthe
output,suchasresults.Theoptionsavailableinclude
1

choosingtousenone,anyorallofthreebasewordlists,orupto10ofyourownlists,

sortingtheoutputbyfrequencyofoccurrence,thenumberofdifferenttextstheword
occurredin(range),oralphabetically,

listingothersi.e.wordsthatdidnotoccurinanyofthebasewordlists,

providingtherangeandfrequencynumbers(choosingnottoshowtheseisusefulif
youwanttousetheoutputtomakeotherbasewordlists)

havingalistofwordtypesaswellaswordfamiliesbytickingtheFormsbox

recordingtheoccurrencesofwordsinthebasewordliststhemselvesbyusingthe
UpdateBaseWordsandifnecessarytheZeroBaseWordsoptions

havingthewordsineachtextmarkedaccordingtowhatbasewordlisttheyoccurred
in.

RANGEisaverypowerfulprogramandcanprocessseveralverylargetextsatonceofovera
millionrunningwordseach.
Markingtheinputtexts
IftheMarktextsoptionischosen,eachwordintheinputtextsismarkedaccordingtowhat
basewordlistitoccurredin.Foreachtext,thisisrecordedinaseparatefilethathasthename
oftheinputfileandthesuffix.mrk,forexampleIndo1.mrk
Hereisanexampleofpartofamarkedtext.
4

Unmarked words are in Basewrd1.txt


Words marked with <2> are in Basewrd2.txt
Words marked with <3> are in Basewrd3.txt
Words marked with <!> are not in any of the lists
Group Work and Language Learning
Like all learning activities, group work is more likely to go well if it is properly
planned. Planning requires<3> an understanding of the principle<3> which lies behind
successful group work
The principle<3> of group work
Several factors<3> work together to result in group work where everyone involved<3>
is interested, active and thoughtful. If these factors<3> agree with each other then
group work is likely to be successful. If they are not in agreement, group work is likely
to be unsuccessful. The five factors<3> are the learning goals<3> of group work, the
task<3>, the way information<2> is distributed<3>, the seating arrangement<2> of the
members of the group, and the social relationships between the members of the group.
Thewordsnotmarked,forexampleLikealllearningareinbasewordlist1(thefirst1000
words).Thewordswith<2>afterthem,forexampleinformation<2>,areinbasewordlist2
(thesecond1000words).Thewordsmarkedwith<3>areinbasewordlist3(theAcademic
WordList).Thewordsmarkedwith<!>arenotinanyofthelists.
Usingthebasewordlists
RANGEcanbeusedwithuptotenwordlists.Theseallowittoclassifysomeofthewords
intheinputfilesintowordfamilies.Theprogramwillgivedifferentfiguresdependingon
whetherthebasewordlistsareusedornot.Ifthebasewordlistsareused,thefigureswill
representamixtureoffamiliesandtypes.Allthewordsinthebasewordlistsarecountedas
familiesandtheremainderarecountedastypes.Ifthebasewordlistsarenotused,thenall
thewordsarecountedastypes,becauseitisthebasewordliststhatareusedtomake
families.Youcanuseanyofthebasewordlistssimplybycheckingintheappropriateboxat
thebottomoftheRANGEdialoguebox.
Ifyouwanttofindoutwhatwordsinthebasewordlistsdidnotoccurintheinputtextsthe
CopyBasewordsoptionintheFilemenuallowsyoutodothis.
ThewordlistsavailableforRANGE
Threereadymadebaselistsareavailable.Thefirst(BASEWRD1.txt)includesthemost
frequent1000wordsofEnglish.Thesecond(BASEWRD2.txt)includesthe2nd1000most
frequentwords,andthethird(BASEWRD3.txt)includeswordsnotinthefirst2000wordsof
Englishbutwhicharefrequentinuppersecondaryschoolanduniversitytextsfromawide
rangeofsubjects.Allofthesebaselistsincludethebaseformsofwordsandderivedforms.
Thefirst1000wordsthusconsistsofaround4000formsortypes.Thesourcesoftheselists
5

areAGeneralServiceListofEnglishWordsbyMichaelWest(Longman,London1953)for
thefirst2000words,andTheAcademicWordListbyCoxhead(1998,2000)containing570
wordfamilies.ThefirstthousandwordsofAGeneralServiceListofEnglishWordsare
usuallythoseinthelistwithafrequencyhigherthan332occurrencesper5millionwords,
plusmonths,daysoftheweek,numbers,titles(Mr,Mrs,Miss,Ms,Mister),andfrequent
greetings(Hello,Hietc).
ThelistsincludebothAmericanandBritishspellings.Apostrophesaretreatedasspaces,so
I'veiscountedastwoitems,asisJane's.
Thewordformsinthebaselistsaregroupedintowordfamiliesunderaheadword.For
example,theheadwordAIDhasthefollowingfamilymembersAIDED,AIDING,AIDS,and
UNAIDED.InthebaseliststhefamilymembershaveaTabinfrontofthem.Theheadword
occursjustbeforethefamilymembersandhasnoTab.Forinformationonwordfamiliessee
Bauer,L.andNation,I.S.P."Wordfamilies"InternationalJournalofLexicography6,3
(1993)127.
Stoplist
IfyouwantRANGEnottocountsomewordsandtoexcludethemfromalltotalsyoujust
needtomakealistofthesewords,liketheonebelow,andsaveitasatextfile.Youthen
needtoclicktheUsestoplistboxandthenchoosethefileyouwantasyourstoplist.Thefile
calledfunction.txtisalistofallthefunctionwordsofEnglishwhichcanbeusedasastop
file.
a
about
above
across
after
against
albeit
all
along
although
am
Runningseveralfilesoneaftertheother
Ifyouwanttoprocessseveralfilesoneaftertheotherandgetseparatedataforeachfile,you
needtoclicktheBatchFilesbox.ThenyouchoosethefilesbygoingtotheFilemenuand
choosingOpen.Youdonothavetochoosetheresultsfiles.Theprogramwillcreateresults
filesbyadding_range.txttothenameoftheinputfile,forexample,ifoneoftheinputfilesis
Dracula.txttheresultswillbeputinafilecalledDracula_range.txt.Thebatchfileoptionis
veryusefulforexampleifyouwanttomeasurethevocabularyprofilesofalargenumberof
studentcompositions.
6

Preparingyourownbaselists
Youdonotneedtousethebaseliststhatareprovidedwiththeprogram.If,forexample,you
wishtoexaminethevocabularyofgradedreadersortolookattheoverlapbetweentwo
texts,youcanturnoneofthetextsintoawordlistbyrunningtheprogramFREQUENCY,or
RANGE,editittomakewordfamilies,andgiveitthenameBASEWRD1.txt,sothatit
becomesabaselistthatRANGEwilluse.Youcanmaketwootherbaselistsnamed
BASEWRD2.txtandBASEWRD3.txtandsoon.Thesamewordshouldnotoccurmorethan
onceinthesamelistorindifferentlists.Theprogramwillgiveyouanerrormessagewhen
yourunitifyouputthesamewordinmorethanonce.
Inordertopreparethethreebaselists,youcaneithercreatenewwordlistsoradaptexisting
baselists.Typethewordsintothefileinthefollowingway.Theindentedfamilymembers
musthaveaTaborfivespacesinfrontofthem.Notethattheheadwordisconsideredasa
familymemberanddoesnothavetobetypedagain.
A

AN

ABLE

ABLER
ABLEST
ABLY
ABOUT
ABOVE
Afteryouhavetypedalist,saveitasanASCIIfile.InMSWord,useSaveasMSDOSText
withLayout(*.asc)ifyouhaveusedTabsorsaveitasatextonlyfile.Inbothcasesrename
thefileafterwardssothatitonlyendswith.txtandiscalledBasewrd1.txtorBasewrd2.txtor
Basewrd3.txtwithnoadditionalfileextensionsuchas.txtor.asc.Theprogramwillprompt
youifyoutrytorunafilewiththewrongfileextension,andwilloffertocorrectitforyou.If
youareworkingonyourbaselistsinWordPerfectforWindows,saveitasASCII(DOS)
GenericWordProcessorifyouareusingTabs.UsingtheLayoutorGenericoptionwill
preservetheTabswhichwouldotherwisebeconvertedtospaces,orsaveitasASCIIDOS
text.Thelistshouldlooklikethis.
A0
AN0
ABLE0
ABLER0
Theremustbeonespacebetweentheheadwordandthezero,andonespacebetweenthe
familymemberandthezero.Commentscanbeputinthebaseliststoremindyouoftheir
contentbytyping#infrontofeachlineofcomment.Forexample,
#Thisisthefirst1000wordsoftheGSL.
7

Ifyouwanttomakebasewordlistsfromtheexistinglists,putatickintheoptionsUpdate
BaseWordsandZeroBaseWordsandruntheRANGEprogramoverthelistofheadwords
thatyouwantinyourlist.ThenchoosetheoptionCopyBasewordsfromtheFilemenuwhich
willthenallowyoucopywholefamilieswhichhaveanumbernexttothemtoanewlist.You
cankeepaddingfamiliestothenewbasewordlistbyusingtheCopyBasewordsoptionwith
severalexistingbasewordfiles.If you ask to copy families greater than 0 then the default is
to copy a family that has either a headword or any member that has a count of greater than
zero. If you ask to copy families equal to 0 then the default is to copy a family that has either
a headword or any member that has a count of equal to zero.
If you want to copy families that have zero in both the headwords and the members, click the
option, Use Family Total, which uses the total frequency of the family. Thus if you select this
option and ask for families that are equal to zero, you will only get families in which both the
headwords and the members are all zero. If you ask for families equal to one, you'll only get
families where the member and headword counts add up to one and so on.
Makingalistofheadwords
IfyouwanttomakealistofonlyheadwordsfromaBasewordlist,choosetheMakeHead
WordListoptionfromtheFilemenu.
UsingRANGEorFREQUENCYwithlanguagesotherthanEnglish
Thefilerange.txtallowsyoutospecifywhatlettersofthealphabetyouwantRANGEto
recognise.FREQUENCYhasasimilarfilecalledfrequency.txt.
Troubleshooting
Thefollowingdescriptionlistserrormessages,causes,waysofcheckingandremedies.
ERRORSavefilenotset,cannotrun.
Cause:Youhavenotprovidedanamefortheoutputfilewhichwillcontainthe
resultsfromrunningtheprogram.
Remedy:GototheFilemenu,selectSave,andintheFilenameboxtypeanamefor
theoutputfile.
ERRORBASEWRD1.txtnotfound,cannotrun.
Cause:Oneormoreofthebaselistsisnotpresent,orisnotinthesamedirectoryas
theRANGEprogramandtheinputtext,orareincorrectlynamed.Theerrormessage
willtellyouwhichofthethreelistsismissing.

Check:ListthefilesinthedirectorytoseeifBASEWRD1.txt,BASEWRD2.txt,and
BASEWRD3.txtarepresent.RANGEwillonlyusebaselistsnamed
BASEWRD1.txt,BASEWRD2.txt,BASEWRD3.txtandsoon.
Remedy:Renamethebaselistfiles,createbaselistfiles,ormovethebaselistfilesto
thesamedirectory.Youmayneedtomakesurethatthereisnotanadditionalsuffix
like.ascafter.txt.
ERRORlinetoolong
Cause:TheinputfileisnotinDOS(ASCII)formatwithlinebreaks.Ithasnotbeen
convertedfromaWORDfiletoaTextfilewithlinebreaks(*.txt).
Check:UseatexteditorlikeNotepadtolookattheinputfiletoseeifitcontains
unusualcharacters.
Solution:ConvertthefiletoatextfilebychoosingSaveasfromtheFilemenuand
choosingTextOnlywithLineBreaks(*.txt)fromtheSaveasTypebox.
SKIP

ERRORLineinthebasewordfilehasnoentry!Skippingline:3
Cause:Thereisablanklineinthebasewordfile.Theprogramgivesthelinenumber
tohelpyoufindthis.
Solution:Deletetheunwantedhardreturn.
Reading:basewrd1.txt
ERROREntry:Widihasnocountvalue,usingzero
Cause:Thereisanerrorinthebasewordlistsinthatoneormoreofthewords(Widi
intheexample)doesnothaveaspaceanda0afterit.
Check:Lookatthelisttoseeifthewordisfollowedbyspacesandanumber,for
example0.TheReading:basewrd1.txtmessagehelpsyoufindwhatbasewordlistthe
errorisin.
Solution:Editthewordinthebasewordlistbyaddingthespacepluszero.
ERRORWrd:JONIenteredinbasewordlists!line:5
Cause:Awordoccursmorethanonceinabasewordlistorinmorethanonebase
wordlist.Forexample,awordfamilyortypeoccursinbothBASEWRD1.txtand
BASEWRD2.txt.
9

Solution:Makeanoteofthewordsmentionedintheerrormessage(JONIinthe
example)andeditthelistssothatthesewordsonlyoccurinonelistoronlyonceina
list.
Applications
Thefollowingpiecesofpublishedresearchandthesesuseddataobtainedbyusingversions
ofRANGEandFREQUENCY.RANGEhaspreviouslybeencalledFvordsand
VocabProfile.FREQUENCYwaspreviouslycalledWORDS.
Articles
Coxhead,A.(2000)Anewacademicwordlist.TESOLQuarterly34,2:213238.
Hirsh,D.andNation,P.(1992)Whatvocabularysizeisneededtoreadunsimplifiedtextsfor
pleasure?ReadinginaForeignLanguage8,2:689696.
HwangKyonghoandNation,P.(1989)Reducingthevocabularyloadandencouraging
vocabularylearningthroughreadingnewspapers.ReadinginaForeignLanguage6,
1:323335.
Klinmanee,N.andSopprasong,L.(1997)Bridgingthevocabularygapbetweensecondary
schoolanduniversity:aThaicasestudy.Guidelines19,1:110.
Laufer,B.andNation,P.(1995)LexicalrichnessinL2writtenproduction:Canitbe
measured?AppliedLinguistics16,3:307322.
Laufer,B.(1994)Thelexicalprofileofsecondlanguagewriting.Doesitchangeovertime?
RELCJournal25,2:2133.
Laufer,B.andParibakht,T.S.(1998)Therelationshipbetweenpassiveandactive
vocabularies:effectsoflanguagelearningcontext.LanguageLearning48,3:365391.
Nation,PandDeWeerdt,(2002)Adefenceofsimplification.Prospect.
Nation,P.andHwang,K.(1995)Whereshouldgeneralservicevocabularystopandspecial
purposesvocabularybegin?System23,1:3541.
Nation,P.andWang,K.(1999)Graded readers and vocabulary. Reading in a Foreign
Language 12, 2: 355-380.
Sutarsyah,C.,Nation,P.andKennedy,G.(1994)HowusefulisEAPvocabularyforESP?A
corpusbasedcasestudy.RELCJournal25,2:3450.
Wodinsky,M.andNation,P.(1988)Learningfromgradedreaders.ReadinginaForeign
Language.5,1:155161.
Worthington,D.andNation,P.(1996)Usingtextstosequencetheintroductionofnew
vocabularyinanEAPcourse.RELCJournal27,2:111.
Theses
Coxhead,A.(1998)ThedevelopmentandevaluationofanacademicwordlistMAThesis,
VictoriaUniversityofWellington,NewZealand.
Hirsh,D.(1993)Thevocabularydemandsandvocabularylearningopportunitiesinshort
novelsMAThesis,VictoriaUniversityofWellington,NewZealand.
HwangKyongho(1989)Readingnewspapersfortheimprovementofvocabularyandreading
skills.MAThesis,VictoriaUniversityofWellington,NewZealand.
10

Jenkins,S.(1993)Thevocabularyburdenofcontrolledanduncontrolledreadingmaterials
usedwithbeginningESLreaders.MAThesis,VictoriaUniversityofWellington,
NewZealand.
Muhammad,R.(1990)Thevocabularydemandsofreadinginformationpamphlets.MA
Thesis,VictoriaUniversityofWellington,NewZealand.
Sutarsyah,C.(1993)ThevocabularyofeconomicsandacademicEnglish.MAThesis,
VictoriaUniversityofWellington,NewZealand.
DevelopmentanddistributionofRANGEandFREQUENCY
RANGEandFREQUENCYwereprogrammedbyAlexHeatley.TheyweredesignedbyPaul
NationandAverilCoxheadoftheSchoolofLinguisticsandAppliedLanguageStudies,
VictoriaUniversity,P.O.Box600,Wellington,NewZealand.Theprogramiswrittenin
Delphi5.Theuncompiledversionisavailableifyouwishtoadapttheprogram.Pleasekeep
PaulNationinformedofproblemswiththeprogramandbaselists,andanyimprovements
andapplicationsyouhavemade.Youcandistributetheprogram,baselistsandinstructions
tootherusers.Itmustnotbesoldforprofit.Thereisnoneedtoseekpermissionforitsuseor
foritsfreedistributiontoothers.
Email:Paul.Nation@vuw.ac.nz
Fax:6444635604Attention:PaulNation
Torefertoitinalistofreferences,youcandosousingthefollowinginformation.
Heatley,A.,Nation,I.S.P.andCoxhead,A.2002.RANGEandFREQUENCYprograms.
http://www.vuw.ac.nz/lals/staff/Paul_Nation
SampleoutputfromRANGE
Thissampleusesthetwofollowingshorttextsasinput.RANGEwasrunwiththethreebase
listschosenandwiththeFormsboxticked.
Text1
Everybody laughed. Joni's face became red and hot. Joni, that is not the correct answer.
Please listen to the lesson, said Miss Suhardi. Then the bell rang. It was the end of the
English lesson and it was the end of school for that day. Miss Suhardi gave the class some
homework and left the room. The students began to talk and laugh. Joni's face was still hot,
but the others had already forgotten his mistake.
Text 2
Widi picked up her books and went too. Joni walked near her. He was happy. He wanted to sit
near her. Priyadi stood up and spoke to all the students. "Next Thursday," he said, "we are
going to visit the students of a school in Jakarta. We are going to stay with them for three
11

days. We will visit some places in Jakarta too. It will cost 2,500 rupiahs. Please pay the
money soon. Tell me your names and pay the money to me.
Theoutput
Processing file: C:\WINNT\Profiles\nationp\Personal\range\widi.txt
Number of lines: 6
Number of words: 78
Processing file: C:\WINNT\Profiles\nationp\Personal\range\joni.txt
Number of lines: 6
Number of words: 83
Reading: basewrd1.txt
Reading: basewrd2.txt
Reading: basewrd3.txt
WORD LIST

TOKENS/%

one
two
three
not in the lists

143/88.8
6/ 3.7
0/ 0.0
12/ 7.5

81/87.1
5/ 5.4
0/ 0.0
7/ 7.5

Total

161

93

Number of BASEWRD1.txt types: 3126


Number of BASEWRD2.txt types: 2721
Number of BASEWRD3.txt types: 2540

TYPES/%

FAMILIES
74
5
0
?????
79

Number of BASEWRD1.txt families: 999


Number of BASEWRD2.txt families: 986
Number of BASEWRD3.txt families: 570

Table of Ranges: Types


160 Words appear in
26 Words appear in

1 input files
2 input files

Table of Ranges: Families


71 Words appear in
15 Words appear in

1 input files
2 input files

Types Found In Base List One


TYPE
AND
FOR
IT
OF
PLEASE
SAID
SCHOOL
SOME
STUDENTS
THE
TO
WAS
A
ALL
ALREADY
ANSWER
ARE
BECAME
BEGAN

RANGE
2
2
2
2
2
2
2
2
2
2
2
2
1
1
1
1
1
1
1

FREQ
7
2
3
3
2
2
2
2
3
14
7
4
1
1
1
1
2
1
1

12

F1
4
1
2
2
1
1
1
1
1
10
2
3
0
0
1
1
0
1
1

F2
3
1
1
1
1
1
1
1
2
4
5
1
1
1
0
0
2
0
0

BOOKS
BUT
CLASS
COST
DAY
DAYS
END
ENGLISH
EVERYBODY
FACE
FORGOTTEN
GAVE
GOING
HAD
HAPPY
HE
HER
HIS
HOT
IN
IS
LAUGH
LAUGHED
LEFT
LISTEN
ME
MISS
MONEY
NAMES
NEAR
NEXT
NOT
OTHERS
PAY
PLACES
RANG
RED
ROOM
S
SIT
SOON
SPOKE
STAY
STILL
STOOD
TALK
TELL
THAT
THEM
THEN
THREE
THURSDAY
TOO
UP
VISIT
WALKED
WANTED
WE
WENT
WILL
WITH
YOUR

1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1

1
1
1
1
1
1
2
1
1
2
1
1
2
1
1
3
3
1
2
2
1
1
1
1
1
2
2
2
1
2
1
1
1
2
1
1
1
1
2
1
1
1
1
1
1
1
1
2
1
1
1
1
2
2
2
1
1
3
1
2
1
1

0
1
1
0
1
0
2
1
1
2
1
1
0
1
0
0
0
1
2
0
1
1
1
1
1
0
2
0
0
0
0
1
1
0
0
1
1
1
2
0
0
0
0
1
0
1
0
2
0
1
0
0
0
0
0
0
0
0
0
0
0
0

1
0
0
1
0
1
0
0
0
0
0
0
2
0
1
3
3
0
0
2
0
0
0
0
0
2
0
2
1
2
1
0
0
2
1
0
0
0
0
1
1
1
1
0
1
0
1
0
1
0
1
1
2
2
2
1
1
3
1
2
1
1

RANGE
1
1
1
1
1

FREQ
1
1
2
1
1

F1
1
1
2
1
0

F2
0
0
0
0
1

Types Found In Base List Two


TYPE
BELL
CORRECT
LESSON
MISTAKE
PICKED
Types Found In Base List Three

13

TYPE

RANGE

FREQ

F1

F2

RANGE TYFREQ FAFREQ


2
7
7
2
0
9
2
1
2
2
2
2
2
3
4
2
3
3
2
3
3
2
2
2
2
0
2
2
2
2
2
2
2
2
0
3
2
14
14
2
7
7
1
1
1
1
1
1
1
1
1
1
1
1
1
0
1
1
0
1
1
0
1
1
1
1
1
1
1
1
1
1
1
2
2
1
1
1
1
0
1
1
2
2
1
0
1
1
0
1
1
0
3
1
1
1
1
0
1
1
2
2
1
0
2
1
2
2
1
1
2
1
1
1
1
1
1
1
2
2
1
2
2
1
0
1
1
2
2
1
1
1
1
1
1
1
0
1
1
2
2
1
0
1
1
1
1
1
0
1
1
1
1
1
0
3
1
1
1
1
1
1
1
0
1
1
0
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
0
1
1
0
2
1
1
1
1
1
1
1
2
2

F1
4
6
1
1
1
2
2
1
1
1
1
1
10
2
0
0
1
1
1
1
0
1
1
0
2
1
1
2
1
1
0
0
1
2
0
0
2
1
1
2
0
0
0
0
1
1
0
0
1
1
1
0
0
0
0
0
0
1
1
0
1
0
2
0
0
0

LIST OF FAMILY GROUPS


BASE ONE FAMILIES
AND
BE
DAY
FOR
HE
IT
OF
PLEASE
SAY
SCHOOL
SOME
STUDENT
THE
TO
A
ALL
ALREADY
ANSWER
BECOME
BEGIN
BOOK
BUT
CLASS
COST
END
ENGLISH
EVERY
FACE
FORGET
GIVE
GO
HAPPY
HAVE
HOT
I
IN
LAUGH
LEFT
LISTEN
MISS
MONEY
NAME
NEAR
NEXT
NOT
OTHER
PAY
PLACE
RED
RING
ROOM
SHE
SIT
SOON
SPEAK
STAND
STAY
STILL
TALK
TELL
THEN
THEY
THIS
THREE
THURSDAY
TOO

14

F2
3
3
1
1
3
1
1
1
1
1
1
2
4
5
1
1
0
0
0
0
1
0
0
1
0
0
0
0
0
0
3
1
0
0
2
2
0
0
0
0
2
1
2
1
0
0
2
1
0
0
0
3
1
1
1
1
1
0
0
1
0
1
0
1
1
2

UP
VISIT
WALK
WANT
WE
WILL
WITH
YOU

1
1
1
1
1
1
1
1

2
2
0
0
3
2
1
0

2
2
1
1
3
2
1
1

0
0
0
0
0
0
0
0

2
2
1
1
3
2
1
1

BASE TWO FAMILIES


BELL
CORRECT
LESSON
MISTAKE
PICK

RANGE TYFREQ FAFREQ


1
1
1
1
1
1
1
2
2
1
1
1
1
0
1

F1
1
1
2
1
0

F2
0
0
0
0
1

BASE THREE FAMILIES

RANGE TYFREQ FAFREQ

F1

F2

RANGE
2
1
1
1
1
1
1

F2
1
0
2
1
1
0
1

Types Not Found In Any List


TYPE
JONI
HOMEWORK
JAKARTA
PRIYADI
RUPIAHS
SUHARDI
WIDI

FREQ
4
1
2
1
1
2
1

time taken was : 1 Seconds


Number of Nodes Read:
0
Number of Cache Nodes Read:
31175
Number of Nodes Written:
0
Number of Cache Nodes Written:
2045
Number of nodes per second,
0
Number of words per second,
161
Number of unique words in tree,
93
Number of unique words per second,
93
...Finished
/var/www/apps/conversion/tmp/scratch_1/253970036.doc

15

F1
3
1
0
0
0
2
0

Das könnte Ihnen auch gefallen