Sie sind auf Seite 1von 304

Quick Reference

Chapter
1: STATISTICA: General Overview 1
2: Step-by-Step Examples 9
Analytics 11
Data Management 72
Enterprise Installations 98
3: User Interface 125
4: Output from Analyses 145
5: STATISTICA Documents 167
6: Graphs 187
7: Customizing STATISTICA 211
8: STATISTICA Visual Basic 219
9: STATISTICA Query 241
10: STATISTICA and .NET 247
Appendixes
A: Getting More Help 255
B: STATISTICA Enterprise Server 261
C: STATISTICA Family of Products 273
QuickReference

Copyright StatSoft, 2011


iiSTATISTICAQuickReference
QuickReference:Contents

Copyright StatSoft, 2011


STATISTICAQuickReferenceiii
STATISTICA Quick Reference
Contents
1. STATISTICA: A GENERAL OVERVIEW OF FEATURES ....................................... 1
2. STEP-BY-STEP EXAMPLES ................................................................................. 9
Analytics..............................................................................................................11
Example1:Correlations................................................................................11
Example2:ANOVA........................................................................................34
Example3:VariableBundles.........................................................................40
Example4:ByGroupAnalyses.....................................................................43
Example5:SummaryResultsPanels(Quality,Process,GageSixpacks).....51
Example6:STATISTICADataMiner..............................................................57
DataManagement..............................................................................................72
Example1:SpreadsheetFormulasandBatchFormulas..............................72
Example2:InputDataDirectlyfromExcel...................................................77
Example3:AccessingDataDirectlyfromaSQLServerDatabase................79
Example4:DataPreparationCleaningandFiltering.................................84
Example5:UsingSTATISTICAETL(Extract,Transform,andLoad)...............93
EnterpriseInstallations.......................................................................................98
Example1:STATISTICAEnterpriseServerDownload/Offload
Analysesfrom/toServers........................................................................98
Example2:UsingSTATISTICAinRegulatedEnvironments.........................102
Example3:STATISTICAEnterprise..............................................................109
TheSTATISTICAEnterpriseServerOption...................................................124
OtherExamples
STATISTICAVisualBasic:RecordinganAnalysis.........................................230
STATISTICAQuery:AccessingExternalDatabases......................................244
3. USER INTERFACE .......................................................................................... 125
GeneralFeatures...............................................................................................127
CustomizedOperation................................................................................127
AlternativeAccesstotheSameFacilitiesCustomStylesofWork...........128
MultipleAnalysisSupport.................................................................................128
QuickReference:Contents

Copyright StatSoft, 2011


ivSTATISTICAQuickReference
InteractiveUserInterface.................................................................................130
Overview.....................................................................................................130
TheFlowofInteractiveAnalysis.................................................................131
FeaturesofAnalyses...................................................................................134
DocumentTypes.........................................................................................137
STATISTICAVisualBasicandControllingSTATISTICAfrom
OtherApplications.......................................................................................140
WebBrowserBasedUserInterface:STATISTICAEnterpriseServer.................141
MicrosoftOfficeintegration.............................................................................142
4. SIX CHANNELS FOR OUTPUT FROM ANALYSES ........................................ 145
Overview...........................................................................................................147
1.STATISTICAWorkbooks.................................................................................148
2.StandAloneWindows...................................................................................150
3.Reports..........................................................................................................151
STATISTICAReports.....................................................................................151
ReportsfromWorkbooks............................................................................152
RTF(RichTextFormat)Reports..................................................................152
Acrobat(PDF)Reports................................................................................153
HTMLReports.............................................................................................154
4.MicrosoftWord.............................................................................................154
5.OutputtotheWeb........................................................................................155
KnowledgePortal........................................................................................155
PublishingContentfromSTATISTICAEnterpriseServer.............................157
PublishingContentfromSTATISTICADesktopApplications.......................160
6.SharePointorSTATISTICADocumenTManagementSystem(SDMS)...........163
SharePoint...................................................................................................163
STATISTICADocumentManagementSystem(SDMS)................................165
5. STATISTICA DOCUMENTS ............................................................................. 167
Workbooks........................................................................................................169
NavigatingtheWorkbookTree...................................................................171
Spreadsheets(MultimediaTables)...................................................................173
Inputvs.OutputSpreadsheets...................................................................177
STATISTICASpreadsheetOLEDBProvider..................................................178
Reports..............................................................................................................180
NavigatingtheReportTree.........................................................................181
QuickReference:Contents

Copyright StatSoft, 2011


STATISTICAQuickReferencev
Graphs...............................................................................................................182
Macros(STATISTICAVisualBasicPrograms).....................................................183
STATISTICAProjects..........................................................................................184
6. GRAPHS ......................................................................................................... 187
Overview...........................................................................................................189
CustomizationofGraphs..................................................................................190
GeneralCategoriesofGraphs...........................................................................198
GraphsofInputData.........................................................................................199
GraphsofBlockData.........................................................................................202
GraphsMenuGraphs........................................................................................204
GraphBrushingandCaseStates.......................................................................205
OtherSpecializedGraphs..................................................................................208
CreatingGraphsviaSTATISTICAVisualBasic....................................................209
7. CUSTOMIZING STATISTICA .......................................................................... 211
CustomizationoftheInteractiveUserInterface..............................................213
CustomizationofDocuments...........................................................................214
Localvs.PermanentCustomizations................................................................215
GeneralDefaults...............................................................................................215
GraphCustomization........................................................................................217
MaintainingDifferentConfigurationsofSTATISTICA.......................................218
CustomizedConfigurationsforIndividualUsersonaNetwork.......................218
8. STATISTICA VISUAL BASIC ........................................................................... 219
RecordingSTATISTICAVisualBasic(SVB)Macros(Programs)..........................224
AnalysisMacros,Master(Log)Macros,andKeyboardMacros.................224
Example:RecordinganAnalysis.......................................................................230
ActiveXObjectsandDocuments(ATechnicalNote)........................................238
9. STATISTICA QUERY ....................................................................................... 241
Overview...........................................................................................................243
STATISTICAQuery:QuickStepbyStepInstructions........................................244
InPlaceProcessingofDataonRemoteServers(theIDP
TechnologyOption).....................................................................................245
OLAPCUBES......................................................................................................246
LargeDatabaseFiles.........................................................................................246
QuickReference:Contents

Copyright StatSoft, 2011


viSTATISTICAQuickReference
10. PROGRAMMING STATISTICA FROM .NET ................................................. 247
AddingtheSTATISTICAObjectLibraryintoYour.NETProject...................249
ManuallyCreatingtheCOMInteropLibrary..............................................251
SupportingMultipleVersionsofSTATISTICA..............................................251
InstantiatingSTATISTICA.............................................................................252
TheLibraryVersionofSTATISTICA..............................................................252
APPENDIXES
A.GettingMoreHelp........................................................................................255
B.STATISTICAEnterpriseServer........................................................................261
C.STATISTICAFamilyofProducts.....................................................................273
INDEX ................................................................................................................. 289

STATISTICA:
A GENERAL OVERVIEW
OF FEATURES

1
1

CHAPTER

Copyright StatSoft, 2011


2STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICAQuickReference3
STATISTICA:
A GENERAL OVERVIEW
OF FEATURES
STATISTICAisacomprehensiveanalytic,research,andbusinessintelligencetool.It
isanintegrateddatamanagement,analysis,mining,visualization,andcustom
applicationdevelopmentsystemfeaturingawideselectionofbasicandadvanced
analyticproceduresforbusiness,datamining,science,andengineering
applications.
Analytic Facilities
STATISTICAincludesnotonlygeneralpurposeanalytic,graphical,anddatabase
managementprocedures,butalsocomprehensiveimplementationsofspecialized
methodsfordataanalysis(e.g.,predictivedatamining;business,socialsciences,
andbiomedicalresearch;orengineeringapplications).Allanalytictoolsofferedin
theSTATISTICAlineofsoftwareareavailableaspartofanintegratedpackage.
Thesetoolscanbecontrolledthroughaselectionofalternativeuserinterfaces
including:
ahighlyoptimizedinteractiveuserinterface(withoptionstoexecute
STATISTICAfromwithinMicrosoftOfficeandotherapplications),
acompletethinclient,browserbaseduserinterface(inSTATISTICA
EnterpriseServer)thatenablesyoutooffloadtimeconsumingtaskstothe
serverandworkcollaboratively,and
CHAPTER
1
1

Chapter1: OverviewofFeatures

Copyright StatSoft, 2011


4STATISTICAQuickReference
acomprehensive,industrystandard,.NETcompatibleprogramming
interface(includingthebuiltin,.NETcompatibleVisualBasic),offering
accesstomorethan14,000externallycallablefunctions.
Interactiveuserinterfacescanbeeasilyautomatedviamacrosandcustomized
usingavarietyofmethods,andtheyarerecordableintheformofindustry
standardVBscripts.Thebuiltindevelopmentenvironmentcanbeusedto
interfaceSTATISTICAwithotherapplicationsandenterprisewideinfrastructures
ortobuildcustomextensionsofanycomplexity,fromsimpleshortcutsto
advanced,largescaledevelopmentprojects.
Unique Features
SomeoftheuniquefeaturesoftheSTATISTICAlineofsoftwareinclude:
thebreadthofselectionandcomprehensivenessofimplementationof
analyticalprocedures,
theunparalleledselection,quality,andcustomizabilityofgraphics
integratedseamlesslywitheverycomputationalprocedure,
aselectionofefficientanduserfriendlyuserinterfaces,
theeaseofcustomizabilityusingthetrulyopenarchitecturecompatible
withvirtuallyallenterpriseanddevelopmentenvironments(including
.NET),thatexposesSTATISTICAsmorethan14,000functions,
awideselectionofadvancedsoftwaretechnologies(seeSoftware
Technology,page6)thatisresponsibleforSTATISTICAspractically
unlimitedcapacity,performance(speed,responsiveness),andapplication
customizationoptions,
nativeRscriptscanberundirectlywithinSTATISTICAandRoutputcanbe
retrievedasnativeSTATISTICASpreadsheetsandGraphs.
OneofthemostuniqueandimportantfeaturesoftheSTATISTICAfamilyof
applicationsisthatthesetechnologiesenableeveninexperienceduserstotailor
STATISTICAtotheirspecificpreferences.Youcancustomizepracticallyevery
aspectofSTATISTICA,includingeventhelowlevelproceduresofitsuserinterface.
ThesameversionofSTATISTICAcanbeused:
Chapter1:OverviewofFeatures

Copyright StatSoft, 2011


STATISTICAQuickReference5
BynovicestoperformroutinetasksusingthedefaultanalysisStartup
dialogQuicktab(containingjustafew,selfexplanatorybuttons),oreven
byaccessingSTATISTICAwiththeirWebbrowsers(andahighlysimplified
frontend),and
Byexperiencedanalysts,professionalstatisticians,andadvanced
applicationdeveloperswhocanintegrateanyofSTATISTICAshighly
optimizedprocedures(morethan14,000functions)intocustom
applicationsorcomputingenvironments,usinganyofthecuttingedge
.NETandWebcompatibletechnologies.
The General Philosophy of the
STATISTICA Approach
STATISTICAsdefaultconfiguration(itsgeneraluserinterfaceandsystemoptions)
isaresultofyearsoflisteningcarefullytoourusers.
Wehavereceivedfeedbackfromtensofthousandsofourusers,representing
hundredsofthousandsofourusersfromallcontinentsand,practicallyspeaking,
allwalksoflife.Oneofthemostimportantfactsthatwehavelearnedfrom
theseusersishowdifferenttheirneedsandpreferencesare(bothacross
individualsandprojectsorapplications).Inordertomeetthosedifferentiated
needs,STATISTICAisdesignedtoofferperhapsoneofthemostflexibleandeasily
customizableuserinterfacesofanycontemporaryapplication.
AlthoughSTATISTICAprovidesaccesstoapowerfularsenalofadvancedsoftware
technologies(seeSoftwareTechnology,page6),youdonotevenneedtoknow
aboutthem,becausetheyaredesignedtoworkautomaticallyandintuitively.A
noviceusermayneverseemorethanafewselfexplanatorybuttons.Advanced
options,however,areonlyonetabormouseclickaway.Practicallyeveryaspectof
STATISTICA(fromthestartupconfiguration,tothewaytheoutputisgenerated
andmanagedbythesystem,tohowSTATISTICApromptsyoutochooseyournext
step)canbechangedwithamouseclick.Moreover,STATISTICAremembersyour
selectionsuntilyouchangeyourmind.Practicallyalldialogsusedtoselectan
analysisorperformaroutineoperationcanbeeasilyreplaced(e.g.,simplified,
enhanced,orcombinedwithcustom,userdesignedprocedures).STATISTICAwill
alwayslookandworkthewayyouwant.
Chapter1: OverviewofFeatures

Copyright StatSoft, 2011


6STATISTICAQuickReference
Software Technology
(A Technical Note)
Theperformance,customizability,andwideselectionofoptionsthatcanbe
tailoredtoyourneedsmentionedintheprevioussectionwouldnotbepossibleif
STATISTICAdidnotfeaturetheadvancedtechnologiesthatdriveallfunctionsof
theapplication.
STATISTICAusesand/orsupportsvirtuallyalltherelevantleadingedgesoftware
technologiesavailabletoday.Everyoneofthemorethan14,000STATISTICA
functionsisaccessibletoexternalapplications.Practicallynolimitationsare
imposedintermsofeithertheamountorcomplexityofdatathatcanbestored
andaccessed.STATISTICAalsoisoptimizedforWebandmultimediaapplications.
Computationalandgraphicsproceduresaredrivenbycountlessproprietary
optimizationssuchas,forexample,thequadrupleprecisioncomputational
technologythatenablesustoovercomethelimitationsoftheIEEEfloatingpoint
storagestandardsanddeliverscomputationalaccuracynormallyfoundonlyin
designatedmathapplications(thatfeaturearbitraryprecisionoptions)butnotin
highvolumedataprocessingapplicationssuchasstatisticalordatamining
programs.
Asaresult,STATISTICAoffersunmatchedspeed,numericalprecision,and
responsiveness,whichisaidedbymultithreading(andtheadvanced
supercomputerlikedistributed/parallelprocessingarchitectureofferedin
theClientServerversion,i.e.,STATISTICAEnterpriseServer).
DataaccessisbasedonaflexiblestreamingtechnologythatenablesSTATISTICAto
workeffortlesslywithboththesimpleinputdatafilesstoredonthelocaldriveand
queriesofmultidimensionaldatabasescontainingterabytesofdataandstoredin
remotedatawarehousesandprocessedinplace(i.e.,withouthavingtoimport
themtoalocalstorage;thisfeatureisavailableinenterpriseversionsof
STATISTICA).
Forexample,youcansimultaneouslyrunmultipleinstancesofSTATISTICA[inany
combinationoflocal,network,andClientServer(Webbased)environments],each
runningmultipleanalysesofdatafrommultipleandsimultaneouslyopeninput
datafilesandqueries,andtheresultscanbeorganizedintoseparateprojects.
STATISTICAsinputandoutputdatafilesandgraphscanbeofpracticallyunlimited
Chapter1:OverviewofFeatures

Copyright StatSoft, 2011


STATISTICAQuickReference7
size,comprisinghierarchiesofdocumentsofvarioustypes.Theoutputcanbe
directedtoamultitudeofoutputchannelssuchasmultimediatables,high
performanceworkbooks,reports(including.pdffilesandMicrosoftOffice
documents),andtheInternet,aswellastheoptionalSTATISTICADocument
ManagementSystem,whichcanbeseamlesslyintegratedwithanySTATISTICA
application.
Web Enablement
Oneoftheuniquefeaturesofthe STATISTICAfamilyofapplicationsisthatitisfully
Webenabled,andifSTATISTICAEnterpriseServerisinstalled,youcannotonly
offloadtimeconsumingtaskstotheserver,butalsoaccessthecomprehensive
functionalityoftheSTATISTICAsystemusingathinclient(browser)interface.This
includestheoptiontoexecutepreparedscriptsandaplethoraofinteractive
functionality,includingsuchoperationsasinteractivelybuildingpredictivedata
miningmodelsbydraggingarrowsintheinteractiveworkspaceofSTATISTICAData
Miner(usingonlythebrowser,withoutanyclientsoftwareinstalled).Formore
information,pleaserefertoAppendixBSTATISTICAEnterpriseServer,page263.
NotethatmostfeaturesdescribedinthismanualareavailableinallSTATISTICA
products,althoughsomesectionsofthemanualreferonlytospecificproducts
suchastheSTATISTICAEnterpriseServerfacilitiesortheSTATISTICADataMiner
lineofproducts.
Record of Recognition
Wearepleasedtoreportthat,asofthisprinting,STATISTICAhasreceivedthe
highestratingineverypublishedindependentcomparativereviewinwhichithas
beenfeatured.Inthehistoryofthesoftwareindustry,veryfewproductshaveever
achievedsucharecord.
FormoreinformationaboutStatSoftandSTATISTICAsrecordofrecognition,
pleasevisitourWebsiteatwww.StatSoft.com.
Chapter1: OverviewofFeatures

Copyright StatSoft, 2011


8STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICAQuickReference9
STEP-BY-STEP EXAMPLES
ANALYTICS
Example 1: Correlations ..................................................................... 11
Example 2: ANOVA .............................................................................. 34
Example 3: Variable Bundles ............................................................. 40
Example 4: By-Group Analyses .......................................................... 43
Example 5: Summary Results Panels
(Quality, Process, Gage-Sixpacks) ............................................... 51
Example 6: STATISTICA Data Miner .................................................. 57
DATA MANAGEMENT
Example 1: Spreadsheet Formulas and Batch Formulas ............... 72
Example 2: Input Data Directly from Excel ...................................... 77
continued



CHAPTER
2
2

CHAPTER2: ENTERPRISE EXAMPLES

Copyright StatSoft, 2011


10STATISTICAQuickReference
Example 3: Accessing Data Directly from a SQL
Server Database ........................................................................... 79
Example 4: Data Preparation Cleaning and Filtering ................... 84
Example 5: Using STATISTICA ETL (Extract,Transform,
and Load) ....................................................................................... 93
ENTERPRISE INSTALLATIONS
Example 1: STATISTICA Enterprise Server
Download/Offload Analyses from/to Servers ............................ 98
Example 2: Using STATISTICA in Regulated Environments .......... 102
Example 3: STATISTICA Enterprise ................................................. 109
The STATISTICA Enterprise Server Option ...................................... 124
OTHER EXAMPLES
STATISTICA Visual Basic: Recording an Analysis .......................... 230
STATISTICA Query: Accessing External Databases ....................... 244


Copyright StatSoft, 2011


STATISTICAQuickReference11

STEP-BY-STEP EXAMPLES
ANALYTICS
Example 1: Correlations
Starting STATISTICA.AfterinstallingSTATISTICA,youcanstarttheprogramby
selectingSTATISTICAfromtheWindowsStartAllProgramssubmenu.

YoucanalsodoubleclickoneitherSTATIST.exeinWindowsExplorerortheiconof
anySTATISTICAfile,e.g.,aspreadsheet,tostarttheprogram.
WhenyoustartSTATISTICAforthefirsttime,theUserInterfacedialogisdisplayed,
whereyoucanchoosetousetheribbonbarortheclassicdropdownmenus.All
examplesinthismanualusetheribbonbar.
CHAPTER
2
2

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


12STATISTICAQuickReference

Notethatitiseasytoswitchbetweentheribbonbarandtheclassicmenusatany
time.Whentheribbonbarisdisplayed,clickthemenuicon ontheQuickAccess
toolbar(locatedintheupperleftcorneroftheribbonbar)todisplaytheclassic
menus.Whentheclassicmenusaredisplayed,selectRibbonBarfromtheView
menutodisplaytheribbonbar.
Tocreatemorespaceintheapplicationwindow,youcanminimizetheribbonbar.
Eitherdoubleclickontheselectedtabheader,orrightclickontherightsideofthe
rowoftabsandfromtheshortcutmenu,selectMinimizetheRibbon.
AfteryouclickOKintheUserInterfacedialog,theWelcometo
STATISTICAdialogisdisplayed,whichcontainsoptionsthatare
usefultoaccesscommonfunctionsinSTATISTICA.
Ifyouprefer,youcanselecttheDontshowthisdialogagain
checkboxlocatednearthebottomofthedialog,andthisdialog
willnotbedisplayedwhenyoustartSTATISTICA.Dependingon
theversionofSTATISTICAyouhave,theremaybeotherdialogs
displayedaswell.
Customization of STATISTICA.Practicallyallaspectsofthe
behaviorandappearanceofSTATISTICA(evenmanyelementary
featuresillustratedinthisexample,suchaswhereoutputis
directed)canbepermanentlycustomizedtomatchyour
preferences.Forexample,eventhefirststep(openingSTATISTICA)canbe
customized;youcanchangethedefaultfullscreenopeningmode,theappearance
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference13
ofthedataspreadsheet,andmanyotheraspectsofSTATISTICA,whichwillbe
illustratedthroughoutthismanual.
Selecting a data file.Forthisexample,openAdstudy.sta:ontheHometabinthe
Filegroup,clicktheOpenarrow.Fromthedropdownmenu,selectOpen
ExamplestodisplaytheOpenaSTATISTICADataFiledialog.Doubleclickonthe
Datasetsfolder,anddoubleclickonAdstudy.Youcanalsoopendatafilesby
1)selectingOpenDocumentfromtheOpendropdownmenutodisplaytheOpen
dialogwhereyoucanbrowsetotheappropriatelocation,2)clickingthe
buttonlocatedoneachStartupPanel(thefirstdialogdisplayedwhenstarting
analysisorgraphspecifications),or3)clickingthefoldericonaboveOpenonthe
Hometab.
Data spreadsheets (multimedia tables).STATISTICAdatafilesaredisplayedina
spreadsheet(i.e.,onespreadsheetisonedatafile).AllSTATISTICASpreadsheets
aredisplayedusingStatSoftspowerfulmultimediatabletechnology,andtheycan
containnotonlypracticallyunlimitedamountsofdata,butalsosound,video,
embeddeddocuments,automationscripts,andcustomuserinterfaces.
Itispossibletohavemorethanonedataspreadsheetopenatatime(witheach
spreadsheetconnectedtoadifferentanalysis).
DatamanagementfacilitiesareavailableontheDatatab,whichisdisplayed
wheneveraspreadsheetisopen.Commandsonthetabsareorganizedinlogical
groups;e.g.,theDatatabcontainstheTransformations,Cases,Variables,
Manage,andModegroups.

AllthecommandsontheribbonbarandclassicmenusaredescribedinSTATISTICA
Help;pointto(highlight)acommand,andpressF1onyourkeyboardtodisplaythe
respectiveHelptopic.
Variable specifications.Thevariable(column)headersinthespreadsheet
containthevariablenames.DoubleclickonthefirstvariableheaderGENDER
todisplayitsVariablespecificationsdialog.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


14STATISTICAQuickReference

Spreadsheet formulas.Usingtheoptionsinthisdialog,youcanchangethe
variablenameand/orformat,enteraformulatorecalculatethevaluesofthe
variable,etc.IftheentryintheLongname(labelorformulawithFunctions)box
startswithanequalsign(=),STATISTICAinterpretsitasaformula[acommentcan
followafterasemicolon(;)].Forexample,ifyouenterintotheLongnamebox
(ofvariableone)=(v2+v3+v4)/3or=mean(v2:v4),thecurrentvaluesofthat
variablewillbereplacedbytheaverageofvariablestwothroughfour,separately
foreachcase(row)ofthespreadsheet.
Specificationsofallvariablescanalsobereviewedandeditedtogetherina
combinedVariableSpecificationsEditordialog,accessedbyclickingtheAll
SpecsbuttonintheVariablespecificationsdialog.

Shortcut menus accessed from spreadsheets.Ausefulfeatureofthe


spreadsheetisthelistofcommandsavailablefromitsshortcutmenus.Shortcut
menusaredynamicmenusthataredisplayedbyrightclickingonanitem(e.g.,a
cellinthespreadsheet,asshownintheillustrationbelow).Thespreadsheet
shortcutmenusincludeaselectionofspecificdatamanagementoperationsand
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference15
otheroptionsrelatedtothecurrentlyselectedvariable(column),case(row),block
ofcells,orotheritem.

Six ways of handling output.Youcancustomizethewayoutputismanagedin


STATISTICA(seeFiveChannelsforOutputfromAnalyses,page147).Youcandirect
alloutputtofivebasicchannels:
Workbooks,seepage148,
Standalonewindows,seepage150,
Reports,seepage151,
MicrosoftWord,seepage154,
TheWeb,seepage155,and
SharePointorSTATISTICADocumentManagementSystem(SDMS),see
page163
Thefirstfouroutputchannelslistedabovearecontrolledbytheoptionsinthe
OutputManageroptionspaneoftheOptionsdialog[accessiblebyselectingthe
ToolstabandclickingOptions;intheOptionsdialog,selectOutputManagerin
thetreeview(theleftpane)toviewrelatedspecificationsintheoptionspane(the
rightpane)].SharePointoptionsarelocatedontheHometabintheSharePoint
group.STATISTICADocumentManagementSystem(SDMS),acompletesolution
formanagingdocuments,isavailablefromStatSoft.SeeAppendixCSTATISTICA
FamilyofProductsformoreinformation.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


16STATISTICAQuickReference

ThereareanumberofwaystooutputtotheWeb,dependingontheversionof
STATISTICAyouhave.Thesemeansforoutputcanbeusedinmanycombinations
(e.g.,aworkbookandreportsimultaneously),andeachoutputchannelcanbe
customizedinavarietyofways.Also,alloutputobjects(spreadsheetsandgraphs)
cancontainotherembeddedandlinkedobjectsanddocuments,soSTATISTICA
outputcanbehierarchicallyorganizedinavarietyofways.
Calculating a correlation matrix.Now,letscomputeacorrelationmatrixforthe
variablesintheAdstudy.stadatafile.TodisplaytheBasicStatisticsandTables
StartupPanel,selecttheStatisticstab,andintheBasegroup,clickBasicStatistics,

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference17
orselectStatisticsBasicStatistics/TablesfromtheSTATISTICAStartmenu

in
thelowerleftcornerofthescreen.

Atthispoint,ensurethatablock(agroupofselectedcells)isnotselectedinthe
spreadsheet.Todeselectablock,clickinanycellinthespreadsheet.Ifablockis
selected,STATISTICAassumesthatthevariablescorrespondingtotheblockare
intentionallypreselectedfortheanalysis,andwhenyoulaterclicktheOKor
Summarybuttontoproducetheanalysisresults,insteadofpromptingyouto
selectvariables,STATISTICAwillautomaticallyproducethecorrelationsforthe
selectedblockvariables.
IntheBasicStatisticsandTablesStartupPanel(showninthenextillustration),

selectCorrelationmatricesandclicktheOKbutton(ordoubleclickCorrelation
matrices)todisplaytheProductMomentandPartialCorrelationsdialog.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


18STATISTICAQuickReference

Quick vs. advanced analyses.Aswithmostanalysisspecificationdialogs(and


othertypesofSTATISTICAdialogs),theProductMomentandPartialCorrelations
dialogisorganizedbytabsaccordingtothetypeofoptionsavailable.Typically,at
leasttwocategoriesofoptionsareavailable.
TheQuicktabofadialogcontainsthemostcommonlyusedoptions,enablingyou
toquicklyspecifyabasicanalysiswithouthavingtosearchthroughnumerous
options.

TheAdvancedtabtypicallycontainsthesameoptionsavailableontheQuicktab
aswellasavarietyoflesscommonlyusedoptions(e.g.,inthiscase,optionsto
savematrices,producelesscommonlyrequestedstatistics,andcreateavarietyof
plots).Additionaltabsareoftenavailableaswell,dependingonthetypeof
analysisbeingspecified.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference19
Notethatinsomecases,onlyaQuicktabisavailable.Aswithalldialogsin
STATISTICA,youcanpressF1onyourkeyboardorclickthe buttonintheupper
rightcornertodisplayaHelptopiccontaininginformationabouttheoptions
availableonthecurrentlyselectedtab.
The self-prompting nature of STATISTICA dialogs.AlldialogsinSTATISTICA
followtheselfpromptingdialogconvention,whichmeansthatwheneveryou
arenotsurewhattoselectnext,simplyclicktheOKbuttonortheSummary
buttonandSTATISTICAwillproceedtothenextlogicalstep,promptingyouforthe
specificinputneeded(e.g.,variablestobeanalyzed).
Variables button.EveryanalysisspecificationdialoginSTATISTICAcontainsoneor
moreVariablesbuttonsusedtodisplaythevariableselectiondialogtospecify
variablestobeanalyzed.
Variable selection dialog.Forthisexample,clicktheOnevariablelistbutton(or
pressALT+Vonyourkeyboard)todisplaytheSelectthevariablesfortheanalysis
dialog.Notethatthevariableselectiondialogisalsodisplayedifyouclickthe
Summarybuttonbeforevariablesareselected.(Asmentionedpreviously,ifa
blockofvariablesisselectedinthedatafile,thosevariableswillbespecified
automaticallyfortheanalysis,andwhenyouclicktheSummarybutton,a
correlationmatrixwillbeproducedforthevariablesselectedintheblock,notall
variablesinthedatafile.)

Thevariableselectiondialogsupportsvariouswaysofselectingvariables(including
thestandardWindowsSHIFT+clickandCTRL+clickconventionstoselectrangesand
discontinuouslistsofvariables).
Youcanalsousevariousshortcutsandoptionsinthevariableselectiondialogto
reviewthecontentsofthedatafile.Forexample,youcanspreadthevariablelist
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


20STATISTICAQuickReference
toreviewthevariableslongnamesorformulas(clicktheSpreadbutton),oryou
canzoominonavariable(clicktheZoombutton)toreviewasortedlistofall
valuesanddescriptivestatisticsfortheselectedvariable(seethenextillustration).

Forthisexample,selectvariables1through10inthevariableselectiondialog.

ClicktheOKbutton.Amessagewillbedisplayedinformingyouthattherearetext
variablesselected.ClicktheContinuewithcurrentselectionbuttontoreturnto
theProductMomentandPartialCorrelationsdialog.Next,clicktheSummary
buttontogenerateacorrelationmatrixfortheselectedvariables.

NotethatinsteadofclickingtheSummarybutton,youcouldhaveclickedthe
Summary:CorrelationsbuttonontheQuicktaborontheAdvancedtabwiththe
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference21
sameresults.Also,dependingonthedefaultsyouhavespecifiedforhandling
output(intheOutputManageroptionspaneoftheOptionsdialog),the
Correlationsspreadsheetcanbedisplayedinareportorastandalonewindowor
senttoaWorddocument,ratherthaninaworkbookasshownabove.
Summary graphs.STATISTICAprovidesextremelyflexibletoolsandmethodsfor
summarizingkeyresultsingraphsand/ortables.Forexample,resumetheanalysis
byclickingtheProductMomentandbuttonontheAnalysisbarinthelowerleft
cornerofthescreenorbypressingCTRL+Ronyourkeyboard,andclickthe
buttontodisplaysummarygraphsforeachpairofvariablesinthecorrelation
matrix.

Thesegraphsnotonlyshowthescatterplotofpointsforeachcorrelation,butalso
thedistributions(histograms)foreachvariable,aswellastherespective
correlationcoefficientandregressionequation.
STATISTICAincorporatesmanysuchdisplaystosummarizebasicdescriptive
statistics,correlations,theresultsofGageorProcesscapabilitystudies,orother
typesofdataanalyses.
Results spreadsheets (multimedia tables).Inadditiontostoringdata,
spreadsheetsareusedinSTATISTICAtodisplaymostofthenumericoutput.Note
thatspreadsheetsoffermanydisplayfeaturesandoptions,andinthisexample,
significantcorrelationsaremarkedwithadifferentformattohelpdistinguish
them;bydefault,thecolorisred(intheCorrelationsspreadsheet,seethecell
adjacenttoMEASURE07underGENDER).Spreadsheetscanholdanywherefroma
shortlinetogigabytesofoutput,andtheyofferavarietyofoptionstofacilitate
reviewingtheresultsandvisualizingtheminpredefinedandcustomdefined
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


22STATISTICAQuickReference
graphs,aswillbeseenlaterinthisexample.Also,asmentionedpreviously,
STATISTICASpreadsheetsaremanagedusingStatSoftspowerfulmultimediatable
technology.Theycanhandlenotonlyvirtuallyunlimitedamountsofdata,butalso
video,sound,customuserinterfaces,andautoexecutingscripts,aswellasoffer
virtuallyunlimitedcustomizationoptions(seepage173forfurtherdetailson
spreadsheets).
Spreadsheet options.Mostspreadsheetfacilitiesareaccessibleviaoptionson
theDatatabandtheshortcutmenus(displayedbyrightclickinginthe
spreadsheet).Youcantrytheseoptionstoseehowtheywork,oryoucanreview
theirdescriptionsbypressingtheHelpkey(F1).Youcanchangeallaspectsofthe
displayformatsforeachspreadsheetcolumn,edittheoutput,orappendblank
casesandvariablestomakeroomfornotesoroutputpastedfromothersources.
Spreadsheetscanbeprintedinavarietyofways(bydefault,inpresentation
qualitytableswithgridlines).Also,sincespreadsheetsareusedforinput,youcan
easilyspecifyananalysisusingtheresultsfromapreviousanalysis(forexample,
youcouldusethiscorrelationmatrixtospecifyamultidimensionalscaling
analysis).Tousearesultsspreadsheetasaninputspreadsheet,selecttheInput
checkbox(locatedontheDatatabintheModegroup)whenthatspreadsheetis
active.
Analysis workbooks and other output options.Allresultscanbedisplayed(and
stored)instandalonewindows,reports,Worddocuments,orworkbooks,which
representthedefault(andperhapsthemostversatile)wayofhandlingoutput
fromanalyses(seepage148andpage169forfurtherdetailsonworkbooks).
DependingonyourselectionsintheOutputManager(accessiblebyselectingthe
HometabandclickingOptionsintheToolsgroup,andthenselectingOutput
Manager,locatedunderAnalyses/Graphs),resultscanbeputinasingle
workbookthatholdstheresultsfromallanalyses,aseparateanalysisworkbook
thatholdstheresults(spreadsheetsandgraphs)fromasingleanalysis,the
workbookthatcontainstheoriginaldatafile,orapreexistingworkbook.
Additionally,youcanchoosetohavetheresultssenttoaworkbookautomatically,
oryoucansendthemtotheworkbookyourselfbyclickingAddtoWorkbookon
theHometabintheOutputgrouptosendselectedstandalonespreadsheetsor
graphstoaworkbook.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference23
Output Manager.Whichtypeofworkbookyouchoose,orwhetheryouchooseto
useaworkbook,dependsentirelyonhowyouprefertostoreyourdataand
results.Tochangetheoutputdestinationfortheresultsofaparticularanalysis
only,clickthe buttononanyanalysisorgraphspecificationdialog,and
selectOutputtodisplaytheAnalysis/GraphOutputManagerdialog.

Tochangeoutputoptionsforallanalyses,usethe(global)OutputManager(the
OutputManageroptionspaneoftheOptionsdialog,accessiblebyselectingthe
HometabandclickingOptionsintheToolsgroup),orselecttheUseglobalOutput
settings(changesherewillaffecttheglobalsettings)optionbuttoninthe
Analysis/GraphOutputManagerdialog.
Aswithallworkbooks,individualdocuments(e.g.,spreadsheetsorgraphs)or
groupsofdocumentscanbeprinted,extracted,copied,anddeletedfroman
analysisworkbook.SeetheoverviewofWorkbooksonpage169formoredetails;
seealsotheElectronicManual(STATISTICAHelp).
Copy vs. Copy with Headers.Contentsofspreadsheetscanbecopiedtothe
ClipboardbypressingCTRL+C(whichcopiesonlythecontentsoftheselectedblock).
Tocopytheblockalongwithitsrespectivevariableandcasenames,selecttheEdit
tab,andintheClipboard/Datagroup,clicktheCopyarrowandselectCopywith
Headersfromthedropdownmenu.Whenspreadsheetsarepastedintoaword
processordocument,theywillbeactive(inplaceeditable)STATISTICAobjects,
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


24STATISTICAQuickReference
standardRTFformattedtables,unformattedtext,pictures,orHTML(depending
onyourchoiceinthePasteSpecialdialogofthewordprocessor).
Printing spreadsheets.Toproduceahardcopyofanoutputspreadsheet,select
theHometab,andintheFilegroup,clickPrint(orpressCTRL+P)todisplaythePrint
Spreadsheetdialog,inwhichyouspecifyprintingoptions.Youcanalsousethe
shortcutmethodofclickingtheprintericon locatedontheQuickAccesstoolbar
intheupperleftcorneroftheribbonbar.Thisshortcutmethoddoesnotdisplay
thePrintSpreadsheetdialog,butprintstheentirecurrentdocument.Ifyouwant
toprintadocumentfromwithinaworkbook,ensurethatthedocumentisselected
intheworkbook,andselecttheSelectionoptionbuttoninthePrintSpreadsheet
dialog.Youcanalsoextractacopyofthedocumentfromtheworkbook(dragit
fromthetreepane,orselectthedocumentandclickMoveontheWorkbooktab
intheExtractgroup)andthenprintit.
Optional reports of all output.Workbooksofferperhapsthemostflexible
optionstomanageyouroutput(seepages148and169).Insomecircumstances,
however,itmaybeusefultoautomaticallyproducealogofallresults(contentsof
allspreadsheetsand/orgraphs)inatraditionalwordprocessorstylereportformat
wherecommentsandannotationscanbeinsertedinarbitrarylocations,objects
canbeplacedsidebyside,etc.(seepage151andpage180forfurtherdetails
onreports).

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference25
UsetheoptionsintheOutputManagertocreatesuchareport.Todisplaythe
OutputManager,selecttheToolstab,clickOptions,andintheOptionsdialog,
selectOutputManagerlocatedunderAnalyses/Graphs(forglobalchanges).To
displaytheAnalysis/GraphOutputManagerdialog,clickthe

button

in
anyanalysisorgraphspecificationdialog,andselectOutput(forlocalchanges).
IntheOutputManageroptionspaneoftheOptionsdialogorinthe
Analysis/GraphOutputManagerdialog,clicktheReportOutputarrow.Fromthe
dropdownmenu,selecteitherSendtoMultipleReports(oneforeach
Analysis/Graph),SingleReport(commonforallAnalyses/graphs),or[SelectFile]
(whichwilldisplaytheOpendialogwhereyoucanselectanalreadyestablished
report).
IntheOutputManager,youcanalsospecifytheamountofsupplementary
informationtobeincludedwiththespreadsheetresults.UsetheSupplementary
detailoptiontospecifyeitherBrief(includesonlytheselectedspreadsheetsand
graphs),Medium(includestheselectedspreadsheetsandgraphsaswellasthe
currentdatafilename,informationoncaseselectionconditionsandcaseweights
ifanywerespecified,alistofallvariablesselectedforeachanalysis,andthe
missingdatavaluesforeachvariable),Long[includesallinformationfromthe
Mediumformatandthelongvariablelabels(e.g.,formulas),reservingonelineof
output(ormore)foreachvariable],orComprehensive(includesallinformation
includedintheLongreportformataswellasacompletelistofallofthetextlabels
foreachselectedvariable).
Interpretation of the results STATISTICA Electronic Manual (Help) and the
Electronic Statistics Textbook.Nowletsreturntotheexampleandthe
correlationmatrixthathasbeenproduced.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


26STATISTICAQuickReference
Eachofthecellsofthecorrelationmatrixrepresentsavalue(intherangeof1.00
to+1.00)thatreflectstherelationbetweenthevariables(seetherespective
variableandcaseheaders).Thehighertheabsolutevalueofthecorrelation
coefficient,theclosertherelation;ifthevalueispositive,therelationispositive
(highvaluesofonevariablecorrespondtohighvaluesoftheothervariable;
likewise,lowvaluesofonevariablecorrespondtolowvaluesoftheother
variable).Ifthevalueisnegative,theoppositeistrue(lowvaluesofonevariable
correspondtohighvaluesoftheothervariable).
Tolearnmoreabouthowtointerpretvaluesofcorrelations,youcanreviewa
comprehensive,illustrateddiscussionofthetopicintheElectronicManual
(STATISTICAHelp),whichfeaturesthecompletecontentsoftheStatSoftElectronic
StatisticsTextbook.TodisplaytheElectronicManual,selecttheHelptab,andin
theHelpgroup,clickHelp.OntheSearchtaboftheElectronicManual,enterthe
respectiveterm(e.g.,Correlations)intotheTypeintheword(s)tosearchforbox,
clicktheListTopicsbutton,andthenselectthedesiredtopicintheSelecttopic
box(inthiscase,CorrelationsIntroductoryOverview):

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference27
AnothervaluablereferencetoolistheStatSoftElectronicStatisticsTextbook(an
awardwining,Webbasedgeneralresourceonstatisticsthathasbeen
recommendedbyEncyclopediaBritannicaforitsQuality,Accuracy,Presentation,
andUsability).

Toopenthetextbook,selecttheHelptab,andintheHelpgroup,clickElectronic
StatsTextbook.
Also,manytopicsinSTATISTICAHelpcontainalinktothetextbook.

Clickthelinkintheupperrightcornerofthetopictodisplaytherespectivepagein
theElectronicTextbook.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


28STATISTICAQuickReference

Producing graphs from spreadsheets.Oneoftheimportant(andoften


overlooked)issuesdiscussedintheElectronicManualistheimportanceof
scatterplotsinexaminingcorrelations.Forexample,evenverylargeandhighly
statisticallysignificantcorrelationcoefficientscanbeentirelyduetooneunusual
datapoint(outlier),andifthatisthecase,thenthecorrelationcoefficient(even
ifstatisticallysignificant)wouldhavenovaluetous(i.e.,itwouldhaveno
predictivevalidity).Followingthisconcern,andtheadviceoftheElectronic
Manual,letsexamineascatterplotthatwillvisualizearelationbetweenthe
variablesand,thus,visualizeaparticularcorrelationcoefficientfromthetable.
Whileexaminingthespreadsheet,youcanviewthecorrelationsgraphically,for
example,tovisualizethecorrelationbetweenvariablesMeasure06and
Measure04.Toproduceascatterplotforthesetwovariables,rightclickonthe
respectivecorrelationcoefficient(0.162269).Intheresultingshortcutmenu,
selectGraphsofInputDataScatterplotbyMEASURE06Regression,95%conf.,
asshowninthenextimage.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference29

Thespecifiedgraphwillbedisplayed.

Aswecanlearnfromthegraph,therearenounusualpatternsofdata,thus,there
isnoreasontobeconcernedaboutoutliers(seetheshortdiscussionofoutlierson
page28;seealsothetopiconoutliersintheElectronicManual).
Graph customization.Notethatnow,whenthefocusisonthegraphwindow,the
Edittabcontainsdifferentoptionsthanitdidforthespreadsheets.

Itcontainsavarietyofgraphcustomizationanddrawingtools.Manyofthese
optionsarealsoavailablefromshortcutmenusaccessedbyrightclickingon
specificpartsofthegraph.Notethattheoptionsonshortcutmenusare
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


30STATISTICAQuickReference
hierarchical,meaningthatthefirstoneortwooptionsapplyspecificallytothe
graphelementyouhaveselected,whileloweroptionswilldisplaydialogsthat
offermoreoptionsonagreatervarietyofgraphelementsrelatedtotheelement
youhaveselected.Ifyourightclickanywhereinthespaceoutsidethegraphaxes,
amenuofglobaloptionsisdisplayed(asshowninthenextimage).

Formoreinformationongraphcustomization,seepage190andtheElectronic
Manual.
Nowletsreturntothespreadsheet.
Split scrolling in spreadsheets.Spreadsheetscanbesplitintouptofoursections
(panes)bydraggingthesplitbox(thesmallrectangleatthetopofthevertical
scrollbarortotheleftofthehorizontalscrollbar).Thisisusefulifyouhavealarge
amountofinformationandyouwanttoreviewresultsfromdifferentpartsofthe
spreadsheet.Whenyoumovethemousepointertothesplitbox,themouse
pointerchangesto or .Now,topositionthesplit,dragittothedesired
position.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference31
Youcanchangethepositionofthesplitbydraggingthesplitbox(nowlocated
betweenpanes)toanewposition.

Notethatverticallysplitpanesscrolltogetherwhenyouscrollhorizontally;
horizontallysplitpanesscrolltogetherwhenyouscrollvertically.Forinformation
abouthighlightingblocksofdataacrosssplitpanesandaboutvariablespeed
highlightingofblocksofdata,seeHowcanIexpandablockinthespreadsheet
outsidethecurrentscreen?intheElectronicManual.
Drag-and-drop.STATISTICAsupportsthecompletesetofstandardspreadsheet
(MicrosoftExcelstyle)draganddropfacilities.Forexample,inordertomovea
block,pointtotheborderoftheselection(themousepointerchangestoan
arrow)anddragittothenewlocation.

Tocopyablockofdata,pointtotheborderoftheselection(themousepointer
changestoanarrow),anddragtheselectiontoanewlocationwhilepressingthe
CTRLkey.Notethatwhenyouaredraggingtheselection,aplussign(+)isdisplayed
nexttothemousepointertoindicateyouarecopyingthetextratherthanmoving
it(seethenextimage).
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


32STATISTICAQuickReference

Toinsertablockbetweencolumnsorrows,pointtotheborderoftheselection
(themousepointerchangestoanarrow)andthendragtheselectionwhile
pressingtheSHIFTkey.
Ifyoupointbetweenrows,aninsertionbarisdisplayedbetweentherows,and
whenyoureleasethemousebutton,theblockisinsertedbetweenthosetworows
[creatingnewcase(s)].Ifyoupointbetweencolumns,aninsertionbarisdisplayed
betweenthecolumns,andwhenyoureleasethemousebutton,theblockis
insertedbetweenthosetwocolumns[creatingnewvariable(s)].
NotethatifyoualsopresstheCTRLkeywhileyouaredraggingtheselection,the
blockwillbecopiedandinsertedinsteadofmovedandinserted;apluswillappear
nexttothemousepointer(asshowninthenextillustration).

Additionally,aseriesofvalueswithinablockcanbeextrapolated(AutoFilled)by
draggingtheFillHandle(thesmall,solidsquarelocatedonthelowerrightcorner
oftheblockborder).

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference33
Electronic Manual.STATISTICAprovidesanElectronicManualwith
comprehensivedocumentationonallprogramproceduresandalloptions,
availableinacontextsensitivemanner(thereisatotalofmorethan100
megabytesofcompresseddocumentationincluded).Toaccessthemanual,select
theHelptabandclickHelpintheHelpgroup,orclickthe iconintheupperright
corneroftheribbonbar.Youcanalsopointto(highlight)amenucommandor
selectatabinadialogforwhichyouwantinformation,andpressF1onyour
keyboardtodisplaytherespectiveHelptopic,orclicktheHelpbutton thatison
thecaptionbarofalldialogs.
Duetoitsdynamichypertextorganization,organizationaltabs(Contents,Index,
Search,andFavorites),andvariousfacilitiesusedtocustomizetheHelpsystem,it
isfastertousetheElectronicManualthantolookforinformationinthetraditional
manuals.
Also,ToolTipsdisplayshortexplanationsofthecommandswhenthemouse
pointerhoversoverthem.
Statistical Advisor.AStatisticalAdvisorfacilityisbuiltintotheSTATISTICA
ElectronicManual.OntheHelptabintheHelpgroup,clickStatisticalAdvisorto
displayasetofsimplequestionsaboutthenatureoftheresearchproblemandthe
typeofyourdata.Clickontheappropriatelinkstoanswerthequestions,and
suggestionsforthestatisticalproceduresthatappearmostrelevantwillbe
displayed,containinglinkstoguideyoutothespecificproceduresinthe
STATISTICAsystem.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


34STATISTICAQuickReference
Directjumps(hypertextlinks)intheStatisticalAdvisortopicsguideyouto
correspondingIntroductoryOverviews,whichdiscussindetailtherespective
statisticalmethodsandprocedures.
Example 2: ANOVA
Calling the ANOVA module.Forthisexampleofa2x2(between)x3(repeated
measures)design,opentheAdstudy.stadatafile.Then,tostartthe
ANOVA/MANOVAanalysis,selecttheStatisticstab,andintheBasegroup,click
ANOVAtodisplaytheGeneralANOVA/MANOVAStartupPanel.

Thisdialogisusedtospecifyverysimpleanalyses(e.g.,viaOnewayANOVA
designswithonlyonebetweengroupfactor)andmorecomplexanalyses(e.g.,via
RepeatedmeasuresANOVAdesignswithbetweengroupfactorsandawithin
subjectfactor).
Design.SelectRepeatedmeasuresANOVAastheTypeofanalysisandQuick
specsdialogastheSpecificationmethod,andthenclicktheOKbuttoninthe
GeneralANOVA/MANOVAStartupPaneltodisplaytheANOVA/MANOVA
RepeatedMeasuresANOVAdialog.

Specifying the design (variables).Thefirst(betweengroup)factorisGender


(with2levels:MaleandFemale).Thesecond(betweengroup)factorisAdvert
(with2levels:PepsiandCoke).Thetwofactorsarecrossed,whichmeansthat
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference35
therearebothMaleandFemalesubjectsinthePepsiandCokegroups.Eachof
thosesubjectsrespondedtothreequestions(thisrepeatedmeasurefactorwillbe
calledResponse;ithasthreelevelsrepresentedbyvariablesMeasure01,
Measure02,andMeasure03).
ClicktheVariablesbutton(intheANOVA/MANOVARepeatedMeasuresANOVA
dialog)todisplaythevariableselectiondialog.SelectMeasure01through
Measure03asdependentvariables(fromtheDependentvariablelistfield)and
GenderandAdvertasfactors[fromtheCategoricalpredictors(factors)field].

ThenclicktheOKbuttontoreturntotheANOVA/MANOVARepeatedMeasures
ANOVAdialog.
The repeated measures design.Thedesignoftheexperimentthatwearegoing
toanalyzecanbesummarizedasfollows:

Between-Group Between-Group Repeated Measure Factor: Response
Factor #1:
Gender
Factor #2:
Advert
Level #1:
Measure01
Level #2:
Measure02
Level #3:
Measure03
Subject 1 Male Pepsi 9 1 6
Subject 2 Male Coke 6 7 1
Subject 3 Female Coke 9 8 2
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Specifying a repeated measures factor.Theminimumnecessaryselectionsare
nowcomplete,and,ifwedidnotwanttoselecttherepeatedmeasuresfactor,we
wouldbereadytoclicktheOKbuttonandseetheresultsoftheanalysis.However,
forourexample,weneedtospecifythatthethreedependentvariableswehave
selectedbeinterpretedasthreelevelsofarepeatedmeasures(withinsubject)
factor.Unlesswedoso,STATISTICAassumesthatthosearethreedifferent
dependentvariablesandrunsaMANOVA(i.e.,MultivariateANOVA).
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


36STATISTICAQuickReference
Inordertodefinethedesiredrepeatedmeasuresfactor,clicktheWithineffects
buttonontheQuicktabtodisplaytheSpecifywithinsubjectsfactordialog.

NotethatSTATISTICAhassuggestedtheselectionofonerepeatedmeasuresfactor
with3levels(defaultnameR1).Youcanspecifyonlyonewithinsubject(repeated
measures)factorviathisdialog.Tospecifymultiplewithinsubjectfactors,usethe
GeneralLinearModelsmodule(availableintheoptionalAdvanced
Linear/NonlinearModelspackage).PresstheF1keyonyourkeyboardwhilethe
Specifywithinsubjectsfactordialogisdisplayed(orclickthe

buttoninthe
upperrightcornerofthedialog)todisplaytheElectronicManualtopicthat
describesalloptionsinthisdialogandcontainslinkstocomprehensivediscussions
ofrepeatedmeasuresandexamplesofdesigns.
Forthisexample,editthenameforthefactor:intheFactorNamebox,changethe
defaultR1toRESPONSE,andclicktheOKbuttontoexitthedialog.
Codes (defining the levels) for between-group factors.Youdonotneedto
manuallyspecifycodesforbetweengroupfactors[i.e.,thereisnoneedtoinstruct
STATISTICAthatvariableGenderhastwolevels:1and2(orMaleandFemale)]
unlessyouwanttopreventSTATISTICAfromusing,bydefault,allcodes
encounteredintheselectedgroupingvariablesinthedatafile.Toentersuch
customcodeselection,clicktheFactorcodesbuttontoaccesstheSelectcodesfor
indep.vars(factors)dialog.

Beforeyoumakeyourselections,youcanusetheoptionsinthisdialogtoreview
valuesofindividualvariablesbyclickingtheZoombutton,scanthefile,andfillin
thecodesfields(e.g.,GenderandAdvert)foranindividualvariableorallvariables,
etc.Fornow,clicktheOKbuttonintheSelectcodesforindep.vars(factors)
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference37
dialog;STATISTICAautomaticallyfillsinthecodesfieldswithalldistinctivevalues
encounteredintheselectedvariables,

andclosesthedialog.
Performing the analysis.ClicktheOKbuttonintheANOVA/MANOVARepeated
MeasuresANOVAdialog.TheanalysisisperformedandtheANOVAResultsdialog
isdisplayed,whichcontainsvariousoutputspreadsheetsandgraphsoptions.

Thisdialogcontainsseveraltabsthatenableyoutoquicklylocatethedesired
resultsoptions.Forexample,ifyouwanttoperformplannedcomparisons,select
theCompstab.Toviewresidualstatistics,selecttheResidstab.Forthisexample,
wewillonlyusetheresultsoptionsavailableontheQuicktab.
Reviewing ANOVA results.LetsstartbylookingattheANOVAsummaryofall
effectstablebyclickingtheAlleffectsbutton(theonewiththeSUMMicon ).

Theonlyeffect(ignoringtheIntercept)inthisanalysisthatisstatisticallysignificant
(p=.007)istheRESPONSEeffect.Thisresultmaybecausedbymanypossible
patternsofmeansoftheRESPONSEeffect(formoreinformation,consultthe
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


38STATISTICAQuickReference
ANOVAIntroductoryOverviewintheElectronicManual).Wewillnowlookatthe
marginalmeansforthiseffectgraphicallytoseewhatitmeans.
TodisplaytheANOVAResultsdialogagain(thatis,resumetheanalysis),press
CTRL+RorclicktheANOVAResultsbuttonontheanalysisbar.Then,clicktheAll
effects/GraphsbuttontodisplaytheTableofAllEffectsdialogtoreviewthe
meansforindividualeffects.

Thisdialogcontainsasummarytableofalleffects(withmostoftheinformation
youhaveseeninthealleffectsspreadsheet)andisusedtoreviewindividual
effectsfromthattableintheformoftheplotsoftherespectivemeans(or,
optionally,spreadsheetsoftherespectivemeanvalues).
Plot of means for a main effect.IntheTableofAllEffectsdialog,doubleclickon
thesignificantmaineffectRESPONSE(theonemarkedwithanasteriskinthep
column)toproducetherespectiveplot.

Thegraphindicatesthatthereisacleardecreasingtrend;themeansforthe
consecutivethreequestionsaregraduallylower.Eventhoughthereareno
significantinteractionsinthisdesign(seethediscussionoftheTableofalleffects,
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference39
page37),wewilllookatthehighestorderinteractiontoexaminetheconsistency
ofthisstrongdecreasingtrendacrossthebetweengroupfactors.
Plot of means for a three-way interaction.Toseetheplotofthehighestorder
interaction,intheTableofAllEffectsdialog,doubleclickontherowmarked
RESPONSE*GENDER*ADVERT,representingtheinteractionbetweenfactors1
(Gender),2(Advert),and3(Response).Anintermediatedialog,Specifythe
arrangementofthefactorsintheplot,isdisplayed,whichisusedtocustomize
thedefaultarrangementoffactorsinthegraph(notethat,unlikethepreviousplot
ofasimplefactor,thecurrenteffectcanbevisualizedinavarietyof ways).

ClicktheOKbuttontoacceptthedefaultarrangementandproducetheplotof
means.

Asyoucansee,thispatternofmeans(splitbythelevelsofthebetweengroup
factors)doesnotindicateanysalientdeviationsfromtheoverallpatternrevealed
inthefirstplot(forthemaineffect,RESPONSE).Nowyoucancontinueto
interactivelyexamineothereffectsrunposthoccomparisons,planned
comparisons,extendeddiagnostics,etc.tofurtherexploretheresults.
Interactive data analysis in STATISTICA.Thisexampleillustratesthewayin
whichSTATISTICAsupportsinteractivedataanalysis.Youarenotforcedtospecify
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


40STATISTICAQuickReference
alloutputtobegeneratedbeforeseeinganyresults.Evensimpleanalysisdesigns
canproducelargeamountsofoutputandcountlessgraphs,butusuallyyoucannot
knowwhatwillbeofinterestuntilyouhaveachancetoreviewthebasicoutput.
WithSTATISTICA,youcanselectspecifictypesofoutput,interactivelyconduct
followuptests,andrunsupplementarywhatifanalysesafterthedataare
processedandbasicoutputreviewed.STATISTICAsflexiblecomputational
proceduresandwideselectionofoptionsusedtovisualizeanycombination
ofvaluesfromnumericaloutputoffercountlessmethodstoexploreyourdataand
verifyhypotheses.
Automating analyses (macros and STATISTICA Visual Basic).Anyselections
thatyoumakeinthecourseoftheinteractivedataanalysis(includingboth
specifyingthedesignsandchoosingtheoutputoptions)areautomatically
recordedintheindustrystandardVisualBasiccode.Youcansavesuchmacrosfor
repeateduse(youcanalsoassignthemtotoolbarbuttons,modifyoreditthem,
combinethemwithotherprograms,etc.).Formoreinformation,seeChapter8
STATISTICAVisualBasiconpage219ortheSTATISTICAVisualBasicPrimer.
Example 3: Variable Bundles
STATISTICAoffersauniqueoptionvariablebundlestolocateasubsetofdata
quicklyandeasilyinalargedatafile.Bundlescanbecreatedtoorganizelargesets
ofvariablesandtofacilitatetherepeatedselectionofthesamesetofvariables.
OpenEnginePerformance.sta.Thisdatasetdescribestheperformanceoflarge
enginesandcontainsvariousprocessparametersrecordedduringtheir
manufacture.Itincludes128engines;theirEfficiency,FuelEconomy,andPoweras
measuredduringtesting;and74processparameterscollectedduringthe
manufactureofeachengine.
Forthisexample,wewillproceedwiththepremisethatweoftenneedtogenerate
analysesinwhichthesamesetofvariablesisrepeatedlyused.
SelecttheDatatab,andintheVariablesgroup,clickBundlestodisplaythe
VariableBundleManagerdialog.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference41

ClicktheNewbuttontodisplaytheNewBundledialog,

enterthenameProductionintheBundlenamefield,andclicktheOKbutton.The
Selectvariablesforbundledialogisdisplayed,whichcontainsallthevariablesin
theEnginePerformance.stadataset.

Forouranalyses,weneedtoselectthevariablesInput01Input05,Input20,
Input30Input35,andInput70.Youcanselectthesevariablesusingthestandard
WindowsSHIFT+clickandCTRL+clickconventionstoselectrangesanddiscontinuous
listsofitems,respectively.
ClicktheOKbuttontoclosetheSelectvariablesforbundledialogandreturnto
theVariableBundleManager.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


42STATISTICAQuickReference

Theleftpaneofthisdialogdisplaysthenamesofallbundlesthathavebeen
definedforthisspreadsheet(youcancreatenumerousbundlesineach
spreadsheetifneeded).Therightpanedisplaysthecontentsofthebundlethatis
currentlyselectedintheleftpane.Ifbothofthesepanesareempty,nobundles
havebeencreatedforthisspreadsheet.
YoucanmakechangestoabundlebyclickingtheEditbutton,discardabundleby
clickingtheDeletebutton,changethetitleofabundlebyclickingtheRename
button,andproduceaspreadsheetcontaininginformationaboutthebundlesfor
theactivedataspreadsheetbyclickingtheOutputtoSpreadsheetbutton.
Forthisexample,clicktheOKbuttontoacceptthebundlewecreatedandclose
theVariableBundleManagerdialog.
Then,selecttheStatisticstab,andintheBasegroup,clickMultipleRegressionto
displaytheMultipleLinearRegressionStartupPanel.OntheQuicktab,clickthe
Variablesbuttontodisplaythevariablespecificationdialog.
Bundlesaredisplayedinbracketsandlisted(inalphabeticalorder)atthetopof
thevariablelist.IntheIndependentvariablelist,selecttheProductionbundleto
specifywithoneclickofthemousebuttonInput01Input05,Input20,Input30
Input35,andInput70astheindependentvariablesfortheanalysis.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference43
Ifyouarentsurewhatvariablesareincludedinabundle,movethemousepointer
overthebundlenameinthevariableselectiondialog,andaToolTipwilldisplay
thevariablenumbers.

Additionally,youcanviewthelistofvariables(byname)byclickingthe[Bundles]
buttoninthevariablespecificationdialog.ThisdisplaystheVariableBundles
Manager.
Notethatbundlesaredefinedforasinglespreadsheet,andtheyareonlyusedfor
variableselection.Hence,theyareneverlistedinreportsorotheroutput.
Asyoucanseewiththisexample,youwillsaveconsiderabletimebyselectinga
bundleratherthanlookingforthecorrectvariablestochooseinalargedataset.
Example 4: By-Group Analyses
STATISTICAoffersapowerfuloptiontoturneverystatisticalorgraphicsanalysis
intoananalysisbygroup.Whenreviewingresultsintheresultsdialogofpractically
anyanalysis,orusingthegraphsoptions,youcanselectoneormoregrouping
variables,andthencreateresults1)forallcasesinthedatacombined,and/or
2)brokendownbyeachcombinationofuniquevaluesinthegroupingvariables.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


44STATISTICAQuickReference

Thisisaverypowerfultoolforinteractiveandexploratorydataanalysis,allowing
youtoreviewquicklywhetheranypatternsorspecificresultsholdinall
subgroups,samples,orstratainyourdata.
Forexample,youmaybeperformingamultipleregressionanalysisanddecideto
review,withoutexitingthecurrentdialog,theresultsbrokendownbyGenderand
anothergroupingvariableinyourdata.Afterselecting(enabling)thisoption(by
clickingthe ByGroupbutton),everytimeyouclickanyoftheresultsbuttons
(e.g.,tocreateasummaryresultsspreadsheetorgraph),allresultsarecomputed
notonlyforallgroups(optionally),butalsoforeachuniquecombinationof
groupingvariablesthatwerespecified(e.g.,byGenderandanothergrouping
variable).
TheresultsoftheByGroupanalysiscanbeplacedeitherinthedefaultresults
workbookintotheirownfolder,labeledwiththerespectivebygroupcondition
(e.g.,Gender=Female;Time=After1),orintothesamefolderwithallotherresults.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference45

Forexample,youcouldcreatemultiplelineplotstodescribeamultivariatebatch
process,creatingaseparategraph(trajectories)foreachbatch.
Exploring Experimental Data Using
the By Group Option
ThisexampleisbasedonthedatafileTomatoes.sta,whichisoneoftheexample
datafilesdescribedingreaterdetailintheExperimentalDesignsectionofthe
STATISTICAElectronicManual(seetheexampleDesigningandAnalyzinga2
3
3
2

Experiment).ConnorandYoung(inMcLeanandAnderson,1984)reportan
experiment(takenfromYoudenandZimmerman,1936)onvariousmethodsof
producingtomatoplantseedlingspriortotransplantinginthefield.
StartbyopeningtheexampleTomatoes.stadataset.SelecttheHometab.Inthe
Filegroup,clicktheOpenarrowandselectOpenExamplesfromthedropdown
menutodisplaytheOpenaSTATISTICADataFiledialog.Doubleclickonthe
Datasetsfolder,andthenselectandopentheSTATISTICAdatasetTomatoes.sta.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


46STATISTICAQuickReference

Shownhereareafewrows(cases)ofthatdatafile.Youcanrefertothe
ExperimentalDesignElectronicHelpexampletopicforacompleteanalysisofthese
data.
Exploring Patterns by Variety
Thisexampleillustratesatypicalworkflowasitoftenappliestotheanalysisof
discreteorbatchmanufacturingdata,i.e.,thegoaloftheanalysisistoverify
(graphicallyoranalytically)thatsomepatternsordistributionsequallyapplytoall
samples,parts,orbatches.
WewillexploretheeffectofProductionMethod,SoilCondition,andPotsizeon
yield(Pounds),andevaluatewhetheranypatternsholdforeachVarietyinthe
study.Insteadofperformingacompleteanalysisofvariance(asisdescribedinthe
ExperimentalDesignexampleoftheElectronicHelp),wewillusemostlygraphical
methodsandvisualinspection.
Specifying variability plots.SelecttheGraphstab.IntheMoregroup,click2D,
andfromthedropdownmenu,selectVariabilityPlotstodisplaytheVariability
Plotdialog.ClicktheVariablesbutton,andintheSelectVariablesforVariability
Plotdialog,selectPOUNDSastheDependentvariable,andSOILCONDITION,
POTSIZE,andPRODUCTIONMETHODfromtheGroupingvariablelist.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference47
Furtheronintheexample,wewillcreatethegraphbyVARIETYtoillustratetheBy
Groupfeatures.Now,clicktheOKbuttoninthevariableselectiondialog.
Reordering variables for variability plot.Forthemostinformativeplot,lets
reorderthevariablessothatPRODUCTIONMETHODwillbethefirstfactorinthe
listofFactors.ClickonthatvariableintheFactorslist,andthen,whilepressingthe
leftmousebutton,dragittothetopofthelist.

Finally,alsointheVariabilityPlotdialog,ensurethatPRODUCTIONMETHODis
selectedintheFactorslist,andselectthePutboxesaroundgroupscheckbox.
Specifying by grouping.WewanttocreatethevariabilityplotforPRODUCTION
METHOD,SOILCONDITION,andPOTSIZEforallvarietiesoftomatoescombined,
andbrokendownbyVARIETY(onegraphperVARIETY).ClicktheByGroupbutton
todisplaytheByGroupdialog.

ClicktheGroupingVariable(s)buttontodisplaytheSelectByVariablesdialog,
andspecifyVARIETYastheByGroupvariable.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


48STATISTICAQuickReference

NotethatyoucanspecifymorethanoneByGroupvariable,inwhichcaseall
subsequentanalyseswillbeperformedbrokendownbyeachuniquecombination
ofvaluesfoundintheByGroupvariables.
Reviewing the variability plots.NowclickOKtoclosetheSelectByVariables
dialog,andclickOKtoclosetheByGroupdialog.IntheVariabilityPlotdialog,
clickOKtocreatethegraphs.

NoticehowtheVariabilityPlotiscreated1)forAllGroups,and2)foreachVariety
(BonnyandMarglobe).
Ifyoureviewthesegraphscarefully,youwillseethattheProductionMethod
appearstomakelittledifference(intheobservedvaluesforPounds)for
Variety=Bonny,whileforVariety=Marglobe,theFibrePlmethodshowstheleast
variabilityinvalues,whicharegenerallyatthehigherendofthedistributionofall
valuesforvariablePounds.
Descriptive Statistics By Group
Letsnextusethedescriptivestatisticsoptionstofurtherexplorethis.Selectthe
Statisticstab.IntheBasegroup,clickBasicStatisticstodisplaytheBasicStatistics
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference49
andTablesStartupPanel.SelectBreakdown&onewayANOVA,andclicktheOK
buttontodisplaytheStatisticsbyGroups(Breakdown)dialog.ClicktheVariables
button,andintheSelectthedependentvariablesandgroupingvariablesdialog,
specifyPoundsastheDependentvariableandProductionMethodastheGrouping
variable.ThenclickOKtoclosethevariableselectiondialog,andclickOKinthe
StatisticsbyGroups(Breakdown)dialogtodisplaytheStatisticsbyGroups
Resultsdialog.
WewanttocomputeStatisticsbyGroups,brokendownfurtherbytomatoVariety.
So,clicktheByGroupbutton,andintheByGroupdialog,clicktheGrouping
Variable(s)button.IntheSelectByVariablesdialog,selectVarietyastheBy
Groupvariable.

Now,clickOKinthisdialogandclickOKintheByGroupdialog.IntheStatisticsby
GroupsResultsdialog,clickinsequence,1)theSummarybutton,2)theAnalysis
ofVariancebutton,and3)theInteractionplotsbutton.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


50STATISTICAQuickReference

Allresultsareplacedintotherespectivefolder,eithertheAllGroupsfolderorthe
Variety=BonnyorVariety=Marglobefolders.
Youcannowreviewtheseresultsforallgroupscombinedandbrokendownby
Variety;asyouwillsee,indeed,ProductionMethodappearstohaveaneffecton
yield(Pounds)forVariety=Marglobe,whilethereisnoindicationofsuchaneffect
forVariety=Bonny.
Summary
WithSTATISTICA,youcanperformadhocbygroupanalysesfromvirtuallyany
resultsdialog,reviewingresultsforallgroupscombinedorbrokendownbyoneor
moregroupingvariable.Thisverypowerfulfeatureforexploratorydataanalysis
canbeusedtocomparegroupsandverifyconsistencyofresultsacrossgroupsfor
anyanalysis.
Beforeconcludingthistopic,afewcommentsaboutthetechnicaldetailsregarding
theimplementationofthisfeaturemaybeuseful.Whenperformingbygroup
analyses,asillustratedinthisexample,theprogramwillactuallyrerunthe
analysesforeachgroup(andallgroups),leveragingtheSTATISTICAVisualBasic
macrocodethatisrecordedautomaticallyduringtheinteractiveanalyses,and
whichcanbesavedasmacrosasdescribedelsewhereinthismanual(seeChapter
8STATISTICAVisualBasic).Whenanalyzingverylargedataproblems(e.g.,very
largeunbalancedexperimentaldesignsorcomplexanalysesthatrequireiterated
computationsbeforeresultscanbedisplayed),theindividualanalysesmaytakeup
significantamountsofcomputingtime,inparticularwhentherearemanyunique
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference51
groupsidentifiedinthedata(e.g.,imagineacomplexgeneralizedlinearmodel
estimatedforeachof100groups).
Therefore,itisgenerallyagoodideatobegineachexploratoryanalysisby
computingsimpledescriptivestatistics,frequencytables,andgraphsto
understandthestructureofthedataandidentifythenumberofuniquegroups
(combinationofvaluesinthegroupingvariables)inthedata.
Example 5: Summary Results Panels
(Quality, Process, GageSixpacks)
SeveralanalysesinSTATISTICAsupportsummarygraphsandreportsarrangedinto
asingle(graphics)document.InSixSigmaandmanufacturingapplications,these
typesofdisplaysaresometimesreferredtoasQualitySixpacksbecausethey
summarizethequalityofasinglevariablewithsix(orfewer)individualgraphs
andtables.

STATISTICAincorporatesmanysuchdisplaystosummarizebasicdescriptive
statistics,correlations,theresultsofgageorprocesscapabilitystudies,orother
typesofdataanalyses,asshowninthefollowingillustration.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


52STATISTICAQuickReference

Process Capability Analysis Consistent


with DIN 55319 and ISO 21747
Inrecentyears,European(andotherinternational)manufacturershavedeveloped
standardsforthecomputationofprocesscapabilityindicesthatwillexplicitly
accountforsystematicandrandomprocessvariationovertime,aswellasnon
normaldistributions.Theseindiceshave,forexample,beenadoptedthroughout
theautomanufacturingindustryandtheirsuppliers,andSTATISTICAfullysupports
thesestandards.
Processcapabilityindicesmeasurethenumberoftimesthattheobserved
(normal)distributionofvaluescanfitinsidethespecificationlimitsforthe
respectivepartunderconsideration.Thus,theseindicessummarizethequalityofa
processtoproduceproductsorpartsthatareconsistentwithdesignspecifications.
Inshort,DIN(DeutscheIndustrieNorm)55319andISO21747describetherulesto
applywhenchoosingamongvariousdistributionmodelsandhowtoaccountfor
timedependentvariationintheprocess.
Forexample,evenifadistributionofdatapointswithineachsampleisNormal,if
thereissystematicorrandomvariationthatoccursovertimeassuccessive
samplesaretaken,theresultantdistributionofvalueswillnotbeNormal.
Therefore,inmanycasesthenormaldistributionbasedprocesscapability
computationswillnotbeapplicable.Also,itisusuallyofinteresttoidentifyany
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference53
timedependentvariabilityortrendsbecausetheycanindicatemachinewearor
otherprocessproblems.
Thefollowingexampleillustratesstepbystephowtocomputeprocesscapability
indicesconsistentwiththeseinternationalstandards,andhowtocreatean
efficientsingledocumentsummaryreport.
Select data.ThisexampleisbasedonadatasetreportedinMontgomery(1985,
page177,1991,page234).WellusethedatafilePistons.stathatislocatedin
STATISTICAsexamplesdirectory.Specifically,weareinterestedinmonitoringthe
size(diameter)ofpistonringsforautomotiveengines.Therefore,constant
samplesoffiveobservationseachhavebeentakenfromtheongoing
manufacturingprocess.Asisthecaseinmanyongoingmanufacturingprocesses,
samplesaretakenovertime,soanyvariabilityintheprocessqualityovertimewill
affecttheoverallvariability.
OntheHometab,clicktheOpenarrow,andfromthedropdownmenu,select
OpenExamplestodisplaytheOpenaSTATISTICADataFiledialog;openthe
Datasetsfolder,anddoubleclickonPistons.staorselectitandclicktheOpen
button.
Specify analysis.SelecttheStatisticstab.IntheIndustrialStatisticsgroup,click
ProcessAnalysis.IntheProcessAnalysisProceduresStartupPanel,selectProcess
CapabilityISO/DIN(Timedependentdistributionmodel).

ClicktheOKbuttonintheProcessAnalysisProceduresStartupPanel.Onthe
QuicktaboftheISO21747ProcessCapabilitySetupdialog,clicktheVariables
button.IntheSelectVariables(andoptionalgroupingvariable)dialog,select
variableSizeintheVariablesfortheanalyseslist,andSampleintheby...
(Time/Groupingvar.)list,andclickOK.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


54STATISTICAQuickReference
IntheISO21747ProcessCapabilitySetupdialog,clicktheProcessspecsbutton
todisplaytheEnter/editspecificationlimitsdialog,whereyoucanenterthe
processspecificationlimits.Specificationordesignlimitsdefinethemaximumand
(or)minimumallowablevaluesfortherespectivepart;inthiscase,specifythe
lowerandupperspeclimits(LSL,USL)as74+/0.05(LSL=73.95,USL=74.05).Enter
74intheNominalfield,andenter0.05intheDeltafield.

ClickOKtofinalizethischoiceandreturntotheISO21747ProcessCapability
Setupdialog.

Inthisdialog,therearenumerousotheroptionsavailabletomodifytherulesthat
areappliedtoselectthemostappropriatedistributionandtimedependent
distributionmodelforthedatasothattheappropriateprocesscapabilityindices
canbecomputed.Youcanclickthe buttonintheupperrightcornerofthe
dialogorpressF1todisplaytheSTATISTICAElectronicHelptopiccontainingspecific
detailsregardingalloptionsinthisdialog.Forexample,thedetailsregardingthe
(small)differencesintheDINandISOspecificationsarediscussedthere.
NowclicktheOKbuttonintheISO21747ProcessCapabilitySetupdialogto
performtheanalysesforvariableSize.
Reviewing results.IntheISO21747ProcessCapabilityResultsdialog,clickthe
Summarybuttontoreviewtheanalysissummarydisplay.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference55

Asyoucansee,allrelevantdetails(asrecommendedinISO21747and/orDIN
55319)aresummarizedonasinglepage(document),whichcontainsall
informationnecessarytojudgetheprocessascapableornotcapable(or
questionable).
Attribute Gage Analysis
Foranotherexampleofthistypeofsummary(compound)displaysinSTATISTICA,
wewillperformanattributegageanalysis.
Ingeneral,anymeasurementsystemusedinmanufacturingmustbevalidatedto
ensurethattherespectivegagesmeasurethequalitycharacteristicofinterestwith
sufficientaccuracyandprecision.Often,agageofparticularimportanceistheone
thatdetermineswhetheramanufacturedpartisofsufficientqualitytobe
acceptedorrejected;inotherwords,thegagemeasuresasimpleaccept/reject
attribute.
Todeterminethequalityofthegage,astudyisperiodicallyperformedwherethe
gage(accept/rejectdecision)isappliedtoreferencepartswithknowndeviations
fromthedesiredspecifications.Thisprocessisdescribedintherespectivesection
oftheSTATISTICAElectronicManual,aswellastheAIAG(AutomotiveIndustry
ActionGroup)MeasurementSystemAnalysis(MSA)manual(2000).
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


56STATISTICAQuickReference
ThisexampleillustratestheanalysisdescribedintheMSAmanualonpages8186.
Select data.OpentheAttributeGageStudy.stadatafile.Thisfilecontainsthedata,
alreadysummarizedtoacceptancedata,oftheattributegagestudydescribedin
theMSAmanual,(p.84)
Specify analysis.SelecttheStatisticstab.IntheIndustrialStatisticsgroup,click
ProcessAnalysis.IntheProcessAnalysisProceduresStartupPanel,select
Attributegagestudy(Analyticmethod),andclicktheOKbutton.
IntheAttributegagestudy(Analyticmethod)dialog,clicktheVariablesbutton.
SelectPart#inthePartnumberslist,ReferenceintheReferencevalueslist,and
AcceptanceintheAcceptance/Responselist,andthenclicktheOKbuttontoclose
thisdialogandreturntotheAttributegagestudy(Analyticmethods)dialog.In
theTolerancelimitforcalculationgroup,specify0.01astheLowerlimit,select
theDisplaytheotherlimitcheckbox,andthenspecify0.01asthatlimit.

Weareinterestedinevaluatingthegageperformanceforaprocessortypeof
manufacturedpartthatshouldbeidentifiedasunacceptable(shouldberejected),
whenitsreallowerlimitdropsbelow0.01(expressedhereasadeviationfromthe
spec).Inthedatafile,theAcceptanceprobabilitiessummarizethenumberof
referencepartsmeasurements,fromatotalof20suchpartsandmeasurements
each,thatweredeclaredasunacceptable(i.e.,thatwererejected).
Reviewing results.NowclickOKintheAttributegagestudy(Analyticmethods)
dialog.IntheResultsdialog,clicktheSummarybuttontoreviewthesummary
results.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference57

Allimportantresultstodeterminethebiasandrepeatability(ofmeasurements)of
theattributegagearesummarizedonasinglepage.Fordetailsonthe
interpretationofthereportedstatisticsandgraphs,refertotheElectronicManual.
Example 6: STATISTICA Data Miner
STATISTICADataMiner(SDM)isacomprehensivesystemforpredictivemodeling
thatoffersawidevarietyofanalytictechniquesandmodelbuilding,validation,
andmodeldeploymentoptions.Thedefault,andperhapstheindustrystandard,
typeofuserinterfaceprovidedinSDMfollowsthegeneralinteractivedatamining
workspaceapproachthatenablesuserstobuildmodelsbydraggingicons
representingstepsofdataacquisition,datapreparation,modeling,and
deploymentandconnectthemwitharrows.Theworkspaceuserinterfaceoption
inSDMrepresentsapowerfulalternativetothetraditionalinteractivedata
analysisuserinterface,anditcanbeusednotonlyasatoolfordevelopingand
testingpredictivedataminingmodes,butalsoasapowerfulgeneraltooltobe
usedforvisualprogrammingofanalyticworkflowsformanytypesofanalyses.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


58STATISTICAQuickReference
Toopenanew(blank)dataminingworkspace,selecttheDataMiningtab.Inthe
Toolsgroup,clickWorkspacesandfromthemenu,selecteitherNewWorkspace
MyProceduresorNewWorkspaceAllProcedures.

Ablankdataminingworkspacewillbedisplayed.

Now,click onthetoolbartodisplaytheSelectDataSourcedialog,
usedtoselectadatafileforanalysis.Next,theSelectdependentvariablesand
predictorsdialogisdisplayed;clickthe buttontodisplaythevariable
selectiondialog,usedtospecifythedependentvariablesandpredictors.Then,
click tocreateanalyticnodes,andconnectthemwith arrows
tospecifythedesiredprojectworkflow.
ThefollowingsectionincludesastepbystepexampleofDataMinerRecipesan
innovativeuserinterfacefordataminingintroducedbyStatSoftwhichoffersa
powerfulalternativetotheworkspacebasedapproachtomodelbuilding,andcan
beusedbybothnovicesandadvancedanalysts.
Overview
ThisexamplepertainstoSTATISTICADataMinerRecipes,aStatSoftproductthat
offersawideselectionofmethodsforpredictivedatamining.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference59
Ageneraltrendindataminingistheincreasingemphasisonsolutionsbasedon
simpleanalyticprocessesratherthanthecreationofevermoresophisticated
generalanalytictools.STATISTICADataMinerRecipes(SDMR)offersaneasyto
usealternativetothetraditionaldataminerworkspaceuserinterfaceforbuilding
predictivedataminingmodels.Thisapproachprovidesanintuitivegraphical
interfacetoenablethosewithlimiteddataminingexperiencetoexecutearecipe
likestepbystepanalyticprocess.Withtheseintuitivedialogs,youcanperform
variousdataminingtaskssuchasregression,classification,andclustering.Other
recipescanbebuiltquicklyascustomsolutions.
Completedrecipescanbesavedanddeployedasprojectfilestoscorenewdata.
TheprojectfilescanbegeneratedasC/C++languageorPMMLscript,orsentto
STATISTICAEnterprise.
TheSDMRuserinterfacecanalsobeusedbyadvancedanalyststoautomateand
storespecificdataminingalgorithms.
SDMRspanstheentiredataminingprocessfromqueryingexternaldatabasesto
thefinaldeploymentofsolutionsand,ingeneral,consistsofthefollowingsteps.
1.Identifiesthedatafromwhichtolearn
ConnectstoODBCorOLEDBcompliantdatabases
ConnectstoSTATISTICAdatafiles
2.Cleansdataandremovestheredundantpredictors
Flexibleandefficientmethodsforsamplingthedata(simple,stratified,
systematic,etc.)
Moreflexiblewaystoidentifyandrecodethemissingdata
Identificationofoutliers
Transformthedatapriortoperformingthesubsequentsteps
Identifyandeliminateredundantpredictors
3.Identifiesimportantpredictorsfromalargepoolofpredictorsthatarestrongly
relatedtothedependent(outcomeortarget)variableofinterest
Featureselectionforverylargedatasets(e.g.,thousandsofvariables)
Detectionofimportantinteractionsamongthepredictorsbyusingtree
basedmethods
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


60STATISTICAQuickReference
4.Generatesapoolofeligiblemodels
Leveragethecomprehensiveselectionofcuttingedgetechniquesfor
predictivedataminingavailableinSDMR
OffloadcomputationallyexpensivetaskstoSTATISTICAEnterpriseServer,
freeingyourlocalcomputerforothertasks
5.Performsautomaticcompetitiveevaluationofmodelstoidentifytheoptimum
modelwithrespecttoperformanceandcomplexity
6.Deploysthemodeltoscorenewdatausingtheinbuiltefficientdeployment
engine
STATISTICADataMinerRecipesprovidesthesolutionthatmapsthestepsofthe
dataminingworkflowintoaresultsorienteduserinterface.Fromdatacleaningto
modelvalidation,SDMRguidesyouranalysisfromstarttofinishsothatyoucan
getactionableresultsandanswersquickly.Atthesametime,SDMRstillapplies
themostcomprehensivecollectionofdataminingalgorithmsinasinglepackage
withoutrequiringtheusertoknowthedetailsofthosealgorithms.
STATISTICADataMinerRecipescontainsthelargestcollectionofdatamining
methodsandalgorithmsinasinglepackageorlibrary.Inmostgeneralterms,
thesealgorithmsborrowinsightsandmethodologiesfromvariousdomainssuchas
statistics,engineering,artificialintelligence,cognitivescience,etc.,tolearn
patternsfromdatathatcanbeusedtomakepredictions(aboutinsuranceorcredit
risk,processorproductquality,equipmentfailure,medicaldiagnoses,andsoon).
TheSTATISTICAElectronicManualandtheonlineElectronicStatisticsTextbook
providedetailedintroductionstothevariousmethodsandtechniquesthatare
usuallysummarilydescribedasdatamining.
Inpractice,specificdomainsandtypesofdataarebestanalyzedusingparticular
typesofmethodsandalgorithms.Forexample,thedataminingtechniquesthat
workbestformodelinginsurancelossdataaredifferentfromthosethatworkbest
forpredictingemissionsfromafurnace.However,thereisatypicalworkflow
fromthedefinitionofthedataandanalysisproblemthroughsampling,model
building,andevaluationthatisapplicabletoallpredictivedatamining.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference61
DataMinerRecipesenablethosewithoutextensiveexperiencewithdatamining
toolstomoveveryquicklyfromthedefinitionofaproblemtotangibleand
actionableresults.

Inthisapproach,yousimplyfollowarecipelikeuserinterfacetocompletethe
necessarystepstomovetoasolution.Infact,mostofthesestepsareentirely
automatedsothattheonlyrequiredinputistodefinethedataandvariablesfor
theanalyses,whiletheprogramautomaticallydoestherestdetermineslearning
andtestingsamples,performsfeatureselection,triesvariousdatamining
algorithmsandmethods,andevaluatesresultstoselectthebestdatamining
model.Thesecomputationsandanalysescanbeperformedwitheitherthe
desktopSTATISTICADataMinersoftwareor,ifavailable,ontheSTATISTICAData
MinerServer.
Data Miner Recipes Project Files
WhenyousaveaDataMinerRecipesprojectatanystageofcompletion,two
separatefilesarecreated:
ADataMinerRecipesfilewiththefilenameextension.dmrproj
ASTATISTICAWorkbookfilebythesamename,butwiththefilename
extension.stw,containingresultsanddetailedinformationforeachstepof
therecipe
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


62STATISTICAQuickReference
Itisimportantthatbothfilesresideinthesamefiledirectory.So,ifyouwantto
copyaDataMinerRecipeprojectcalledMyDataMinerProjecttoanewfile
directory,emailittoacolleague,orcheckitintotheSTATISTICADocument
ManagementSystem,thenbothfilesMyDataMinerProject.dmrprojand
MyDataMinerProject.stwmustbecopiedtothenewdestination.
Followingareadditionaldetailsaboutthesetwofiles.
Data Miner Recipes file (.dmrproj).TheDataMinerRecipesareXML(extensible
markuplanguage)formatfilesthatcontainallinformationregardinguserschoices
(orchoicesautomaticallymadebytheprogram),including:
Datafileinformation(ordataconnectioninformation)
Variableselectionsandvariablemetadata(e.g.,definingcontinuousand
categoricalpredictorsandoutcomes)
Choicesaboutdatapreprocessingsteps(e.g.,missingdatahandling,
filteringofduplicaterecords,transformations,etc.)
Finalvariableselectionsbasedontheapplicationoffeatureselection
algorithms
Resultsfrommodelbuildingandfinalevaluationandchoicesofmodels
Allinformationnecessarytodeploypredictivemodelsandtopredictnew
cases(e.g.,toscoredatabases,computecomponentscores,inferredsensor
values,predictedriskorfailureprobabilities,etc.)
Therefore,whendeployingDataMinerRecipestotheSTATISTICAEnterprise
softwaretoautomaticallycomputepredictedvaluesinanenterpriseapplication
(automatedcreditscoring,multivariatecontrolchartingandfailureanalysis,etc.),
allinformationnecessarytocomputepredictedvalues,classifications,or
classificationprobabilities(e.g.,probabilityofdefault,loss)iscontainedinside
theseXMLformatfiles.
Data Miner Recipes Workbook file (.stw).Thesefilescontaindetailed
informationdescribingtheresultsforeachstep.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference63

Theresultsstoredinthisworkbookprovidecompletedocumentationforthe
computationsandanalysesperformedastheDataMinerRecipewas(orisinthe
processofbeing)completed.Therefore,ifthedatamininganalysesareperformed
inaregulated(e.g.,FDA,ISO,etc.)environment,orifdataminingispartofan
organizationsmissioncriticalactivitiesperformedundertheguidanceandin
compliancewithspecificstandardoperatingprocedures(SOPs),thenitisusually
recommendedthatthisfilebestoredintheSTATISTICADocumentManagement
SystemalongwiththeDataMinerRecipeprojectfile(.dmrproj).
Using STATISTICA Data Miner
Recipes (SDMR)
Thisexampleillustrateshowquicklyandefficientlydataminingprojectscanbe
completedusingSTATISTICADataMinerRecipes,evenifthebestsolutiontothe
(prediction)problememergesonlyafter(automatically)comparingtheefficacyof
variousadvanceddataminingalgorithms.
Inthisexample,wewillexploretheuseofSDMRforcreditscoringapplications.
TheexampleisbasedonthedatafileCreditScoring.sta,whichcontains
observationson18variablesfor1,000pastapplicantsforcredit.Eachapplicant
wasratedasgoodcredit(700cases)orbadcredit(300cases).Wewantto
developacreditscoringmodelthatcanbeusedtodetermineifanewapplicantis
agoodcreditriskorabadcreditrisk,basedonthevaluesofoneormoreofthe
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


64STATISTICAQuickReference
predictorvariables.AnadditionalTrain/Testindicatorvariableisalsoincludedin
thedatafileforvalidationpurposes.
InSTATISTICA,selecttheDataMiningtab.IntheRecipesgroup,clickDataMiner
RecipestodisplaytheDataminerrecipesdialog.OntheRecipestab,clicktheNew
buttontocreateanewproject.TheStepstabwillbeselectedautomatically.

ThestepnodepanelislocatedintheupperleftareaoftheStepstab.Itcontains
fourmajornodes:Datapreparation,Dataforanalysis,Dataredundancy,and
Targetvariable.
Nodes (steps).Eachnode(orstep)canexistinoneoffourstates,dependingon
whetherallrequiredoptionshavebeenspecified.Eachstateisrepresentedbyan
icon:ared

indicatesawaitstate,meaningastepcannotbestartedbecauseitis
dependentonapreviousstepthathasnotbeencompleted;ayellow indicatesa
readystate,meaningyouarereadytostartthestepbecausepreviousstepshave
beencompleted;agreen indicatesacompletedstep.Notethatyoumustclick
theNextstepbuttontochangetheyellow (readystate)tothegreen
(completedstate).Thechangewillbemadeonlyifthestephasbeensuccessfully
completed(i.e.,allrequiredinformationhasbeenspecified).Lastly,ifyouhave
openedadatasetandselectedvariables,andyoudonotwanttoproceedstepby
stepthroughalltheoptions,youcanselecttheConfigureallstepscheckboxon
theStepstab.Thestepswillnowberepresentedbyanavy icon.Youcanselect
anyofthestepsandmodifytheoptions,oryoucanleavealloptionsattheir
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference65
defaults.Then,clicktheNextsteparrow,andfromthedropdownlist,selectRun
tocompletion.STATISTICADataMinerRecipeswillruntheanalysisandcreatethe
modelresults.
Options tab.TheOptionstabofSTATISTICADataMinerRecipesisusedtoset
globaloptionsforrecipesusingverylargedatafiles.Optionsincludespecifications
forsamplingandformaximumfilesizetosaveintheProjectWorkbook.Since
mostoftheseoptionsareappliedtotheDatapreparationstep,theyshouldbeset
priortostartingworkonanewrecipe.Modificationstothevaluesonthistab
applyonlytothecurrentrecipeunlessyouclicktheSavedefaultsbutton.
Data Preparation
Connecting data.OntheDatapreparationtab,clicktheOpen/Connectdatafile
button.IntheSelectDataSourcedialog,clicktheFilesbuttontobrowsetoand
opentheCreditScoring.stadatafile(locatedintheDatasetsfolderinstalledwith
STATISTICA).Ifthedatafileisalreadyopen,itwillbelistedintheOpen
SpreadsheetDocumentsfolder;doubleclickittoopenit,orselectitandclickthe
OKbutton.
OntheDatapreparationtab,clicktheSelectvariablesbutton,andintheSelect
variablesdialog,select:
Variable1(CreditRating)astheTarget,categoricalvariable,
Variables3,6,and14asInput,continuouspredictors
Variables2,45,713,and1518asInput,categoricalpredictors,and
Variable19TrainTestastheTestingsamplevariable.

Then,clicktheOKbutton.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


66STATISTICAQuickReference
SelecttheAdvancedtabintheDataminerrecipesdialog,andselecttheUse
sampledatacheckbox.SelecttheStratifiedrandomsamplingoptionbuttonas
thesamplingstrategytoensurethateachclassofthedependentvariableCredit
Ratingisrepresentedwithapproximatelyequalnumbersofcasesintrainand
validationsets.ThenclicktheMoreoptionsbuttontodisplaytheStratified
samplingdialog.ClicktheStratavariablesbutton,selectCreditRatingasthe
stratavariable,andclickOKinthisdialogandintheStratifiedsamplingdialog.
ClicktheNextstepbuttonfortheDatapreparationsteptoensurethatthisstep
hasbeensuccessfullycompleted(inthestepnodepanelnexttoDatapreparation,
theyellow

changestoagreen ).
Data for Analysis
AftertheDatapreparationstepiscompleted,theDataforanalysisstepwillbe
selectedautomatically.OntheDataforanalysistab,clicktheSelecttesting
samplebutton,andintheTestingSampleSpecificationsdialog,selectthe
Variableoptionbutton.Verifythatthecategory(value)Trainisenteredinthe
CodefortrainingsamplefieldandTestisenteredintheCodefortestingsample
field.

Then,clicktheOKbutton.Themodelswillbefittedusingthetrainingsampleand
evaluatedusingtheobservationsinthetestingsample.Byusingobservationsthat
didnotparticipateinthemodelfittingcomputations,thegoodnessoffitstatistics
computedfor(predictedvaluesderivedfrom)thedifferentdataminingmodels
(algorithms)canbeusedtoevaluatethepredictivevalidityofeachmodeland,
hence,canbeusedtocomparemodelsandtochooseoneormoreoverothers.
Descriptive statistics.Thisstepwillalsocomputedescriptivestatisticsforall
variablesselectedintheanalysis.Descriptivestatsprovideusefulinformation
aboutrangesanddistributionsofthedatausedfortheproject.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference67
ClicktheNextstepbuttontoensurethatthisstepissuccessfullycomplete.
Data Redundancy
Now,theDataredundancystepwillbeselected.Thepurposeofthisstepisto
eliminatehighlyredundantpredictors.Forexample,ifthedatasetcontainedtwo
measuresforweight,oneinkilogramtheotherinpounds,thenthosetwo
measureswouldberedundant.
OntheDataredundancytab,selecttheCorrelationcoefficientoptionbutton,and
specifytheCriterionvalueas0.8.ClicktheNextstepbuttontoeliminatethe
redundantpredictorsthatarehighlycorrelated(r0.8).Sincethereisno
redundancyinthedatasetweareusinginthisexample,amessagedialogwillbe
displayedstatingthis.

ClicktheOKbutton.Thedatacleaningandpreprocessingformodelbuildingisnow
complete.
Target Variable: Building Predictive Model
Next,weneedtobuildpredictivemodelsforthetargetinthisexample.Inthe
stepnodepanel,theTargetvariablenodehasabranchingstructurewiththe
parentnodeconnectingtofourchildnodesincludingImportantvariables,Model
building,Evaluation,andDeployment.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


68STATISTICAQuickReference
Dimension reduction.TheImportantvariablesnodeisselectedautomatically.In
thisstep,thegoalistoreducethedimensionalityofthepredictionproblem,i.e.,to
selectasubsetofinputsthatismostlikelyrelatedtothetargetvariable(inthis
exampleCreditRating)and,thus,ismostlikelytoyieldaccurateanduseful
predictivemodels.Thistypeofanalyticstrategyisalsosometimescalledfeature
selection.
Twostrategiesareavailable.WhentheFastpredictorscreeningoptionbuttonis
selected,theprogramwillscreenthroughthousandsofinputsandfindtheones
thatarestronglyrelatedtothedependentvariableofinterest.Whenthe
Advancedscreeningoptionbuttonisselected,treemethodsareusedtodetect
importantinteractionsamongthepredictors.
Forthisexample,selecttheAdvancedscreeningoptionbuttonasthefeature
selectionstrategy,andthenclicktheAdvancedscreeningbuttontodisplaythe
Advancedscreeningdialog.Enter12intheNumberofpredictorstoextractfield,
andselectEqualinthePriorclassprobabilitiesfield.

ClicktheOKbuttoninthisdialog,andthenclicktheNextstepbuttontocomplete
thisstep.Toreviewasummaryoftheanalysisthusfar,ontheStepstab,clickthe
Reportbutton,andfromthedropdownlist,selectSummaryreporttodisplaythe
Resultsworkbook.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference69
Thesepredictorswillbefurtherexaminedusingvariouscuttingedgedatamining
andmachinelearningalgorithmsavailableinSDMR.
Building models.TheDataminerrecipesdialogwasminimizedsothatyoucould
seetheResultsworkbook.ClicktheDataminerrecipesbuttonlocatedonthe
AnalysisBar(inthelowerleftcorneroftheapplication)todisplaythedialogagain.
Now,theModelbuildingnodeisselected.Inthisstep,youcanbuildavarietyof
modelsfortheselectedinputs.OntheModelbuildingtab,C&RT,Boostedtree,
andNeuralnetworkareselectedbydefaultasthemodelsoralgorithmsthatwill
automaticallybetriedagainstthedata.
Thecomputationsforbuildingpredictivemodelscanbeperformedeitherlocally
(onyourcomputer)orontheSTATISTICAEnterpriseServer.However,thelatter
optionisavailableonlyifyouhaveavalidSTATISTICAEnterpriseServeraccount
andyouareconnectedtotheserverinstallationatyoursite.Forthisexample,click
theBuildmodelbuttontoperformthecomputationslocallyonyourcomputer.
Thiswilltakeafewmoments;whenfinished,clicktheNextstepbuttonto
completethisstep.
Evaluating and selecting models.Now,theEvaluationnodeisselected.Onthe
EvaluationtabintheSelectmodel(s)field,ensurethatallmodelsareselected
(eachcheckboxisselected).ClicktheEvaluatemodelsbuttontoperformthe
competitiveevaluationofmodelsforidentifyingthebestperformingmodelin
termsofperformanceinthevalidationsample.
NoticethattheBoostingTreesmodelhastheminimumerrorrateof31.48%.In
otherwords,68.52%ofthecasesinthevalidationsamplearecorrectlypredicted
bythismodel.Notethatyourresultsmayvaryslightlybecausetheseadvanced
dataminingmethodsrandomlysplitthedataintosubsetsduringtrainingto
producereliableestimatesoftheerrorrates.
Thefollowingspreadsheetshowstheclassificationperformanceofthebestmodel
onthevalidationdataset.Thecolumnsrepresentthepredictedclassfrequencies,
aspredictedbytheBoostingTreesmodel,andtherowsrepresenttheactualor
observedclassesinthevalidationsample.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


70STATISTICAQuickReference

Inthismatrix,youcanseethatthismodelpredicted68outof103badcredit
riskscorrectly,butmisclassified35ofthem.Thisinformationisusuallymuch
moreinformativethantheoverallmisclassificationrate,whichsimplytellsusthat
theoverallaccuracyis68.52%.
DisplaytheDataminerrecipesdialogagain,andclicktheNextstepbutton.A
messageisdisplayedwithinstructionstoselectonlyonemodelfordeployment.
ClickOK,andclearthecheckboxesadjacenttoC&RTandNeuralnetwork.Wewill
deploytheBoostingTreesmodelthatgaveusthebestpredictiveaccuracyonthe
testsample.Now,clicktheNextstepbuttonagain.
Deployment
ThefinalDeploymentstepinvolvesusingthebestmodelandapplyingittonew
datainordertopredictthegoodorbadcustomers.Thisstepalsoprovidesthe
optionforwritingbackthescoringinformation(classificationprobabilities
computedbythebestmodel,predictedclassification,etc.)totheoriginalinput
datafileordatabase.Thisisextremelyusefulfordeployingmodelsonverylarge
datasetstoscoredatabases.
OntheDeploymenttab,clicktheDatafilefordeploymentbuttonandopenthe
CreditScoring.stadatafile(locatedintheDatasetsfolderinstalledwith
STATISTICA).Fordemonstrationpurposes,weareusingthesamedatafilefor
deploymentofthebestmodel.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference71

ClicktheNextstepbuttontoscorethisdatafileusingthebestmodel.Thescoredfile
withclassificationsandpredictionprobabilities(titledSummaryofDeployment)is
locatedintheDeploymentfolderintheprojectworkbookasshownbelow.

Summary
Thepurposeofthisexampleistodemonstratetheefficiencyofthedataminer
workflowimplementedinSTATISTICADataMinerRecipes.Withonlyafewclicks,
theprogramwilltakeyouthroughthecompleteanalyticprocessfromthe
definitionofinputdataandanalysisproblem,throughdatacleaningandpreparation
andmodelbuilding,allthewaytofinalmodelselectionanddeployment.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


72STATISTICAQuickReference
Eventhoughmostofthecomputationalcomplexitiesofdataminingareresolved
automaticallyinSTATISTICADataMinerRecipes,whichenablesyoutomovefrom
problemdefinitiontoasolutionveryquicklyevenifyouareanovice,theprogram
willapplyandtryalargenumberofadvanceddataminingalgorithmsand
automaticallydeterminewhichapproachismostsuccessful.
Thus,theSTATISTICADataMinerRecipesmethodologyanduserinterfaceenables
youtoleveragethelargestcollectionofdataminingalgorithmsinasinglepackage
tosolveyourproblems.
DATA MANAGEMENT
Example 1: Spreadsheet Formulas
and Batch Formulas
YoucandefinenewvariablesforSTATISTICASpreadsheetsintermsofother
variables,sometimesreferredtoasvariabletransformations.Additionallyyoucan
verifydata,transformdata,andrecodedataonasinglevariable(asopposedtoa
setoftransformationformulas,i.e.,batchformulas).Thisisaccomplishedwith
spreadsheetformulas.
Toaccessspreadsheetformulas,doubleclickonavariableheaderinaSTATISTICA
SpreadsheettodisplaytheVariablespecificationdialog.Theformulaisentered
intotheLongname(labelorformulawithFunctions)field(alsocalledtheformula
editor)locatedatthebottomofthedialog.Whenyouenteralongvariablename
intheformulaeditorthatstartswithanequalsign,STATISTICArecognizesitasa
formulaandwillverifyitforformalcorrectness.
Theformulacanreferenceothervariableseitherbyname(MEASURE01,TIME),or
byabsolutevariablenumberusingtheVxsyntax,wherexistheabsolutevariable
number.Forexample,V3isvariablenumber3.V0hasspecialmeaning,andrefers
tothecurrentcasenumber.
Spreadsheetformulasareevaluatedacase(row)atatime.Foreachcaseinthe
spreadsheet,theformulaisevaluated,andreferencestotheothervariablesare
substitutedwiththeirvaluesfromthecurrentcase.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference73
InSTATISTICA,randomaccessspreadsheetfunctionsenabletheformulatoaccess
variablevaluesfromothercases.AcommonexampleofthisistheLagfunction,
whichwillreferenceavariable,andlagitforwardorbackwardacertainnumberof
cases.
Thefollowingtablelistsseveralspreadsheetformulasandtheirresults.
Formula Result
=contains(v1,"B12C")
Returns1ifthetextB12Cisfoundinvariable1.Returns0ifno
matchisfound.
=(v1+v2+v3)/3 Computesthemeanofthefirstthreevariables.
=(v0<=10)*1+(v0>10)*2 Recodescases110as1.Theothercasesaresetto2.
=((v1=1)AND(v2=5))*5 Returnsthevalueof5ifv1=1andv2=5,otherwisesetto0.
=student(v4,15)
ReturnsprobabilitydensityvaluesoftheStudentstdistribution
basedonthevaluesofv4and15degreesoffreedom.
=cusum(v3) Performsacumulativesumofvariable3.
=v1+v2
Concatenatestwotextvariables:Ifv1='A'andv2='B',thenthe
resultis'AB'
=vnormal(rnd(1),50,3)
GeneratesrandomnumbersfromaNormaldistribution
(=50,=3)
=DTMonth(DTToday)
Returnsnumberrepresentingmonthoftheparameter,e.g.,3if
itiscurrentlyMarch
=match(v1,1,0,2,0,v1)
Comparesfirstvaluetoasetofvalue/resultpairs,returningthe
firstresultifthecorrespondingvaluematches.Ifnomatch,then
afinaldefaultresultisused.Forexample,returns0ifv1is1or
2,elsereturnsv1.
=trunc((v01)/10)
Assignsconsecutiveintegerstotheconsecutivesetsof10cases
(i.e.,casesnumber1through10willbeassigned0,cases
number1120willbeassigned1,andsoon
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


74STATISTICAQuickReference
Notethatyoucanclickthe

buttonintheVariablespecificationdialogto
displaytheFunctionBrowserdialog,whichcontainsthecompletelistofformulas
andoperators(=,+,>,and,or,etc.).
Example: Spreadsheet Formula
OpentheAdstudy.stadatafile.Wewillcreateanewvariablethatisthemeanof
variables3through25(i.e.,MEASURE01throughMEASURE23).
Doubleclickonthefirstblankvariableheader(aftervariable25).TheAddCases
and/orVariablesdialogwillbedisplayed.ClicktheOKbuttontoacceptthe
default,whichistoaddonevariable.
TheVariablespecificationdialogforthenewvariablewillbedisplayed.Inthe
Displayformatgroup,selectNumber.IntheLongnamefieldatthebottomofthe
dialog,enter:=mean(v3: v25).

ClicktheOKbutton.Adialogwillbedisplayedthatinformsyouwhetherthe
formulaisformallycorrect.ClicktheYesbuttontoproceed.Thenewvariableis
nowfilledwiththemeanofvariables3through25foreachcase.
Sinceyoucanrefertovariablesbytheirnamesortheirnumbers,theformulawe
justcreatedcouldalsobeexpressedas:=mean(MEASURE01:MEASURE23).
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference75
Example: Batch Formulas
Spreadsheetformulasareusefulfordefiningaformulaforonevariableatatime.
However,therearemanysituationsinwhichyouneedtoevaluateseveral
formulasfordifferentvariablessimultaneously.Thiscanbedonewiththebatch
formulasfacilitiesinSTATISTICA.
OpentheCharacteristics.stadatafile.Thisdatafilecontainsinformationabout
patientsinastudy.Forthisexample,wewill1)calculatepatientBodyMassIndex
(BMI)and2)convertheighttocentimeters(cm),andaddthesetwovariablesto
thedataset.
OntheDatatab,intheTransformationsgroup,clickTransformstodisplaythe
BatchTransformationFormulasdialog.

Theonlydifferencesinsyntaxbetweenthebatchtransformationformulasandthe
spreadsheetformulasisthesupportformultipleformulasinthebatchoption,and
thefactthatbecausethebatchformulasarenotattachedtoanyspecificvariable
(infacttheycanbefreelycopiedfromdatafiletodatafile),theycannotstartwith
anequalsign,butmusthaveatargetvariable(e.g.,v1=...orMeasure03=...)sothat
STATISTICAknowstowhichvariableeachformulashouldapply.Thereisalsoan
optiontodistributeallbatchformulasintotherespectivevariablesinthe
spreadsheetandsavethemwiththedatafile,effectivelyreplacingthe
spreadsheetformulas(ifthereareany).
FollowingarethecalculationsusedtocalculateBMIandtoconvertHeight(in)to
centimeters,andtheformulastoenterintheBatchTransformationdialog:
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


76STATISTICAQuickReference
Calculation Batch Transformation Dialog Entry

BMI=('weight(lb)'/'Height(in)'**2)*703

'Height(cm)'='height(in)'*2.54
IntheFormulasfield,enterthelistoftransformationformulastobeappliedtothe
activedataspreadsheet.Separateeachtransformationformulabyareturn(press
ENTERonyourkeyboard).

ClicktheOKbuttonintheBatchTransformationFormulasdialog.TheAddNew
Variables?dialogwillbedisplayed;clicktheYesbuttontoaddthetwonew
variablestotheCharacteristics.stadatafile.Amessagewillbedisplayedtoinform
youwhethertheexpressionsyouenteredintheBatchTransformationdialogare
correct.IftheyareOK,clickYestoproceed.STATISTICAcalculatestheformulas
andaddsthetwovariables,BMIandHeight(cm),tothespreadsheet.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference77
TheoptionsintheBatchTransformationFormulasdialogareparticularlywell
suited(optimized)fortransforminglargedatasets.Theformulaswillbeevaluated
onebyone,insequence,sothattheresultsofonetransformationinthelistcan
serveastheinputforthenext.Thus,itispossibletocreateanewvariablewith
oneformulaandthenusethatvariableinsubsequentformulas.
Clickthe buttonintheupperrightcorneroftheBatchTransformation
FormulasdialogtodisplaytheSTATISTICAElectronicManualtopicrelatedtothese
optionsandlinkstovariousothertopicscontainingexamplesofformulasand
syntaxrules.
Example 2: Input Data
Directly from Excel
InadditiontousingthetraditionalSTATISTICAspreadsheet,youcanopenExcel
filesinaSTATISTICAwindowandthenperformanalysesusingtheExcelfileasyour
datasource.
OntheSTATISTICAHometab,intheFilegroup,clicktheOpenarrowandselect
OpenExamplesfromthedropdownmenutodisplaytheOpenaSTATISTICAData
Filedialog.
FromtheFilesoftypedropdownlistatthebottomofthedialog,selectExcelFiles
(*.xls;*xlsx;*.xlsm).DoubleclicktheDatasetsfolder,andthenselecttheWeather
reportdatafile,whichisanExcelfile.ClicktheOpenbutton,andtheOpeningfile
dialogwillbedisplayed.
ClicktheOpenasanExcelWorkbookbutton,andtheExcelfilewillbedisplayed.
NotethatwhenanExcelworksheetisopenedinSTATISTICA,theExceland
STATISTICAmenusmerge,enablingyoutoaccesskeyfunctionalityforboth
applications.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


78STATISTICAQuickReference

FromtheStatisticsmenu,selectBasicStatistics/Tables.TheSelectExcelRange
fortheAnalysisdialogwillbedisplayed.

ThisdialogisdisplayedwheneveryouselectacommandfromtheStatistics,Data
Mining,orGraphsmenuafteropeninganExcelworksheetintheSTATISTICA
application.NotethatSTATISTICAhasdeterminedthelogicalspecifications,but
theseoptionscanbechangedifnecessary.Whenvariablenamesarenotincluded
withtheExcelworksheet,STATISTICAwillassignvariablenames:Var1,Var2,Var3,
etc.AswithSTATISTICAspreadsheets,allvaluesinacolumnwillbeusedforthe
selectedanalysisunlesscaseselectionconditionsarespecified.
Forthisexample,clicktheOKbuttonintheSelectExcelRangefortheAnalysis
dialogtoacceptthedefaults;thedialogwillclose,andtheReview/EditColumn
Typesdialogwillbedisplayed.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference79

InSTATISTICA,youcandefinethedatatypeforthespecificcolumns.Datatypes
includenumeric,text,mixednumericandtext,andmissingdata.Emptycellsinan
Excelworksheetarealwaystreatedasmissingdata,andwhenanumericcolumn
containstextvalues,thosevaluesarealsotreatedasmissingdata.STATISTICA
providesdefaultdatatypesforallcolumnsbasedonthefirstfewrowsofdata(in
fact,youcancleartheReview/Modifycolumntypesbeforeimportingcheckbox
intheSelectExcelRangefortheAnalysisdialogbeforeclickingOKinthatdialog,
andtheReview/EditColumnTypesdialogwillnotbedisplayed).However,youcan
changethedefaulttypesifneeded:selectthenameofthecolumnyouwantto
changeandclicktheEditbutton(ordoubleclickonthenameofthecolumnyou
wanttochange)todisplaytheChangeImportColumnTypedialog,whereyoucan
specifythetypeyouprefer.

Forthisexamplewewillacceptthedefaults,soclicktheCancelbuttoninthe
ChangeImportColumnTypedialog,andclicktheOKbuttonintheReview/Edit
ColumnTypesdialog.AfteryouclickOK,theStartupPanelfortheselectedanalysis
orgraphwillbedisplayed(inthisexample,theBasicStatisticsandTablesStartup
Panel),andyoucanproceedwiththeanalysisasusual.
Example 3: Accessing Data Directly
from a SQL Server Database
STATISTICAprovidesaccesstovirtuallyalldatabases(includingmanylargesystem
databasessuchasOracle,Sybase,etc.)viaSTATISTICAQuery,accessiblefrom
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


80STATISTICAQuickReference
eithertheHometab(intheFilegroup,clicktheOpenarrowtoaccesstheOpen
ExternalDatasubmenu)ortheDatatab(intheManagegroup,clickExternal
Data).ForimportingdatafromadatabasedirectlyintoaSTATISTICASpreadsheet
sothatitcanbesaved,thetooltouseisSTATISTICAQuery.
WithSTATISTICAQuery,youcaneasilyaccessdatausingOLEDBconventions.OLE
DBisadatabasearchitecture[basedontheComponentObjectModel(COM)]that
providesuniversaldataintegrationoveranenterprisesnetwork,frommainframe
todesktop,regardlessofthedatatype.
STATISTICAQuerysupportsmultipledatabasetables;specificrecords(rowsof
tables)canbeselectedbyenteringSQLstatements.STATISTICAQuery
automaticallybuildstheSQLstatementforyouasyouselectthecomponentsof
thequeryviaasimplegraphicalinterfaceand/orintuitivemenuoptionsand
dialogs.Hence,anextensiveknowledgeofSQLisnotnecessaryinorderforyouto
createadvancedandpowerfulqueriesofdatainaquickandstraightforward
manner.Multiplequeriesbasedononeormanydifferentdatabasescanalsobe
createdtoreturndatatoanindividualspreadsheet;hence,youcanmaintain
connectionstomultipleexternaldatabasessimultaneously.
Forthisexample,createanewdatabasequery:selecttheHometab,andinthe
Filegroup,clicktheOpenarrow.Fromthedropdownlist,selectOpenExternal
DataCreateQuery.STATISTICAQuerywillstart,andtheDatabaseConnection
dialogwillbedisplayed.

Fromthisdialog,youcanchooseexistingdatabaseconnectionsordefinenew
ones.Forthisexample,wellcreateanewdatabaseconnection,soclicktheNew
buttontodisplaytheDataLinkPropertiesdialog.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference81

YoucanchooseeithertheOLEDBproviderthatwassuppliedbyyourdatabase
vendor,oroneoftheMicrosoftdefaultOLEDBprovidersthatiscompatiblewith
yourdatabasesystem.
Forthisexample,wellusetheNorthwindsampledatabaseinstalledwith
MicrosoftSQLServer,soselectMicrosoftOLEDBProviderforSQLServerandclick
theNext>>button.TheDataLinkPropertiesdialogConnectiontabwillbe
displayed.

SelectaserverfromtheSelectorenteraservernamedropdownlist.
Then,selectthelogonoptionbuttonappropriatetoyourSQLServerNorthwind
databaseinstallation.SelecteithertheUseWindowsNTIntegratedsecurity
optionbutton,orselecttheUseaspecificusernameandpasswordoptionbutton
andenteraUsernameandPasswordintherespectivefields.
Next,selectNorthwindfromtheSelectthedatabaseontheserverdropdownlist.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


82STATISTICAQuickReference
ClicktheTestConnectionbuttontoattemptaconnectiontothespecifieddata
source.Iftheconnectionfails,ensurethatthesettingsarecorrect.Forexample,
spellingerrorsandcasesensitivitycancausefailedconnections.Iftheconnection
succeeds,clicktheOKbuttoninthemessagedialog.
ClickOKintheDataLinkPropertiesdialogtodisplaytheAddaDatabase
Connectiondialog.EnterNorthwindintheNameeditbox,andclickOK.
TheDatabaseConnectiondialogwillbedisplayedagain,withthenewNorthwind
connectiondefined.

Selectthisconnection,andclickOK.TheSTATISTICAQuerywindowwillbe
displayed,withallthedatabasetablesinthetreeviewontheleft.

RightclickontheOrderDetailstable,andfromtheshortcutmenu,selectAddto
addthetabletothetableviewpane(theupperrightpaneintheSTATISTICAQuery
window).Then,rightclickontheProductstable,andaddittothetableviewpane.
SincebothtablescontaintheProductIDfield,STATISTICAQueryautomaticallyjoins
thetwotablesonthiskey.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference83

Toselectthefieldstoincludeinthequery,rightclickintheOrderDetailstablein
thetableviewpane,andfromtheshortcutmenu,selectSelectAllFields.Inthe
Producttable,selecttheProductNamefield.
ClickthePreviewDatatabinthelowerrightpanetodisplayapreviewofthe
query.

ClicktheSQLStatementtabtodisplaytheSQLStatementgeneratedbythequery.
ToreturnthedatatoaSTATISTICASpreadsheet,clickthegreenarrowonthe
STATISTICAQuerytoolbar.TheReturningExternalDatatoSpreadsheetdialogwill
bedisplayed,whereyoucancontrolwhetherthequerywillbeplacedintoanew
orcurrentspreadsheetandadjustotherqueryparameters.SelecttheNew
Spreadsheetoptionbutton,andclicktheRunNowbuttontorunthequery.Ifthe
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


84STATISTICAQuickReference
ConnecttoOLEDBProviderdialogisdisplayed,clicktheOKbutton.Afterafew
moments,thedataisreturnedtotheSTATISTICASpreadsheet.

NowthedatacanbeanalyzedwithanyoftheSTATISTICAtools.Notethatthe
spreadsheetretainsthedatabaseconnection,andyoucanrerunthequeryatany
time:selecttheDatatab,andintheManagegroupclickExternalData.Select
RefreshDatafromthedropdownmenu.YoucanalsopressF5onyourkeyboard
whenthespreadsheetisopen.
Example 4: Data Preparation
Cleaning and Filtering
Summary of Options for Data
Filtering/Recoding
Inpractice,mostofthetimerequiredtocompleteadataanalysisordatamining
projectisspentonthepreparationofdata.Sometimesasmuchas90%ofalltime
andeffortrequiredtocompleteaprojectisrelatedtothepropercleaningand
preparationofthedata.
Whenbuildingpredictionmodelsusingdataminingtools,orevenwhenjust
computingsimpledescriptivestatistics(averages,frequencydistributions),results
ofanalysescanbeverymisleadingif,forexample,largenumbersofduplicate
recordsareincluded(e.g.,thesamepartnumbersarerecordedmultipletimes)or
thedataincludeoutliersormiscodedvalues(outsidethevaliddataranges)or
excessivenumbersofmissing(blank)data.
OntheDatatab,intheTransformationsgroup,clickFilter/Recodetodisplaya
dropdownmenucontainingcommandstoaddresssuchdataqualityissuesquickly
andeffectivelysothatmeaningfulandvaliddataanalysesordataminingprojects
canbecompletedinlesstime.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference85
Filter Duplicate Cases
Usethisoptionwhenyoususpectthatyourdatafilemaycontainduplicaterecords
(e.g.,duplicate/identicalcustomerrecords).
Forexample,supposethatinananalysisofcustomerrecords,toidentifytypical
customerdemographics(profiles),youwanttocounteachcustomeronlyonce;
however,yourcustomerdatabaseisorganizedbytransactions,soeachcustomer
mayappearmultipletimes.Inthiscase,youcanusetheFilterDuplicateCases
optionstocreateadatafilefortheanalysescontainingonlyuniquerecords(i.e.,
whereeachcustomerIDislistedonlyonce).
Duplicate information example.OpentheDuplicates.stadatafile.Fromthe
Filter/Recodemenu,selectFilterDuplicateCasestodisplaytheFilterDuplicate
Casesdialog.IntheInputgroupbox,theVariablesoptionisusedtospecifythe
basisofdistinctionforduplicates.ClicktheVariablesbutton,andinthevariable
selectiondialog,selectRespondentsothatallrespondentswillbecheckedfor
duplicates.ClickOKinthevariableselectiondialogtoreturntotheFilterDuplicate
Casesdialog.

IntheInputgroupbox,clicktheCasesbuttontodisplaytheSpreadsheetCase
SelectionConditionsdialog,whichcontainsoptionstoselectonlyspecified
observationsorcasesforthededupingoperations.Inthisexample,wewillfilter
allthecases,soclicktheCancelbuttonintheSpreadsheetCaseSelection
Conditionsdialog.
TheUsecasenamescheckboxisclearedbydefault;wewillleavethisoptionasis
forthisexample.Whenthischeckboxisselected,casenamesareusedasoneof
thebasesfordistinction,i.e.,STATISTICAwilltreatasduplicatesanycasesthat
havethesamecasename(providedthecasesmatchonanyotherspecified
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


86STATISTICAQuickReference
variablesaswell).Whenthecheckboxiscleared,duplicatecasenamesare
ignored.
CleartheDataaresortedcheckbox(becausethecurrentdatafilehasnotbeen
sortedwhenyouhaveanextremelylargedatafile,itismoreefficienttosortthe
datafirst).
IntheOutputgroupbox,verifythatallvariablesareselected(ALLwillbeadjacent
totheVariablesbutton).Thisoptionisusedtoselectthevariablesintheinput
spreadsheetthatwillbeincludedintheoutput(filtered)spreadsheet;thedefault
isALL.
VerifythattheCreatenewspreadsheetcheckboxisselected(thedefault),and
selecttheCreateduplicatesspreadsheetcheckbox.Leavethelasttwooptionsat
theirdefaults:thePreserveordercheckboxiscleared[thenewspreadsheetswill
besortedbythevariable(s)thatwereselectedasthebasisofdistinction,inthis
example,Respondent],andtheCopyformattingcheckboxisselected.ClickOK.
Twonewspreadsheetswillbegenerated.Oneofthespreadsheetsis10vby51c
(10variablesby51cases)andcontainstherespondentsfromtheoriginal
spreadsheetexcludingtheduplications.Theotherspreadsheetis10vby9cand
containstheduplicaterespondentsthatwereextractedfromtheoriginal
spreadsheet.
Lookattheoriginalspreadsheet,Duplicates.sta,andnoticethatsomeofthe
variableheadersRespondent,State,andColorsareformatteddifferently.Then
lookatthetwonewspreadsheets;thevariableheadersforRespondent,State,and
Colorshavethesameformattinginallthreespreadsheets.STATISTICAusessub
settingtocreatethenewspreadsheetsandensuresthatvariablepropertiesofthe
parentspreadsheetaremaintainedinthechildspreadsheets.
Now,closethetwonewspreadsheets,butleavetheDuplicates.staspreadsheet
open.Noticethatitis10vby60c.FromtheFilter/Recodemenu,selectFilter
DuplicateCasestodisplaytheFilterDuplicateCasesdialogagain.IntheInput
groupbox,clicktheVariablesbutton,andinthevariableselectiondialog,select
RespondentandclickOK.IntheInputgroupbox,cleartheDataaresortedcheck
box.IntheOutputgroupbox,cleartheCreatenewspreadsheetcheckbox.Click
OK.Thedialogclosesand,insteadofcreatinganewspreadsheetwiththe
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference87
duplicatesexcluded,theDuplicates.staspreadsheetismodified.Allduplicatecases
areremovedfromit;itnowhas10vby51c.
Notethatthefilterduplicatecasesfunctionalitydoesnotusecasesensitivity
(uppercase,lowercaseletters)foracomparisonofuniqueness,i.e.,ifyouhave
tworespondentsC.BarrettandC.BARRETTthesecondrespondentwillbe
excluded.
Filter Sparse Data
Itisnotuncommonthatsomevariables(parameters,ordatafields)availablefor
(forexample)predictivemodelinghaveveryfewvaliddata.Forexample,ina
customerdatabaseselfreported(bycustomers)Incomemayberecorded;
however,veryfewcustomersactuallyvolunteeredtheircurrentincomes,somost
ofthedata(inthatfieldofthedatabase)isblank(ormissing).Inmanufacturing
data,adatafieldmayexisttorecordaspecificparameter,butthesensormightbe
faultyforanextendedperiodoftime,recordingmostlymissing(invalid)data.

Includingsuchsparselypopulated(withdata)variablesinananalysismayleadto
erroneousresults,orpreventyoufrombuildingpredictivemodelsaltogether
(dependingonhowthemissingdataarehandledlaterintheanalyses).Therefore,
youmaywanttoidentifysuchsparsevariablesaheadoftimeusingtheFilter
SparseDataoptions(accessiblefromtheFilter/RecodemenulocatedontheData
tabintheTransformationsgroup),andeliminatethemfromsubsequent
consideration.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


88STATISTICAQuickReference
Process Invariant Variables
Asimilar(tothesparsedatacase)dataqualityissuethatoftenoccurs,inparticular
inindustrialmanufacturing(process)data,isthatsomevariables(parameters)that
arerecordedandincludedintheanalysesareinvariant,i.e.,allvaluesarethe
same.

Suchvariablesarenotusefulforpredictivemodeling,andtheProcessInvariant
Variablesoptions(accessiblefromtheFilter/RecodemenulocatedontheData
tabintheTransformationsgroup)enableyoutoidentifythosevariables
automatically,andexcludethemfromfurtheranalyses.
Recode Outliers
Extremedatavaluesoroutlierscangreatlyaffectvariousanalysesandcausepoor
accuracyofprediction(datamining)models.Thereisnoformaldefinitionofwhat
constitutesanoutlierorextremevalue,andSTATISTICAsgraphicaltoolsmay
providethebestwaytoreviewdatatoidentifysuchunusualobservations(e.g.,
youcouldcreateboxplotsofthekeyvariablestoidentifyextremeobservations
andbrushorflagtheminthedata).
Toautomaticallyprocesslistsofvariablestoidentifyandremoveoutliers,the
RecodeOutliersoptions(accessiblefromtheFilter/Recodemenulocatedonthe
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference89
DatatabintheTransformationsgroup)provideseveraltestsforoutliers
(approachesforidentifyingextremevalues).

Outlierscanberecodedtomissingdataortovaliddatavalues(e.g.,tothe
respectivepercentileboundaryvalues,etc.).
Process Missing Data
Missingdataorinvaliddatavaluesmustobviouslybedealtwithinamannerthatis
consistentwiththegoalsoftheanalyses.Insomecases,missingorinvaliddata
maythemselvesprovideusefulinformationaboutaprocessorvariableofinterest.
Forexample,inmarketingresearch,itiscommonthatrespondentswillrefuseto
providedetailedpersonalinformationregardingtheirhealth,financialstatus(e.g.,
savings),etc.,andsuchrefusalitselfmaybecorrelatedwithothersignificant
variablesofinterest(e.g.,refusaltoanswerquestionsrelatedtoincomemayitself
beagoodindicatorofhighincome,ifindeedwealthierindividualsinthesurvey
tendednottoanswerthosequestions).
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


90STATISTICAQuickReference

TheProcessMissingDataoptions(accessiblefromtheFilter/Recodemenu
locatedontheDatatabintheTransformationsgroup)enableyoutorecode
missingdataflexibly,definemultiplemissingdatavaluesorcodesforasingle
variable(whichcanthenberecodedtothevariablemissingdatacode),orjustto
flagvariablesthathavemorethanacertainpercentageofmissingdata.
Imputation of Missing Data
(k-Nearest Neighbor)
Itisoftennotclearhowbesttorecodemissingdata,andinfact,sometimesby
recodingmissingdataforaparticularvariabletoaspecificvalue(e.g.,themean),
thefinalresultsmaybebiased.Forexample,supposeinasurveyallrespondents
whorefusetoreporttheirincometendtobeinthehigherincomebracket.Inthat
case,assigningthemeanincometothoseindividuals(i.e.,recodingmissingdata
forvariableIncometothemeanincomeforthewholesample)mayyieldhighly
misleadingresults.
STATISTICAincludesaveryefficientmethod(applicabletoverylargedatasetsand
databases)forreplacingmissingdatawithvaliddatavaluesthatareconsistent
withtheotherobservationsinthesample.Detailsregardingtheknearest
neighbormethodandalgorithmareprovidedintheElectronicHelpforthe
MachineLearningmoduleofSTATISTICADataMiner.
Inshort,usingtheMDImputationoptions(accessiblefromtheFilter/Recode
menu),inafirstpassthroughthedata,theknearestneighboralgorithmwillselect
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference91
a(smaller)samplefromallavailabledata.Inthesecondpassthroughthedatafile,
whenmissingdataareencountered,theyarereplacedwithvalid(observed)values
foundinsimilarobservationsinthesmallersample(withrespecttoallother
variablesthatwereselected).Sotocontinuethisexample,ifindeedhigherincome
respondentsarelesslikelytoreportthisfact,butdoreportotherindicatorsof
highincome(e.g.,ownershipofaluxurycar,moresquarefootageoftheirhome,
etc.),thentheknearestneighboralgorithmwillaccuratelyassignthoseindividuals
(whofailedtoreporttheirincome)tothehighincomebracket.

Theknearestneighboralgorithmisfastandefficient,andprovidesaneffective
methodforreplacingmissingdataintheinputfilewithreasonableguesses
basedonsimilardatapointsinthesample.Thisapproachdoesnotmakeany
particularassumptionsaboutthenatureoftherelationshipsbetweenvariables
(i.e.,requirethatamodelbeestimatedforeachvariabletopredictmissingdata
values),butsimplyusestheobserveddataasthemodel.
Merge Data Files
TheSTATISTICAMergeOptionsdialogenablesyoutomergetwodatafileseither
bythevariablesorbythecasessothatyoucancentralizealloftheobservationsto
onetable.SelecttheDatatab,andintheManagegroup,clickMergetodisplay
theMergeOptionsdialog.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


92STATISTICAQuickReference

ClicktheHelp buttonintheupperrightcornerofthedialogtoaccessHelp
topicsdescribingalltheoptionsinthisdialog.
Creating Subsets
Ifyouhavealargespreadsheet,youmaywanttocreateanewspreadsheet
containingaspecifiedsubsetofthecurrentspreadsheet.Forexample,open
Boston2.sta.Thisdatasetcontainsoverathousandcases.Wewanttoextract
housingtractswithlowmedianprices.
SelecttheDatatab,andintheManagegroup,clickSubsettodisplaytheCreatea
Subsetdialog.

ClicktheCasesbuttontodisplaytheSpreadsheetCaseSelectionConditions
dialog,whichcontainsoptionstocreateconditionstodefinetheselectionofcases
tobeconsideredforthesample.
SelecttheEnableSelectionConditionscheckboxtoactivatetheoptions,andthen
selecttheSpecific,selectedbyoptionbuttonintheIncludecasesgroupboxto
specifywhichcasestoincludeintheanalysis.Typev1=LOWintheExpressiontext
box.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference93

ClicktheOKbuttontosettheselectionconditionsandreturntotheCreatea
Subsetdialog,andclicktheOKbuttoninthisdialogtocreatethenewspreadsheet.
Theresultantspreadsheetcontains334cases(insteadoftheoriginal1,012cases)
andall15variablesfromtheoriginalspreadsheet.ForthePRICEvariable,allcases
haveavalueofLOW.
Example 5: Using STATISTICA ETL
(Extract, Transform, and Load)
TheSTATISTICAETL(Extract,Transform,andLoad)moduleprovidesunique
capabilitiesforprocessingandmergingdata,inparticular,processdatathatare
difficulttomanageusingstandarddatabasetools.ETLautomatestheprocessof
validatingandaligningmultiplediversedatasourcesintoasinglesourcesuitable
foradhocorautomatedanalyses.
ETLofferstwooptionsforaligningdata:Timeindexed,whichaggregatesdatafrom
multipledatasourcesbasedonadate/timestampvariableandalignsdataby
minute,hour,day,week,month,quarter,oryear;andIDbased,whichaggregates
datafrommultipledatasourcesbasedonanidentifiervariableandanoptional
timevariable,andoptionallyalignsdatabyNequalintervalsorNuserspecified
intervals.
ThisexampleillustrateshowtheETLmodulehandlesstockrelateddatasetswith
differenttimeintervals.Stocksareboughtandsoldatvaryingpricesthroughout
eachday.Microsoft(tickerMSFT)andOracle(tickerORCL)aresoftwarecompanies
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


94STATISTICAQuickReference
thattradeontheNASDAQelectronicstockexchange.Inthisexample,wewill
comparedatasetscontaininghistoricalstockpriceswithdifferentdate/time
stamps.ThefirstsetcontainsdailyMicrosoftpricequotesfromNASDAQ,whilethe
secondsetcontainsweeklyOraclepricequotesfromanothersource.
OpenMicrosoftPrices.staandOraclePrices.sta:ontheHometabintheFilegroup,
clicktheOpenarrow.Fromthedropdownmenu,selectOpenExamplestodisplay
theOpenaSTATISTICADataFiledialog.DoubleclickontheDatasetsfolder,select
MicrosoftPrices.staandOraclePrices.sta,andclicktheOpenbutton.
Bothdatafilescontainthefollowingcolumns(variables):DATEthedayonwhich
atradetakesplace;OPENopeningpricefortheday,firsttradeoftheday;HIGH
thehighestpriceoftheday;LOWthelowestpriceoftheday;CLOSEclosing
pricefortheday,lasttradeoftheday;andVOLUMEthedailynumberoftraded
sharesofasecurity.
However,theyhavedifferentdateranges:Microsoft10/22/200701/04/2008;
Oracle10/18/200712/28/2007.Inordertocomparethedata,therangeswill
needtobealigned.
SelecttheDatatab.IntheManagegroup,clickExternalData,andselectTime
indexedProcessDatafromtheExtract,Transform,andLoad(ETL)submenu.The
STATISTICAExtract,Transform,andLoad(ETL):TimeindexedStartupPanel
isdisplayed.

ClicktheAdddatasourcebuttontodisplaytheSelectDataSourcesdialog.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference95

ClicktheDocumentsbuttontodisplaytheSelectDocumentsdialog.Selectthe
OpenSpreadsheetsDocumentscheckboxtoselectbothdatafiles
(MicrosoftPrices.staandOraclePrices.sta).

ClicktheOKbuttonintheSelectDocumentsdialog,andthenclicktheOKbutton
intheSelectDataSourcesdialog.TheSTATISTICAExtract,Transform,andLoad
(ETL):TimeindexedStartupPanelwillappearasshownbelow:

SelectMicrosoftPrices.stainthefilelistatthetopofthedialog,andclickthe
VariablesbuttontodisplaytheSelectvariablesdialog.SelectDATEfromthe
Date/Timestamplist,andselectCLOSEfromtheVariableslist.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


96STATISTICAQuickReference
ClicktheOKbuttontoclosethisdialogandreturntotheSTATISTICAExtract,
Transform,andLoad(ETL):TimeindexedStartupPanel.
NowselectOraclePrices.stainthefilelist.ClicktheVariablesbutton,andselect
variable1fromtheDate/Timestamplistandvariable5fromtheVariableslist,
andthenclicktheOKbutton.
IntheAggregationintervalforalldatasource(s)groupbox,selecttheWeekly
optionbutton,andchangethestartfromfieldtoFriday.

Foradditionaldate/timeoptions,selecttheOptionstab.SelecttheFilterallinput
datasourcesbythefollowingDate/Timecheckbox.Tolimitthedatathatis
returnedfrombothoftheselecteddatafiles,enter11/2/2007intheStartdate
fieldand12/28/2007intheEnddatefield.Thiswillreturneightweeksofdata
(FridaytoFriday).

Now,clicktheResultsbuttontomergethedataintoaspreadsheet.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference97

Thetwodatafilesarenowalignedweeklybydatefortherange11/2/2007to
12/28/2007.ThedailyclosingMicrosoftpricesareaggregatedasmeans,whilethe
weeklyclosingOraclepricesareunchanged.
TheResultsspreadsheetdisplaysdate/timestampsascasesnamessothatthey
canbeusedforgraphingtheaggregatedandaligneddata.
SelecttheGraphstab.IntheMoregroup,click2DandselectLinePlots(Variables)
todisplaythe2DLinePlotsVariablesdialog.
ClicktheVariablesbutton,andinthevariableselectiondialog,selectvariables2
and3.Then,clicktheOKbutton.Inthe2DLineplotsVariablesdialog,select
MultiplefortheGraphtype,andclicktheOKbutton.Thefollowingimageshows
theresultantgraphplottingMicrosoftandOracleprices.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


98STATISTICAQuickReference
ENTERPRISE INSTALLATIONS
Example 1: STATISTICA Enterprise
Server Download/Offload Analyses
from/to Servers
STATISTICAEnterpriseServerextendsthecapabilitiesoftheSTATISTICAplatform,
turningseveralstandaloneworkstationsintoapowerful,enterprisewide
collaborativeintelligencesystem.OneofthekeyfeaturesofSTATISTICAEnterprise
Serversclientserverarchitectureisthatitenablesyoutoutilizeserverside
resourcestorunmultiple,possiblytimeconsuming,orrepetitivestatistical
analyses(offloadtaskstotheserver)whileatthesametimefreeingthelocal
systemforothertasksthatrequireimmediateattention.Thiscanbeachieved
usingeitheraWebbrowser(thinclient)ordesktopversionofSTATISTICA
(thickclient,STATISTICAEnterpriseServerclient).Whiletheformerallowsaccess
toSTATISTICAEnterpriseServerusingonlyabrowser,thelatterrequires
STATISTICAinstallationonyourcomputer.STATISTICAEnterpriseServerstight
integrationwiththeSTATISTICAapplicationprovidescommonuserexperienceand
workflowforbothclientandserversideoperations,agenerallymorefeaturerich
andresponsiveuserinterface,andalltheadditionalcomponentsandtoolsof
desktopSTATISTICA.
Offloading an analysis (or a custom script) to STATISTICA Enterprise Server.
First,ensurethatSTATISTICAEnterpriseServerintegrationisenabled.Selectthe
Hometab,andintheToolsgroupclickOptionstodisplaytheOptionsdialog.In
thetreeview,selectServer/Web.SelecttheEnableSTATISTICAEnterpriseServer
Integrationcheckbox.TheonlyrequiredparameterisSTATISTICAEnterprise
Serversnetworkpath(andconnectionsettings,iftheyaredifferentfromthe
default).Askyournetworkadministratorforthesevalues.ItispossibletoEnable
IntegratedLoginifitissupportedandenabledontheserver;otherwiseyouwill
needtoenteryourusernameandpasswordwhenloggingintoSTATISTICA
EnterpriseServer.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference99

Afterspecifyingtheoptionsonthistab,clicktheOKbutton.
TheServertabhasnowbeenaddedtotheribbonbar.IntheUsergroup,clickLog
In,andenteryourusernameandpasswordifrequested.Uponsuccessfully
establishingaconnection,theoptionsontheServertabwillbecomeavailable.
TheOpen,Save,andSaveAscommandsintheFilegroupareusedtouploada
currentlyopenfiletotheserverordownloadafileandopenitlocally.Thereare
alsoexplicitoptionsintheTransfergrouptoDownloadFiletoandUploadFile
fromspecificfoldersontheserverandtheclient.
Note:Asrealworldexamplesoftimeorresourceconsuminganalysesareusually
basedonlargedatasetsand/orinvolveiterativealgorithmsrepresentedby
STATISTICAcomponentsthatarenotincludedinallconfigurationsofSTATISTICA,
wearedeliberatelygoingtouseanexamplethatdoesnotrequiremuchtimeto
complete.Buteveninasituationwhereasingleanalysisisquickandnotresource
intensive,youmightneedtorunafairlycomplicated,timeconsumingsequenceof
tasks,possiblyscheduledatcertaintimeintervals.Inthiscase,theSTATISTICA
EnterpriseServerschedulingfacilitiescouldbeusedonceyouhavecreatedand
uploadedacustomscriptthatrepresentstherequiredtasks(forexample,by
combiningthemacrosrecordedduringaSTATISTICAsession).
Now,recordasampleanalysismacro;forexample,completethestepsdescribed
inExample2:ANOVA(page34).
Aftercompletingtheexample,intheANOVAResultsdialog,clicktheOptions
button(locatedatthebottomofthedialog),andfromthedropdownlist,select
CreateMacro.IntheNewMacrodialog,acceptalldefaults,andclickOK.Testthe
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


100STATISTICAQuickReference
generatedmacrobyrunningit(pressF5)toensurethatitproducesresultsas
expected.Clickonthemacrocodewindowtoensureithasthefocus.
Then,ontheServertabintheTasksgroup,clickOffloadtodisplaytheOffloada
taskdialog.

Weneedtoselectatasktooffload(ascriptoraDataMinerproject)and,
optionally,adatasetonwhichthetaskwilloperate(thedatasetcouldbean
optionalcomponentsinceDataMinerprojectsmayhavetheirdatasetsembedded
andmacrosmightexplicitlyloaddatasetsornotrequirethematall).
Sincethereisanopenactivedataset(Adstudy.sta)andanopenSTATISTICAMacro
(oursampleanalysis),thedefaultsettingsoftheoptionsintheOffloadatask
dialogspecifytousethemforoffloading.Instead,thisexamplewilldemonstrate
howtoreferenceataskandaserversidedataset.Thisoptionisusefulsinceit
givesyoutheadvantageofcentralserversidestorage,whichisespecially
beneficialinthecaseoflargedatasets(possiblydynamicallyupdated)thatare
usedbymultipleusers.
Toreferenceaserversidedataset,intheDataSourcegroupbox,selecttheSelect
datafilestoredontheserveroptionbuttontodisplaytheSTATISTICAEnterprise
ServerRepositorydialog.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference101

ThedirectorystructureinthetreeviewofthedialogrepresentstheSTATISTICA
EnterpriseServerRepository(possiblyabridgedaccordingtoyourparticular
permissions).ClickontheDatasetsfolderintheleftpane,andselectAdstudy.sta
intherightpane(oryoucanenterthepathintheeditboxatthebottomofthe
dialog).
ClickOKintheSTATISTICAEnterpriseRepositorydialogandintheOffloadatask
dialog.STATISTICAwillsubmitthetasktotheserver,uploadingfilesifneeded.
Nowyoucanswitchtootheractivities,whileperiodicallymonitoringthestatusof
offloadedtasksbyclickingStatusintheTasksgroupontheServertabtodisplay
theTaskStatusdialog.ThefollowingillustrationshowsaTaskStatusdialog
containingseveraloffloadedtasks.

ThetaskliststatuscanbeupdatedmanuallybyclickingtheRefreshbuttonor
automaticallybyselectingtheAutomaticcheckboxinthelowerrightportionof
theTaskStatusdialog.TasksgothroughPendingandRunningstatestoeither
CompletedorScriptError.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


102STATISTICAQuickReference
Ifyourtaskfails,doubleclickonthetaskentrytoviewadditionalinformation
aboutthefailure.Whentheerrorisfixed(e.g.,SVBscriptorDataMinerworkspace
isupdated),selectthefailedtaskandclicktheResubmitbutton.
Oncethetaskcompletessuccessfully,youcanretrievetheresults.Notethatsince
theresultsarelocatedontheserver,theyareavailablefromanySTATISTICAclient
workstationaslongasyouareloggedinunderthesamecredentials.TheResults
groupboxcontainsaTaskcheckboxandaDatacheckboxtoretrievethetask
sourceandthedataset(ifapplicable)backtotheclient.WhentheInBrowser
checkboxisselected,theresultswillbeopenedinthebrowser,switchingtoathin
client.Thisoptionisusefuliftheresultsareexpectedtobesignificantinsize;e.g.,
iftheanalysisgeneratesmanydatasetsand/orgraphs,youcansearchthrough
theminthebrowserandselectonlythespecificresultsyouwanttoretrieveto
yourdesktop.Tracereportprovidesadiagnosticreportoftaskexecution.
Tosavediskspaceontheserver,itisagoodpracticetodeletetaskresultsthatare
nolongerneeded.Amessagewillbedisplayedeverytimeresultsarerequested
askingiftheresultsshouldbedeletedafterretrieval(unlesstheDeletetaskafter
retrievalcheckboxiscleared).ClickOKtodeletetheresults.
Onceourtaskcompletes,weretrievetheresultsandclosetheTaskStatusdialog.
Resultsareequivalentwhetherrunlocallyorontheserver.

Example 2: Using STATISTICA in


Regulated Environments
Inaregulatedenvironment,analysesconductedforGxP(GoodManufacturing
Practices,GoodClinicalPractices,GoodLaboratoryPractices)applicationsareones
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference103
thatimpactconsumersafetysuchasinclinicaltrials,manufacturing,andquality
control.WhenabusinessconductsanalysesforaGxPapplication,regulatory
bodiesrecommendthatthecompanybeabletoprovethattheresultsofthe
validatedanalysissystem(e.g.,STATISTICA)areundercontrol.STATISTICA,through
itsaudittrailandspreadsheet/reportlockingfeatures,offersthetoolsyouneedto
meetthisregulatoryrequirement.
InordertomeettraceabilityrequirementsforGxPapplications,thereareatleast
threeconcerns:1)controloftheinputdatabeingsubmittedtotheanalysis(i.e.,
knowingwhomadewhatchange,atwhattime,forwhatreason;andtheold
valuesandnewvalues),2)controloftheresultstablesandgraphs(e.g.,
demonstratethattheywerenotalteredinanywayaftertheywerecreated),and
3)traceabilitybetweentheversionoftheinputspreadsheetandtheresults
outputs.STATISTICAprovidesthisinformationthroughitsSpreadsheetAuditTrails
andGxPReportsfunctionality.
SeealsoSTATISTICADocumentManagementSystemintheElectronicHelpfor
moredetailsaboutversioning/historyofSTATISTICAdocuments.
Control of Input Data
Enable Audit Trail Logging
OpenaSTATISTICASpreadsheet.SelecttheToolstab,clickAuditTrail,andselect
Settingsfromthedropdownmenu.TheSpreadsheetAuditLogSettingsdialog
willbedisplayed.SelecttheEnableaudittrailloggingcheckboxtoenableaudit
trailloggingforthecurrentspreadsheet.

Notethatwhenspreadsheetaudittrailloggingisenabled,thespreadsheetis
automaticallysettodirectmode,i.e.,changesmadetothespreadsheetwillbe
immediatelywrittentodisk.Thus,whenaudittrailloggingisenabled,changesto
thedatafilecannotbeundone.
SelecttheRequireuserstoenterreasoncommentsforeachchangecheckboxto
requireuserstoexplaineachchangemadetothespreadsheet.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


104STATISTICAQuickReference
TheTruncatelogbuttonisavailableonlyifaudittraillogginghaspreviouslybeen
specified,andthereisacurrentSpreadsheetAuditLogViewerattachedtothe
spreadsheet.Clickingthisbuttonwilltruncatethespreadsheetloganddeleteall
existingentries.Youwillbepromptedtoconfirmthisactionbeforethecurrent
entriesaredeleted.Oncethelogistruncated,thetruncateactionwillberecorded
inthenewlytruncatedlogfile.
ClickOKintheSpreadsheetAuditLogSettingsdialog,andaudittrailloggingwill
beenabled;infact,theEnterreasonforchangedialogwillbedisplayed
immediatelyinordertoenterthereasonforenablingtheloggingfunction.Entera
comment,andclickOK.
Now,rightclickintheheaderofthelastvariableinthespreadsheet,andselect
AddVariablesfromtheshortcutmenu.IntheAddVariablesdialog,wewillaccept
alldefaults,soclickOK.TheEnterreasonforchangedialogwillbedisplayed;you
mustenteracommentandclickOKbeforethechangewillbemade.Whenaudit
trailloggingisenabled,everychangemadetothespreadsheetwillbe
documented,andwhentheRequireuserstoenterreasoncommentsforeach
changecheckboxisselected,usercommentsalsowillbestoredanddisplayedin
theSpreadsheetAuditLogViewer.
Next,ontheToolstab,clickAuditTrailandselectViewLogtodisplaythe
SpreadsheetAuditLogViewer.

Thelogviewerdisplaysagridofinformationregardingtheauditedactions
includingthesequencenumber,timeofchange,thecomputerusedtomakethe
change,userinformation,thenatureofthechange,andthereasonforthechange.
Columnwidthsintheloggridcanbeincreasedanddecreasedusingstandard
Windowstechniques.TheSpreadsheetAuditTrailsaresavedandembeddedinto
eachrespectivespreadsheet.
Password encryption vs. locking.Aspreadsheetcanbepasswordencryptedso
thatitcannotbeopenedwithoutthecorrectpassword.Onlyuserswhoknowthe
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference105
passwordcanopenthespreadsheet.Onceapasswordencryptedspreadsheetis
opened,itcanbemodified.
Alternatively,lockingaspreadsheetmakesportionsofthespreadsheetreadonly,
enablingyoutopreventchangestosomeorallaspectsofthespreadsheet.The
spreadsheetcanbeopenedbyanyone,butlockedportionscannotbealtered.
Boththepasswordencryptionoptionsandspreadsheetlockingfacilitiescanbe
usedsimultaneously.
Password Encrypt a Spreadsheet
OpenaSTATISTICASpreadsheet.ClicktheStartbutton intheupperleftcorner
oftheribbonbar,andfromthedropdownmenuselectPropertiestodisplaythe
DocumentPropertiesdialog.SelectthePasswordtab.

EnterapasswordintheDocumentPasswordfield,andclicktheOKbutton.The
Passworddialogwillbedisplayed,whereyoureenterthepasswordtoconfirmit;
passwordsarecontextsensitive.

ClicktheOKbuttoninthePassworddialog,andclosethedatafile.Adialogis
displayedwhereyoucanchoosetosavethechanges;clicktheYesbuttonsothat
thepasswordwillbeencrypted.Thenexttimeanyoneattemptstoopenthis
spreadsheet,thePassworddialogwillbedisplayed,andthecorrectpassword
mustbeenteredbeforethespreadsheetwillopen.
Lock a Spreadsheet
Inordertomeetcompliancerequirements,itisnecessarytohavecontrolofthe
reliabilityofinputdata.Usingthespreadsheetlockingoptions,youcanprevent
changestoallspreadsheetfeatures,fromtheappearanceofthedata(i.e.,display
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


106STATISTICAQuickReference
elements,variablespecifications)totheactualdataandanycaseselection
conditionsorweightsthataredefinedforthespreadsheet.Ofcourse,sometimes
changeshavetobemade(e.g.,whendataareincorrectlyentered).The
STATISTICASpreadsheetAuditTrailfacility,whenenabled,willrecordeachchange
madetothespreadsheet.
WithSTATISTICAEnterpriseproducts,onlyuserswithSystemAdministrator
permissionscanmodifySpreadsheetAuditTrailsettings.Formoreinformation,
seetheElectronicHelpforSTATISTICAEnterprisefacilities.
Withaspreadsheetopen,selecttheToolstab.ClickLockingtodisplaytheLock
Spreadsheetdialog.

Here,youcanspecifywhichaspectsofthespreadsheetthatyouwanttolock.
Whenuserstrytochangealockedfeature,amessagewillbedisplayed,informing
themthatthespreadsheetislocked.
SelecttheSpreadsheetdatacheckboxtopreventchangestotheactualdata
containedinthespreadsheet.Userswillbeunabletochangethedatavaluesand
themissingdatacode.Theywillalsobeunabletoperformanydatamanagement
operationsthataffectthespreadsheet(e.g.,changethedatatypeorthelengthfor
textvariables).Ifthischeckboxiscleared,userswillbeabletoeditthedata(e.g.,
byupdatingqueriesandSpreadsheetFormulasorbysimplytypinginnewvalues).
SelecttheDisplayelements(fonts,formats,etc.)checkboxtoprohibitthe
modificationoffontsandformatsusedinthespreadsheet.Optionsforchanging
thefontsize,color,type,andstyle(i.e.,bold,underline)willbedimmed.
Additionally,theoptionsforapplyingspreadsheetlayouts(accessiblebyselecting
theFormattabandclickingLayoutsintheSpreadsheetgroup)willbeunavailable.
SelecttheCaseselectionandweightscheckboxtopreventusersfromchanging
caseselectionconditionsandcaseweightsforthelockedspreadsheet.Userswill
notbeabletotoggletheuseofselectionconditionsorchangethecurrently
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference107
definedselectionconditions.MostoptionsontheSelectiontaboftheSpreadsheet
CaseSelectionConditionsdialogwillbedimmed;however,optionsontheother
tabsofthatdialog(e.g.,creatingsubsamples,applyingformatstoselection
conditions)arestillavailable.OptionsontheCaseWeightsdialogwillbe
unavailable.
SelecttheVariablespecificationscheckboxtopreventchangestothevariable
specifications(e.g.,measurementtype,missingdatacode,displayformat,long
variablename).Userswillstillbeabletoviewtheindividualvariablespecification
dialog(accessiblebydoubleclickingthevariableheader)andtheVariable
SpecificationsEditor;however,optionsforchangingthesespecificationswillbe
dimmed.
SelecttheAudittrailcheckboxtopreventchangestotheaudittrailsettings.Users
willbeunabletomodifytheaudittrailsettings.
Enterapasswordtousewhenlockingandunlockingthespreadsheet,confirmthe
password(whichiscontextsensitive),andclickOK.Althoughapasswordisnot
required,itisstronglyrecommended.Ifapasswordisnotenteredandconfirmed,
anyusercanunlockspreadsheetfeaturesbysimplyclearingtheselectedcheck
boxes.Notethatiflockshavealreadybeendefined,youmustenterthecorrect
passwordbeforelockscanbechangedormodified.
Nowtrymakingchangesinthespreadsheet;amessagewillbedisplayedinforming
youthattheoperationcannotbecompletedbecausethespreadsheetislocked.
Controlling Results and Traceability
Tomeetcompliancerequirements,anotherstepistoensurethatreportedresults
areundercontrol.STATISTICAprovidesoptionsforcreatingGxPreports.InGxP
mode,allresultsaresenttoareportwindow,andthewindowislocked.All
optionsforremovingresults(Cut,ExtractOriginal,Clear,etc.)andaddingresults
(Paste,Insert)aredisabled.STATISTICAcanalsoincludeacreationdateinall
reportsaswellasatimestampforallresultsthatareaddedfromresultsdialogs.
Theappearanceandcontentofthecreationdateandtimestamparecompletely
configurableandcanincludeuserandcomputerinformationinadditiontothe
timeanddate.Thus,inGxPmode,youcanknowwhentheresultswerecreated
andbywhom.Youcanalsobecertainthatresultshavenotbeenremoved.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


108STATISTICAQuickReference
AnadditionalfeatureofGxPmodeisatraceabilityoption.WhenrunninginGxP
mode,STATISTICAautomaticallyverifieswhetherspreadsheetaudittrailsare
enabled.Iftheyare,STATISTICAincludesthespreadsheetnameandversion
numberinthereport.Sometimesversionnumbersarenotavailable,forexample,
ifaudittrailsarenotenabledortheresultsarecreatedfromanInplaceDatabase
connection.Whenthatisthecase,STATISTICAwillprovideanexplanationforwhy
aversionnumberisnotavailable.
Create a GxP Report
SelecttheHometab.IntheToolsgroup,clickOptionstodisplaytheOptions
dialog.Inthetreeview,selectOutputManager,locatedunderAnalyses/Graphs.
FromtheReportOutputdropdownlist,selecteitherSendtoMultipleReports
(oneforeachAnalysis/Graph)orSingleReport(commonforallAnalyses/
Graphs).
SelecttheLockedcheckboxtomaketheReportLocking(GxPReports)options
becomeavailableandtoensurethatdocumentscannotberemovedfromthe
report.OptionspertainingtoreportssuchasCut,Paste,Delete,Extract,etc.,will
bedisabled.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference109
Toincludeacreationstampatthetopofthefile,youcanacceptthedefault
formatintheCreationStampfield,orenteryourown.Thefollowingcodescanbe
usedinthisfield:&[Date],&[Time],&[User],and&[Computer].Anytextyouenter
willbedisplayedasis.
Toincludeatimestampaboveeachobjectasitisaddedtothereport,youcan
acceptthedefaultformatintheTimeStampfield,orenteryourown.The
followingcodescanbeusedinthisfield:&[Date],&[Time],&[User],and
&[Computer].
ClickOKintheOptionsdialog,andnowperformanyanalysis;e.g.,useBasic
StatisticstocreateaquickDescriptiveStatisticssummaryspreadsheet.Whenyou
clicktheSummarybutton,theresultswillbesenttoalockedreportthatliststhe
creator,date,time,etc.,oftheanalysis.

Example 3: STATISTICA Enterprise


STATISTICAEnterpriseproductsextendthefunctionalityofSTATISTICAapplications
byofferingcollaborativework,centraladministration,systemlevelcustomization,
andotherfeaturesnecessarywhenusingSTATISTICAapplicationsaspartofthe
enterpriselevelcomputersystems.
STATISTICAEnterpriseManagerisacomponentoftheSTATISTICAEnterprise
systemthatenablesuserstoconfigurevariousaspectsoftheEnterprisesystem
includinguseradministration,systemvieworganization,databaseconnection
maintenance,dataconfigurations,andanalysisconfigurations.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


110STATISTICAQuickReference
Forthisexample,wewill:
1. Createanewuser
2. Createanewgroup
a. Assignpermissionstothegroup
b. Addtheuser(seeNo.1)tothegroup
3. Createasystemviewnode
4. Createanewdatabaseconnection
5. Createadataconfiguration
6. Createananalysisconfiguration
7. Runtheanalysisconfiguration
System View vs. Object View
Beforestartingthisexample,onethingshouldbenoted.InSTATISTICAEnterprise
Manager,ontheViewtab,youcanselecteitherSystemVieworObjectView.In
SystemView,objects,e.g.,dataconfigurationsandanalysisconfigurations,are
shownaschildnodes.InObjectView,objectsareshownaschildnodeswithin
theirrespectivecategories.Forthisexample,SystemViewshouldbeselected.
1. Create a New User
LaunchtheEnterpriseManager,andloginasauserwhoispartofthedefault
Administratorgroup.Inthetreeview(theleftpane),clicktheplussign nextto
theUserAdministrationnodetoexpandit,andthenselecttheUsersfolder.Inthe
propertiespage(therightpane),clicktheNewUserbuttontodisplaytheoptions
tocreateanewuser.IntheNamefield,enterTestUser1,anddefineapassword
andconfirmthepassword.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference111

Then,clicktheCommitChangesbutton locatedatthetopoftheapplicationon
theQuickAccesstoolbartosavethechanges.Amessagewillbedisplayedthat
informsyouthattheuserdoesnthavepermissiontologin.ClicktheYesbuttonto
continue.
Wewillnowcreateagroup,givethegrouppermissions,andassignthenewuser
tothatgrouptoallowtheusertohavepermissiontologontotheEnterprise
Manager.Withthismethod,anypermissionchangeswillonlyneedtobeapplied
tothegroupinsteadoftheindividualusers,makingmaintenanceofusersin
STATISTICAEnterpriseeasier.
2. Create a New Group
IntheUserAdministrationnode,selecttheGroupsfolder,andintheproperties
page,clicktheNewGroupbuttontodisplaytheoptionstocreateanewgroup.In
theNamefield,enterTestGroup1.IntheGroupMembersframe,selectthecheck
boxadjacenttoTestUser1.Thiswilladdthepreviouslycreatedusertothegroup.
IntheGroupPermissionsframe,selectthecheckboxesadjacenttoAnalysisAdmin
(AADM)andWebUser(WUSR).Inthetreeview,clicktheplussign adjacentto
the

TestGroup1nodetoexpandit,andselectAnalysismodules.Intheproperties
page,clicktheSelectAllbuttontoselectallofthemodulesintheAvailable
analysismodule(s)list.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


112STATISTICAQuickReference

ThiswillgiveusersofthisgrouppermissiontologontobothWebanddesktop
STATISTICAandrunalloftheavailableanalysesandreports.
ClicktheCommitChangesbutton

tosavethechanges.
Wehavenowcreatedthenecessaryuserandgroupsecuritytorunanalysesand
reports.Whencreatingthedata,analysis,andreportconfigurationsinthenext
steps,wewillassignthisgrouptothoseobjectstoallowonlyuserswithinthe
grouptorunthem.
3. Create a System View Node
NowwewillcreateaSystemViewnodetoholdthisexamplesdata,analyses,and
reportconfiguration.Inthetreeview,clicktheplussign adjacenttotheSystem
Viewnodetoexpandit.RightclickontheSTATISTICAEnterprisefolder,andfrom
theshortcutmenu,selectNewFolder.IntheFoldernametextboxinthe
propertiespage,enterTestExample1asthenewfoldersname.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference113

ClickCommitChangestosavethechange.Thisfolderwillnowbeusedtohouse
thedata,analyses,andreportconfigurations.
4. Create a New Database Connection
RightclickontheDatabaseConnectionsnodeinthetreeview,andfromthe
shortcutmenu,selectNewDatabaseConnectiontodisplaytheDataLink
Propertiesdialog.

Forthisexample,wellusetheNorthwindsampledatabaseinstalledwith
MicrosoftSQLServer,soselectMicrosoftOLEDBProviderforSQLServer,andclick
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


114STATISTICAQuickReference
theNext>>button.TheDataLinkPropertiesdialogConnectiontabwillbe
displayed.
SelectaserverfromtheSelectorenteraservernamedropdownlist.
Then,selectthelogonoptionbuttonappropriatetoyourSQLServerNorthwind
databaseinstallation.SelecteithertheUseWindowsNTIntegratedsecurity
optionbutton,orselecttheUseaspecificusernameandpasswordoptionbutton
andenteraUsernameandPasswordintherespectivefields.
Next,selectNorthwindfromtheSelectthedatabaseontheserverdropdownlist.

ClicktheTestConnectionbuttontoattemptaconnectiontothespecifieddata
source.ApromptwillbedisplayedthatacknowledgesthattheTestconnection
succeeded.Ifitdidntsucceed,checkyouraccesspermissionstothefileand
ensurethatthesettingsarecorrect.Forexample,spellingerrorsandcase
sensitivitycancausefailedconnections.

ClickOKintheprompt,andclickOKintheDataLinkPropertiesdialog.Inthe
resultingpropertiespage,enterTestExampleConnection1intheNamefield.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference115

Then,clicktheAccessPermissionsbutton.FromthelistofAvailableUsersand
Groups,selectTestGroup1,andthenclickthetoparrowbutton tomoveTest
Group1totheAccessPermissionslist.

Now,clicktheCommitChangesbutton.
WiththedatabaseconnectioncreatedtotheNorthwinddatabase,wewillnow
createadataconfigurationtoextractdatafromthedatabase.
5. Create a Data Configuration
RightclickontheTestExample1folderinthetreeview,andfromtheshortcut
menu,selectNewDataConfiguration.Inthepropertiespage,enterTestExample
1intheNamefield.ClickthearrownexttotheConnectionfield,andfromthe
dropdownlist,selectTestExampleConnection1.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


116STATISTICAQuickReference

ClicktheNextStepbuttontodisplaythenewqueryoptions.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference117
ClicktheSQLWizardbuttontodisplaytheNewQuerydialog,whichwillopenin
STATISTICA.

DragtheOrderstablefromtheleftpaneintotheeditorviewer(theupperright
pane),andthenselect,inthefollowingorder,theOrderID,ShipVia,ShipCountry,
andFreightfields.

SelectthePreviewDatatabinthequerypropertiesview(lowerrightpane)and
clicktheRefresh

toolbarbutton(theredexclamationmark).Thiswilltestthe
querytoensurethatvaluesarebeingretrievedfromthedefinedquery.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


118STATISTICAQuickReference
ClicktheReturnDatatoSTATISTICA toolbarbutton(greenarrow)tosubmitthis
querybacktothedataconfiguration.

ClicktheOrderIDrowtohighlightit,andthenclicktheEditbuttontodisplay
optionstoedittheOrderIDcolumn.ClicktheAutoUpdatearrow,andfromthe
dropdownlist,selectFirstupdatecolumn.Thisenablesyoutodetectchangesin
theOrderIDcolumn.Inaddition,thecolumnissorted.

ClicktheNextStepbuttontoedittheShipViacolumn.ClicktheFilteringbuttonto
displaythefilteringoptions,andselecttheEnabledcheckboxtoallowfilteringon
theShipViacolumn.
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference119

ClicktheNextStepbuttontoreturntoShipViacolumneditingoptions,andthen
clicktheNextStepbuttontoedittheShipCountrycolumn.ClicktheFiltering
buttontodisplaythefilteringoptions,andselecttheEnabledcheckboxtoallow
filteringontheShipCountrycolumn.ClicktheNextStepbuttontoreturntothe
ShipCountrycolumneditingoptions,andthenclicktheNextStepbuttontoeditthe
Freightcolumn.ClicktheTargetTypearrow,andfromthedropdownlist,select
VariableCharacteristic.Thisoptionwillmakethiscolumnavailabletoperform
packagedSPCanalyses(thisisthecolumncontainingthedatatobeanalyzed).

Next,clicktheNextStepbuttontodisplaytheAccessPermissionsoptionsforthis
object.FromthelistofAvailableUsersandGroups,selectTestGroup1,andthen
clickthetoparrowbutton tomoveTestGroup1totheAccessPermissionslist.
Nowthisdataconfigurationwillbeexecutable(butnoteditable)bytheusersof
TestGroup1.
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


120STATISTICAQuickReference
ClicktheCommitChangesbuttontocommitthisnewdataconfigurationto
STATISTICAEnterpriseManager.
6. Create an Analysis Configuration
NowthatadataconfigurationhasbeendefinedtoextractdatafromtheNorthwind
database,ananalysisconfigurationtoanalyzethedataneedstobecreated.
Inthetreeview,rightclickontheTestExample1folder,andfromtheshortcut
menu,selectNewAnalysisConfigurationtodisplaytheSelectaData
Configurationdialog.SelecttheTestExample1object,andclicktheOKbutton.Ifa
dialogisdisplayedwiththestatement:Whenselected,thisoptionwillreplacethe
permissionsofthisAnalysiswiththoseoftheselectedData,clickOK.

ClicktheNextStepbuttontocontinuecreatingtheanalysisconfiguration(leaving
thedefaultnamethesameasthedataconfigurationforexpediencyonly).Click
theNextStepbuttononceagaintocontinueeditingtheanalysisconfiguration.

Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference121
InthepropertiespagefortheSPCCharacteristicsFreightcolumn,changethe
ChartTypetoIndividuals&MovingRange(asshownintheaboveillustration).
NootherSPCoptionsneedtobeconfigured,soselecttheRunoptionsnodeinthe
treeview,andselecttheShowSQLCriteriadialogcheckboxinthepropertiespage.

ThisoptionwillspecifythatSTATISTICApromptforfilteringonthosecolumnsthat
haveFilteroptionsinthedataconfiguration(if,whendefiningtheFilteroptions,
theyweresettoRequiredwhenfiltering,thisstepwouldnotberequiredasit
wouldalwaysforceafilteringpromptwhenrunninginthisexampleitwasnot
requiredtoforcefiltering).ClicktheCommitChangesbuttontosavethisanalysis
configurationtoSTATISTICAEnterprise.
7. Run the Analysis Configuration
ClosetheEnterpriseManager,andlogontoSTATISTICAastheTestUser1user
createdinStep1.SelecttheEnterprisetab,andintheEnterprisegroup,clickRun
Analysis/ReporttodisplaytheRunAnalysisorReportdialog(thisdialogmaybe
displayedautomaticallydependingonyourconfiguration).SelecttheTestExample
1analysis,andclicktheOKbutton;theSQLCriteriadialogwillbedisplayed.

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


122STATISTICAQuickReference
ClicktheColumnarrow,andselectShipCountryfromthedropdownlist.Clickthe
browsebutton todisplaytheValueofShipCountrydialog,whichcontainsthe
listofavailableShipCountryvalues.SelectBrazilandclicktheOKbutton.

ClicktheFinishbuttontocompletethefilteringstep,extractthedata,andperform
apackagedanalysisontheFreightcolumn.

Custom User Interfaces


Notethatthissimpleexampleillustrateshowtoenableandrunananalysis
configurationusingthestandardSTATISTICAuserinterfaceandoutput
components.
However,oneofthemajorstrengthsofSTATISTICAEnterpriseistheeaseof
creatingcustomuserinterfaces(e.g.,fordifferentcategoriesofusersdepending
ontheirrolesintheorganization,expertise,ordataaccessprivileges).
Chapter2:StepbyStep Examples

Copyright StatSoft, 2011


STATISTICAQuickReference123
Youcaneasilycreateacustomizeduserinterfaceatanydegreeofcomplexity,
fromhighlysimplifiedones,e.g.,onethatcontainsonlythreeoptions:

toveryelaborateuserinterfacesofvirtuallyunlimitedflexibility:
Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


124STATISTICAQuickReference
PleaserefertotheSTATISTICAEnterprisedocumentation(ElectronicManual)for
moredetailsandexamples.
The STATISTICA Enterprise
Server Option
STATISTICAEnterpriseServerprovidesallofthefunctionalitydescribedinthis
exampleandalsoenablesoffloadingtaskstotheserverandremoteaccessviaa
browserinterface.

SeeAppendixBSTATISTICAEnterpriseServer,page263,formoreinformation.

Copyright StatSoft, 2011


STATISTICAQuickReference125
USER INTERFACE
General Features ................................................................................... 127
Multiple Analysis Support ..................................................................... 128
Three Alternative User Interfaces:
Interactive User Interface ................................................................ 130
STATISTICA Visual Basic and Controlling STATISTICA
from Other Applications .............................................................. 140
Web Browser-Based User Interface:
STATISTICA Enterprise Server .................................................... 141
Microsoft Office Integration ................................................................. 142
CHAPTER
3
3

Chapter2: StepbyStep Examples

Copyright StatSoft, 2011


126STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICAQuickReference127
USER INTERFACE
GENERAL FEATURES
Customized Operation
TheSTATISTICAsystemcanbecontrolledinseveralways.Thefollowingsections
summarizethefeaturesofthemainalternativeuserinterfacesofSTATISTICA:
1. Interactiveinterface(seepage130)
2. STATISTICAVisualBasic(seepage140)
3. Webbrowserbasedinterfaces(seepage141)
4. MicrosoftOfficeIntegration(seepage142)
However,notethat:
Manyaspectsoftheseuserinterfacesdonotexcludeeachother;thus,
dependingonyourspecificapplicationsandpreferences,youcancombine
them;
ThecustomizableQuickAccessToolbarandclassicmenuscanbeusedto
integratethealternativeuserinterfacesand,forexample,toprovidequick
accesstomacro(VisualBasic)programsorcommonlyusedfiles;and
Almostallfeaturesofthesealternativeuserinterfacescanbecustomized
(leadingtoadifferentappearanceandbehaviorofSTATISTICA);itis
generallyrecommendedthatyoucustomizeyoursysteminordertotake
fulladvantageofSTATISTICAspotentialtomeetyourpreferencesand
CHAPTER
3
3

Chapter3:UserInterface

Copyright StatSoft, 2011


128STATISTICAQuickReference
optimalrequirementsofthetasksthatyouneedtoaccomplish(see
CustomizationoftheInteractiveUserInterfaceonpage213).
Alternative Access to the Same
Facilities - Custom Styles of Work
Evenwithoutanycustomization,thedefaultsettingsofSTATISTICAoffer
alternativeuserinterfacemeansandsolutionstoachievethesameresults.This
alternativeaccessprinciplepresentineveryaspectofitsuserinterfaceenables
STATISTICAtosupportdifferentstylesofwork.Forexample,mostofthe
commonlyusedtoolscanbeaccessedalternatively:
Fromtheribbonbarortheclassicmenus
Viakeyboardshortcuts
Byusingtheclickablefieldsonthestatusbar
ViathecustomQuickAccesstoolbar(userdefinedtoolbarwithbuttonsand
specialcontrols,whichcanincludemacrosandcommands)
Fromtheshortcutmenusassociatedwithspecificobjects(cells,workbookicons,
partsofgraphs),whicharedisplayedbyrightclickingontheitem.
Itissuggestedthatyouexplorethealternativeuserinterfacefacilitiesof
STATISTICAbeforebecomingattachedtoonestyleoranother.
MULTIPLE ANALYSIS SUPPORT
Asmentionedbefore,youcanhaveseveralinstancesofSTATISTICAopenatthe
sametime.Eachofthemcanrunthesameordifferenttypesofanalyses
(traditionallycalledmodules),suchasBasicStatistics,MultipleRegression,ANOVA,
etc.Moreover,inoneSTATISTICAinstance,multipleanalysescanbeopen
simultaneously.Theycanbeofthesameoradifferentkind(e.g.,fiveMultiple
RegressionsandtwoANOVAs),andeachofthemcanbeperformedonthesameor
adifferentinputdatafile(multipleinputdatafilescanbeopenedsimultaneously).
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference129
Individual analyses functional units of your work.Inordertofacilitate
takingadvantageofthismultitaskingfunctionality,yourworkwithSTATISTICAis
organizedintofunctionalunitscalledanalysesthatarerepresentedwithbuttons
ontheanalysisbaratthebottomoftheapplicationwindow(abovethestatusbar,
seethefollowingillustration,whereDescriptiveStatistics,ClusterAnalysis,and
CanonicalAnalysisarerunningsimultaneously).Consecutivebuttonsareaddedas
youstartnewanalyses.Avarietyofoptionsareprovidedtocontrol(and/or
permanentlyconfigure)thisaspectofSTATISTICA.

Bydefault,whenyouselectspecificoutputfromaresultsdialog,theoutput(a
spreadsheetoragraph)isdisplayedandthedialogisautomaticallyminimizedinto
itsrespectiveanalysisbuttonatthebottomofthescreen.Clickthatbutton(or
pressCTRL+R)todisplaythedialogagainandresumetheanalysis.
Aselectionofoptionspertainingtoanalysismanagementareavailableonthe
shortcutmenu(accessedbyrightclickingonananalysisbuttonontheanalysis
bar)relatedtoeachrespectiveanalysisbutton(asshownabove).
A useful hint for those with large screens. Ifyouhavealargescreen,youcan
turnoffthedefaultminimizationoftheanalysisdialogsandtakeadvantageofthe
factthatmostofthesedialogsaresmalland,thus,canremainontheworkspace
withoutinterferingwiththeviewingofanalysisresults.Youcanadjustthisoption
eitherforaparticularanalysis(cleartheAutoMinimizecommandontheanalysis
Chapter3:UserInterface

Copyright StatSoft, 2011


130STATISTICAQuickReference
buttonshortcutmenu,showninthepreviousimage),orgloballyfortheentire
program[selectAnalyses/GraphsinthetreepaneoftheOptionsdialog
(accessiblebyselectingtheToolstabandclickingOptions),andcleartheAuto
minimizedialogswhendisplayingoutputcheckbox].
WhenyourunmultipleanalysesandtheSTATISTICAworkspacebecomes
cluttered,youcanhideallwindowsrelatedtospecificanalyses(orclosethem
altogetherviatheanalysisbuttonshortcutmenucommandCloseAllAnalyses).
YoucanalsoopennewSTATISTICAinstances,whichoffersanothersimplewayto
organizeandmanageyour work.
INTERACTIVE USER INTERFACE
Overview
Main components of the interactive user interface of STATISTICA. Although
theinteractiveuserinterfaceofSTATISTICAisnottheonlyoneavailable(see
Chapter7CustomizingSTATISTICA,page213,andChapter8STATISTICAVisual
Basic,page219),inmostcasesitistheeasiestandmostcommonlyused.Many
componentsofthisuserinterfacecanbeseenintheSTATISTICAapplication
window.
First,similartomostsoftwareprograms,tabs,menubarsandvarioustoolbarsare
displayedatthetopofthewindow.Thesearecustomizableanddisplayedinthe
mostappropriatemannerforyourtasks.
Atthebottomofthewindow,theanalysisbar(containingminimized
analysis/graphdialogs)andthestatusbararedisplayed.Additionally,shortcut
menusareavailablewhenyourightclickinappropriateplaces.
Datafilescanbedisplayedinspreadsheets,workbooks,reports,orindividual
windows.Resultsspreadsheetsorgraphscanbedisplayedinworkbooks,reports,
orindividualwindows.Notethatadditionaldocuments(suchasWordorBitmap
images)canalsobedisplayedinspreadsheets,workbooks,orreports.Finally,
STATISTICAVisualBasiccodeisdisplayedinmacrowindows.
Normallyyouwouldnotsimultaneouslyseeallofthesefacilitiesandtoolsatone
time.YoualwayshavetheabilitytomaketheuserinterfaceofSTATISTICAas
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference131
simpleorcomplexasyourparticularneedsandcomfortleveldemand(seepage
213).ThesevarioustoolsandfacilitiesaredescribedindetailintheElectronic
Manual(STATISTICAHelp).
Modules.WhileSTATISTICAoffersavarietyofstatisticalandgraphicalprocedures,
eachprocedurecanbeperformedinthesameinstanceofSTATISTICA.Thismeans
that,forexample,itispossibletocalculateresidualstatisticsusingoptionsinthe
MultipleRegressionmodule,thenimmediatelyusethatoutputintheFactor
Analysisoranotherexploratorymodulewithoutfirststartinganotherinstanceof
STATISTICA.Formoreinformationonusingresultsasinputdata,seeCanIUsethe
ResultsofMyAnalysistoPerformAnotherAnalysis?intheElectronicManual.
The Flow of Interactive Analysis
Startup Panel.WhenastatisticalprocedureisselectedfromtheStatistics,Data
Mining,orGraphstabs,itsrespectiveStartupPanelisdisplayed(asshownbelow;
BasicStatisticswasselectedfromtheStatisticstabBasegrouptodisplaythe
BasicStatisticsandTablesStartupPanel).

EachStartupPanelcontainsalistofthetypesofanalysesavailableinthat
particularmodule.Clickinganywhereoutsidethepanelautomaticallyminimizesit
asabuttonontheanalysisbar.Ifyoursystemincludesahighresolutionscreen,
youcanchangethisdefaultandkeeptheconsecutivedialogs(ineachanalysis
sequence)displayedontheworkspace.
Analysis specification and output selection (results) dialogs. Whenthe
desiredanalysisisselectedintheStartupPanel,theanalysisspecificationdialogis
displayed,inwhichyouselectthevariablestobeanalyzedandotheroptionsand
Chapter3:UserInterface

Copyright StatSoft, 2011


132STATISTICAQuickReference
featuresofthetasktobeperformed.Often,thesedialogscontainseveraltabsthat
grouptheoptions,analyses,and/orresultsinlogicalcategoriestomakeiteasierto
locatespecificfeatures.

Insomesimpleanalyses(suchasDescriptiveStatistics,shownintheillustration
above),theanalysisspecificationdialogalsoservesasanoutputselectiondialog
whereyoucanspecifythetypeandformatoftheoutput(e.g.,specific
spreadsheetsorgraphs).Mostanalyses,however,haveaseparateanalysis
specificationdialogandresultsdialog.
Spreadsheet facilities for scenario (what-if) analyses and customized
appearance.STATISTICAprovidesyouwiththecapabilitytoappend
supplementaryinformationaboutvariablemeasurementtypesandcasestatesto
yourspreadsheets.Thismetadatacanbeusedtocreateamorecomprehensive
descriptionofyourdataset,facilitatewhatiftypesofexploratoryanalyses,and
customizetheappearanceofcasesingraphs.
Case states and brushing.Youcanassigncasestatestocasesinorderto
customizetheappearanceofpointsingraphicaldisplays,thusmakingitveryeasy
toidentifyinfluentialandinterestingpoints.Awideselectionofsymbolsand
colorsisavailabletocustomizetheappearanceofselectedpoints.Notonlycan
casestatesbeassignedinthespreadsheetbeforeagraphiscreated,theycanalso
beassignedinteractivelyinthegraphviatheBrushingfacilities(accessibleby
clickingtheBrushingbutton intheCustomizeGraphgroupontheEdittab
whenagraphisdisplayed).Thecasestatesassignedinthegraphpropagateback
tothespreadsheet.Theabilitytoassigncasestatesineitherthespreadsheetor
graphfurtherfacilitatestheexploratoryvisualanalysisofdata.
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference133
Measurement types and automatic variable pre-screening. Themodelingor
measurementtypeofavariablecanbeexplicitlydefinedinordertoindicatewhat
analysesandgraphsareappropriateforsuchavariable.Thesemeasurementtypes
willmapdirectlytosubsequentanalysesandgraphs,identifyingappropriate
variablesineachcase(e.g.,variablesoftypecategoricalwillbepresentwithinthe
listofcategoricalpredictorsavailableinaFactorialANOVA).

Inallvariableselectiondialogs(suchastheoneshownabove),theShow
appropriatevariablesonlyoptionisprovided,whichenablesyoutoprescreenor
filtervariablesaccordingtotheirMeasurementType(specifiedintheVariable
specificationdialog,accessiblebydoubleclickingonavariableheaderina
spreadsheet);ifthattypeisAuto,thentheAutomaticvariableprescreeningand
classificationoptions(locatedintheAnalysis/GraphoptionspaneoftheOptions
dialog,accessiblebyselectingtheToolstabandclickingOptions)determinehow
STATISTICAwillautomaticallydeterminetheMeasurementType.
Auto filtering (cloaking variables and cases).Filtering(accessiblebyselecting
theDatatabandclickingAutoFilterintheTransformationsgroup)isaquickand
easywaytodisplayaspecificportionofthedatainyourspreadsheetwithout
sortingthedataorcreatingasubset.Whenavariableisfiltered,onlythevalues
thatmeetthespecifiedcriteriaaredisplayedinthespreadsheet.Casesthatdonot
meetthecriteriaarehiddenfromsightbutnotremovedfromthespreadsheet
(e.g.,inthespreadsheetshownbelow,onlythecasesforGENDER=MALEare
displayed).

Althoughhidden,theyarestillavailableforstatisticalandgraphicalanalyses.
Chapter3:UserInterface

Copyright StatSoft, 2011


134STATISTICAQuickReference
Output.AsdescribedinmoredetailinChapter4FiveChannelsforOutputFrom
Analyses(page147)andasillustratedinExample1:Correlations(page11)and
Example2:ANOVA(page34),theconsecutiveoutputspreadsheetsandgraphsare
displayedinworkbooksbydefault.Theseworkbookscanbesavedandlater
reopened,makingiteasytoreturntospecificresultsasneeded.
Additionally,youcansendalloutputtoananalysisreport(seepage151),which
producesaneasilyorganized(viathereporttree),formatted,andprintedreportof
aspecificanalysis.Youcanalsochoosetosendallresults,regardlessofwhat
analysisitcomesfrom,toasinglereport.Lastly,theoutputcanbedirectedto
separatewindows.
Tospecifyoutputoptionsforasingleanalysisorsession,clickthe

button

intheanalysisorgraphspecificationdialogandselectOutputtodisplaythe
Analysis/GraphOutputManagerdialog.
Toaccessglobaloutputoptions,selecttheToolstab.ClickOptionstodisplaythe
Optionsdialog,andselectOutputManager.Or,selecttheHometabandclick
OptionsintheToolsgroup.Formoreinformation,seetheElectronicManual.
Features of Analyses
STATISTICAprovidesdirectaccesstoallstatisticalanalysesviatheStatisticstab:

andtheDataMiningtab:

andprovidesdirectaccesstoallgraphicalanalysisdialogsviatheGraphstab:
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference135

Thesetabsareneverdisabled,i.e.,theyareavailablewheneveranyinputdata
documentisopen.
TheStatisticsandDataMiningtabsprovideaccesstoallavailableanalysistypes
withinSTATISTICA.TheGraphstabprovidesdirectaccesstoavarietyofcommonly
usedgraphtypes(e.g.,scatterplots,histograms,means/errorplots,etc.)aswellas
hierarchicalaccesstoallgraphtypesinSTATISTICAincluding2DGraphs,3D
SequentialandXYZGraphs,CategorizedGraphs,UserdefinedGraphs,BlockData
Graphs,InputDataGraphs,andMultiGraphLayouts.Comprehensivediscussions
ofallthevarioustypesofstatisticsandgraphsofferedbySTATISTICAareavailable
intheglossaryoftheElectronicManual.Seealso,AppendixC:STATISTICAFamily
ofProducts(page275)formoreinformationonallmembersofthecomprehensive
selectionofdataanalysisapplicationsfromtheSTATISTICAfamilyofproducts.
Using the analysis bar. TotakeadvantageofSTATISTICAsmultitasking
functionality(seeMultipleAnalysisSupport,page128),STATISTICAanalysesare
organizedasfunctionalunitsthatarerepresentedwithbuttonsontheanalysisbar
atthebottomoftheapplicationwindow(abovethestatusbar,seethenext
illustration,whereDescriptiveStatistics,ClusterAnalysis,andCanonicalAnalysis
arerunningsimultaneously).Consecutivebuttonsareaddedasyoustartnew
analyses.

Chapter3:UserInterface

Copyright StatSoft, 2011


136STATISTICAQuickReference
Minimizing dialogs (and a hint for users with large screens). Dependingon
yourpreferences,youcanchoosetominimizeallanalysisdialogswhenyouselect
anotherwindowinSTATISTICAoranotherapplication.BydefaulttheAuto
Minimizecommandisselected;however,whenyourscreenislargeenoughto
accommodateseveralwindows,itisrecommendedthatyouclearthiscommand.
Thiskeepstheanalysisdialogsonscreenwhiletherespectiveoutputcreatedfrom
thesedialogsisproduced,thusenablingyoutousethedialogsastoolbarsfrom
whichoutputcanbeselected.Seepage129forinformationonhowtoadjustthis
command.
Continuing analyses/graphs. Itiseasytocontinueananalysisorgraph(i.e.,to
changethefocustothecurrentdialogforaparticularanalysis).SelecttheTools
tab,clickAnalysisBar,andselectResumefromthedropdownmenu;orpress
CTRL+R;orclicktheanalysis/graphbuttonontheanalysisbar.Whenmultiple
analysesarerunning,youcanalsoselectthespecificanalysisfromtheSelect
Analysis/Graphsubmenu(asshowninthenextillustration).

Hiding windows. Tofurtherfacilitatetheorganizationofwindowsfromvarious


analyses,youcanhideallwindowsassociatedwithaparticularanalysiswhenthat
analysisisdeselected:selecttheToolstab,clickAnalysisBar,andfromthe
Optionssubmenu,selectHideonDeselect.Bydefault,thiscommandisnot
selected.Notethatthiscommandonlyapplieswhentheresultsaresentto
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference137
individualwindows;seethediscussionoftheOutputManager(page147)for
moredetailsonmanagingoutputfromanalyses.Inaddition,thereisacommand
ontheHometabintheWindowsgrouptoclosealldocumentwindows:clickClose
All(orpressCTRL+Lonyourkeyboard),andacommandontheToolstabtocloseall
analyses:clickAnalysisBarandselectCloseAllAnalysesfromthedropdown
menu.
Bringing windows to the top. OntheToolstabclickAnalysisBar,andfromthe
OptionssubmenuselectBringtoToponSelecttoactivate(bringtothetopof
STATISTICA)allwindowsassociatedwithaparticularanalysiswhenthatanalysisis
selected,replacingwhateverdialogswereontop.Thiscommandalsofacilitates
theorganizationofindividualwindowsfromvariousanalyses.Bydefault,this
commandisselected.Notethatthiscommandonlyapplieswhentheresultsare
senttoindividualwindows;seethediscussionoftheOutputManager(page147)
formoredetailsonmanagingoutputfromanalyses.
Hiding the summary box. Bydefault,asummaryboxislocatedatthetopof
certainresultsdialogs(suchasMultipleRegressionResults)andcontainsbasic
summaryinformationabouttheanalysis.Youcanhideanindividualsummarybox
byclickingthe buttoninthelowerrightcornerofthesummarybox.Youcanalso
suppressthedisplayofallsummaryboxesgloballybyselectingtheToolstab,
clickingAnalysisBar,andselectingHideSummaryBoxfromtheOptionssubmenu.
Document Types
STATISTICAusessevenprincipaldocumenttypes:
Workbooks(seepages148and169)
Spreadsheets(multimediatables)(seepage173)
Reports(seepages151and180)
Graphs(seepages182and189)
Macros(STATISTICAVisualBasicprograms)(seepages183and219)
STATISTICAProjectFiles(seepage184)
DataMinerRecipesProjectFiles(seepage61)
Chapter3:UserInterface

Copyright StatSoft, 2011


138STATISTICAQuickReference
Usingthesesevendocumenttypes,youcanmanagedataofvarioustypes,
performdataentryandanalyses,generategraphsofthehighestquality,develop
customapplicationsofanydegreeofcomplexity,andcreatecustomformatted
reports.
Youcanquicklyaccessthemostrecentlyuseddocuments.ClicktheSTATISTICA
Startmenu

(inthelowerleftcornerofthescreen)andselectDocuments.

IntheGeneraloptionspaneoftheOptionsdialog(accessiblebyselectingthe
ToolstabandclickingOptions),youcanspecifyhowmanyrecentlyused
documentstodisplay(thedefaultis16).Formoredetailedinformationabouteach
documenttype,seetheoverviewsforworkbooks,spreadsheets,reports,graphs,
andmacrosonpage169;forfurtherinformation,seetheElectronicManual.
Tabs related to types of active document windows.Eachofthemaintypesof
STATISTICAdocumentwindows(seepage137)managesdatainadifferentway
and,thus,offersdifferentcustomizationandmanagementoptions.These
differencesarereflectedinthetabsthataccompanyeachtypeofwindow.Menu
commandsandbuttonsforeachofthemaintypesofdocumentsaredescribedin
detailintheElectronicManual.
Thetabsthatareavailablewhenworkbooksareopendependonthetypeof
documentthatiscurrentlyselectedintheworkbook.Therefore,whenyouare
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference139
editingaspreadsheet,graph,report,ormacrowithinaworkbook,thetabs
relevantforthatdocumenttypeareavailable.Whenyouselectanemptynode
intheworkbooktreepane,bydefault,theWorkbooktabisdisplayed.
User-defined toolbars. Inadditiontothevarietyoftoolbarsprovidedonthe
STATISTICAclassicmenus(ontheribbonbar,clickthe iconintheupperleft
cornertodisplaytheclassicmenus),youcanalsocreateuserdefinedtoolbars.
ThesetoolbarscanincludeanycommandavailableinSTATISTICA,aswellasspecial
controls(i.e.,fontname,fontsize,graphstyles,etc.).Thetoolbarscanbegiven
anynameandcanbedesignatedtoopendependingontheactivedocumenttype.
Also,youcancustomizealltoolbars(includingexistingtoolbars)byadding
commandsandspecialcontrols.
Tocreateatoolbar(oreditanexistingone)usetheoptionsontheToolbarstabof
theCustomizedialog,accessiblebyselectingCustomizefromtheToolsmenu.
Customizingatoolbarisaseasyasdraggingcommandsfromthedialogtothe
toolbar,asshownintheillustrationbelow.

Shapesandlocationsoftoolbarscanbeeasilyadjusted(e.g.,alltoolbarscanbe
dockedorfreefloating).Alloftheseoptionsmakeitpossibleforyoutocreate
uniquetoolbarsthatprovideyouwithaveryspecializeduserinterface.The
ElectronicManualincludessimpletofollow,stepbystepinstructionsonhowto
Chapter3:UserInterface

Copyright StatSoft, 2011


140STATISTICAQuickReference
makecustomizations.Specifically,seeCreateaNewToolbarintheElectronic
Manualformoredetails.
TheQuickAccesstoolbarlocatedatthetopoftheribbonbarcanbecustomizedas
well;seeCustomizeQuickAccessToolbarintheElectronicManual.
User-defined menus. Customizingtheclassicmenusisequallyeasyandcanbe
performedusingtheMenutaboftheCustomizedialog(seetheElectronicManual
fordetails).
STATISTICA VISUAL BASIC
AND CONTROLLING STATISTICA
FROM OTHER APPLICATIONS
TheindustrystandardSTATISTICAVisualBasiclanguage(integratedinto
STATISTICA)providesanalternativeuserinterfacetotheentirefunctionalityof
STATISTICA,anditoffersincomparablymorethanjustasupplementary
applicationprogramminglanguagethatcanbeusedtowritecustomextensions.
STATISTICAVisualBasictakesfulladvantageoftheobjectmodelarchitectureof
STATISTICAandcanbeusedtoaccessprogrammaticallyeveryaspectandvirtually
everydetailofthefunctionalityofSTATISTICA.Eventhemostcomplexanalyses
andgraphscanberecordedintoVisualBasicmacrosandlaterberunrepeatedlyor
editedandusedasbuildingblocksofotherapplications.STATISTICAVisualBasic
addsanarsenalofmorethan14,000newfunctionstothestandard
comprehensivesyntaxofVisualBasic,thuscomprisingoneofthelargestand
richestdevelopmentenvironmentsavailable.FormoreinformationonSTATISTICA
VisualBasic,seeChapter8(page219).
Controlling STATISTICA from other applications. Oneofthefeaturesthat
makestheSTATISTICAVisualBasicenvironmentsopowerfulistheabilityto
integrateandmanipulatevariousapplicationsandtheirenvironmentswithina
singlemacro.Forexample,youcanrecordorwriteaSTATISTICAVisualBasic
programthatcomputespredictionsviatheSTATISTICATimeSeriesmoduleand
executethatprogramfromwithinanExcelspreadsheetoraWorddocument.The
exchangeofinformationbetweendifferentapplicationsisaccomplishedby
exposingthoseapplicationstotheVisualBasicprogramsasObjects.So,for
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference141
example,youcanrunstatisticalanalysesintheSTATISTICABasicStatisticsmodule
fromaVisualBasicprograminExcelbydeclaringinsidetheprogramanobjectof
typeStatistica.Application.
Onceanobjecthasbeencreated,theVisualBasicprogramthenhasaccesstothe
propertiesandmethodscontainedinthatobject.Propertiescanbemostly
thoughtofasfunctions,methodscanbemostlythoughtofassubroutinesthat
performcertainoperationsorcomputationsinsidetherespectiveapplication
object.YoucancallSTATISTICAproceduresdirectlyfrommanyotherapplications
andprogramminglanguages(e.g.,C++,Java,andothers).
WEB BROWSER-BASED USER
INTERFACE: STATISTICA
ENTERPRISE SERVER
Inadditiontothetwobasictypesofuserinterfacesdescribedintheprevious
sections,theentireSTATISTICAfamilyofproductsalsooptionallyoffersabrowser
baseduserinterface,whereallinteractionswiththeapplicationinvolvingquerying
databases,datamanagementoperations,dataanalysis,ordatamining,aswellas
generatingreportsandcollaborativework,canbeperformedwithouthavingany
STATISTICAapplicationinstalledonthelocalcomputer,usingonlyabrowser.This
alternativeuserinterfacerequiresthataClientServerversionoftherespective
STATISTICAapplicationbeinstalled.
STATISTICAEnterpriseServerisahighlyscalable,enterpriselevel,fullyWeb
enableddataanalysisanddatabasegatewayapplicationsystemthatisbuilton
distributedprocessingtechnologyandfullysupportsmultitierClientServer
architectureconfigurations.STATISTICAEnterpriseServerexposestheanalytic,
query,reporting,andgraphicsfunctionalityofSTATISTICAthrougheasytouse,
interactive,standardWebinterfaces.Alternatively,itenablesusersofthedesktop
version(thickclient)tooffloadcomputationallyintensiveanalyticsanddatabase
operationstotheServer.Itisofferedasacomplete,readytoinstallapplication
withaninteractive,Internetbrowserbased(pointandclick)userinterface
(thinclient)thatmakesitpossibleforuserstointeractivelycreatedatasets,run
analyses,andreviewoutput.However,STATISTICAEnterpriseServerisbuiltusing
Chapter3:UserInterface

Copyright StatSoft, 2011


142STATISTICAQuickReference
openarchitectureandincludes.NETcompatibledevelopmentkittools(based
entirelyonindustrystandardsyntaxconventionssuchasVBScript,C++/C#,HTML,
Java,andXML)thatenablesITdepartmentpersonneltocustomizeallmain
componentsofthesystemorexpanditbybuildingonitsfoundations,for
example,byaddingnewcomponentsand/orcompanyspecificanalyticor
databasefacilities.
Asmentioned,STATISTICAServerisprovidedwithanInternetbrowserbaseduser
interface(intheformofsimpletonavigateandeasytousedialogs)enablingyou
tospecifyanalysesandreviewresults.However,toolsareprovidedtocustomize
thesedialogsandeasilysetupnewuserinterfacesortoaddnewfunctions.For
example,asimpledialogwithonlythreebuttonscanbecreatedinthebrowser,
andclickingeachbuttonwillrunaseriesofanalysesandgenerateadetailed
report.STATISTICAEnterpriseServerapplicationsaddanewdimensionandan
endlessarrayofpossibilitiestotheentirelineofSTATISTICADataAnalysis,Data
Mining,andQualityControl/SixSigmasoftware.
ThesystemiscompatiblewithallmajorWebserversoftwareplatforms(e.g.,UNIX
Apache,andMicrosoftIIS),worksinbothMicrosoft.NETandSun/Java
environments,anddoesnotrequireanychangestotheexistingfirewalland
Internet/Intranetsecuritysystems
Formoreinformation,pleaserefertoAppendixBSTATISTICAEnterpriseServer,
page263.
MICROSOFT OFFICE INTEGRATION
IfMicrosoftOfficeisinstalledonthesamemachineasSTATISTICA,Excel
spreadsheetscanbeopeneddirectlywithinSTATISTICAandusedasadatasource
foranalyses,andWorddocumentscanbeusedasadestinationforreports(see
page143;seealsopage154).
Excel as a data source.STATISTICAcanopenExceldocumentsintheSTATISTICA
workspacethroughthestandardOpendialog.WhenanExcelworkbookis
selected,adialogwillbedisplayedthatenablesyoutoimportthefileintoa
standardSTATISTICASpreadsheetortokeepthedocumentinExcelform,i.e.,as
anExcelwindowwithinSTATISTICA.
Chapter3:UserInterface

Copyright StatSoft, 2011
STATISTICAQuickReference143
OncetheExceldocumentisopened,youhaveaccesstoallthemenusandtoolbars
thatExcelsupports.Thus,youcaneditandupdateformulas,changethe
formatting,copy/paste,drag/dropeverythingthatyouwouldnormallydoifyou
werewithintheExcelapplication.
ThemainstrengthinExcelintegrationisthattheExceldocumentscanbeusedasa
datasourceforanalyses.SimplyhavetheExceldocumentwindowselectedwhen
startingananalysis,andtheanalysiswillsourcefromtheExceldocument.When
initiallyrunningtheanalysis,STATISTICAwilldisplayadialoginwhichyoucan
specifywhatrangeoftheExceldocumentshouldbeusedasthedatasourceandif
aparticularroworcolumnistobeusedasvariablenamesorcasenames.These
settingsareassignedtotheExceldocumentsoyouwillonlyneedtospecifythem
once.
NotonlycanSTATISTICAusetheExcelfileasadatasource,butautoupdatingcan
bespecifiedaswell.Ifyoucreateanautoupdatinggraphandthenchangethe
Excelfilebyenteringnewdataorreevaluatingformulas,thegraphwillalsobe
updated.
Word as a report destination.YoucanalsoopenandeditWorddocuments
withintheSTATISTICAworkspace.Worddocumentscanbeopenedusingthe
standardOpendialog,andwhenperformingstatisticalanalysesorcreatinggraphs,
outputcanbeingdirectedtoaWorddocument.Anyoutputthatcanbedirectedto
aSTATISTICAReportiscapableofbeingdirectedtoaWorddocument.
AswithExcel,whentheWorddocumentisopen,youhaveaccesstoallthe
toolbarsandmenusthataresupportedwithintheWordapplication.Youcan
performanyformattingandeditingthatWordsupportswithinitsapplication.
WhensendingspreadsheetanalyticalresultstoWord,STATISTICAwilltake
advantageofWordstableeditingfacilityandconvertthespreadsheetintoatable.
Formultipagespreadsheets,youcancontrolwheretobreaktherowsand
columns.Thesespreadsheetswillbebrokenbycolumnssuchaswillbeallowed
withoutexceedingthepagewidth.Allrowsforagivensetofcolumnswillbe
renderedbeforethenextsetofspreadsheetcolumnsisrenderedintheWord
document.ThissolutionenablesthepresentationofspreadsheetsinWordthat
arenativelyeditableinWord,displaytheentirecontentsofthespreadsheet,and
printandpaginatecorrectly.
Chapter3:UserInterface

Copyright StatSoft, 2011


144STATISTICAQuickReference

SIX CHANNELS FOR


OUTPUT FROM ANALYSES
Overview ................................................................................................. 147
1. STATISTICA Workbooks .................................................................... 148
2. Stand-Alone Windows ....................................................................... 150
3. Reports ............................................................................................... 151
4. Microsoft Word .................................................................................. 154
5. Output to the Web ............................................................................. 155
6. SharePoint or STATISTICA Document Management
System (SDMS) ........................................................................... 163
CHAPTER
4
4

Copyright StatSoft, 2011


STATISTICAQuickReference147
SIX CHANNELS FOR OUTPUT
FROM ANALYSES
OVERVIEW
Whenyouperformananalysis,STATISTICAgeneratesoutputintheformof
multimediatables(spreadsheets)andgraphs.Therearesixbasicchannelsto
whichyoucandirectalloutput:
1. STATISTICAWorkbooks(page148)
2. StandaloneWindows(page150)
3. Reports(page151)
4. MicrosoftWord(page154)
5. TheWeb(page155)
6. SharePointorSTATISTICADocumentManagementSystem(SDMS)(page
163)
Thefirstfouroutputchannelslistedabovearecontrolledbytheoptionsinthe
OutputManager(accessiblebyselectingOutputManagerfromtheStartbutton

dropdownmenulocatedintheupperleftcorneroftheribbonbar,seepage
23forfurtherdetailsonboththeglobalOutputManagerintheOptionsdialog
andtheAnalysis/GraphOutputManagerdialog).Thereareanumberofwaysto
outputtotheWeb,dependingontheversionofSTATISTICAyouhave.SharePoint
isaccessiblefromwithinSTATISTICA,andSDMSisanadditionalproductavailable
fromStatSoft.
CHAPTER
4
4

Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


148STATISTICAQuickReference
Thesemeansforoutputcanbeusedinmanycombinations(e.g.,aworkbookand
reportsimultaneously)andcanbecustomizedinavarietyofways.Also,alloutput
objects(spreadsheetsandgraphs)placedineachoftheoutputchannelscan
containotherembeddedandlinkedobjectsanddocuments,soSTATISTICAoutput
canbehierarchicallyorganizedinavarietyofways.EachoftheSTATISTICAoutput
channelshasitsuniqueadvantages,asdescribedinthefollowingsections.More
comprehensiveoverviewsofeachofthedocumenttypesassociatedwiththe
respectivechannelsofoutputareincludedinChapter5STATISTICADocuments
(page167).
The auto save and recovery features.AllSTATISTICAdocuments(i.e.,input
spreadsheets,workbooks,reports,andmacros)thataccumulatetheresultsof
yourwork(e.g.,dataentry,editing,oroutputcollection)overanextendedperiod
oftimesupporttheAutoSavefeature,whichisconfigurableintheGeneral
optionspaneoftheOptionsdialog(accessiblebyselectingtheToolstaband
clickingOptions).Thisfacilitywillautomaticallysavethecontentsofyourwork
periodically(e.g.,every10minutes)and,thus,giveyoutheoptiontoretrievedata
thatotherwisecouldbelostincaseofapoweroutageorasystemfailure.
1. STATISTICA WORKBOOKS
Workbooksarethedefaultwayofmanagingoutput(formoreinformation,see
page169).Eachoutputdocument(e.g.,aSTATISTICASpreadsheetorGraph,as
wellasaWordorExceldocument)isstoredasatabintheworkbook.
Documentscanbeorganizedintohierarchiesoffoldersordocumentnodes(by
default,oneiscreatedforeachnewanalysis)usingatreeview,inwhichindividual
documents,folders,orentirebranchesofthetreecanbeflexiblymanaged.

Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference149
Forexample,selectionsofdocumentscanbeextracted(e.g.,dragcopiedordrag
moved)toareportwindowortotheapplicationworkspace(i.e.,theSTATISTICA
applicationbackgroundwheretheywillbedisplayedinstandalonewindows).
Entirebranchescanbeplacedintootherworkbooksinavarietyofwaysinorderto
buildspecificfolderorganization,etc.
Technicallyspeaking,workbooksareActiveXdocumentcontainers(seepage238
forinformationonActiveXtechnology,seealsotheElectronicManual).
Workbooksarecompatiblewithavarietyofforeignfileformats(e.g.,Office
documents)thatcanbeeasilyinsertedintoworkbooksandinplaceedited.
User notes and comments in workbooks.Workbooksofferpowerfuloptionsto
efficientlymanageevenextremelylargeamountsofoutput,andtheymaybethe
bestoutputhandlingsolutionforbothnovicesandadvancedusers.Itmight
appearthatonepossibledrawbackisthatusercomments(e.g.,notes)and
supplementaryinformationcannotbeastransparentlyinsertedintothestream
oftheworkbookoutputastheycanintraditional,wordprocessorstylereports,
suchasSTATISTICAReports(seethenextsection).However,notethat:
AllSTATISTICAdocumentscaneasilybeannotated,botha)directly,by
typingtextintographs,tables,andreports,andb)indirectly,byentering
notesintotheCommentsboxoftheDocumentPropertiesdialog(accessed
byselectingPropertiesfromtheStartbutton

dropdownmenulocated
intheupperleftcorneroftheribbonbar),and
Formatteddocumentswithnotesandcomments(intheformoftextfiles,
STATISTICAReportdocuments,WordPadorwordprocessordocuments,
etc.)caneasilybeinsertedanywhereinthehierarchicalorganizationof
outputinworkbooks.Moreover,suchsummarynotesorcomment
documentscanbemadenodesforgroupsofsubordinateobjectstowhich
thenoteisrelatedtofurtherenhancetheirorganization.
Saving workbooks as Web pages.Workbookscanbesavedas*.html(Web)files
byselectingSaveAsontheHometabintheFilegroupfromtheSavemenu,andin
theSaveAsdialog,choosingWebPage(*.htm;*.html)fromtheSaveastype
dropdownlist.SavingasaWebpagewillcreatean*.htmlfileinthespecified
directorythatcanbeopenedwithstandardinternetbrowserssuchasMicrosoft
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


150STATISTICAQuickReference
InternetExplorer.WhensavingtheworkbookasaWebpage,STATISTICAalso
createsasubdirectorythatcontainsalltheimagesreferencedbytheWebpage.

TheWebpageoutputcontainsan.htmlbasedtreecontrolthatenablesyouto
navigateanddisplaythevariousworkbookimages,similartotheactualworkbook.
2. STAND-ALONE WINDOWS
STATISTICAoutputdocumentscanalsobedirectedtoaqueueofstandalone
windows;theQueueLengthcanbecontrolledintheOutputManageroptions
paneoftheOptionsdialog(accessiblebyselectingtheToolstabandclicking
Options).

Thecleardisadvantageofthisoutputmodeisitstotallackoforganizationandits
naturaltendencytocluttertheapplicationworkspace(someprocedurescan
generatehundredsoftablesorgraphswithaclickofthebutton).
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference151
Oneoftheadvantagesofthiswayofhandlingoutputisthatyoucaneasilycustom
arrangetheseobjectswithintheSTATISTICAapplicationworkspace(e.g.,tocreate
multiple,easytoidentifyreferencedocumentstobecomparedtothenew
output).However,notethatinordertoachievethateffect,youdonotneedto
configuretheoutputaheadoftimeandgeneratealargenumberof(mostly
unwanted)separatewindowsthatcancluttertheworkspace.Instead,individual,
specificoutputobjectsdirectedtoandstoredintheothertwochannels
(workbooksandreports)caneasilybedraggedoutfromtheirrespectivetree
viewsontotheapplicationworkspaceasneeded.
3. REPORTS
Whenperformingananalysis,theultimategoalistocreatemeaningfuloutputin
ordertogainanunderstandingofthedata.Themannerinwhichtheoutputis
producedisimportantaswell.STATISTICAoffersavarietyofmethodstoproduce
reportsthataccommodatethediverseneedsofusers.
STATISTICA Reports
STATISTICAReports(formoreinformation,seepage180)offeramoretraditionalway
ofhandlingoutputwhereeachobject(e.g.,aSTATISTICASpreadsheetorGraph,oran
Excelspreadsheet)isdisplayedsequentiallyinawordprocessorstyledocument.

However,thetechnologybehindthissimpleeditoroffersyouveryrich
functionality.Forexample,liketheworkbook(seeSTATISTICAWorkbooks,page
148),theSTATISTICAReportisalsoanActiveXcontainer(forinformationon
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


152STATISTICAQuickReference
ActiveXtechnology,seepage238ortheElectronicManual)whereeachofits
objects(notonlySTATISTICASpreadsheetsandGraphs,butalsoanyotherActiveX
compatibledocuments,e.g.,Excelspreadsheets)remainsactive,customizable,and
inplaceeditable.
Theobviousadvantagesofthiswayofhandlingoutput(moretraditionalthanthe
workbook)aretheabilitytoinsertnotesandcommentsinbetweentheobjectsas
wellasitssupportforthemoretraditionalwayofquickscrollingthroughand
reviewingtheoutputtowhichsomeusersmaybeaccustomed(e.g.,theeditor
supportsvariablespeedscrolling).Also,onlythereportoutputincludesand
preservestherecordofthesupplementaryinformation,whichcontainsadetailed
logoftheoptionsspecifiedfortheanalyses(e.g.,selectedvariablesandtheir
labels,longnames,etc.,dependingonthelevelofsupplementaryinformation
specifiedintheOutputManager,seepage25).
Theobviousdrawback,however,ofthesetraditionalreportsistheinherentflat
structureimposedbytheirwordprocessorstyleformat,althoughthatiswhat
someusersorcertainapplicationsmayfavor.
Reports from Workbooks
WhenyouhaveaSTATISTICAWorkbookcontaininganalysesoutput,youmay
decideyouwanttotransferittoareport.
OpenaSTATISTICAWorkbookandselectallofthefiles,i.e.,selectthefirstfile,
presstheSHIFTkeyonyourkeyboard,andselectthelastfile.Then,clickAddto
ReportontheHometabintheOutputgroup.Allthefilesintheworkbookwillbe
duplicatedinaSTATISTICAReport.
RTF (Rich Text Format) Reports
RTF(RichTextFormat)isaMicrosoftstandardmethodofencodingformattedtext
andgraphicsforeasytransferbetweenapplications.Whenreportsaresavedin
RichTextFormat(*.rtf),allfileformattingispreservedsothatitcanbereadand
interpretedbyotherRTFcompatibleapplications(e.g.,Word).
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference153
TheSTATISTICAReportformat(.str)adherestoRTFconventions;however,saving
reportsinthedefaultSTATISTICAReportformatensuresthatthereportswillbe
openedinSTATISTICA,givingyoucompleteaccesstothereporttree.
InordertoopenaSTATISTICAreportinanRTFcompatibleapplication,openthe
report,selecttheHometab,clicktheSavearrow,andselectSaveAsfromthe
dropdownmenutodisplaytheSaveAsdialog.FromtheSaveastypedropdown
list,selectRichTextFiles(*.rtf),enteranameintheFilenamefield,andclickthe
Savebutton.YoucanthenopenthefileinanyRTFcompatibleapplication.
Acrobat (PDF) Reports
PDFistheacronymforPortableDocumentFormat;itistheindustrystandard
formatforstoringtextualandgraphicaldata.PDFoffersagraphicallyrich
appearanceandstructurethatmakesitidealforpresentationpurposes.
Additionally,PDFdocumentscanbeviewedinbothimageandtextualmode,i.e.,
youcaneitherselectdataasaformattedimageorasregulartext.
PDFisplatformindependent,andmostoperatingsystemsofferfreePDFviewing
applications(e.g.,AdobeAcrobatonWindowsandGhostscriptonLinux).
PDFhasbeenapprovedasanacceptabledocumentstorageformatforregulated
environmentsaccordingtotheFDAs21CFRPart11.
TosaveaSTATISTICAReportasaPDFfile,openthereport,selecttheHometab,
andthenselectSaveAsPDFfromtheSavemenu.TheOutputOptionsdialogwill
bedisplayed,whereyoucanchoosewhethertooutputspreadsheetsasObjects
(astheyaresizedinthereportwindow)orFullsizedSpreadsheets(onseparate
pages).Ifyoualwayswanttooutputspreadsheetsinthesamemanner,selectthe
Usethecurrentsettinganddonotdisplaythisdialogagaincheckbox.Clickthe
OKbuttontoclosetheOutputOptionsdialoganddisplaytheSavereportasPDF
dialog.UsetheSaveinfieldtoselecttheappropriatelocationinwhichtosavethe
document,enteranameintheFilenamefield,andclicktheSavebutton.
STATISTICAReports,Spreadsheets,andGraphscanallbesavedinPDFformat.
NotethatthesearenotsimplifiedPDFfiles(representingcompressedbitmapsof
therespectivedocumentpageimages)butfullfeaturedPDFfilesthatsupport
suchoperationsasselectivecopyingoftextinformation.
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


154STATISTICAQuickReference
HTML Reports
YoumaywanttopostaSTATISTICAReportorWorkbookontheInternetforothers
toreview.WithSTATISTICA,youcansavereportsandworkbooksinHTML(an
acronymforHyperTextMarkupLanguage)format.HTMLusestagstoidentify
elementsofthedocument,suchastextorgraphics.
OpenaSTATISTICAReportorWorkbook,andselectSaveAsfromtheSavemenu
(locatedontheHometabintheFilegroup)todisplaytheSaveAsdialog.Fromthe
Saveastypedropdownlist,selectWebPage(*.html;*.htm)tosavethefilewith
an*.htmextension.
Notethatgraphsinthereportorworkbookaresavedas*.pngfilesinthesame
folderastheHTMfile.YoucansavegraphsasJPGfiles,instead.Todothis,click
Options(ontheHometabintheToolsgroup)todisplaytheOptionsdialog.Select
eitherReportsorWorkbooksinthetreeview,accordingtowhichdocumentyou
intendtosendtoan.htmdocument,selecttheJPEGformatoptionbuttoninthe
ExportHTMLimagesasgroupbox,andclickOK.
4. MICROSOFT WORD
WithSTATISTICA,youcanalsorouteoutputdirectlytoWordviatheOffice
Integrationfeatures.WhenWordisopenwithinSTATISTICA,Wordtoolbarsand
menusarealsoavailablethroughstandardActiveXDocumentinterfaces
technology.InSTATISTICA,youcanperformanyformattingandeditingthatWord
supportsinitsapplication.
WhensendingspreadsheetanalyticalresultstoWord,STATISTICAwilltake
advantageofWordstableeditingfacility,andconvertthespreadsheettoatable.
Formultipagespreadsheets,youcancontrolwheretobreaktherowsand
columns.Thesespreadsheetswillbebrokenbycolumnssuchaswillbeallowed
withoutexceedingthepagewidth.Allrowsforagivensetofcolumnswillbe
renderedbeforethenextsetofspreadsheetcolumnsisrenderedintheWord
document.ThissolutionenablesthepresentationofspreadsheetsinWordthat
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference155
arenativelyeditableinWord,displaystheentirecontentsofthespreadsheet,and
printsandpaginatescorrectly.
AswithstandardSTATISTICAReports(seepage151),Worddocumentscanstore
andpreservetherecordofsupplementaryinformation(e.g.,selectedvariables,
longnames,etc.).
TosendoutputtoaWorddocument,usetheoptionsintheOutputManager
(accessiblebyselectingOutputManagerfromtheStartbutton

dropdown
menulocatedintheupperleftcorneroftheribbonbar;orbyselectingtheHome
tab,clickingOptionsintheToolsgroup,andselectingOutputManagerinthe
Optionsdialogtreeview).IntheMicrosoftWordOutputdropdownlist,select
eitherMultipleWorddocuments(oneforeachanalysis/graph),CommonWord
document(onesharedforallanalyses/graphs),or[SelectFile]tobrowsetoa
preexistingWorddocument.
AlthoughWorddocumentsdonotprovidethenavigationaltreeofaSTATISTICA
WorkbookorReport,theadvantagesinsendingoutputtoWorddocumentsare
many.BysendingresultstoaWorddocument,youhaveallthewordprocessing
featuresofWordavailable.Forexample,youcanattachtemplatestocreate
customizeddocuments,addtablesofcontentandindices,trackchanges,etc.
WheninsertingalargespreadsheetintoaWorddocument,STATISTICA
automaticallydetectshowmanyvariablescanfitoneachpageandpartitionsthe
spreadsheetintoseveralWordtables.Ifthespreadsheetusescasenames,those
nameswillbethefirstcolumnineachtable.
AdditionalbenefitsofsendingresultstoaWorddocumentincludeincreased
printingfunctionality(e.g.,printingtofiles,manualduplex)andtheabilitytosave
resultsasWebpages.
5. OUTPUT TO THE WEB
Knowledge Portal
STATISTICAEnterpriseServerReports,oranySTATISTICAReports(seeHTML
Reports,page154),canbedistributedthroughtheKnowledgePortal.The
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


156STATISTICAQuickReference
KnowledgePortalenablesyoutopublishSTATISTICAdocuments(spreadsheets,
graphs,reports,orworkbooks)totheInternet.UserswithlimitedKnowledge
Portalpermissionscanthenviewthosedocuments.Youcancontrolwhocan
accessthesedocumentsbysettingpermissionsonthedocumentsanddirectories
usingstandardSTATISTICAEnterpriseServerrepositorytools.
TopublishcontentintheKnowledgePortal,firstcreateadirectoryinthe
STATISTICAEnterpriseServerrepositoryinthePortalfolder:logontothe
STATISTICAEnterpriseServerasauserwithAdministratorrights,andfromtheFile
menu,selectMyDirectoryOperationstodisplaytheMyDirectorydialog;the
contentwilllooksimilartothefollowingillustration.

TocreateafolderinthePortaldirectorytocontainyourreports,selectthePortal
folder,andthenclicktheCreatebuttontodisplaytheExplorerUserPrompt
dialog.Intheeditfield,enterthenewdirectorynameofSamplePortalFolder,and
clickOK.Adialogwillbedisplayedconfirmingthatthedirectory/Portal/Sample
PortalFolderwascreated.ClicktheShowMyDirectorybutton,andyouwillbe
returnedtotheMyDirectorydialog.SelecttheShowEmptyDirectoriescheckbox,
andthenclicktheRefreshbutton.ExpandthePortaldirectorybyclickingthe+
nexttothatfolder,andthenewSamplePortalFolderwillbedisplayed.
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference157

Notethatyoucancontrolwhocanreadandwritetothisfolderbyselectingthe
SamplePortalFolder,clickingtheSecuritybutton,andusingtheoptionstosetthe
userandgrouppermissionsforthefolderappropriately.
Publishing Content from STATISTICA
Enterprise Server
Nowthatthefolderhasbeencreated,youcanaddanalysisresultstoitforPortal
userstoviewusingeitherSTATISTICAEnterpriseServerorSTATISTICA.
InSTATISTICAEnterpriseServer,startwithatypicalanalysis.FromtheFilemenu,
selectOpenDataSpreadsheet.IntheSelectDataSourcedialog,selectthe
Datasetsfolderintheleftpane,selectthedatafileAdstudy.staintherightpane,
andclickOK.

Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


158STATISTICAQuickReference
ClosetheresultingSpreadsheetEditorwindow(wewontneeditinthisexample),
leavingjustthebrowserwindowdisplayingtheactivedatasourcesummary
informationforAdstudy.sta.
FromtheStatisticsBasicStatisticsandTablessubmenu,selectDescriptive
StatisticstodisplaythevariableselectiondialogandtheDescriptiveStatistics
specificationsdialog.Inthevariableselectiondialog,selectMEASURE01and
MEASURE02intheContinuousvariablescolumn.

IntheDescriptiveStatisticsspecificationsdialog,selectAllresultsintheDetailof
computedresultsreportedfield.

ClickOKtodisplaytheresultsforthisanalysis,consistingofseveralspreadsheets
andgraphs.
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference159

Now,topublishthispagesothatotheruserscanseeitfromtheKnowledgePortal,
clickthePublishbuttonintheupperrightportionofthewindow.ThePublish
Destinationdialogwillbedisplayed.HereyoucanselecttheSamplePortalFolder
thatyoucreated.Youalsocancontrolwhocanhaveaccesstothisparticularpage
byselectingtheIwanttodefinewhocanaccessthisoutputpagecheckbox.

ClicktheNextbutton,andthepagewillbesavedtotheselecteddestination.
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


160STATISTICAQuickReference
Now,whenaKnowledgePortaluserlogson,theywillseethenewSamplePortal
Folderintheiroutputbrowser,fromwhichtheycanselectthenewlyadded
DescriptiveStatisticspage.
Publishing Content from
STATISTICA Desktop Applications
WiththeSTATISTICAEnterpriseServerintegrationfeatureofdesktopSTATISTICA,
youcanalsopublishSTATISTICAdocuments(spreadsheets,graphs,reports,and
workbooks)totheKnowledgePortaldirectlyfromwithintheSTATISTICA
application.
ThefirststepistoenableSTATISTICAEnterpriseServerintegration.SelecttheHome
tab,andintheToolsgroupclickOptionstodisplaytheOptionsdialog.Select
Server/Webinthetreeview,andintheoptionspane,selecttheEnableSTATISTICA
EnterpriseServerIntegrationcheckbox.Then,specifytheURLoftheSTATISTICA
EnterpriseServerandanyoptionalcustomconfigurationsettingsthatmayhave
beendefinedbyyoursystemadministratorwheninstallingSTATISTICAEnterprise
Server.Inthefollowingillustration,STATISTICAEnterpriseServerhasbeeninstalled
onserverx23;theinformationinyourdialogwillbedifferentdependingonwhere
STATISTICAEnterpriseServerisinstalledonyournetwork.

Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference161
AfteryouclicktheOKbuttonintheOptionsdialog,notethatthereisanowa
ServertabdisplayedinSTATISTICAnexttotheHometab.Theonlycommandon
theServertabthatisavailableinitiallyisLogIn;selectthatcommand.Ifyouhave
enabledintegratedlogin(andyourWindowsaccountisenabledonthe
STATISTICAEnterpriseServer),youwillbeloggedinautomatically.Otherwise,you
willbepromptedforaSTATISTICAEnterpriseServerusernameandpassword.
Onceyouhaveloggedin,theothercommandsareavailableontheServertab.
Now,wewillcreateananalysisanduploadtheresultstotheKnowledgePortal.
OpentheAdstudy.stadatafile:selecttheHometab,clicktheOpenarrow,and
selectOpenExamplesfromthedropdownmenu;intheOpenaSTATISTICAData
Filedialog,doubleclickontheDatasetsfolder,andthendoubleclickonthe
Adstudy.stafiletoopenthatspreadsheetforuseinSTATISTICA.
Next,selecttheStatisticstab,andintheBasegroup,clickBasicStatisticsto
displaytheBasicStatisticsandTablesStartupPanel.SelectDescriptivestatistics.

ClickOKtodisplaytheDescriptiveStatisticsdialog.

Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


162STATISTICAQuickReference
Toensurethatalltheoutputfromthisanalysiswillbesenttoaworkbook,clickthe
Optionsbuttonontherightsideofthedialog,andfromthedropdownlist,select
Output.IntheAnalysis/GraphOutputManager,verifythattheWorkbookoption
buttonisselectedinthePlaceallresults(Spreadsheets,Graphs)ingroupbox.
ThenclickOKtoreturntotheDescriptiveStatisticsdialog.
ClicktheVariablesbuttontodisplaythevariableselectiondialog,select
MEASURE01andMEASURE02,andclickOKtoreturntotheDescriptiveStatistics
dialog.OntheQuicktab,clicktheSummary:Statisticsbuttontosendthoseresults
totheworkbook.TheDescriptiveStatisticsdialogwillbeminimizedsoyoucansee
theresults;restoreitbyclickingtheDescriptiveStatisticsbuttonontheAnalysis
Barinthelowerleftofthescreen.NowclicktheHistogramsbuttontogenerate
histogramsforeachselectedvariable.Theanalysisdialogisminimizedagain,and
theworkbookshouldlookasfollows.

ThisisthedocumentwewanttopublishtotheKnowledgePortal.OntheServer
tabintheFilegroup,clickSaveAs.TheSTATISTICAEnterpriseRepositorydialog
willbedisplayed,containingalistoffoldersyoucanreferenceintheSTATISTICA
EnterpriseServer.OpenthePortalfolder,selectSamplePortalFolder,andclickthe
OKbutton.ThiswilluploadtheworkbooktothatKnowledgePortaldirectory.

YoucanreviewthedocumentfromwithinSTATISTICAbyopeningabrowser
windowinsideoftheSTATISTICAworkspace.OntheServertabintheToolsgroup,
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference163
selectOpeninBrowser,andanewbrowserwindowwillbeopened,allowingyou
tologontotheSTATISTICAEnterpriseServer.
FromtheSTATISTICAEnterpriseServerFilemenu,chooseMyDirectory
Operations;inMyDirectory,youcannavigatetotheSamplePortalDirectory,and
seetheWorkbook1.stwfilethatwasuploaded.SelectthisfileandclicktheView
button,andtheworkbookwillbeopenedwithinthebrowser.

6. SHAREPOINT OR STATISTICA
DOCUMENT MANAGEMENT SYSTEM
(SDMS)
WithSTATISTICA,youcanalsorouteoutputtoeitherMicrosoftSharePointorto
theSTATISTICADocumentManagementSystem(SDMS).
SharePoint
WithSTATISTICASharePointintegration,youcanopen,checkout,checkin,and
uploadnewSTATISTICAfilestoSharePoint.
ToopenadocumentinSTATISTICAthatislocatedinSharePoint,selecttheHome
tab.ClicktheOpenarrow,andselectOpenDocument.IntheOpendialog,inthe
Lookindropdownlist,selecttheWebFoldertotheSharePointserverlocation
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


164STATISTICAQuickReference
(seepage165),andthennavigatetothedocumentyouwanttoopen.Youwill
needtologontoSharePoint.
TosaveaSTATISTICAdocument(spreadsheet,workbook,macro,etc.)to
SharePoint,selecttheHometab.ClicktheSavearrow,andselectSaveAs.Inthe
SaveAsdialog,intheSaveindropdownlist,selecttheWebFoldertothe
SharePointserverlocation,andthennavigatetothelocationinwhichyouwantto
savethedocument.YouwillneedtologontoSharePoint.
TheSharePointoptionsCheckOut,CheckIn,andDiscardarelocatedontheHome
tabintheSharePointgroup.

TheseoptionscanalsobeaccessedbyclickingtheStartbuttonlocatedinthe
upperleftcorneroftheribbonbar.Theseoptionsbecomeavailableafteryouhave
openedadocumentfromSharePoint.
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011
STATISTICAQuickReference165

Beforeusingtheseoptions,youmustfirstcreateaWebFoldertotheSharePoint
serverlocation.Todothis,clicktheStartbuttoninthelowerleftcornerofthe
Windowstaskbar,andclickComputer.Rightclickinanyopenareaintheright
paneoftheComputerdialog,andfromtheshortcutmenu,selectAddanetwork
locationtodisplaytheAddNetworkLocationdialog.ClicktheNextbutton.
DoubleclickChooseacustomnetworklocation.IntheInternetornetworkaddress
field,entertheWebaddressofyourSharePointlocation:https://sharepoint...,or
clicktheBrowsebuttontobrowsetoandselectthelocation.ClickNext.
LogontoSharePoint,andclickOK.EnteranamefortheWebFolderintheTypea
nameforthisnetworklocationfield,andclickNext.YouwillseeCompletingthe
AddNetworkLocationWizard;selecttheOpenthisnetworklocationwhenIclick
Finishcheckbox,andthenclickFinish.ANetworkLocationWebFolderhasbeen
createdintheNetworkLocationsectionofComputerwiththelabelyouchose.
STATISTICA Document Management
System (SDMS)
STATISTICADocumentManagementSystem(SDMS)isacompletedatabase
solutionpackageformanagingdocuments.SDMSenablesyoutoquickly,
efficiently,andsecurelysavedocumentsofanytypetoasecurerepository
database,andthenmanagethem[e.g.,findthem,accessthem,searchfor
content,review,organize,edit(withtrailloggingandversioning),approve,etc.].
Chapter4:OutputfromAnalyses

Copyright StatSoft, 2011


166STATISTICAQuickReference

TheintuitiveuserinterfaceofSDMSmakesiteasytoperformalldocument
managementoperationsfromanycomputeronyournetworkorevenviathe
Internet.
IntheSTATISTICADocumentManagementSystem,everythingisdocumentedand
traceable.Forexample,documentsareneverdeleted.Whenadocumentisedited,
anewversionofthatdocumentiscreated,properlyauthenticated,andannotated
withelectronicsignatures.Authorizeduserscanberequiredtoexplicitlycheckout
thedocumentsfromtherepositoryandcheckthenewversionsintotherepository
withnotesanddocumentationregardingthenatureandpurposeoftheedits.
SDMSisspecificallydesignedtoensurecompliancewithFDA21CFRPart11
regulationsandSarbanesOxleylegislation,aswellasISO9000,9001,14001
documentationrequirements.
STATISTICADocumentManagementSystemseamlesslyintegrateswithall
STATISTICAproducts,fromdesktopandnetworkversions,toenterprisewide
installationssuchasSTATISTICAEnterpriseServerbasedworldwideinstallationsor
STATISTICAEnterprise/QC(forprocessanalysisandqualitycontrol/improvement).
SDMScanalsobeusedasastandalonesystem.
SDMSishighlyconfigurable,anditsfunctionalityiscompatiblewithotherapplications,
sothesystemcanbecustomizedtoaccommodateyourspecifictasksandcanbe
integratedseamlesslyintoexistingsystemsfordataanddocumentmanagement.

STATISTICA
DOCUMENTS
Workbooks ............................................................................................. 169
Spreadsheets (Multimedia Tables) ...................................................... 173
Reports ................................................................................................... 180
Graphs .................................................................................................... 182
Macros (STATISTICA Visual Basic Programs) ..................................... 183
STATISTICA Projects .............................................................................. 184
CHAPTER
5
5

Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


168STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICAQuickReference169
STATISTICA DOCUMENTS
WORKBOOKS
Workbooks(introducedbrieflyonpage148)arethedefaultwayofmanaging
output.Theystoreeachoutputdocument(e.g.,aSTATISTICASpreadsheetor
Graph,aswellasaWordorExceldocument)asatab.

Technicallyspeaking,STATISTICAWorkbooksareoptimizedActiveX(seepage238)
containersthatcanefficientlyhandlelargenumbersofdocuments.The
documentscanbeorganizedintohierarchiesoffoldersordocumentnodes(by
default,oneiscreatedforeachnewanalysis)usingatreeview,inwhichindividual
documents,folders,orentirebranchesofthetreecanbeflexiblymanaged.
Forexample,selectionsofdocumentscanbeextracted(e.g.,dragcopiedordrag
moved)tothereportwindowortotheapplicationworkspace(i.e.,theSTATISTICA
applicationbackgroundwheretheyaredisplayedinstandalonewindows).
CHAPTER
5
5

Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


170STATISTICAQuickReference
Entirebranchescanbeplacedintootherworkbooksinavarietyofwaysinorderto
buildaspecificfolderorganization,etc.
Eachworkbookcontainstwopanels:anExplorerstylenavigationtreeontheleft
andadocumentviewerontheright.
Thenavigationtree(workbooktree)canbesplitintovariousnodesthatareused
toorganizefilesinlogicalgroupings(e.g.,allanalysisoutputsorallmacroscreated
foraproject).
Tabsatthebottomofthedocumentviewer(workbookviewer)areusedtoeasily
navigatethechildrenofthecurrentlyselectednode.Youcanmovethetabstothe
top,right,orleftoftheworkbookviewerbyrightclickingononeofthetabsand
selectingadifferentlocationfromtheshortcutmenu.Oneadvantageoftheside
placementoftabsisthatmultiplerows(ratherthanonelongrow)areprovided(as
shownbelow).Thismakesiteasytoselectthedesiredtab.

Displayingtabscanalsobesuppressedtosavespace.UnlikemanyExplorerstyle
navigationandorganizationapplicationsthatonlyallowfolderstohavechildren,
theSTATISTICAWorkbookallowsanyiteminthetreetohavechildren.For
example,youcanaddaspreadsheettoyourworkbook,andthenaddallthe
graphsproducedusingthedatainthespreadsheetaschildrentothespreadsheet.
AvarietyofdraganddropfeaturesandClipboardproceduresareavailabletoaid
youinorganizingtheworkbooktree.
TheworkbookcanholdallnativeSTATISTICAdocumentsincludingspreadsheets,
graphs,reports,andmacros.ItcancontainothertypesofActiveXdocumentsas
well,includingExcelspreadsheets,Worddocuments,andothers.Ifyouwantto
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference171
editthesedocuments,youcandosousingtheworkbookviewerpane.Toedita
Worddocument,doubleclickontheobjectintheworkbooktree.TheWord
documentopensintheviewer,andtheworkbookmenubarmergeswiththe
Wordmenubargivingyouaccesstoalloftheeditingfeaturesyouneed.
Workbookscanalsobeusedtostorealloutputfromaparticularanalysis.
Navigating the Workbook Tree
Theworkbooktreedisplaystheorganizationoffilesandfoldersintheworkbook,
displayedinanExplorerstyleformat.Itemswithplussignsnexttothemindicate
foldersorfilesthathavechildrenassociatedwiththem.Toexpandthetreefora
particularfolderorfile,clicktheplussignnexttoit.Theworkbookcansupportan
unlimitednumberoflevels,andindividualitemsfromthetreevieworentire
branchescanbeflexibly(interactively)managed(e.g.,draggingtocopyormove
betweenworkbooksorreports,etc.,orviatheshortcutmenu,asshownbelowin
thesecondimage).

Toselectaworkbookitemforrevieworediting,simplylocatethefileinthe
workbooktreeandclickonitsassociatedicon.Thedocumentwillbedisplayedin
theworkbookviewerpane.Notethatyoucanalsonavigatethroughthechildren
ofthecurrentlyselectednodeusingthenavigationtabsavailable(bydefault)at
thebottomoftheworkbookviewer.Youcaneasilymovethesenavigationtabsto
thetop,right,orleftoftheworkbookviewerbyrightclickingononeofthetabs
andselectingadifferentlocationfromtheshortcutmenuorselectingthe
appropriatecommandfromtheWorkbooktab,Toolsgroup,TabControlmenu.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


172STATISTICAQuickReference
Notethattabsatthetopandbottomoftheviewerscrollsideways,whilemultiple
rowsoftabsareusedwhentabsareplacedtotheleftorrightoftheviewer.
Itemsinthetreeareidentifiedbytheiconnexttothem.The foldericon
representsafolderthatcancontainavarietyofdocumentsandsubfolders.The
foldericonwitharedarrowonitindicatesthatthescriptthatproducedthe
resultsinthatfolderhasbeenattachedtothefolder.ThisenablesSTATISTICAto
rerunorresumetheanalysis(formoredetails,seeChapter8STATISTICAVisual
Basic).The spreadsheet, report, macro,and graphiconsrepresent
STATISTICASpreadsheet,Report,Macro,andGraphdocuments,respectively.The
DataMinericonrepresentsaDataMinerworkspace.
AllnonSTATISTICAdocumentsarerepresentedbytheirrespectivedocument
icons.Forexample,Worddocumentsarerepresentedbythe Wordicon,and
Excelspreadsheetfilesarerepresentedbythe Excelspreadsheeticon.
Commandsforinserting,extracting,renaming,andremovingitemsfromthe
workbooktreeareavailablefromtheworkbooktreeshortcutmenu(accessedby
rightclickinganywhereinthe tree).

ThesecommandsarealsoaccessibleontheWorkbooktab.
Theworkbooktreecanbeorganizedandmodifiedusingdraganddropfeatures
(aswellasClipboardprocedures).Usekeysonyourkeyboardtospecifywhether
anitemistobemovedorcopied,andwhetheranitemistobeinsertedasachild
(i.e.,onelevelbelow)orasasibling(i.e.,onthesamelevel).
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference173
Thefollowingtableillustratesfourdraganddropoptions:
Action Key Press Cursor Effect
MoveChild (none)
Movethefirstselecteditemonelevelbelow
thesecondselecteditem.
MoveSibling SHIFT

Movethefirstselecteditemdirectlybelow
andonthesamelevelasthesecond
selecteditem.
CopyChild CTRL

Copythefirstselecteditemonelevelbelow
thesecondselecteditem.
CopySibling SHIFT+CTRL

Copythefirstselecteditemdirectlybelow
andonthesamelevelasthesecond
selecteditem.
First,selecttheitem(s)thatyouwanttomoveorcopy.Dragtheselectiontoits
newlocationanddropit.Toselectasingleitem,clickontheitem(e.g.,
spreadsheet,graph,orreport).Toselectaparentnodeandallofitschildren,click
onthefolder.Notethathorizontaland/orverticalscrollingwithintheworkbook
treecanbeutilizedduringadraganddropoperation.
SPREADSHEETS
(MULTIMEDIA TABLES)
STATISTICASpreadsheetsarebasedonStatSoftsproprietarymultimediatable
technologyandareusedtomanagebothinputdataandthenumericortext(and,
optionally,anyothertypeof)output.Thebasicformofthespreadsheetisasimple
twodimensionaltablethatcanhandleapracticallyunlimitednumberofcases
(rows)andvariables(columns),andeachcellcancontainavirtuallyunlimited
numberofcharacters.Sound,video,graphs,animations,reportswithembedded
objects,oranyActiveXcompatibledocumentscanalsobeattached.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


174STATISTICAQuickReference

BecauseSTATISTICASpreadsheetscanalsocontainmacrosandanyuserdefined
userinterface,thesemultimediatablescanbeusedasaframeworkforcustom
applications(e.g.,withalistboxofoptionsoraseriesofbuttonsplacedinthe
upperleftcorner),selfrunningpresentations,animations,simulations,etc.

Data file layout in spreadsheets.STATISTICAdataareorganizedintocasesand


variables.Ifyouareunfamiliarwiththisnotation,youcanthinkofcasesasthe
equivalentofrecordsinadatabasemanagementprogram(orrowsofa
spreadsheet),andvariablesastheequivalentoffields(orcolumnsofa
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference175
spreadsheet).Eachcaseconsistsofasetofvaluesofvariables,andthefirst
columninthefilecan(optionally)containnamesofcases.
Thespreadsheetwindowcomprisesseveralbasiccomponents.

Title bar.Thetitlebardisplaysthenameofthespreadsheetfollowedbythe
spreadsheetextension(.sta).Ifthespreadsheetisaninputspreadsheet,thetitle
baralsodisplaysthenumberofvariablesbynumberofcases(e.g.,25vby50c).In
theimageshownabove,thetitlebarcontainsthetextData:Adstudy.sta(25vby
50c).
Info box.Youcanselecttheentirespreadsheetbyclickingonceinthelowerright
corner(themousepointerwillbethedefaultarrow)oftheinfobox,whichis
locatedintheupperleftcornerofthespreadsheetwindow.Toselecttheinfobox
only(forformatting),clickonceintheupperleftcorneroftheinfobox(themouse
pointerwillbeanoutlinedplussign ).Doubleclickintheinfoboxtoenteroredit
thetextintheinfobox(e.g.,additionaldetailsaboutthespreadsheet).Inthe
imageshownabove,theinfoboxcontainsthetextResponses(Peoria,IL).
Header.Theheaderislocatedimmediatelyabovethevariableheadersatthetop
ofthewindow.Doubleclicktheheadertoenteroredittextinformation.Toselect
theheaderonly(forformatting),clickonceintheupperleftcorner(themouse
pointerwillbeanoutlinedplussign ).PressCTRL+ENTERorALT+ENTERtoenteranew
line(notethatyouneedtoextendtheheightofthefieldtoseenewlinesthatyou
areadding).Intheimageshownabove,theheadercontainsthetextAdvertising
EffectivenessStudy.
Case headers.Thesecells,locatedatthefarleftofthewindow,containheader
informationforeachcase.Doubleclickonanycaseheadercelltoenteroredit
textinformation.Toselectthecaseheaderonly(forformatting),clickonceonthe
leftsideofthecaseheader(themousepointerwillbeanoutlinedplussign ).To
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


176STATISTICAQuickReference
selectthecaserow(forediting),clickonceonthemiddleorrightsideofthecase
header(themousepointerwillbeanoutlinedplussignwithanarrow ).To
selectablockofcaseheaders,(withoutselectingtheirrespectiverows),clickon
theleftsideofacaseheaderanddragthemousepointertoincludealldesired
caseheaders.Toautofitthecaseheaders,doubleclickonthefarrightsideofany
caseheader(themousepointerwillbeacrosswithadoubleheadedarrow ).In
thepreviousimage,thecaseheadercellscontainthefirstinitialsandlastnamesof
therespondentsinthestudy.Notethatcaseheadersareoptionalandyoucan
choosenottodisplaythem(selecttheViewtab,intheDisplaygroupclickDisplay
Options,andtoggleofftheCaseNamescommand);iftheyarenotdisplayed,the
casenumbersareshown.
Variable headers.Thesecells,locatedatthetopofeachcolumn,containheader
informationforeachvariable.Todisplaydetailsaboutanindividualvariable,
doubleclickonthevariableheadercell.Toselectthevariableheaderonly(for
formatting)clickonceintheupperportionofthevariableheader(themouse
pointerwillbeanoutlinedplussign ).Toselectthevariablecolumn(forediting)
clickonceinthelowerportionofthevariableheader(themousepointerwillbean
outlinedplussignwithanarrow ).Toautofitthevariablecolumn,doubleclick
ontherightsideofthevariableheader(themousepointerwillbeacrosswitha
doubleheadedarrow ).Inthepreviousimage,thefirsttwovariableheadercells
containthetextGENDERandADVERT.Youhavetheoptiontochangehowthe
variableheadercellsdisplayinformationsothattheyshowthecolumnnumber
associatedwiththevariable,thevariablelongname,and/oranabbreviationofthe
displaytypesforthevariablesinthespreadsheet.Eachoftheseoptionsis
availableontheViewtabintheDisplaygroup;clickVariableHeaders.
Data (and in-cell formatting options).Theremainderofthespreadsheet
containsdatathatpertaintothecasesandvariablesandanyoptionalattachedor
linkedobjects(multimediaobjects,macros,customuserinterface).Textincells
canbeofpracticallyunlimitedlength(inmostSTATISTICAconfigurationsitis
limitedto1,000characterstoprotectagainstinadvertentpastingofunwanted
largeamountsofdataintoonecell).Textincellscanbeextensivelyformatted
includingwrappingthetext,differentfonts,andfontattributes.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference177
Input vs. Output Spreadsheets
STATISTICAofferstheabilitytoopenandusemanyspreadsheetsatthesametime,
allowingyoutoworkwithseveraldifferentinputdatafilessimultaneously.In
additiontostoringdata,STATISTICAusesspreadsheetstodisplaythenumeric
outputfromitsanalyses.BecauseSTATISTICAmakesnodistinctioninthefeatures
supportedforaninputspreadsheet(fromwhichSTATISTICAretrievesitsdata)and
anoutputspreadsheet(wheretheresultsofananalysisaredisplayed),itiseasyto
usetheresultsofoneanalysisasinputdataforfurtheranalyses.
Anyspreadsheetopenedfromadiskfileisautomaticallytreatedasaninput
spreadsheet,andanynumberofinputspreadsheetscanbeopenatatime.To
avoidconfusion,however,anoutputspreadsheet(containingtheresultsofan
analysis)isnotautomaticallyavailableasinputdataforanalysis.Itmustfirstbe
designatedasaninputspreadsheetbeforebeingusedforfurtheranalyses.
Additionally,inputspreadsheetsreportthenumberofvariablesandcasesforthat
spreadsheetinthetitlebar.Forexample,ifExp.sta(88vby48c)isinthetitlebar,it
isaninputspreadsheet;ifExp.staisinthetitlebar,itisnotaninputspreadsheet.
Todesignateanoutputspreadsheetasaninputspreadsheet,selectthe
spreadsheet(i.e.,ensurethespreadsheethasthefocus).Then,ontheDatatabin
theModegroup,selecttheInputcheckbox.Nowyoucanbeginananalysis,and
STATISTICAwillusethedatafromthespecifiedinputspreadsheetfortheanalysis.
Notethatifyouswitchbacktoanotherspreadsheetthathaspreviouslybeen
designatedasaninputspreadsheet,itcanstillbeusedforanalysesaswell.
Inaworkbook,onlyonespreadsheetcanbeselectedforanalysesatatime,evenif
theworkbookcontainsseveralinputspreadsheets.Thisspreadsheetiscalledthe
ActiveInputspreadsheet,anditsicon(intheworkbooktree)isframedinred.
Bydefault,whenanoutputspreadsheetisdesignatedasaninputspreadsheet,
STATISTICAautomaticallyselectsitastheActiveInputspreadsheet.Toselect
anotherinputspreadsheetforactiveinput,selecttheActiveInputcheckboxon
theWorkbooktabintheItemsgroup,orselectUseasActiveInputfromthe
workbooktreeshortcutmenu.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


178STATISTICAQuickReference

Itisalsopossibletoleaveastandalonespreadsheetopenbutdesignateitas
unavailableforanalysis.Todothis,selectthespreadsheet,andcleartheInput
checkboxontheDatatabintheModegroup.NowSTATISTICAautomatically
defaultstothemostrecentlyselectedinputspreadsheetforanalysis,ignoringall
spreadsheetsthatarenotdesignatedasinputspreadsheets.
STATISTICA Spreadsheet
OLE DB Provider
InadditiontousingspreadsheetsasdatasourcesforanalysesinSTATISTICA,
spreadsheetscanalsosupplydatatootherdatabaseawareapplicationsbyusing
theStatSoftOLEDBProviderforSTATISTICASpreadsheets.ThisOLEDBdriveris
installedwithSTATISTICA,andallowsreadonlyaccesstodatainSTATISTICA
SpreadsheetsusingtheindustrystandardStructuredQueryLanguage(SQL).You
canaccesstheOLEDBProvideratanypointthesystemallowsyoutochoosea
databaseconnection,usingthestandardMicrosoftDataLinkProperties.
Toaccessthisfunctionality,selecttheDatatab.IntheManagegroup,click
ExternalDataandfromthedropdownlist,selectCreateQuery.IntheDatabase
Connectiondialog,clicktheNewbuttontodisplaytheDataLinkPropertiesdialog,
whereyouselectStatSoftOLEDBProviderforSTATISTICASpreadsheets.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference179

ClicktheNextbuttontodisplaytheConnectiontab.

TheDataSourcefieldspecifiesthedirectorypathwherethespreadsheetis
located.Whencreatingthequery,youcanchooseindividualspreadsheetfiles
withinthatdirectory.ThefollowingexampleusesSTATISTICAQuery,andhas
definedaconnectiontotheSpreadsheetOLEDB,specifyingthepathofthe
STATISTICAExamplesfolder.Eachspreadsheetwithinthefoldershowsupasa
potentialtable.

Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


180STATISTICAQuickReference
ThesespreadsheetscanbereferencedinFROMclauses,specificvariablenames
selectedasfieldsinSELECTclauses,andcasesdefinedwithWHEREclauses.Joins
betweenmultiplespreadsheetsaresupportedaswell,usingstandardJOINclauses.

UsingtheStatSoftOLEDBProviderforSTATISTICASpreadsheetsenablesyouto
provideSTATISTICASpreadsheetdatatoanyapplication(includingSTATISTICA
itself)thatcanusetheindustrystandardOLEDBinterfaceforqueryingdata.
REPORTS
Reports(brieflyintroducedonpage150)inSTATISTICAofferamoretraditional
wayofhandlingoutput(comparedtoworkbooks)aseachobject(e.g.,a
STATISTICASpreadsheetorGraph,oranExcelspreadsheet)isdisplayed
sequentiallyinawordprocessorstyledocument.

Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference181
However,thetechnologybehindthissimplereportoffersyourichfunctionality.
Forexample,liketheworkbook,eachSTATISTICAReportisalsoanActiveX(see
page238)containerwhereeachofitsobjects(notonlySTATISTICASpreadsheets
andGraphs,butalsoanyotherActiveXcompatibledocuments,e.g.,Word
documents)isactive,customizable,andinplaceeditable.Reportsarestoredinthe
STRfileformat,whichisaStatSoftextensionoftheMicrosoftRTF(RichText
Format,*.rtf)format.STRfilessharetheRTFformattinginformationand
additionallytheyincludethetreeviewinformation(whichcannotbestoredinthe
standardRTFfiles).Hence,reportfilesarebydefaultsavedwiththefilename
extension*.str,buttheycanalsobesavedasstandardRTFfiles(inwhichcasethe
treeinformationwillnotbepreserved).
Theobviousadvantagesofthiswayofhandlingoutput(moretraditionalthanthe
workbook)aretheabilitytoinsertnotesandcommentsinbetweentheobjects
aswellasitssupportforthemoretraditionalwayofquicklyscrollingthroughand
reviewingtheoutputtowhichsomeusersmaybeaccustomed.Also,onlythe
reportoutputincludesandpreservesarecordofthesupplementaryinformation,
whichcontainsadetailedlogoftheoptionsspecifiedfortheanalyses(e.g.,
selectedvariablesandtheirlabels,longnames,etc.,dependingonthelevelof
supplementaryinformationspecifiedintheOutputManager,seepage25).
Theobviousdrawback,however,ofthesetraditionalreportsistheinherentflat
structureimposedbytheirwordprocessorstyleformat,thoughthatiswhatsome
usersofcertainapplicationsmayfavor.
Navigating the Report Tree
Thereporttreedisplaystheorganizationoffilesinthereport.Thefilesare
displayedinanExplorerstyleformat;however,unlikeworkbooksthatcansupport
anynumberoflevels,thereportsupportsonlyoneleveloffiles.
YoucanembedanytypeofSTATISTICAdocumentinareport,including
spreadsheets,graphs,andanalyses.InadditiontoSTATISTICAdocumenttypes,
youcanembedothertypesofActiveX/OLEobjectsinareport,includingExcel
spreadsheets,Worddocuments,bitmapimages,andothers.Toeditoneofthese
typesofembeddeddocuments,doubleclickonthedocument.Thefileopensin
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


182STATISTICAQuickReference
theviewer,andthereporttoolbarmergeswiththetoolbarfromtheembedded
filesnativeapplication,givingyouaccesstoalloftheeditingfeaturesyou need.
Itemsinthetreeareidentifiedbytheiconnexttothem.The spreadsheet,
macro,and graphiconsrepresentSTATISTICASpreadsheet,Macro,andGraph
documents,respectively.The DataMinericonrepresentsaDataMiner
workspace.AllnonSTATISTICAdocumentsarerepresentedbytheirdocument
icons.Forexample,Worddocumentsarerepresentedbythe Wordicon,and
Excelspreadsheetfilesarerepresentedbythe Excelspreadsheeticon.
Thereporttreecanbeorganizedandmodifiedusingdraganddropfeaturesas
wellasClipboardprocedures.

Commandsforinserting,extracting,renaming,andremovingitemsfromthe
reporttreeareavailablefromthereporttreeshortcutmenu(accessedbyright
clickinganywhereinthetree,asshownabove).
GRAPHS
GraphsrepresentanotherdistinctivetypeofSTATISTICAdocuments,andthey
offerrichfunctionalitybothintermsofthevarietyofwaysinwhichgraphscanbe
createdinSTATISTICAandintheselectionofgraphcustomizationtools.
SimilartotheotherSTATISTICAdocuments,graphsareActiveXcontainers(see
page238),whichmeansthattheycancontainavarietyofcompatibledocuments
(e.g.,Visiodrawings,Adobeillustrations,Excelspreadsheets,etc.).STATISTICA
GraphsarealsoActiveXobjectsand,therefore,canbelinkedtoorembeddedinto
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference183
othercompatibledocuments(e.g.,Worddocuments)wheretheycanbeinplace
editedbysimplydoubleclickingonthem.
GraphsarediscussedinmoredetailinChapter6Graphs.
MACROS (STATISTICA
VISUAL BASIC PROGRAMS)
TheindustrystandardSTATISTICAVisualBasic(SVB)language(integratedinto
STATISTICA)offersanother(alternative)userinterfacetothefunctionalityof
STATISTICA,anditoffersincomparablymorethanjustasupplementary
applicationprogramminglanguagethatcanbeusedtowritecustomextensions.
NotethatSTATISTICAVisualBasicisnotMicrosoftVisualBasic6.0.StatSoftowns
andmaintainsthecodeforSTATISTICAVisualBasic.SVBiscompatiblewith
MicrosoftsVB.NET,MicrosoftsVisualBasicforApplications(VBA),andalsowith
MicrosoftsVisualBasic6.0(VB6).SVBscriptinglanguageisuniqueintermsofits
flexibilityandcompatibility,anditisalsoverypowerful.ItprovidesaccesstoVisual
BasicforApplications(usedforscriptingMicrosoftOfficeproducts)andaccessto
the.NETFrameworkwithinthesamefile(seeChapter10Programming
STATISITCAfrom.NET,page247).OtherAPIscanalsobeaccessedandleverage
theflexibilityofSVBsuchas,forexample,YahoosStockQuoteAPIorGoogle
AnalyticsAPI.SVBoffersapowerful64bitsolutionforsystemintegration,
expansion,andcustomdevelopment.
STATISTICAVisualBasictakesfulladvantageoftheobjectmodelarchitectureof
STATISTICAandisusedtoaccessprogrammaticallyeveryaspectandvirtuallyevery
detailofthefunctionalityofSTATISTICA.Eventhemostcomplexanalysesand
graphscanberecordedintoVisualBasicmacrosandlaterberunrepeatedlyor
editedandusedasbuildingblocksofotherapplications.STATISTICAVisualBasic
addsanarsenalofmorethan14,000newfunctionstothestandard
comprehensivesyntaxofVisualBasic,thuscomprisingoneofthelargestand
richestdevelopmentenvironmentsavailable.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


184STATISTICAQuickReference

STATISTICAMacroscanbesavedinseveralformats,dependingonhowyouintend
tousethem(seetheSTATISTICAVisualBasicPrimerandtheElectronicManualfor
moreinformation).YoucanalsocopythemtotheClipboardandpastetheminto
otherprogramsordocuments.
STATISTICAVisualBasicisdiscussedinmoredetailinChapter8(page219).
STATISTICA PROJECTS
WhenperformingstatisticalanalysesandworkingwithSTATISTICAdocuments,
youwilloftenhavemanydifferentwindowsopen,andevendifferentanalysesin
differentstagesofprogress.STATISTICAprovidesameansforsavingyour
workspace,includinganyanalysesinprogress.YoucancloseSTATISTICAatany
pointduringananalysis,andwhenyoulaterreopentheproject,thepreviously
openedfilesandinprocessanalyseswillberestored.
TosaveaSTATISTICAProject,selecttheHometab,clicktheSavearrowinthe
Projectgroup,andselectSaveProjectAstodisplaytheSaveSTATISTICAProject
dialog.
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011
STATISTICAQuickReference185

Inthisdialog,specifythepathandfilenameoftheSTATISTICAProjectfile(a
projectsextensionis.spf).Youcanalsospecifywhatitemstoincludeinthe
project.AllSTATISTICAdocumenttypescanbeselected(Spreadsheets,Graphs,
Workbooks,Macros,Reports,DataMinerprojects,InPlaceDatabaseprojects,
Analyses,andAnalysisresults).ForthoseSTATISTICAdocumentsthatarealready
storedondisk,youhavetheoptiontoeitherLinktotheexistingdocumentfile,or
tostoreacopyofthedocumentwithintheSTATISTICAProjectfile(Embedthe
documentintheproject).
InadditiontoSTATISTICAdocuments,projectfileswillalsosaveallinprogress
analyses.Theprojectfilewillstoretherecordedscriptsthatareautomatically
createdwheneveryanalysisisrun.Whentheprojectisreopened,thescriptsfor
theanalysesarererunagainsttheoriginaldataandtheanalysesdialogsaremade
visibleagaininexactlythestatetheywerewhentheprojectfilewassaved.
Projectfilesareaconvenientwaytosendinprogressanalysisstepsandresults
backandforthbetweenusersifyouelecttoembedthesaveddocumentsinthe
projectfile.Oneusercanrunanalysestoacertainpoint,andthensavetheproject
fileandpassittoanotheruser,whocanopentheprojectfileandcontinueexactly
wherethefirstuserstoppedtheanalyses.
Unlessyouconfigureitotherwise,STATISTICAwillautomaticallydisplayaprompt
askingifyouwanttosaveaprojectfilewhenquittingtheprogram,andwill
Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


186STATISTICAQuickReference
automaticallyreopenthelastsavedprojectfilewhenstarting.Thus,STATISTICA
makesiteasytoquitforthedayandstartthenextsessionrightwhereyouleftoff.
NotethataprojectisastateofaninstanceofSTATISTICA.Thus,projectsarenot
likeotherdocumentsinthatyoucannotopenmorethanoneprojectinasingle
instanceofSTATISTICA.Adifferent(second)projectcanbeopenedinasecond
instanceofSTATISTICA.

Copyright StatSoft, 2011


STATISTICAQuickReference187
GRAPHS
Overview ................................................................................................. 189
Customization of Graphs ...................................................................... 190
General Categories of Graphs .............................................................. 198
Graphs of Input Data ............................................................................. 199
Graphs of Block Data ............................................................................ 202
Graphs Menu Graphs ............................................................................. 204
Graph Brushing and Case States ......................................................... 205
Other Specialized Graphs ..................................................................... 208
Creating Graphs via STATISTICA Visual Basic .................................... 209
CHAPTER
6
6

Chapter5:STATISTICADocuments

Copyright StatSoft, 2011


188STATISTICAQuickReference



Copyright StatSoft, 2011


STATISTICAQuickReference189
GRAPHS
OVERVIEW
Themostcommonapplicationofgraphsistoefficientlypresentandcommunicate
information(typically,numericaldata).However,graphicaltechniquesalsoprovide
powerfulanalyticaltoolsfortheexplorationofdataandverificationofhypotheses.
A broad selection of graphics options.STATISTICAincludesacomprehensive
selectionofgraphicalmethodsforbothdataanalysisandthepresentationof
results.AllgraphsinSTATISTICAincludeabroadselectionofbuiltin,interactive
analytictechniquesandextensivecustomizationtoolsthatenableyouto
interactivelycontrolvirtuallyallaspectsofthedisplay.Also,flexiblegraphics
managementfacilitiesareavailablethatareusedtointegratevariousgraphical
displaysandtobuilddynamiclinksbetweenapplications(e.g.,usingOLEObject
LinkingandEmbedding).
Comprehensive support for Visual Basic and other languages.STATISTICA
graphicaloptionscanalsobeaccessedprogrammatically(usingbuiltinSTATISTICA
VisualBasicorothercompatiblelanguages),whichcreatespracticallyunlimited
possibilitiesforproducinghighlycustomizedgraphicaldisplays.Thesecustom
graphscanlaterbepermanentlyaddedtoSTATISTICAsuserinterface(e.g.,
assignedtobuttonsontoolbarsoraddedtothemenus).
General categories of graphs.TheSTATISTICAsystemoffersavarietyofmethods
inwhichgraphscanberequestedordefined.Thesemethods(constitutingbroad
categoriesofgraphs,suchasinputdata,blockdata,andspecialized)arereviewed
inGeneralCategoriesofGraphsonpage198;theycomplementeachother,
CHAPTER
6
6

Chapter6:Graphs

Copyright StatSoft, 2011


190STATISTICAQuickReference
providingahighlevelofintegrationbetweennumbers(suchasrawdata,
intermediateresults,orfinalresults)andgraphicaldisplays.Forexample,
specializedgraphscanberequestedaspartoftheautomaticoutputfrom
statisticalprocedures,buttheycanalsoberequestedviaintegratedtoolsto
visualizevirtuallyanycombinationofnumbers(and/orlabels)thataredisplayedor
generatedbySTATISTICA.
CUSTOMIZATION
OF GRAPHS
Interactive graph customization.ThecustomizationoptionsinSTATISTICA
graphicsincludehundredsoffeaturesandtoolsthatcanbeusedtoadjustevery
detailofthedisplayandassociateddataprocessing.However,theseoptionsare
arrangedinahierarchicalmanner,sothoseusedmostoftenareaccessibledirectly
viashortcutsbydoubleclickingorrightclickingontherespectiveelementofthe
graph.
Permanent settings and automation options.Theinitial(default)settingsofall
ofthesefeaturescanbeeasilyadjustedsothateventhedefaultappearanceand
behaviorofSTATISTICAgraphswillmatchyourspecificneedsand/orwillrequire
verylittleinterventiononyourpart.Followingaresomeofthewaystomakethese
adjustments:
1. Options dialog.Perhapsthemoststraightforwardwaytoadjustthedefault
appearanceofgraphsisbymodifyingthegraphoptionsintheOptionsdialog
(selecttheToolstabandclickOptions).Mostcommonlyusedsettingscanbe
easilyadjustedthere(selectDisplayorSettings,locatedunderGraphs),and
theresultswillbereflectedinthedefaultstyles(seenumber2below)that
willbeusedbythesystemandassuch,theywillbeautomaticallysavedin
theSTATISTICAconfigurationfile(e.g.,differentsettingscanbeusedfor
differentprojects).Forfurtherdetails,seethedocumentationforthe
ConfigurationsoptionspaneoftheOptionsdialogintheElectronicManual.
2. Graph style system. Allofthenumerousfeaturesthataffectthe
appearanceofthegraph(fromaselementaryasthecolorofthefontinthe
footnotetoasgeneralastheglobalfeaturesofthegraphdocument)canbe
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference191
savedasindividualstyles.Thesestylescanbegivencustomnamesand
laterbereappliedusingsimpleshortcuts(suchaspressingaspecifickey
combinationorclickingabuttononacustomtoolbar).Anintelligentsystem
internallymanagesthesethousandsofstylesandtheircombinationsin
STATISTICAandhelpsyouachieveyourcustomizationobjectiveswitha
minimumamountofeffort.Alluserdefinedormodifiedstyleswillbesaved
automaticallyintheSTATISTICAconfigurationfile(e.g.,differentsetsor
systemsofstylescanbeusedfordifferentprojects).Forfurtherdetails,see
thedocumentationfortheConfigurationsoptionspaneoftheOptions
dialogintheElectronicManual.
3. User-defined graphs. Newtypesofgraphscanbedefinedinavarietyof
waysandcanbeaddedtothemenus,dialogs,ortoolbars.Ifacustomgraph
thatyouintendtouserepeatedlyisnotbuiltfromscratchbutisbasedon
oneoftheGraphsmenugraphsandisproducedbysomecombinationofthe
existinggraphcustomizationoptions,thenaddingittotheGraphsmenuasa
newtypeofgraphisassimpleasclickingtheAddAsUserdefinedGraphto
MenubuttonontheOptions2tabofthegraphspecificationdialog.Alluser
definedgraphspecificationswillbesavedautomaticallyintheSTATISTICA
configurationfile(e.g.,differentsetsofcustomgraphscanbeusedfor
differentprojects).Forfurtherdetails,seethedocumentationforthe
ConfigurationsoptionspaneoftheOptionsdialogintheElectronicManual.
4. STATISTICA Visual Basic.Finally,notethattherearenolimitstohow
deeplycustomizedyourSTATISTICAcustomgraphscanbe,because
STATISTICAVisualBasic(withallitspowerfulcustomdrawingtoolsaswellas
theSTATISTICAbasedlibraryofgraphicsprocedures)canbeusedtoproduce
virtuallyanygraphicsormultimediaoutputsupportedbythecontemporary
computerhardware.Thosecustomdevelopeddisplaysormultimediaoutput
canbeassignedtoSTATISTICAtoolbars,menus,ordialogsandbecomea
permanentpartofyourSTATISTICAapplication.
SeetheElectronicManual(STATISTICAHelp)forfurtherdetailsonthesegraph
customizationmethods.TheElectronicManualalsocontainstopicsdevotedto
specificcategoriesofgraphs,includesconceptualoverviewsandexamplesof
typicalapplications,anddiscussesdistinctivefunctionalpropertiesofthe
respectivetypesofgraphs.
Chapter6:Graphs

Copyright StatSoft, 2011


192STATISTICAQuickReference
ThedefaultsettingsofmostgraphsofferedinSTATISTICAfollowtheestablished
conventionsthatareeitherexplicitlydescribedintheliteratureonstatisticaland
technicalgraphing,ortheyrepresentstandardsthatarecommonlyacceptedby
majorscientificjournals(e.g.,SCIENCE).However,practicallyalldefaultsettingsof
STATISTICAcanbecustomizedtomeetspecificrequirementsofunusual
applications(seepage190).STATISTICAsgraphicsfacilitiesweredesignedtoplay
theroleofflexibletools,capableofproducingeffectsthatgofarbeyond
establishedpatternsandtemplates.
Inadditiontoacomprehensiveselectionofstandardstatisticalandtechnical
graphs,STATISTICAincludesnumerousuniquetypesofgraphsandgraph
customizationfacilities.TheGraphOptionsdialog,accessiblebydoubleclickingin
thebackgroundofagraph,orselectingtheToolstabandclickingGraphinthe
Optionsgroup,containsoptionsthataddressalloftherelevantcustomizable
featuresforaparticulargraph.Theoptionsaregroupedinclusterscontaining
logicallyrelateditems,andareanallinclusivesupersetofgraphshortcut
optionsaccessedbydoubleclickingspecificgraphfeatures.

Locatedatthebottomofgraphs,youllfindtheinteractivegraphicscontrols(see
thenextillustrations),whichenableyoutoadjustthetransparencyoftheplot
areasandmarkers,andtoscrollandpaninordertointeractivelyscalethegraph.
Morecontrolsarelocatedin3Dgraphstoenableinteractiverotation.Clickthe
wrenchiconadjacenttothesliderstodisplaytheGraphOptionsdialog.
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference193

Left:2DGraph
Below:EnlargedimageofPanning(scaling),Scrolling,
andTransparencyControls
Left:Sectiontobe
scalediscircled
Right:Scaledviewof
leftgraphscircledarea
Left:Scatterplotwith
denseconcentrationof
datapoints
Right:Transparency
Controlrevealshidden
trends
Left:PlotAreaTransparencyControlcircled;making
plotareastransparentallowsportionsoftheplotto
overlapwhilestillbeingvisible
InteractiveScrolling
InteractivePanning
Chapter6:Graphs

Copyright StatSoft, 2011


194STATISTICAQuickReference

WhileStatSoftstatisticiansdesignedmostofthegraphcustomizationoptions,itis
importanttosaythatSTATISTICAusershaveplayedasignificantroleintheir
creation.Infact,theselectionofgraphicsoptionsincludedinSTATISTICAisthe
resultofinputfromthousandsofuserswhoprovidedtheircommentsinresponse
toStatSoftsinquiries.ManyuniquefacilitiesofSTATISTICAGraphswere
introducedinresponsetousersideasandrequests.WeatStatSoftarevery
gratefulfortheinputfromourusers.
Asmentionedpreviously(anddiscussedindetailonpage198),therearevarious
methodstospecifySTATISTICAGraphs.Youcouldsaythatthesemethods
representdifferenttypesofinterfacesbetweennumbersandgraphs.
Forexample,thenumbersrepresentedinapiechartcansimplydepictvaluesofa
spreadsheetcolumn(e.g.,variableSales)intheconsecutivecasesofthe
spreadsheet(e.g.,caseslabeled:Year2008,Year2009,Year2010,...,etc.).

Thenumbersinasimilarpiechart,however,canrepresentresultsofcalculations.
Forexample,theslicesofthepiecanrepresentrelativefrequenciesof
observationsthatbelongtocertaincategoriescalculatedbyoneofthehistogram
Left:3DGraph;RotationControlscircled
Below:EnlargedimageofRotationandTransparency
Controls
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference195
orfrequencycategorizationprocedures(e.g.,numbersofyearswhentheSales
werebelow$10million,between$10and$20million,andabove$20million).

Regardlessofthemethodthatwasusedtocreateagraph(i.e.,regardlessof
wherethenumbersrepresentedinthegraphwereobtainedorhowtheywere
calculated),allSTATISTICAGraphcustomizationandmultigraphicsmanagement
facilitiescanbeusedtochangetheappearanceofthegraphorintegrateitwith
othergraphsordocuments.

Chapter6:Graphs

Copyright StatSoft, 2011


196STATISTICAQuickReference
Also,allintegratedanalyticfacilitiesthatareaccessiblefromwithingraphsin
STATISTICA(suchasfunctionfitting,smoothing,rotation,brushing,analytical
zooming,etc.)areavailableandcanbeappliedtothegraphregardlessofthe
sourceofthenumbersinthegraphorthemethodthatwasusedtocreateit.
ThegrapheditingfacilitiesofferedinSTATISTICAenableyoutocreatenotonly
highlycustomizedscientificandtechnicalpublicationreadydisplays:

Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference197
andprecisedrawings:

butalsopresentationqualitydiagrams,posters,businesscharts,andother
displays:

Chapter6:Graphs

Copyright StatSoft, 2011


198STATISTICAQuickReference
thataredesignedtocommunicateinformationinaneffectiveandattractive
manner.
Graphsthataresavedintofilesorthatinanyotherwayhavebeentemporarily
detachedfromtheSTATISTICAapplication(e.g.,copiedtotheClipboardorlinked
toadocumentinanotherapplication)arecompleteobjects(technically
speaking,ActiveXobjects,seepage238)thatcontainnotonlyallcustomization
featuresandotherembeddedobjects,butalsoalldatathatarenecessaryto
continueeditingallaspectsofthedisplayortheanalysisofitscontents(fitting,
smoothing,etc.).
BecauseSTATISTICAGraphsareActiveXobjects,theycaneasilybelinkedtoor
embeddedinothercompatibledocuments(e.g.,ExcelorWorddocuments),where
theycanbeinplaceeditedbydoubleclickingonthem.STATISTICAGraphsarealso
ActiveXcontainersand,therefore,cancontainawidevarietyofembeddedor
linkeddocumentssuchasVisiodrawings,Adobeillustrations,Excelspreadsheets,
orWorddocuments.Moreover,STATISTICAsupportshierarchiesofembedded
objectsuptofourlevels,whichmeansthatitcanmanagedocumentscontaining
documents,containingdocuments,whichcontaindocuments.
GENERAL CATEGORIES
OF GRAPHS
Inadditiontothespecializedstatisticalgraphsthatareavailablefromtheoutput
dialogsinallstatisticalprocedures(seepage208),therearetwogeneral
categoriesorclassesofgraphsbothaccessiblefromtheGraphstab,shortcut
menus,andtheSTATISTICAStartbutton menu:
Inputdatagraphs(GraphsofInputData,seepage199)andGraphsmenu
graphs,(seepage204)and
GraphsofBlockData(seepage202).
Themostimportantdifferencebetweenthesetwogeneralcategoriesliesinthe
datathatthegraphtypesutilizeforgeneratingplots.
Input data graphs. GraphsofInputDataandtheirexpandedversiononthe
Graphstabproducestatisticalsummariesorotherrepresentationsoftherawdata
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference199
inthecurrentinputdataspreadsheet(typicallyforallthevariables,orforsubsets
ifcaseselectionconditionsareused).Notethatifgraphsofthisgeneralcategory
areproducedusingashortcutmenufromwithinaspreadsheetofresultsthatdoes
notcontaintheactualdata(e.g.,acorrelationmatrix),STATISTICAwillstillreachto
therespectiveinput(raw)datatoproducethegraph(e.g.,ascatterplotofthe
variablesidentifiedbytheselectedcellinthecorrelationmatrixfromwhichthe
shortcutmenuwasopened).
Graphs of Block Data. GraphsofBlockData,however,areentirelyindependent
oftheconceptofinputdataordatafile.Theyprovideageneraltoolto
visualizenumericvaluesinthecurrentlyselectedblockofanyspreadsheet(which
cancontainvaluesfromcustomdefinedsubsetsofnumericaloutputorarbitrarily
selectedsubsetsofrawdata).
Common features of the two categories of graphs. Thesetwogeneral
categoriesofgraphsofferthesamecustomizationoptionsandthesameselection
oftypesofgraphs.Forexample,youcancreatethesamehighlyspecialized
categorizedternarygraphfromtheinput(raw)datasetandfromacustomdefined
blockofvaluesrepresentingresultsofaparticulartest.
Thesetwogeneralcategoriesofgraphswillbebrieflydiscussedinthenexttwo
sections,followedbyasectionontheGraphstab,whichcontainsanexhaustive
selectionofallgraphsfromthefirstcategory(inputdatagraphs,oftenreferredto
asGraphsmenugraphs),aswellasaccesstoGraphsofBlockDataandother
options.
GRAPHS OF INPUT DATA
TheGraphsofInputDatacommandisavailablefromtheshortcutmenuofall
spreadsheets,anditoffersquickandsimplifiedaccesstothemostcommonlyused
typesofgraphsbasedonthecurrentinputdataset.
Chapter6:Graphs

Copyright StatSoft, 2011


200STATISTICAQuickReference

NotethatallthesegraphsarealsoavailableontheGraphstab,fromthe
STATISTICAStartmenu

onthestatusbar,orbyclickingtheGraphsGallery
buttononanygraphspecificationdialog.GraphsofInputDatadonotofferas
manyoptionsasthecorrespondingGraphsmenugraphs;however,theyare
quickertoselectbecauseunlikeGraphsmenugraphs:
GraphsofInputDatacanbecalleddirectlyfromthespreadsheetshortcut
menus,
GraphsofInputDatadonotrequireyoutoselectvariables(thevariable
selectionisdeterminedbythecurrentcursorpositionwithina
spreadsheet),and
GraphsofInputDatadonotrequireyoutoselectoptionsfromany
intermediatedialogs(defaultformatsoftherespectivegraphsare
produced).
GraphsofInputDataprocessdatadirectlyfromthecurrentinputdatafile,and
theytaketheircuesastowhichvariablestousefromthecurrentcursorposition
(inanytypeofspreadsheet).
Forexample,ifyourightclickasinglecorrelationinaresultsspreadsheetand
createaScatterplotbygraph,STATISTICAgeneratesa2Dscatterplotusingthe
originalrawvaluesofthetwovariablesrepresentedbythatcorrelation(seethe
IntroductoryExampleonpage11foramoredetailedexample).
AlthoughthemostconvenientwaytoselectGraphsofInputDataisviathe
spreadsheetshortcutmenu,youcanalsoselectthemfromtheGraphstaborthe
STATISTICAStartmenu

.Eithermethodwilldisplayasubmenufromwhichyou
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference201
canchooseoneofthestatisticalgraphsapplicabletothecurrentvariable(i.e.,to
thevariableindicatedbythecurrentcursorpositioninthespreadsheet).
Ifthespreadsheethasamatrixformatoraformatwhereacursorposition
indicatesnotonebuttwovariables(asintheillustrationshowingacorrelation
matrix,below),thenpredefinedbivariategraphsforthespecifiedpairofvariables
willbedirectlyavailablefromtheGraphsofInputDatasubmenus.

Otherwise,i.e.,whenthecurrentcursorpositionindicatesonlyonevariableasina
tableofdescriptivestatistics(asshowninthenextillustration),andifyouselect
anyofthebivariategraphsinthemenu,STATISTICAwillpromptyoutoselectthe
secondvariable.Forexample,ifyouselectScatterplotby,theSelectsecond
variabledialogwillbedisplayed,whereyouspecifybywhichvariableMeasure05
isgoingtobeplotted.

Chapter6:Graphs

Copyright StatSoft, 2011


202STATISTICAQuickReference
Ifmorethanonevariableisindicatedbyahighlightedsection(i.e.,whenablockis
selected),thentheGraphsofInputDatamenuwillapplytothefirstselected
variable.
WhengeneratingGraphsofInputData,STATISTICAtakesintoaccountthecurrent
caseselectionandweightingconditionsforthevariablesthatarebeingplotted.
Note,however,thatthecaseselectionorweightingconditionsneedtobe
specifiedforthecurrentspreadsheet(i.e.,viatheToolstabSelectionConditions
EditoptionsandtheToolstabWeightoptions)andnotjustlocallyforan
analysis(i.e.,selectedfromtherespectiveanalysis/graphspecificationdialogs
usingthe
and
buttons).ThelatterconditionswillbeignoredbytheGraphs
ofInputData.FormoreinformationonspecifictypesofGraphsofInputData,see
theElectronicManual.
GRAPHS OF BLOCK DATA
UnlikeGraphsofInputData,GraphsofBlockDatausethecurrentlyselected
(continuous)blockofdataintheactivespreadsheettospecifyinputdatafor
thegraph.

Notethatthesegraphsareentirelyindependentfromtheconceptofinputdata.
Theyprocessvalues(numbers)fromwhateveriscurrentlyselectedintheblock
andignorethemeaningofthosenumbers(e.g.,thenumberscanberawdataor
valuesofcorrelationcoefficients).Thesegraphsofferaneffectivemeansof
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference203
visualizing,exploring,andefficientlysummarizingnumericoutputfromanalyses
displayedinresultsspreadsheets(e.g.,histogramsofMonteCarlooutputscoresin
theSEPATHmodule,oraboxplotofaggregatedmeansfromamultivariate
multipleclassificationtableintheANOVAmodule).
AlthoughthemostconvenientwaytoselectGraphsofBlockDataisviathe
shortcutmenuassociatedwiththeblockselectedinaspreadsheet,Graphsof
BlockDataarealsoavailablefromtheGraphstabortheSTATISTICAStartmenu

.WhencreatingGraphsofBlockData,youcanselectfromdefaultgraphs(e.g.,
Histogram:BlockColumnsorLinePlot:BlockRows),oryoucancreateyourown
customgraphsforeithertheselectedcellsintherowsorcolumns,orofallcellsin
theselectedrowsorcolumns(i.e.,goingbeyondthevaluesthatareselectedinthe
block).
Default graphs. Usingthedefaultgraphs(thefirstsixcommandsontheGraphsof
BlockDatasubmenu,shownintheillustrationabove),youcancreatespecified
graphswithasingleclick.Forspecificinformationoneachdefaultgraph,referto
theElectronicManual.
Custom graphs.SelectanyofthefourCustomGraphcommandstodisplaythe
SelectGraphdialog,whichprovidesavarietyofoptionsforcreatingcustomized
graph.

Forspecificinformationoncustomgraphs,refertotheElectronicManual.
Customizing graphs.AswithmostfeaturesofSTATISTICA,GraphsofBlockData
arefullycustomizable.SelectCustomizeListfromtheBlockDataGraphsmenuto
displaytheCustomizeGraphMenudialog,whichprovidesoptionstoremove,
rename,oreditthecurrentlylistedgraphsaswellastoaddnew(userdefined)
graphstotheGraphsofBlockDatamenu.
Chapter6:Graphs

Copyright StatSoft, 2011


204STATISTICAQuickReference
Forexample,ifyouwanttoincludeanormalfitonthehistogramscreatedusing
Histogram:BlockColumns,selectHistogram:BlockColumnsintheCustomize
GraphMenudialog,clicktheEditbutton,andswitchtheGraphSubTypeto
NormalFit.AllsubsequentlycreatedHistogram:BlockColumnsplotswillincludea
normalfittothedata.
GRAPHS MENU GRAPHS
TheGraphstabprovidesacompleteselectionofallstatisticalgraphsavailablein
STATISTICA.TheseoptionsareavailablefromnotonlytheGraphstab,butalsothe
STATISTICAStartmenu

,andofferhundredsoftypesofgraphical
representationsandanalyticsummariesofdata.

Notethat,unlikeGraphsofBlockData(whicharealsoincludedonthistabin
ordertoofferafullcomplementofallgraphicaloptionsaccessiblefromasingle
control),allothergraphtypesfromtheGraphstabarenotlimitedtothevaluesin
thecurrentoutputspreadsheet.Instead,theyprocessdatadirectlyfromthe
currentinputspreadsheet,inthesamewaythe(previouslydiscussed)Graphsof
InputDatado.Theyrepresenteitherstandardmethodstographicallysummarize
rawdata(e.g.,variousscatterplots,histograms,orplotsofcentraltendenciessuch
asmedians)orstandardgraphicalanalytictechniques(e.g.,categorizednormal
probabilityplots,detrendedprobabilityplots,orplotsofconfidenceintervalsof
regressionlines).Whengeneratingthesegraphs,STATISTICAtakesintoaccount
thecurrentcaseselectionandweightingconditionsforthevariablesselectedtobe
plotted.
Graphsmenugraphsinclude2DGraphs,3DSequentialGraphs,3DXYZGraphs,
MatrixPlots,IconPlots,CategorizedGraphs,andUserDefinedGraphs.Notethat
theCommongroupontheGraphstabincludesthemostcommonlyusedtypesof
graphs(Histograms,Scatterplots,Mean/ErrorPlots,etc.),andtheMoregroup
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference205
containsacomprehensivelistofallgraphtypes.Seealso,TypesofGraphsMenu
GraphsintheElectronicManual.
GRAPH BRUSHING AND
CASE STATES
GraphsthatarecreatedfromtheGraphstabarehighlyinteractivewiththe
spreadsheetfromwhichtheywerecreated.Youcanidentifyandselectpointsin
thegraphandspecifythattheyaretobehighlightedinthesourcespreadsheet,
andviceversa.
Inadditiontoselectingpointsingraphsandspreadsheets,youcanidentify
propertiesofacaseinaspreadsheetthatwillbeusedwhenthegraphiscreated
fromthatdata.Thesepropertiesincludethepointmarkerstyleandcolor,and
whetherthepointistobeexcludedfromthegraphand/orfitcalculations.
Tostartbrushingwithinagraph,clickthebrushing
buttonontheEdittabintheCustomizeGraphgroup,or
rightclickinthebackgroundofagraphandselectShow
BrushingfromtheshortcutmenutodisplaytheBrushing
dialog,whichisshownintheillustrationtotheright.
WiththedefaultSelectionBrush,whichisSimple,youcan
drawarectangleonthegraphtoselectthepointscontained
intherectangle.Thefollowingillustrationdemonstratesthis
fortheexampledatasetAdstudy.sta,witha2Dscatterplot
ofMEASURE01byMEASURE02.
Notethattheupperleftthreepointshavebeenselectedby
thebrushingtool,whichhighlightsthepointsinthegraphas
wellasthecorrespondingcasesinthespreadsheetfrom
whichthegraphwascreated.
Chapter6:Graphs

Copyright StatSoft, 2011


206STATISTICAQuickReference

Alternatively,insteadofusingtheBrushingfacilities,youcanselectcasesinthe
spreadsheet(clickonthefarleftsideofthecasename)andthecorresponding
pointswillbemarkedinthegraph,asshowninthefollowingillustration,where
thefirstfivecasesintheAdstudy.staspreadsheethavebeenselected.

Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference207
Youcanspecifyspreadsheetcasestatesfromeitheraspreadsheetoragraph.Ina
STATISTICASpreadsheet,rightclickonacasenametodisplaytheshortcutmenu,
whichcontainscommandsincludingOff,Label,MarkedPoints,andCaseStates.
Similarcommandsareavailablefromtheshortcutmenudisplayedwhenyouright
clickonthepointsinagraph.Thegraphwillusetheseoptionswhendisplayingthe
pointsrepresentedbythiscase.Forexample,ifyouselectLabel,the
correspondingpointswillbelabeled,asshowninthenextillustration.Notethat
thespreadsheetcasesaremarkedwithacasestateicontoindicatethatthecase
pointsarelabeled.

Rightclickonacasename,andfromtheshortcutmenuselectCaseStatesEdit
CaseStatestochangethecasemarkerand/orcolor.
NotethattheselectionofpointsisavailableforgraphtypesotherthanScatterplots.
Forhistograms,brushing/selectingahistogrambarwillselectthecorresponding
pointstothatbarinthespreadsheet.Thesameistrueoftheboxesinboxplots.
Usingcasestatesandbrushingandselectingpointsisparticularlyusefulwiththe
HiddenandExcludedcasestatesoptions.First,tomaketheseoptionsavailable,
displaytheOptionsdialog(selecttheToolstabandclickOptions),andinthetree
viewselectNavigation/Defaults(locatedunderSpreadsheets).Clearthe
Chapter6:Graphs

Copyright StatSoft, 2011


208STATISTICAQuickReference
CombineExcludedandHiddenCaseStatesintoOffstatecheckbox,andclickthe
OKbutton.
Then,selecttheDatatab,andintheCasesgroupclickCases.FromtheCaseStates
submenu,selectHiddentomarkacaseashidden,i.e.,thecasewillnotbevisible
ingraphs,butwillbeusedinanalyses.Youcanalsorightclickonacasename,and
fromtheshortcutmenuselectCaseStatesEditCaseStatestodisplaytheCase
Statedialog,whereyoucanselecttheHiddencheckbox.
SelectExcludedtomarkacaseasexcluded,i.e.,thecasewillnotbeusedinthe
computations;however,thecasewillbedisplayedinmostgraphtypes.Thecase
pointmarkerisdisplayed,butthecaseisremovedfromcomputations.The
Excludedcasestatealsoworksinconjunctionwithspreadsheetselection
conditions;anycasethathastheExcludedcasestatesetwillbetreatedasifthe
casewereexcludedbyselectionconditions.Therefore,usinggraphbrushingand
casestatesisaconvenienttooltointeractivelyremoveoutliersandthenrerun
analyseswiththepointsremoved.
WhentheCombineExcludedandHiddenCaseStatesintoOffstatecheckboxis
selectedintheOptionsdialogNavigation/Defaultsoptionspane,theHiddenand
ExcludedoptionsarereplacedwiththeOffoption.SelectOfftomarkacaseasHidden
andExcluded;thepointwillbeexcludedfromcomputationsandfromgraphs.
OTHER SPECIALIZED GRAPHS
InadditiontothestandardselectionofGraphsofInputData,GraphsofBlockData,
andGraphsmenugraphs,otherspecializedstatisticalgraphsthatarerelatedtoa
typeofanalysis(e.g.,clusteranalysisresults)areaccessibledirectlyfromresults
dialogs(i.e.,thedialogsthatcontainoutputoptionsfromthecurrentanalysis).
Chapter6:Graphs

Copyright StatSoft, 2011
STATISTICAQuickReference209

Thespecializedgraphsaredescribedinthedocumentationfortheanalysesfrom
whichtheycanbeproduced;forinformation,refertotheElectronicManual.
CREATING GRAPHS VIA
STATISTICA VISUAL BASIC
STATISTICAgraphicaloptionscanalsobeaccessedprogrammaticallyusingthe
builtinSTATISTICAVisualBasic(SVB)orothercompatiblelanguages.Therefore,
therearenolimitstohowdeeplycustomizedyourSTATISTICAgraphscanbe,
becauseSVB(withallitspowerfulcustomdrawingtoolsaswellastheSTATISTICA
basedlibraryofgraphicsprocedures)canbeusedtoproducevirtuallyanygraphics
ormultimediaoutputsupportedbythecontemporarycomputerhardware.
AnapplicationwritteninSTATISTICAVisualBasiccanoperateongraphsinthreeways:
Createanewgraphandthenmodify,print,orsaveit;
Accessanexistinggraphandthenmodifyit;
Chapter6:Graphs

Copyright StatSoft, 2011


210STATISTICAQuickReference
Openanexistinggraphfileandthenmodify,print,orsaveit.
EverygraphavailableinSTATISTICAcanbeproducedbySTATISTICAVisualBasic
andthencustomizedusingSTATISTICAproceduresorgeneraloptionsofferedin
thiscomprehensivelanguage.

AswithallotherfunctionsinSTATISTICAVisualBasic,functionstoaccessthe
graphicslibraryofSTATISTICAcanbeeasilyincorporatedintoSTATISTICAVisual
BasicprogramsviaahierarchicallyorganizedFunctionBrowser.Itcontainsshort
descriptionsofallfunctionsandoptionsthatcanbeinserteddirectlyintothesource
codeofyourprogram(i.e.,intotheSTATISTICAVisualBasicEditor,seepage225).

FormoreinformationonaccessingthegraphicslibrariesofSTATISTICAviathe
STATISTICAVisualBasicprogramminglanguage,refertotheElectronicManual.

CUSTOMIZING
STATISTICA
Customization of the Interactive User Interface ................................ 213
Customization of Documents ............................................................... 214
Local vs. Permanent Customizations .................................................. 215
General Defaults .................................................................................... 215
Graph Customization ............................................................................. 217
Maintaining Different Configurations of STATISTICA ........................ 218
Customized Configurations for Individual Users on a Network ........ 218
CHAPTER
7
7

Copyright StatSoft, 2011


STATISTICAQuickReference213
CUSTOMIZING
STATISTICA
STATISTICAofferstheflexibilityoffullycustomizableuserinterfacesandsupports
thenecessaryadjustmentofthestandarduserinterfacetobettersuityourspecific
needs.Infact,STATISTICAanticipatesyourneedsinthatitremembersvarious
choicesasyoumakethem.Forexample,ifyoulaunchananalysisfromthe
Advancedtabonananalysisspecificationdialog,theAdvancedtabwillbe
selected(insteadoftheQuicktab)thenexttimeyoudisplaythatdialog.
Practicallyallaspectsoftheuserinterfacecanbecustomizedstartingwithsuch
elementarycontrolsastheclassicmenus,QuickAccesstoolbar,andthekeyboard.
Theprocessforcustomizingthesescreencomponentsisquickandstraightforward
(forexample,seetheillustrationofcustomizingthetoolbaronpage139).Youcan
setbothglobalandlocalcustomizationsforgraphs,spreadsheets,workbooks,
reports,etc.,andmaintaindifferentconfigurationsofSTATISTICA(forasingleuser
aswellasfornetworkusers).Youcanalsodefineentirelynewuserinterfaces(see
pages139and140).
CUSTOMIZATION OF THE
INTERACTIVE USER INTERFACE
Asmentionedbefore,STATISTICAcontainsfacilitiestodefineentirelynewuser
interfaces(seepage139),includingtheInternetbrowserbaseduserinterfaces
(seepage141).However,practicallyallaspectsofthedefault,interactiveuser
interfacecanalsobeadjustedeasilyinavarietyofways.Forexample,youcanadd
CHAPTER
7
7

Chapter7:CustomizingSTATISTICA

Copyright StatSoft, 2011


214STATISTICAQuickReference
tothedefaultoptions,simplifythem,orkeepchangingthemasyourneeds
change.Dependingontherequirementsofthetaskstobeperformed,aswellas
yourpersonalpreferencesforparticularmodesofwork(andaestheticchoices),
youcansuppressallicons,toolbars,statusbars,longmenus,workbookfacilities,
draganddropfacilities,dynamic(automatic)linksbetweengraphsanddata,3D
effectsintables,and3Deffectsindialogboxes;requestbarebonessequential
outputwithsimple,paperwhitespreadsheetsandmonochromegraphs;andset
thesystemtoautomaticallymaintainnomorethanonesimplereportatatime.
Oralternatively,youcandefineelaboratelocalandglobaltoolbars;takefull
advantageofallspecialtoolsandcontrols,icons,toolbars,macros(e.g.,assign
particulartaskstospecificnewclassicmenucommands,theQuickAccesstoolbar,
orkeys),elaboratemultimediatables,workbookfacilities,anddraganddrop
facilities;establishmultipledynamic(automatic)linksbetweengraphsanddata
andinternallinksbetweengraphicalobjects;customizetheoutputwindowswith
colors,specialfonts,andhighlights;adjustthedefaultgraphstylesandtheir
displaymodes;andsendtheresultstoseparatehierarchicallyorganized
workbookstocreateanelaborate,multilayereddataanalysisenvironmentthat
facilitatestheexplorationofcomplexdatafilesandallowsyoutocompare
differentaspectsoftheoutput.
CUSTOMIZATION
OF DOCUMENTS
Thereisavarietyofcomprehensive,specializedtoolstocustomizethelayoutand
operationofSTATISTICAdocuments(seeChapter5STATISTICADocuments,page
167).Forexample,STATISTICAhasacomprehensivesystemofmanagingdefaults
ofeveryaspectofgraphsandcombiningcustomizationsintohierarchically
organizedstyles.Similarly,youcancreatecustomlayoutsandformatsfor
spreadsheets(multimediatables)andevencustomizeevents(e.g.,whathappens
whenyoudoubleclickonatable).SeetheElectronicManualforfurtherdetails.
Chapter7:CustomizingSTATISTICA

Copyright StatSoft, 2011
STATISTICAQuickReference215
LOCAL VS. PERMANENT
CUSTOMIZATIONS
ManyaspectsoftheappearanceofSTATISTICAcanbeadjustedfromboththe
ViewandToolstabs.Eachofthesetwomethods,however,hasadifferent
function.
View tab.ThechangesspecifiedontheViewtabaffectthecurrentappearanceof
STATISTICA(e.g.,hidestheStatusBar)orthecurrentdocumentwindow(e.g.,
spreadsheetgridlines).
Options dialog.TheoptionsavailableintheOptionsdialog(selecttheToolstab
andclickOptions)areusedtoadjustthepermanentprogramdefaults(discussed
inmoredetailinthenextsection).Note,however,thattheglobaloptionsthatare
applicabletodocumentsofaparticulartype(e.g.,agraphoraspreadsheet)will
notchangethecurrentdocument.Instead,theywillonlybestoredasprogram
defaultsthatwillaffectthecreationofthenext(i.e.,new)documentofthe
respectivetype.
Forexample,ifyouchangetheDefaultSpreadsheetLayoutintheNavigation/
DefaultsoptionspaneoftheOptionsdialog,youwillseethenewSpreadsheet
Layoutappliedonlywhenyoucreateanewspreadsheet.However,thesedefaults
willnotaffectanypreviouslysavedfilesbecausethosespreadsheetsaredisplayed
withthespecificappearancewithwhichtheyweresaved(usetheoptionsonthe
Viewtabtocustomizetheexistingobjects).
GENERAL DEFAULTS
Customization of the general system defaults.Thegeneraldefaultsettingsof
STATISTICAcanbeadjustedwiththeoptionsintheOptionsdialog(selecttheTools
tabandclickOptions).Theycontrol:
ThegeneralaspectsofthebehaviorofSTATISTICA(suchasmaximizing
STATISTICAonstartup,workbookandreportfacilities,filelocations,
customlists,etc.),
Chapter7:CustomizingSTATISTICA

Copyright StatSoft, 2011


216STATISTICAQuickReference
Thewayinwhichtheoutputisproduced(e.g.,inworkbooks,reports,etc.),
Thegeneralappearanceoftheapplicationwindow(icons,toolbars,etc.),
and
Theappearanceofdocumentwindows.
TheGeneraloptionspaneoftheOptionsdialogisshowninthenextillustration.

Alltheseandothergeneralsettingsareaccessibleregardlessofthetypeof
documentthatiscurrentlyactive(e.g.,aspreadsheetoragraph).Formore
informationaboutaspecificoptionspane,seetheElectronicManual(i.e.,pressF1
toviewtheSTATISTICAHelptopicdescribingtheoptionscurrentlydisplayed).
Switching between alternative sets of defaults (configurations).Optionsare
providedintheConfigurationsoptionspaneoftheOptionsdialogthatenableyou
tomaintainlibrariesofsettingsandswitchbetweenthemfordifferentprojects
(orusers).Forfurtherdetails,seeMaintainingDifferentConfigurationsof
STATISTICAonpage218andintheElectronicManual.
Chapter7:CustomizingSTATISTICA

Copyright StatSoft, 2011
STATISTICAQuickReference217
GRAPH CUSTOMIZATION
Interactive graph customization.ThecustomizationoptionsinSTATISTICA
graphicsincludehundredsoffeaturesandtoolsthatcanbeusedtoadjustevery
detailofthedisplayandassociateddataprocessing.Theseoptionsarearrangedin
ahierarchicalmanner,sothoseusedmostoftenareaccessibledirectlyvia
shortcutsbydoubleclickingorrightclickingonaspecificelementofthegraph.
Permanent settings and automation options.Theinitial(default)settingsofall
graphfeaturescanbeeasilyadjustedsothateventhedefaultappearanceand
behaviorofSTATISTICAGraphswillmatchyourspecificneedsand/orwillrequire
verylittleinterventiononyourpart.VariousaspectsofSTATISTICAGraphscanbe
permanentlyadjustedbyusing:
1.theOptionsdialog(selecttheToolstabandclickOptions),
2.thecomprehensivesystemofgraphstyles,
3.userdefinedgraphs,and
4.STATISTICAVisualBasic.
ThesefacilitiesarebrieflyreviewedinChapter6Graphs(page190).Formore
information,pleaserefertotheElectronicManual.
TherearenolimitstohowdeeplycustomizedyourSTATISTICAcustomgraphs
canbe,becauseSTATISTICAVisualBasic(withallitspowerfulcustomdrawingtools
aswellastheSTATISTICAbasedlibraryofgraphicsprocedures)canbeusedto
producevirtuallyanygraphicsormultimediaoutputsupportedbycontemporary
computerhardware.Thosecustomdevelopeddisplaysormultimediaoutputcan
beassignedtoSTATISTICAtoolbars,menus,ordialogsandbecomeapermanent
partofyourSTATISTICAapplication.
Chapter7:CustomizingSTATISTICA

Copyright StatSoft, 2011


218STATISTICAQuickReference
MAINTAINING DIFFERENT
CONFIGURATIONS OF STATISTICA
STATISTICAstoresallprogramsettingswhenyouexittheprogram,andrestores
themthenexttimeyoustarttheapplication.Youcancreatedifferent
configurationsofthesesettingsbyusingtheoptionsintheConfigurationsoptions
paneoftheOptionsdialog(selecttheToolstabandclickOptions).Withthe
configurationmanager,youcansavethecurrentprogramstateintoanewor
existingconfiguration,oryoucanrestartSTATISTICAusingadifferent
configuration.Otheroptionsincludetheabilitytoimportorexportconfigurations
toaseparatefilesotheycanbesharedamongSTATISTICAinstallations.
CUSTOMIZED CONFIGURATIONS
FOR INDIVIDUAL USERS ON A
NETWORK
Thesameprincipledescribedinthepreviousparagraphappliestonetwork
installationsofSTATISTICA.Onanetwork,STATISTICAisinstalledinonlyone
location(onaserver),buteachusercanstillconfigureSTATISTICAdifferently
becausethesettingconfigurationinformationisstoredlocally.Notethatyouneed
tochooseNetworkInstallationintheSTATISTICASetupprograminordertoinstall
itproperlyonanonlocaldrive(networkserver).Notethatanetworkversionof
STATISTICAisnecessarytoensureitsreliableoperationwhenusedbymorethan
oneuseratatimeorevenoneuserifSTATISTICAisnotinstalledonthelocal
system.

STATISTICA
VISUAL BASIC
Recording STATISTICA Visual Basic (SVB) Macros (Programs) ........ 224
Example: Recording an Analysis .......................................................... 230
ActiveX Objects and Documents (A Technical Note) ......................... 238
CHAPTER
8
8

Copyright StatSoft, 2011


STATISTICAQuickReference221
STATISTICA
VISUAL BASIC
TheSTATISTICAVisualBasic(SVB)language(integratedintoSTATISTICA)is
compatiblewiththeindustrystandardsandprovidesanotheruserinterfacetothe
functionalityofSTATISTICA,anditoffersincomparablymorethanjusta
supplementaryapplicationprogramminglanguagethatcanbeusedtowrite
customextensions.
NotethatSTATISTICAVisualBasicisnotMicrosoftVisualBasic6.0.StatSoftowns
andmaintainsthecodeforSTATISTICAVisualBasic.SVBiscompatiblewith
MicrosoftsVB.NET,MicrosoftsVisualBasicforApplications(VBA),andalsowith
MicrosoftsVisualBasic6.0(VB6).SVBscriptinglanguageisuniqueintermsofits
flexibilityandcompatibility,anditisalsoverypowerful.ItprovidesaccesstoVisual
BasicforApplications(usedforscriptingMicrosoftOfficeproducts)andaccessto
the.NETFrameworkwithinthesamefile(seeChapter10Programming
STATISITCAfrom.NET,page247).OtherAPIscanalsobeaccessedandleverage
theflexibilityofSVBsuchas,forexample,YahoosStockQuoteAPIorGoogle
AnalyticsAPI.SVBoffersapowerful64bitsolutionforsystemintegration,
expansion,andcustomdevelopment.
SVBtakesfulladvantageoftheobjectmodelarchitectureofSTATISTICAandis
usedtoaccessprogrammaticallyeveryaspectandvirtuallyeverydetailofthe
functionalityofSTATISTICA.Eventhemostcomplexanalysesandgraphscanbe
recordedintoVisualBasicmacrosandlaterberunrepeatedlyoreditedandused
asbuildingblocksofotherapplications.SVBaddsanarsenalofmorethan14,000
newfunctionstothestandardcomprehensivesyntaxofVisualBasic,thus
comprisingoneofthelargestandrichestdevelopmentenvironmentsavailable.
CHAPTER
8
8

Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


222STATISTICAQuickReference
Applications for STATISTICA Visual Basic programs.STATISTICAVisualBasic
programscanbeusedforawidevarietyofapplications,fromsimplemacros
recordedtoautomateaspecific(repeatedlyused)sequenceoftasks,toelaborate
customanalyticsystemscombiningthepowerofoptimizedproceduresof
STATISTICAwithcustomdevelopedextensionsfeaturingtheirownuserinterface.
Whenproperlylicensed,scriptsforanalysesdevelopedthiswaycanbeintegrated
intolargercomputingenvironmentsorexecutedfromwithinproprietary
corporatesoftwaresystemsorInternetorintranetportals.
SVBprogramscanalsobeattachedtovirtuallyallimportanteventsina
STATISTICAanalysissuchasopeningorclosingfiles,clickingoncellsin
spreadsheets,etc.;inthismanner,thebasicuserinterfaceofSTATISTICAcanbe
highlycustomizedforspecificapplications(e.g.,fordataentryoperations,etc.).
SeveralscriptinglanguagesareincludedinSTATISTICA.YoucanselectfromSVB,
EnhancedSVB,STATISTICAVisualBasic.NET,orR.
EnhancedSTATISTICAVisualBasicisasupersetofSTATISTICAVisualBasic,and
includesadditionalfeatures.STATISTICAVisualBasic.NETfeaturesdirect,native
accessto.NETAssemblies,i.e.,notthroughCOMInteropaswouldberequired
fromstandardSVB.
Risaprogramminglanguageandenvironmentforstatisticalcomputing.TheR
environmentanditssourcecodearefreelyavailableundertheGNUGPLlicense.
TheRcommunitymaintainsseveralcentralizedrepositoriesthatmakehundredsof
suchpackagesreadilyavailabletoallusersovertheInternet.NativeRscriptscan
berundirectlywithinSTATISTICA,STATISTICAEnterprise,andSTATISTICA
EnterpriseServer.

RoutputcanberetrievedasnativeSTATISTICASpreadsheetsandGraphs,and
managedviahighlyflexibleSTATISTICAWorkbookcontainers.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference223

UsingtheRlanguagerequiresthatyouhaveRinstalledoneitherthesame
computerrunningSTATISTICAoracomputeraccessiblefromtheSTATISTICA
EnterpriseServerinordertouseitsspecializedroutinesandcapabilitiesto:
AddnewRbasedmodules
LeverageSTATISTICAssuperiorgraphics,flexiblespreadsheets,and
convenientworkbookcontainersforvariousdocumenttypestohandle
outputfromR
IntegrateRintoSTATISTICAEnterprisetomakespecializedRfunctionality
availableasreusableanalysistemplatesforusersnotfamiliarwiththeR
language,inasecure,rolebasedenterpriseanalysissystem
AddRbasedanalyticnodestoSTATISTICADataMiner,thusleveragingallR
capabilitiesinsideSTATISTICAandDataMinerworkspaces
BuildscalableRserversusingSTATISTICAEnterpriseServertohandle
securityandloadbalancing,andtotakeadvantageofmultipleprocessor
serverstorunRfordemandingand/orvalidatedenterpriseapplications
SeetheElectronicManualformoreinformationonthesescriptinglanguages.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


224STATISTICAQuickReference
RECORDING STATISTICA VISUAL
BASIC (SVB) MACROS (PROGRAMS)
Analysis Macros, Master (Log) Macros,
and Keyboard Macros
STATISTICAprovidesacomprehensiveselectionoffacilitiesforrecordingmacros,
i.e.,STATISTICAVisualBasic(SVB)programs,toautomaterepetitiveworkortobe
usedasameanstoautomaticallygenerateprogramsforfurthereditingand
modification.Themacroprogramsrecordedbythesefacilitiescanbesavedtobe
runasis,ortheycanbeusedasthebuildingblocksformorecomplexand
highlycustomizedVisualBasicapplicationprograms.AnalysisMacrosandMaster
Macrosfollowtheidenticalsyntaxandcanlaterbemodified,butbecauseofthe
differentwaysinwhicheachofthemiscreated,theyofferdistinctiveadvantages
anddisadvantagesforspecificapplications.
Analysis macros. SimpleAnalysisMacrosautomaticallyrecordthesettings,
selections,andchosenoptionsforaspecificanalysis.Notethatthetermanalysis
inSTATISTICAdenotesonetaskselectedeitherfromtheStatistics,DataMining,or
Graphstabsandcanbeverysmallandsimple(e.g.,onescatterplotrequested
fromtheGraphstab),orveryelaborate(e.g.,acomplexstructuralequation
modelinganalysisselectedbychoosingthatoptionfromtheStatisticstab,and
involvinghundredsofoutputdocuments).Afterselectinganyofthestatistical
optionsfromtheStatisticsorDataMiningtabsorgraphicsoptionsfromthe
Graphstab,allactionssuchasvariableselections,optionsettings,etc.,are
recordedbehindthescenes;atanytimeyoucantransferthisrecording(i.e.,the
VisualBasiccodeforthatmacro)totheVisualBasicEditorwindow.TheCreate
Macrocommandisavailablefromeveryanalysisdialogviathedropdownmenu
displayedbyclickingtheOptionsbuttonortheshortcutmenuaccessedbyright
clickingtheanalysisbuttonwhentheanalysisisminimized.
Master macros (logs).YoucanrecordaMasterMacroorMasterLogofanentire
session,whichcanconsistofoneormanyanalyses.Thisrecordingwillconnect
analysesperformedwithvariousanalysisoptionsfromtheStatistics,DataMining,
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference225
and/orGraphstabs.However,unlikesimpleAnalysisMacros,youcanturnthe
recordingofMasterMacrosonandoff.TheMasterMacrorecordingwillbegin
whenyouturnontherecording[selecttheToolstab,clickMacro,andselectStart
RecordingLogofAnalyses(MasterMacro)],anditwillendwhenyoustopthe
recording(clickMacro,andselectStopRecording).Inbetweentheseactions,all
fileselectionsanddatamanagementoperationsarerecorded,asaretheanalyses
andselectionsfortheanalyses,inthesequenceinwhichtheywerechosen.
Keyboard macros. Thistypeofmacrorecordingstoresthesequencesofkeyboard
input.WhenyouselecttheToolstab,clickMacro,andselectStartRecording
KeyboardMacro,STATISTICAwillrecordtheactualkeystrokesenteredviathe
keyboard.WhenyouStopRecording,aSTATISTICAVisualBasiceditorwindow
openswithasimpleprogramcontainingasingleSendKeyscommandwithsymbols
thatrepresentallthedifferentkeystrokesperformedduringtherecordingsession.
Notethatthistypeofmacroisverysimpleinthesensethatitwillnotrecordany
contextinwhichtherecordedkeystrokesarepressedandwillnotrecordtheir
meaning(i.e.,commandsthesekeystrokestrigger),butthisfeaturemakesthem
usefulforspecificapplications,e.g.,toautomateenteringtext,suchastitles,
selectionconditions,etc.
STATISTICA Visual Basic editor and debugger.Programscanbewrittenfrom
scratchusingtheSTATISTICAVisualBasicprofessionaldevelopmentenvironment,
whichfeaturesaprogrameditorwithapowerfuldebugger(withbreakpoints,etc.)
andmanyfacilitiesthataidinefficientcodebuilding.Thesefacilitiesaredescribed
indetailintheSTATISTICAElectronicManual.
WheneditingmacroprogramsbytypinginVisualBasiccommandsorprogram
commandsspecifictoSVB,theeditordisplaystypeaheadhelptoillustratethe
appropriatesyntax.Helponthemembersandfunctionsforeachclass(object)is
alsoprovidedinline.

Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


226STATISTICAQuickReference
Whenexecutingaprogram,youcansetbreakpointsintheprogram,stepthrough
itlinebyline,andobserveandchangethevaluesofvariablesinthemacro
programasitisrunning.

Alsoavailableisaninteractivedialogeditorthatenablesyoutobuilddialogboxes.

Tosummarize,STATISTICAVisualBasicisnotonlyapowerfulprogramming
language,butitrepresentsaverypowerful,professionalprogramming
environmentfordevelopingsimplemacrosaswellascomplexcustom
applications.
Visual Basic from other applications.SVBprogramscanalsobedevelopedby
enhancingVisualBasicprogramscreatedinotherapplications(e.g.,Excel)by
callingSTATISTICAfunctionsandprocedures.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference227

Executing STATISTICA Visual


Basic Programs
STATISTICAVisualBasicprogramscanbeexecutedfromwithinSTATISTICA,but
becauseoftheindustrystandardcompatibilityofSVB,youcanalsoexecuteits
programsfromanyothercompatibleenvironment(e.g.,Excel,Word,orastand
aloneVisualBasiclanguage).Inpractice,youwouldtypicallycallSTATISTICA
functionsfromVisualBasicinanotherapplication.Note,however,thatwhenyou
runanSVBprogramorattempttocallSTATISTICAfunctionsfromanyother
application,allcallstotheSTATISTICAspecificfunctions(asopposedtothegeneric
functionsofMSVisualBasic)willbeexecutedonlyiftherespectiveSTATISTICA
librariesarepresentonthecomputerwheretheexecutiontakesplace.Thatis,you
mustbealicenseduseroftherespectiveSTATISTICAlibrariesofprocedures.Note
thatthislargelibraryofSTATISTICAfunctions(morethan14,000procedures)is
transparentlyaccessiblenotonlytoVisualBasic,butalsotocallsfromanyother
compatibleprogramminglanguageorenvironment,suchasC/C++,C#,orDelphi.
Performance of STATISTICA Visual Basic programs. Whiletheobvious
advantagesofVisualBasic(comparedtootherlanguages)areitseaseofuseand
familiaritytoaverylargenumberofcomputerusers,thepossibledrawbackof
VisualBasicprogramsisthattheydonotperformasfastasapplicationsdeveloped
inlowerlevelprogramminglanguages(suchasC).However,thatpotential
problemdoesnotapplytoSVBapplications,especiallythosethatrelymostlyon
executingcallstoSTATISTICAsanalytic,graphics,anddatamanagement
procedures.TheseproceduresfullyemploySTATISTICAtechnologyandperformat
aspeedcomparabletorunningtherespectiveproceduresinSTATISTICAdirectly.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


228STATISTICAQuickReference
Structure of STATISTICA Visual Basic.STATISTICAVisualBasicconsistsoftwo
majorcomponents:1)ThegeneralVisualBasicprogrammingenvironmentwith
facilitiesandextensionsfordesigninguserinterfaces(dialogs)andfilehandling,
and2)theSTATISTICAlibrarieswiththousandsoffunctionsthatprovideaccessto
practicallyallfunctionalityofSTATISTICA.
TheVisualBasicprogrammingenvironmentfollowstheindustrystandardsyntax
conventionsoftheMicrosoftVisualBasicLanguage;thefewdifferencespertain
mostlytothemannerinwhichdialogsarecreated(seeCustomDialogsand
CustomUserInterfacesintheSTATISTICAElectronicManual),andaredesignedto
offerprogrammers/developersmoreflexibilityinthewayuserinterfacesare
handledincomplexprograms.IntheSVBprogrammingenvironment,dialogscan
beentirelyhandledinsideseparatesubroutines,whichcanbeflexiblycombined
intolargermultipledialogprograms;MSVisualBasicisformbased,wherethe
formsordialogs,andalleventsthatoccurinthedialogs,arehandledinseparate
programunits.
Attaching Macros to Toolbars and Menus
ASTATISTICAVisualBasicprogramcanbesavedandthenattachedtoacustom
classicmenu/toolbarortotheQuickAccesstoolbarontheribbonbar.Thisenables
youtoeasilycustomizeandextendtheoperationandappearanceofSTATISTICA
withyourowncustommacros.Toutilizethesefacilities,savethemacroby
selectingSaveAsGlobalMacrofromtheFilemenu.Then,tocustomizethemenus
and/ortoolbars,selectCustomizefromtheToolsmenutodisplaytheCustomize
dialog.Toaddthemacrotoamenuortoolbar,choosetheCommand/Macrostab,
andselectMacrosfromtheCategorieslist.Allyourglobalmacroswillbelistedin
theCommandssectionofthetab.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference229

YoucanthenselectanddragthespecificitemfromtheCommandslistontoany
menuortoolbar.Notethatasyourmousepointerhoversoveramenu,themenu
willexpand,enablingyoutoinserttheiteminanysubmenuaswell.Oncethe
macroisplacedonthemenuortoolbarwhiletheCustomizedialogisdisplayed,
youcanrightclickthemacroandchangetheappearanceandtextoftheitem,as
wellasaddicons.
Running Macros from a command line. WithSTATISTICA,youcanexecuteSVB
programsfromthecommandlinebyusingthe/RunMacro=commandline
parameter.Thesyntaxis:
statist.exe /RunMacro=macroname
wheremacronameisthefilenameofthemacro.Ifafullpathisnotspecified,
STATISTICAwillattempttorunthemacrofromtheapplicationscurrentlyselected
directory(whichisWindowsdefaultbehavior).
Ifthemacrodoesnotmaketheapplicationoranydocumentvisible(throughthe
Application.Visible = True,orsimilardocumentproperties),theSTATISITCA
instancewillautomaticallyshutdownwhencomplete.Iftheapplicationismade
visible,theapplicationwillremainvisibleafterthemacrocompletes,andyouwill
needtoshutdowntheprogram.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


230STATISTICAQuickReference
EXAMPLE: RECORDING
AN ANALYSIS
Thisexampleillustrateshowtorecordananalysisintoascriptthatcanbe
executedtoreruntheanalysis.Thenthescriptwillbeeditedandcombinedwith
anotherscripttocreateacustomizedscriptthatcanrunanalysesondemand.
Additionally,thisexampleshowshowyoucanuseattachedscriptstoautoupdate
andrerunanalysesfromresultsworkbooks.
StartbyopeningtheexampleAdstudydataset.SelecttheHometab,clickthe
Openarrow,andselectOpenExamplestodisplaytheOpenaSTATISTICAData
Filedialog.DoubleclickontheDatasetsfile,andthenopentheSTATISTICAdata
setAdstudy.sta.
Then,selecttheStatisticstab.IntheBasegroup,clickBasicStatisticstodisplay
theBasicStatisticsandTablesStartupPanel.SelectDescriptivestatistics.

ClicktheOKbuttontodisplaytheDescriptiveStatisticsdialog.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference231

ClicktheVariablesbuttontodisplaytheSelectthevariablesfortheanalysis
dialog.SelectvariablesMEASURE01throughMEASURE23byclickingMEASURE01
anddraggingtoMEASURE23,andthenclickOK.
IntheDescriptiveStatisticsdialog,selecttheAdvancedtab,andnotethe
numerousoptionsavailable.

Forthisexample,wewillleavealloptionsattheirdefault.ClicktheSummary
buttontodisplaythedescriptivestatisticsfortheselectedvariables.

Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


232STATISTICAQuickReference
Whenyouproducetheresultsworkbook,theDescriptiveStatisticsdialogis
automaticallyminimizedsoyoucanseetheresults.Torestorethedialog,clickthe
DescriptiveStatisticsbuttonontheAnalysisBarinthelowerleftofthescreen.
Whileyouarerunningthisanalysis,STATISTICAautomaticallyrecordsallthe
analysisstepsbehindthescenes.YoucannowproduceaSTATISTICAVisualBasic
(SVB)macrotorecreatethisanalysis.IntheDescriptiveStatisticsdialog,clickthe
button,andselectCreateMacrofromthedropdownmenu.TheNew
Macrodialogwillbedisplayed,whereyoucannamethemacroandentera
description.Leavealltheentriesattheirdefaults,andclickOK.AnSVBmacro
windowwillbedisplayed,containingtherecordedDescriptiveStatisticssession.

Torunthismacro,selecttheDebugtab,andintheRungroup,clickRun(orpress
F5onyourkeyboard).TheexactDescriptiveStatisticsresultsthatweregenerated
intheinitialanalysiswillbereproduced.
LookattheSVBmacroforamoment.Towardthetop,oneofthelinesis:
Set newanalysis = Analysis (scBasicStatistics, ActiveInputDataSet)
ThisistellingthemacrothatitisgoingtoruntheBasicStatisticsanalysis,andthat
itwillbeusingtheactivedataset,thatis,thespreadsheetthatiscurrently
selectedwhenthemacroruns.
Afewlinesfurtherdownisasectionthatstartswith:
Dim oAD2 As STABasicStatistics.BasDescriptiveStatistics
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference233
andunderthatarepropertiessuchas:
.PairwiseDeletionOfMD = True
Thesepropertiescorrespondtoalltheoptionsthatwereavailableonthedifferent
tabsoftheDescriptiveStatisticsdialog.Everyoptioninthedialogisrepresented
byaproperty,andallthecurrentsettingsarerecorded.Ifyoudecidetoincludea
MedianandtheSumofeachofthevariables,itiseasytoaddthistotheSVB
macro;justfindthelinesthatread:
.Median = False
and
.Sum = False
andchangetheseto:
.Median = True
and
.Sum = True
Now,runthemacroagainbypressingF5.Anewresultsspreadsheetwillbeadded
totheworkbook,thistimewithnewcolumnsofMedianandSum:

Letskeepthemacrowindowopenandstartanewanalysisonthesamesample
dataset.SelecttheAdstudyspreadsheettobringittothefront.SelecttheGraphs
tab,andintheMoregroup,click2D.SelectNormalProbabilityPlotstodisplaythe
NormalProbabilityPlotsdialog.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


234STATISTICAQuickReference

ClicktheVariablesbutton,andintheSelectVariablesforProbabilityPlotdialog,
selectvariablesMEASURE01throughMEASURE03.ClickOKtoclosethisdialog,
andclickOKintheNormalProbabilityPlotsdialog.ThreeProbabilityPlotgraphs
willbeplacedintheresultsworkbook,oneforeachofthethreevariablesthat
wereselected.

ThestepsoftheProbabilityPlotanalysiswererecordedjustastheywereforthe
DescriptiveStatisticsanalysis.Tocreateanewmacrowiththesesteps,bringthe
NormalProbabilityPlotdialogtothefrontbyclickingthatbuttonontheAnalysis
Barinthelowerleftofthescreen,clickthe button,andselectCreate
Macrofromthedropdownmenu.IntheNewMacrodialog,clickOK,andanew
SVBMacrowindowisopenedwiththerecordedProbabilityPlotscript.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference235

AswiththeDescriptiveStatisticsanalysis,alltheoptionsselectedinthe
ProbabilityPlotdialogarespecifiedaspropertieswithinthemacro.Forinstance,
tochangethisfromaNormalProbabilityPlottoaHalfNormalProbabilityPlot,
locatethefollowingline:
.GraphType = scProbNormal
andchangeitto:
.GraphType = scProbHalfNormal
Also,letsexpandthevariablestoincludevariableMEASURE04.Todothis,findthe
followingline:
.Variables = "3-5"
Thislinecorrespondstothevariablesselectedfortheplots.Sinceweselected
MEASURE01throughMEASURE03,andthesearevariablenumbers3through5
fromthedataset,thisstringwasrecorded.ToaddMESURE04(variablenumber6),
changethislineto:
.Variables = 3-6
NowrunthemacrobypressingF5.FournewgraphsareproducedasHalfNormal
ProbabilityPlotsforvariablesMEASURE01throughMEASURE04.
Thisexamplehasdemonstratedhowyoucanrunanyanalysis,andthencreatea
macrooftheanalysisthatcanbeeditedandrerun.Additionally,thisexamplehas
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


236STATISTICAQuickReference
shownhowthesemacroscanbecombinedtodevelopmacrosthataremore
complex.Thisisthebuildingblockofcreatingyourownpowerfulcustomized
analysesusingtheSVBlanguage.
Rerunning Analyses from
Results Workbooks
Inthepreviousexample,youlearnedthatallanalysesinSTATISTICAwillrecordthe
stepsusedtoproducethem,andthesecanbeloadedintoamacrothatyoucan
editandrun.Whenananalysisproducesresultsthatareplacedinaworkbook,
STATISTICAautomaticallyassociatestherecordedscriptsstepstotheworkbook
folderthatcontainstheresults.Thisenablesyoutoeitherreruntheanalysisorto
resumeananalysis.
Thusfar,wehaveproducedseveralinstancesofrunningbothDescriptiveStatistics
andProbabilityPlots.Theresultsworkbooklookssimilartothefollowing
illustration.

Noticethatthereisaredarrowoneachworkbookfolder.Thisisanindicatorthat
thescriptthatproducedtheresultsinthatfolderhasbeenattachedtothefolder.
ThisenablesSTATISTICAtorerunorresumetheanalysis.
Torerunananalysis,rightclickononeofthefolderslabeledDescriptivestatistics
dialog,andfromtheshortcutmenu,selectRerunAnalysis.TheRerunAnalysis
dialogwillbedisplayed.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference237

HereyoucanchoosetoUseoriginaldatasourceorUsenewdatasource.The
latteroptiongivesyouthepowerfulabilitytocreatetemplatesthatcanthenbe
appliedtonewdatasources.Inadditiontospecifyingthedatasource,youcan
choosetoReplacecurrentfoldercontentsorOutputtonewfolder.Inthis
example,leavethedefaults,andclickOK.Youwillseethatthecontentsofthe
folderarebrieflydeletedandthenaddedagainastheanalysisisrerun.
Onepurposeforthisfeatureistheabilitytoupdate/rerunresultsfromcomplex
analysesifnewdataisenteredintothespreadsheet.Forinstance,ifthedatain
theopendatafileAdstudy.stahasbeenchangedandtheanalysisisrerun,thenew
resultswillbecalculatedwiththenewdata.
Theresumeanalysisfunctionalityenablesyoutobringananalysisbacktothe
pointbeforetheresultsweregenerated,allowingyoutoselectdifferentoptionsor
continueananalysisinprogress.RightclickthesameDescriptivestatisticsdialog
folder,andfromtheshortcutmenu,selectResumeAnalysis.TheResumeAnalysis
dialogwillbedisplayed.Thisdialogalsocontainsoptionstospecifytheinputdata
source(originalornew).TheOutputoptionsforthenewresultsaretoOutputto
currentfolder(asifthisisjustanextensionofthepreviousanalysis)orOutputto
newfolder(asifthisisabrandnewanalysis).

Leavethedefaultsastheyare,andclickOK.TheDescriptiveStatisticsdialogwill
bedisplayed,withalltheoptionssettowhatwasusedwhentheselectedoutput
wascreated.SincethedefaultwastoOutputtocurrentfolder,clickingthe
Summarybuttonwillgeneratenewoutputtothesamefolder.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


238STATISTICAQuickReference
ActiveX OBJECTS AND
DOCUMENTS
(A TECHNICAL NOTE)
ThetermActiveXisusedindifferentcontexts,anditsdefinitionsstressdifferent
aspectsofthatconcept.ItsusewithinSTATISTICA,however,canbegroupedinto
twogeneralcategories:ActiveXobjectsandActiveXdocuments.
ActiveX objects.AnActiveXobjectiswhatwasoncereferredtoasanOLE(Object
LinkingandEmbedding)object.AtitsheartistheMicrosoftCOM(Component
ObjectModel)technologythatmakesitpossibleforobjectstobeaccessedina
uniformmanner.Throughtheuseofstandardprotocols,objectscreatedinone
applicationcanbestoredandeditedinadifferentapplication.Tosupportthis
functionality,thecontainingobjectneedstobeanActiveXobjectclient,andthe
applicationthatinitiallycreatedtheobjectneedstobeanActiveXobjectserver.
STATISTICAisboth.AsanActiveXobjectclient,STATISTICAallowsyoutoembed
andlinkobjectsfromotherapplicationsinspreadsheets,graphs,andreports.As
anActiveXobjectserver,itallowsyoutoembedandlinkspreadsheetsandgraphs
intootherapplications.
ActiveX documents.ActiveXdocumentstaketheActiveXcontrolsonestep
further,inthattheyallowentiredocumentstobeembeddedintoother
applications.AnActiveXdocumentcontainerallowsotherapplicationdocuments
tobeusedwithinit,andanActiveXdocumentserverallowsitsdocumentstobe
usedwithinanyActiveXdocumentcontainer.Again,STATISTICAdoesboth.
STATISTICAWorkbooksareActiveXdocumentcontainers,andallowdocuments
fromotherActiveXserverstobedisplayedwithintheworkbook.Examplesofthis
areWordandExcel;thesedocumentscanbeuseddirectlyfromwithina
STATISTICAWorkbook.Similarly,STATISTICASpreadsheets,Graphs,andReports
areActiveXdocumentservers,andtheyalsocanbeplacedwithinanyActiveX
documentcontainersuchasMicrosoftInternetExplorer.
Office integration and ActiveX documents.TheActiveXdocumenttechnology
hasspecialapplicationwithWordandExceldocuments.STATISTICAcanopen
theseparticulardocumentsnativelyintheirownwindowswithintheSTATISTICA
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011
STATISTICAQuickReference239
workspace.ThisOfficeintegrationenablesyoutouseExceldocumentsasdata
sourcesandWorddocumentsasreportsforanalyses.Whenthedocumentsare
openintheSTATISTICAwindow,theappropriatemenusandtoolbarsfor
Excel/Wordareavailableforuse.
Chapter8:STATISTICAVisualBasic

Copyright StatSoft, 2011


240STATISTICAQuickReference


Copyright StatSoft, 2007
STATISTICA QuickReference 241
STATISTICA
QUERY
Overview ................................................................................................. 243
Quick, Step-by-Step Instructions .......................................................... 244
In-Place Processing of Data on Remote Servers
(The IDP Technology Option) .......................................................... 245
OLAP Cubes ............................................................................................ 246
Large Database Files ............................................................................ 246

CHAPTER
9
9

Copyright StatSoft, 2011


STATISTICAQuickReference243
STATISTICA
QUERY
Note:Foranexplanationofalltechnicaltermsusedinthisoverview(e.g.,ODBC,
SQL,OLAP,etc.),pleaserefertotheglossaryintheSTATISTICAElectronicManual,
accessiblebyclickingHelpontheHelptabintheHelpgroup.
ThischapterincludesabriefoverviewofSTATISTICAQuery,aflexibletoolfor
accessingdatafromexternaldatabases.Italsoincludesinformationonretrieving
datafromOLAPCubeproviderssuchasMSOLEDBProviderforAnalysisServices
orSAPBusinessWarehouseMDX.
OVERVIEW
STATISTICAQueryisusedtoaccessdataeasilyfromawidevarietyofdatabases
(includingmanylargesystemdatabasessuchasOracle,MSSQLServer,Sybase,
etc.)usingMicrosoftsOLEDBconventions.OLEDBisapowerfuldatabase
technologythatprovidesuniversaldataintegrationoveranenterprisesnetwork,
frommainframetodesktop,regardlessofthedatatype.OLEDBoffersamore
generalizedandmoreefficientstrategyfordataaccessthantheolderODBC
conventionsbecauseitallowsaccesstomoretypesofdataandisbasedonthe
ComponentObjectModel(COM).
STATISTICAQuerysupportsmultipledatabasetables;specificrecords(rowsof
tables)canbeselectedbyenteringSQLstatements,whichSTATISTICAQuery
automaticallybuildsforyouasyouselectthecomponentsofthequeryviaa
simplegraphicalinterfaceand/orintuitivemenuoptionsanddialogs.Therefore,an
CHAPTER
9
9

Chapter9:STATISTICAQuery

Copyright StatSoft, 2011


244STATISTICAQuickReference
extensiveknowledgeofSQLisnotnecessaryinorderforyoutocreateadvanced
andpowerfulqueriesofdatainaquickandstraightforwardmanner.Multiple
queriesbasedononeormanydifferentdatabasescanalsobecreatedtoreturn
datatoanindividualspreadsheet,andyoucanmaintainconnectionstomultiple
externaldatabasessimultaneously.
STATISTICA QUERY: QUICK STEP-
BY-STEP INSTRUCTIONS
ThestepsnecessarytoretrieveexternaldataviaSTATISTICAQueryareoutlined
below:
1.SelecttheHometab.IntheFilegroup,clicktheOpenarrow.SelectOpen
ExternalDataCreateQuerytodisplaytheDatabaseConnectiondialog.
(YoucanalsoselecttheDatatab.IntheManagegroup,clickExternalData
andselectCreateQuerytodisplaytheDatabaseConnectiondialog.)Inthis
dialog,selectapredefineddatabaseconnection(theprovider,datasource
location,andadvancedsettingsoftheserverordirectoryonwhichthedata
resides).
Notethatifyouhavenotalreadycreatedthedatabaseconnection,youcan
dosobyclickingtheNewbuttonintheDatabaseConnectiondialog.The
DataLinkPropertiesdialogwillbedisplayed,whichwillguideyouthrougha
stepbystepwizardtocreateadatabaseconnection.Forspecific
documentationwhenyouareusingtheDataLinkPropertiesdialog,press
theF1keyonyourkeyboardtodisplaytheMicrosoftDataLinkHelp

.
2.AfteryouhaveselectedadatabaseconnectionandclickedtheOKbuttonin
theDataLinkPropertiesdialog,youwillhaveaccesstoSTATISTICAQueryin
whichyoucancreateaSQLstatementbyspecifyingthedesiredtables,
fields,joins,criteria,etc.(viatheTable,Join,andCriteriamenus)tobe
includedinyourquery.
Chapter9:STATISTICAQuery

Copyright StatSoft, 2011
STATISTICAQuickReference245

3.Onceyouhavespecifiedaquery,selectReturnDatatoSTATISTICAfromthe
Filemenu.TheReturningExternalDatatoSpreadsheetdialogwillbe
displayed,inwhichyoucanspecifythenameofthequery,whereyouwant
STATISTICAQuerytoputthedatathatthequeryreturns,andadditional
options.
SeetheElectronicManualforfurtherdetails.
IN-PLACE PROCESSING OF
DATA ON REMOTE SERVERS
(THE IDP TECHNOLOGY OPTION)
Thequeryfacilities(describedintheprevioussections),whenofferedaspartof
theenterpriseversionsofSTATISTICA(seeSTATISTICAEnterpriseSystems,page
278),areadditionallyenhancedbyoptionstoprocessdatafromremoteservers
inplace,thatis,withouthavingtoimportthemandcreatealocaldatafile.This
InPlaceDatabaseProcessing(IDP)technologyisparticularlyusefulforprocessing
extremelylargedatafileswhereitcanproducesignificantperformancegainsand
enableSTATISTICAuserstoprocessdatafilesthatexceedthestoragecapacityof
thelocaldeviceoreventheSTATISTICAEnterpriseServer.
Technical note.TheIDPtechnologyisbasedondistributedprocessing
architecture,wherethequeriesareperformedontheserverside(usingtheserver
Chapter9:STATISTICAQuery

Copyright StatSoft, 2011


246STATISTICAQuickReference
CPUresources)andtherespectiverecordssenttotheSTATISTICAcomputerwhere
theyaresimultaneously(asynchronously)processedastheybecomeavailable.
OLAP CUBES
OLAP(OnLineAnalyticProcessing)isagenerictermforasystemthatprovides
efficientaccesstosummarydataaboutverylargedatabases.Unlikeordinary
relationaldatabases,whichorganizedataasasetofwelldefined,two
dimensionaltables,anOLAPdatawarehouserepresentsdataatmanylevelsof
detailinmultidimensionaldatasetsknownascubes.WhenaSTATISTICAuser
wantstoperformananalysisagainstdatafromanOLAPCube,thedatamustbe
reducedtoatwodimensionalformcasesandvariablesthatcanbe
representedinaSTATISTICAspreadsheet.TheSTATISTICAQuerytoolprovidesa
graphical,draganddropinterfaceforspecifyingthedimensionsandlevelsof
detailthatwillbeextractedfromthecubetofeedintothequery.TheMDX
(MultiDimensionaleXpressions)modeistriggeredautomaticallywhenanOLAP
datasourceisselected.
CustomerswhorequireOLAPintegrationwillusuallyhavesophisticateddatabase
supporttechniciansthroughtheirinhouseinformationtechnologydepartment
whocanhelpdesignthesequeries.Becausetheconfigurationofthedimensionsin
anOLAPcubeisdeterminedbythecustomersdatabaseadministrators,StatSoft
canprovideonlylimitedassistanceinthisarea.
LARGE DATABASE FILES
STATISTICAproductsaredesignedforlargescaleanalytics;consequently,they
integratewellwithdatabasesystemsdesignedformanagingverylargeamountsof
data,suchasTeradataandothers.Forexample,STATISTICAcanbothextractdata
foranalysisfromTeradata,anditcanalsoscoreresultsdirectlyinsideTeradata
throughdeploymentcodecreatedbySTATISTICADataMinerandappliedtothe
Teradataasauserdefinedfunction,whichsignificantlyacceleratesprocessingof
largeamountsofdata.

PROGRAMMING
STATISTICA
FROM .NET
Adding the STATISTICA Object Library into Your .NET Project .......... 249
Manually Creating the COM Interop Library ....................................... 251
Supporting Multiple Versions of STATISTICA ...................................... 251
Instantiating STATISTICA ...................................................................... 252
The Library Version of STATISTICA ....................................................... 252
1
1
0
0

CHAPTER

Copyright StatSoft, 2011


STATISTICAQuickReference249
PROGRAMMING STATISTICA
FROM .NET
VirtuallyeveryaspectofSTATISTICAisexposedasasetofCOMinterfacesthatare
registeredonamachinewhenSTATISTICAisinstalled.Since.NETbasedlanguages
cannotcommunicatewithCOMdirectly,awrapperclasscalledtheCOMInterop
canbeutilizedtointegratetheSTATISTICAlibrariesintoyour.NETproject.The
COMInteroplayeriscreatedautomaticallybytheVisualStudio.NETIDEwhenyou
importaCOMinterface.TheCOMInteroplayerhandlesallofthedetailsregarding
interactingwiththeCOMlibrariesin.NET.WiththeCOMInteroplayerinplace,
theSTATISTICACOMinterfacesbehavelikeanyother.NETobject.
Adding the STATISTICA Object Library
into Your .NET Project
The.NETInteroplayeriscreatedautomaticallybyaddingthedesiredSTATISTICA
COMinterfacesintoyour.NETproject.STATISTICAObjectLibraryisthebase
STATISTICACOMlibrary.ToaddtheSTATISTICAObjectLibrarytoa.NETproject,
firstselectthedesired.NETprojectinSolutionExplorer,andthenselectAdd
Referencefromtheshortcutmenu(accessedbyrightclickingonthe.NETproject).

1
1
0
0

CHAPTER
Chapter10:Programmingfrom.NET

Copyright StatSoft, 2011


250STATISTICAQuickReference
TheAddReferencedialogwillbedisplayed.SelecttheCOMtab.Fromthe
ComponentNamelist,selectSTATISTICAObjectLibrary,andclickOK.

Atthispoint,thenecessaryCOMInteroplibraryiscreatedautomatically.Under
theprojectReferencesnode,youwillnowseetheentrySTATISTICA.

ThefileInterop.STATISTICA.dllisalsoaddedtotheprojectoutputdirectory.The
STATISTICACOMInteroplibraryisstoredinthisfile.ToviewtheSTATISTICAobject
libraryfromyour.NETproject,rightclickontheSTATISTICAreference,andfrom
theshortcutmenu,selectViewinObjectBrowser.

Chapter10:Programmingfrom.NET

Copyright StatSoft, 2011
STATISTICAQuickReference251
Manually Creating the
COM Interop Library
ItisalsopossibletocreatetheCOMInteroplibrarymanuallyandimportitinto
your.NETproject.Thisgivesyoutheabilitytospecifyadifferentnameforthe
InteropDLLaswellasdefineacustomnamespace.Theprogramthatenablesyou
tocreateanInteropisTLBIMP.EXE.FromaVisualStudiocommandprompt,
executeTLBIMPwithaninitialparameterofthetypelibrarysource.Intheexample
below,theoutputDLLnameandnamespacearealsospecified.

Inthisexample,wereferencethefileSTATIST.EXEsincethatexecutablecontains
theSTATISTICAObjectLibrarytypelibrary.OncetheInteropDLLisgenerated,you
canaddittoyour.NETprojectbyselectingAddReferencefromtheSolution
Explorerasbefore,butthistimeclicktheBrowsebuttontoselectthenewly
createdInteropDLL.
Supporting Multiple Versions
of STATISTICA
TosupportmultipleversionsofSTATISTICA,itisnecessarytomaintainseparate
STATISTICAObjectLibraryInteropDLLsforeachversionofSTATISTICAyouwantto
support.YoucanusetheTLBIMPcommandtogenerateInteropDLLsagainst
specificversionsofSTATIST.EXEandotherDLLs.Whendistributingtheapplication,
ensurethatthecorrectversionoftheSTATISTICAInteropDLLisdeployedwith
your.NETapplication.
Chapter10:Programmingfrom.NET

Copyright StatSoft, 2011


252STATISTICAQuickReference
Instantiating STATISTICA
BecauseofitsCOMarchitecture,STATISTICAcanbeincorporatedintomany
differentdevelopmentenvironments.WhenusingSTATISTICAfromanexternal
developmentenvironment,itisnecessarytohaveatoplevelobjectcalledthe
applicationobject.Theapplicationobjectistheapplicationitselfandwillcontain
otherobjects(forexample,spreadsheetsandgraphs),butaccesstotheseother
objectsisrestrictedunlesstheapplicationobjectisrunning.
AssumingyouareusingthedefaultnamespaceSTATISTICA,theinterfaceyou
shoulddeclareyourvariableasisSTATISTICA.Application.Tocreateaninstanceof
STATISTICA,setyourvariableequaltonew STATISTICA.ApplicationClass().
STATISTICA.Application pApp = (STATISTICA.Application)
new STATISTICA.ApplicationClass();
pApp.Visible = true;
WhenaninstanceoftheSTATISTICA.ApplicationClassiscreated,aSTATIST.EXE
processwillbelaunched.ThisisequivalenttolaunchingSTATISTICAfromtheStart
menu.TheSTATISTICAinstanceisinitiallyhiddenbutcanbemadevisible.Sinceit
isaseparateprocess,allcallstothisinstancearemadeoutofprocess.
The Library Version of STATISTICA
InadditiontotheSTATISTICA.Applicationobject,thereisalsoalighterweight,
higherperformanceversionoftheobjectcalledSTATISTICA.Library.TheLibrary
versionislicensedseparatelyandthereforemaynotbeavailablewithyour
installation.ItcontainsidenticalinterfacesastheSTATISTICA.Applicationlibrary.
AnyexistingcodethatusestheApplicationobjectcanbereplacedwiththeLibrary
object.
ThemainrestrictionisthattheSTATISTICAuserinterfacefeaturesarenotavailable
fromtheLibraryversion.Therefore,intheexampleabove,iftheApplicationobject
wasinstantiatedasanewSTATISTICA.LibraryClass,itwouldnotbepossibleto
maketheobjectvisible(andshowtheSTATISTICAinterface).
TheLibraryversionofSTATISTICAisloadedinprocess,whichmeansaccessingits
COMinterfacesismoreefficientthanusingtheApplicationversionoftheobject
Chapter10:Programmingfrom.NET

Copyright StatSoft, 2011
STATISTICAQuickReference253
(whichisloadedoutofprocess).Sinceitisloadedinprocess,multipleversionsof
thelibrarycannotbeinstantiated.Normally,youwouldonlyinstantiateone
LibraryobjectoroneApplicationobjectinyourprogram.
Chapter10:Programmingfrom.NET

Copyright StatSoft, 2011


254STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICA Quick Reference 255
GETTING
MORE HELP
Electronic Manual More than 100 Megabytes of
References, Illustrations, and Examples ........................................ 257
Other Technical Support Resources
and Facilities ..................................................................................... 258
A
A

APPENDIX
CHAPTER10:PROGRAMMING FROM .NET

Copyright StatSoft, 2011


256 STATISTICA Quick Reference

Copyright StatSoft, 2011


STATISTICAQuickReference257
GETTING MORE HELP
Electronic Manual
Themostconvenientplacetogetassistance
andaccessavastrepositoryofinformation
aboutSTATISTICAistheElectronicManual
(Help),whichcontainsmorethan100
Megabytesofreferences,illustrations,and
examples.
ToaccessHelpinSTATISTICA,selecttheHelp
tab.IntheHelpgroup,clickHelp.Youcanalso
clicktheHelp buttonintheupperright
cornerofanydialogtoaccessHelptopics
describingalltheoptionsinthatdialog.
Thishypertextdocumentoffersmuchmore
thanjustanexplanationoftheoptionsin
STATISTICA.Itincludesnumerousexamples,
overviews,andillustrations,aswellas
thousandsoftipsonhowtooptimizeyour
work.

APPENDIX
A
A

AppendixA:GettingMoreHelp

Copyright StatSoft, 2011


258STATISTICAQuickReference
TheSTATISTICAElectronicManualisextremely
comprehensive.ItoffersabuiltinStatistical
Advisor(seepage33)supplementedwiththe
completecontentsofStatSoftsawardwinning
ElectronicStatisticsTextbookandGlossary.
StatSoftsElectronicStatisticsTextbook,
locatedonthecompanyWebsite
(StatSoft.com),hasbeenrecommendedby
EncyclopediaBritannicaforitsQuality,
Accuracy,Presentation,andUsability.

Thisuniquetextbookhasbeenusedformany
yearsineducationalandresearchactivitiesat
universitiesandresearchorganizations
worldwide.
Other Technical
Support
Resources and
Facilities
Web site resources.StatSofts
Website,oneofthemost
visitedInternetaddresses
relatedtodataanalysis,offers
notonlyaccesstomany
resourcesthatareusefulfor
dataanalysisprofessionalsin
general,butitalsoincludes:
Acontinuouslyupdated
FrequentlyAskedQuestions
section,and
Adownloadareawhereusers
ofthecurrentversionof
STATISTICAproductscan
receivedownloadableupdatesoftheir
software.Weareconstantlyworkingon
increasingthecompatibilityofSTATISTICA
softwareevenwiththoseapplicationsthat
violatestandardconventions.Therefore,in
manycircumstances,downloadinganupdate
canhelpwhentheproblemthatyouare
experiencingiscausedbynonstandardsystem
configurationsorconflictswithother
applications.
E-mail technical support.Ifyourquestionis
notansweredinthelocationsmentioned,you
maysendemailtous.Pleaseincludeyour
serialnumber(inSTATISTICA,selecttheHelp
tab,andintheAboutgroup,clickSTATISTICA
toviewyourserialnumber)andinformation
aboutyourhardware[thetypeofprocessor
(CPU)andtheamountofmemory(RAM)and
diskspace]andtheversionoftheoperating
systemthatyouareusing.
AppendixA:GettingMoreHelp

Copyright StatSoft, 2011
STATISTICAQuickReference259
IfyouliveinNorthAmerica,sendyouremail
toinfo@StatSoft.com;otherwise,emailyour
localStatSoftoffice(seebelow).
Phone technical support.Youcanalsocall
yourlocalStatSoftofficetotalktoa
technician.IfyouliveinNorthAmerica,call
(918)7491119(theNorthAmericantechnical
supportofficehoursare9:00AMto5:00PM
CentralTime,MondaythroughFriday).
Ifyouliveinanotherlocation,pleasecontact
theofficethatservesyourspecificarea.To
locatethatoffice,selecttheHelptab.Inthe
Aboutgroup,clickSTATISTICAtodisplaythe
AboutSTATISTICAdialog,andthenselectthe
InternationalOfficestab.

Pleaseknowyourserialnumber(in
STATISTICA,selecttheHelptab,andinthe
Aboutgroup,clickSTATISTICAtoaccessyour
serialnumber),informationaboutyour
hardware[thetypeofprocessor(CPU)andthe
amountofmemory(RAM)anddiskspace],and
theversionoftheoperatingsystemthatyou
areusingbeforeyoucontactStatSofttechnical
supportoffices.
AppendixA:GettingMoreHelp

Copyright StatSoft, 2011


260STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICAQuickReference261

STATISTICA ENTERPRISE
SERVER
General Overview ................................................................................... 263
A Broad Choice of Analytic Facilities and Configurations................. 264
Functionality and Applications: The Advantages of
STATISTICA Enterprise Server .......................................................... 264
Advantages of Multithreading Technology ......................................... 265
STATISTICA Enterprise Server User Interface ..................................... 266
Compatibility with Industry Standards ................................................ 269
Architecture of the System (A Technical Note) .................................. 270
Competitive Advantages ....................................................................... 271
Knowledge Portal .................................................................................. 271
STATISTICA Enterprise Server Demo Movie ........................................ 271
B
B

APPENDIX
AppendixA:GettingMoreHelp

Copyright StatSoft, 2011


262STATISTICAQuickReference

Copyright StatSoft, 2011


STATISTICAQuickReference263
STATISTICA ENTERPRISE
SERVER
STATISTICAEnterpriseServerisahighlyscalable,
enterpriselevel,fullyWebenableddata
analysisanddatabasegatewayapplication
systemthatisbuiltondistributedprocessing
technologyandfullysupportsmultitierClient
Serverarchitectureconfigurations.STATISTICA
EnterpriseServerexposestheanalytic,query,
reporting,andgraphicsfunctionalityof
STATISTICAthrougheasytouse,interactive,
standardWebinterfaces.Alternatively,it
enablesusersofthedesktopversion(thick
client)tooffloadcomputationallyintensive
analyticsanddatabaseoperationstotheServer.
Itisofferedasacomplete,readytoinstall
applicationwithaninteractive,Internet
browserbased(pointandclick)userinterface
(thinclient)thatmakesitpossibleforusersto
interactivelycreatedatasets,runanalyses,and
reviewoutput.However,STATISTICAEnterprise
Serverisbuiltusingopenarchitectureand
includes.NETcompatibledevelopmentkittools
(basedentirelyonindustrystandardsyntax
conventionssuchasVBScript,C++/C#,HTML,
Java,andXML)thatenablesITdepartment
personneltocustomizeallmaincomponentsof
thesystemorexpanditbybuildingonits
foundations,forexample,byaddingnew
componentsand/orcompanyspecificanalytic
ordatabasefacilities.
Asmentioned,STATISTICAEnterpriseServeris
providedwithanInternetbrowserbaseduser
interface(intheformofsimpletonavigateand
easytousedialogs)enablingyoutospecify
analysesandreviewresults.However,toolsare
providedtocustomizethesedialogsandeasily
setupnewuserinterfacesortoaddnew
functions.Forexample,asimpledialogwith
onlythreebuttonscanbecreatedinthe
browser,andclickingeachbuttonwillruna
seriesofanalysesandgenerateadetailed
report.STATISTICAEnterpriseServer
applicationsaddanewdimensionandan
endlessarrayofpossibilitiestotheentirelineof
STATISTICADataAnalysis,DataMining,and
QualityControl/SixSigmasoftware.
ThesystemiscompatiblewithallmajorWeb
serversoftwareplatforms(e.g.,UNIXApache,
andMicrosoftIIS),worksinbothMicrosoft.NET
andSun/Javaenvironments,anddoesnot
requireanychangestotheexistingfirewalland
Internet/Intranetsecuritysystems.
B
B

APPENDIX
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011


264STATISTICAQuickReference
A Broad Choice of Analytic
Facilities and Configurations
TheSTATISTICAEnterpriseServersystemis
offeredasacompletesolutionthatincludesthe
analyticfunctionalityofanySTATISTICAproduct
oranycombinationofproducts,from
STATISTICABasetoDataMinerapplications.
TheminimuminstallationofSTATISTICA
EnterpriseServersoftwareincludestheanalytic
functionalityofSTATISTICABaseandalicense
for5concurrentusers(minimum).
Customerscaneitherorderaspecificversionof
STATISTICAEnterpriseServerincludingthe
analyticfunctionalitythattheyrequire(e.g.,
STATISTICABasefor10users),ortheycanadd
theEnterpriseServerfunctionality(asdescribed
inthissection)tosomeoralloftheseatsofthe
currentlylicensedSTATISTICAproduct(e.g.,add
theEnterpriseServerfunctionalityto20outof
50existinglicensesofSTATISTICAEnterprise).
Functionality and Applications:
The Advantages of STATISTICA
Enterprise Server
A powerful, enterprise-wide collaborative-
intelligence system.Anotherimportantwayto
takeadvantageoftheSTATISTICAEnterprise
Serverfunctionalityistouseitasthecoreand
naturalextensionofanyoftheSTATISTICA
enterprisesystems(e.g.,STATISTICADataMiner
applications).
Specifically,STATISTICAEnterpriseServercan
actasthecoreofanenterprisewidenetwork
systemthatenablestheparticipantstowork
collaborativelyandquicklyshareresults
(reports),aswellasscriptsofanalysesor
queries.Userorgrouppermissions(seethe
TechnicalNoteonpage270)canbeusedbythe
administratorstomanageaccessofspecific
groupsofuserstospecificdataorreports.The
accessibilityofitstoolsmakesSTATISTICA
EnterpriseServeraperfectsystemtofacilitate
collaborativeprojectsofemployeeswhoare
telecommutingortraveling.
Advantages of distributed processing, and
multi-tier Client-Server architecture.Users
willbenefitnotonlyfromthecollaborativework
toolsbutalsotheoptionstooffloadthe
computationallyintensiveortimeconsuming
taskstotheservercomputers.Specifically,
becausethemostpowerfulmultiprocessor
CPUs(and/orclustersofcomputers)areusually
usedasservers,userscanoffload
computationallyintensivetasks,and,for
example,runinthebackgroundqueriesthat
willscanterabytesofdataonremoteservers
andperformtimeconsuming,longsequences
ofanalysesorreports,whilekeepingtheend
userscomputerscompletelyfreetodoother
tasks.Becauseofitsdistributedprocessing
architecture,STATISTICAEnterpriseServer
scalesinahighlyefficientmannertotake
advantageofmultiprocessorCPUsand/or
multiplecomputersand,therefore,userscan
takefulladvantageofmultitierClientServer
architecture,where:
Tier1istheuserinterfaceontheclient
computer(aplainbrowserorSTATISTICA
thickclient,seeSTATISTICAClient,page265),
Tier2istheSTATISTICAEnterpriseServer
softwareandtheimplementationofthe
businessintelligencethatitmaycontain
(specificqueries,scriptsof
custom/proprietaryanalyses,etc.),and
Tier3isSTATISTICAdatabases(e.g.,
STATISTICADataWarehouse)orother
corporaterepositoriesofdata.
InthedesktopversionofSTATISTICA,all
computationsareperformedonthelocal
computer,andresourcesofothercomputers
areusedonlyinthecasewhentheInPlace
DatabaseProcessing(IDP),seepage245,
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011
STATISTICAQuickReference265
interfacetoexternaldatabasesisestablished.
IDPisatechnologythatreadsdata
asynchronouslydirectlyfromremotedatabase
servers(usingdistributedprocessingif
supportedbytheserver),andbypassesthe
needtoimportdataandcreatealocalcopyof
thedataset.Recordsofdataareretrievedand
senttotheSTATISTICAcomputer
asynchronouslybytheCPUofthedatabase
server,whileSTATISTICAsimultaneously
processesthemusingtheCPUofthelocal
computer.
WhenaClientServerversionofSTATISTICAis
used,thelocalcomputerdrivesonlytheuser
interfaceofSTATISTICA,andallcalculationsare
performedontheserver.TheClientServer
architectureoffersobviousadvantageswhen
yourprojectsarelarge(e.g.,computationally
intensiveorinvolvingprocessingofextremely
largedatasets)and,thus,whentheycanbe
offloadedtotheservers,freeingyourlocal
computertoperformotherjobs.
STATISTICA Client.Whilenocomponentsof
theSTATISTICAsystemarenecessaryonthe
clientcomputer(onlyabrowser),havingacopy
ofSTATISTICAinstalledontheclientsideadds
newpossibilities.Onecouldask,WhywouldI
wanttouseSTATISTICAEnterpriseServerifI
haveacopyofSTATISTICAinstalledonmy
laptop?TheansweristhathavingSTATISTICA
installedontheclientcomputerenablesyouto
takeadditionaladvantageofthemultitier
ClientServerarchitecture(seepage264)and
workinteractivelywithSTATISTICAinstalled
locallywhileoffloadingcertaintimeconsuming
taskstotheservermachine(s)and/orexchange
dataandoutputbetweenallthethreetiers.You
canrunSTATISTICAEnterpriseServerfrom
withindesktopSTATISTICAandflexiblycontrol
theinteractionbetweenthetwo.Avarietyof
optionsareavailabletosharetasksbetweenthe
desktopandservercomputer.

Also,whenyoureviewyourSTATISTICA
EnterpriseServeroutputinthebrowser,you
haveoptionstobringanyoralloutputobjects
toyourdesktopcomputerforfurther
processing.Forexample,aclickonasmall
buttonplacedoptionally(dependingontheuser
configuration)nexttoeveryoutputobject
(tableorgraph)senttoyourbrowserbythe
STATISTICAEnterpriseServersystemwilloffer
youtheoptiontodownloadthatobject(a
STATISTICAtableoragraph)totheclient
computerinitsnativeSTATISTICAformat(in.sta
or.stgfileformat)soyoucanworkwithit
offlineusingthelocallyinstalledSTATISTICA
tools.
Advantages of Multithreading
Technology
TheSTATISTICAEnterpriseServerplatformis
builtonadvanceddistributedprocessingand
multithreadingtechnologytosupportoptimal
managementoflargecomputationalloads.This
technologyenablesrapidprocessingofeven
verylargeandcomputationallyintensive
projects,takingfulladvantageofthemultiple
CPUsontheserver,orevenmultipleservers
workinginparallel.
Theillustrationonthenextpageshowsa
projectrunningonaquadprocessorserver,
alongwiththeserverperformancemonitor
demonstratingthefullutilizationofthe
resourcesofallfourCPUsexecutinginthe
multithreadingmodeasingle,computationally
intensiveSTATISTICADataMinerproject.
Inaddition,theSTATISTICAEnterpriseServer
architecturedeliversaplatformindependent,
Webbrowserbaseduserinterface,and
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011


266STATISTICAQuickReference
providesanultimate,largeenterpriselevel
abilitytomanageprojectsorgroupsofusers.
Ultimate scalability (parallel processing
technology).Oneoftheuniquefeaturesofthe
STATISTICAdistributedprocessingtechnologyis
thatitflexiblyscalesnotonlytotakeadvantage
ofallCPUsonthecurrentservercomputer(to
supportbothmultiplejobs/usersandalso
individual,computationallyintensiveprojects),
butitalsoscalestomultipleservercomputers
(clusters).Thisuniquefeatureisimportant,
sinceitdeliverssignificantperformancegains.
STATISTICAusestheparallelprocessing
technologyacrossseparatehardwareunits(as
somesupercomputersdo)and,therefore,if
youhave,forexample,threeserverswithfour
processorseach,STATISTICAcanrunan
individualprojectonall12processors(ifthe
scaleofthatprojectwarrantsthatmodeof
processing).
STATISTICA Enterprise Server
User Interface
WiththeSTATISTICAEnterpriseServer
implementationofSTATISTICA,userscan
interactivelyruntheprogramfromtheclient
machineina
Webbrowser
interfacethatis
similartothat
availableforthe
desktop
installation.
Therefore,the
clientsideofthe
application(the
frontend)can
berunonany
computerusing
onlyabrowser.
However,the
actual
computations
andother
operations
performedonthedatawillremainonthe
(remote)serverwithitsusuallymorepowerful
processorsandstorageresources(andtheywill
bemanagedusingtheoptimized,
multithreadinganddistributedprocessing
architectureofthesystemformaximum
performance).
Inessence,theuserinterfaceaspectsof
STATISTICAcanberunbyoneormultipleusers,
whiletheserverperformsallcomputationsand
dataoperations,enforcingthepropersecurity
andaccessprivilegesapplicabletothe
respectiveprojectsandclassesofusers,as
designedbythenetworkadministrator.
STATISTICAEnterpriseServeroffersa
straightforwarduserinterfacesupportinga
selectionofinteractivedataanalysis,data
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011
STATISTICAQuickReference267
mining,qualitycontrol,databasemanagement,
databasequery,andgraphcustomization
operations.
AfterloggingontotheSTATISTICAEnterprise
Serversystem,

youcanselectadatasource(adatasetoralive
databaseconnection),

reviewandeditthedataintheinteractive
SpreadsheetEditor,

selecttheanalysistobeperformedusingthe
standardmenusystem(orashortcutinthe
userdefinedMyMenu),

selectvariablesandspecifyoptionalanalysis
parameters,
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011


268STATISTICAQuickReference

andinteractivelyreviewtheoutput.

Avarietyofinteractivefacilitiestoperform
specialdatabase,qualitycontrol,ordatamining
operations(includinginteractivelybuildingdata
miningmodelsbydraggingarrowsinthemodel
workspace;seebelow)areprovided,andare
accessiblefromthestandardbrowser.

AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011
STATISTICAQuickReference269
Inadditiontothesebuiltin,straightforward
userinterfacefacilities,STATISTICAEnterprise
Serveralsoincludesatoolkitthatenablesusers
tocustomizetheuserinterfaceanddevelop
customapplicationswithspecificallypredefined
functionality,packagedinawaythatmatches
therequirementsoftheirspecificapplications.
Compatibility with
Industry Standards
Theunsurpassedcompatibilitywithindustry
standardsisanotherinthelonglistofunique
advantagesofSTATISTICAEnterpriseServer.
STATISTICAEnterpriseServercanbedeployed
onanyofthepopularWebserverplatforms
(e.g.,aUNIXbasedApacheorIIS),and
therefore,itwillconformtotheexistinglocal
securityprotocols(firewalls)asrequiredbythe
corporateclient.
STATISTICAEnterpriseServerusesadvanced
proprietarytechnologydevelopedatStatSoftto
ensureitshighperformanceandscalability(e.g.,
multiple,multiprocessorSTATISTICAEnterprise
Servercomputersworkinginadistributed
processingenvironment).Thistechnologyis
builtonStatSoftsyearsofexperienceproviding
highperformance,scalableenterprisesystems
tomajorcorporationsintheUnitedStatesand
aroundtheworld.However,STATISTICA
EnterpriseServerisstillbasedontheindustry
standardcommunicationprotocols(e.g.,XML)
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011


270STATISTICAQuickReference
toensure1)itsplatformindependence,2)
smoothtransitiontofuturetechnologies,and3)
easeofcustomizationbytheclient.Notethat
theeaseofcustomizationisadditionally
boostedbythefactthatonlytheindustry
standardsyntaxconventions(suchasVBscript,
C++,HTML,andXML)areusedtocustomize,
configure,anddefineallthespecificanalytic
operationsandalloutputinSTATISTICA
EnterpriseServer.
Architecture of the System
(A Technical Note)
Althoughthegeneraldesignusestwo
computersinatypicalconfiguration,theWeb
server(e.g.,aUNIXbasedApachesystem)and
atleastoneSTATISTICAEnterpriseServer
(optionallyscalabletomultipleSTATISTICA
EnterpriseServers),
inmanycases,STATISTICAEnterpriseServer
couldbeinstalledonthesamemachineif
desired(whenIISisusedastheWebhost):
Thedesignallowsforaflexible,genericWeb
serverimplementationbyusingastandard
scriptinglanguageontheWebserver.The
purposeoftheWebserveristopackage
requestsfromtheuser(receivedfroma
browser),sendthesetotheSTATISTICA
EnterpriseServer,andthenprocessresponses
fromtheSTATISTICAEnterpriseServerfor
displaytotheusers(ontheirbrowsers).
CommunicationbetweentheWebserverand
theSTATISTICAEnterpriseServeris
accomplishedthroughtechnologybasedonthe
industrystandardXMLconventions.Thesystem
isfullycustomizable,andforcustomerswho
wanttodeveloptheirownmodificationsor
extensionsofthis(readytodeploy)system,it
providesdevelopmenttoolkitfacilitiesallowing
modificationofallaspectsofboththescripts
thatarebeingexecutedbySTATISTICA(onthe
STATISTICA
Enterprise
Serverside)and
theappearance
oftheuser
interface
exposedtothe
enduserson
the(browser
based)thin
clientside.Onlythemoststandard,commonly
knowntools(suchasVBorXML/HTML)are
usedtocustomizeorexpandthesystem.
TheactualWeb
pagedefinitions
andSTATISTICA
scriptstobe
executedare
storedina
designated
Repository
Facilityonthe
STATISTICA
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011
STATISTICAQuickReference271
EnterpriseServer,andtheyaremanagedina
queuelikefashion.Thesystemalsoincludesa
highlyoptimizedDistributedProcessing
Managerthathandlestheincomingprocessing
loadanddistributesitoptimallyovermultiple
threadsofSTATISTICAandmultipleSTATISTICA
EnterpriseServercomputers.
TheSTATISTICAEnterpriseServersoftware
systemalsoincludestheSTATISTICAVisualBasic
WebExtensions.TheseextensionstotheSVB
languageenablethescriptwritertoeitherlet
thesystemdisplaytheresultinggraphsand
spreadsheetsontheautomaticallygenerated
(output)Webpages,orcustomizethe
appearanceofthegeneratedoutputpagesby
addingHTMLdirectivesasappropriate.
Securityandauthenticationisakeydesign
featureintheSTATISTICAEnterpriseServer
applicationsystem.Atthebeginningofthe
session,userssignontothesystemwiththeir
usernameandpassword.System
administratorsareabletocontrolaccesstodata
sourcesandscriptsbasedeitheronuseror
grouppermissions.Thehighestlevelofthe
accessprivilegeallowsadvancedusers(or
administrators)toexecutevirtuallyarbitrary
scripts(e.g.,inordertoperformsystem
administrationormaintenanceoperations).This
levelrequiresadesignated(highest)access
privilegebecause,duetothegeneralnature
andpoweroftheSTATISTICAVisualBasic
language,itgivesaccess(totheauthorized
users)toallresourcesonthenetwork.
Notethatthissystemcanbeintegratedwith
thetraditional(i.e.,nonWebbased)
STATISTICAconcurrentnetworkoraSTATISTICA
enterprisesystemauthenticationscheme.
Competitive Advantages
ThecompetitiveadvantagesofSTATISTICA
EnterpriseServerapplicationsstartwiththe
completelistofuniquefeaturesofSTATISTICA
itself.Further,unlikethecompetingproducts,
weofferacompleteapplication(asolution)
withaWebbaseduserinterfaceandnotmerely
adevelopmentkit(althoughthedevelopment
kitfacilitiesarealsoavailabletoextendor
customizethesystem).Also,wedonotrequire
thataspecificWebserversoftwarebeinstalled
first(whichmayormaynotcomplywiththe
clientssecuritystandardsandotherpolicies).
Finally,oursystemiscontrolledbyindustry
standardVBscripts,C++,HTML,andXMLthat
canbeeasilymodifiedbyusersorsystem
administrators.Inaddition,ourdistributed
processingandmultithreadingtechnology
deliversperformanceandsystem
responsivenessthatisnotmatchedbyany
competingproducts.
Knowledge Portal
AdesignatedKnowledgePortalapplicationis
optionallyavailablethatenablesusersto
effectivelyandsecurelydistributeorganized
setsofoutputdocumentsovertheWeb.It
offerssupportforworkgroupsofusers(each
withdifferentaccessprivileges,andthusaccess
todifferentpartsofthedatabaseofoutput
documents),intuitivetreevieworganizationof
availablematerials,andoptionstobroadcast
documentsupdatedontheWebserverin
realtime.
STATISTICA Enterprise Server
Demo Movie
HowdoesSTATISTICAEnterpriseServerwork?
VisitStatSoftsWebsite,www.StatSoft.com,to
viewaninformativepresentationoftheunique
featuresofSTATISTICAdescribedhere.The
moviealsoincludesastepbystepexample
application.
AppendixB:STATISTICAEnterpriseServer

Copyright StatSoft, 2011


272STATISTICAQuickReference

STATISTICA FAMILY
OF PRODUCTS
General Purpose/Desktop Products .................................................... 275
STATISTICA Base .............................................................................. 275
STATISTICA Advanced Linear/Nonlinear Models ........................... 275
STATISTICA Multivariate Exploratory Techniques ......................... 276
STATISTICA Variance Estimation and Precision ............................ 276
STATISTICA Automated Neural Networks (SANN) ......................... 276
STATISTICA Power Analysis ............................................................. 276
Industrial Solutions, Six Sigma Tools .................................................. 276
STATISTICA Quality Control Charts ................................................. 276
STATISTICA Process Analysis .......................................................... 277
STATISTICA Design of Experiments ................................................. 277
STATISTICA Multivariate Statistical Process Control (MSPC) ...... 277
continued


C
C

APPENDIX


STATISTICA Enterprise Systems ........................................................... 278
STATISTICA Data Miner .................................................................... 278
STATISTICA Process Optimization .................................................. 278
STATISTICA Text Miner ..................................................................... 278
STATISTICA Sequence, Association and Link Analysis (SAL) ....... 279
STATISTICA Enterprise ..................................................................... 279
STATISTICA Enterprise/QC .............................................................. 279
STATISTICA Monitoring and Alerting Server (MAS) ....................... 280
STATISTICA ETL (Extract, Transform, and Load) ............................ 280
STATISTICA MultiStream ................................................................. 280
STATISTICA Enterprise Server ......................................................... 281
Scoring Solutions .................................................................................. 281
STATISTICA Live Score ..................................................................... 281
STATISTICA Credit Scoring .............................................................. 281
STATISTICA Scorecard .................................................................... 282
Data and Document Management ...................................................... 282
STATISTICA Document Management System (SDMS) .................. 282
STATISTICA PI Connector................................................................. 283
STATISTICA Data Warehouse .......................................................... 283
Vertical Market Applications ............................................................... 286
PROCEED ........................................................................................... 286
STATISTICA PowerSolutions ............................................................ 287

Copyright StatSoft, 2011


STATISTICAQuickReference275
STATISTICA FAMILY
OF PRODUCTS
Common system features.Inadditionto
comprehensive,leadingedgeanalytics,
STATISTICAproductsofferaselectionoffully
customizableuserinterfaces(withsimplified
shortcuttemplatesfornovices),flexible,
presentationqualityoutputmanagement
(includingavarietyofreportformats,suchas
.pdf,Word,.rtf,.html,andoutputtoWeb
portals),fullOLE/ActiveXsupport,andWeb
enablement.
Also,allproductsincludedatamanagement
optimizedtohandlelargedatasets,interactive
databasequerytools,andawideselectionof
dataimport/exportfacilities.STATISTICA
productscanmanagedatasetsofpractically
unlimitedsizeandofferquadrupleprecision
calculations;theysupportmultipleinputfiles,
multipleinstances,andmultitasking.Abroad
selectionofinteractivevisualizationand
graphics/drawingtoolsofthehighestqualityis
fullyintegratedintoeachproduct,andeach
includesacompletesetofautomationoptions
andaprofessionalVisualBasicand.NET
compatibledevelopmentenvironmentwith
morethan14,000externallyaccessible
functions.

GENERAL-PURPOSE
DESKTOP PRODUCTS
STATISTICA Base.Offersa
comprehensivesetofessential
statisticsinauserfriendlypackageandallthe
performance,power,andeaseofuseofthe
STATISTICAtechnology.
AllSTATISTICAgraphicstools
BasicStatistics,Breakdowns,andTables
DistributionFitting
MultipleLinearRegression
AnalysisofVariance
Nonparametrics,andmore
STATISTICA Advanced
Linear/Nonlinear Models.Offers
awidearrayofthemostadvancedmodeling
andforecastingtoolsonthemarket,including
automaticmodelselectionfacilitiesand
extensiveinteractivevisualizationtools.
GeneralLinearModels
GeneralizedLinear/NonlinearModels
GeneralRegressionModels
GeneralPartialLeastSquaresModels
NIPALSAlgorithm(PCA/PLS)
VarianceComponents
SurvivalAnalysis
CoxProportionalHazardsModels
APPENDIX
C
C

AppendixC:FamilyofProducts

Copyright StatSoft, 2011


276STATISTICAQuickReference
NonlinearEstimation
FixedNonlinearRegression
LogLinearAnalysisofFrequencyTables
TimeSeries/Forecasting
StructuralEquationModeling,andmore
STATISTICA Multivariate Exploratory
Techniques.Offersabroadselection
ofexploratorytechniquesforvarioustypesof
data,withextensive,interactivevisualization
tools.
ClusterAnalysis
FactorAnalysis
PrincipalComponents/ClassificationAnalysis
CanonicalAnalysis
DiscriminantAnalysis
GeneralDiscriminantAnalysisModels
Reliability/ItemAnalysis
ClassificationTrees
CorrespondenceAnalysis
MultidimensionalScaling,andmore
STATISTICA Variance Estimation and
Precision.Acomprehensivesetof
techniquesforanalyzingdatafromexperi
mentsthatincludebothfixedandrandom
effectsusingREML(RestrictedMaximum
LikelihoodEstimation).WithSTATISTICA
VarianceEstimationandPrecision,youcan
obtainestimatesofvariancecomponentsand
usethemtomakeprecisionstatementswhile
atthesametimecomparingfixedeffectsin
thepresenceofmultiplesourcesofvariation.
Variabilityplots
Multipleplotlayoutstoallowdirect
comparisonofmultipledependentvariables
Expectedmeansquaresandvariance
componentswithconfidenceintervals
Flexiblehandlingofmultipledependent
variables:analyzeseveralvariableswiththe
sameordifferentdesignsatonce
Graphdisplaysofvariancecomponents
STATISTICA Automated Neural
Networks (SANN).Containsthemost
comprehensiveneuralnetworkalgorithmsand
trainingmethods.
Automaticsearchforbestarchitectureand
networksolutions
MultilayerPerceptrons
RadialBasisFunctionNetworks
SelfOrganizingFeatureMaps
TimeSeriesNeuralNetworksforboth
RegressionandClassificationproblems
Avarietyofalgorithmsforfastandefficient
trainingofNeuralNetworkModelsincluding
GradientDescent,ConjugateGradient,and
BFGS
Numerousanalyticalgraphstoaidin
generatingresultsanddrawingconclusions
Samplingofdataintosubsetsforoptimizing
networkperformanceandenhancingthe
generalizationability
SensitivityAnalysis,LiftCharts,andROC
Curves
CreationofEnsemblesoutofalreadyexisting
standalonenetworks
C/C++/C#,PMML(PredictiveModelMarkup
Language),Java,STATISTICAEnterprise,and
SASNeuralNetworkCodeGeneratorsthat
areeasytodeploy
STATISTICA Power Analysis.An
extremelypreciseanduserfriendly
specializedtoolforanalyzingallaspectsof
statisticalpowerandsamplesizecalculation.
SampleSizeCalculation
ConfidenceIntervalEstimation
StatisticalDistributionCalculators,andmore
INDUSTRIAL SOLUTIONS,
SIX SIGMA TOOLS
STATISTICA Quality Control Charts.
Offersfullycustomizable(e.g.,callable
AppendixC:FamilyofProducts

Copyright StatSoft, 2011
STATISTICAQuickReference277
fromotherenvironments),easyandquickto
use,versatilechartswithaselectionof
automationoptionsanduserinterface
shortcutstosimplifyroutinework(a
comprehensivetoolforSixSigmamethods).
MultipleChart(SixSigmaStyle)Reportsand
displays
XbarandRCharts;XbarandSCharts;N
p
,
P,U,CCharts
ParetoCharts
ProcessCapabilityandPerformanceIndices
MovingAverage/RangeCharts,EWMA
Charts
ShortRunCharts(includingNominaland
Target)
CuSum(CumulativeSum)Charts
RunsTests
Interactive
Causesandactions,customizablealarms,
analyticbrushing,andmore
STATISTICA Process Analysis.A
comprehensivepackageforProcess
Capability,GageR&R,andotherquality
control/improvementapplications(a
comprehensivetoolforSixSigmamethods).
Process/CapabilityAnalysisCharts
Ishikawa(CauseandEffect)Diagrams
GageRepeatability&Reproducibility
VarianceComponentsforRandomEffects
WeibullAnalysis
Samplingplans,andmore
STATISTICA Design of Experiments.
FeaturesthelargestselectionofDOE
andrelatedvisualizationtechniquesincluding
interactivedesirabilityprofilers(a
comprehensivetoolforSixSigmamethods).
FractionalFactorialDesigns
MixtureDesigns
LatinSquares
SearchforOptimal2
(kp)
Designs
ResidualAnalysisandTransformations
Optimizationofsingle/multipleresponse
variables
CentralCompositeDesigns
TaguchiDesigns
MinimumAberration&Maximum
Unconfounding
2
(kp)
FractionalFactorialDesignswithBlocks
ConstrainedSurfaces
DandAOptimalDesigns
Desirabilityprofilers,andmore
STATISTICA Multivariate Statistical
Process Control (MSPC).Acomplete
solutionformultivariatestatisticalprocess
control,deployedwithinascalable,secure
analyticssoftwareplatform.
Univariateandmultivariatestatistical
methodsforqualitycontrol,predictive
modeling,anddatareduction
Functionstodeterminethemostcritical
process,rawmaterials,andenvironment
factorsandtheiroptimalsettingsfor
deliveringproductsofthehighestquality
Monitoringofprocesscharacteristics
interactivelyorautomaticallyduring
productionstages
Building,evaluating,anddeploying
predictivemodelsbasedontheknown
outcomesfromhistoricaldata
Historicalanalysis,dataexploration,data
visualization,predictivemodelbuildingand
evaluation,modeldeploymentto
monitoringserver
Interactivemonitoringwithdashboard
summarydisplaysandautomaticupdating
results
Automatedmonitoringwithrules,alarm
events,andconfigurableactions
MultivariatetechniquesincludingPartial
LeastSquares,PrincipalComponents,Neural
Networks,RecursivePartitioning(Tree)
AppendixC:FamilyofProducts

Copyright StatSoft, 2011


278STATISTICAQuickReference
Methods,SupportVectorMachines,
IndependentComponentsAnalysis,Cluster
Analysis,andmore
STATISTICA ENTERPRISE SYSTEMS
Inadditiontothecommonfeatures,
STATISTICAEnterpriseSystemsoptionallyoffer
awideselectionoftoolsforcollaborative
work,Webbrowserbaseduserinterfaces
(usingSTATISTICAEnterpriseServer),
specializeddatabases,andahighlyoptimized
interfacetoenterprisewidedatarepositories,
includingoptionstorapidlyprocesslargedata
setsfromremoteserversinplace,without
creatinglocalcopies.Deploymentandonsite
trainingservicesareavailable.
STATISTICA Data Miner.Themost
comprehensiveselectionofdata
miningsolutionsonthemarket,withanicon
based,extremelyeasytouseuserinterface
(optionallyWebbrowserbasedviaSTATISTICA
EnterpriseServer,seepage281)anda
deploymentengine.Itfeaturesaselectionof
completelyintegratedandautomated,ready
todeployasis(butalsoeasilycustomizable)
systemsofspecificdataminingsolutionsfora
widevarietyofbusinessapplications.A
designatedSPCversion(QCDataMiner)to
mine/analyzelargestreamsofQCdataisalso
available.Thedataminingsolutionsaredriven
bypowerfulproceduresfromfivemodules:
GeneralSlicer/DicerExplorer(withoptional
OLAP)
GeneralClassifier
GeneralModeler/MultivariateExplorer
GeneralForecaster
GeneralNeuralNetworksExplorer,and
more
STATISTICA Process Optimization.
AnaddontoDataMiner,STATISTICA
ProcessOptimizationisapowerfulsoftware
solutiondesignedtomonitorprocessesand
identifyandanticipateproblemsrelatedto
qualitycontrolandimprovementwith
unmatchedsensitivityandeffectiveness.
ProcessOptimizationintegratesallQuality
ControlCharts,ProcessCapabilityanalyses,
ExperimentalDesignprocedures,andSixSigma
methodswithacomprehensivelibraryof
cuttingedgetechniquesforexploratoryand
predictivedatamining.
PredictQCproblemswithcuttingedgedata
miningmethods
Discoverrootcausesofproblemareas
MonitorandimproveROI(ReturnOn
Investment)
Generatesuggestionsforimprovement
Monitorprocessesinrealtimeoverthe
Web
CreateanddeployQC/SPCsolutionsover
theWeb
Usemultithreadinganddistributed
processingtorapidlyprocessextremely
largestreamsofdata
STATISTICA Text Miner.Apowerful
softwaresolutionfortextmining,
documentretrieval,andminingof
unstructureddata.Anoptionaladdonproduct
forSTATISTICADataMiner,designedand
optimizedforaccessingandanalyzing
documents(unstructuredinformation)ina
varietyofformats:.txt(text),.pdf(Adobe),.ps
(PostScript
TM
),.html,.xml(Webformats),and
mostMicrosoftOfficeformats(e.g.,.doc,.rtf);
optimizedaccesstoWebpages(URL
addresses)isalsoprovided.
Efficientlyindexverylargecollectionsoftext
documents;identifykeytermsand
similaritiesbetweendocumentsandterms,
andextracttheinformationrelevanttoyour
missionandgoals
AppendixC:FamilyofProducts

Copyright StatSoft, 2011
STATISTICAQuickReference279
Applystublists(wordstoignore)and
languagespecificstemmingalgorithms
(variouslanguagesaresupported)
Includesnumerousoptionsforconverting
documentsintonumericinformationfor
furtherprocessing(e.g.,mapping,clustering,
predictivedatamining,classificationof
documents,etc.)
Fullsupportformultithreadedoperationon
multiprocessorserverinstallationsfor
extremelyfastindexingandsearchingof
hugedocumentrepositories
Canalsobeusedtoindex,analyze,andmine
otherunstructuredinput,suchassoundor
imagefiles(afterdomainspecificpre
processingisapplied)
FullyintegratedintotheSTATISTICAand
STATISTICAEnterpriseServersystems,
hence,thelargenumberofavailable
methodsforsupervisedandunsupervised
learning(clustering),mapping,data
visualization,etc.,aredirectlyand
immediatelyavailable;manyofthe
algorithmsavailableinSTATISTICAData
Miner,suchasthemachinelearning
algorithms(kNearestNeighbor,Naive
Bayesclassifiers,advancedSupportVector
MachinesandKernelclassifiers),are
particularlywellsuitedfortextminingorthe
analysisofotherunstructuredinformation
STATISTICA Sequence, Association
and Link Analysis (SAL).Designedto
addresstheneedsofclientsinretailing,
banking,insurance,etc.,industriesby
implementingthefastestknownhighly
scalablealgorithmwiththeabilitytodrive
AssociationandSequencerulesinonesingle
analysis.Theprogramrepresentsastand
alonemodulethatcanbeusedforbothmodel
buildinganddeployment.Alltoolsin
STATISTICADataMinercanbequicklyand
effortlesslyleveragedtoanalyzeanddrill
intoresultsgeneratedviaSTATISTICASAL.
Usesatreebuildingtechniquetoextract
AssociationandSequencerulesfromdata
Usesefficientandthreadsafelocal
relationaldatabasetechnologytostore
AssociationandSequencemodels
Handlesmultipleresponse,multiple
dichotomy,andcontinuousvariablesinone
analysis
PerformsSequenceAnalysiswhilemining
forAssociationrulesinasingleanalysis
SimultaneouslyextractsAssociationand
Sequencerulesformorethanone
dimension
Giventheabilitytoperform
multidimensionalAssociationandSequence
miningandthecapacitytoextractonlyrules
forspecificitems,theprogramcanbeused
forPredictiveDataMining
PerformsHierarchicalSingleLinkageCluster
Analysis,whichcandetectthemorelikely
clusterofitemsthatcanoccur.Thishas
extremelyuseful,practicalrealworld
applications,e.g.,inretailing.
STATISTICA Enterprise.Anintegrated
multiusersystemdesignedforgeneral
purposedataanalysisandbusinessintelligence
applicationsinresearch.STATISTICAEnterprise
canoptionallyofferthestatisticalfunctionality
availableinanyorallSTATISTICAproducts.
Integrationwithdatawarehouses
Intuitivequeryandfilteringtools
Easytouseadministrationtools
Automaticreportdistribution
Alarmnotification,andmore
STATISTICA Enterprise/QC.Designed
forlocalandglobalenterprisequality
controlandimprovementapplications
includingSixSigma.STATISTICAEnterprise/QC
AppendixC:FamilyofProducts

Copyright StatSoft, 2011


280STATISTICAQuickReference
offersahighperformancedatabase(oran
optimizedinterfacetoexistingdatabases),
realtimemonitoringandalarmnotificationfor
theproductionfloor,acomprehensivesetof
analyticaltoolsforengineers,sophisticated
reportingfeaturesformanagement,SixSigma
reportingoptions,andmuchmore.
Webenableduserinterfaceandreporting
tools;interactivequeryingtools
Userspecificinterfacesforoperators,
engineers,etc.
Groupwarefunctionalityforsharingqueries,
specialapplications,etc.
Openendedalarmnotificationincluding
cause/actionprompts
Scalable,customizable,andcanbe
integratedintoexistingdatabase/ERP
systems,andmore
STATISTICA Monitoring and Alerting
Server (MAS). Asystemthatenables
userstoautomatethecontinualmonitoringof
hundredsorthousandsofcriticalprocessand
productparameters.Theongoingmonitoring
isanautomatedandefficientmethodfor:
Monitoringmanycriticalparameters
simultaneously
Providingstatussnapshotsfromthe
resultsofthesemonitoringactivitiesto
personnelbasedontheirresponsibilities
DashboardsassociatedwithUsers/Groups
STATISTICA ETL (Extract, Transform,
and Load).Providesoptionsto
simplifyandfacilitateaccessto,aggregation,
andalignmentofdatafrommultipledatabases
whensomeofthedatabasescontainprocess
data(usingtheoptionalPIConnector)while
otherscontainstaticdata(e.g.,fromOracle
orMSSQLServer).Providesforadhoc
queryingandaligningofdataforsubsequent
analysessuchasadhoccharting,etc.,ofdata
describingaspecifictimeinterval.
TimeindexedSTATISTICAETLaggregates
datafrommultipledatasourcesbasedona
date/timestampvariable.Datamaybe
alignedbyminute,hour,day,week,month,
quarter,oryear.
IDBasedSTATISTICAETLaggregatesdata
frommultipledatasourcesbasedonan
identifiervariable(eithernumberortext)
andanoptionaltimevariable.Ifatime
variableisdefined,datamaybeoptionally
alignedbyNequalintervalsorNuser
specifiedintervals.
STATISTICA MultiStream.Asolution
packageforidentifyingand
implementingeffectivestrategiesforadvanced
multivariateprocessmonitoringandcontrol.
STATISTICAMultiStream

wasdesignedfor
processindustriesingeneral,butisparticularly
wellsuitedtohelppowergenerationfacilities
leveragetheirdata(collectedintoexisting
specializedprocessdatabasesformultivariate
andpredictiveprocesscontrol)foractionable
advisorysystems.
STATISTICAMultiStreamisacomplete
enterprisesystembuiltonarobust,advanced
clientserver(andfullyWebenabled)
architecture,offerscentraladministrationand
managementofdeploymentofmodels,aswell
ascuttingedgerootcauseanalysisand
predictivedataminingtechnology,andits
analyticsareseamlesslyintegratedwitha
builtindocumentmanagementsystem.
Automated(nonlinear)rootcauseanalysis
andfeatureselectionforthousandsof
parameterstoclearlyidentifywhichones
arethemostlikelyresponsibleforprocess
problems
Automatedandinteractivecommonality
analysistoidentifyparametersand
processesthatshiftedormovedfrom
AppendixC:FamilyofProducts

Copyright StatSoft, 2011
STATISTICAQuickReference281
normaloperationsduringparticulartime
intervals
Advancedlinearandnonlinear(e.g.,SVM,
RecursivePartitioning,NeuralNets)models
forcreatingsensitivemultivariatecontrol
schemesandworkflowstoidentify
multivariateshiftsanddriftsearly,before
theycauseproblems
Advanceddataminingalgorithmsfor
predictingandoptimizingkeyperformance
andqualityindicators
Trackshundredsofdatastreams
simultaneously
Deliverssimplesummariesrelevantto
criticalprocessparametersandoutcomes
viaefficientandsimpledashboardsand
drilldownworkflows
Deliversstandardandcustomizedanalytic
workflowsforrootcauseanalysis,
leveragingcuttingedgedataanalysisand
dataminingtechnologies
Warnsof(predicted)problemsand
equipmentfailuresbeforetheyoccur
(predictivealarming),thusavoidingcostly
shutdownsandunscheduledmaintenance
Watcheseverythingthatimpactsyour
processperformanceinrealtime
STATISTICA Enterprise Server.The
ultimateenterprisesystemthatoffers
fullWebenablement,includingtheabilityto
runSTATISTICAinteractivelyorinbatchfroma
Webbrowseronanycomputer(including
Linux,UNIX)andoffloadtimeconsumingtasks
totheservers(usingdistributedprocessing).
UsesmultitierClientServerarchitecture,
supportingmultithreadinganddistributed/
parallelprocessingthatscalestomultiple
servercomputers.
SCORING SOLUTIONS
STATISTICA Live Score.STATISTICA
EnterpriseServersoftwarewithinthe
STATISTICADataAnalysisandDataMining
Platform.Dataareaggregatedandcleaned
andmodelsaretrainedandvalidatedusingthe
STATISTICADataMinersoftware.Oncethe
modelsarevalidated,theyaredeployedtothe
STATISTICALiveScoreserver.STATISTICALive
Scoreprovidesmultithreaded,efficient,and
platformindependentscoringofdatafrom
lineofbusinessapplications.Someexamples
oftheuseofSTATISTICALiveScore:
Providescreditscorecardstocustomer
serviceapplications(e.g.,callcentersystems
andWebbasedapplications)
Enablescustomersegmentation,up
sell/crosssell,andcustomerchurn
identificationtocustomerserviceand
marketingrepresentatives
Providesproactivefrauddetectionalertsto
analysts
STATISTICA Credit Scoring.The
solutionforanycompanytobuildin
housemodelsforitsvariouscreditproducts
anddecisionmaking.STATISTICACreditScoring
coversallaspectsofthecreditscoringneeds
foryourcompany.
In-house model building.TheSTATISTICA
CreditScoringsoftwaresolutionenablesthe
developmentandevaluationofpredictive
modelstoevaluateandassignariskto
applicationsforcredit,eitherforarequest
foranewaccountorforrequestedchanges
(e.g.,balanceincrease)tothetermsofan
existingcreditaccount.
Scoring applications.STATISTICALive
Scoreenablescompaniestoscorecredit
applications;itcanbeeasilyintegratedwith
yourexistingcustomerservicesystems,self
serviceWebsitesforcustomers,etc.
AppendixC:FamilyofProducts

Copyright StatSoft, 2011


282STATISTICAQuickReference
Evaluate performance.STATISTICACredit
Scoringprovidesbuiltinmonitoringand
evaluationoftheongoingperformanceof
themodelstoenabletheevaluationof
outcomesandkeymetricsandtomake
decisionsaboutwhenmodelsmayneedto
beupdated.
WhatmakestheSTATISTICACreditScoring
solutionunique?
The Approach.STATISTICACreditScoring
includesbothtraditionalmethodsfor
developingcreditscoringmodels(suchas
scorecardsbasedonlogisticregression)as
wellasmoreadvancedmethodsfor
predictivemodelingthatoftenprovide
betteraccuracy,whichtranslatesinto
decreasedrisk,increasedapprovalrates,
andincreasedprofits.STATISTICACredit
ScoringincludesSTATISTICAScorecard,a
dedicatedsolutionfordevelopment,
evaluating,andmonitoringscorecards
includingstepsforFeatureSelection,
AttributeBuilding,ScorecardBuilding,
CutoffPointSelection,RejectInference,and
PopulationStability.
Real-time Scoring.STATISTICACredit
ScoringincludesSTATISTICALiveScore,the
solutionforenablingscoringdecisions
directlyfromcustomerapplicationsvia
CustomerServiceAgents,Websites,and
otherlineofbusinesssystems.
Sources of Data.Unlikegenericscorecards,
STATISTICACreditScoringcanbetailoredto
meetyourspecificneeds.Forexample,it
providestheflexibilitytoincludevarious
datasourcessuchasbehaviorscoring,
utilizingthetransactionalrecordofthe
accounttoinformrecommendationsfor
creditlineincreases,incentives,crosssellor
upsell,orotherchangesinterms.
Flexibility and Capabilities.STATISTICA
CreditScoringisspecifictobuildingcredit
scoringmodels,butthesameapproaches
andtechniquescanalsobeappliedto
modelingcustomerchurn,increasingthe
abilitytodetectfraud,responsemodeling
formarketingcampaigns,andother
applicationswithinyourcompany.
STATISTICA Scorecard.STATISTICA
Scorecardisadedicatedsolutionfor
developing,evaluating,andmonitoring
scorecards,includingstepsforFeature
Selection,AttributeBuilding,Scorecard
Building,CutoffPointSelection,Reject
Inference,andPopulationStability.
DATA AND DOCUMENT
MANAGEMENT
STATISTICA Document Management
System (SDMS).Acomplete,highly
scalable,databasesolutionpackagefor
managingelectronicdocuments.Withthe
STATISTICADocumentManagementSystem,
youcanquickly,efficiently,andsecurely
managedocumentsofanytype[e.g.,find
them,accessthem,searchforcontent,review,
organize,edit(withtrailloggingand
versioning),approve,etc.].
Extremelytransparentandeasytouse
Flexible,customizable(optionally
browser/Webenabled)userinterface
Electronicsignatures
Comprehensiveauditingtrails,approvals
Optimizedsearches
Documentcomparisontools
Security
SatisfiestheFDA21CFRPart11
requirements
SatisfiesISO9000(9001,14001)
documentationrequirements
Unlimitedscalability(fromdesktopor
networkClientServerversions,tothe
AppendixC:FamilyofProducts

Copyright StatSoft, 2011
STATISTICAQuickReference283
ultimatesize,Webbasedworldwide
systems)
Openarchitectureandcompatibilitywith
industrystandards
STATISTICA PI Connector. Allowsfor
directintegrationtodatastoredinthe
PIdatahistorian.TheSTATISTICAPIConnector
utilizesthePIuseraccesscontrolandsecurity
model,allowsforinteractivebrowsingoftags,
andtakesadvantagesofdedicatedPI
functionalityforinterpolationandsnapshot
data.STATISTICAintegratedwiththePIsystem
isbeingusedforstreamlinedandautomated
analysesforapplicationssuchasProcess
AnalyticalTechnology(PAT)inFDAregulated
industries,AdvancedProcessControl(APC)
systemsinChemicalandPetrochemical
industries,andadvisorysystemsforprocess
optimizationandcomplianceintheEnergy
Utilityindustry.
STATISTICA Data Warehouse.A
complete,powerful,scalable,and
customizableintelligentdatawarehouse
solution,whichalsooptionallyoffersthemost
completeanalyticfunctionalityavailableon
themarket,fullyintegratedintothesystem.
STATISTICADataWarehouseconsistsofasuite
ofpowerful,flexiblecomponentapplications,
including:
STATISTICADataWarehouseServer
Database
STATISTICADataWarehouseQuery
(featuringSTATISTICAEnterpriseServer
Query)
STATISTICADataWarehouseAnalyzer
(featuringSTATISTICAEnterpriseServerData
Miner,STATISTICAEnterpriseServerText
Miner,STATISTICAEnterpriseServerProcess
Optimization,orthecompletesetof
STATISTICAEnterpriseServeranalytics)
STATISTICADataWarehouseReporter
(featuringSTATISTICAKnowledgePortal
and/orSTATISTICAEnterpriseServer
InteractiveKnowledgePortal)
STATISTICADataWarehouseDocument
Repository(featuringSTATISTICAEnterprise
ServerDocumentManagementSystem)
STATISTICADataWarehouseScheduler
STATISTICADataWarehouseRealTime
MonitorandReporter(featuringSTATISTICA
EnterpriseServerorSTATISTICA
Enterprise/QCServer)
Ifyouarenewtodatawarehousing,StatSoft
consultantswillguideyoustepbystepthrough
theentireprocessofdesigningtheoptimal
datawarehousearchitecturefroma
comprehensivereviewofyourinformation
storageandextraction/analysisneeds,tothe
finaltrainingofyouremployeesandsupportof
yourdailyoperations.
Crucial features and benefits.Thecrucial
featuresandbenefitsofSTATISTICAData
Warehousesolutionsinclude,amongmany
others:
Completedatawarehousingapplication
tailoredtoyourbusiness
Platformindependentarchitecturefor
seamlessintegrationwithyourexisting
infrastructure
Facilitiestointegratedatafromawide
varietyofsources
Virtuallyunlimitedscalability
Optionstoupdate/synchronizedatafrom
multiplesourcesviaautomaticschedulersor
ondemand
CompletelyWebenabledsystem
architecturetoprovideultimateenterprise
functionalityforallcompanylocations
aroundtheworld(e.g.,accessviaWeb
browsersfromanylocation)
AppendixC:FamilyofProducts

Copyright StatSoft, 2011


284STATISTICAQuickReference
Advancedsecuritymodelandauthentication
ofusers
Completedocumentmanagementoptions
tooptimizemanagementofdocumentsof
anytypesandsatisfyregulatory
requirements(e.g.,FDA21CFRPart11,ISO
9000)
Advancedanalyticcomponentsto
clean/verifydataandtointegrate
automateddatamining,artificial
intelligence,andrealtimeprocess
monitoring
Optionstoautomaticallyrunandposton
KnowledgePortals(orbroadcast)highly
customizedreports,includinginteractive
(i.e.,drillable,sliceable,anduser
customizable)reportsandresultsof
advancedanalytics
Backupandarchivingoptions
Programmable,customizable,and
expandabletoadapttospecificmission
profiles(openarchitecture,exposedto
extensionsusingthemostindustrystandard
languages,suchasVB,C++,Java,HTML)
Builtonrobust,welltested,highlyscalable,
cuttingedgetechnologytoleverageyour
investment[includinghighlyoptimizedin
placedatabaseprocessing(IDP)technology,
truemultithreading,distributed/parallel
processing,andsupportforpoolingCPU
resourcesofmultipleserverstodeliver
supercomputerlikeperformance]
STATISTICADataWarehouseisacomplete
intelligentdatastorageandinformation
delivery/distributionsolutionthatenablesyou
tocustomizetheflowofinformationthrough
yourorganizationandprovideallauthorized
membersofyourorganizationwithflexible,
secure,andrapidaccesstocriticalinformation
andintelligentreporting.
Thesystemisvirtuallyplatformindependent
andwillfitintoanyexistingdatabase
architectureandhardwareenvironment.Itwill
efficientlycombineinformationfrommultiple
databaseformatsandsources(frommanual
dataentryformstolargebatteriesof
automaticdatacollectiondevices).Thesystem
canbefurtherenhancedthroughintegration
withotherfullycompatiblecomponentsofthe
STATISTICAlineofapplicationsandsolutions;
tonamejustafew:
STATISTICADataMinerforadvanceddata
miningandartificialintelligence(e.g.,neural
networks)basedsolutionstoprovide
decisionsupportthroughcuttingedge
methodsforknowledgeextractionand
prediction
QualityControlMinerandEnterprise/QCfor
tightintegrationwithqualitycontrol,
processcontrol,andyieldmanagement
activities
STATISTICATextMinerforautomatic
processingofunstructuredinformationin
documents,databases,orWebdirectories
(WebcrawlingofURLs)
STATISTICAKnowledgePortalforpresenting
summaryreports,charts,andactionitems
toendusers(management,salesforce,
engineers,etc.)throughsecureaccess
portalsviatheWeb;todeliverkey
intelligenceanddecisionsupportto
stakeholdersworldwide
Architecture and connectivity.STATISTICA
DataWarehouseconnectstoanyplatform,
database,ordatasource,andwillscaleto
businessesandapplicationsofanysize.The
programisbuiltonadatabaseanddatabase
schemacustomizedforyourparticular
business.Thesolutioncanbeinstalledeither
inclusiveofahighperformancedatabase
engine(SQLServer)orasa(virtual)database
schemacompatiblewithmostindustry
AppendixC:FamilyofProducts

Copyright StatSoft, 2011
STATISTICAQuickReference285
standarddatabases;therefore,itwill
seamlesslyintegrateintoexistingdatabase
systems.
BecauseSTATISTICADataWarehousedoesnot
dependononeparticulardatabasevendoror
hardwareplatform,itisitselfentirely
platformindependent.ThemainData
Warehousesoftwarewillconnecttoany
databaseformatand,hence,canefficiently
combineandpoolinformationfrommultiple
sources.
STATISTICADataWarehouseapplication
softwarewillrunonserverswithmultiple
processorsorbanksofmultipleprocessor
serversforsupercomputerlikeperformance.
Thesystemwillscaleeffortlesslyand
economicallytoevenhugedatasizesand
analysis(intelligence)problems.
Web enablement.STATISTICAData
Warehouseextractsinformationfromsources
anywhereintheworldanddelivers
intelligenceanywhereintheworld.
TheWebcomponentofthesystemisbuilton
theprovenSTATISTICAEnterpriseServer
technologythatisusedbyorganizations
worldwidetoprovidesecureaccessvia
standardWebbrowsers.UnlikeotherWeb
basedsolutions,STATISTICADataWarehouse
doesnotrequireanyadditionalcomponentsto
beinstalledonthe(thin)clientmachines.
Advanced security and authentication.The
STATISTICADataWarehouseimplementsa
detailedandsophisticatedsecuritysystemto
ensurethatyourproprietaryknowledgeand
intelligenceissafefromunauthorizedaccess.
Thesystemwilllikelybecomethemost
importantrepositoryofbusinessintelligence
anddecisionsupportresourcesinyour
organization.Therefore,thesecurityofthe
systemisacrucialprioritysothatthose
valuableresourcesareshieldedfrom
unauthorizedaccess.
STATISTICADataWarehouseimplementsthe
highestlevelofsecuritybyestablishinggroups
ofuserswithdifferentlevelsofauthority
(regardingtheinformationthatisaccessible
andtheoperationsthatcanbeperformed),
requiringregularlyupdatedpasswords,etc.
Specialmethodsarealsoinplacetodetectand
guardagainstsystematicelectronicintrusions
(hacking).
Document control.STATISTICAData
Warehouseenablesfulldocumentmanage
ment,compliantwithgovernmentand
industrystandards.
STATISTICADocumentManagementSystem
canbeseamlesslyintegratedintoyour
STATISTICADataWarehouseapplicationto
optimizetheflowofinformationwithinyour
organizationandthusincreaseyour
productivity.Thissystemcanalsobe
configuredtocomplywithall(corporate)
documentationmanagementpoliciesor
regulatoryrequirementsfordocument
security,audittrails,andelectronic
signatures/authentication(as,forexample,
stipulatedbyFDA21CFRPart11:Electronic
Records;ElectronicSignatures;orISO90014.5:
Documentanddatacontrol).
Advanced analytics.STATISTICAData
Warehousecanincorporatethemost
advanceddataanalysisandknowledge
extractionmethodsavailable;youcangofar
beyondOLAPtosimplifyandextract
knowledgeabouteventhemostcomplex
andinaccessibletootherapplications
patternsinthedata.
BecauseSTATISTICADataWarehouseisbuilt
fromthesamehighperformancecomponents
astheentireSTATISTICAlineofanalytic
solutionssoftware,thoseanalyticsolutions
AppendixC:FamilyofProducts

Copyright StatSoft, 2011


286STATISTICAQuickReference
caneasilyandseamlesslybeintegratedinto
yourDataWarehouse.STATISTICAoffersthe
mostcomprehensivesetoftoolsfordata
mining,textmining,dataanalysis,graphicsand
visualization,qualityandprocesscontrol
(includingSixSigma),etc.onthemarket.These
resourcesandtechnologiescanbeconnected
tothedatasourcesintheSTATISTICAData
Warehousetoleveragethemostadvanced
technologiesandalgorithmsavailablefor
analyzingandextractingkeyintelligencefrom
allsources.Forexample,youcanapply
hundredsofneuralnetworksarchitectures,
highestperformancetreeclassifiers(e.g.,
stochasticgradientboostingtrees),flexible
rootcauseanalyses,controlchartingmethods,
powerfulbusinessforecastingmethods,or
sophisticatedanalyticgraphicsmethodsto
convertrawdataintheDataWarehouseinto
usefulandactionableintelligencewithclear
implicationsfordecisionsaffectingyour
business.
Programmability and customizability.
STATISTICADataWarehouseisanopen
architecturesystemthatwillnotlockyouinto
arelationwithasinglevendororsolution;you
canrespondquicklytonewbusinessdemands
andrequirementsthatneedtobe
incorporatedintotheDataWarehouse.
Aswithallapplicationsandsolutionsinthe
STATISTICAfamilyofproducts,STATISTICA
DataWarehouseisfullyprogrammableand
customizable,usingindustrystandardpro
grammingtoolssuchasVisualBasic,C++,Java,
orHTML.Thisfeatureisofkeyimportance
whenyourbusinessdependsonyourabilityto
quicklyadapttonewinformationandbusiness
realities.Becauseyoucancustomizethe
systemwithoutbeingforcedtorelyonthe
programmersofasinglevendororknowledge
ofidiosyncraticscriptingconventions(required
bymanycompetingsolutions),youhavethe
freedomtodevelopyourproprietary
extensionstotheDataWarehouseandtoadd
notonlyyourownreportsbutalsocustom
analyticanddatatransformation/cleaning
procedures,usingwidelyavailableresources
andindustrystandardtools(e.g.,VB,C++,
Java,orHTMLtoolsandprogrammers).Of
course,StatSoftcanalwaysofferyouafull
complementofconsulting,systemintegration,
andprogrammingservicesdeliveredbyan
experiencedstaff.
VERTICAL MARKET APPLICATIONS
PROCEED. Aturnkeymanufacturing
softwaresolutionthatdistills
fundamentalcausalrelationshipsbetween
productsandtheprocessesthatproduce
them,usingdatathatisalreadycollectedand
managed.PROCEEDimplementsthepatent
pendingapproachdevelopedandprovenat
CaterpillarInc.andpoweredbytheSTATISTICA
EnterpriseAnalyticsSoftwarePlatform.
Hightechmanufacturingenterprisestoday
collectvastamountsofdata.
Dataabouttheproductionprocesses.
Dataabouttestsofrawmaterials,
subassemblies,andmaterialsinprocess.
Dataaboutthecriticaltoqualityattributes
offinishedproducts.
Allofthesedatacollectionandstorageefforts
continuetobefueledbyincreasesin
automation,technologyadvancesinthe
storagecapabilitiesofdatarepositories,and
theadvancesinsensorsandothertechniques
formeasurement.Todaysmanufacturersare
sittingonagoldmineofinformation...onlyif
theyareabletotranslateitintoactionable
information.
Collectingdataisnotsufficienttodrive
enterprisechange.Tocreatechange,weneed
totranslatethesedataintoknowledgeand
AppendixC:FamilyofProducts

Copyright StatSoft, 2011
STATISTICAQuickReference287
thencommunicatethatknowledgeinaformat
thatenablesthepeoplewhoareempowered
toactonit.NowisthetimeforthisReturnon
InvestmentfromdatausingPROCEED.
PROCEEDcombinesnovelandtraditional
knowledgeextractionmethodsto:
Deriveandvalidatesimpletocomplex
causalrelationshipsbetweenmanufacturing
processesandproductqualityoutcomes
Deployactionableinformationtoenable
processownersandknowledgeworkersto
comparewhatifscenariosandsimultane
ouslyoptimizemultiplecompeting
outcomes
STATISTICA PowerSolutions.A
solutionpackageaimedforuseat
powergenerationcompaniestooptimize
powerplantperformance,increaseefficiency,
andreduceemissions.Thisproductoffersa
highlyeconomicalalternativetomultimillion
dollarinvestmentsinneworupgraded
equipment(hardware).Basedonmorethan20
yearsofexperienceinapplyingadvanceddata
driven,predictivedatamining/optimization
technologiesforprocessoptimizationin
variousindustries,STATISTICAPowerSolutions
enablespowerplantstogetthemostoutof
theirexistingequipmentandcontrolsystems
byleveragingalldatacollectedattheirsitesto
identifyopportunitiesforimprovement,even
forolderdesignssuchascoalfiredCyclone
furnaces(aswellaswallfiredorTfired
designs).
AppendixC:FamilyofProducts

Copyright StatSoft, 2011


288STATISTICAQuickReference

QuickReference:Index

Copyright StatSoft, 2011
STATISTICAQuickReference289
INDEX
A
accept/rejectattribute,55
Acrobatreports,153
ActiveX,169,181,198,238
documents,238
objects,238
adhocbygroupanalyses,50
advancedlinear/nonlinear
models,275
Advancedtab,18
advice,statistical,33
aggregation,93
AIAGMSAmanual,55
AllSpecsbutton,14
analyses
attributegage,55
automating,40
autominimize,129
buttons,analysisbar,129,
135
closeall,130,137
manufacturing,55
quickvs.advanced,18
recording,230
rerun,236
resume,38,237
selection,16,17
analysisbar,129,130,135
analysisconfiguration,
STATISTICAEnterprise,120
analysismacros,224
analysisspecificationdialogs,
131
analysissummary,54
analysisworkbooks,22
Analysis/GraphOutput
Managerdialog,23,25
analyticfacilities,3
analyticsexamples,11
analyzinglargedata
problems,50
annotations,149
ANOVA
example,34
onewaydesigns,34
repeatedmeasuresdesigns,
34
appendsupplementary
information,132
applicationobject,252
arrangementoffactors,39
attributegageanalysis,55
audittraillogging,103
audittrail,spreadsheets,106
autofiltering,133
autosave,148
automatedneuralnetworks,
276
B
batchformulas,72,75
BFGSalgorithm,276
blockdatagraphs,199,202
block,deselect,17
brushing,132,205
Brushingdialog,205
bundles,variable,40
buttons
AllSpecs,14
ByGroup,44
Functions,74
OK,19
OpenData,13
Options,23,25,134
Spread,20
Summary,19
Variables,19
Zoom,20
ByGroupbutton,44
bygroupanalyses,47
example,43
C
C/C++,59,227,276
canonicalanalysis,276
capabilityanalysis.See
processcapabilityanalysis
caseheaders,175
caselabels,207
casestates,132,205
excluded,207
hidden,207
cases
filterduplicates,85
causeandeffectdiagrams,
277
cellformatting,spreadsheets,
176
centralcompositedesigns,
277
classicmenus,11,12
classificationtrees,276
cleaningdata,84
closeallanalyses,130,137
closeallwindows,137
clusteranalysis,276
codes,36,109
missingdata,90
COMInteroplibrary,251
compliancerequirements,
meeting,105
configurations,different,218
configurations,network,218
conjugategradientalgorithm,
276
copy,23
copywithheaders,23
correlationmatrix,16
correlationsexample,11
correlations,significant,21
correspondenceanalysis,276
Coxproportionalhazards
models,275
creationstamp,109
QuickReference:Index

Copyright StatSoft, 2011


290STATISTICAQuickReference
creditscorecards,281
creditscoring,63
CreditScoring,281
cumulativesumchart,277
customgraphs,203
customuserinterface,
STATISTICAEnterprise,122
customization,12,228
alternativeaccessto
facilities,128
appearanceofSTATISTICA,
213
differentconfigurationsof
STATISTICA,218
documents,214
generaldefaults,215
graphs,29,190,217
localvs.permanent,215
network,218
operationofSTATISTICA,
213
otherapplications,140
STATISTICAVisualBasic,
140,221
toolbars,139
userinterface,127,213
Customizedialog,139
D
data
accessingdirectlyfrom
databases,79
cleaningandfiltering,84
filterduplicatecases,85
filtersparse,87
IDbased,93,280
management,72
manufacturing,46
missing,89
onremoteservers,245
recoding,84,90
retrieveexternal,244
data(cont.)
timeindexed,93,280
transformation,286
transformationformulas,
72,75,76
dataanalysis,interactive,39
dataconfiguration,
STATISTICAEnterprise,115
datafiles
merge,91
opening,13
subsets,92
datamanagement
operations,14
DataMiner,57,278
DataMinerRecipes,59,63
datapreparation,65
dataredundancy,67
deployment,70
nodes(steps),64
projectfiles,61
summary,71
workbookfile,62
datamining,59
DataMiningtab,134
dataspreadsheets,13
Datatab,13,22
datawarehouse,283
DatabaseConnectiondialog,
80
databases,accessingdata
directlyfrom,79
debugger,STATISTICAVisual
Basic,225
defaultgraphs,203
defaults,215
alternativesets,216
deployment,62,70
descriptivestatisticsoptions,
48
designofexperiments,277

dialogs
Analysis/GraphOutput
Manager,23
analysisspecification,131
Analysis/GraphOutput
Manager,25
autominimize,136
Brushing,205
Customize,139
DatabaseConnection,80
FunctionBrowser,74
OpenaSTATISTICAData
File,13
Options,15,25,134,215
outputselection(results),
132
PrintSpreadsheet,24
results,132
selfprompting,19
StartupPanel,13,131
UserInterface,11
VariableBundleManager,
40
variableselection,19,133
variablespecifications,13
VariableSpecifications
Editor,14
WelcometoSTATISTICA,12
DIN55319,52
discriminantanalysis,276
distributionmodel,time
dependent,54
documentcustomization,214
documentmanagement
system,163,282
documenttypes,137
documents,recentlyused,
138
draganddrop,182
QuickReference:Index

Copyright StatSoft, 2011
STATISTICAQuickReference291
E
Edittab,29
ElectronicManual,26,33,36,
257
ElectronicStatisticsTextbook,
27,258
EnhancedSVB,222
Enterpriseinstallations,98
enterprisenetwork,279
enterprisesystems,278
enterprise/QCnetworks,279
EWMAchart,277
exampledatasets,45
examples
accessingdatadirectlyfrom
databases,79
analytics,11
ANOVA,34
bygroupanalyses,43
correlations,11
datapreparationcleaning
andfiltering,84
getexternaldatavia
STATISTICAQuery,244
inputdatadirectlyfrom
Excel,77
macrorecording,230
recordingananalysis,230
spreadsheetformulas,
batchformulas,72
STATISTICADataMiner
Recipes,63
STATISTICAEnterprise,109
STATISTICAEnterprise
Server,98
STATISTICAVisualBasic,
230
summaryresultspanels,51
usingSTATISTICAExtract,
TransformandLoad,93

examples(cont.)
usingSTATISTICAin
regulatedenvironments,
102
variablebundles,40
Excel,77,140,142,148,151,
169,180,182,198,238
inputdatadirectlyfrom,77
openinSTATISTICA,142
exploratorydataanalysis,44,
50
exportoutput,7
extract,transform,andload,
280
F
F1key,13
factoranalysis,276
factors,arrangement,39
filterdata.Seedatacleaning
andfiltering
filterduplicatecases,85
filtersparsedata,87
filteringvariables,133
fixednonlinearregression,
276
formulaeditor,72
formulas,14,72
multiple,75
results,73
spreadsheet,14
fractionalfactorialdesigns,
277
frauddetection,281
frequencytables,51
fromclauses,STATISTICA
Query,180
function
externallycallable,4,227,
228,275
internallyused,12,73,74,
104,196,226
FunctionBrowserdialog,74
Functionsbutton,74
G
gagerepeatability/
reproducibility,277
generaldiscriminantanalysis
models,276
generallinearmodels,36,275
generaloverview
analyticfacilities,3
softwaretechnology,6
uniquefeatures,4
Webenablement,7
generalpartialleastsquares
models,275
generalregressionmodels,275
generalizedlinear/nonlinear
models,275
globalmacros,228
gradientdescentalgorithm,
276
graphs,182,189
autoupdating,143
blockdata,199,202
brushing,205
casestates,205
categories,198
creatingviaSTATISTICA
VisualBasic,209
custom,203
customization,29,190,217
customizing,203
default,203
defaults,217
drawingtools,29
inputdata,198,199
piecharts,194
producedfrom
spreadsheets,28
shortcutmenus,29
specialized,208
QuickReference:Index

Copyright StatSoft, 2011


292STATISTICAQuickReference
graphs(cont.)
STATISTICAVisualBasic,
191
styles,190
summary,21,51
userdefined,191
graphsmenugraphs,204
Graphstab,134
grouppermissions,
STATISTICAEnterprise,111
GxP
applications,102
report,108
H
Help,26,33,36,257
Helptopics,19
hidesummarybox,137
hidewindows,136
HTML,278,286
HTMLoutput,154
I
IDbaseddata,93,280
IDP(inplacedatabase
technology),245
importdata,6,142,245,265
industrystandards,
compatibility,269
infobox,spreadsheets,175
inplacedatabasetechnology
(IDP),245
inputdatagraphs,198,199
inputspreadsheets,177
inputvs.output
spreadsheets,177
integratedlogin,98
interactivedataanalysis,39,
44
Ishikawadiagrams,277
ISO21747,52
J
Java,141,263,276
joinclauses,STATISTICA
Query,180
K
keyboardmacros,225
knearestneighbor,90
knowledgeportal,155,160
L
labels,cases,207
lagfunction,73
Latinsquares,277
Libraryobject,252
limits,specifying
upper/lower,54
LiveScore,281
lockspreadsheets,105
loglinearanalysisof
frequencytables,276
M
macros,40,183
analysis,224
attachtotoolbars/menus,
228
edit,235
global,228
keyboard,225
master,224
record,4,40,50,140,183,
221,224,225
runfromcommandline,
229
managingoutput,147
manufacturing
analyses,55
data,46
process,53
mastermacros,224
menus
spreadsheetshortcut,14
userdefined,140
mergedatafiles,91
MicrosoftOfficeintegration,
142,238
MicrosoftWordintegration,
143
MicrosoftWordreport,154
missingdata,59,62,79,87,
89,90,91
replacementof,90
setvalueof,89,90
mixturedesigns,277
modeldeployment,277
modules,131
monitoringandalerting
server,280
movingaverage/rangecharts,
277
multidimensionalscaling,276
multilayerperceptrons,276
multimediatables,13,173
multipleanalysissupport,128
multitaskingfunctionality,
129
multithreading,265
multivariateexploratory
techniques,276
multivariatestatistical
processcontrol,277
N
.NET,249
neuralnetworks,276,277
NIPALSalgorithm,275
nonlinearestimation,276
notes,inworkbooks,149
QuickReference:Index

Copyright StatSoft, 2011
STATISTICAQuickReference293
O
objectlibrary,249
objects
embedded,16
linked,16
Officeintegration,142,238
offloadtasks,98
OKbutton,19
OLAP,243,246
OLEDB,80,178,243
OLEobjects,181
onewayANOVAdesigns,34
onlinestatisticstextbook,258
openadatafile,13
OpenaSTATISTICADataFile
dialog,13
OpenDatabutton,13
options
autominimize,136
autosave,148
bringtotoponselect,137
hideondeselect,136
hidesummarybox,137
resumeanalysis,136
Optionsbutton,23,25,134
CreateMacro,232
Optionsdialog,15,25,134,
215
Graphs,190
optionspane,15
Oracle,243
outliers,28,59,208
recode,88
output,15,21
graphs,182
HTML,154
MicrosoftWord,154
PDF,153
reports,151,180
spreadsheets,173
standalonewindows,150
web,155
output(cont.)
workbooks,148,169
outputmanagement,134
OutputManager,23,25,134,
147
global,23
options,15
outputspreadsheets,177
output,managing,147
P
parallelprocessing,266
Paretochart,277
partialleastsquares,277
passwordencrypt
spreadsheets,105
PDFfiles,savingto,153
PDFreports,153
PIconnector,283
piecharts,194
poweranalysis,276
powergenerationfacilities,280
powergeneration,optimize
performance,287
principalcomponents,277
principalcomponents/
classificationanalysis,276
PrintSpreadsheetdialog,24
PROCEED,286
processanalysis,277
processcapabilityanalysis,52
processcapabilityindices
standards,52
processcapabilityresults,55
processindustries,280
processinvariantvariables,88
processmissingdata,89
processoptimization,278
processspecificationlimits,
54
process/capabilityanalysis
charts,277
programmingSTATISTICAfrom
.NET,249
projects.SeeSTATISTICA
Projects
Q
qualitycontrolcharts,276
qualitysixpacks,51
query,243
queryexample,244
QuickAccesstoolbar,12,24,
140
Quicktab,5,18,19,213
R
Rlanguage,4,222
radialbasisfunctionnetworks,
276
readonlyspreadsheets,105
recipe.SeeDataMiner
Recipes
recode
outliers,88
recodedata,72
recordinganalyses,230
recoveryfeatures,148
regulatedenvironments,102
reliability/itemanalysis,276
remotedatabases,245
remoteservers,245
remoteservers,inplace
processing,245
repeatedmeasuresANOVA
designs,34
reports,24,151,180
fromworkbooks,152
GxP,108
HTML,154
MicrosoftWord,154
multiple,25
openasaved,25
QuickReference:Index

Copyright StatSoft, 2011


294STATISTICAQuickReference
reports(cont.)
PDF,153
richtextformat,152
single,25
tree,181
requireuserstoenter
comments,104
resultsspreadsheet,21
resumeanalysis,38,136,237
ribbonbar,11,12
RTF
format,181
reports,152
runstest,277
S
samplingplans,277
SAP,243
Scorecard,282
selectclauses,STATISTICA
Query,180
selforganizingfeaturemaps,
276
sequenceassociationandlink
analysis,279
serverintegration,98
serversremote,inplacedata
processing,245
SharePoint,163
shortcutmenus,14
graphs,29
significantcorrelations,21
simpledescriptivestatistics,
51
singledocumentsummary
report,53
sixsigma,51
tools,276
specializedgraphs,208
specifyingupper/lowerlimits,
54
splitscrollinginspreadsheets,
30
Spreadbutton,20
spreadsheetauditlogviewer,
104
spreadsheets,13,173
appendcases,22
appendvariables,22
audittrail,106
autofillblock,32
batchformulas,75
block
autofill,32
copy,23,31
deselect,17
insert,32
move,31
caseheaders,175
cellformatting,176
copyablock,31
defaultlayout,215
draganddrop,31
formulas,14,72,74
header,175
infobox,175
input,177
inputvs.output,177
insertablock,32
lock,105
moveablock,31
output,177
passwordencrypt,105
passwordencryptionvs.
locking,104
printing,24
readonly,105
results,21
shortcutmenus,22
specifyasinput,22
splitscrolling,30
titlebar,175
variableheaders,176
SQL,80,83,243,284
standalonewindows,150
queuelength,150
Startmenu,17
StartupPanel,13,131
statist.exe,251
STATISTICA
controlfromother
applications,140
customizeappearance,213
generaloverview,3
Help,13,26,257
Libraryversion,252
multipleversionsupport,
251
objectlibrary,249
programmingfrom.NET,249
serialnumber,258,259
softwaretechnology,6
Startmenu,17
systemfeatures,275
technicalsupport,258
uniquefeatures,4
STATISTICAAdvanced
Linear/NonlinearModels,
275
STATISTICAAutomated
NeuralNetworks,276
STATISTICABase,275
STATISTICACreditScoring,281
STATISTICADataMiner,278
STATISTICADataMiner
Recipes,59,63
datapreparation,65
dataredundancy,67
deployment,70
nodes(steps),64
projectfiles,61
summary,71
workbookfile,62
STATISTICADataWarehouse,
283
STATISTICADesignof
Experiments,277
QuickReference:Index

Copyright StatSoft, 2011
STATISTICAQuickReference295
STATISTICADocument
ManagementSystem,163,
282
STATISTICAEnterprise,109,
279
createanalysis
configuration,120
createdataconfiguration,
115
createdatabase
connection,113
createnewgroup,111
createnewuser,110
createsystemviewnode,
112
customuserinterface,122
example,109
ObjectView,110
runanalysisconfiguration,
121
systemview,110
STATISTICAEnterprise
Manager,109
STATISTICAEnterpriseServer,
98,155,160,281
demo,271
knowledgeportal,155
publishingcontent,157
saveserverspace,102
schedulingfacilities,99
serverrepository,101
STATISTICAEnterprise
Systems,278
STATISTICAEnterprise/QC,
279
STATISTICAEnterpriseWide
DataMiningSystem,278
STATISTICAExtract,
Transform,andLoad,93,
280
STATISTICALiveScore,281
STATISTICAmodules,131
STATISTICAMonitoringand
AlertingServer,280
STATISTICAMultiStream,280
STATISTICAMultivariate
ExploratoryTechniques,
276
STATISTICAMultivariate
StatisticalProcessControl,
277
STATISTICAPIConnector,283
STATISTICAPowerAnalysis,
276
STATISTICAPowerSolutions,
287
STATISTICAProcessAnalysis,
277
STATISTICAProcess
Optimization,278
STATISTICAprojects,184
saving,184
STATISTICAQualityControl
Charts,276
STATISTICAQuery,79,179,
243
fromclauses,180
joinclauses,180
previewdata,83
retrieveexternaldata,244
selectclauses,180
whereclauses,180
STATISTICAScorecard,282
STATISTICASequence
AssociationandLink
Analysis,279
STATISTICAstartbutton,138
STATISTICAstartmenu,200
STATISTICATextMiner,278
STATISTICAVariance
EstimationandPrecision,
276
STATISTICAVisualBasic,40,
140,183,191,221
analysismacros,224
STATISTICAVisualBasic
(cont.)
creatinggraphs,209
editoranddebugger,225
example,230
executingprograms,227
keyboardmacros,225
mastermacros,224
methods,141
properties,141
structure,228
STATISTICAVisualBasic.NET,
222
STATISTICAworkbooks,148
StatisticalAdvisor,33,258
statisticsbygroups,49
Statisticstab,134
statisticstextbook,online,258
StatSoftwebsite,258
statusbar,130
STRformat,181
structuralequationmodeling,
276
subsets,creating,92
summarybox,137
Summarybutton,19
summarygraphs,21,51
summaryreport,53
summaryresultspanels,51
supplementaryinformation,
append,132
support,258
supportvectormachines,278
survivalanalysis,275
SVB,183,221
T
tableofalleffects,38
tabs
Advanced,18
Data,13,22
DataMining,134
QuickReference:Index

Copyright StatSoft, 2011


296STATISTICAQuickReference
tabs(cont.)
Edit,29
Graphs,134
Quick,18
Statistics,134
View,215
tabs,workbooks,139
Taguchidesigns,277
technicalsupport,258
Teradata,246
textminer,278
timeseriesneuralnetworks,
276
timeseries/forecasting,276
timestamp,109
timedependentdistribution
model,54
timeindexeddata,93,280
toolbars
customize,139
userdefined,139
traceability,108
traceabilityrequirements,103
treeview,15
tree,reports,181
U
userinterface
customization,127,213
generalfeatures,127
interactive,130
interactiveanalyses,131
STATISTICAEnterprise
Server,266
UserInterfacedialog,11
userdefinedmenus,140
userdefinedtoolbars,139
V
variabilityplots,46

variable
block,17,19
changeformat,14
changename,14
formula,14
processinvariant,88
selection,19
selectionconventions,19
specifications,13
VariableBundleManager
dialog,40
variablebundlesexample,40
Variablesbutton,19
variableheaders,176
variableselectiondialog,133
variablespecificationsdialog,
13
VariableSpecificationsEditor,
14
variables
automaticprescreening,
133
bundles,40
ToolTips,43
filtering,133
measurementtypes,133
organizelargesets,40
reorder,47
repeatedselection,40
variancecomponents,275
variancecomponentsfor
randomeffects,277
varianceestimationand
precision,276
Viewtab,215
VisualBasic,221
methods,141
properties,141
W
webbrowser,usingwith
STATISTICA,98
webenablement,7
weboutput,16,155
website,StatSoft,258
Weibullanalysis,277
WelcometoSTATISTICA
dialog,12
whereclauses,STATISTICA
Query,180
Word,140,142,143,148,
154,169,182,198,238
workbooks,22,148,169
draganddrop,172
icons,172
notesandcomments,149
overview,169
printdocumentfrom
within,24
redarrow,236
rerunninganalyses,236
saveaswebpages,150
tabs,138,170
tree,171
X
XbarandRcharts,277
XML,278
Z
Zoombutton,20

QuickReference

Copyright StatSoft, 2011
STATISTICAQuickReference297
QuickReference

Copyright StatSoft, 2011


298STATISTICAQuickReference

Das könnte Ihnen auch gefallen