Beruflich Dokumente
Kultur Dokumente
MANAGINGTHEDIGITALFIRM
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
BoB TriestoMasterItsData
Problem:BankofBaroda(BoB)isIndias6th largestbank.Hasconsiderable globalpresence,over2800branches,>1000ATMs,widerangeofproducts, differentkindsofcustomers,deliverychannels,subsidiaries,etc. Explosivegrowthcreatedinformationmanagementchallenges.Hadto processdualstatementsbasedonlegacyaswellascentralbankingsystem. Solutions: UseHPNeoview tocreateanenterprisewidesetofdata, preventingunnecessarydataduplication. Neoview consolidateslegacyapplicationsandindividualsmalldatasources intoDWH.Eliminatesoutdated,incompleteorincorrectlyformatteddata. Offersrealtime update,querying,reportingandconsolidation. DemonstratesITsroleinsuccessfuldatamanagement. Illustratesdigitaltechnologysroleinstoringandorganizingdata.
Session1214
FOUNDATIONSOFBUSINESS INTELLIGENCE:DATABASESAND INFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
OrganizingDatainaTraditionalFileEnvironment
OrganizingDatainaTraditionalFileEnvironment
TRADITIONALFILEPROCESSING
Fileorganizationconcepts
Database:Groupofrelatedfiles File:Groupofrecordsofsametype Record:Groupofrelatedfields Field:Groupofcharactersasword(s)ornumber
Describesanentity (person,place,thingonwhichwe storeinformation) Attribute:Eachcharacteristic,orquality,describing entity
E.g.,AttributesDateorGradebelongtoentityCOURSE
3 4
Theuseofatraditionalapproachtofileprocessingencourageseachfunctionalareainacorporationto developspecializedapplications.Eachapplicationrequiresauniquedatafilethatislikelytobeasubsetof themasterfile.Thesesubsetsofthemasterfileleadtodataredundancyandinconsistency,processing inflexibility,andwastedstorageresources.
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
DatabaseandInformationManagement
Components of an Application
ManagementInformationSystems
OrganizingDatainaTraditionalFileEnvironment
Problemswiththetraditionalfileenvironment(files maintainedseparatelybydifferentdepartments)
Dataredundancy:
Presenceofduplicatedatainmultiplefiles
Datainconsistency:
Sameattributehasdifferentvalues
Programdatadependence:
Whenchangesinprogramrequireschangestodata accessedbyprogram(Y2K,4digitto6digit,boolean)
Program(describethelocationandnatureofthe datarequiredforprocessing)
Userinterfacethatfacilitatesdataprocessing,reporting, queriesetc
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
TheDatabaseApproachtoDataManagement
HUMANRESOURCESDATABASEWITHMULTIPLEVIEWS
Database
Servesmanyapplicationsbycentralizingdataand controllingredundantdata
Databasemanagementsystem(DBMS)
Interfacesbetweenapplicationsandphysicaldatafiles Separateslogicalandphysicalviewsofdata Solvesproblemsoftraditionalfileenvironment
7
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
TheDatabaseApproachtoDataManagement
RELATIONALDATABASETABLES
RelationalDBMS
Representdataastwodimensionaltablescalledrelationsor files Eachtablecontainsdataonentityandattributes
Table:gridofcolumnsandrows
Rows(tuples):Recordsfordifferententities Fields(columns):Representsattributeforentity Keyfield:Fieldusedtouniquelyidentifyeachrecord Primarykey:Fieldintableusedforkeyfields Foreignkey:Primarykeyusedinsecondtableaslookupfieldto identifyrecordsfromoriginaltable
9 10
Arelationaldatabaseorganizesdataintheformoftwodimensionaltables.Illustratedherearetablesfor theentitiesSUPPLIERandPARTshowinghowtheyrepresenteachentityanditsattributes.Supplier NumberisaprimarykeyfortheSUPPLIERtableandaforeignkey forthePARTtable.
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
RELATIONALDATABASETABLES(cont.)
TheDatabaseApproachtoDataManagement
OperationsofaRelationalDBMS
Threebasicoperationsusedtodevelopuseful setsofdata
SELECT:Createssubsetofdataofallrecordsthat meetstatedcriteria JOIN:Combinesrelationaltablestoprovideuser withmoreinformationthanavailableinindividual tables PROJECT:Createssubsetofcolumnsintable, creatingtableswithonlytheinformationspecified
12
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
THETHREEBASICOPERATIONSOFARELATIONALDBMS
TheDatabaseApproachtoDataManagement
ObjectOrientedDBMS(OODBMS)
Storesdataandproceduresasobjects Objectscanbegraphics,multimedia,Javaapplets RelativelyslowcomparedwithrelationalDBMSfor processinglargenumbersoftransactions HybridobjectrelationalDBMS:Providecapabilities ofbothOODBMSandrelationalDBMS
Databasesinthecloud
Theselect,join,andprojectoperationsenabledatafromtwodifferenttablestobecombinedandonly selectedattributestobedisplayed.
TypicallylessfunctionalitythanonpremisesDBs AmazonWebServices,MicrosoftSQLAzure
14
13
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
TheDatabaseApproachtoDataManagement
MICROSOFTACCESS DATADICTIONARY FEATURES
MicrosoftAccesshasa rudimentarydatadictionary capabilitythatdisplays informationaboutthesize, format,andother characteristicsofeachfieldina database.Displayedhereisthe informationmaintainedinthe SUPPLIERtable.Thesmallkey icontotheleftof Supplier_Number indicates thatitisakeyfield.
CapabilitiesofDatabaseManagementSystems
Datadefinitioncapability:Specifiesstructureofdatabase content,usedtocreatetablesanddefinecharacteristics offields Datadictionary:Automatedormanualfilestoring definitionsofdataelementsandtheircharacteristics Datamanipulationlanguage:Usedtoadd,change, delete,retrievedatafromdatabase
StructuredQueryLanguage(SQL) MicrosoftAccessusertoolsforgenerationSQL
ManyDBMShavereportgenerationcapabilitiesfor creatingpolishedreports(CrystalReports)
15 16
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
EXAMPLEOFANSQLQUERY
TheDatabaseApproachtoDataManagement
ANACCESSQUERY
Illustratedhereishowthe queryinFigure67wouldbe constructedusingMicrosoft Accessquerybuilding tools.Itshowsthetables, fields,andselectioncriteria usedforthequery. FIGURE68
IllustratedherearetheSQLstatementsforaquerytoselectsuppliersforparts137or150.Theyproducea listwiththesameresultsasFigure65.
17
18
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
TheDatabaseApproachtoDataManagement
ANUNNORMALIZEDRELATIONFORORDER
DesigningDatabases
Conceptual(logical)design:Abstractmodelfrombusiness perspective Physicaldesign:Howdatabaseisarrangedondirectaccessstorage devices
Designprocessidentifies
Relationshipsamongdataelements,redundantdatabaseelements Mostefficientwaytogroupdataelementstomeetbusiness requirements,needsofapplicationprograms
Normalization
Streamliningcomplexgroupingsofdatatominimizeredundant dataelementsandawkwardmanytomanyrelationships
19
FIGURE69
20
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
NORMALIZEDTABLESCREATEDFROMORDER
TheDatabaseApproachtoDataManagement
Entityrelationshipdiagram
Usedbydatabasedesignerstodocumentthedata model Illustratesrelationshipsbetweenentities
Distributingdatabases:Storingdatabaseinmore thanoneplace
Partitioned:Separatelocationsstoredifferentparts ofdatabase Replicated:Centraldatabaseduplicatedinentirety atdifferentlocations
22
FIGURE610
21
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
ANENTITYRELATIONSHIPDIAGRAM
UsingDatabasestoImproveBusinessPerformanceandDecisionMaking
Verylargedatabasesandsystemsrequirespecial capabilities,tools
Toanalyzelargequantitiesofdata Toaccessdatafrommultiplesystems
Threekeytechniques
1.Datawarehousing 2.Datamining 3.Toolsforaccessinginternaldatabasesthroughthe Web
24
FIGURE611
ThisdiagramshowstherelationshipsbetweentheentitiesSUPPLIER,PART,LINE_ITEM,andORDERthat mightbeusedtomodelthedatabaseinFigure610.
23
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
UsingDatabasestoImproveBusinessPerformanceandDecisionMaking
TheDatabaseApproachtoDataManagement
COMPONENTSOFADATAWAREHOUSE
Datawarehouse:
Storescurrentandhistoricaldatafrommanycore operationaltransactionsystems Consolidatesandstandardizesinformationforuseacross enterprise,butdatacannotbealtered Datawarehousesystemwillprovidequery,analysis,and reportingtools
Datamarts:
Subsetofdatawarehouse Summarizedorhighlyfocusedportionoffirmsdatafor usebyspecificpopulationofusers Typicallyfocusesonsinglesubjectorlineofbusiness
25
FIGURE612 Thedatawarehouseextractscurrentandhistoricaldatafrommultipleoperationalsystemsinsidethe organization.Thesedataarecombinedwithdatafromexternalsourcesandreorganizedintoacentral databasedesignedformanagementreportingandanalysis.Theinformationdirectoryprovidesuserswith informationaboutthedataavailableinthewarehouse.
26
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
UsingDatabasestoImproveBusinessPerformanceandDecision Making
UsingDatabasestoImproveBusinessPerformanceandDecision Making
BusinessIntelligence:
Toolsforconsolidating,analyzing,andproviding accesstovastamountsofdatatohelpusersmake betterbusinessdecisions E.g.,HarrahsEntertainmentanalyzescustomersto developgamblingprofilesandidentifymost profitablecustomers Principletoolsinclude:
Softwarefordatabasequeryandreporting Onlineanalyticalprocessing(OLAP) Datamining
27
Onlineanalyticalprocessing(OLAP)
Supportsmultidimensionaldataanalysis
Viewingdatausingmultipledimensions Eachaspectofinformation(product,pricing,cost, region,timeperiod)isdifferentdimension E.g.,howmanywasherssoldintheEastinJune comparedwithotherregions?
OLAPenablesrapid,onlineanswerstoadhoc queries
28
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
TheDatabaseApproachtoDataManagement
MULTIDIMENSIONAL DATAMODEL
Theviewthatisshowingis productversusregion.Ifyou rotatethecube90degrees, thefacethatwillshow is productversusactualand projectedsales.Ifyourotate thecube90degreesagain,you willseeregionversusactual andprojectedsales.Other viewsarepossible. FIGURE613
UsingDatabasestoImproveBusinessPerformanceandDecision Making
Datamining:
MorediscoverydriventhanOLAP Findshiddenpatterns,relationshipsinlargedatabases andinfersrulestopredictfuturebehavior E.g.,Findingpatternsincustomerdataforonetoone marketingcampaignsortoidentifyprofitablecustomers. Typesofinformationobtainablefromdatamining
Associations Sequences Classification Clustering Forecasting
29
30
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
UsingDatabasestoImproveBusinessPerformanceandDecisionMaking
UsingDatabasestoImproveBusinessPerformanceandDecisionMaking
Predictiveanalysis
Usesdataminingtechniques,historicaldata,and assumptionsaboutfutureconditionstopredict outcomesofevents E.g.,Probabilityacustomerwillrespondtoan offer
Webmining
Discoveryandanalysisofusefulpatternsand informationfromWWW
E.g.,tounderstandcustomerbehavior,evaluateeffectivenessof Website,etc.
Webcontentmining
KnowledgeextractedfromcontentofWebpages
Textmining
Extractskeyelementsfromlargeunstructured datasets(e.g.,storedemails)
31 32
Webstructuremining
E.g.,linkstoandfromWebpage
Webusagemining
UserinteractiondatarecordedbyWebserver
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
UsingDatabasestoImproveBusinessPerformanceandDecisionMaking
TheDatabaseApproachtoDataManagement
LINKINGINTERNALDATABASESTOTHEWEB
DatabasesandtheWeb
ManycompaniesuseWebtomakesomeinternal databasesavailabletocustomersorpartners Typicalconfigurationincludes:
Webserver Applicationserver/middleware/CGIscripts Databaseserver(hostingDBM)
AdvantagesofusingWebfordatabaseaccess:
Easeofuseofbrowsersoftware Webinterfacerequiresfewornochangestodatabase InexpensivetoaddWebinterfacetosystem
33
FIGURE614
UsersaccessanorganizationsinternaldatabasethroughtheWebusingtheirdesktopPCsandWeb browsersoftware.
34
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagingDataResources
ManagingDataResources
Establishinganinformationpolicy
Firmsrules,procedures,rolesforsharing,managing, standardizingdata Dataadministration:
Firmfunctionresponsibleforspecificpoliciesandproceduresto managedata
Ensuringdataquality
Morethan25%ofcriticaldatainFortune1000 companydatabasesareinaccurateorincomplete Mostdataqualityproblemsstemfromfaulty input Beforenewdatabaseinplace,needto: Identifyandcorrectfaultydata Establishbetterroutinesforeditingdataonce databaseinoperation
36
Datagovernance:
Policiesandprocessesformanagingavailability,usability, integrity,andsecurityofenterprisedata,especiallyasitrelatesto governmentregulations
Databaseadministration:
Defining,organizing,implementing,maintainingdatabase; performedbydatabasedesignandmanagementgroup
35
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagementInformationSystems
FOUNDATIONSOFBUSINESSINTELLIGENCE: DATABASESANDINFORMATIONMANAGEMENT
ManagingDataResources
Dataqualityaudit:
Structuredsurveyoftheaccuracyandlevelof completenessofthedatainaninformationsystem
Surveysamplesfromdatafiles,or Surveyendusersforperceptionsofquality
END
Datacleansing
Softwaretodetectandcorrectdatathatare incorrect,incomplete,improperlyformatted,or redundant Enforcesconsistencyamongdifferentsetsofdata fromseparateinformationsystems
37 38