Beruflich Dokumente
Kultur Dokumente
TRANSFORMATION
LOADINGETL
SUBMITTEDBY:
SIMRANKOUR(228/13)
CONTENTS
CONCEPTOFDATAWAREHOUSE
FEATURES
ARCHITECTURE
ETLOverview
EXTRACTION
TRANSFORMATION
ETLDEMO
LOADING
ETLTOOLS
FEATURESOFETLTOOLS
ADVANTAGESOFETL
DISADVANTAGESOFETL
CONCLUSION
REFRENCES
INTRODUCTIONTODATA
WAREHOUSE
Adatawarehouseisarelationaldatabasewhich
isdevelopedwithaaimforqueryandanalysis
ratherthanfortransactionprocessing
Itcontainshistoricalandcommulativedata
derivedfromtransactiondatafromsingleor
multiplesources.
DWHisasingleversionoftruthfora
organizationandcreatedforapurposeofhelpin
decisionmakingandforecasting.
FEATURESOFDATAWAREHOUSE
SUBJECTORIENTED
INTEGRATED
TIMEVARIANT
NONVOLATILE
SUBJECTORIENTED
INTEGRATED
TIMEVARIANT
NONVOLATILE
ARCHITECTUREOFDATAWAREHOUSE
EXTRACTTRANSFORMATION
LOADING
ETLOVERVIEW
ExtractionTransformationLoadingETL
Togetdataoutofthesourceandloaditintothe
datawarehousesimplyaprocessofcopyingdata
fromonedatabasetoother.
ETLisadatamovementtechnologyspecifically
whereyouaregettingdatafromoneone
applicationsdatastoreandmovingittoanother
loaction.
EXTRACTION
ETLprocessinvolvesextractingthedatafromthesource
systems.
Eachdatasourcehasitsdistinctsetofcharacteristicsthat
needtobemanagedandintegratedintotheETLsystem.
ETLprocessneedstoeffectivelyintegratesystemsthat
havedifferent:
DBMS
OperatingSystems
Hardware
Communicationprotocols
Thelogicaldatamapdescribestherelationshipbetween
theextremestartingpointsandtheextremeendingpoints
ofyourETLsystemusuallypresentedinatable
DATAEXTRACTIONSTRATEGIES
FULLEXTRACTION
PARTIALEXTRACTIONWITHUPDATENOTIFICATION
PARTIALEXTRACTIONWITHOUTUPDATENOTIFICATION
TRANSFORMATION
Inthedatatransformationstage,aseriesofrulesor
functionsareappliedtotheextracteddatainorderto
prepareitforloadingintotheendtarget.
Somedatadoesnotrequireanytransformationatall;such
dataisknownas"directmove"or"passthrough"data.
Animportantfunctionoftransformationisthecleaningof
data,whichaimstopassonly"proper"datatothetarget.
DataExtractedintoastagingserverisarawdataand
cannotbeusedasitis.ItneedstobeCleansed,Mapped
andTransformed
BASICTRANFORMATIONTASKS
Selection
Matching
DataCleansingorEnrichment
ConsolidationsorSummarization
MAJORTRANFORMATIONTYPES
CHARACTERSETCONVERSION
ENCODINGHANDLING:mapping"Male"to"M"
CALCULATEDANDDERIVEDVALUES:sale_amount=
qty*unit_price
TRANSLATINGCODEDVALUES:ifthesourcesystem
codesmaleas"1"andfemaleas"2",butthe
warehousecodesmaleas"M"andfemaleas"F"
SUMMARIZATION,AGGREGATION
ETLDEMO
LOADING
DataLoadingfetchestheprepareddata,applies
ittothedatawarehouseandstoresitinthe
database.
TYPESOFLOADING
InitialLoadpopulatingallthedatawarehouse
tablesforthefirsttime.
IncrementalLoadapplyingongoingchangesas
necessaryinaperiodicmanner.
FullRefreshcompletelyerasingthecontentof
oneormoretableandreloadingwithfreshdata.
ETLTOOLS
SourceandTargetDataSystem
Connectivity
ScalabilityandPerformance
PrebuiltTransformationConnectors
DataProfilingandCleaning
RobustAdminstrationFeatures
EasyIntegrationwithWebservices
ADVANTAGESOFETL
Itskipsallthealltheapplicationslogic,andmove
directlytothedatalayer,havingatargetlocation
wherethedataisneededtobestored.
Thereisnoneedtowriteproceduresandcodes.
Insteadtheenvironmentworkswithgraphical
interfacewhereweareusingdraganddrop
interfacetoshowtheflowofdata.
ETLtoolsaregoodforbulkdatamovements,getting
largevolumeofdataandtransferingtheminbatch.
Theyaregoodforcomplexrulesandtranformations.
DISADVANTAGESOFETL
Wecannotedownunstructureddata
manipulation.
IthastobedependentonReportingTools.
Powercenterisnotcapableofgeneratingreports
ofitsown.
CONCLUSION
ETLisadatamovementtechnologyspecifically,
whereyouaregettingdatafromoneapplications
datastoreandmovingittoanotherlocation
ratherthantryingtointerfacetoanapplications
programminginterfaces.Soyouareskippingall
oftheapplicationslogic,andgoingrightthrough
thedatalayer.Andthen,youhaveatarget
locationwhereyouaretryingtolandthatdata.
REFERENCES
http://www.webopedia.com/TERM/E/ETL.html
www.sitesbay.com/java/featuresofEtl.
www.tutorialspoint.com/Etl.
www.wikipedia.org/Extract,transform,load.
www.Etlpoint.com/Etltutorial
THANKYOU