Beruflich Dokumente
Kultur Dokumente
Techniques
Abstract
Diabeticretinopathythemostcommondiabeticeyedisease,iscausedbycomplicationsthatoccurs
whenbloodvesselsintheretinaweakensordistracted.Itresultsinlossofvisionifearlydetectionis
notdone.Severaldataminingtechniqueservesdifferentpurposesdependingonthemodeling
objective.Theoutcomeofthevariousdataminingclassificationtechniqueswascomparedusing
rapidminertool.WehaveusedNaivebayesandSupportVectorMachinetopredicttheearly
detectionofeyediseasediabeticretinopathyandfoundthatNaivebayesmethodtobe83.37%
accurate.Theperformancewasalsomeasuredbysensitivityandspecificity.Theabovemethodology
hasalsoshownthatourdatamininghelpstoretrieveusefulcorrelationevenfromattributeswhich
arenotdirectindicatorsoftheclasswhichwearetryingtopredict.
ThecommonestcauseofblindnessamongworkingclassisDiabeticRetinopathywhichoftenleads
tothecompletelossofvision1.TheWorldHealthOrganization(WHO)hasestimatedthatDiabetic
Retinopathyisresponsiblefor4.8%ofthe37millioncasesofblindnessthroughouttheworld.
Thereforeapredictiontechniqueisconceivedsothatearlyprecautionsorcontrolscanbe
implemented.Peoplewithdiabetesaresusceptibletoimpairmentofothervitalorganssuchasheart,
kidneyandeyes2.AttheinitialstageofDiabeticRetinopathy,therewillbesomechangesinthe
visionthatcanbenoticed.Butovertime,DiabeticRetinopathycangetworsenandcausevisionloss.
Imageanalysistoolscanbeusedforautomateddetectionofthesevariousfeaturesandstagesof
DiabetesRetinopathyandcanbereferredtothespecialistaccordinglyforintervention.Thussuch
toolswillbeusefulforeffectivescreeningofDiabeticRetinopathypatients3.Prevalenceofhighrate
ofretinopathycasesfoundworldwideisduetodelayindiagnosisforretinopathysinceitis
asymptomatic4.Therefore,apredictiontechniquehasbeenconceivedsothatearlyprecautionsor
controlscanbeimplemented.
LanordStanleyetal.5devisedamethodtodiagnosediabeticsintheIndiancommunitywiththehelp
offoursimplequestionsviz.age,abdominalobesity,physicalactivityandfamilyhistoryalongwith
onemeasurementforwaistcircumference.
DiabetesDataAnalysisandPredictionModelDiscoveryUsingRapidMiner6analyzeaPima
Indiansdiabetesdatasetcontaininginformationaboutpatientswithandwithoutdiabetes.Thiswork
focusesondatapreprocessing,includingattributeidentificationandselection,outlierremoval,data
normalizationandnumericaldiscretization,visualdataanalysis,hiddenrelationshipsdiscovery,and
adiabetespredictionmodelconstruction.
IHDPS7prototypepredictsthepossibilityofpatientsgettingaheartdiseasefromtheClevelandheart
diseasedatabaseusingdataminingtechniquesdecisiontrees,naiveBayesandneuralnetworkwith9
medicalattributes.Theresultsshowthatthemosteffectivemodeltopredictpatientswithheart
diseasesisnaiveBayes(86.12%)followedbyneuralnetworkanddecisiontrees.Furthermore,it
canincorporateotherdataminingtechniquessuchastimeseries,clusteringandassociationrules.
EmpiricalStudyonthePerformanceofIntegratedHybridPredictionModelontheMedical
Datasets8systemhasbeenproposedtoimprovethediagnosticaccuracyofdiabeticdiseaseby
selectinginformativefeaturesofPimaIndiansDiabetesdataset.Thehybridpredictionmodel
proposedcombinestwodifferentfunctionalitiesofdataminingclusteringandclassificationwithF
scoreselectionapproachtoidentifytheoptimalfeaturesubsetofthePimaIndiansDiabetesdataset.
Theproposedmodelwasvalidatedusingfourparameters,namelytheaccuracyoftheclassifier,area
underROCcurve,sensitivityandspecificity.
Thetwotraditionalclassificationmethods(logisticregressionandFisherlineardiscriminant
analysis)andfourmachinelearningclassifiers(neuralnetworks,supportvectormachines,fuzzyc
mean,andrandomforests)werecompared9toclassifypersonswithandwithoutdiabetes.
Duringtherecentyearstherehavebeenmanystudiesonautomaticdiagnosisofdiabetes,diabetic
retinopathy,heartdiseaseetc.In10amethodhasbeenproposedforautomateddetectionand
classificationofvascularabnormalitiesusingseveraltechniquessuchasscaleandorientation,
selectiveGaborfilterbanks.In11KaplanMeiermethodtogenerateunivariatesurvivalcurvesto
identifypatientswhowereatahigherriskforretinopathy,andresultsshoweddurationofdiabetes,
systolicbloodpressure,glycosylatedhaemoglobin,albuminuria,genderanddiabetestherapywere
significantlyassociatedwiththeoccurrenceofretinopathy.
Study12wasmadetoevaluatetheefficiencyofthreeplantcomponentsviz,cinnamaldehyde,
cinnamicacidandcinnamylalcoholininhibitingAldoseReductase(AR),anenzymeassociatedwith
retinopathyofbothtype1andtype2diabeticpatients.
Aproduct13madefromwholeleafconcentrateofStevia,foundtoreducehyperglycaemiaintype2
diabeticwomen.
In14,itwassuggestedthatincreasedawarenessandtreatmentofdiabetesshouldbeginwith
prevention.
Accordingto15dataminingapplicationscanbedevelopedtoevaluatetheeffectivenessofmedical
treatments.
1. Methods
Dataminingtechniquewasusedtopredictthechancesofdiabeticretinopathy.Underthedata
explorationmode,
almostallattributeselectionmodulesapplicableforthedatatocollectoptimalsubsetofattributes
wereexplored.RapidMinerwaschosenasthedataminingtoolduetoitslearningoperatorsand
operatorframework,whichallowsformingnearlyarbitraryprocesses.
ThoughthereisavailabilityofClevelandClinicFoundationHeartDiseasedataset,forthesakeof
determiningtheaccuracyrateinIndianregion,wehavecollected300clinicalrecordsfromDr.
SeshaiahDiabetesCentre,Chennai,TamilNadu.Theclinicaldatasetspecificationprovides
concise,unambiguousdefinitionforitemsrelatedtodiabetes.
Typically,crossvalidationisusedtogenerateasetoftraining,validationfolds,andwecompared
theexpectederroronthevalidationfoldsaftertrainingonthetrainingfolds.Crossvalidationworks
werecarriedoutbyusingpartofthedatatotrainthemodel,andtherestofthedatasettotestthe
accuracyofthetrainedmodel.Inthiscase,wehavedividedthedatasetinto10partswithtraining
andtestingdataforeachpart.TheproposedarchitectureisgiveninFigure1.Theattributesdata
viewofeachrecordsareshowninTable1.