Data Mining

2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT)
Prediction of Heart Disease at early stage using

Data Mining and Big Data Analytics: A Survey
Salma Banu N.K Suma Swamy
School of Computing and Information Technology, Department of Computer Science & Engineering,
REVA UNIVERSITY, Visvesvaraya Technological University,
Bengaluru, India Sir M. Visvesvaraya Institute of Technology
salmabanunk@reva.edu.in Bengaluru, India
sumaswamy_cs@ sirmvit.edu
Abstract—In this paper, the various technologies of data basically based on patient’s Electro cardiogram (ECG),
mining (DM) models for forecast of heart disease are discussed.
Echocardiography (ECHO) tests results and doctor’s
Data mining plays an important role in building an intelligent
model for medical systems to detect heart disease (HD) using
experience. ECG uses electrodes which has to be placed on
data sets of the patients, which involves risk factor associated the body of the patient from which the heart pulse is
with heart disease. Medical practitioners can help the patients recorded in the form of electrical activity. This signal
by predicting the heart disease before occurring. The large depolarizes for every heartbeat reflected through the
data available from medical diagnosis is analyzed by using electrical activity on the skin. Heart muscles keeps
data mining tools and useful information known as knowledge depolarizing
is extracted. Mining is a method of exploring massive sets of
ECHO is a diagnostic test that uses ultrasound waves to
data to take out patterns which are hidden and previously
unknown relationships and knowledge detection to help the
create an virtual replica reflection of muscles of the heart
better understanding of medical data to prevent heart disease. and that is used to detect cardiovascular diseases (CVD). It
There are many DM techniques available namely is one of the challenging tasks to diagnose, which needs
Classification techniques involving Naïve bayes (NB), Decision prior knowledge with good skills. Data mining is a non-
tree (DT), Neural network (NN), Genetic algorithm (GA), trivial extraction of implicit, previously unknown potential
Artificial intelligence (AI) and Clustering algorithms like K- useful information called as knowledge from the medical
NN, and Support vector machine (SVM). Several studies data using complex algorithms. Big data (BD) can be
have been carried out for developing prediction model using referred as huge record of information set. BD and DM are
individual technique and also by combining two or more two different things. The task carried out by these two
techniques. This paper provides a quick and easy review and methods are similar focusing on collecting the huge amount
understanding of available prediction models using data of data, handling them and preparing report on the data by
mining from 2004 to 2016. The comparison shows the accuracy taking out the information which is knowledgeable. DM is
level of each model given by different researchers. basically an activity of observing the patterns in the data
which is relevant and with particular information by using
Keywords—Data Mining; big data; CVD; risk factors; BD. The useful patterns with hidden patterns, unknown
accuracy; prediction rate; heart disease; DMT and data sets correlations are analytically handled for making
knowledgeable decision through this BD analytics process.
I. INTRODUCTION II. LITERATURE SURVEY
The healthcare industry has huge amount of medical data,
which is not mined. The medical data of the patients has A review is carried out on different techniques used by
hidden patterns which is necessary for data analysis in researchers in detection of HD. Enormous technologies of
detection of heart disease. Heart disease is a leading cause DM are involved in design of HD prediction model.
of death worldwide from past 15 years [1]. The heart is an 2004-2007
important organ of human body. If the blood circulation to
the body is insufficient, the organs of the body that is brain Carlos Ordonez (2004) presented improved study by using
and heart stop working and death occurs in few minutes. rule association for prediction of HD. The study involved
The risk factors associated are identified as age, family finding in detection of HD. The assessed data set covered
history, diabetes, hypertension, high cholesterol, tobacco, medical records of people having heart disease with
smoking, alcohol intake, obesity, physical inactivity, chest attributes like chest pain, blood pressure, cholesterol and
pain type and poor diet [5]. Information obtained by blood sugar for risk associated factors. The heart perfuser
examining the history record of the patient, it is possible to was measured and artery depletion was observed by the
segregate the record and give report on HD if it is positive author [2].
or negative. These factors lead us to detect HD. Diagnosis is
978-1-5090-4697-3/16/$31.00 ©2016 IEEE 256

Carlos Ordonez (2006) solved the limitations of Association measures. The accuracy obtained was 92.1%, 91.0% and
rule. Medical record was collected from “Emory University 89.6% for SVM, ANN and DT. A study carried out by
Radiology Department”. The algorithm which was designed comparing different prediction models of CVD patients with
could search for constraint attributes which intern decreased a cross validation of 10-fold which gives the glimpse of the
set of rules. This algorithm looked for association rules on ability to handle variety of data.
the trained data set and did the validation by testing them
independently. Bioinformatics significance of rules found Survival “1” with survivability and non-survival with “0”
were valued based on support, confidence. Again it was called binary or Boolean representation was subjected on
checked and evaluated using lift. The data consisted of raw data. Amongst 1000 CHD cases, 798 cases- survival
records of the patients with HD. Searching, training and and 202 cases- death [5].
testing reduced association rules and accuracy rate was high Latha et al. (2007) used Coactive Neuro Fuzzy Interface
[3]. Systems and GA in designing HD prediction model which
Peter Leijdekkers et al. (2006) designed “personalized HD was smart with less error for mean square. After analysis of
monitoring application which used smart phones as well as many methods, two techniques are combined those were NN
wireless (bio) sensors”. The analysis depends on sensors and GA. Both together form a hybrid method for prediction
which are real time ones embedded in smart devices which keeping the account of risk factor and optimize the NN
can collect data from environment as well and give an alarm weight. This was the first hybrid method. The primary aim
to the ambulance so that proper care is provided to the was to use this technique in medical decision support and to
patient from the experts and save the life of the patient and indicate the risk for reducing so that it helps patient in
make him out of danger. The alert message is sent to remote reducing the chances of HD further [7].
healthcare center through sensors for addressing the patient. Heon Gyu Lee et al. (2007) used multiparametric features
Then proper care is taken from cardiologist. The model also like linear and non-linear features of high-rate variability
caters in rehabilitation activities for speedy recovery of the (HRV) of 3 posture positions namely supine, left lateral and
patient by monitoring. This includes proper diet followed by right lateral position to find the HRV indices to detect
exercise. coronary artery disease (CAD). Bayesian classification,
The model was designed using “two methods ‘Microsoft’s associative classifier, classification based on multiple
Windows Mobile Pocket PC platform’ and ‘.Net Compact association rule (CMAR), C4.5 (DT) and SVM classifiers
Framework extended with OpenNETCF’ modules used in are used for predicting coronary artery diseases. Statistical
implementing the application. The information is fed on analysis were used for essential feature selection. Accuracy
server called “SQL CE” whose database is complex of SVM, CMAR, C4.5, NB (Tree Augmented NB- TAN)
specially meant for storing information on mobile devices and NB (Selection Tree Augmented NB- STAN) were 90%,
[4]. 80%, 78%, 81% and 85% respectively. SVM showed the
best performance [9].
Hongmei Yan et al. (2006) used Multilayer perception
(MLP) with 40 input variables for Input layer and the output 2008-2010
layer with 5 nodes. Improved backpropogation algorithm Sellappan Palaniappan et al. (2008) proposed an
was used to train the system. 352 medical records were “Intelligent Heart Disease Prediction System (IHDPS)”
collected for train & test the system. Assessment methods model designed by applying DT, NB and NN, DM
like cross validation, holdout and bootstrapping were techniques. Result obtained gave strength of each technique
applied to assess the system. MLP is well known for its which fulfilled the objects. Major queries like “what-if”
good architecture which makes this method most important were answered by IHDPS where general DT could not do.
in NN models also its algorithms are very easy and simple Patterns with relationship type with important knowledge
in understanding. MLP consists of 3 layers namely Input were developed by keeping risk factors of HD. IHDPS
layer, Hidden Layer and Output Layer. The hidden layer model was developed on web, making it more user friendly
was obtained by cascading learning process. The with scalability vertically as well as horizontally and
experimental results achieved were of 90% accuracy [10]. reliable. Risk factors namely age, blood sugar, BP, family
Yanwei X et al. (2007) developed data mining algorithms history and other fields were base for detection of HD. 909
for predicting survival of Coronary Heart Disease (CHD) information set with 15 attributes collected from
patients based on 1000 cases. Because the CHD prediction “Cleveland” HD database. The DM techniques namely NN,
technique needs a priority for addressing as it was a NB and DT were used. Amongst which NB gave best result
challenge for medical society. The work carried vast [6].
observation on the medical data for 6 months of 1000 CHD “IEHPS, heart attack prediction system” proposed by
records. Information of the survival rate was recorded. The Shantakumar B.Patil et al. (2009) was a smart & efficient
three major DM techniques were employed on 502 cases. technique designed using NN. Significant frequent patterns
“10-fold cross-validation” measures and estimates the are generated “by ‘K-means clustering’ and ‘MAFIA’
accuracy and also performance of these techniques. The algorithm”. [11].
measures used were “accuracy, sensitivity and specificity”.
Confusion matrix was obtained to calculate the three
257
Minas A. Karaolis et al. (2009) used “Event-related risk bagging algorithm and support vector machine were
factors” investigated for 1. Before the event: non-modifiable focused. The accuracy generated was 84.1% [13].
and modifiable (risk factors were family history,
hypertension, age, sex, smoking and blood sugar). 2. After Syed Umar et al. (2013) presented a system by combining
the event: modifiable-smoking, systolic/diastolic blood two techniques called as hybrid model by using global
pressure, cholesterol. 528 patients records collected from optimization benefits of GA specifically to initiate the NN
“Paphos district in Cyprus”. “Myocardial infarction (MI), weights. Backpropogation algorithm was used for learning
percutaneous coronary intervention (PCI) and coronary and training the neural network. A multi-layered feed-
artery bypass graft surgery (CABG)” were the events used forward network had “12-input, 10 hidden and 2 output
for investigation. The accuracy obtained were 66%, 75%, nodes”. Input count depends on final risk factor of the
and 75% for MI, PCI and CABG models respectively [18]. chronic. Training and learning is done by “The Levenberg-
Marquardt backpropogation algorithm”. Bias and weight is
K. Srinivas et al. (2010) analyzed the “behavioral Risk recorded and updated by network training function.
Factor Surveillance System” where it was surveyed by Implementation of the system was done by using “Matlab
testing and giving a self-report on CVD rate by telling that R2012a, Global Optimization Toolbox and the Neural
rate of CVD is high in coal mining region by carrying out Network Toolbox”. Risk factors of 50 patients was collected
tests on Singareni Collieries Company in Andhra Pradesh, and the results obtained showed training accuracy of 96.2%
India, compared to other regions after control for other risks. and a validation accuracy of 89% [14].
Diagnosis was done by collecting patient’s report which
gave records of distinguished measures saying about the I.S.Jenzi et al. (2013) designed a reliable classifier model
morbidity. using data mining technique, Association rules and
classification techniques like decision tree, Naïve Bayes and
Along with these attributes the data sets which are Neural Network. The relationship between key patterns was
symptoms of HD were also considered. Sensitivity and established. The data set consists of 14 attributes, where
accuracy were the two key measures for evaluation. class attribute is considered at the end of all attributes. The
GUI was developed in Microsoft .NET platform, with
(Accuracy and sensitivity) obtained was (0.825% & 0.88) interconnections done by using IKVM interface with some
for SVM, (0.825% & 0.8717) for DT C4.5 and (0.897%, Java libraries. The receiver operating characteristic (ROC)
0.9017) for NN- Multi layered perceptron [8]. curves was a visual tool used to represent the accuracy. The
2011-2013 results obtained was represented on ROC which has the area
under ROC of data mining was 0.807 which is found better
Peter, T.J. et al. (2012) used classification data mining for than Naïve Bayes [16].
prediction of heart disease. Input set has intrinsic linear
combination of variables which cannot be adapted at Shamsher Bahadur Patel et al. (2013) deduced 14 attributes
modeling nonlinear complex interaction in medical to 6 by using GA. Then Classifiers NB, classification by
domains. These limitations of conventional medical scoring clustering & DT were used to predict the diagnosis of HD.
systems are handled by using classification models. Genetic search was applied on the 14 attributes and no. of
attributes was reduced to 6. The 6 attributes were Resting
The data is cleaned by using classification method of DM BP (RBP), old peak (OLDPK), chestpaintype (CPTYPE),
where the complex and implicit relationship between number of major vessel colored (VSL), Exercise induced
interdependent variables is detected. Naïve bayes, k-NN, angina (EIA) and max heart rate achieved (THAL). This
decision tree and neural network were implemented on the reduced dataset was applied to 3 classification models. For
knowledge data. Naïve bayes performed better than other designing the model 4 computing evaluation measures used
methods. Accuracy of the classifiers was 83.70%, 76.66%, were namely, True positives (T-POS) refers to positive
75.18% and 78.148% of NB, DT, K-NN, NN respectively tuple, True Negative (T-Neg) refers to negative tuple, False
[12]. Negative (F-Neg) and False positive (F-POS). WEKA tool
Chaitrali S. et al. (2012) used 13 attributes like sex, blood was used for implementation. Accuracy obtained by DT,
pressure, and cholesterol for prediction of heart disease. NB, Classclust was 99.2%, 96.5% and 88.3% respectively.
Two more attributes called smoking and obesity was added. It was observed that DT DMT outperformed the other two
DM classification methods used were NN, DT and NB. DM technique, WEKA – Waikato Environment for
Accuracy obtained for these techniques were 100%, 99.62% Knowledge analysis [15].
and 90.74% respectively. Confusion matrix was obtained for 2014-2016
3 classification methods for 13 attribute data sets and
15attribute data set. The accuracy with 15 attribute was Hlaudi Daniel Masethe et al. (2014) proposed DM
100% for neural network [22]. algorithm namely J48, NB, “REPTREE CART” and Bayes
Net were used for predicting “heart attacks”. The data set
Mai Shouman et al. (2012) proposed a model by using was collected from hospital and doctors who were
single data mining technique and hybrid data mining practitioners in South Africa. 11 attributes considered were
technique. The kernel density, automatically defined groups, patient Identification Number, Gender, age, chest pain,
cardiogram, BP , rating of heartbeat, cholesterol, tobacco
258
consumption, hot drinks intake & diabetes level. The tool operation for searching techniques giving the target result
callled Waikote Environment for knowledge Analysis which is specific, beneficial and best.
(WEKA) was used for prediction of HD. WEKA tool was
significant in discovering analyzing and predicting patterns. In general we can call DM is the “Handler” and BD as its
Accuracy obtained were 99.0741, 99.222, 98.148, 99.0741 “Asset”. BD is referred as the tool which accesses the large
for J48, REPTREE, Naïve Bayes, Bayes Net and simple data sets, process them and produce expected result. In the
CART respectively, Bayes Net algorithm outperformed the field of medical lot of information about the patient gets
N B algorithm [19]. collected. The task of BD is to extract knowledge from such
huge, complicated, heterogeneous data sets and perform
G. Purusothama et al. (2015) compared different operation on this clinical data which is not in proper format
classification techniques to design risk prediction model for is a big challenge. Both DM and BD put together will help
HD. The two kinds of models compared were 1. Single in designing a model to predict the heart disease through
model to test data 2. Combined model to test data (hybrid patient’s medical information.
model). These two models were used for data analysis. The
authors in this survey have considered only the By using these two techniques it is possible to design model
classification techniques in case of single model and where we can check if the patient has HD or not based upon
combined model. Accuracy obtained for decision technique, considering risk factors and also to predict how long the
association rule, K-NN, artificial neural network, naive patient has to stay in the hospital for his/ her treatment.
bayes, hybrid approach were 76%, 55%, 58%, 85%, 86%, Hence by designing this new method will help the chronic
69% and 96% respectively. It was concluded that by patient to improve his health and avoid unwanted hospital
applying hybrid data mining techniques promising results stay which will reduce unnecessary expenses. The patients
were obtained in diagnosis of heart disease [17]. who are likely to get HD can be predicted by using the risk
Lokanath Sarangi et al. (2015) designed a cost efficient factors and necessary precautions and treatments can be
model by using Genetic Algorithm optimizer technique. The started at the earliest.
weights were optimized and fed as an input to the given It is possible to get promising outcome with winning
network. The accuracy achieved was 90% by using the solution by using DM and BD in designing a predictive
hybrid technique of GA and neural networks [20]. model by combining techniques available.
M.Satish, et al. (2015) used different DM techniques (DMT)
like Rule based, DT, NB, and ANN. An efficient approach
called pruning-classification association rule (PCAR) was IV. CARDIOVASULAR DISEASES (CVD) or HEART
used to generate association rules from HD warehouse for DISEASE (HD)
prediction of Heart Disease. Pima Indian Heart attack data The heart and blood vessel diseases are referred as
warehouse was used for pre-processing for mining. All the cardiovascular disease or coronary heart disease (CHD).
above mentioned DMT were explained [21]. HD is a general name for a variety of disease, conditions
III. DATA MINING AND BIG DATA ANALYTICS and disorders that affect the heart and blood vessels.
Symptoms of HD vary depending on the specific type of
Data mining (DM) and big data (BD) work on huge HD. World Health Organization in the year 2003 reported
records of information sets by gathering from different that 29.2% of total global deaths are due to CVD. It is
resources which are best for any field of business expected that CVD will be the leading cause of death in
applications or other clients. Considerable BD can be developing countries due to the change in the life style,
referred in terms of DM where BD is huge information set. work culture and food habits. If a patient has more than one
Traditionally when the data increased it was handled risk factor as specified below then the overall risk of
through different techniques when the current techniques coronary heart disease is much higher. The risk factors can
were expensive at that time. In earlier times MS excel be grouped into 2 categories 1) Risk factors that cannot
spread sheet handled the huge data sorting and playing with change 2) Risk factors that can change.
the data.
Risk factors that cannot change include age, gender and
Huge, homogeneous and complex data gets accumulated family history.
automatically through different sources of applications and
business. It is the responsibility of the database manager Age: Risk increases as a person gets older
who makes the decision to choose relevant and correct Gender: Before the age of 60 men are at greater risk than
chunk of information from the huge set. To extract women
knowledge from that piece of information DM techniques
can be used, as it include many techniques, algorithms, Family History: Risk may increase if close blood relatives
software packages and tools experienced early heart diseases.
The clients can frame there queries for accessing Risk factors that can be changed include blood cholesterol,
information from database through automation instead of high blood pressure and diabetes.
doing everything manually. Specifically DM has optimal
259
Blood Cholesterol: Raised or Altered level. The body makes Pecchia et al. Regression tree 79.31%
its own cholesterol for healthy cells. Because of different Techniques
dietary source, excess cholesterol gets accumulated in the
body which deposits in the Walls of arteries. This damages Fahim Sufi et Clustering Technique
and blocks the arteries, which results in heart attack. 2011 97%
al. Apriori Algorithm
High Blood pressure: The arteries get affected by high BP. 83.70%,
Healthy diet with reduced intake of salt, moderate body
weight, physical activeness; reduced stress factors prevent T John Peter et 76.66%,
2012 NB, DT, k-NN, NN
the increase in Blood pressure. al. 75.18%,
Diabetes: This is a high risk factor of coronary heart disease 78.148%
V. HOW DM IS USEFUL FOR THE MEDICAL FIELD: 100%,
Due to the vast use of computers in the hospitals also by 2012 Chaitrali S et al. NN, DT, NB 99.62%,
doctors who practice, a large amount of information is 90.74%
gathered. Huge set of data consist of relevant information of
the patient along with lot of other information which is the Maishowman et SVM with Bagging
2012 84.1%
noise. The entire set of data may be used by the al. Algorithm
practitioners but the data miners have to extract only
Syed Umar NN Backpropagation
specific concerned information know as knowledge. 2013 96.2%
Amin et al. Algorithm
Today as the research is emerging in a very rapid pace; it is
a major requirement to use the technology available to be 99.2%,
Shamsher DT, NB, Classification
helpful for the society globally. With the available mining 2013 96.5%,
tools it is possible to design a model which can be helpful Bahadur et al. Clustering
for the health care industry. The DM tools can provide us 88.3%
with accurate and time to time report needed for the 2013 I.S. Jenzi NB 80.7%
practitioners so that the patient is benefited.
99.0741%,
VI. COMPARISION TABLE
Hlaudi Daniel J48, REP TREE, NB, 99.222%,
Data Mining Technique/ 2014
Year Author Accuracy et al. CART 98.148%,
Algorithm 99.0741%
2004 Carlos Ordonez Association Rule - B.
Hongmei Yan 85.034%,
2006 Multilayer perceptron 90% 2014 Venkatalakshmi NB, DT
et al. 84.0136%
et al.
92.1%, Lokanath
2007 Yanwei X et al. SVM, ANN, DT 91.0%, 2015 GA Technique 90%
Sarangi et al.
89.6%
81%,
Heon Gyu Lee Bayesian classification, VII. CONCLUSION
2007 80%,
et al. CMAR, C4.5, SVM In this paper, a survey conducted from 2004 to 2015 gives
78%, 85% the idea of different models available and the different data
85.68%, mining techniques used. The accuracy obtained with these
models is also mentioned. It is observed that all the
2008 Sellappan et al. NN, Naïve Bayes, DT 86.12%, techniques available have not used big data analytics. Use of
80.4% big data analytics along with data mining will give
promising results to get the best accuracy in designing the
Minas A
2009 C4.5 (DT) Algorithm 82% prediction model.
Karolis
VIII. FUTURE WORK
82.5%,
K. Srinivas et The Main objective is to identify the key patterns and
2010 SVM, DT (C4.5), MLP 82.5%, features from the medical data of the patient by combining
al.
89.7% data mining techniques along with big data analytics to
predict the heart disease before it causes to help the medical
2011 Leandro Classification & 96.39%,
260
practitioners. The other objective will be to reduce the data 16. I.S.Jenzi, P.Priyanka, Dr.P.Alli, “A Reliable Classifier Model
sets and increase the accuracy of prediction model. Using Data Mining Approach for Heart Disease Prediction”,
International Journal of Advanced Research in Computer
Science and Software Engineering, 2013.
17. G. Purusothama and P. Krishnakumari, “A Survey of Data
mining techniques on risk prediction: Heart disease”, Indian
Journal of Science and Technology, 2015.
REFERENCES 18. Minas A. Karaolis, Joseph A. moutiris, Dementra Hadjipanayi,
“ Assessment of the risk factors of coronary heart events based
on data mining with decision trees”, IEEE transactions on
1. World Health Organization. World Health statistics Annual, information technology in biomedicine,2010.
Geneva, Switzerland: World Health Organization (2006), 19. Hlaudi Daniel Masethe, Mosima Anna Masethe, “Prediction of
available from: http://www.who.int/ mediacentre/ factsheets/ Heart Disease using classification Algorithm”, Proceedings of
fs310.pdf. the World Congress on Engineering & Computer Science 2014,
2. Carlos Ordonez. Improving heart disease prediction using WCECS.
constrained association rules. Presented in a Seminar at 20. Lokanath Sarangi, Mihir Narayan Mohanty, Srikanta Pattnaik,
University of Tokyo; 2004. “An Intelligent Decision Support System for Cardiac Disease
3. Association Rule Discovery With the Train and Test Approach Detection”, IJCTA, International Press 2015.
for Heart Disease Prediction, Carlos Ordonez IEEE 21. M.Satish, D Sridhar, “Prediction of Heart Disease in Data
Transactions On Information Technology In Biomedicine, VOL. Mining Technique”, International Journal of Computer Trends
10, NO. 2, APRIL 2006. & Technology (IJCTT), 2015.
4. Personal Heart Monitoring and Rehabilitation System using 22. Chaitrali S. Dangare Sulabha S Apte, “Improve study of Heart
Smart Phones Peter Leijdekkers, Valérie Gay, Proceedings of Disease prediction system using Data Mining Classification
the International Conference on Mobile Business (ICMB’06) 0- techniques”, International journal of computer application,
7695-2595-4/06 $20.00 © 2006 IEEE. 2012.
5. Yanwei X, Wang J, Zhao Z, Gao Y. Combination data mining
models with new medical data to predict outcome of coronary
heart disease. Proceedings International Conference on
Convergence Information Technology;2007. p. 868–72.
6. Sellappan Palaniappan, Rafiah Awang, "Intelligent Heart
Disease Prediction System Using Data Mining Techniques",
IJCSNS International Journal of Computer Science and
Network Security, Vol.8 No.8, August 2008.
7. Latha Parthiban and R.Subramanian, “Intelligent Heart Disease
Prediction System using CANFIS and Genetic Algorithm”,
International Journal of Biological and Life Sciences, Vol.3,
No.3, pp.157-160, 2007.
8. Analysis of coronary heart disease and prediction of heart attack
in coal mining regions using data mining techniques, K.
Srinivas,G. Raghavendra Rao ; A. Govardhan, in IEEE 5th
International Conference on Computer Science and Education
(ICCSE), 2010.
9. Heon Gyu Lee, Ki yong Noh and Keun Ho Ryu, “ Mining
Biosignal Data: Coronary Artery Disease diagnosis Using
Linear and Nonlinear Features of HRV”, Springer-Verlag Berlin
Heidelberg 2007.
10. Hongmei Yan, Yingtao Jiang, Jun Zheng, Chenglin Peng and
Qinghui Li, “ A Multilayer perceptron-based medical decision
support system for heart disease diagnosis”, ELSEVIER 2006.
11. Shantakumar B.Patil and Y.S.Kumaraswamy, “Intelligent and
Effective Heart Attack Prediction System Using Data Mining
and Artificial Neural Network”, European Journal of Scientific
Research, Vol.31, No.4, pp.642-656, 2009
12. An empirical study on prediction of heart disease using
classification data mining techniques, Peter, T.J, .
Somasundaram, K., 2012 International Conference on
Advances in Engineering, Science and Management (ICAESM).
13. Mai Shouman, Tim Turner, Rob Stocker, “ Using data mining
techniques in heart disease diagnosis and treatment”, IEEE
Japan-Egypt Conference on Electronics, Communications and
Computers, 2012.
14. Genetic Neural Network based Data mining in prediction of
Heart disease using risk factors. Syed Umar Amin1, Kavita
Agarwal2, Dr. Rizwan Beg Proceedings of 2013 IEEE
Conference on Information and Communication Technologies
(ICT 2013).
15. Shamsher Bahadur Patel, Pramod Kumar Yadav and Dr. D.P.
Shukla, “Predict the Diagnosis of Heart Disease Patients using
classification Mining Techniques”, IOSR Journal of
Agriculture and Veterinary Science (IOSR-JAVS), 2013.
261

Data Mining

Hochgeladen von

Dokumentinformationen

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Data Mining

Hochgeladen von

Copyright:

Verfügbare Formate

2016 International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT)

Prediction of Heart Disease at early stage using

978-1-5090-4697-3/16/$31.00 ©2016 IEEE 256

Das könnte Ihnen auch gefallen