Beruflich Dokumente
Kultur Dokumente
net/publication/330222999
CITATIONS READS
0 154
5 authors, including:
Alrence Halibas
Gulf College Oman
15 PUBLICATIONS 9 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Graduate Attributes and Learning Outcomes: Proposed Framework and Assessment Process View project
Determining the Intervening Effects of Exploratory Data Analysis and Feature Engineering in Telecoms Customer Churn Modelling View project
All content following this page was uploaded by Alrence Halibas on 03 March 2019.
Abstract— Several machine learning classifiers have been Furthermore, this study attempts to determine the most
used for Autism Spectrum Disorder screening, however, appropriate binary classification model for the given
literature in finding the best classifier for this application datasets. It explores different machine learning algorithms
domain is inadequate. Hence, this paper presents a comparison that can effectively classify whether a child, adolescent or
of five (5) supervised machine learning algorithms: Decision adult is likely an ASD candidate. Likewise, this paper tries to
Tree, Naïve Bayes, k-nn, Random Tree, and Deep Learning get a good estimate and comparison of the algorithms’
using small datasets (n=1100) on child, adolescent and adult prediction performance in terms of their accuracy and
ASD screening in finding the most appropriate classifier. These classification error, precision and class recall, and Receiver
algorithms, which are evaluated using a broad set of prediction
Operating Characteristics (ROC).
performance metrics including accuracy, precision/recall
measures, and Receiver Operating Characteristics, are The succeeding sections of this paper are organized as
compared against each other. The experiment result suggests follows: Section II presents the background information on
that the Deep Learning classifier gives the best performance the chosen machine learning algorithms and performance
(with more than 96%) in almost all metrics while the Random metrics, Section III presents the experimental methods that
Tree classifier came out as the least performing classifier in all include the datasets and software used as well as the
the performance metrics. performance analysis and results, and Section IV presents the
Conclusion and Future Work.
Keywords— Autism, Bioinformatics, Machine Learning, Binary
Classification, Deep Learning
II. BACKGROUND INFORMATION
I. INTRODUCTION
Autism, or otherwise known as Autism Spectrum A. Supervised Machine Learning
Disorder (ASD), is a complex mental condition that is Machine Learning is a branch of Artificial Intelligence
exhibited from early childhood and primarily characterized (AI) that learns and discovers meaningful patterns in data to
by communication and social difficulties. It affects 1 in 150 make predictions [13]. It is built on mathematical principles
children worldwide [1]. Early diagnosis and intervention are of probability and statistics as well as computer science.
seen to be the best treatment for this disorder, hence, Supervised learning is a machine learning technique that
increasing interests and approaches in autism understanding learns from mapping input variables and output variables and
and diagnosis are prevalent [2]. uses this mapping for prediction [14]. Simply, it is learning
from examples having two sets of data, a training and a test
Nowadays, machine learning algorithms are rapidly used set [15].
in medical science [3] that transforms biomedical data into
valuable knowledge. It is widely used in bioinformatics to A typical supervised learning problem, as shown in Fig.
build predictive models for detection and diagnosis of 1, contains instance space X contains objects or attributes, a
diseases [4], medical image segmentation [5], gene finding, label space Y, and a prediction space Y’. [16] defines
protein folding prediction, and so many others [3]. classification as the task of learning a model that maps each
Supervised machine learning is now increasingly applied to attribute x to one of the predefined class labels y.
various bioinformatic problems [6]. Research on machine
learning to enhance autism diagnosis is seen to have a
potential and usefulness [7]. In fact, several studies on autism INPUT OUTPUT
diagnostics using machine learning have already been carried Classification
Attribute Set Model Class (or Label)
out by [8], [9]. The ongoing medical research in this field is (x) (y)
attributed to an increasing availability of online data sets and
low-cost computing [10]. In this regard, this study referenced
the study of [11] on ASD Screening. An artefact of the Fig. 1 A Classification Model
author’s work is a mobile application called ASD Test that is
available on Google Play and Mac App Store [12]. The A training sample (attribute) set is denoted as S =
application allows the users to answer a 10 autism-related ((x1,y1),…,(xm,ym)) Є (X ×Y)m that contains a predetermined
questions and suggests the likelihood of having autistic traits. number of examples where each xi Є X. The output is a
The datasets of this application are utilized for this study. model hs : X Y’ which learns from the sample set [17]. In
(5)
B. Software Specification
1. Datasets Used
2. Software Resources