Beruflich Dokumente
Kultur Dokumente
, MONTH 1
Abstract—Epidemic outbreaks are collective effects of ongoing scalable properties of machine learning (ML). Artificial intelli-
globalization, urbanisation, population mobility, climate change, gence (AI) that uses ML techniques is trending towards trans-
demographical change and evolution of newer strains of infec- formed provision of healthcare services to reduce economic
tious agents that result in high morbidity, mortality and huge
financial loss, such as COVID-19. Thus, the early prediction of the burden of a community or country. This revolutionary change
emergence of a disease can play a pivotal role to prevent a disease has high impact on improving global health. Accordingly,
to become epidemic. In this paper, our prime focus is to propose public healthcare can be supported in several ways in order to
a novel and smart machine learning based advanced model for provide awareness recommendations and safety precautions to
public healthcare to reduce and control epidemic outbreaks. the mass [4].
Collective knowledge from interconnected disciplines, shared
data repository, and diverse roles have been embedded into the The rise of AI has already been used to slow down the
proposed framework. The model includes machine learning as a spread of disease in epidemic situation through early stage
deployable method for predicting early risk factors using data risk prediction [5]. The example of using ML to detect weather
related to ecological, epidemiological, socio-economic and other and land-use patterns associated with dengue fever transmis-
factors. An evaluation based on actual COVID-19 related data sion in Manila is worth mentioning [6]. Furthermore, asthma
demonstrates that early prediction of the epidemic outbreak can
aid the healthcare authority to take appropriate steps to combat healthcare has been aided by ML through the use of weather,
epidemics in early-stage and prevent lives and financial losses historical and pollution data [7]. In fact, the drug development
globally. process has experienced spectacular shift from hype to hope.
Index Terms—Epidemic, outbreak, disease control, early pre- AI can be regarded as a viable solution to detect pandemic
diction, disease management, global health outbreaks or public health emergencies. Therefore, a modelling
framework can be applied to identify the precipitating factors
and forecast future outbreaks [8].
I. I NTRODUCTION The major contribution of this research is to design a
Epidemic outbreaks can pose a noteworthy threat to human high-level architecture of a universal model for early-stage
health. The spread of infectious diseases like COVID-19 are epidemic outbreak prediction and to find the necessity of
more extensive and expeditious than ever. The outbreaks are applying ML to interconnected shared data repository to
generally accelerated by mobility of human population, persis- advance the public healthcare. The proposed model for future
tent urbanization, and global environmental changes including intelligent public healthcare management aims to reduce the
climate, land use, demographics etc. Viruses and bacteria morbidity, mortality, and economical burden for eliminating
evolve quickly by natural selection and other mechanisms the annihilation. Our evaluation based on actual COVID-19
[1]. The evolution may arise different strains to come around related data demonstrates that early prediction of an outbreak
through recombination and mutation. For example, H1 N1 in- provides the necessary tools to combat epidemics and prevent
fluenza strain ("swine flu") had RNA segment from human and valuable lives.
bird viruses, as well as pig viruses from both North America The rest of the paper includes a description of our proposed
and Asia [2]. The devastating consequences of any outbreaks model followed by an evaluation section. Before concluding
are not only confined to mortality and morbidity of the affected the paper the limitations and future scopes have been pro-
region, but also impact global economy, trade, commerce, and vided.
traveling.
High mutation rate and rapid evolution of viruses lead to II. P ROPOSED M ODEL
the emergence of drug-resistant strains rather than eradication. This section describes our proposed model for future public
The discovery and development of new interventions are health management in case of a epidemic disease outbreak like
usually time-consuming. Meanwhile the infection can result in COVID-19. The core of this model is based on the role played
a significant loss to public health and economy. The emergent by different stakeholders and data shared among all parties.
threat posed by the epidemics can become a global public Fig. 1 shows a high-level architecture of our model, which
health concern, which enhances the need for prompt detection will be managed and controlled by AI entities, institutions,
and characterization of outbreaks precisely [3]. and domain experts. In this context, we elaborate the role
Contemporary researchers have found intelligent models of different entities, discuss the data domain, and highlight
as a way to predicting pandemic diseases based on diverse the workflow of the system consisting of many processes
factors, such as land use, human mobility information, spatial that are either computation processes or stakeholder-initiated
dynamics, and overland travel data, due to the flexible and processes.
JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MONTH 2
B. Data Layers
This section describes shared data repository, data collec-
tion, and data integration and process layer.
1) Shared Data Repository Layer: In this layer, data con-
tributed by different entities are stored in a shared data reposi-
tory. This repository can be placed in cloud services, such as in
Azure, Amazon, and Google that enable big data analytics for
managing, processing, and storing the vast amount of hetero-
geneous data. Common data types for epidemic or pandemic
predictions are: demographic data, climate data, historical data,
etc. Besides, for treatment predictions, collective knowledge
of researchers and healthcare professionals, such as doctors,
nurses, and pharmacists will be involved.
2) Data Collection Layer: Data are obtained in both tradi-
tional and electronic manner to produce more authentic and
fast data. The traditional procedures, for example, surveys,
interviews are mostly authentic but the process is time-
consuming, whereas electronic procedures like social media
crawling and web application logs are convenient, automated,
Fig. 1: Proposed Model for Future Public Healthcare Manage- and faster. Our model suggests using both mechanisms to
ment collect data, as the sources can be different in category, size,
and specialization.
3) Data Integration and Process Layer: This layer mainly
deals with AI, ML, and data mining (DM) that can use the
A. Role of Different Entities
resources mentioned above to investigate patterns, clusters,
and predict diseases. DM and ML have a range of advanced
In this model, we consider various stakeholders and data techniques to pre-process, classify and cluster different types
providers from diverse genre. For instance, of data. The tools to use this computer-aided concept are avail-
able and enriched. AI will cover the knowledge integration and
• Government: The government acts as a major data processing and provide the means of using softbots to analyze
provider during the time of a pandemic. They can share an and forecast epidemic outbreaks.
enormous amount of valid data, for example, travel data,
flight schedules, and even private data to handle complex
situations by preserving the privacy of citizens.
• General population: This entity act as the user and the
client of healthcare services of this model. The social
media usage of this entity, chat-bot data, and their tele-
health reporting data can contribute to both data analysis
as well as automatic recommendations.
• Healthcare professionals: Real-time data can be ob-
tained from government-run hospitals, clinics, tele-
healthcare, and other heathcare service providers using
government management information system (GMIS) of a
country. Thus, healthcare professionals act as knowledge-
base data providers and can contribute to decision-making
by providing more accurate predictions of disease out-
breaks and viable precautions and treatments.
• Analysts: An analyst can be one who produces different
insights into data and detects interesting patterns of data.
For this model, we consider lab experts and data analysts,
who can support the proposed public health management
model with advanced datasets and findings.
• Industrial bodies: The industry and business can provide
healthcare and precaution supports. On the other hand,
they can also act as business insights and case report
based data providers. For example, telecom companies
can support sending bulk SMS for alerting people of a Fig. 2: Workflow of Proposed model
particular problem area.
JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MONTH 3
TABLE IV: Prediction by RF classifier with accuracy mea- F. Evaluation and verification
sures.
To evaluate the performance of the classifier we considered
Prediction Predicted the widely accepted ML evaluation measures [13]:
Accuracy F-measure
item value i) Accuracy and F-measures in terms of true positive, true
District Dhaka 0.999 0.98
negative, false Positive, and false negative.
Gender Male 0.85 0.85
Symptom Dry-cough 0.98 0.98 ii) Precision-recall curve area (PRC area value), which
Pre-existing Cardiovascular 0.99 0.98 represents plot of Precision vs. Recall for all potential cut-
Condition offs for a test area [13].
To verify, we cross-matched the prediction with the status
of LD. From Table IV, it is evident that the prediction result
was created using WD, NHCD, and LD. The global knowledge matches the status of LD. This table contains the most signif-
of potential symptoms is provided by doctors and researchers icant prediction from NHCD dataset for COVID-19 disease.
around the world. NHCD data are from the public and autho- Again, a graph in Fig. 3, demonstrates the PRC area
rized healthcare experts. The data from LD are also verified plotting for each district, whereas the illustration Fig. 4 shows
and released data by the government. So, it can be observed weight in respective districts. Here, the weight is a simple
that different roles and data from different layers have been statistical value representing the number of confirmed COVID-
integrated to create a strong knowledge base. 19 patients in respective districts.
It is evident from Fig. 3 that by using ML classification
D. Selection algorithm technique a hidden pattern has been discovered. The plotting
Although there are several existing ML algorithms for pre- for the PRC area per district matches the statistical curve
diction, our problem and purpose require an algorithm that is of Fig. 4. Therefore, if the regular call data from NHCD
efficient for classification and regression. Besides, that specific was monitored by ML techniques like RF, more hygiene and
algorithm should be used commonly for healthcare prediction medical precautionary measurements could be undertaken by
[13]. Therefore, the ensemble decision tree technique of Ran- the infection prevention and control (IPC) focal for the risky
dom Forest (RF) was selected to train the classifier that is districts. Eventually, it could help the authorities to take action
highly scalable regardless of the number of dimensions [14]. against the significant districts before the disease outbreak.
JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MONTH 5
Fig. 3: PRC area value for the class "District" using RF classification algorithm.
ACKNOWLEDGMENT
The authors would like to thank...
R EFERENCES
[1] E. Kussell, “Evolution in microbes,” Annual review of biophysics,
vol. 42, pp. 493–514, 2013.
[2] G. B. A. Flavia and K. Natarajaseenivasan, “Phylogenetic analysis
of h1n1 sequences from pandemic infections during 2009 in india,”
Bioinformation, vol. 5, no. 10, p. 416, 2011.
[3] W. H. Organization, Managing epidemics: key facts about major deadly
diseases. World Health Organization, 2018.
[4] H. Chen, D. Zeng, D. L. Buckeridge, M. I. Izadi, A. Verma, A. Okhma-
tovskaia, X. Hu, X. Shen, Z. Cao, F.-Y. Wang et al., “Ai for global
disease surveillance,” IEEE Intelligent Systems, vol. 24, no. 6, pp. 66–
82, 2009.
[5] M. Odlum and S. Yoon, “What can we learn about the ebola outbreak
from tweets?” American journal of infection control, vol. 43, no. 6, pp.
563–571, 2015.
[6] X. Cui, N. Yang, Z. Wang, C. Hu, W. Zhu, H. Li, Y. Ji, and C. Liu,
“Chinese social media analysis for disease surveillance,” Personal and
Ubiquitous Computing, vol. 19, no. 7, pp. 1125–1132, 2015.
[7] L. Luo, C. Liao, F. Zhang, W. Zhang, C. Li, Z. Qiu, and D. Huang,
“Applicability of internet search index for asthma admission forecast
using machine learning,” The International journal of health planning
and management, vol. 33, no. 3, pp. 723–732, 2018.
[8] K.-K. Mak and M. R. Pichika, “Artificial intelligence in drug develop-
ment: present status and future prospects,” Drug discovery today, vol. 24,
no. 3, pp. 773–780, 2019.
[9] “Coronavirus:,” online; accessed 16 May 2020. [Online]. Available:
https://www.who.int/health-topics/coronavirus#tab=tab_1
[10] “Coronavirus cases:.” [Online]. Available:
https://www.worldometers.info/coronavirus/
[11] “Corona info:,” online; accessed 15 May 2020. [Online]. Available:
https://corona.gov.bd/
[12] “Telemedicine service:,” online; accessed 10 May 2020.
[Online]. Available: https://dghs.gov.bd/index.php/en/mis-docs/dhis-2-
form/84-english-root/ehealth-eservice/490-telemedicine-service
[13] J. Han, J. Pei, and M. Kamber, Data mining: concepts and techniques.
Elsevier, 2011.
[14] F. Portela, M. F. Santos, J. Machado, A. Abelha, and Á. Silva, “Pervasive
and intelligent decision support in critical health care using ensembles,”
in International Conference on Information Technology in Bio-and
Medical Informatics. Springer, 2013, pp. 1–16.
[15] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion,
O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg et al.,
“Scikit-learn: Machine learning in python,” the Journal of machine
Learning research, vol. 12, pp. 2825–2830, 2011.
[16] R. E. Levasseur, “People skills: Change management toolsâĂŤlewin’s
change model,” Interfaces, vol. 31, no. 4, pp. 71–73, 2001.
[17] J. C. de Man and J. O. Strandhagen, “An industry 4.0 research agenda
for sustainable business models,” Procedia Cirp, vol. 63, pp. 721–726,
2017.