[go: up one dir, main page]

0% found this document useful (0 votes)
93 views8 pages

Big Data Analytics For Healthcare Organization A S

This document discusses big data analytics for healthcare organizations. It describes how healthcare data is growing exponentially large due to advanced technology. Big data analytics can be used to analyze this data to uncover valuable insights that can improve healthcare outcomes. The document outlines the big data analytics process and discusses its benefits, such as providing innovative treatment techniques and high quality affordable healthcare. Challenges of big data analytics for healthcare are also mentioned, such as dealing with the large and complex datasets from various sources in different formats.

Uploaded by

Andi Marsali
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
93 views8 pages

Big Data Analytics For Healthcare Organization A S

This document discusses big data analytics for healthcare organizations. It describes how healthcare data is growing exponentially large due to advanced technology. Big data analytics can be used to analyze this data to uncover valuable insights that can improve healthcare outcomes. The document outlines the big data analytics process and discusses its benefits, such as providing innovative treatment techniques and high quality affordable healthcare. Challenges of big data analytics for healthcare are also mentioned, such as dealing with the large and complex datasets from various sources in different formats.

Uploaded by

Andi Marsali
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Advances in Science, Technology and Engineering Systems Journal Vol. 2, No.

4, 189-196 (2017)
ASTESJ
www.astesj.com
ISSN: 2415-6698

Big Data Analytics for Healthcare Organization, BDA Process, Benefits and Challenges of BDA: A Review
Siva Sankara Reddy Donthi Reddy*, 1, Udaya Kumar Ramanadham2
1
Department of Computer Science & Engineering, BIHER, Bharath University, Chennai, Tamilanadu, India
2
Department of Information Technology, BIHER, Bharath University, Chennai, Tamilanadu, India

ARTICLE INFO ABSTRACT


Article history: Day by day, data grows exponentially large using advanced technology and it requires
Received: 10 July, 2017 effective analytical techniques to analyze the unknown and useful facts, patterns,
Accepted:05 September, 2017 associations and new trends which will provide new way for giving treatment to diseases
Online: 09 October, 2017 and to provide good quality healthcare at low cost for everyone. This paper describes
uncover valuable insights, various lifestyle choices, some social determinants, clinical and
Keywords:
financial factors that it may effect the overall health of an individual. It also presents how
Big Data
to analyze the facts by using big data analytics to improve the healthcare in the world and
Big Data Analytics
also describes the various steps involved in Big Data Analytics process and discusses its
Healthcare
advantages and challenges which show impact on healthcare organization.
BDA process
BDA Advantages

1. Introduction petabytes or exabytes. According to [3], with such fast and rapid
growth of data, U.S. healthcare alone will soon reach the zettabyte
In the digital world, data are generated as large sets from
(1021 gigabytes) scale. The main goal of healthcare industry is to
various sources. The fast transition from conventional to digital
analyse this big volume of data for unknown and useful facts,
technologies has contributed to the growth of big data. It provides
patterns, associations and trends with the help of machine learning
evolutionary breakthroughs in many fields with collection of large
algorithms, which can give new innovative techniques for
datasets. Big Data is generated everyday by diverse segments of
treatment of various diseases. The aim is to provide high quality
industries like business, finance, manufacturing, healthcare,
healthcare at lower cost to all. This can be a beneficial one for the
education, research and development etc. In general, it refers to
entire world. Big Data sources are showed in the following Figure
the collection of large and complex datasets which are difficult to
1.
store and process using traditional database management tools or
data processing applications. So there is need of developing and 2. Characteristics of Big Data:
using an effective, innovative tools and technologies offered by
Big Data. Data can be of structured, unstructured and semi- The 5 V’s of Big Data relevant to Healthcare are:
structured type. Different variety of data include the text, audio, i) Volume: As described earlier, healthcare industry
video, log files, sensor data etc. in petabytes and beyond. As the produces the variety of data with more growth rate.
data is too big from various sources in different form, it is According to EMC report and the research firm IDC, the
characterized as 5 V’s. The 5 V’s of Big Data are: Volume, healthcare data increases with 48 per cent annually. In 2013
Variety, Velocity, Veracity and Value [1].Volume represent the year, the healthcare data was 153 Exabyte’s and it may
size of the data - how large the data is. The size of the data can be increase to 2,314 Exabyte’s by 2020.[1-2]
represented in terabytes and petabytes. Variety represents the data
which appears in different forms. Velocity represents the motion ii) Variety: In the past, the healthcare organization was
of the data and the analysis of streaming of the data. Veracity generating clinical data of patients with similar symptoms,
represents the availability and accountability of various sizes of storing and analysing it to derive the most effective course
data. Value represents the high quality of data. The Big Data helps of treatment for the admitted patient. Now the healthcare
more to healthcare in the world [9].The healthcare organization industry is focusing on complete healthcare, by providing
has generated large amount of data till date, which is scaled in an effective treatment through analysis of a patient’s data
from various other sources also. This refers to the variety.
*
Corresponding Author: Udaya Kumar Ramanadham, Professor, Department of Generally, the varied health care data falls into one of the
Information Technology, BIHER, Bharath University, Chennai, Tamilanadu, three categories - i.e. structured, semi structured and
India, Contact No: (+91) 9789994242, E-Mail: rsukumar2007@gmail.com unstructured. Generally the following data is collected:

www.astesj.com 189
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
clinical data from Clinical Decision Support systems for anesthesia, bedside heart monitors, etc.) can mean the
(CDSS) (physician’s notes, genomic data, behavioural difference between life and death.[8-9]
data, data in Electronic Health Records (EHR), Electronic v) Value: It refers to the quality of data. The data of EMR’s
Medical Records (EMR)), machine generated sensor data, and EHR’s are recognized as high value data normally. But
data from wearable devices, Medical Image data (from CT it is too difficult to certify the value of data from social
scan, MRI, X Ray’s etc.), medical claim related data, media. So, the effective analytical methods are needed for
hospital’s administrative data, national health register data, the high value data to lead for better quality, effective
medicine and surgical instruments expiry date healthcare solutions and innovations.
identification based on RFID data[3-6], social media data
like Twitter data, Facebook data, web pages, blogs and The following Figure 2 depicts the 5 V’s of Big Data in
various articles.[7] Healthcare.

Suppo
Volume Velocity
rt
Batch
Storage Terabytes
Cloud Records/Arch Real Time
Transactions Processes
Data Streams
Tools
Tables, Files
base
Statist
ics
5 V’s of Statistical
Structured
BIG Mobile Unstructured
Big Events
DATA Multi-Factor Data Correlations
Probabilistic Hypothetical
Analyze
Informat Trustworthiness
ion Value
Proces NoSQL Tera Variety Authenticity
sing bytes Orgin, Reputation
Availability
Accountability Veracity
Figure 1: Big Data Sources

iii) Velocity: It refers to the frequency and speed at which


data is generated, captured and shared. More data is Figure 2: Five V’s of Big Data in Healthcare
generated by consumers as well as businesses with in
shorter cycles, from hours, minutes, and seconds down to 3. Literature Survey:
milliseconds. The wearable devices and sensor devices By increasing digitization of healthcare information, it is
collect real time physiological data of patients rapidly. This needed to improve the quality of healthcare, results, and reducing
new data which is being generated every second is posed a the costs. The advanced tools and technologies are used in health
complex and critical challenge for data analysts. Social care organizations to generate valuable insights of digital
media data is also added to velocity as the users views, healthcare information. The organizations must analyse patient
posting data, feeding data scale up in seconds to enormous information to more accurately measure the risk and for better
amount in case of epidemics/national disasters. outcomes. At the same time, many organizations are working to
iv) Veracity: It refers to trustworthiness of data. Data is increase data transparency for producing latest insightful
accumulated in real-time and at a rapid pace, or velocity. knowledge.
The continuous flow of new data presents new challenges. To exchange health information between various providers
Just as the volume and variety of data that is collected and and payers, some integrated delivery networks can be formed. The
stored has changed, so is the velocity at which speed it is pharmaceutical companies are tied up to protect patients’ privacy
generated and that is necessary for accessing, analyzing, while making data available to qualified researchers outside the
and comparing as well as taking decisions based on the organization.
output. Most healthcare data has been traditionally static—
paper files, x-ray films, and scripts. Velocity represents Kiyana Zolfaghar et al. [17] has presented prediction model
regular monitoring, such as more daily diabetic glucose to give possible solutions for congestive heart failure incidents
measurements (or more continuous control by insulin using Mahout Framework. The raw data is pre-processed and
pumps), blood pressure readings, and EKGs. Meanwhile, converted to encoded format which will be given as input to the
in many medical situations, constant real-time data (trauma Mahout framework, using random forest algorithm.
monitoring for blood pressure, operating room monitors Joseph M. Woodside [18] has presented, inefficient vendors
can be identified, and who is poor in the member's lifestyle
www.astesj.com 190
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
decisions and compliance with preventative care programs. For challenge considering the volume of data that is analyzed in
individuals, intensives can be given, such as cash, gift cards which making UM decisions. WellPoint was teamed the IBM on a new
are considered as one of the recommended changes in the health method to UM: using the cognitive system IBM Watson to
care system. provide approval guidelines for nursing staff, based on clinical
and patient data. WellPoint trained Watson with 25,000 historical
Existing analytical techniques can be applied to the vast
cases. The system uses hypothesis generation and evidence-based
amount of existing patient related health and medical data to reach learning to generate confidence-scored recommendations that
a deeper understanding of results, which can be applied at the help nurses make decisions about UM. The new system provides
point of care. Ideally, these data would inform each physician and responses to all requests in seconds, as opposed to 72 hours for
their patients during the decision-making process and used to urgent pre-authorization and three to five days for elective
identify the appropriate treatment option for that particular patient. procedures with the previous UM process. Encouraged with
success of the system, today 15,835 healthcare provider offices
4. BDA Initiatives for Healthcare Industry in the World use it. [15]
Most of the countries have initiated the number of big data
initiatives around the world. Some of the initiatives are described Seattle Children's Hospital and Regional Medical Centre is
as follows: using big data analytics as part of its Clinical Standard Work
(CSW) program, which defines patient populations and
New Zealand’s Ministry of Health has collaborated with New recommends an ideal protocol for each population, allowing
Zealand Society to Study about Diabetes. These have used SAS. ensuring that every patient at the hospital receives the same
[10] standard of care. The CSW program gets the enormous data from
Data analysis for providing a Virtual Diabetes Register which enterprise data warehouse (EDW) which currently integrates data
combines and filters the health information to determine from 10 sources across the hospital, including electronic medical
accurately that how many people are diagnosed with the diabetic records (EMRs) and billing systems. With the help of CSW
condition and predicting who can develop diabetes in the program, the doctors and nurses get complete information based
future.[12] on thousands of data points of each patient, they get answers to
complex queries about potential treatments and procedures, and
McKinley Children’s Centre of California provides child identify pathways of care for patients with particular needs,
welfare services in Los Angeles country. The organization serves regardless of provider. Clinicians can also evaluate treatment
for more than 700 children annually, which provides the services protocols for determining the resources which need to be allocated
such as residential care, foster care and adoptions, special by hospital. [16]
education, and mental health services. The center has launched an
innovative big data analytics for initiating the staff identifying the Seton Healthcare Family, based on Texas, Austin. It has used
variables that impact each child’s success and identifying the right the IBM Content and Predictive Analytics for Healthcare
combination of programs to improve outcomes. [13] solutions. The system gives an integrated view of relevant clinical
and operational information to drive more informed decision
The Data Science Institute of Columbia University, New making. By teaming unstructured content (History and Physical,
York has collaborated with the New York City Department of Discharge Summaries, Echocardiogram Reports, and Consult
Health and Mental Hygiene (NYC DOHMH) for working a Notes) with predictive analytics, Seton is able to identify patients
project that focuses on the detection of disease outbreaks in New likely for re-admission and introduce early interventions to reduce
York City restaurants. The main goal of this project is to identify cost, mortality rates, and improved patient quality of life.
and analyze the unprecedented volumes of user-contributed
opinions and comments on social media sites such as Twitter, Doctors at UCLA with the help of IBM Watson Foundations
Face book and Yelp, which host massive amounts of content by have recently started using data streaming technology in order to
users about their real-life experiences and opinions about make more informed decisions about brain functions and
restaurants. It will help to extract reliable indicators of otherwise- abnormalities. Using IBM Watson foundations, physicians are
unreported disease outbreaks associated with the restaurants. The able to gather data from sensors to analyze brain functions in real
NYC DOHMH analyses these indicators, as they are produced, to time. As a result of the use of this technology, patient care can be
decide when additional action is required to be taken. This project substantially improved, and doctors have more time to serve for
is developing non-traditional information extraction technology more patients.
over redundant, noisy, and often ungrammatical text-- for a public The U.K.’s National Health Service uses cloud analytics
health task of high importance to society at large. [14] software to pluck numerical and text data on health-care facilities
WellPoint, Inc. is an Indiana polis-based health Benefit from spread sheets and databases and presents it in plain English
Company wanted to reduce the waste of resources (money) by on its website, NHS Choices. This endeavor is helpful to its
improving the utilization management (UM) process, which citizens, as they can make better choice about their care based on
governs the pre-approval of healthcare insurance coverage for information of about 50,000 health care facilities. The software
many medical procedures. Its goals were to accelerate processing uses natural language generation techniques and to examine
of physicians’ treatment requests, save members’ time and through structured data and automatically present it in story form.
improve efficiencies in the approval process, while continuing to US State of North Carolina processes about 88 million claims
base UM decisions on medical evidence and clinical practice totaling about $12 billion annually from 66,000 providers who
guidelines(for ensuring consistency in process).It was a very big treat the state's two million Medicaid patients., The State’s
www.astesj.com 191
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
Department of health and human services in collaboration with adult people and the generated data is made publicly accessible
IBM used big data analytics to help identify suspicious billing quarter wise.
patterns by healthcare providers. Using three years' worth of
In January 2015, President Obama announced a new
North Carolina Medicaid claims data, IBM data mining software,
biomedical research project that is “Precision Medicine” which
which featured special algorithms and modeling capabilities, was
will use the power of big data to help for the development of
applied to detect common fraud and abuse schemes. Almost 90%
specialized drugs to cure the diseases like cancer and diabetes.
reduction in fraud was achieved.
The program shall collect genetic data of one million Americans
Informatics for Integrating Biology and the Bedside (i2b2) is so scientists could develop drugs and treatments tailored to the
an NIH-funded National Centre for Biomedical Computing based characteristics of individual patients.
at Partners HealthCare System. The i2b2 Centre developed a
scalable informatics framework that will enable clinical 5. Big Data Analytics Process Steps:
researchers to use existing clinical data for discovery research and,
when combined with IRB-approved genomic data, facilitate the There are different steps in Big Data Analytics process.
design of targeted therapies for individual patients with diseases i) Acquisition and Storage of Data: As already described, the
having genetic origins. data is fed to the system through many external sources like
Facebook status, web pages, blogs, articles, social media data
The Human Connective project which led by Washington
like twitter feeds, clinical data from Clinical Decision
University, University of Minnesota, and Oxford University is
Support systems (CDSS), EMR, EHR, machine generated
working to map the human brain by making a comprehensive
sensor data, data from wearable devices, national health
connectivity diagram. It will produce invaluable information
register data, drug related data from Pharmaceutical
about brain connectivity, its relationship to behavior, and the
companies and many more [3]. This data can be stored either
contributions of genetic and environmental factors to individual
in database system or data warehouse. It is convenient to store
differences in brain circuitry and behavior. This will help to figure
such voluminous data on the cloud rather than on physical
reasons why certain people have certain brain disorders, help the
disks because the advancement of cloud computing. This is
physicians to easily diagnose and in certain cases prevention of
more cost effective and manageable way to store data.
mental or physical illness. Over a 3-year span (2012-2015), the
Human Connectome Project (HCP) has scanned 1,200 healthy
Discovery (Phase1)

Pre-
Acquisition Integration Analysis Interpretation
processing

Iterative Process: Quality Not Sufficient or New Questions Arise


Algorithm

(Model)
Application (phase 2)
Classification

Regression

Segmentation
Input Output Association
Apply
Data, some linked Algorithm Tailored Results Sequence
to Individuals

Refine Models

Figure3 Functional Architecture of Big Data Analytics Process

www.astesj.com 192
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
ii) Cleaning of Data: Generally, the healthcare data is seen as The following Figure 3 depicts the functional architecture of
flaws like many patients don’t share their data completely Big Data Analytics process steps.
like data about their dietary habits, weight and lifestyle. In 5.1 Technology Support for Big Data Analytics in Health Care:
this type of cases the empty fields need to be filled
appropriately. Another example, the gender can be either at There are large number of open source and proprietary platforms
most one of two values i.e. male or female. In case any other and tools available in the market. Some of them are Hadoop, Map
value or no value is present then such entries need to updated Reduce, Storm, Grid Grain. Big Data Databases like Cassanadra,
and handled accordingly. The data from sensors, HBase, Mongo DB, Couch DB, Orient DB, Terrastore, Hive etc.
prescriptions, medical image data and social media data need
Data Mining tools like RapidMiner, Mahout, Orange, Weka,
to be expressed in a structured and suitable form for
Rattle, and KEEL etc. File Systems like HDFS and Gluster.
performing effective analysis.[2]
iii) Integration of Data: The BDA process makes use of data Programming Languages like Pig/PigLatin, R, and ECL. Big Data
where accumulated across various platforms. This data may Search Tools like Lucene, Solr etc. Data Aggregation and
be varied in metadata (the number of fields, type, and format). Transfer Tools like Sqoop, Flume, and Chukwa. Other tools like
The total data can be grouped correctly and consistently into Oozie, Zookeeper, Avro, and Terracotta. Some open source
a dataset which can be effectively used for data analysis platforms are also available like Lumify, IKANOW [11].
purpose. This is a very challenging task, considering the big
volume and variety of big data. The criteria for platform evaluation may be varied for different
iv) Querying, Analysis and Interpretation of Data: After organizations. Generally the ease of use, availability, the
cleaning of data and integration, the next step is to query the capability to handle voluminous data, support for visualization,
data. A query can be simple one like what is mortality rate in high quality assurance, cost, security can be some of the variables
a particular area? Or complex query like how many patients to decide upon the platform and tool to be used. Some of the
with diabetes would be likely to develop heart related platforms and tools are mentioned the following Table
problems in next 6 years? Based upon the complexity of the
query, the data analyst can choose appropriate platform and
analytic tool.
Table 1 Platforms & Tools for Big Data Analytics in Healthcare

Platforms Description Jaql is a functional, declarative


&Tools query language designed to
process large datasets. To
HDFS enables the underlying
Jaql facilitate parallel processing,
The Hadoop storage for the Hadoop cluster.
Jaql converts “‘high-level’
Distributed File It divides the data into smaller
queries into ‘low-level’ queries”
System (HDFS) parts and distributes it across the
consisting of MapReduce tasks.
various servers/nodes.
Zookeeper allows a centralized
MapReduce provides the
infrastructure with various
MapReduce interface for the distribution of
Services, providing
sub-tasks and the gathering of
Zookeeper synchronization across a cluster
outputs. When tasks are
of servers. Big Data analysis
executed, MapReduce tracks the
applications utilize these
processing of each server/node.
services to coordinate parallel
Pig programming language is
processing across big clusters.
configured to assimilate all
PIG and PIG types of data
HBase is a column-oriented
Latin (structured/unstructured, etc.). It
HBase database management system
is comprised of two key
that sits on top of HDFS. It uses
modules: the language itself,
a non-SQL approach.
called PigLatin, and the runtime
version in which the PigLatin Cassandra is also a distributed
code is executed. database system. It is designated
Cassandra as a top-level project modeled to
Hive is a runtime Hadoop
handle big data distributed
support architecture that
across many utility servers. It
leverages Structure Query
also provides reliable service
Hive Language (SQL) with the
with no particular point of
Hadoop platform. It permits
failure
SQL programmers to develop
(http://en.wikipedia.org/wiki/Ap
Hive Query Language (HQL)
ache_Cassandra) and it is a
statements akin to typical SQL
NoSQL system.
statements.

www.astesj.com 193
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
Oozie, an open source project, iii) Insurance Companies: The government is reimbursed the
Oozie streamlines the workflow and large amount of expenditure for giving medical claims for
coordination among the tasks. patients. We can analyze, identify, predict and minimize
The Lucene project is used the possible frauds related to medical claims by using BDA.
Lucene widely for text [3]
analytics/searches and has been iv) Pharmaceutical Companies: By using BDA techniques
incorporated into several open effectively, the R&D can help pharmaceutical companies
source projects. Its scope to produce drugs that may be most effective for treating a
includes full text indexing and specific disease with in the shorter period.
library search for use within a v) Government: The BDA can help in improving the public
Java application. health surveillance and speed up the response to disease
Avro facilitates data outbreaks. The government can use demographic data,
Avro serialization services. historical data of disease outbreak, weather data, data from
Versioning and version control social media over disease keywords like cholera, flu etc.
are additional useful features. BDA can analyze this massive data to predict epidemics,
Mahout is yet another Apache finding correlation between the weather and likely
project whose goal is to generate occurrence of disease. Therefore preventive measures can
Mahout free applications of distributed be taken to avoid the same. [3]
and scalable machine learning
algorithms that support big data 7. Big Data Analytics - Challenges:
analytics on the Hadoop
platform. The advantages of big data are more for healthcare, but there
are number of challenges which can be broken up.
6. Big Data Analytics Benefits in Healthcare
i) Unstructured and Provenance of Data: The BDA process
The massive amount of data provides the opportunities for can collect data from different sources. Most of the data is
researchers in the Healthcare field to use tools and techniques for unstructured data like medical prescriptions, blogs, tweets,
opening the hidden answers. Big Data Analytics tools and status updates, and comments. It is necessary to generate
techniques can be applied in effective way on large sets of data right metadata for this unstructured data and transform it
then the following benefits will be given: into a structured format. The image and video data should
i) Individuals/Patients: Generally, when treatment is be structured for semantic content and search. By using data
given to a patient, then the historical data can be considered such analysis process, the provenance of data along with its
as a set of similar patients about the symptoms, drugs used metadata should be carried out so it is easy to track the
outcome/response of different patients. With the help of BDA, the processing steps when error generates [3]. Some intelligent
specific treatment is given for a patient based on his genomic data, processing techniques should be proposed to deal the data
location, weather, lifestyle, medical history, response to certain input from sensors and wearable devices. This will help to
medicines, allergies, family history etc. When the genome data is filter/derive the meaningful data, which can then be stored
fully explored for some kind of relation and it can be established on permanent storage. Therefore it will save space.
between the DNA and a particular disease. Then the specific line ii) Missing or Incomplete Data: Some patients may hide their
of treatment can be constructed for every individual. The patients personal information about his/her life style at the time of
will benefit in the following ways: filling forms or oral interviews by doctors. Some fields may
• Correct and effective treatment can be applied. be empty at the time of storing the data in digital format.
• Health related issues will be known in better way. Sometimes it may happen that some of the fields produce
• Preventive steps can be taken in time. wrong results. If analysis is done on the empty or wrong
• Continuous health monitoring at patients location using fields of data, then it may or may not get processed. In both
wearable wireless devices. the cases they produce wrong results. If we leave some
• Designing specialized treatment for patient. records as empty then the analysis may not on cumulative
• Life expectancy and quality will be found in advance. data. If we take wrong value fields then the analysis is
ii) Hospitals: By using effective BDA techniques on the data incorrect and unreliable. This type of issues will be
availability, the hospitals can get following benefits: addressed.
• Predict the patients staying and readmission information. iii) Quality of Data: When we consider data from social media,
then we need to ensure that data whether it is a valid data or
• New healthcare plans will be developed to prevent
not. So it is great challenge to determine the validation and
hospitalization.
quality of data.
• Various questions can be answered by analyzing the data
iv) Technical Challenges: There are different technical
using BDA tools and techniques regarding disease
challenges.
treatment.
• Data aggregation with different database management
• The hospital management can take and manage
systems is also a great challenge in BDA. By dividing
administrative decisions in the better way.
certain standard database design practices meant for a

www.astesj.com 194
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
specific domain like healthcare, financial sector etc., it BDA solutions can also help clinicians and
can be made easier [3]. We are required more epidemiologists performing analyses across patient
technological standards and protocols for different populations and care venues to help identify disease
database management systems to integrate seamlessly. trends.
• The traditional algorithms can be scaled up to handle the • Clinical Operations: BDA can produce accurate
big volume of data in data mining processes or analysis. solutions for clinical operations without waiting for
The processors speed has come to a point beyond which longer time to take fast decisions.
it’s hard to increase in parallelogram process. So the • Policy, Financial and Administrative: BDA has
trend can be moved towards multi-core processors. In supported the decision makers to integrate and analyze
such a scenarios, we need statistical algorithms which data related to key performance indicators on policy and
can be parallelized otherwise the computing financial aspects.
performance will decrease when they handle complex 9. Conclusion and Future Work
big volume data.[9] The interactive response time is
another big problem while apart from this scaling Big Data Analytics in healthcare is evolving into a promising
complex query processing techniques to terabytes. [3] field for giving new insights from huge data sets and
• An analysis is more useful if a non-technical person is improving results while reducing costs. Its strength is high;
able to understand and interpret it. The large volume and however there are more challenges to overcome. Big Data
variety of data is too hard to represent it visually in a Analytics has the potential to transform the way healthcare
more understandable and easy way. A user should be providers from traditional ways to more suitable and right
able to perform the repeated analysis with the different tools and technologies to gain insight from their clinical and
set of assumptions, data sets and parameters. It will help other data repositories and make constructive decisions. In
the user to better understand the analysis process and the future we’ll see the rapid, widespread implementation and
verify whether the system works in a required way or not. use of big data analytics across the healthcare organizations
and the healthcare industry. To that end, the challenges must
• We need careful evaluation process to use the best
be discussed and see the overcoming measures. As big data
platform and tool for market floods. analytics become more important, more attention will be
v) Data Security: Data Security is another major challenge as required, due to some issues such as guaranteeing privacy,
more and more data is digitized. Most of the people are not safeguarding security, establishing standards and governance,
willing to share their personal data with a fear of security and continually improving the tools and technologies. Big
breach. If there is assurance for data security, then the data analytics and applications in healthcare are at an initial
problem can be managed. There should be strict government stage of development, but rapid advancements of Big Data
policies and norms for what data can be shared and what not. platforms and tools can accelerate their maturing process.
Apart from this, strong technological hardware and software Conflict of Interest
level security precautions and measures should be
implemented to prevent the hacking and interpreting The authors declare no conflict of interest.
malicious code. Acknowledgment
vi) Lack of Experts: There is a more shortage of qualified and I would like to thank to all people who help me prepare this paper
experienced data scientists in the world. So it is necessary to completely. I would also thank to my guide who help me and get
create an expertise in the field of data science to turn the proper suggestions. I would like to thank to all website and journal
promises of big data into reality. papers which I have referred to create my review paper
8. Innovative Ideas and Solutions: successfully.
The following are some possible new innovative ideas and The authors would like to thank all reviewers and Prof. Passerini
solutions of Big Data in Healthcare industry. Kazmerski, Editor for his valuable comments on the manuscript.
• Clinical Decision Support: BDA technologies predict References
outcomes or recommend alternative treatments to
[1] Jasleen Kaur Bains, “Big Data Analytics in Healthcare- Its Benefits, Phases
clinicians and patients at the point of care by and Challenges” , International Journal of Advanced Research in Computer
understanding, analyzing, categorizing and learning Science and Software Engineering, Volume 6, Issue 4, April 2016,Available
from them. online at: www.ijarcsse.com
Wullianallur Raghupathi and Viju Raghupathi, “Big data analytics in
• Personalized Care: By predicting and analyzing disease [2]
healthcare: promise and potential”, Health Information Science and Systems
symptoms in advance personalized care is taken (e.g., 2014, 2:3, Available: http://www.hissjournal.com/content/2/1/3
genomic DNA sequence for cancer care) in real time to [3] VivekWadhwa,”The rise of big data brings tremendous possibilities and
highlight best practice treatments to patients. These frightening
perils”,April2014.Available:http://www.washingtonpost.com/blogs/innovat
solutions may offer early detection and diagnosis before
ions/wp/2014/04/18/therise-of-big-data-brings-remendous-possibilities-
a patient develops disease symptoms. and-frightening-perils/
• Public And Population Health: BDA solutions that can [4] D. Agrawal et. al, “Challenges and Opportunities with Big Data”, Big Data
help in searching and identifying patient population via WhitePaper-Computing Research Association, Feb-2012, Available:
http://cra.org/ccc/docs/init/bigdatawhitepaper.pdf
social media data to predict flu outbreaks based on [5] Nambiar, R. ; Cisco Syst., Inc., San Jose, CA, USA ; Bhardwaj, R. ; Sethi,
consumers’ search, social content and query activity. A. ; Vargheese, R.,”A look at challenges and opportunities of Big Data
www.astesj.com 195
D. S. S. Reddy et al. / Advances in Science, Technology and Engineering Systems Journal Vol. 2, No. 4, 189-196 (2017)
analytics in healthcare”, IEEEConference 2013, Available:
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6691753&url=http
%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3
D6691753
[6] Ahmed E. Youssef,” A Framework for Secure Healthcare Systems Based on
Big Data Analytics in Mobile Cloud Computing Environments”, The
International Journal of AmbientSystem and Applications 06-2014,
Available: http://airccse.org/journal/ijasa/papers/2214asa01.pdf
[7] J. Archenaa, E.A. Mary Anita,” A Survey of Big Data Analytics in
Healthcare and Government”, Procedia Computer Science, Elsevier,
Volume 50, 2015, Pages 408–413,Big Data,Cloud and Computing
Challenges, Available:
http://www.sciencedirect.com/science/article/pii/S1877050915005220
[8] Matthew Herland, Taghi M Khoshgoftaar and RandallWald, “A review of
data mining using bigdata in health informatics”, Herland et al. Journal of
Big Data 2014, Springer, 1:2 Available:
http://www.journalofbigdata.com/content/1/1/2
[9] MH Kuo, T Sahama, AW Kushniruk, EM Borycki, DK Grunwell, ―"Health
big data analytics: current perspectives, challenges and potential solutions",
International Journal of Big Data Intelligence ,Vol. 1, Issue 1, pp.114-126.
[10] Bernard Marr, "How Big Data Is Changing Healthcare", Available:
http://www.forbes.com/sites/bernardmarr/2015/04/21/how-big-data-is-
changing-healthcare/
[11] “Improve Healthcare Win $3,000,000”, Available:
http://www.heritagehealthprize.com/c/hhp
[12] Cynthia Harvey, “50 Top Open Source Tools for Big Data", Available:
http://www.datamation.com/data-center/50-top-open-source-tools-for-big-
data-1.html
[13] “Big Data Provides True Picture of Diabetic Population”, Available:
http://www.sas.com/en_us/news/sascom/2014q1/nz-ministry-of-health.html
http://www-01.ibm.com/common/ssi/cgi-
bin/ssialias?subtype=AB&infotype=PM&appname=SWGE_YT_YT_USE
N&htmlfid=YTC03753USEN&attachment=YTC03753USEN.PDF
[14] Health Analytics, Available: http://datascience.columbia.edu/health-
analytics
[15] http://www.ibm.com/smarterplanet/us/en/ibmwatson/assets/pdfs/WellPoint
_Case_Study_IMC14792.pdf
[16] Linda L. Briggs, “BigData means better care at Seattle's Children Hospital",
Available: http://tdwi.org/articles/2013/08/13/big-data-analytics-smarter-
care.aspx
[17] Kiyana Zolfaghar, Naren Meadem, Ankur teredesai, Senjuti Basu Roy, Si-
Chi Chin.“Big Data Solutions for Predicting Risk-of-Readmission for
Congestive Heart Failure Patients”.2013 IEEE International Conference on
Big Data, 978-1-4799-1293-3/13.
http://dx.doi.org/10.1109/bigdata.2013.6691760
[18] Joseph M. Woodside. Virtual Health Management, 2014 11th International
Conference on Information Technology New Generations 978-1-4799-3187-
3/14. http://dx.doi.org/10.1109/itng.2014.124

www.astesj.com 196

You might also like