Handbook on Intelligent Healthcare Analytics. Группа авторов
3
Big Data Knowledge System in Healthcare
P. Sujatha1*, K. Mahalakshmi2 and P. Sripriya3
1Department of Information Technology, Vels Institute of Science, Technology & Advanced Studies, Pallavaram, Chennai, India
2School of Computing Sciences, Vels Institute of Science, Technology & Advanced Studies, Pallavaram, Chennai, India & Women’s Christian College, Chennai, India
3Department of Computer Applications, Vels Institute of Science, Technology & Advanced Studies, Pallavaram, Chennai, India
Abstract
The present society is an Informational Society. The information is essential for industries for accurate results. Knowledge systems integrate the information from multiple sources to extract important insights for decision-making. Due to digitalization, the data available is abundant in all kinds of industry. In order to handle voluminous data and to predict the outcomes accurately, a new framework is evolved using big data and big data analytics. Big data has huge implications for knowledge management. Medical data in the healthcare industry is enormous. In recent years, the healthcare sector is changing from volume-based service industry into value-based service industry, because health is the most precious gift to human beings. Advanced technologies and its related analytical tools increase better healthcare practices and drive human beings into longer life spans. Big data knowledge system has a significant responsibility in the healthcare practices. Big data knowledge systems analyze the large volume of patients’ electronic health records and transform the medical information into knowledge for assisting the process of clinical decisions, for recommendation of medicines, and for better diagnoses of diseases. In this chapter, we discuss big data, its applications, and challenges of the big data knowledge system in the healthcare sector
Keywords: Knowledge system, big data, electronic health records, healthcare, big data knowledge system, big data analytics, medical data, clinical decisions
3.1 Introduction
Since the last decade, there is a constant development in information technology. The most recent trends in technology like social networks, smart phones, computers, and smart wearable devices lead to the growth of data. This growth in the data has given embarks on this latest notion described as big data. Big data is a large data set, which is generated at rapid speed. This data set is very complex, which goes beyond the capacity of traditional database and processing tools for storing, processing, and analyzing. Big data refers to diverse large amount of information created by the industries, machines, and the individual person. The production and usage of big data is persistently increasing. In today’s world, the volume of information far exceeds what the human mind can perceive. Big data requires sophisticated computer storage and analysis tools. In simple terms, big data is a huge amount of data in structured, semi-structured, and unstructured format. Day by day, the digital data volume is increasing steadily at a rapid pace. So, the big data is too large to process using the traditional technologies. Many companies realize the importance of the availability of big data. They use those large data to improve their decision-making and planning capabilities.
Healthcare big data is huge because of its abundant sources like electronic health record (EHR), the data from wearables and medical devices, genomic data, clinical research data, Internet of Things (IoT), the search engine data, and the social media data. It is very difficult to integrate big medical data with traditional information processing systems and databases because of its variety and volume. However, the tool and technology of big data is used for gathering, processing, and storing large volumes of medical data, which is diverse in nature. The role of big data in the healthcare industry is important due to availability and easy accessibility of vast amounts of medical data, increase in healthcare costs, and the need for personalized patient care. Big data analytics effectively accumulate, analyze, and take the knowledge from the medical data.
Big data knowledge systems in the healthcare industry are more important because all these data about patients are used for developing health recommender systems, clinical decision support system, disease prediction system, and knowledge discovery systems. The medical big data with knowledge system assists during the process of analyzing and extracting valuable knowledge. The extracted knowledge used by the healthcare professional for their clinical decisions. The big data knowledge system transforms the healthcare big data to useful information. The healthcare industries are providing better clinical decisions, for the quality patient care with lower cost, using the healthcare data analytics.
Meaning of big data, various dimensions of big data, tools and technologies used in big data, process of creating value from big data, big data analytics, role of big data knowledge system in healthcare, healthcare big data analytics, the applications of big data, and the challenges faced by big data in the healthcare industry are presented and discussed in the following sections.
3.2 Overview of Big Data
3.2.1 Big Data: Definition
Every day, the organization is producing an enormous quantity of data. These huge volumes of data compose “big data”. Big data, complex in nature, requires powerful technologies and advanced algorithms for its processing.
A formal definition of big data was given in [1]: “Big data is the information asset characterized by such a high volume, velocity, and variety to acquire specific technology and analytical methods for its transformation into value.”
Data has been increasing constantly in an unpredicted way from the last decade due to the digitalization and the advancements in technology. In common, the big data is being generated from following sources [14]:
Social data: The social data refers to social media data. It is generated from social media such as YouTube and Twitter. This data is mainly used in market analysis. The analysis on Facebook likes and comments and tweets on Twitter provide the details about the consumer behavior.
Machine data: Machine data refers to the data generated by machines, such as wearable, sensor devices, web logs, and satellites.
Transactional data: The transactional data are generated as a result of the transactions. The transactions can be online or offline. Examples of the transactional data are the delivery receipts, order, invoices, etc.
Human generated data: The human generated data is extracted from the emails, electronic medical reports, messages, etc.
Search engine data: The search engine data are generated from the browsers.
All the abovementioned data are in diverse formats such as comments, videos, email, and sensor data, most of which are in unstructured format. Big data is the huge size of a data set that grows exponentially with time. Examples of big data: Amazon product list, YouTube videos, Google search engine, and Jet engine data. Storing and processing of abovementioned big data is not possible with conventional databases because traditional databases can contain only gigabytes of data. But, the big data contains several petabytes of data. The big data solutions solve this entire problem with distributed storage and processing systems.
3.2.2 Big Data: Characteristics
The massive set of information generated with the utilization of the latest technologies is called big data. This large set of data is used for individual and organizational purposes. Previously, the information was generated, stored, and processed easily because of limited sources of data. The conventional database was the single supply of data. Most of the data in the conventional database was in structured format. Presently, data is in a wide variety of formats such as sensor data, email