Data Discovery from Large Public Key-Value Data
Abstract
This initial review attempts to provide foundation for understanding the use of big data in healthcare, with a view to explore how big data can be applied to particular areas to gain the maximum benefit for the targeted research. Big data analytics relates to healthcare as an option for solving information system complexities within healthcare. Although the five dimensions of big data are categorized separately, in fact, they intertwine. MapReduce is a programming model and an associated implementation for processing and generating large data. A MapReduce program is composed of a Map() procedure (method) that performs filtering and sorting (such as sorting students by first name into queues, one queue for each name) and a Reduce() method that performs a summary operation (such as counting the number of students in each queue, yielding name frequencies). The "MapReduce System" (also called "infrastructure" or "framework") orchestrates the processing by marshalling the distributed servers, running the various tasks in parallel, managing all communications and data transfers between the various parts of the system, and providing for redundancy and fault tolerance.
Full Text:
PDFCopyright (c) 2017 Edupedia Publications Pvt Ltd
![Creative Commons License](http://licensebuttons.net/l/by-nc-sa/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
All published Articles are Open Access at https://journals.pen2print.org/index.php/ijr/
Paper submission: ijr@pen2print.org