A Detail Study on Big Data Analytics Using Hadoop Technologies
Abstract
Big data is a term that describes the large volume of data. That contains data in the form of both structured and un-structured data. These data sets are very large and complex so that it becomes difficult to process using traditional data processing applications. Big data is difficult to work with using most relational database management systems and desktop statistics and visualization packages, requiring instead "massively parallel software running on tens, hundreds, or even thousands of servers". The technologies used in Hadoop by big data application to handle the massive data are Hdfs, Map Reduce, Pig, Apache Hive, Hbase and Spark. These technologies handle massive amount of data in KB, MB, GB, TB, PB, EB, ZB, YB and BB.
Keywords: Apache hive, Big Data, Hadoop, Hbase, Map Reduce, Pig, SparkFull Text:
PDFCopyright (c) 2018 Edupedia Publications Pvt Ltd
![Creative Commons License](http://licensebuttons.net/l/by-nc-sa/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
All published Articles are Open Access at https://journals.pen2print.org/index.php/ijr/
Paper submission: ijr@pen2print.org