Usage of HIVE Tool in Hadoop ECO System with Loading Data and User Defined Functions

Authors

  • Dr. K. Uma Pavan Kumar Associate Professor,Malla Reddy Institute of Technology, Hyderabad, India Author

DOI:

https://doi.org/10.61841/2y2xdm74

Keywords:

Hive,, -Import,, UDF,, -Map Reduce, Data Loading.

Abstract

The general usage of Hadoop is to store the bulk data with Hadoop Distributed File System and to process the data with Map Reduce. Apart from this the eco system provides extensive functionalities like usage of query-based logics to import the data from local path and Hadoop distributed path. This article presents the usage of Hive in the context of loading the bulk data and some simple analytics applicability. The Hive User Defined functions (UDF) creation and running with eclipse is the additional context of the paper. The work explains the parameters involved in the processing of the data loading and working with UDF’s so as to simplify the Map Reduce (MR) process with HIVE commands.The context of Map Reduce requires the complex coding skills, and the problem is only HDFS path is known to the MR, there is no approach of working with local file system. The basic advantage of Hive is to work with local path files and as well as HDFS path files. Similarly processing wise Hive simplifies coding and functions usage with the implementation of the simple commands.The case study taken in this article deals with various parameters like page views data, system_IP, View_time, user_id and page_url. The other case study we have taken is loading of the bulk data in the less time.The outcome of the work is loading of the data in the context of local path and Hadoop Distributed Path. Loading of the bulk data within seconds and recording of the time taken is the other outcome. The creation of the UDF and running of the tasks in HIVE is the resultant of the work. Apart from these considerations the research issues and possible extension works can be observed in the article.

 

Downloads

Download data is not yet available.

References

1. U. P. K. Kethavarapu,S.Saraswathi, “Ontology based job recommendation system with dynamic source updates by slowly changing source detection”, International Journal of Knowledge Engineering and Soft Data Paradigms, vol. 5, no. 3/4, pp. 164-173, 2016.

2. K.UmapavanKumar, S. V. N. Srinivasu, A. Ramaswamy Reddy, “Hadoop Cluster Performance with MR and Pig Latin in the Big Data”, International Journal of Innovative Technology and Exploring Engineering, vol. 8, no. 7, pp. 83-87, May 2019.

3. K.UmapavanKumar, “DWH security encapsulation with Bitmap Indexing Mechanisms”,IJETCSE, vol. 11, no. 2, pp. 10-15, Nov. 2014.

4. K.Umapavan Kumar, “Various Issues in Hadoop Distributed File System and MR future research

directions”, International Journal of Pure and Applied Mathematics, vol. 120, no. 6, pp. 4441-4451, 2018.

5. U. P. K.Kethavarapu, “The ten ingredients of data base systems for improving performance and their review leading to research problems”, IFRSA International Journal of Computing, vol. 2, no. 2, pp. 409- 415, Apr. 2012.

6. U. P. K. Kethavarapu, “Various Computing models in Hadoop eco system along withthe perspective of analytics using R and Machine learning”, International Journal of Computer Science and Information Security, vol. 14, pp. 17-23.

7. C. P. Chen and C.-Y. Zhang, “Data Intensive Applications, Challenges, Techniques and Technologies: A Survey on Big Data”, Information Science, vol. 275, pp. 314-347,2014.

8. J. Y. Monteith, J. D. McGregorand J. E.Ingram, “Hadoop and its EvolvingEcosystem”, 5th International Workshop on Software Eco System ,pp. 57-68, 2013.

9. Mausam j. Naik (2019) mapksignalling pathway: role in cancer pathogenesis. Journal of Critical Reviews, 6 (3), 1-6. doi:10.22159/jcr.2019v6i3.31778

10. Craddock, T.J.A., Tuszynski, J.A. On the role of the microtubules in cognitive brain functions (2007) NeuroQuantology, 5 (1), pp. 32-57.

11. Georgiev, D.D., Papaioanou, S.N., Glazebrook, J.F. Solitonic effects of the local electromagnetic field on neuronal microtubules (2007) NeuroQuantology, 5 (3), pp. 276-291.

Downloads

Published

30.06.2020

How to Cite

Kumar, D. K. U. P. (2020). Usage of HIVE Tool in Hadoop ECO System with Loading Data and User Defined Functions. International Journal of Psychosocial Rehabilitation, 24(4), 1058-1062. https://doi.org/10.61841/2y2xdm74