Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Framework to Monitor Big Data Processing in the Cloud


Affiliations
1 Department of Engineering and IT, Manipal University, Academic City, Dubai, United Arab Emirates
2 Cerner Healthcare Pvt. Limited, Flat No: B-74, Ganga Heights, HBR Layout, 24th cross, 18th Main, 5th Block, Bangalore-560043, India
     

   Subscribe/Renew Journal


Big data is processed on the cloud as a series of Map and Reduce jobs. Often, huge chunks of data is pushed onto the cloud for processing, thereby overloading the processor. As a result, the jobs working on such huge chunks of data run for many days, making it quite difficult for the Technical Analysts to keep track of such jobs and to find the ischolar_main cause of the delay in completing them. This paper proposes an idea to build a framework and explains the implemented details to overcome the problems currently faced in cloud data processing, thus enhancing the data processing activities in the cloud. The purpose of this implementation is to make Hadoop Big data processing management simpler by developing web based application for provisioning, managing, and monitoring data processing activities on Cloudera apache Hadoop clusters. This paper provides an intuitive, easy-to-use Hadoop Data processing monitoring management web dashboard backed by reporting framework to provide the collective view of the data as obtained from Job Trackers.

Keywords

Mapreduce, Hadoop, Data Processing, Data Monitoring.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 148

PDF Views: 2




  • Framework to Monitor Big Data Processing in the Cloud

Abstract Views: 148  |  PDF Views: 2

Authors

R. Vijaya Arjunan
Department of Engineering and IT, Manipal University, Academic City, Dubai, United Arab Emirates
Nagapandu Potti
Cerner Healthcare Pvt. Limited, Flat No: B-74, Ganga Heights, HBR Layout, 24th cross, 18th Main, 5th Block, Bangalore-560043, India

Abstract


Big data is processed on the cloud as a series of Map and Reduce jobs. Often, huge chunks of data is pushed onto the cloud for processing, thereby overloading the processor. As a result, the jobs working on such huge chunks of data run for many days, making it quite difficult for the Technical Analysts to keep track of such jobs and to find the ischolar_main cause of the delay in completing them. This paper proposes an idea to build a framework and explains the implemented details to overcome the problems currently faced in cloud data processing, thus enhancing the data processing activities in the cloud. The purpose of this implementation is to make Hadoop Big data processing management simpler by developing web based application for provisioning, managing, and monitoring data processing activities on Cloudera apache Hadoop clusters. This paper provides an intuitive, easy-to-use Hadoop Data processing monitoring management web dashboard backed by reporting framework to provide the collective view of the data as obtained from Job Trackers.

Keywords


Mapreduce, Hadoop, Data Processing, Data Monitoring.