Open Access Open Access  Restricted Access Subscription Access

Sentiment Analysis on Big Data Using Hadoop


Affiliations
1 Department of Computer Science, Sri Krishna Degree College, Bengaluru, India
2 Adarsh Institute of Management, and Information Technology (AIMIT), Bengaluru, India
 

Big Data is the term for collection of data sets so large&complex that it becomes difficult to process using onhand Database Management tools or traditional data processing applications. In this paper we take Twitter as an example to perform sentiment analysis. Twitter, one of the largest social media site receives millions of tweets every day on variety of important issues. This huge amount of raw data can be used for industrial , Social, Economic, Government policies or business purpose by organizing according to our requirement and processing. Hadoop is one of the best tool options for twitter data analysis as it works for distributed Big data , Streaming data , Time Stamped data , text data etc. This paper discuss how to use FLUME and HIVE tool for twitter post analysis. FLUME is used to extract real time twitter data into HDFS. Hive which is SQL like query language is used for some extraction and analysis.

Keywords

BIGDATA, FLUME, HIVE, HADOOP, MAPREDUCE, HDFS.
User
Notifications
Font Size

  • https://www.intechopen.com/books/earthquakes-tectonics...and.../tweeter-prediction
  • https://arxiv.org/pdf/cond-mat/0508476
  • https://en.wikipedia.org/wiki/Big_data
  • https://www.sas.com › Home › SAS Insights › Big Data Insights
  • https://en.wikipedia.org/wiki/Cloud_computing
  • https://creative.adobe.com/products/download/creative-cloud
  • https://aws.amazon.com

Abstract Views: 243

PDF Views: 125




  • Sentiment Analysis on Big Data Using Hadoop

Abstract Views: 243  |  PDF Views: 125

Authors

Manjula Prasad
Department of Computer Science, Sri Krishna Degree College, Bengaluru, India
P. Niveditha
Adarsh Institute of Management, and Information Technology (AIMIT), Bengaluru, India

Abstract


Big Data is the term for collection of data sets so large&complex that it becomes difficult to process using onhand Database Management tools or traditional data processing applications. In this paper we take Twitter as an example to perform sentiment analysis. Twitter, one of the largest social media site receives millions of tweets every day on variety of important issues. This huge amount of raw data can be used for industrial , Social, Economic, Government policies or business purpose by organizing according to our requirement and processing. Hadoop is one of the best tool options for twitter data analysis as it works for distributed Big data , Streaming data , Time Stamped data , text data etc. This paper discuss how to use FLUME and HIVE tool for twitter post analysis. FLUME is used to extract real time twitter data into HDFS. Hive which is SQL like query language is used for some extraction and analysis.

Keywords


BIGDATA, FLUME, HIVE, HADOOP, MAPREDUCE, HDFS.

References





DOI: https://doi.org/10.21095/ajmr%2F2017%2Fv0%2Fi0%2F122462