Open Access Open Access  Restricted Access Subscription Access

Emoticon based Sentiment Analysis using Parallel Analytics on Hadoop


Affiliations
1 Department of IT, St. Joseph’s College (Autonomous), College Road, Tiruchirappalli - 620002, Tamil Nadu, India
2 Jamal Mohamed College, 7, Race Course Road, Khajanagar, Tiruchirappalli – 620020, Tamil Nadu, India
 

Objectives: The major objective of this approach is to provide a sentiment analysis architecture that can operate on streaming big data to provide effective results at tolerable time limits. Method/Analysis: An effective mechanism for analyzing the social networking messages to identify the sentiment levels has been proposed. This is a generic model that can be used for product or organization specific analysis. Further, this method also considers emoticons, which form the integral part of any expressed emotion. The entire process is carried out in Hadoop Architecture using the MapReduce paradigm. Findings: Experiments have been conducted on a Hadoop cluster. Inputs were passed from a client node connected to the cluster. Map Reduce programs were executed in six phases, each phase performing a single task in map and reduce phases. The ROC plot exhibits excellect accuracies with most of the points being clustered in the top left region, some even approaching 100% effectiveness. Even the PR plots exihibits similar efficiency scenario with high positive retrieval rates. Incorporating the emoticons plays a major role in increasing the efficiency of this approach. Novelty/Improvement: This approach uses Hadoop based implementations, involving Map and Reduce operations. Using this approach provides data scalability and improves the efficiency of the results in acceptable time limits.

Keywords

Emoticons, Hadoop, Map Reduce, Polarity Identification, Sentiment Analysis, Social Networking Data Processing.
User

Abstract Views: 145

PDF Views: 0




  • Emoticon based Sentiment Analysis using Parallel Analytics on Hadoop

Abstract Views: 145  |  PDF Views: 0

Authors

V. Jude Nirmal
Department of IT, St. Joseph’s College (Autonomous), College Road, Tiruchirappalli - 620002, Tamil Nadu, India
D. I. George Amalarethinam
Jamal Mohamed College, 7, Race Course Road, Khajanagar, Tiruchirappalli – 620020, Tamil Nadu, India

Abstract


Objectives: The major objective of this approach is to provide a sentiment analysis architecture that can operate on streaming big data to provide effective results at tolerable time limits. Method/Analysis: An effective mechanism for analyzing the social networking messages to identify the sentiment levels has been proposed. This is a generic model that can be used for product or organization specific analysis. Further, this method also considers emoticons, which form the integral part of any expressed emotion. The entire process is carried out in Hadoop Architecture using the MapReduce paradigm. Findings: Experiments have been conducted on a Hadoop cluster. Inputs were passed from a client node connected to the cluster. Map Reduce programs were executed in six phases, each phase performing a single task in map and reduce phases. The ROC plot exhibits excellect accuracies with most of the points being clustered in the top left region, some even approaching 100% effectiveness. Even the PR plots exihibits similar efficiency scenario with high positive retrieval rates. Incorporating the emoticons plays a major role in increasing the efficiency of this approach. Novelty/Improvement: This approach uses Hadoop based implementations, involving Map and Reduce operations. Using this approach provides data scalability and improves the efficiency of the results in acceptable time limits.

Keywords


Emoticons, Hadoop, Map Reduce, Polarity Identification, Sentiment Analysis, Social Networking Data Processing.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i33%2F128213