Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Rich Semantic Sentiment Analysis Using Lexicon Based Approach


Affiliations
1 Department of Information Technology, G.H. Patel College of Engineering and Technology, India
     

   Subscribe/Renew Journal


Web is a huge repository of information, and a massive amount of data is generated everyday on online platforms. Information, can be facts and opinions, facts are objective statements about an event, and opinions are subjective statements that reflect the sentiments of a person towards an event. Research on sentiment analysis has increased tremendously in recent years due to its wide variety of applications. To analyze sentiments, certain methods have been proposed, which can be broadly categorized as supervised machine learning and lexicon based approaches. Supervised machine learning methods are giving high accuracy but these methods need training data and are domain dependent, while lexicon-based methods are not domain dependent. Although, building of lexicon is costly, but once constructed, it can be applied for a wide variety of domains, but still lexicon based methods are restricted to their dictionaries and are full-dependent on the presence of terms that explicitly reflect the sentiment, while in many cases the sentiment of a term is implicitly reflected by the semantics of its context. Therefore, we've proposed context aware, semantically rich (conceptual&contextual semantics) lexicon-based method which is different from traditional lexicon-based methods that assigns sentiment score and strength to terms in a dynamic way, and outperforms baselines.

Keywords

Sentiment Analysis, Lexicon, Supervised Machine Learning, Contextual And Conceptual Semantics.
Subscription Login to verify subscription
User
Notifications
Font Size

  • Orestes Appel et al., “A Hybrid Approach to the Sentiment Analysis Problem at the Sentence Level”, Knowledge-Based Systems, Vol. 108, pp. 110-124, 2016.
  • Bing Liu, “Sentiment Analysis and Opinion Mining, Synthesis Lectures on Human Language Technologies”, Morgan and Claypool Publishers, 2012.
  • Bing Liu, “Sentiment Analysis and Subjectivity”, Handbook of Natural Language Processing, 2010
  • Erik Cambria, Bjorn Schuller, Yunqing Xia and Catherine Havasi, “New Avenues in Opinion Mining and Sentiment Analysis”, IEEE Intelligent Systems, Vol. 28, No. 2, pp. 1521, 2013.
  • Mike Thelwall, Kevan Buckley, Georgios Paltoglou, Di Cai and Arvid Kappas, “Sentiment Strength Detection in Short Informal Text”, Journal of the American Society for Information Science and Technology, Vol. 61, No. 12, pp.2544-2558, 2010.
  • Noura Farra, Elie Challita, Rawad Abou Assi and Hazem Hajj, “Sentence-Level and Document-Level Sentiment Mining for Arabic Texts”, Proceedings of IEEE International Conference on Data Mining Workshops, pp.1114-1119, 2010.
  • A. Agarwal, F. Biadsy and K.R. McKeown, “Contextual Phrase-Level Polarity Analysis using Lexical Affect Scoring and Syntactic N-Grams”, Proceedings of 12th Conference of the European Chapter of the Association for Computational Linguistics, pp. 24-32, 2009.
  • Robert Remus and Christian Hanig, “Towards WellGrounded Phrase-Level Polarity Analysis”, Proceedings of 12th International Conference on Computational Linguistics and Intelligent Text Processing, pp. 380-392, 2011.
  • Alec Go, Richa Bhayani and Lei Huang, “Twitter Sentiment Classification using Distant Supervision”, CS224N Project Report, Stanford, pp. 1-12, 2009.
  • Albert Bifet and Eibe Frank. “Sentiment Knowledge Discovery in Twitter Streaming Data”, Proceedings of International Conference on Discovery Science, pp. 1-15, 2010.
  • Alexander Pak and Patrick Paroubek, “Twitter as a Corpus for Sentiment Analysis and Opinion Mining”, Proceedings of LREC Conference, Vol. 10, pp. 1320-1326, 2010.
  • Apoorv Agarwal.et al., “Sentiment Analysis of Twitter Data”, Proceedings of Workshop on Languages in Social Media, pp. 30-38, 2011.
  • Efthymios Kouloumpis, Theresa Wilson and Johanna D. Moore, “Twitter Sentiment Analysis: The Good the Bad and the OMG!”, Proceedings of 5th International AAAI Conference on Weblogs and Social Media, pp. 538-541, 2011.
  • Hassan Saif, Yulan He and Harith Alani, “Semantic sentiment analysis of twitter”, Proceedings of International Semantic Web Conference, 2012.
  • Stefano Baccianella, Andrea Esuli and Fabrizio Sebastiani, “SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining”, Proceedings of LREC Conference, Vol. 10, pp. 2200-2204, 2010.
  • Theresa Wilson, Janyce Wiebe and Paul Hoffmann. “Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis”, Proceedings of Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 347-354, 2005.
  • James W. Pennebaker, Matthias R. Mehl, and Kate G. Niederhoffer, “Psychological Aspects of Natural Language use: Our Words, Our Selves”, Annual Review of Psychology, Vol. 54, No. 1, pp. 547-577, 2003.
  • L. Zhang, R. Ghosh, M. Dekhil, M. Hsu, and B. Liu, “Combining Lexicon-based and Learning based Methods for Twitter Sentiment Analysis”, Technical Report, HP Laboratories, pp .1-7, 2011.
  • Andrius Mudinas, Dell Zhang, and Mark Levene, “Combining Lexicon and Learning based Approaches for Concept-Level Sentiment Analysis”, Proceedings of 1st International Workshop on Issues of Sentiment Discovery and Opinion Mining, pp. 1-7, 2012.
  • Ji Fang and Bi Chen, “Incorporating Lexicon Knowledge into SVM Learning to Improve Sentiment Classification”, Proceedings of Workshop on Sentiment Analysis where AI meets Psychology, pp. 94-100, 2011.
  • Hassan Saif, Yulan He and Harith Alani, “Alleviating Data Sparsity for Twitter Sentiment Analysis”, Proceedings 2nd Workshop on Making Sense of Microposts, pp. 2-9, 2012.
  • Hiroya Takamura, Takashi Inui and Manabu Okumura, “Extracting Semantic Orientations of Words using Spin Model”, Proceedings of 43rd Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, pp. 133-140, 2005.
  • Michael Speriosu et al., “Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph”, Proceedings of 1st Workshop on Unsupervised Learning in NLP Association for Computational Linguistics, pp. 53-63, 2011.
  • Anthony Aue and Michael Gamon, “Customizing Sentiment Classifiers to New Domains: A Case Study”, Proceedings of Recent Advances in Natural Language Processing, Vol. 1. No. 3, pp. 1-7, 2005.
  • Hassan Saif, Yulan He, Miriam Fernandez and Harith Alani, “Contextual Semantics for Sentiment Analysis of Twitter”, Information Processing and Management, Vol. 52, No. 1, pp. 5-19, 2016.
  • Kyoungok Kim and Jaewook Lee, “Sentiment Visualization and Classification via Semi-Supervised NonlinearDimensionality Reduction”, Pattern Recognition, Vol. 47, No. 2, pp. 758-768, 2014.
  • Arnd Christian Konig and Eric Brill, “Reducing the Human Overhead in Text Categorization”, Proceedings of 12th ACM International Conference on Knowledge Discovery and Data Mining, pp. 598-603, 2006.
  • Bing Liu and Lei Zhang, “A Survey of Opinion Mining and Sentiment Analysis”, Mining Text Data, pp. 415-463, 2012.
  • Jose M.Chenlo and David E. Losada, “An Empirical Study of Sentence Features for Subjectivity and Polarity Classification”, Information Sciences, Vol. 280, pp. 275288, 2014.
  • Andrea Esuli and Fabrizio Sebastiani, “Sentiwordnet: A Publicly Available Lexical Resource for Opinion Mining”, Proceedings of LREC Conference, pp. 1-5, 2006.
  • Maite Taboada et al., “Lexicon-based methods for Sentiment Analysis”, Computational Linguistics, Vol. 37, No. 2, pp. 267-307, 2011.
  • Erik Cambria, “An Introduction to Concept-Level Sentiment Analysis”, Proceedings of Mexican International Conference on Artificial Intelligence, pp. 1-6, 2013.
  • Peter D. Turney, “Thumbs Up or Thumbs Down?: Semantic Orientation Applied to Unsupervised Classification of Reviews”, Proceedings of 40th Annual Meeting on Association for Computational Linguistics, pp. 417-424, 2002.
  • Peter D.Turney and Michael L. Littman, “Measuring Praise and Criticism: Inference of Semantic Orientation from Association”, ACM Transactions on Information Systems, Vol. 21, No. 4, pp. 315-346, 2003.
  • Rodrigo Moraes, Joao Francisco Valiati and Wilson P. Gaviao Neto, “Document-Level Sentiment Classification: An Empirical Comparison between SVM and ANN”, Expert Systems with Applications, Vol. 40, No. 2, pp. 621-633, 2013.
  • Tao Xu, Qinke Peng and Yinzhao Cheng, “Identifying the Semantic Orientation of Terms using S-HAL for Sentiment Analysis”, Knowledge-Based Systems, Vol. 35, pp. 279-289, 2012.
  • Xiaowen Ding, Bing Liu and Philip S. Yu, “A Holistic Lexicon-based Approach to Opinion Mining”, Proceedings of International Conference on Web Search and Data Mining, pp. 231-240, 2008.
  • Erik Cambria, Catherine Havasi and Amir Hussain, “SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis”, Proceedings of FLAIRS Conference, pp. 202-207, 2012.
  • Lisette Garcia Moya, Henry Anaya Sanchez and Rafael Berlanga Llavori, “Retrieving product Features and Opinions from Customer Reviews”, IEEE Intelligent Systems, Vol. 28, No. 3, pp. 19-27, 2013.
  • Kushal Dave, Steve Lawrence and David M. Pennock, “Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews”, Proceedings of 12th International Conference on World Wide Web, pp.519-528, 2003.
  • J. Yi, T. Nasukawa, R. Bunescu and W. Niblack, “Sentiment Analyzer: Extracting Sentiments about a given Topic using Natural Language Processing Techniques”, Proceedings of 3rd IEEE International Conference on Data Mining, pp. 1-8, 2003.
  • Satoshi Morinaga et al., “Mining Product Reputations on the Web”, Proceedings of 8th ACM International Conference on Knowledge Discovery and Data Mining, pp. 341-349, 2002.
  • Simon Tong and Daphne Koller, “Support Vector Machine Active Learning with Applications to Text Classification”, Journal of Machine Learning Research, Vol. 2, pp. 45-66, 2001.
  • Janyce Wiebe, “Learning Subjective Adjectives from Corpora”, Proceedings of International Conference on American Association for Artificial Intelligence, pp. 1-6, 2000.
  • Fabrizio Sebastiani, “Machine Learning in Automated Text Categorization”, ACM Computing Surveys, Vol. 34, No. 1, pp. 1-47, 2002.
  • Rudy Prabowo and Mike Thelwall, “Sentiment Analysis: A Combined Approach”, Journal of Informetrics, Vol. 3, No.2, pp. 143-157, 2009.
  • Arman Khadjeh Nassirtoussi et al., “Text Mining for Market Prediction: A Systematic Review”, Expert Systems with Applications, Vol. 41, No. 16, pp. 7653-7670, 2014.
  • Basant Agarwal and Namita Mittal, “Prominent Feature Extraction for Review Analysis: An Empirical Study”, Journal of Experimental and Theoretical Artificial Intelligence, Vol. 28, No. 3, pp. 485-498, 2016.
  • Asma Musabah Alkalbani et al., “Sentiment Analysis and Classification for Software as a Service Reviews”, Proceedings of IEEE 30th International Conference on Advanced Information Networking and Applications, pp. 5358, 2016.
  • Peter D. Turney and Patrick Pantel, “From Frequency to Meaning: Vector Space Models of Semantics”, Journal of Artificial Intelligence Research, Vol. 37, No. 1, pp. 141-188, 2010.

Abstract Views: 253

PDF Views: 3




  • Rich Semantic Sentiment Analysis Using Lexicon Based Approach

Abstract Views: 253  |  PDF Views: 3

Authors

Hedayatullah Lodin
Department of Information Technology, G.H. Patel College of Engineering and Technology, India
Prem Balani
Department of Information Technology, G.H. Patel College of Engineering and Technology, India

Abstract


Web is a huge repository of information, and a massive amount of data is generated everyday on online platforms. Information, can be facts and opinions, facts are objective statements about an event, and opinions are subjective statements that reflect the sentiments of a person towards an event. Research on sentiment analysis has increased tremendously in recent years due to its wide variety of applications. To analyze sentiments, certain methods have been proposed, which can be broadly categorized as supervised machine learning and lexicon based approaches. Supervised machine learning methods are giving high accuracy but these methods need training data and are domain dependent, while lexicon-based methods are not domain dependent. Although, building of lexicon is costly, but once constructed, it can be applied for a wide variety of domains, but still lexicon based methods are restricted to their dictionaries and are full-dependent on the presence of terms that explicitly reflect the sentiment, while in many cases the sentiment of a term is implicitly reflected by the semantics of its context. Therefore, we've proposed context aware, semantically rich (conceptual&contextual semantics) lexicon-based method which is different from traditional lexicon-based methods that assigns sentiment score and strength to terms in a dynamic way, and outperforms baselines.

Keywords


Sentiment Analysis, Lexicon, Supervised Machine Learning, Contextual And Conceptual Semantics.

References