Open Access Open Access  Restricted Access Subscription Access

Automatic Sentiment Lexicon Construction for Punjabi


Affiliations
1 Department of Computer Science Punjabi University, Patiala, India
 

Sentiment Analysis has become a revenue generation model. The backbone of any Sentiment Analysis is Sentiment Lexicon. Using the available sentiment lexicon to develop new sentiment lexicon in other language is an interesting area of study and focus of this paper. Available resources like Hindi SentiWordNet, English SentiWordNet and Punjabi WordNet are used to prepare the sentiment lexicon for Punjabi Language with their positive and negative scores and IDs. The prepared dataset includes unique entries to improve the reliability of Lexicon. This prepared dataset recursively collects synonyms to expand the sentiment lexicon. In the experimental result, it is proved that developedPunjabi Sentiment Lexicon helps in improving the sentiment analysis task or opinion mining task.

Keywords

SentiWordNet, Punjabi Sentiment Lexicon, Punjabi WordNet, Sentiment Analysis, English SentiWordNet, Hindi SentiWordNet, Dictionary.
User
Notifications
Font Size

  • N. Omar, M. Albared, A. Q. Al-shabi and T. Al-moslmi, "Ensemble of Classification Algorithms for Subjectivity and Sentiment Analysis of Arabic Customers' Reviews," International Journal of Advancements in Computing Technology(IJACT) , vol. 5.14, p. 77, 2013.
  • A. Das and S. Bandyopadhyay, "SentiWordNet for Indian Languages," pp. 56-63, 2010.
  • A. Esuli and F. Sebastiani, 1 June 2010. [Online]. Available: http://www.cs.unh.edu/~cmo66/class_websites/cs405/assignments/a7/wordnet.txt.
  • M. Sharan, "hINDIswn," 14 April 2016. [Online]. Available: https://github.com/smadha/SarcasmDetector/blob/master/Hindi%20SentiWordNet/HSWN_ WN.txt.
  • A. Das. [Online]. Available: http://www.amitavadas.com/sentiwordnet.php.
  • B. Ohana and B. Tierney, "sentiment classification of reviews using sentiwordnet," in In 9th.
  • it & t conference, Dublin, Ireland, 2009.
  • "Punjabis," Wikipedia the free encyclopedia, [Online]. Available: https://en.wikipedia.org/wiki/Punjabis.
  • S. Ahire, "A survey of Sentiment Lexicons," 2014.
  • L. Z. and B. L. , Sentiment Analysis and Opinion Mining, Morgan & Claypool Publishers, 2012, p. 167.
  • N. Verma and P. Bhattacharyya, "Automatic Lexicon Generation through WordNet," in GWC, Brno, Czech Republic, 2004.
  • G. k. Riyad Al-Shalabi, "Constructing an automatic lexicon for Arabic language," International Journal of Computing & Information Sciences, vol. 2.2, pp. 114-128, 2004.
  • A. Esuli and F. Sebastiani, "SENTIWORDNET: A Publicly Available Lexical Resource For Opinion Mining," in In Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), Genoa-Italy, 2006.
  • W. Du, S. Tan, X. Cheng, X. Yun and Tan, "Adapting Information Bottleneck Method for Automatic Construction of Domain-oriented Sentiment Lexicon," in Proceedings of the third ACM international conference on Web search and data mining, New York, USA, 2010.
  • S. Baccianella, A. Esuli and F. Sebastiani, "SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining," in Language Resources and Evaluation (LREC), Valletta Malta, 2010.
  • A. Hamouda, M. Rohaim and M. Marei, "Building Machine Learning Based Senti-word Lexicon for Sentiment Analysis," Journal of advances in information technology, vol. 2.4, pp. 199-203, November 2011.
  • R. Xie and C. Li, "Lexicon Construction: A Topic Model Approach," in 2012 International Conference on Systems and Informatics (ICSAI), China, 2012.
  • A. Bakliwal, P. Arora and V. Verma, "Hindi subjective lexicon: A lexical resource for hindi polarity classification," in In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC), Istanbul, 2012.
  • Y. Lu, M. Castellanos, U. Dayal and C. Zhai, "Automatic Construction of a Context-Aware Sentiment Lexicon: An Optimization Approach," in 20th international conference on World Wide Web, Hyderabad, 2011.
  • A. K. and V. G. , "Proposed Algorithm of Sentiment Analysis for Punjabi Text," Journal of Emerging Technologies in Web Intelligence, vol. 6.2, pp. 180-183, 2014.
  • N. Medagoda, S. Shanmuganathan and J. L. Whalley, "Sentiment Lexicon Construction Using SentiWordNet 3.0," in 11th International Conference on Natural Computation (ICNC), 2015.
  • S. Park and Y. Kim, "Building thesaurus lexicon using dictionary-based approach for sentiment classification," in Software Engineering Research, Management and Applications
  • "IndoWordNet," Center for Indian Language Technology, [Online]. Available: http://www.cfilt.iitb.ac.in/indowordnet/.

Abstract Views: 254

PDF Views: 1




  • Automatic Sentiment Lexicon Construction for Punjabi

Abstract Views: 254  |  PDF Views: 1

Authors

Diksha Goyal
Department of Computer Science Punjabi University, Patiala, India
Gurpreet Singh Josan
Department of Computer Science Punjabi University, Patiala, India

Abstract


Sentiment Analysis has become a revenue generation model. The backbone of any Sentiment Analysis is Sentiment Lexicon. Using the available sentiment lexicon to develop new sentiment lexicon in other language is an interesting area of study and focus of this paper. Available resources like Hindi SentiWordNet, English SentiWordNet and Punjabi WordNet are used to prepare the sentiment lexicon for Punjabi Language with their positive and negative scores and IDs. The prepared dataset includes unique entries to improve the reliability of Lexicon. This prepared dataset recursively collects synonyms to expand the sentiment lexicon. In the experimental result, it is proved that developedPunjabi Sentiment Lexicon helps in improving the sentiment analysis task or opinion mining task.

Keywords


SentiWordNet, Punjabi Sentiment Lexicon, Punjabi WordNet, Sentiment Analysis, English SentiWordNet, Hindi SentiWordNet, Dictionary.

References