Open Access Open Access  Restricted Access Subscription Access

Challenges in Morphological Analysis of Tamil Biomedical Texts


Affiliations
1 Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai - 600025, Tamil Nadu, India
 

The purpose of a Morphological analyser is to explore the internal structure of the word and retrieve grammatical features and properties of a morphologically inflected word. Breaking down these amalgamated words is in itself a challenging job in the field of Natural Language Processing. The complexity further increases when the analysis is done on a more ancient and morphologically rich dataset like Tamil Siddha Medicinal documents. In this paper we list the different challenges we faced when trying to explore the syntactic and semantic features of Tamil Siddha texts for building a Tamil Biomedical NER. We also highlight the different fine tuning that was carried out on the analyser to overcome some of the difficulties and possible changes that can be done to improve the accuracy of the analyser in the given domain.

Keywords

Challenges, Morphological Analyzer, POS Tagging, Tamil Biomedicine.
User

Abstract Views: 163

PDF Views: 0




  • Challenges in Morphological Analysis of Tamil Biomedical Texts

Abstract Views: 163  |  PDF Views: 0

Authors

J. Betina Antony
Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai - 600025, Tamil Nadu, India
G. S. Mahalakshmi
Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai - 600025, Tamil Nadu, India

Abstract


The purpose of a Morphological analyser is to explore the internal structure of the word and retrieve grammatical features and properties of a morphologically inflected word. Breaking down these amalgamated words is in itself a challenging job in the field of Natural Language Processing. The complexity further increases when the analysis is done on a more ancient and morphologically rich dataset like Tamil Siddha Medicinal documents. In this paper we list the different challenges we faced when trying to explore the syntactic and semantic features of Tamil Siddha texts for building a Tamil Biomedical NER. We also highlight the different fine tuning that was carried out on the analyser to overcome some of the difficulties and possible changes that can be done to improve the accuracy of the analyser in the given domain.

Keywords


Challenges, Morphological Analyzer, POS Tagging, Tamil Biomedicine.



DOI: https://doi.org/10.17485/ijst%2F2015%2Fv8i23%2F136784