Open Access Open Access  Restricted Access Subscription Access

Authorship Identification for Tamil Classical Poem (Mukkoodar Pallu) using C4.5 Algorithm


Affiliations
1 Department of Computer Science and Engineering, SRM University, Kattankulathur, Chennai - 603203, Tamil Nadu, India
 

Objectives: To training classifier based on the features extracted from the poems of Mukkoodar Pallu, authors for various unknown poems can be classified. Methods/Analysis: The classification accuracy by performing classification in the dataset using C4.5 algorithm is illustrated in this paper. Findings: The results of performing classification on dataset that consists of features extracted from the dataset are shown in this paper. Features like number of characters, number of sentences and the classification accuracy when C4.5 algorithm is used is illustrated. Novelty/Improvement: By doing this, authors of various other poems in Tamil language can be identified which will be helpful to the society. Also a generalized authorship identification tool for all regional languages can be achieved.

Keywords

Authorship, Classification, Feature Selection, Tamil Articles.
User

Abstract Views: 175

PDF Views: 0




  • Authorship Identification for Tamil Classical Poem (Mukkoodar Pallu) using C4.5 Algorithm

Abstract Views: 175  |  PDF Views: 0

Authors

A. Pandian
Department of Computer Science and Engineering, SRM University, Kattankulathur, Chennai - 603203, Tamil Nadu, India
V. V. Ramalingam
Department of Computer Science and Engineering, SRM University, Kattankulathur, Chennai - 603203, Tamil Nadu, India
R. P Vishnu Preet
Department of Computer Science and Engineering, SRM University, Kattankulathur, Chennai - 603203, Tamil Nadu, India

Abstract


Objectives: To training classifier based on the features extracted from the poems of Mukkoodar Pallu, authors for various unknown poems can be classified. Methods/Analysis: The classification accuracy by performing classification in the dataset using C4.5 algorithm is illustrated in this paper. Findings: The results of performing classification on dataset that consists of features extracted from the dataset are shown in this paper. Features like number of characters, number of sentences and the classification accuracy when C4.5 algorithm is used is illustrated. Novelty/Improvement: By doing this, authors of various other poems in Tamil language can be identified which will be helpful to the society. Also a generalized authorship identification tool for all regional languages can be achieved.

Keywords


Authorship, Classification, Feature Selection, Tamil Articles.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i47%2F133814