Semi-Supervised Distributional Vector Generation Techniques for Text Classification

Mohammed Abdul Wajeed

The PDF file you selected should load here if your Web browser has a PDF reader plug-in installed (for example, a recent version of Adobe Acrobat Reader).

If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs.

Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link above.

Fullscreen Fullscreen Off

Abstract
References
Article Metrics
Refbacks

Text class has loved its privilege as a core studies area in text mining. Supervised, unsupervised are the 2 famous paradigms within the technique of type. Relatively novel method of classification is semi-supervised mastering which is midway among the supervised and unsupervised getting to know. With smaller schooling statistics units and taking the large without problems to be had unlabeled data, the procedure of studying in class is refined. There are versions in semisupervised, transductive gaining knowledge of wherein the trained and untrained facts are given in advance the classifier is built, the goal is to expect the magnificence label of untrained data. The opposite version is inductive learning in which the labeled and unlabeled statistics is utilized in model constructing; goal of the version is to predict the unseen information magnificence label. The paper aims to using transductive getting to know to classifying the textual statistics with the aid of considering the phrases appearing in special parts of the record. The words performing inside the introductory and conclusion a part of the files may additionally play important function within the procedure of type, than the ones seemed in other parts. The approach employed could provide one of a kind weights to words primarily based on their presence in one-of-a-kind role of the document. Taking into consideration the above within the procedure of mapping the textual facts into numerical patterns editions of distributed vector generations are acquired. Taking into account large differences in the duration of the documents, distinct normalization techniques are employed which gave eights one-of-a-kind vectors. Non-parametric, most effective to put into effect ok-nearest neighbour algorithm is hired for free-go with the flow textual classification. The outcomes received conclude that semi-supervised textual class can be carried out without loss in category accuracy where restrained skilled records is to be had, as the accuracies of the gaining knowledge of model in supervised and emi-supervised coincide with each other.

Keywords

Distributional Vectors, KNN, Semi-Supervised, Text Classification, Transductive Learning.

About the Journal

Editorial Board

Current Issue

Archives

Advanced Search

Article Submission

Registration

Subscription

User

Information

Journal Content
Browse

Donations

Username
Password
Remember me

Username
Password
Remember me

Indian Journal of Science and Technology

Keywords