Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Enhanced Method for Efficient Information Retrieval from Resume Documents Using SPARQL


Affiliations
1 Department of Information Technology, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
2 Department of Computer Science and Engineering, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
     

   Subscribe/Renew Journal


It is more important to retrieve information from various types of documents like DOC, HTML, etc that contain vital information to be preserved and used in future. Information retrieval from these documents is mostly the manual effort. Though search algorithms do this retrieval, they may not be accurate as expected by the user. Also, some documents like candidates' resumes cannot be stored into the relational database as such because the number of fields is more. Much of manual efforts are put in use to analyze the various resumes to select the candidates who satisfy the specific criteria. To minimize the manual efforts and to get the results faster, this paper proposes the use of Semantic Web Technology like OWL, RDF and SPARQL to retrieve the information from the documents efficiently. This paper proposes to create the Ontology for the required domain as a first step. Based on the fields or tags in the owl file, the user is given a form to provide his personal and academic details. These data is converted into RDF/XML document. RDF files are retrieved and grouped based on some category. Query text is entered and the relevant records are retrieved from RDF documents using SPARQL. SPARQL is an RDF query language that enhances fast and efficient search of data when compared to other XML query languages like XPATH and XQUERY. Comparison between SPARQL and XPATH in terms of time taken to retrieve records is also analyzed in this paper.

Keywords

RDF, OWL, SPARQL, Document Filter, Information Retrieval.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 205

PDF Views: 2




  • An Enhanced Method for Efficient Information Retrieval from Resume Documents Using SPARQL

Abstract Views: 205  |  PDF Views: 2

Authors

P. Sheba Alice
Department of Information Technology, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
A. M. Abirami
Department of Information Technology, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India
A. Askarunisa
Department of Computer Science and Engineering, Thiagarajar College of Engineering, Madurai, Tamil Nadu, India

Abstract


It is more important to retrieve information from various types of documents like DOC, HTML, etc that contain vital information to be preserved and used in future. Information retrieval from these documents is mostly the manual effort. Though search algorithms do this retrieval, they may not be accurate as expected by the user. Also, some documents like candidates' resumes cannot be stored into the relational database as such because the number of fields is more. Much of manual efforts are put in use to analyze the various resumes to select the candidates who satisfy the specific criteria. To minimize the manual efforts and to get the results faster, this paper proposes the use of Semantic Web Technology like OWL, RDF and SPARQL to retrieve the information from the documents efficiently. This paper proposes to create the Ontology for the required domain as a first step. Based on the fields or tags in the owl file, the user is given a form to provide his personal and academic details. These data is converted into RDF/XML document. RDF files are retrieved and grouped based on some category. Query text is entered and the relevant records are retrieved from RDF documents using SPARQL. SPARQL is an RDF query language that enhances fast and efficient search of data when compared to other XML query languages like XPATH and XQUERY. Comparison between SPARQL and XPATH in terms of time taken to retrieve records is also analyzed in this paper.

Keywords


RDF, OWL, SPARQL, Document Filter, Information Retrieval.