Open Access Open Access  Restricted Access Subscription Access

A Study of Information Retrieval Approaches in Duplicate Bug Detection


Affiliations
1 Department of CSE, GITAM University, Vizag - 530045, Andhra Pradesh, India
 

Background/Objectives: The aim of this paper was to explore various information retrieval approaches applied for detecting duplicate bug reports. Methods/Statistical Analysis: We have determined Data pre-processing, Textual Analysis, Similarity measurement, classification and clustering methods applied on bug reports of various open source browsers for detecting duplicate bug reports. Findings: Information Retrieval Approaches provide an efficient way of detecting duplicate bug reports. The result of our study states that Recall and precision are the two important aspects of performance analysis of duplicate bug detection methods. We can achieve a precision of 99% and recall of 98%, by using both textual and categorical similarity measurements. Application/Improvements: As a consequence it decreases the time and effort spent on fixing the same bug repeatedly.

Keywords

Duplicate bug, Information Retrieval, Precision, Recall, Similarity Measures.
User

Abstract Views: 163

PDF Views: 0




  • A Study of Information Retrieval Approaches in Duplicate Bug Detection

Abstract Views: 163  |  PDF Views: 0

Authors

D. Swapna
Department of CSE, GITAM University, Vizag - 530045, Andhra Pradesh, India
K. Thammi Reddy
Department of CSE, GITAM University, Vizag - 530045, Andhra Pradesh, India

Abstract


Background/Objectives: The aim of this paper was to explore various information retrieval approaches applied for detecting duplicate bug reports. Methods/Statistical Analysis: We have determined Data pre-processing, Textual Analysis, Similarity measurement, classification and clustering methods applied on bug reports of various open source browsers for detecting duplicate bug reports. Findings: Information Retrieval Approaches provide an efficient way of detecting duplicate bug reports. The result of our study states that Recall and precision are the two important aspects of performance analysis of duplicate bug detection methods. We can achieve a precision of 99% and recall of 98%, by using both textual and categorical similarity measurements. Application/Improvements: As a consequence it decreases the time and effort spent on fixing the same bug repeatedly.

Keywords


Duplicate bug, Information Retrieval, Precision, Recall, Similarity Measures.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i43%2F123678