The PDF file you selected should load here if your Web browser has a PDF reader plug-in installed (for example, a recent version of Adobe Acrobat Reader).

If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs.

Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link above.

Fullscreen Fullscreen Off


Background/Objectives: The aim of this paper was to explore various information retrieval approaches applied for detecting duplicate bug reports. Methods/Statistical Analysis: We have determined Data pre-processing, Textual Analysis, Similarity measurement, classification and clustering methods applied on bug reports of various open source browsers for detecting duplicate bug reports. Findings: Information Retrieval Approaches provide an efficient way of detecting duplicate bug reports. The result of our study states that Recall and precision are the two important aspects of performance analysis of duplicate bug detection methods. We can achieve a precision of 99% and recall of 98%, by using both textual and categorical similarity measurements. Application/Improvements: As a consequence it decreases the time and effort spent on fixing the same bug repeatedly.

Keywords

Duplicate bug, Information Retrieval, Precision, Recall, Similarity Measures.
User