A dataset from change history to support evaluation of software maintenance tasks

Bogdan Dit, Andrew Holtzhauer, Denys Poshyvanyk, Huzefa Kagdi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

33 Scopus citations

Abstract

Approaches that support software maintenance need to be evaluated and compared against existing ones, in order to demonstrate their usefulness in practice. However, oftentimes the lack of well-established sets of benchmarks leads to situations where these approaches are evaluated using different datasets, which results in biased comparisons. In this data paper we describe and make publicly available a set of benchmarks from six Java applications, which can be used in the evaluation of various software engineering (SE) tasks, such as feature location and impact analysis. These datasets consist of textual description of change requests, the locations in the source code where they were implemented, and execution traces. Four of the benchmarks were already used in several SE research papers, and two of them are new. In addition, we describe in detail the methodology used for generating these benchmarks and provide a suite of tools in order to encourage other researchers to validate our datasets and generate new benchmarks for other subject software systems. Our online appendix: http://www.cs.wm.edu/semeru/data/msr13/

Original languageEnglish
Title of host publication2013 10th Working Conference on Mining Software Repositories, MSR 2013 - Proceedings
Pages131-134
Number of pages4
DOIs
StatePublished - 2013
Event10th International Working Conference on Mining Software Repositories, MSR 2013 - San Francisco, CA, United States
Duration: 18 May 201319 May 2013

Publication series

NameIEEE International Working Conference on Mining Software Repositories
ISSN (Print)2160-1852
ISSN (Electronic)2160-1860

Conference

Conference10th International Working Conference on Mining Software Repositories, MSR 2013
Country/TerritoryUnited States
CitySan Francisco, CA
Period18/05/1319/05/13

Keywords

  • Datasets
  • Feature location
  • Generate benchmarks
  • Impact analysis

Fingerprint

Dive into the research topics of 'A dataset from change history to support evaluation of software maintenance tasks'. Together they form a unique fingerprint.

Cite this