Configuring and assembling information retrieval based solutions for software engineering tasks

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Information Retrieval (IR) approaches are used to leverage textual or unstructured data generated during the software development process to support various software engineering (SE) tasks (e.g., concept location, traceability link recovery, change impact analysis, etc.). Two of the most important steps for applying IR techniques to support SE tasks are preprocessing the corpus and configuring the IR technique, and these steps can significantly influence the outcome and the amount of effort developers have to spend for these maintenance tasks. We present the use of Genetic Algorithms (GAs) to automatically configure and assemble an IR process to support SE tasks. The approach named IR-GA determines the (near) optimal solution to be used for each step of the IR process without requiring any training. We applied IR-GA on three different SE tasks and the results of the study indicate that IR-GA outperforms approaches previously used in the literature, and that it does not significantly differ from an ideal upper bound that could be achieved by a supervised approach and a combinatorial approach.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE International Conference on Software Maintenance and Evolution, ICSME 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages641-646
Number of pages6
ISBN (Electronic)9781509038060
DOIs
StatePublished - 12 Jan 2017
Event32nd IEEE International Conference on Software Maintenance and Evolution, ICSME 2016 - Raleigh, United States
Duration: 2 Oct 201610 Oct 2016

Publication series

NameProceedings - 2016 IEEE International Conference on Software Maintenance and Evolution, ICSME 2016

Conference

Conference32nd IEEE International Conference on Software Maintenance and Evolution, ICSME 2016
Country/TerritoryUnited States
CityRaleigh
Period2/10/1610/10/16

Keywords

  • Information retrieval
  • Parametrization
  • Reproducibility of experiments
  • Search-based software engineering
  • Text-based software engineering

Fingerprint

Dive into the research topics of 'Configuring and assembling information retrieval based solutions for software engineering tasks'. Together they form a unique fingerprint.

Cite this