Using data fusion and web mining to support feature location in software

Meghan Revelle, Bogdan Dit, Denys Poshyvanyk

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

86 Scopus citations

Abstract

Data fusion is the process of integrating multiple sources of information such that their combination yields better results than if the data sources are used individually. This paper applies the idea of data fusion to feature location, the process of identifying the source code that implements specific functionality in software. A data fusion model for feature location is presented which defines new feature location techniques based on combining information from textual, dynamic, and web mining analyses applied to software. A novel contribution of the proposed model is the use of advanced web mining algorithms to analyze execution information during feature location. The results of an extensive evaluation indicate that the new feature location techniques based on web mining improve the effectiveness of existing approaches by as much as 62%.

Original languageEnglish
Title of host publication18th IEEE International Conference on Program Comprehension, ICPC 2010
Pages14-23
Number of pages10
DOIs
StatePublished - 2010
Event18th IEEE International Conference on Program Comprehension, ICPC 2010 - Braga, Minho, Portugal
Duration: 30 Jun 20102 Jul 2010

Publication series

NameIEEE International Conference on Program Comprehension

Conference

Conference18th IEEE International Conference on Program Comprehension, ICPC 2010
Country/TerritoryPortugal
CityBraga, Minho
Period30/06/102/07/10

Keywords

  • Data fusion
  • Dynamic analysis
  • Feature location
  • Information retrieval
  • Web mining

Fingerprint

Dive into the research topics of 'Using data fusion and web mining to support feature location in software'. Together they form a unique fingerprint.

Cite this