TY - JOUR
T1 - Integrating information retrieval, execution and link analysis algorithms to improve feature location in software
AU - Dit, Bogdan
AU - Revelle, Meghan
AU - Poshyvanyk, Denys
PY - 2013/4
Y1 - 2013/4
N2 - Data fusion is the process of integrating multiple sources of information such that their combination yields better results than if the data sources are used individually. This paper applies the idea of data fusion to feature location, the process of identifying the source code that implements specific functionality in software. A data fusion model for feature location is presented which defines new feature location techniques based on combining information from textual, dynamic, and web mining or link analyses algorithms applied to software. A novel contribution of the proposed model is the use of advanced web mining algorithms to analyze execution information during feature location. The results of an extensive evaluation on three Java systems indicate that the new feature location techniques based on web mining improve the effectiveness of existing approaches by as much as 87%.
AB - Data fusion is the process of integrating multiple sources of information such that their combination yields better results than if the data sources are used individually. This paper applies the idea of data fusion to feature location, the process of identifying the source code that implements specific functionality in software. A data fusion model for feature location is presented which defines new feature location techniques based on combining information from textual, dynamic, and web mining or link analyses algorithms applied to software. A novel contribution of the proposed model is the use of advanced web mining algorithms to analyze execution information during feature location. The results of an extensive evaluation on three Java systems indicate that the new feature location techniques based on web mining improve the effectiveness of existing approaches by as much as 87%.
KW - Concept location
KW - Feature identification
KW - Information retrieval
KW - Program comprehension
KW - Software evolution and maintenance
KW - Web mining
UR - http://www.scopus.com/inward/record.url?scp=84880701407&partnerID=8YFLogxK
U2 - 10.1007/s10664-011-9194-4
DO - 10.1007/s10664-011-9194-4
M3 - Article
AN - SCOPUS:84880701407
SN - 1382-3256
VL - 18
SP - 277
EP - 309
JO - Empirical Software Engineering
JF - Empirical Software Engineering
IS - 2
ER -