Preprocessing CVS data for fine-grained analysis
- 1 January 2004
- proceedings article
- Published by Institution of Engineering and Technology (IET)
- Vol. 2004, 2-6
- https://doi.org/10.1049/ic:20040466
Abstract
All analyses of version archives have one phase in common: the preprocessing of data. Preprocessing has a direct impact on the quality of the results returned by an analysis. In this paper we discuss four essential preprocessing tasks necessary for a fine-grained analysis of CVS archives: (a) data extraction, (b) transaction recovery, (c) mapping of changes to fine-grained entities, and (d) data cleaning. We formalize the concept of sliding time windows and show how commit mails can relate revisions to transactions. We also present two approaches that map changes to the affected building blocks of a file, e.g. junctions or sections.Keywords
This publication has 0 references indexed in Scilit: