Identification of data cohesive subsystems using data mining techniques
- 27 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The activity of reengineering and maintaining large legacy systems involves the use of design recovery techniques to produce abstractions that facilitate the understanding of the system. We present an approach to design recovery based on data mining. This approach derives from the observation that data mining can discover unsuspected non-trivial relationships among elements in large databases. This observation suggests that data mining can be used to elicit new knowledge about the design of a subject system and that it can be applied to large legacy systems. We describe the ISA methodology which uses data mining to identify data cohesive subsystems. We were able to decompose COBOL systems into subsystems by using this approach. Our experience shows that data mining can identify data cohesive subsystems without any previous knowledge of the subject system. Furthermore, data mining can produce meaningful results regardless of system size making this approach especially appropriate to the analysis of large undocumented systems.Keywords
This publication has 17 references indexed in Scilit:
- Using a relational database for software maintenance: A case studyPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Recast: reverse engineering from COBOL to SSADM specificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Extracting object-oriented specification from procedurally oriented programsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Recovering business rules from structured analysis specificationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The KDD process for extracting useful knowledge from volumes of dataCommunications of the ACM, 1996
- Mining association rules between sets of items in large databasesPublished by Association for Computing Machinery (ACM) ,1993
- Reverse-engineering cobol via formal methodsJournal of Software Maintenance: Research and Practice, 1993
- The C information abstraction systemIEEE Transactions on Software Engineering, 1990
- Recognizing a program's design: a graph-parsing approachIEEE Software, 1990
- Extracting and restructuring the design of large systemsIEEE Software, 1990