An architecture for distributing the computation of software clustering algorithms

13 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 181-190
https://doi.org/10.1109/wicsa.2001.948427

Abstract

Collections of general purpose networked workstations offer processing capability that often rivals or exceeds supercomputers. Since networked workstations are readily available in most organizations, they provide an economic and scalable alternative to parallel machines. The authors discuss how individual nodes in a computer network can be used as a collection of connected processing elements to improve the performance of a software engineering tool that we developed. Our tool, called Bunch, automatically clusters the structure of software systems into a hierarchy of subsystems. Clustering helps developers understand complex systems by providing them with high-level abstract (clustered) views of the software structure. The algorithms used by Bunch are computationally intensive and, hence, we would like to improve our tool's performance in order to cluster very large systems. The paper describes how we designed and implemented a distributed version of Bunch, which is useful for clustering large systems.

Keywords

This publication has 12 references indexed in Scilit:

Chava: reverse engineering and tracking of Java applets
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Automatic clustering of software systems using a genetic algorithm
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Using automatic clustering to produce high-level system organizations of source code
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Extracting concepts from file names; a new file clustering criterion
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Data intensive distributed computing: A medical application example
Published by Springer Nature ,1999
Programming languages for distributed applications
New Generation Computing, 1998
Quantifying the Performance Differences between PVM and TreadMarks
Journal of Parallel and Distributed Computing, 1997
TreadMarks: shared memory computing on networks of workstations
Computer, 1996
Recent Enhancements To Pvm
The International Journal of Supercomputer Applications and High Performance Computing, 1995
A case for NOW (Networks of Workstations)
IEEE Micro, 1995