Scalability issues for high performance digital libraries on the World Wide Web
- 23 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
We investigate scalability issues involved in developing high performance digital library systems. Our observations and solutions are based on our experience with the Alexandria Digital Library (ADL) testbed under development at UCSB. The current ADL system provides on-line browsing and processing of digitized maps and other geo-spatially mapped data via the World Wide Web (WWW). A primary activity of the ADL system involves computation and disk I/O for accessing compressed multi-resolution images with hierarchical data structures, as well as other duties such as supporting database queries and on-the-fly HTML page generation. Providing multi-resolution image browsing services can reduce network traffic but impose some additional cost at the server. We discuss the necessity of having a multi-processor DL server to match potentially huge demands in simultaneous access requests from the Internet. We have developed a distributed scheduling system for processing DL requests, which actively monitors the usages of CPU, I/O channels and the interconnection network to effectively distribute work across processing units to exploit task and I/O parallelism. We present an experimental study on the performance of our scheme in addressing the scalability issues arising in ADL wavelet processing and file retrieval. Our results indicate that the system delivers good performance on these types of tasks.Keywords
This publication has 5 references indexed in Scilit:
- SWEB: towards a scalable World Wide Web server on multicomputersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Alexandria digital library: Rapid prototype and metadata schemaPublished by Springer Nature ,1996
- Alexandria Digital LibraryCommunications of the ACM, 1995
- A scalable HTTP server: The NCSA prototypeComputer Networks and ISDN Systems, 1994
- Prediction-based dynamic load-sharing heuristicsIEEE Transactions on Parallel and Distributed Systems, 1993