Conceptual modeling for ETL processes
Top Cited Papers
- 8 November 2002
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. In this paper, we focus on the problem of the definition of ETL activities and provide formal foundations for their conceptual representation. The proposed conceptual model is (a) customized for the tracing of inter-attribute relationships and the respective ETL activities in the early stages of a data warehouse project; (b) enriched with a 'palette' of a set of frequently used ETL activities, like the assignment of surrogate keys, the check for null values, etc; and (c) constructed in a customizable and extensible manner, so that the designer can enrich it with his own re-occurring patterns for ETL activitiesKeywords
This publication has 8 references indexed in Scilit:
- Arktos: towards the modeling, design, control and execution of ETL processesInformation Systems, 2001
- Data warehouse process managementInformation Systems, 2001
- Efficient resumption of interrupted warehouse loadsPublished by Association for Computing Machinery (ACM) ,2000
- AJAXPublished by Association for Computing Machinery (ACM) ,2000
- Fundamentals of Data WarehousesPublished by Springer Nature ,2000
- Architecture and quality in data warehouses: An extended repository approachInformation Systems, 1999
- A methodological framework for data warehouse designPublished by Association for Computing Machinery (ACM) ,1998
- THE DIMENSIONAL FACT MODEL: A CONCEPTUAL MODEL FOR DATA WAREHOUSESInternational Journal of Cooperative Information Systems, 1998