ETL is the process of setting up the data from the operational source for data. This process consists of extracting, transforming, and loading, some process that is performed before being published in the data warehouse. So, ETL or extract, transform, loading the data processing phase is from the source of incoming data into the data warehouse. The purpose of ETL is to collect, filter, manipulate and combine the relevant data from various sources to be stored in the data warehouse. ETL can also be used to integrate data. With the existing system. The result of the process of ETL is it generates data that meets the criteria data warehouse such as historical data, integrated, encapsulated, static and has a structure that is designed for the purposes of the analysis process. ETL process consists of three stages, namely:
Extract
The first step of the process is the process of withdrawal of ETL data from one or more operational system as a data source (can be extracted from the system's OLTP, but could also be from data sources outside of the system database). Most data warehouse projects combining data from different sources. In fact, the process of extraction is the process of breaking down and cleaning of data extracted to get a pattern or structure of the desired data.
Transform
The process of cleaning up the data that has been taken in the process of extract data so that it corresponds to the structure of the data warehouse or data mart.
Load
The load phase is the phase function for entering data into the end target, i.e. into a data warehouse. Time and range to replace or supplement the data depend on the design of the data warehouse at the time of analyzing information needs.
Open Source Best ETL Tools
Extract
The first step of the process is the process of withdrawal of ETL data from one or more operational system as a data source (can be extracted from the system's OLTP, but could also be from data sources outside of the system database). Most data warehouse projects combining data from different sources. In fact, the process of extraction is the process of breaking down and cleaning of data extracted to get a pattern or structure of the desired data.
Transform
The process of cleaning up the data that has been taken in the process of extract data so that it corresponds to the structure of the data warehouse or data mart.
Load
The load phase is the phase function for entering data into the end target, i.e. into a data warehouse. Time and range to replace or supplement the data depend on the design of the data warehouse at the time of analyzing information needs.
Open Source Best ETL Tools
- CloverETL
- HPCC
- Kettle
- GeoKettle
- Talend Studio
Advertisement
No comments