DW & ETL
SAP BO Data Services
SAP BODS Repository
- SAP BODS - ETL Flow in DS Designer
- SAP BODS - DS Designer Introduction
- SAP BODS - DSMC Modules
- Data Services Management Console
- Repository Creating & Updating
- SAP BODS - Repository Overview
SAP BODS Datastores & Formats
- Data Extraction from Excel Workbook
- Extracting Data from DB Tables
- COBOL Copybook File Format
- SAP BODS - File Formats
- SAP BODS - Adapter Datastore
- SAP BODS - Linked Datastore
- SAP BODS - Memory Datastore
- SAP BODS - Changing a Datastore
- SAP BODS - Datastore Overview
Data Flow & Work Flow
- SAP BODS - Creating Workflows
- SAP BODS - Workflow Introduction
- BODS - Dataflow Changing Properties
- SAP BODS - Dataflow Introduction
SAP BODS Transforms
SAP BODS Administration
- SAP BODS - Tuning Techniques
- Data Assessment & Data Profiling
- Debugging & Recovery Mechanism
- Creating Embedded Dataflow
- SAP BODS - Data Services Overview
Multi-user Development
- Creating a Multi-user Environment
- BODS - Central Repository Security
- BODS - Central vs Local Repository
SAP BODS Useful Resources
- SAP BODS - Discussion
- SAP BODS - Useful Resources
- SAP BODS - Quick Guide
- SAP BODS - Questions Answers
Selected Reading
- Who is Who
- Computer Glossary
- HR Interview Questions
- Effective Resume Writing
- Questions and Answers
- UPSC IAS Exams Notes
ETL - Introduction
ETL stands for Extract, Transform and Load. An ETL tool extracts the data from different RDBMS source systems, transforms the data pke applying calculations, concatenate, etc. and then load the data to Data Warehouse system. The data is loaded in the DW system in the form of dimension and fact tables.
Extraction
A staging area is required during ETL load. There are various reasons why staging area is required.
The source systems are only available for specific period of time to extract data. This period of time is less than the total data-load time. Therefore, staging area allows you to extract the data from the source system and keeps it in the staging area before the time slot ends.
Staging area is required when you want to get the data from multiple data sources together or if you want to join two or more systems together. For example, you will not be able to perform a SQL query joining two tables from two physically different databases.
Data extractions’ time slot for different systems vary as per the time zone and operational hours.
Data extracted from source systems can be used in multiple data warehouse system, Operation Data stores, etc.
ETL allows you to perform complex transformations and requires extra area to store the data.
Transform
In data transformation, you apply a set of functions on extracted data to load it into the target system. Data, which does not require any transformation is known as direct move or pass through data.
You can apply different transformations on extracted data from the source system. For example, you can perform customized calculations. If you want sum-of-sales revenue and this is not in database, you can apply the SUM formula during transformation and load the data.
For example, if you have the first name and the last name in a table in different columns, you can use concatenate before loading.
Load
During Load phase, data is loaded into the end-target system and it can be a flat file or a Data Warehouse system.
Advertisements