Home > Software > Data-Warehouse > DataStage
Interview Questions   Tutorials   Discussions   Programs   Discussion   

DataStage - What are the stages will use implementing SCD2/DATE field?

asked mar September 20, 2014 06:33 AM  

What are the stages will use implementing SCD2/DATE field?


1 Answers

answered By vishnoiprem   0  

SCD type 2 implementation in Datastage

Slowly changing dimension Type 2 is a model where the whole history is stored in the database. An additional dimension record is created and the segmenting between the old record values and the new (current) value is easy to extract and the history is clear. The fields ‘effective date’ and ‘current indicator’ are very often used in that dimension and the fact table usually stores dimension key and version number. SCD 2 implementation in Datastage The job described and depicted below shows how to implement SCD Type 2 in Datastage. It is one of many possible designs which can implement this dimension. For this example, we will use a table with customers data (it’s name is DCUSTOMERSCD2) which has the following structure and data: D_CUSTOMER dimension table before loading

Datastage SCD2 job design

The most important facts and stages of the CUSTSCD2 job processing: • The dimension table with customers is refreshed daily and one of the data sources is a text file. For the purpose of this example the CUSTID=ETIMAA5 differs from the one stored in the database and it is the only record with changed data. It has the following structure and data: SCD 2 – Customers file extract:

There is a hashed file (Hash_NewCust) which handles a

  • lookup of the new data coming from the text file. A T001_Lookups transformer does a lookup into a hashed file and maps new and old values to separate columns. SCD 2 lookup transformer

    A T002CheckDiscrepacies_exist transformer compares old and new values of records and passes through only records that differ. SCD 2 check discrepancies transformer

    A T003 transformer handles the UPDATE and INSERT actions of a record. The old record is updated with current indictator flag set to no and the new record is inserted with current indictator flag set to yes, increased record version by 1 and the current date. SCD 2 insert-update record transformer

    ODBC Update stage (ODWCustomersSCD2Upd) – update action ‘Update existing rows only’ and the selected key columns are CUSTID and RECVERSION so they will appear in the constructed where part of an SQL statement. ODBC Insert stage (ODWCustomersSCD2Ins) – insert action ‘insert rows without clearing’ and the key column is CUSTID. DCUSTOMER dimension table after Datawarehouse refresh

   add comment

Your answer

Join with account you already have



Ready to start your tutorial with us? That's great! Send us an email and we will get back to you as soon as possible!