Learn Datastage in 30 days

Please go through this content to learn datastage

Introduction to Data Warehousing

What is Data Warehousing?
Why Data Warehouse is required?
Types of Systems
What is OLAP and the contrast with OLTP
SDLC – Data Warehousing Life Cycle

Database Design and Data Modeling

Source data
Staging Area
Target data
✔Analysis & Reporting
ODS

What is Extract Transform Load

Introduction of Extraction, Transformation and Loading
Types of ETL tools
Key tools in the market

Multi-Dimensional Modeling

What is dimension modeling?
Difference between ER modeling and dimension modeling
✔What is a Dimension?
What is a Fact?
Start Schema
Snow Flake Schema
Difference between Star and snow flake schema
✔Fact Table
✔Different types of facts
✔Dimensional Tables
✔Fact less Fact Table
✔Confirmed Dimensions
✔Unconfirmed Dimensions
✔Junk Dimensions
✔Monster Dimensions
✔Degenerative Dimensions
✔What are slowly changing Dimensions?
✔Different types of SCD’s

IBM WebSphere DataStage and QualityStage, Versions 9.1 Contents

Introduction about Data Stage
✔Difference between Data Stage 7.5.2 and 9.1
What’s new in Data Stage 9.1?
✔What is way ahead in Data Stage?
✔IBM Information Sever architecture
✔Datastage within the IBM Information Server architecture
✔Difference between Server Jobs and Parallel Jobs
✔Difference between Pipeline Parallelism and Partition Parallelism
✔Partition techniques (Round Robin, Random,etc)

Hash, Entire, Same, Modules, Range, DB2, Auto

✔Configuration File
✔Difference between SMP/PMP(Cluster) Architecture
✔Data stage components (Server components /Client components)

Designer

✔Introduction about Designer
✔Repository
✔Palette
✔Type of Links
✔System(SMP) & Cluster system(MPP)
✔Parameter passing
✔Exporting and importing the jobs
✔Importing Method(Flat file, Txt, Xls and Database files)
✔OSH Importing Method
✔Configuration file

File Stages

✔Sequential file
✔Dataset file
✔File set
✔Lookup file set
✔Difference between Sequential file/Dataset/File set

Database Stages

✔Overview of iWay, Classic federation and netezza
✔Dynamic RDBMS
✔Oracle Enterprise
✔ODBC Enterprise
✔Stored Procedure

Data stage Director

✔Introduction to Data stage Director
✔Job Status View
✔View logs
✔Scheduling
✔Batches Creation
✔Cleaning resources using Administrator

Processing Stages

Change Data Capture Stage
✔Compare Stage
✔Difference Stage
✔Aggregate Stage
✔Transformer Stage
✔Difference between basic transformer and transformer
✔Surrogate Generator Stage
✔Join Stage
✔Merge Stage
✔Lookup Stage
✔Difference between Join/Lookup/Merge
✔Difference between Join/Lookup
✔Remove Duplicates
✔Switch
✔Pivot
✔Modify
✔Funnel
✔Generic stage
Sort stage
✔Different types of sorting and sort stage.
✔Different types of combining and collecting techniques.
✔Filter
✔External filter
✔Difference between filter, External filter and switch stages.
✔SCD stage
✔Encode and decode stages
✔FTP stage
✔Adding job parameters to a job
✔Parameter set
✔Difference between partitioning and re partitioning
✔Run time column propagation
✔Schema files

Debugging Stage

✔Head
✔Tail
✔Pea
✔Row Generator
✔Column Generator
✔Sample

Containers

✔Shared Containers
✔Local Containers

JOB SEQUENCE AREA

✔Job Activity
✔Job sequencer
✔Start loop Activity
✔End loop Activity
✔Notification Activity
✔Terminator Activity
✔Nested Condition Activity
✔Exception handling Activity
✔Execute Command Activity
✔Wait for file Activity
✔User variable Activity
✔Adding Check Points
✔Restartable

DS-DIRECTOR

✔About DS Director
✔Validation
✔Scheduling
✔Status
✔View logs
✔Monitoring
✔Suppress and Demote the Warnings
✔Peek view

DATASTAGE ADMINISTRATOR

✔Create Project
✔Delete Project
✔Protect Project
✔Environmental variables
✔Auto purge
✔RCP
✔OSH
✔Commands Execute
✔Multiple Instances
✔Job Sequence Settings

IBM WEB SPHERE QUALITY STAGE

✔Data Quality
✔Data Quality Stages
✔Investigate Stage
✔Standardize Stage
✔Match Frequency Stage
✔Reference Match Stage
✔Unduplicated Match Stage
✔Survive Stage
✔Viewing audit details in operational DASHBOARD reports

UNIX Basics

✔Basic file commands
✔What are UNIX shell scripts

You can learn datastage concepts by going through the above course at your own pace. 30 days recommended.

Recommended Reading:

Data Warehousing:

  • The Data Warehouse Toolkit by Ralph Kimball (John Wiley and Sons, 1996)
  • Building the Data Warehouse by William Inmon (John Wiley and Sons, 1996)

Leave a Reply

Your email address will not be published. Required fields are marked *