Ray Wurlod

P.O. Box 1214

ABN 57 092 448 518

North Sydney  N.S.W.  2060

Education and Consulting Services

Australia

 

Email: rayw@mindless.com

 

 

DataStage™ Fundamentals – Enterprise Edition

 

This page outlines the elements of the DataStage Fundamentals – Enterprise Edition class, one of the services offered by this business.

It is an instructor-led class that can be delivered on-site (in a training room equipped with one PC per student and connectivity to a DataStage server).

 

Duration

 

Four days

 

 

Objectives

 

Having completed this class the student will be able:

        to use DataStage tools to construct ETL tasks using DataStage parallel jobs

        to compile and execute those tasks

        to construct a hierarchy of control for those tasks

 

 

Contents

 

  1. Beginning DataStage: what DataStage is, how it works, what it is not
  2. Architectures: design-time, compile-time and run-time
  3. Developer's Toolkit: introduction to the client tools that a developer uses
  4. Parallelism Concepts: pipeline and partition parallelism and how each is implemented in parallel jobs
  5. Configuration: how parallel execution is controlled
  6. Repository Manager: metadata creation, import, export and management
  7. Creating a Job: standard, structured technique for constructing a DataStage job
  8. Designer: editing jobs, job parameters, stages, links and their properties
  9. Data Sets and File Sets: parallel data storage; creation, use and management
  10. Director: execution and inspection/review tool
  11. Sequential Files: processing data in text files of various kinds
  12. Combining Data: horizontal combination (lookup, join, merge, funnel) and vertical combination (aggregation)
  13. Transforming Data: Modify and Transformer stage types
  14. Accessing Tables: Enterprise, API and bulk loader stage types
  15. Job Sequences: using a GUI to construct control hierarchies

 

 

Courseware

 

Each student receives the following two documents.

        Student notes: Powerpoint presentation with detailed notes pages.

        Lab Exercises:  Detailed instructions for "hands on" exercises to reinforce learning

 

 

DataStage is a trademarks of International Business Machines Corporation.

Formally the product name is IBM® Websphere® DataStage.

IBM and Websphere are registered trademarks of International Business Machines Corporation.

 

This page is copyright © 2006, Ray Wurlod. All rights reserved.