What are active and passive stages in DataStage?
Active Stage : Active stage model the flow of data and provide mechanisms for combining data streams, aggregating data and converting data from one data type to another. Eg, Transformer, aggregator, sort, Row Merger etc. Passive Stage : A Passive stage handles access to Database for the extraction or writing of data.
How is parallelism achieved in DataStage?
Partition Parallelism divides the incoming stream of data into subsets that will be processed separately by a separate node/processor. The details of these processors are to be defined in the DataStage Configuration File(to be dealt with in later topics).
Which statement is accurate regarding usage of database stages?
Native database stages match columns by name. DataStage provides native Oracle database stages. Database stages cannot create tables and must be done externally.
Which hash file is used when limited amount of data?
static hash file
These are static has files and dynamic hash files. A static hash file is used when a limited amount of data is loaded to the target database. A dynamic hash file is used when we don’t know the exact amount of data to be transferred to the target database. Q40).
What is a passive stage?
Passive stages: Stages that are used to read the data from database and write data to the database is called passive stages.
What are the passive stages in DataStage?
Actives stages, such as the Transformer and Aggregator, perform processing tasks, while passive stages, such as the Sequential File stage and Hashed File stage, are reading or writing data sources and provide services to the active stages. At its simplest, active stages become processes.
How do you optimize DataStage jobs?
To optimize an InfoSphere DataStage job, do the following steps:
- Start the Designer client and attach to the project that contains the job.
- Open the job that you want to optimize.
- Set the options and properties that control optimization.
- Optimize the job.
- View the optimization log.
- Save the optimized job as a new job.
Are there any real time interview questions for DataStage?
Nonetheless, DataStage real-time interview questions will help to get shortlisted for a job position. Let’s begin with IBM DataStage Interview questions and answers.
How is data stored in a DataStage file?
It stores data in binary in the internal format of Datastage. so it takes less time to read/write from ds to any other source/target. You cannot view the data without Datastage. It Creates 2 types of files to store the data. Descriptor File: Which is created in a defined folder/path.
What are the two types of lookups in DataStage?
There are two types of lookups in DataStage, Normal Lookup and Sparse Lookup. In the case of Normal Lookup, data is first stored in memory, and lookup is performed later. In the case of Sparse lookup, data is saved to the database directly. It is faster than the normal lookup. Q25).
What kind of job can you get in DataStage?
The job role in DataStage comprises the knowledge of data warehouse, ETL, data storage, configuration, design, various stages, and modules in data stages. DataStage is used to integrate various systems (multiple systems) and processes high volumes of data.