Get in Touch

Course Outline

Advanced Transformation Building Blocks

  • Handling complex data types.
  • Managing fields, metadata, and dynamic structures.
  • Leveraging reusable transformation patterns.

Parameters, Variables, and Job-Oriented Design

  • Understanding runtime variables and scoping.
  • Parameterizing transformations for flexibility.
  • Designing parent-child job structures.

Database Integration and Lookup Strategies

  • Mastering advanced lookup steps.
  • Implementing effective caching strategies.
  • Designing efficient join operations.

Working with Files, APIs, and External Systems

  • Processing JSON and XML data formats.
  • Interacting with REST and SOAP services.
  • Executing streaming and batch data loads.

Error Handling and Data Quality Techniques

  • Capturing and routing error data.
  • Applying data validation patterns.
  • Conducting auditing and logging procedures.

Performance Tuning Essentials

  • Optimizing step design for efficiency.
  • Addressing memory usage and threading considerations.
  • Identifying and resolving bottlenecks.

Introduction to Repository-Based Development

  • Utilizing the Pentaho repository.
  • Managing version control.
  • Adopting team collaboration practices.

Deployment and Migration Practices

  • Moving jobs across different environments.
  • Managing configurations.
  • Following operational best practices.

Summary and Next Steps

Requirements

  • A solid grasp of ETL fundamentals.
  • Prior experience using Pentaho Data Integration.
  • Foundational knowledge of data warehousing concepts.

Target Audience

  • ETL developers.
  • Data engineers.
  • Technical professionals seeking to expand their PDI expertise.
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories