Stitch ETL – A Simple, extensible ETL built for data teams

Learn Stitch ETL, migrating data between Snowflake, AWS S3 and AWS PostgreSql

The course is about Stitch, a product owned by Talend.

What you’ll learn

  • Stitch ETL from scratch.
  • Data Migration.
  • Data Replication.
  • Streaming Data Pipeline.

Course Content

  • Introduction –> 2 lectures • 4min.
  • Signing Up –> 1 lecture • 2min.
  • Integration –> 2 lectures • 20min.
  • Destination –> 3 lectures • 23min.
  • Replication –> 2 lectures • 10min.

Stitch ETL - A Simple, extensible ETL built for data teams

Requirements

The course is about Stitch, a product owned by Talend.

 

What is Stitch?

Stitch is a cloud-first, open source platform for rapidly moving data. A simple, powerful ETL service, Stitch connects to various data sources and replicates that data to a destination.

 

• Stitch helps you replicate data into cloud data warehouses

• Stitch rapidly moves data from 130+ sources into a cloud data warehouse with no coding

• Stitch is Simple, extensible ETL built for data teams

 

This course starts with,

• Introduction of Stitch

• Signing up with Stitch

• Creating sources of AWS S3, AWS RDS PostgreSql

• Creating the targets of Snowflake, AWS S3 and AWS RDS PostgreSql

• Replicate the data from source to target

 

It enables to,

• Extract data from various sources

• Load into the leading cloud data platforms

• Analyze the data with the leading BI tools

 

Replication

Stitch’s replication process consists of three distinct phases:

  1. Extract: Stitch pulls data from your data sources and persists it to Stitch’s data pipeline through the Import API.
  2. Prepare: Data is lightly transformed to ensure compatibility with the destination.
  3. Load: Stitch loads the data into your destination.

A single occurrence of these three phases is called a replication job. You can keep an eye on a replication job’s progress on any integration’s Summary page.

 

Stitch integrated with the target systems such as,

• Amazon Redshift

• AWS S3

• Delta Lake on Databricks

• Google BigQuery

• Microsoft Azure Synapse Analytics

• Microsoft SQL Server

• Panoply

• PostgreSQL

• Snowflake

 

This course is for,

• ETL Developers

• Data Engineers

• Data Architects

• Data Migration Specialists

• Data Integration Specialists

Get Tutorial