Data Engineering with Google Dataflow and Apache Beam

Data Engineering with Google Dataflow and Apache Beam

First steps to Extract, Transform and Load data using Apache Beam and Deploy Pipelines on Google Dataflow

What you’ll learn

  • Apache Beam
  • ETL
  • Python
  • Google Cloud
  • DataFlow
  • Google Cloud Storage

Requirements

  • Basic Python
  • Free GCP Account

Description

This course wants to introduce you to the Apache Foundation’s newest data pipeline development framework: The Apache Beam, and how this feature is becoming popular in partnership with Google Dataflow. In a summary, we want to cover the following topics:
1. Understand your inner workings
2. What are your benefits
3. Explain how to use on your local machine without installation via Google Colab for development
4. Its main functions
5. Configure Apache Beam python SDK locallyvice
6. How to deploy this resource on Google Dataflow to a Batch pipeline
This course is dynamic, you will be receiving updates whenever possible.
It is important to remember that this course does not teach Python, but uses it. So, get comfortable with knowing Python basics, defining a function, creating objects and data types.
Also, if you are interested in learning section 4, which consists of deploying a pipeline on Google Dataflow, you will need to have a free counter in GCP. It’s a simple process, but it requires a credit card!
Requirements:
· Basic knowledge of Python
· Have Python 3.7 or greater installed locally (from section 4)
· Free account at GCP (from section 4)
Schedule:
· Section 2 – Concepts
· Section 3 – Main Functions
· Section 4 – Apache Beam on Google Dataflow

Who this course is for:

  • Data Engineers
  • Data Analysts
  • Business Intelligence Professionals
  • Open Source Fans
  • ETL Engineers

Last updated: 10/2021 | Size: 726 MB
Click to get:
Source: https://www.udemy.com/course/data-engineering-with-google-dataflow-and-apache-beam/


PySpark – Build DataFrames with Python, Apache Spark, and SQL
PySpark – Build DataFrames with Python, Apache Spark, and SQL
04.27.2021
Apache Spark Project for Beginners: A Complete Project Guide
Apache Spark Project for Beginners: A Complete Project Guide
05.28.2020
Apache Ranger : Fine-Grained Access Control
Apache Ranger : Fine-Grained Access Control
10.06.2021
Apache Cassandra v3 NoSQL. Data backup with Python3, AWS S3 Course
Apache Cassandra v3 NoSQL. Data backup with Python3, AWS S3 Course
04.10.2021

No comments.

Add Commenent
reload, if the code cannot be seen

or
Registration
Lostpassword