Introduction to Data Engineering: Building Your First Pipeline

Caitlin C. Johnson
Last Updated: 1 February 2020
Overview

In this tutorial, you will learn how to build a data pipeline using Python to extract data from a public API, perform transformation on the data utilizing Pandas, and then load the data into a Postgres database. The primary purpose of this project is to understand the fundamentals of data engineering. By the end of this tutorial, it is my goal for you to understand some of the best practices relating to ETL and have a foundational understanding of data engineering that can be built off of.

Target Audience

If you have beginner experience with Python and you’re looking to jump into the data engineering world, then this tutorial will be particularly useful for you.

Tech Stack

Language: Python

Database: Postgres Relational Database

Tool(s): Pandas, TablePlus

Resume

Completing the tutorial will allow you to put the following on your resume:

Created a data pipeline using Python to extract data from a public API, perform transformation on the data utilizing Pandas, and then load the data into a Postgres database.

Prerequisites

You should have a basic understanding of programming in Python.

Price

$20.00

Start Learning Now
Notify me when this tutorial is released!
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.