Skip to content

My projects from the Data Engineer with AWS Nanodegree Program by Udacity.

Notifications You must be signed in to change notification settings

iamsamucoding/udacity-data-engineering-with-aws-nanodegree

Repository files navigation

Udacity Data Engineering with AWS Nanodegree Program

My projects from the Data Engineer with AWS Nanodegree Program by Udacity.

This program aimed to learn to design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets.

  • Create user-friendly relational and NoSQL data models
  • Create scalable and efficient data warehouses
  • Work efficiently with massive datasets
  • Build and interact with a cloud-based data lake
  • Automate and monitor data pipelines
  • Develop proficiency in Spark, Airflow, and AWS tools

Course 1: Data Modeling

Learn to create relational and NoSQL data models to fit the diverse needs of data consumers. Use ETL to build databases in PostgreSQL and Apache Cassandra.

Contents

  • Introduction to Data Modeling
  • Relational Data Models
  • NoSQL Data Models

Projects

Course 2: Cloud Data Warehouses

Learn to create cloud-based data warehouses. Sharpen your data warehousing skills, deepen your understanding of data infrastructure, and be introduced to data engineering on the cloud using Amazon Web Services (AWS).

Contents

  • Introduction to the Data Warehouses
  • Introduction to the Cloud with AWS
  • Implementing Data Warehouses on AWS

Project

Course 3: Data Lake with Spark

Learn about Spark, Data Lakes, and Lakehouses with Amazon Web Services.

Contents

  • Introduction to Spark and Data Lakes
  • Big Data Ecosystem, Data Lakes, and Spark
  • Spark Essentials
  • Using Spark in AWS
  • Ingesting and Organizing Data in a Lakehouse

Project

  • Build a Data Lakehouse in AWS with Spark and AWS Glue
    • Build data pipelines that use Apache Spark to store, filter, process, and transform data from STEDI users for data analytics and machine learning applications.
  • Check out this project!

Course 4: Data Pipelines with Airflow

Learn about how to implement Data Pipelines with Airflow and AWS.

Contents

  • Introduction to Automating Data Pipelines
  • Data Pipelines
  • Airflow and AWS
  • Data Quality
  • Production Data Pipelines

Project

Certificate of completion

https://confirm.udacity.com/e/b15ef03a-19a2-11ee-a5c5-b39e4d9bff76

About

My projects from the Data Engineer with AWS Nanodegree Program by Udacity.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published