Google Cloud Certified Professional Data Engineer (UPDATE IN PROGRESS)

Training Architect
course instructor image
Matthew Ulasien
I am a technology enthusiast who specializes in helping people and making businesses run better through modern tech, as well as a fan of all things Google. Before starting with Linux Academy, I was a cloud consultant for small businesses, helping them future-proof outdated infrastructure with modern cloud solutions and transform the way they work. I enjoy spending time with my family, playing games (when I actually have free time), and keeping up with modern technology trends.

Course Introduction

Getting Started

Important Information About this "In Progress" Course
00:01:37
(IMPORTANT!) Course Update in Progress
00:03:11
Course Introduction
00:05:48
About the Training Architect
00:00:41
Intro to the Data Dossier - Interactive Study Guide
00:04:53
Course and Exam Overview
00:05:06
What is a Data Engineer
00:03:19

Foundational Concepts

Data Lifecycle
00:11:41
Batch and Streaming Data
00:05:28
Cloud Storage as Staging Ground
00:07:16
Database Types
00:07:21
Monitoring Unmanaged Databases
00:06:04
Live-Environment-Challenge: Google Cloud Data Engineer - Foundational Concepts
00:30:00

Managed Databases

Cloud SQL

Choosing a Managed Database
00:07:15
Cloud SQL Basics
00:05:07
Cloud SQL Hands On
00:12:46
Importing Data
00:12:10
SQL Query Best Practices
00:02:48

Datastore

Datastore Overview
00:09:21
Data Organization
00:16:03
Queries and Indexing
00:11:29
Data Consistency
00:06:18

Bigtable

Bigtable Overview
00:07:44
Instance Configuration
00:18:11
Data Organization
00:05:28
Schema Design
00:08:37

Cloud Spanner

Cloud Spanner Overview
00:11:17
Data Organization and Schema
00:07:12
Hands On and Viewing Examples
00:11:46
Live-Environment-Challenge: QUIZ: Managed Databases on Google Cloud
00:30:00

Data Engineering Architecture

Real Time Messaging with Cloud Pub/Sub

Streaming Data Challenges
00:08:24
Cloud Pub/Sub Overview
00:12:28
Pub/Sub Hands On
00:18:24
Connecting Kafka to GCP
00:05:13

Data Pipelines with Cloud Dataflow

Data Processing Pipelines
00:05:24
Cloud Dataflow Overview
00:10:09
Key Concepts
00:09:43
Template Hands On
00:11:08
Streaming Ingest Pipeline Hands On
00:20:03
Additional Best Practices
00:10:11

Dataproc

Dataproc Overview
00:10:48
Configure Dataproc Cluster and Submit Job – Part 1
00:15:35
Configure Dataproc Cluster and Submit Job – Part 2
00:14:35
Migrating and Optimizing for Google Cloud
00:09:49
Best Practices for Cluster Performance
00:05:42
Live-Environment-Challenge: QUIZ: Data Ingest and Processing
00:30:00

Analyzing Data and Enabling Machine Learning

BigQuery

BigQuery Overview
00:14:43
Interacting with BigQuery
00:22:10
Load and Export Data
00:19:02
Optimize for Performance and Costs
00:15:29
Streaming Insert Example
00:08:38
BigQuery Logging and Monitoring
00:08:18
BigQuery Best Practices
00:14:53
Live-Environment-Challenge: QUIZ: BIGQUERY
00:30:00

Machine Learning

What is Machine Learning?
00:14:45
Working with Neural Networks
00:15:08
Preventing Overfitted Training Data
00:07:40

Cloud Machine Learning Engine (ML Engine)

ML Engine Overview
00:21:16
ML Engine Hands On Part 1
00:23:11
ML Engine Hands On Part 2
00:11:04

Pretrained Machine Learning API's

Pre-trained ML API's
00:12:31
Vision API Demo
00:13:28
Live-Environment-Challenge: QUIZ: MACHINE LEARNING ON GOOGLE CLOUD
00:30:00

Datalab

Datalab Overview
00:08:45
Datalab Demo
00:17:48

Data Visualization

Cleaning Your Data with Dataprep

What is Dataprep?
00:08:58
Dataprep Demo Part 1
00:14:15
Dataprep Demo Part 2
00:16:32
Dataprep Demo Part 3
00:11:48

Building Data Visualizations with Data Studio

Data Studio Introduction
00:09:31
Data Studio Demo
00:28:19
Live-Environment-Challenge: QUIZ: DATALAB/DATAPREP/DATA STUDIO
00:30:00

Monitoring and Orchestration

Orchestrating Data Workflows with Cloud Composer

Cloud Composer Overview
00:08:26
Hands On - Cloud Composer
00:15:25

Course Conclusion

Final Steps

Additional Study Resources
00:02:56
Additional Hands On and Practice Resources
00:04:43
What's Next After Certification?
00:03:41
Get Recognized!
00:01:01
(IMPORTANT!) Course Update in Progress
00:03:11
Live-Environment-Challenge: Data Engineer - Final Exam
02:00:00

Details

The Google Cloud Professional Data Engineer is able to harness the power of Google's big data capabilities and make data-driven decisions by collecting, transforming, and visualizing data. Through designing, building, maintaining, and troubleshooting data processing systems with a particular emphasis on the security, reliability, fault-tolerance, scalability, fidelity, and efficiency of such systems, a Google Cloud data engineer is able to put these systems to work.


This course will prepare you for the Google Cloud Professional Data Engineer exam by diving into all of Google Cloud's data services. With interactive demonstrations and an emphasis on hands-on work, you will learn how to master each of Google's big data and machine learning services and become a certified data engineer on Google Cloud.


Download the Data Dossier: https://interactive.linuxacademy.com/diagrams/TheDataDossier.html


Study Guides

Data Dossier Link

https://www.lucidchart.com/documents/view/0ca44a63-4ea4-4d78-8367-2465512d21be/0

Data Dossier PDF

This is the PDF version of the Data Dossier Lucidchart document. Be aware that each layer is on a separate page.

Instructor Deck

Community

certificate ribbon icon

Earn a Certificate of Completion

When you complete this course, you’ll receive a certificate of completion as proof of your accomplishment.

Looking For Team Training?

Learn More