AWS Certified Big Data - Specialty Certification

Training Architect
course instructor image
Fernando Medina Corey
I’m Fernando Medina Corey - a data engineer and technical course author. I love prototyping applications with new datasets and explaining new technical concepts and tools to developers, data engineers and software architects. I also try to stay involved in my local tech scene and frequently speak and teach at meetups and events.

Course Introduction

Getting Started

Course Introduction
00:06:18
About the Course Author
00:01:10
About the AWS Big Data Specialty Course Exam
00:03:55
Course Features and Tools
00:04:44

Domain 1 - Collection

AWS IoT

IoT Overview
00:04:31
IoT Core
00:05:55
IoT Thing Registry
00:05:42
IoT Authorization and Authentication
00:08:15
IoT Message Broker
00:17:01
IoT Shadow Service
00:07:39
IoT Rules Engine
00:06:09
IoT Job Service
00:04:51
Hands-On-Lab: Configuring Your First IoT Device
00:30:00

AWS Greengrass

Greengrass Overview
00:06:58

Kinesis

What is Kinesis?
00:03:50
Getting Data Into Kinesis
00:07:32
Kinesis Data Streams
00:11:24
Kinesis Data Firehose
00:09:23
Kinesis Data Firehose Destinations and Creating an Example Delivery Stream
00:22:00
Kinesis Limitations and Pricing
00:09:06
Hands-On-Lab: Creating and Configuring Kinesis Streams and Kinesis Firehose in AWS
01:00:00
Hands-On-Lab: Building a Pipeline to Ingest and Analyze Streaming Data
00:30:00

AWS Direct Connect

AWS Direct Connect Essentials
00:05:15

AWS Snowball and AWS Snowmobile

AWS Snowball and Snowmobile Overview
00:02:01

AWS Database Migration Service

AWS Database Migration Service Overview
00:06:57
Hands-On-Lab: Migrating a Database Using DMS
00:30:00

Domain 2 - Storage

Amazon S3

Import S3 and DynamoDB Content Note
00:01:39
S3 Essentials
00:13:36
Moving Data to S3 with Multipart and Single Operation Uploads
00:01:55
S3 Permissions
00:13:56
Amazon S3 Encryption
00:08:07
Storage Classes and Lifecycle Policies
00:11:45
S3 Miscellany
00:04:46
S3 Performance
00:04:51

Amazon DynamoDB

DynamoDB Overview
00:03:39
DynamoDB Core Concepts
00:10:23
Provisioned Throughput
00:09:40
DynamoDB Read Operations
00:04:26
Local and Global Secondary Indexes
00:09:18
Common DynamoDB Errors and Limits
00:09:49
DynamoDB Global Tables, Atomic Counters, and Conditional Writes
00:05:54
DynamoDB Pricing
00:03:38
DynamoDB Streams
00:21:25
DynamoDB and S3 Recap
00:06:18
Hands-On-Lab: Creating DynamoDB tables
00:30:00
Hands-On-Lab: Configuring DynamoDB Streams
01:00:00
Hands-On-Lab: DynamoDB Read Operation Performance
00:30:00
Hands-On-Lab: DynamoDB Tables and Global Secondary Indexes
00:30:00

Domain 3 - Processing

Amazon Elastic MapReduce (EMR)

EMR Overview
00:12:31
Configuring and Launching an EMR Cluster
00:13:23
Securing an EMR Cluster
00:08:26
Working with PySpark and S3
00:14:39
Hands-On-Lab: Querying EMR Using Hive
01:00:00

Amazon Machine Learning (AML)

Introduction to Amazon Machine Learning (AML)
00:04:27
Using AWS Datasources with AML
00:08:02
Training Machine Learning Models
00:15:22

Amazon SageMaker

Introduction to SageMaker
00:03:45
AML vs. Amazon SageMaker
00:02:22

Lambda

Lambda Overview and Processing
00:23:29

AWS Data Pipeline

AWS Data Pipeline
00:22:56

AWS Glue

Introduction to AWS Glue
00:07:25

Essential Big Data Tools and Concepts

Overview of Big Data Tooling
00:01:06
Big Data Tools Terminology
00:04:23
Core Big Data Tools
00:09:56
Big Data Tools - GUIs and Data Exploration
00:04:48
Machine Learning Tools
00:01:59

Domain 4 - Analysis

Elasticsearch

ElasticSearch Overview
00:05:26
Elasticsearch Clusters and Access Policies
00:15:00
Elasticsearch and VPC
00:02:26
Elasticsearch, Kibana, and AWS Lambda
00:15:06
Hands-On-Lab: Analyzing and Visualizing data with AWS Elasticsearch Service and Kibana
00:30:00

Athena

Amazon Athena Overview
00:11:28

Redshift

Introduction to Amazon Redshift
00:15:53
Redshift Cluster Configuration
00:15:30
Redshift Distribution Styles
00:09:24
Redshift Column Compression
00:08:48
Choosing Redshift Sort Keys
00:11:55
Loading and Unloading Data with Redshift
00:12:27
Hands-On-Lab: Migrating Redshift Data to and from S3
01:00:00

Kinesis Analytics

Kinesis Analytics Overview
00:15:56

Domain 5 - Visualization

Quicksight

Quicksight Overview
00:13:10

Data Visualization Tools

JavaScript Visualization Tools
00:04:58
Hands-On-Lab: Preparing S3 Data for Public Visualizations
00:30:00

Domain 6 - Security

Access

IAM Basics
00:03:24
IAM for Big Data
00:02:37

Auditing

CloudTrail Essentials
00:09:51

Encryption

Database Encryption
00:03:18

Course Conclusion

Final Steps

How to Prepare for the Exam
00:03:36
What's Next after the Certification?
00:02:00
Get Recognized
00:01:01
Live-Environment-Challenge: AWS Big Data Specialty Certification - Practice Exam
02:00:00

Details

Big data technologies are some of the most exciting and in-demand skills. These tools power large companies such as Google and Facebook and it is no wonder AWS is spending more time and resources developing certifications, and new services to catalyze the move to AWS big data solutions.


This course will provide you with much of the required knowledge needed to be prepared to take the AWS Big Data Specialty Certification. We will cover the different AWS (and non-AWS!) products and services that appear on the exam. Importantly - we will not cover material you should already have a solid understanding of such as AWS Identity and Access Management, and global infrastructure. For those foundational concepts, definitely review the AWS Certified Developer - Associate Level course here on Linux Academy.


Access The Data Dispatch: https://interactive.linuxacademy.com/diagrams/thedatadispatch.html


Join the Linux Academy community slack for chat here: https://inuxacademy-community-slack.herokuapp.com/ and join the #containers channel.


Study Guides

Whitepaper - Use Amazon Elasticsearch Service to Log and Monitor (Almost) Everything

Use Amazon Elasticsearch Service to Log and Monitor (Almost) Everything

Whitepaper - Streaming Data Solutions on AWS with Amazon Kinesis

Whitepaper - Best Practices for Migrating from RDBMS to Amazon DynamoDB

Whitepaper - Lamdba Architecture for Batch and Stream Processing

Whitepaper - Getting Started with Amazon Aurora

Whitepaper - Database Caching Strategies Using Redis

Whitepaper - Data Warehousing on AWS

The Data Dispatch

https://interactive.linuxacademy.com/diagrams/thedatadispatch.html

Whitepaper - Big Data Analytics Options on AWS

githublink.txt

Instructor Deck

Community

certificate ribbon icon

Earn a Certificate of Completion

When you complete this course, you’ll receive a certificate of completion as proof of your accomplishment.

Looking For Team Training?

Learn More