Working with PySpark and S3

Length: 00:14:39

Lesson Summary:

In this video, you'll see how you can use your EMR cluster to interact with data stored in S3. Specifically, you will process CSV data inside of S3 and output it to S3 in a JSON format after processing it to calculate some particular results.


This lesson is only available to Linux Academy members.

Sign Up To View This Lesson
Or Log In

Looking For Team Training?

Learn More