Course Overview
TOPIn this course, you will learn how to build an operational data lake that supports analysis of both structured and unstructured data. You will learn the components and functionality of the services involved in creating a data lake. You will use AWS Lake Formation to build a data lake, AWS Glue to build a data catalog, and Amazon Athena to analyze data. The course lectures and labs further your learning with the exploration of several common data lake architectures.
Scheduled Classes
TOPWhat You'll Learn
TOPCO1. Plan and design a data lake using established data lake methodologies.
- CO2. Describe the components and services required for building a data lake on AWS.
- CO3. Explain how to secure a data lake on AWS using appropriate permissions.
- CO4. Compare the ways data can be ingested, stored, and transformed in a data lake on AWS.
- CO5. Analyze and visualize data stored in a data lake on AWS.
- CO6. Build and automate deployment of a data lake on AWS.
- C07. Describe the role of a data lake within a modern data architecture.
- Apply data lake methodologies in planning and designing a data lake
Outline
TOPDescribe the value of data lakes
- Compare data lakes and data warehouses
- Describe the components of a data lake
- Recognize common architectures built on data lakes
- Describe the relationship between data lake storage and data ingestion
- Describe AWS Glue crawlers and how they are used to create a data catalog
- Identify data formatting, partitioning, and compression for efficient storage and query
- Lab 1: Set up a simple data lake
- Recognize how data processing applies to a data lake
- Use AWS Glue to process data within a data lake
- Describe how to use Amazon Athena to analyze data in a data lake
- Describe the features and benefits of AWS Lake Formation
- Use AWS Lake Formation to create a data lake
- Understand the AWS Lake Formation security model
- Lab 2: Build a data lake using AWS Lake Formation
- Automate AWS Lake Formation using blueprints and workflows
- Apply security and access controls to AWS Lake Formation
- Match records with AWS Lake Formation FindMatches
- Visualize data with Amazon QuickSight
- Lab 3: Automate data lake creation using AWS Lake Formation blueprints
- Lab 4: Data visualization using Amazon QuickSight
- Post course knowledge check
- Architecture review
- Course review
Prerequisites
TOPcompleted the Data Analytics Fundamentals digital course
- Recommended previous knowledge
Who Should Attend
TOPData platform engineers
- Solutions architects
- IT professionals