logo


your one source for IT & AV

Training Presentation Systems Services & Consulting Cloud Services Purchase Client Center Computer Museum
Arrow Course Schedule | Classroom Rentals | Student Information | Free Seminars | Client Feedback | Partners | Survey | Standby Discounts

Data Engineering on Google Cloud

SS Course: 56025

Course Overview

TOP
Get hands-on experience with designing and building data processing systems on Google Cloud. This course uses lectures, demos, and hands-on labs to show you how to design data processing systems, build end-to-end data pipelines, analyze data, and implement machine learning. This course covers structured, unstructured, and streaming data.                                                                  

Scheduled Classes

TOP
08/12/24 - TDV - Virtual-Instructor Led - Virtual-Instructor Led (click to enroll)

What You'll Learn

TOP
Design and build data processing systems on Google Cloud.
  • Process batch and streaming data by implementing autoscaling data pipelines on Dataflow.
  • Derive business insights from extremely large datasets using BigQuery.
  • Leverage unstructured data using Spark and ML APIs on Dataproc.
  • Enable instant insights from streaming data.
  • Understand ML APIs and BigQuery ML, and learn to use AutoML tocreate powerful models without coding.
  • This course teaches participants the following skills:

Outline

TOP
Viewing outline for:
Introduction to Data Engineering
  • Dataflow Streaming Features
  • High-Throughput BigQuery and Bigtable Streaming Features
  • Advanced BigQuery Functionality and Performance
  • Introduction to Analytics and AI
  • Prebuilt ML Model APIs for Unstructured Data
  • Big Data Analytics with Notebooks
  • Production ML Pipelines
  • Custom Model Building with SQL in BigQuery ML
  • Custom Model Building with AutoML
  • Building a Data Lake
  • Building a Data Warehouse
  • Introduction to Building Batch Data Pipelines
  • Executing Spark on Dataproc
  • Serverless Data Processing with Dataflow
  • Manage Data Pipelines with Cloud Data Fusion and Cloud Composer
  • Introduction to Processing Streaming Data
  • Serverless Messaging with Pub/Sub

Prerequisites

TOP
Basic proficiency with a common query language such as SQL.
  • Experience with data modeling and ETL (extract, transform, load) activities.
  • Experience with developing applications using a common programming language such as Python.
  • Familiarity with machine learning and/or statistics.
  • To get the most of out of this course, participants should have:

    Who Should Attend

    TOP
    Extracting, Loading, Transforming, cleaning, and validating data
    • Designing pipelines and architectures for data processing
    • Integrating analytics and machine learning capabilities into data pipelines
    • Querying datasets, visualizing query results and creating reports
    • This class is intended for developers who are responsible for:

    Next Step Courses

    TOP