Course Overview
TOPProcessing streaming data is becoming increasingly popular as streaming enables businesses to get real-time metrics on business operations. This course covers how to build streaming data pipelines on Google Cloud. Pub/Sub is described for handling incoming streaming data. The course also covers how to apply aggregations and transformations to streaming data using Dataflow, and how to store processed records to BigQuery or Bigtable for analysis. Learners get hands-on experience building streaming data pipeline components on Google Cloud by using QwikLabs.
Scheduled Classes
TOPWhat You'll Learn
TOPInterpret use-cases for real-time streaming analytics.
- Manage data events using the Pub/Sub asynchronous messaging service.
- Write streaming pipelines and run transformations where necessary.
- Interoperate Dataflow, BigQuery and Pub/Sub for real-time streaming and analysis
Outline
TOP
Viewing outline for:
This module introduces the course and agenda
- This modules talks about challenges with processing streaming data
- This module talks about using Pub/Sub to ingest incoming streaming data
- This module revisits Dataflow and focuses on its streaming data processing capabilities
- This modules covers BigQuery and Bigtable for streaming data
- This module dives into more advanced features of BigQuery
- This module recaps the topics covered in course
- PDF links to all modules
Prerequisites
TOPExperience analyzing and visualizing big data, implementing cloud-based big data solutions, and transforming/processing datasets.
- Google Cloud Big Data and Machine Learning Fundamentals (or equivalent experience)
- Some knowledge of Java
Who Should Attend
TOPThis class is intended for data analysts, data scientists and programmers who want to build for out-of-the-ordinary scenarios such as high availability, resiliency, high-throughput, real-time streaming analytics on leveraging Google Cloud.