what you learn / training schedule / your trainer

Training schedule

IN-COMPANY TRAINING PROGRAMS

Contact Giovanni Lanzani, if you want to know more about custom data & AI training for your teams. He’ll be happy to help you!
Check out more

This four-day instructor-led class provides you with a hands-on introduction to designing and building data processing systems on Google Cloud Platform. Through a combination of presentations, demos, and hand-on labs, you will learn how to design data processing systems, build end-to-end data pipelines, analyze data and carry out machine learning. The course covers structured, unstructured, and streaming data.

This training is for you if…

you have:

Completed Google Cloud Fundamentals- Big Data and Machine Learning course #8325 OR have equivalent experience
Basic proficiency with common query language such as SQL
Experience with data modeling, extract, transform, load activities
Developing applications using a common programming language such Python
Familiarity with Machine Learning and/or statistics

This training is not for you if…

you have:

Not completed Google Cloud Fundamentals- Big Data and Machine Learning course #8325 OR have equivalent experience
No familiarity with Machine Learning and/or statistics
Little or no experience with data modelling

Clients we've helped

What you'll learn

Design and build data processing systems on Google Cloud Platform
Process batch and streaming data by implementing autoscaling data pipelines on Cloud Dataflow
Derive business insights from extremely large
datasets using Google BigQuery

The schedule

1. Serverless Data Analysis with BigQuery

What is BigQuery
Advanced Capabilities
Performance and pricing

2. Serverless, Autoscaling Data Pipelines with Dataflow

3. Getting Started with Machine Learning

What is machine learning (ML)
Effective ML: concepts, types
Evaluating ML
ML datasets: generalization

4. Building ML Models with Tensorflow

Getting started with TensorFlow
TensorFlow graphs and loops + lab
Monitoring ML training

5. Scaling ML Models with CloudML

Why Cloud ML?
Packaging up a TensorFlow model
End-to-end training

6. Feature Engineering

Creating good features
Transforming inputs
Synthetic features
Preprocessing with Cloud ML

7. ML Architectures

Wide and deep
Image analysis
Embeddings and sequences
Recommendation systems

8. Google Cloud Dataproc Overview

Introducing Google Cloud Dataproc
Creating and managing clusters
Defining master and worker nodes
Leveraging custom machine types and preemptible worker nodes
Creating clusters with the Web Console
Scripting clusters with the CLI
Using the Dataproc REST API
Dataproc pricing
Scaling and deleting Clusters

9. Running Dataproc Jobs

Controlling application versions
Submitting jobs
Accessing HDFS and GCS
Hadoop
Spark and PySpark
Pig and Hive
Logging and monitoring jobs
Accessing onto master and worker nodes with SSH
Working with PySpark REPL (command-line interpreter)

10. Integrating Dataproc with Google Cloud Platform

Initialization actions
Programming Jupyter/Datalab notebooks
Accessing Google Cloud Storage
Leveraging relational data with Google Cloud SQL
Reading and writing streaming Data with Google BigTable
Querying Data from Google BigQuery
Making Google API Calls from notebooks

11. Making Sense of Unstructured Data with Google’s Machine Learning APIs

Google’s Machine Learning APIs
Common ML Use Cases
Vision API
Natural Language API
Translate
Speech API

12. Need for Real-Time Streaming Analytics

What is Streaming Analytics?
Use-cases
Batch vs. Streaming (Real-time)
Related terminologies
GCP products that help build for high availability, resiliency, high-throughput, real-timestreaming analytics (review of Pub/Sub and Dataflow)

13. Architecture of Streaming Pipelines

Streaming architectures and considerations
Choosing the right components
Windowing
Streaming aggregation
Events, triggers

14. Stream Data and Events into PubSub

Topics and Subscriptions
Publishing events into Pub/Sub
Subscribing options: Push vs Pull
Alerts

15. Build a Stream Processing Pipeline

Pipelines, PCollections and Transforms
Windows, Events, and Triggers
Aggregation statistics
Streaming analytics with BigQuery
Low-volume alerts

16. High Throughput and Low-Latency with Bigtable

Latency considerations
What is Bigtable
Designing row keys
Performance considerations

17. High Throughput and Low-Latency with Bigtable

What is Google Data Studio?
From data to decisions

Train, evaluate and predict using machine learning models using Tensorflow and Cloud ML
Leverage unstructured data using Spark and ML APIs on Cloud Dataproc
Enable instant insights from streaming data

Certification

Google Cloud Authorised Trainer

meet your trainer

Thomas van Latum

Google Cloud Professional Certified & Google Authorized Trainer

Thomas is an experienced Consultant with a demonstrated history working in the information technology and services industry. Highly skilled in Data Engineering, Development, and Google Cloud Platform.

Passionate about understanding and applying new, innovative technologies in practice.

Flexible delivery

The Right Format For Your Preferred Learning Style

In-Classroom & In-Company Training

Online, Instructor-Led Training

Hybrid and Blended Learning

Self-Paced Training

Structured, to-the-point, good combination of theory and practical examples, very knowledgeable trainer who can explain concepts very well

Data scientist

It was a hands-on and tangible course. We could apply what we learned in a matter of minutes. The trainer did a great job of answering ad-hoc questions that complemented the material. We appreciated the fact that we could apply what we were taught directly to our company.

Technical Leader & Software Architect

I liked every aspect of this training and would like to thank the trainers. They did an excellent job of explaining how to use Spark for data science. This is the fourth GoDataDriven training I’ve followed. All were great, but this was the best one so far.

Data Scientist

Climbing a steep Python and Machine Learning curve in three days. This would have taken me months on my own.