Beginning with foundations, this training explains how Apache Beam and Dataflow work together to meet your data processing needs without the risk of vendor lock-in.The section on developing pipelines covers how you convert your business logic into data processing applications that can run on Dataflow. This training culminates with a focus on operations, which reviews the most important lessons for operating a data application on Dataflow, including monitoring, troubleshooting, testing, and reliability.
This training is for you if…
- A basic understanding of Java or Python programming language is required.
This training is not for you if…
- Little or no understanding of Java or Python programming language is required.
Clients we've helped
What you'll learn
- Plan and implement a well-architected logging and monitoring infrastructure
- Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
- Create effective monitoring dashboards and alerts
- Monitor, troubleshoot, and improve Google Cloud infrastructure
- Analyze and export Google Cloud audit logs
- Find production code defects, identify bottlenecks, and improve performance
- Optimize monitoring costs
Module 1: Introduction
Module 2: Beam Portability
Module 3: Separating Compute and Storage with Dataflow
Module 4: IAM, Quotas, and Permissions
Module 5: Security
Module 6: Beam Concepts Review
Module 7: Windows, Watermarks, Triggers
Module 8: Sources and Sinks
Module 9: Schemas
Module 10: State and Timers
Module 11: Best Practices
Module 12: Dataflow SQL and DataFrames
Module 13: Beam Notebooks
Module 14: Monitoring
Module 15: Logging and Error Reporting
Module 16: Troubleshooting and Debug
Module 17: Performance
Module 18: Testing and CI/CD
Module 19: Reliability
Module 20: Flex Templates
Module 21: Summary
After this course the next step within the learning journey is the Professional Data Engineer Exam.
Get In Touch!
Contact Max Driessen now if you want to learn more and take your cloud skills to the next level!
Constantijn VisinescuCloud Consultant
Constantijn has been a hardcore Google “evangelist” for the past 5+ years. The main reason he likes Google Cloud Platform is because things tend to “just work”, leaving him much more time to build actual features for his customers. He is also a GCP authorized instructor.
Structured, to-the-point, good combination of theory and practical examples, very knowledgeable trainer who can explain concepts very well
It was a hands-on and tangible course. We could apply what we learned in a matter of minutes. The trainer did a great job of answering ad-hoc questions that complemented the material. We appreciated the fact that we could apply what we were taught directly to our company.
I liked every aspect of this training and would like to thank the trainers. They did an excellent job of explaining how to use Spark for data science. This is the fourth GoDataDriven training I’ve followed. All were great, but this was the best one so far.
Climbing a steep Python and Machine Learning curve in three days. This would have taken me months on my own.