Building Batch Data Pipelines on Google Cloud
Seminar / Firmentraining
Zielgruppe
This course is intended for developers who are responsible for designing pipelines and architectures for data processing.
Voraussetzungen
- Experience with data modeling and ETL (extract, transform, load) activities.
- Experience with developing applications by using a common programming language such as Python or Java.
Inhalte
- Review different methods of data loading: EL, ELT and ETL and when to use what.
- Run Hadoop on Dataproc, use Cloud Storage, and optimize Dataproc jobs.
- Build your data processing pipelines by using Dataflow.
- Manage data pipelines with Data Fusion and Cloud Composer
Zertifizierung
Google Cloud Certified Professional Data Engineer (PDE)