When does class start/end?
Classes begin promptly at 9:00 am, and typically end at 5:00 pm.
The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on...
Read MoreThe two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment. This is the first course of the Data Engineering on Google Cloud series. After completing this course, enroll in the Building Batch Data Pipelines on Google Cloud course.
This course is intended for developers who are responsible for: Querying datasets, visualizing query results, and creating reports. Specific job roles include: Data Engineer, Data Analyst, Database Administrators, Big Data Architects
To benefit from this course, participants should have completed “Google Cloud Big Data and Machine Learning Fundamentals” or have equivalent experience. Participant should also have: • Basic proficiency with a common query language such as SQL. • Experience with data modeling and ETL (extract, transform, load) activities. • Experience with developing applications using a common programming language such as Python. • Familiarity with machine learning and/or statistics
Introduction
Introduction to Data Engineering
Building a Data Lake
Building a Data Warehouse
Summary
Course Resources