When does class start/end?
Classes begin promptly at 9:00 am, and typically end at 5:00 pm.
This hands-on Data Engineering Bootcamp teaches attendees the foundations of data engineering using Python and Spark SQL. Students learn how to build production-ready data-driven solutions and gain a...
Read MoreThis hands-on Data Engineering Bootcamp teaches attendees the foundations of data engineering using Python and Spark SQL. Students learn how to build production-ready data-driven solutions and gain a comprehensive understanding of data engineering.
This Data Engineer Bootcamp training is targeted to Data Engineers
Some working experience in any programming language; the students will be introduced to programming in Python. Basic understanding of SQL and data processing concepts, including data grouping and aggregation.
Chapter 1 - Big Data Concepts and Systems Overview for Data Engineers
Chapter 2 - Defining Data Engineering
Chapter 3 - Data Processing Phases
Chapter 4 - Python 3 Introduction
Chapter 5 - Python Variables and Types
Chapter 6 - Control Statements and Data Collections
Chapter 7 - Functions and Modules
Chapter 8 - File I/O and Useful Modules
Chapter 9 - Practical Introduction to NumPy
Chapter 10 - Practical Introduction to pandas
Chapter 11 - Data Grouping and Aggregation with pandas
Chapter 12 - Repairing and Normalizing Data
Chapter 13 - Data Visualization in Python
Chapter 14 - Python as a Cloud Scripting Language
Chapter 15 - Introduction to Apache Spark
Chapter 16 - The Spark Shell
Chapter 17 - Spark RDDs
Chapter 18 - Parallel Data Processing with Spark
Chapter 19 - Introduction to Spark SQL
Lab Exercises