cloudera-training-courses
8221  Reviews star_rate star_rate star_rate star_rate star_half

Cloudera Streaming Analytics: Using Apache Flink and SQL Stream Builder on CDP

During this two-day instructor-led training course, participants will learn development and operations for Cloudera Streaming Analytics, a framework for low-latency processing and analytics powered...

Read More
$1,725 USD GSA  $1,477.33
Course Code STREAM-CDP
Duration 2 days
Available Formats Classroom

During this two-day instructor-led training course, participants will learn development and operations for Cloudera Streaming Analytics, a framework for low-latency processing and analytics powered by Apache Flink and Cloudera's innovative SQL Stream Builder. Through extensive hands-on exercises, students will gain experience deploying and managing a Flink cluster, developing and running Flink applications, and using SQL Stream Builder's continuous SQL to perform analytics on streaming data.

Skills Gained

During this course, you learn how to:

  • Deploy a Flink cluster using Cloudera Manager
  • Develop Flink batch and streaming applications
  • Run and view Flink jobs
  • Transform data streams
  • Use watermarks and windows to analyze streaming data
  • Analyze data with Cloudera SQL Stream Builder
  • Monitor Flink application metrics

Who Can Benefit

This course is designed for those who have experience with administration and application development on the Cloudera platform.

Prerequisites

Students must have at least basic familiarity with Java and Linux. Our Cloudera Training for Apache Kafka course, or equivalent experience with Apache Kafka, is a recommend prerequisite.

Course Details

Overview

  • Introduction to Apache Flink and Stream Processing
  • Typical Use Cases
  • Related Products

Service Deployment

  • Planning Requirements
  • Installation
  • Flink Dashboard
  • Exercise: Running a Flink Program

Flink Basics

  • Execution Environment
  • Flink Application Structure
  • Create a Flink Project
  • Build a Flink Program
  • Exercise: Building a Simple Flink Program

Architecture

  • Logical
  • Physical
  • Parallelism
  • Fault Tolerance
  • Data Storage

DataStream API

  • DataStream API Overview
  • Data Types and Serialization
  • Sources and Sinks
  • Transformations
  • Exercise: Batch Processing Using Flink
  • Exercise: Creating a Flink Streaming Application
  • Using Kafka as a Source and Sink
  • Exercise: Creating a Streaming Application Using a Kafka Source

Flink SQL and Table API

  • Streaming Concepts
  • Programming Options
  • Integrations
  • Exercise: Using Flink SQL and Kafka

Stateful Stream Processing

  • Stateful Streaming Applications
  • Checkpoints
  • Event Time Processing
  • Watermarks
  • Windows
  • Exercise: Tumbling Windows with Event Time

Cloudera SQL Stream Builder

  • Overview
  • Streaming SQL Console
  • Data Providers
  • SQL Stream Jobs
  • Exercise: Using SQL Stream Builder

Monitoring

  • Metrics
  • Logging
  • Backpressure
  • Resource Configuration
  • Exercise: Monitoring