课程大纲
介绍
Spark Streaming 功能和体系结构概述
- 支持的数据源
- 核心 API
准备环境
- 依赖
- Spark 和流式处理上下文
- 连接到 Kafka
处理消息
- 将入站消息解析为 JSON
- ETL 过程
- 启动流式处理上下文
执行窗口化 Stream Processing
- 滑动间隔
- 检查点交付配置
- 启动环境
处理代码的原型设计
- 连接到 Kafka 主题
- 使用 Paw 从数据源检索 JSON
- 变化和附加处理
流式传输代码
- 作业控制变量
- 定义要匹配的值
- 功能和条件
获取流输出
- 计数器
- Kafka 输出(匹配和非匹配)
故障 排除
总结和结论
要求
- 具有 Python 和 Apache Kafka 的经验
- 熟悉流处理平台
观众
- 数据工程师
- 数据科学家
- 程序员
客户评论 (4)
培训师非常乐意回答我所做的所有问题
Caterina - Stamtech
课程 - Developing APIs with Python and FastAPI
机器翻译
It was a though course as we had to cover a lot in a short time frame. Our trainer knew a lot about the subject and delivered the content to address our requirements. It was lots of content to learn but our trainer was helpful and encouraging. He answered all our questions with good detail and we feel that we learned a lot. Exercises were well prepared and tasks were tailored accordingly to our needs. I enjoyed this course
Bozena Stansfield - New College Durham
课程 - Build REST APIs with Python and Flask
传授培训师的实践知识和经验。
Rumel Mateusz - Pojazdy Szynowe PESA Bydgoszcz SA
课程 - GUI Programming with Python and PyQt
机器翻译
As I was the only participant the training could be adapted to my needs.