Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
课程大纲
介绍
- Spark 和 Hadoop 功能和体系结构概述
- 了解大数据
- Python 编程基础
开始
- 设置 Python、Spark 和 Hadoop
- 了解 Python 中的数据结构
- 了解 PySpark API
- 了解HDFS和MapReduce
将 Spark 和 Hadoop 与 Python 集成
- 在 Python 中实现 Spark RDD
- 使用MapReduce处理数据
- 在HDFS中创建分布式数据集
Machine Learning 使用 Spark MLlib
使用 Spark Streaming 处理大数据
使用推荐系统
使用 Kafka、Sqoop、Kafka 和 Flume
Apache Mahout 与 Spark 和 Hadoop
故障 排除
摘要和后续步骤
要求
- Spark和Hadoop经验
- Python 编程经验
观众
- 数据科学家
- 开发 人员
21 小时
客户评论 (3)
The fact that we were able to take with us most of the information/course/presentation/exercises done, so that we can look over them and perhaps redo what we didint understand first time or improve what we already did.
Raul Mihail Rat - Accenture Industrial SS
课程 - Python, Spark, and Hadoop for Big Data
I liked that it managed to lay the foundations of the topic and go to some quite advanced exercises. Also provided easy ways to write/test the code.
Ionut Goga - Accenture Industrial SS
课程 - Python, Spark, and Hadoop for Big Data
The live examples