sparksql
7 小时 通常来说是1天,包括中间休息。
Audience
Spark SQL是Apache Spark用于处理结构化和非结构化数据的模块。 Spark SQL提供有关数据结构以及正在执行的计算的信息。此信息可用于执行优化。 Spark SQL两个常见用途是:
- 执行SQL查询。
- 从现有Hive安装中读取数据。
在这个由讲师指导的实时培训(现场或远程)中,参与者将学习如何使用Spark SQL分析各种类型的数据集。
在培训结束时,参与者将能够:
课程格式
课程自定义选项
Machine Translated
Introduction
Overview of Data Access Approaches (Hive, databases, etc.)
Overview of Spark Features and Architecture
Installing and Configuring Spark
Understanding Dataframes in Spark
Defining Tables and Importing Datasets
Querying Data Frames using SQL
Carrying out Aggregations, JOINs and Nested Queries
Uploading and Accessing Data
Querying Different Types of Data
Querying Data Lakes with SQL
Troubleshooting
Summary and Conclusion
We are looking to expand our presence in China!
If you are interested in running a high-tech, high-quality training and consulting business.
Apply now!















.jpg)



.jpg)














.jpg)












