Apache Spark MLlib培训

课程编码

spmllib

课程时长

35 小时 通常来说是5天,包括中间休息。

要求

Knowledge of one of the following:

  • Java
  • Scala
  • Python
  • SparkR.

课程概览

MLlib是Spark的机器学习(ML)库。其目标是使实用的机器学习可扩展且简单。它由常见的学习算法和实用程序组成,包括分类,回归,聚类,协同过滤,降维,以及低级优化原语和更高级别的管道API。

它分为两个包:

  • spark.mllib包含在RDD之上构建的原始API。

  • spark.ml提供了构建在DataFrame之上的更高级API,用于构建ML管道。

听众

本课程面向希望利用Apache Spark内置机器库的工程师和开发人员

Machine Translated

课程大纲

spark.mllib: data types, algorithms, and utilities

  • Data types
  • Basic statistics
    • summary statistics
    • correlations
    • stratified sampling
    • hypothesis testing
    • streaming significance testing
    • random data generation
  • Classification and regression
    • linear models (SVMs, logistic regression, linear regression)
    • naive Bayes
    • decision trees
    • ensembles of trees (Random Forests and Gradient-Boosted Trees)
    • isotonic regression
  • Collaborative filtering
    • alternating least squares (ALS)
  • Clustering
    • k-means
    • Gaussian mixture
    • power iteration clustering (PIC)
    • latent Dirichlet allocation (LDA)
    • bisecting k-means
    • streaming k-means
  • Dimensionality reduction
    • singular value decomposition (SVD)
    • principal component analysis (PCA)
  • Feature extraction and transformation
  • Frequent pattern mining
    • FP-growth
    • association rules
    • PrefixSpan
  • Evaluation metrics
  • PMML model export
  • Optimization (developer)
    • stochastic gradient descent
    • limited-memory BFGS (L-BFGS)

spark.ml: high-level APIs for ML pipelines

  • Overview: estimators, transformers and pipelines
  • Extracting, transforming and selecting features
  • Classification and regression
  • Clustering
  • Advanced topics

客户评论

★★★★★
★★★★★

课程分类

促销课程

订阅促销课程

为尊重您的隐私,我公司不会把您的邮箱地址提供给任何人。您可以享有优先权和随时取消订阅的权利。

我们的客户

is growing fast!

We are looking to expand our presence in China!

As a Business Development Manager you will:

  • expand business in China
  • recruit local talent (sales, agents, trainers, consultants)
  • recruit local trainers and consultants

We offer:

  • Artificial Intelligence and Big Data systems to support your local operation
  • high-tech automation
  • continuously upgraded course catalogue and content
  • good fun in international team

If you are interested in running a high-tech, high-quality training and consulting business.

Apply now!