From Data to Decision with Big Data and Predictive Analytics培训


Duration 期限

21 小时 同常来说包括休息是 3天

Requirements 最低要求

Understanding of traditional data management and analysis methods like SQL, data warehouses, business intelligence, OLAP, etc... Understanding of basic statistics and probability (mean, variance, probability, conditional probability, etc....)

Overview 概览


If you try to make sense out of the data you have access to or want to analyse unstructured data available on the net (like Twitter, Linked in, etc...) this course is for you.

It is mostly aimed at decision makers and people who need to choose what data is worth collecting and what is worth analyzing.

It is not aimed at people configuring the solution, those people will benefit from the big picture though.

Delivery Mode

During the course delegates will be presented with working examples of mostly open source technologies.

Short lectures will be followed by presentation and simple exercises by the participants

Content and Software used

All software used is updated each time the course is run so we check the newest versions possible.

It covers the process from obtaining, formatting, processing and analysing the data, to explain how to automate decision making process with machine learning.

Course Outline 课程简介

Quick Overview

  • Data Sources
  • Minding Data
  • Recommender systems
  • Target Marketing


  • Structured vs unstructured
  • Static vs streamed
  • Attitudinal, behavioural and demographic data
  • Data-driven vs user-driven analytics
  • data validity
  • Volume, velocity and variety of data


  • Building models
  • Statistical Models
  • Machine learning

Data Classification

  • Clustering
  • kGroups, k-means, nearest neighbours
  • Ant colonies, birds flocking

Predictive Models

  • Decision trees
  • Support vector machine
  • Naive Bayes classification
  • Neural networks
  • Markov Model
  • Regression
  • Ensemble methods


  • Benefit/Cost ratio
  • Cost of software
  • Cost of development
  • Potential benefits

Building Models

  • Data Preparation (MapReduce)
  • Data cleansing
  • Choosing methods
  • Developing model
  • Testing Model
  • Model evaluation
  • Model deployment and integration

Overview of Open Source and commercial software

  • Selection of R-project package
  • Python libraries
  • Hadoop and Mahout
  • Selected Apache projects related to Big Data and Analytics
  • Selected commercial solution
  • Integration with existing software and data sources


Guaranteed to run even with a single delegate!
公开课 公开课
从 ¥54050
参与人与培训师在不同的地理位置,以在网上上课. More Information

The more delegates, the greater the savings per delegate. Table reflects price per delegate and is used for illustration purposes only, actual prices may differ.

Number of Delegates 公开课 私人远程定制课程
1 ¥54050 ¥51650
2 ¥38100 ¥36750
3 ¥32783 ¥31783
4 ¥30125 ¥29300




地址日期价格【远程 / 传统课堂】
钰海环球金融中心星期二, 2017-08-15 09:30¥51650 / ¥54050
深圳 - 新世界中心星期三, 2017-08-23 09:30¥51650 / ¥58820
苏州 - 晋合广场星期一, 2017-08-28 09:30¥51650 / ¥55850
上海 - 六八八广场星期一, 2017-08-28 09:30¥51650 / ¥54050
厦门 - 国际银行大厦星期三, 2017-09-13 09:30¥51650 / ¥55850