在 深圳 完成此讲师指导的现场培训的参与者将获得对 Big Data 及其相关技术、方法和工具的实用、真实理解。
参与者将有机会通过动手练习将这些知识付诸实践。小组互动和教师反馈构成了课程的重要组成部分。
本课程首先介绍了 Big Data 的基本概念,然后进入用于执行 Data Analysis 的程式设计语言和方法。最后,我们讨论了支援 Big Data 存储、分散式处理和 Scala 特性的工具和基础设施。
This instructor-led, live training in 深圳 (online or onsite) is aimed at intermediate-level database administrators, developers, and analysts who wish to master advanced SQL functionalities for complex data operations and database management.
By the end of this training, participants will be able to:
Perform advanced querying techniques using unions, subqueries, and complex joins.
Add, update, and delete data, tables, views, and indexes with precision.
Ensure data integrity through transactions and manipulate database structures.
Create and manage databases efficiently for robust data storage and retrieval.
IDC 预测,到 2020 年,IT 行业将达到 5 万亿美元,比现在大约 1.7 万亿美元,而该行业 80% 的增长将由这些第三平台技术推动。从长远来看,这些技术将成为处理日益增长的数字信息复杂性的关键工具。Big Data 是智能行业解决方案之一,它允许政府根据分析大量数据(相关和不相关、结构化和非结构化)所揭示的模式采取行动,从而做出更好的决策。
“理解这些大量的Big Data需要尖端的工具和技术,这些工具和技术可以从大量不同的信息流中分析和提取有用的知识,”白宫科技政策办公室的Tom Kalil和Fen Zhao在OSTP博客上的一篇文章中写道。
白宫在2012年建立国家Big Data研究与开发计划(National Big Data Research and Development Initiative)时,在帮助各机构发现这些技术方面迈出了一步。该计划包括超过 2 亿美元,以充分利用 Big Data 的爆炸式增长以及分析它所需的工具。
Big Data 带来的挑战几乎与它的承诺一样令人生畏。高效存储数据是这些挑战之一。与往常一样,预算紧张,因此机构必须最大限度地降低每兆字节的存储价格,并使数据易于访问,以便用户可以在需要时以需要的方式获取数据。备份海量数据加剧了这一挑战。
有效分析数据是另一个重大挑战。许多机构采用商业工具,使他们能够筛选海量数据,发现可以帮助他们更有效地运营的趋势。(MeriTalk 最近的一项研究发现,联邦 IT 高管认为 Big Data 可以帮助机构节省超过 5000 亿美元,同时实现任务目标。
定制开发的 Big Data 工具还使机构能够满足分析其数据的需求。例如,橡树岭国家实验室的计算数据分析小组已将其食人鱼数据分析系统提供给其他机构。该系统帮助医学研究人员找到了一种联系,可以在主动脉瘤发作之前提醒医生注意主动脉瘤。它还用于更平凡的任务,例如筛选简历以将求职者与招聘经理联系起来。
课程 - Big Data Business Intelligence for Govt. Agencies
机器翻译
很多实际的例子,处理同一问题的不同方法,有时还不那么明显的技巧如何改进当前的解决方案
Rafal - Nordea
课程 - Apache Spark MLlib
机器翻译
培训师对概念有很好的把握
Josheel - Verizon Connect
课程 - Amazon Redshift
机器翻译
analytical functions
khusboo dassani - Tech Northwest Skillnet
课程 - SQL Advanced
The live examples
Ahmet Bolat - Accenture Industrial SS
课程 - Python, Spark, and Hadoop for Big Data
how the trainor shows his knowledge in the subject he's teachign
john ernesto ii fernandez - Philippine AXA Life Insurance Corporation
课程 - Data Vault: Building a Scalable Data Warehouse
I enjoyed the Maven training and how to configure it. I like to use Java programming language.
Robert Cost - Corning Incorporated
课程 - Apache ActiveMQ
trainer's knowledge
Fatma Badi - Dubai Electricity & Water Authority
课程 - Big Data - Data Science
very interactive...
Richard Langford
课程 - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
课程 - A Practical Introduction to Stream Processing
During the exercises, James explained me every step whereever I was getting stuck in more detail. I was completely new to NIFI. He explained the actual purpose of NIFI, even the basics such as open source. He covered every concept of Nifi starting from Beginner Level to Developer Level.
Firdous Hashim Ali - MOD A BLOCK
课程 - Apache NiFi for Administrators
Trainer's preparation & organization, and quality of materials provided on github.
Mateusz Rek - MicroStrategy Poland Sp. z o.o.
课程 - Impala for Business Intelligence
Open discussion with trainer
Tomek Danowski - GE Medical Systems Polska Sp. Z O.O.
课程 - Process Mining
Get to learn spark streaming , databricks and aws redshift
Lim Meng Tee - Jobstreet.com Shared Services Sdn. Bhd.
课程 - Apache Spark in the Cloud
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
课程 - KNIME Analytics Platform for BI
That I had it in the first place.
Peter Scales - CACI Ltd
课程 - Apache NiFi for Developers
Instructor very knowledgeable and very happy to stop and explain stuff to the group or to an individual.
Paul Anstee - Northrop Grumman
课程 - Apache Accumulo Fundamentals
Nice training, full of interesting topics. After each topic helpful examples were provided.
Pawel Wojcikowski - MicroStrategy Poland Sp. z o.o.
课程 - Teradata Fundamentals
practical things of doing, also theory was served good by Ajay
Dominik Mazur - Capgemini Polska Sp. z o.o.
课程 - Hadoop Administration on MapR
practice tasks
Pawel Kozikowski - GE Medical Systems Polska Sp. Zoo
课程 - Python and Spark for Big Data (PySpark)
Recalling/reviewing keypoints of the topics discussed.
Paolo Angelo Gaton - SMS Global Technologies Inc.
课程 - Building Stream Processing Applications with Kafka Streams
The VM I liked very much
The Teacher was very knowledgeable regarding the topic as well as other topics, he was very nice and friendly
I liked the facility in Dubai.
Safar Alqahtani - Elm Information Security
课程 - Big Data Analytics in Health
I genuinely enjoyed the hands passed exercises.
Yunfa Zhu - Environmental and Climate Change Canada
课程 - Foundation R
I generally liked the fernando's knowledge.
Valentin de Dianous - Informatique ProContact INC.
课程 - Big Data Architect
Richard's training style kept it interesting, the real world examples used helped to drive the concepts home.
Jamie Martin-Royle - NBrown Group
课程 - From Data to Decision with Big Data and Predictive Analytics