使用Spark进行大数据分析

Big Data Analytics Using Spark

Learn how to analyze large datasets using Jupyter notebooks, MapReduce and Spark as a platform.

计算机大数据数据科学

1230 次查看

加州大学圣地亚哥分校

edX

使用Spark进行大数据分析

完成时间大约为 10 周
高级
英语

注：因开课平台的各种因素变化，以上开课日期仅供参考

你将学到什么

Programming Spark using Pyspark

Identifying the computational tradeoffs in a Spark application

Performing data loading and cleaning using Spark and Parquet

Modeling data through statistical and machine learning methods

课程概况

In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation.

The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

In this course, part of the Data Science MicroMasters program, you will learn what the bottlenecks are in massive parallel computation and how to use spark to minimize these bottlenecks.

You will learn how to perform supervised an unsupervised machine learning on massive datasets using the Machine Learning Library (MLlib).

In this course, as in the other ones in this MicroMasters program, you will gain hands-on experience using PySpark within the Jupyter notebooks environment.

预备知识

The previous courses in the MicroMasters program: DSE200x,DSE210xand DSE220x

同类课程

数据分析师

涨薪计划-数据分析高薪实战班

网易

数据挖掘工程师

数据挖掘·实战班

网易

IBM 人工智能工程专业证书

IBM 人工智能工程专业证书

IBM

数据科学：数据驱动的决策制定

数据科学：数据驱动的决策制定

蒙纳士大学

千万首歌曲。全无广告干扰。

此外，您还能在所有设备上欣赏您的整个音乐资料库。免费畅听 3 个月，之后每月只需 ¥10.00。

声明：MOOC中国十分重视知识产权问题，我们发布之课程均源自下列机构，版权均归其所有，本站仅作报道收录并尊重其著作权益。感谢他们对MOOC事业做出的贡献！

Coursera
edX
OpenLearning
FutureLearn
iversity
Udacity
NovoEd
Canvas
Open2Study
Google
ewant
FUN
IOC-Athlete-MOOC
World-Science-U
Codecademy
CourseSites
opencourseworld
ShareCourse
gacco
MiriadaX
JANUX
openhpi
Stanford-Open-Edx
网易云课堂
中国大学MOOC
学堂在线
顶你学堂
华文慕课
好大学在线CnMooc
(部分课程由Coursera、Udemy、Linkshare共同提供)

MOOC中国

© 2008-2022 CMOOC.COM 慕课改变你，你改变世界