大数据基础知识:HDFS、MapReduce和Spark RDD

Big Data Essentials: HDFS, MapReduce and Spark RDD

1297 次查看
Yandex
Coursera
  • 完成时间大约为 45 个小时
  • 中级
  • 英语
注:本课程由Coursera和Linkshare共同提供,因开课平台的各种因素变化,以上开课日期仅供参考

课程概况

Have you ever heard about such technologies as HDFS, MapReduce, Spark? Always wanted to learn these new tools but missed concise starting material? Don’t miss this course either!

In this 6-week course you will:
– learn some basic technologies of the modern Big Data landscape, namely: HDFS, MapReduce and Spark;
– be guided both through systems internals and their applications;
– learn about distributed file systems, why they exist and what function they serve;
– grasp the MapReduce framework, a workhorse for many modern Big Data applications;
– apply the framework to process texts and solve sample business cases;
– learn about Spark, the next-generation computational framework;
– build a strong understanding of Spark basic concepts;
– develop skills to apply these tools to creating solutions in finance, social networks, telecommunications and many other fields.

Your learning experience will be as close to real life as possible with the chance to evaluate your practical assignments on a real cluster. No mocking, a friendly considerate atmosphere to make the process of your learning smooth and enjoyable.

Get ready to work with real datasets alongside with real masters!

Special thanks to:
– Prof. Mikhail Roytberg, APT dept., MIPT, who was the initial reviewer of the project, the supervisor and mentor of half of the BigData team. He was the one, who helped to get this show on the road.
– Oleg Sukhoroslov (PhD, Senior Researcher at IITP RAS), who has been teaching MapReduce, Hadoop and friends since 2008. Now he is leading the infrastructure team.
– Oleg Ivchenko (PhD student APT dept., MIPT), Pavel Akhtyamov (MSc. student at APT dept., MIPT) and Vladimir Kuznetsov (Assistant at P.G. Demidov Yaroslavl State University), superbrains who have developed and now maintain the infrastructure used for practical assignments in this course.
– Asya Roitberg, Eugene Baulin, Marina Sudarikova. These people never sleep to babysit this course day and night, to make your learning experience productive, smooth and exciting.

课程大纲

Welcome

What are BigData and distributed file systems (e.g. HDFS)?

Solving Problems with MapReduce

Solving Problems with MapReduce (practice week)

Introduction to Apache Spark

Introduction to Apache Spark (practice week)

Real-World Applications

千万首歌曲。全无广告干扰。
此外,您还能在所有设备上欣赏您的整个音乐资料库。免费畅听 3 个月,之后每月只需 ¥10.00。
Apple 广告
声明:MOOC中国十分重视知识产权问题,我们发布之课程均源自下列机构,版权均归其所有,本站仅作报道收录并尊重其著作权益。感谢他们对MOOC事业做出的贡献!
  • Coursera
  • edX
  • OpenLearning
  • FutureLearn
  • iversity
  • Udacity
  • NovoEd
  • Canvas
  • Open2Study
  • Google
  • ewant
  • FUN
  • IOC-Athlete-MOOC
  • World-Science-U
  • Codecademy
  • CourseSites
  • opencourseworld
  • ShareCourse
  • gacco
  • MiriadaX
  • JANUX
  • openhpi
  • Stanford-Open-Edx
  • 网易云课堂
  • 中国大学MOOC
  • 学堂在线
  • 顶你学堂
  • 华文慕课
  • 好大学在线CnMooc
  • (部分课程由Coursera、Udemy、Linkshare共同提供)

© 2008-2022 CMOOC.COM 慕课改变你,你改变世界