你将学到什么
HBase to implement low-latency NoSQL data stores.
Storm to implement real-time streaming analytics solutions.
Spark for high-performance interactive data analysis.
课程概况
In this four week course, you’ll learn how to implement low-latency and streaming Big Data solutions using Hadoop technologies like HBase, Storm, and Spark on Microsoft Azure HDInsight.
Note: To complete the hands-on elements in this course, you will require an Azure subscription and a Windows, Linux, or Mac OS X client computer. You can sign up for a free Azure trial subscription (a valid credit card is required for verification, but you will not be charged for Azure services). Note that the free trial is not available in all regions. It is possible to complete the course and earn a certificate without completing the hands-on practices.
This course is the second in a series that explores big data and advanced analytics techniques with HDInsight; and builds on the batch processing techniques learned in DAT202.1x: Processing Big Data with Hadoop in Azure HDInsight.
课程大纲
Module 1: Using HBase for NoSQL Data
Module 2: Using Storm for Streaming Data
Module 3: Using Spark for Interactive Analysis
Module 4: Final Exam
预备知识
Familiarity with Hadoop clusters and Hive in HDInsight
Familiarity with database concepts and basic SQL query syntax
Familiarity with basic programming constructs (for example, variables, loops, conditional logic). Experience with Java or C# is useful but not essential
A willingness to learn actively and persevere when troubleshooting technical problems is essential