Seaborn的统计数据可视化

你将学到什么

Produce and customize various chart types with Seaborn

Apply feature selection and feature extraction methods with scikit-learn

Build a boosted decision tree classifier with XGBoost

课程概况

Welcome to this project-based course on Statistical Data Visualization with Seaborn. Producing visualizations is an important first step in exploring and analyzing real-world data sets. As such, visualization is an indispensable method in any data scientist’s toolbox. It is also a powerful tool to identify problems in analyses and for illustrating results. In this project, we will employ the statistical data visualization library, Seaborn, to discover and explore the relationships in the Breast Cancer Wisconsin (Diagnostic) data set. We will use the results from our exploratory data analysis (EDA) in the previous project, Breast Cancer Diagnosis – Exploratory Data Analysis to: drop correlated features, implement feature selection and feature extraction methods including feature selection with correlation, univariate feature selection, recursive feature elimination, principal component analysis (PCA) and tree based feature selection methods. Lastly, we will build a boosted decision tree classifier with XGBoost to classify tumors as either malignant or benign.

This course runs on Coursera’s hands-on project platform called Rhyme. On Rhyme, you do projects in a hands-on manner in your browser. You will get instant access to pre-configured cloud desktops containing all of the software and data you need for the project. Everything is already set up directly in your internet browser so you can just focus on learning. For this project, you’ll get instant access to a cloud desktop with Python, Jupyter, and scikit-learn pre-installed.

Notes:
– You will be able to access the cloud desktop 5 times. However, you will be able to access instructions videos as many times as you want.
– This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

课程大纲

Project: Statistical Data Visualization with Seaborn

Welcome to this project-based course on Statistical Data Visualization with Seaborn. Producing visualizations is an important first step in exploring and analyzing real-world data sets. As such, visualization is an indispensable method in any data scientist's toolbox. It is also a powerful tool to identify problems in analyses and for illustrating results. In this project, we will employ the statistical data visualization library, Seaborn, to discover and explore the relationships in the Breast Cancer Wisconsin (Diagnostic) data set. We will use the results from our exploratory data analysis (EDA) in the previous project, Breast Cancer Diagnosis – Exploratory Data Analysis to: drop correlated features, implement feature selection and feature extraction methods including feature selection with correlation, univariate feature selection, recursive feature elimination, principal component analysis (PCA) and tree based feature selection methods. Lastly, we will build a boosted decision tree classifier with XGBoost to classify tumors as either malignant or benign.

课程项目

Project Overview

Importing Libraries and Data

Dropping Correlated Columns from Feature List

Classification using XGBoost (minimal feature selection)

Univariate Feature Selection

Recursive Feature Elimination with Cross-Validation

Plot CV Scores vs Number of Features Selected

Feature Extraction using Principal Component Analysis

Statistical Data Visualization with Seaborn

你将学到什么

课程概况

课程大纲

课程项目

同类课程

IBM 人工智能工程专业证书

人工智能工作流程：企业模型部署

基于Google云平台的TensorFlow无服务器机器学习 – 法语版

Google云平台大数据与机器学习基础

这些课程也不错哦

使用 HTML、CSS 开发一个网站

Microsoft Excel – 使用Excel数据透视表进行数据分析

盖伊‧川崎的创业教练课 (繁体中文)

在线日本语JLPT N2课程（全10 课）

摄影后期处理全面提高篇

在线日本语JLPT N4课程（全15课）

市场营销策略

用户界面设计

声明：MOOC中国十分重视知识产权问题，我们发布之课程均源自下列机构，版权均归其所有，本站仅作报道收录并尊重其著作权益。感谢他们对MOOC事业做出的贡献！

© 2008-2022 CMOOC.COM 慕课改变你，你改变世界