你将学到什么
Become familiar with working with relational databases, using SQL based languages such as MySql, dealing with formatted data (XML, JSON, etc.)
Use Python to work with and analyze data from databases as well as from the web
课程概况
Data is the lifeblood of an organization. Competency in programming is an essential skill for successfully extracting information and knowledge from data.
The goal of this course is to introduce learners to the basics of programming in Python and to give a working knowledge of how to use programs to deal with data.
In this course, we will first cover the basics of programming and then focus on using Python on the entire data management process from data acquisition to analysis of data big data and small data.
This is an intensive hands-on course that will equip and reward learners with proficiency in data management skills.
课程大纲
Week 1: Python review
Week 2: Python review
Week 3: Data interchange formats: JSON and XML
Week 4: Web scraping and web crawling
Week 5: Database basics: Relational databases
Week 6: SQL
Week 7: Data analysis and visualization I
Week 8: Data analysis and visualization II
Week 9: Text mining
Week 10: Analysis of networks
Week 11: Machine learning: Part 1
Week 12: Machine learning: Part 2
预备知识
We will review basic Python programming concepts in week 1 and 2 and no prior programming experience is necessary. But, if you have had some exposure to programming you will get more out of this class.
No specific math capability is required though it will be helpful if you are familiar with the basic concepts of algebra, set theory, and probability.