使用Microsoft Cognitive Services开发AI Vision应用程序

你将学到什么

Identify, categorize, and tag images

Interpret images for characteristics like faces, mood, age, and scene

Perform Optical Character Recognition (OCR) within images

Intelligently manipulate photos

Build an image classifier app

课程概况

Microsoft Cognitive Services is a set of cloud-based intelligence services and APIs for building richer, smarter, and more sophisticated applications. The Vision APIs available in Microsoft Cognitive Services offer many ready-to-use and easy-to-consume features that help you use Artificial Intelligence (AI) to solve your business problems. In this practical course, take an in-depth look at Vision APIs, work through hands-on exercises to learn how to piece them together, and find out how to put them to work in your organization.

Start with an overview of Microsoft Cognitive Services, and then take a look at the Computer Vision API, which gathers actionable information in images. Learn about optical character recognition (OCR) and image manipulation. Explore the Face API to detect, identify, analyze, organize, and tag photos and to create metadata around them. Get the details on the Emotion API, which helps personalize user experience with emotion recognition. And dig into the Custom Vision API, which enables you to customize computer vision models as required for your own specific use cases.

Leverage the latest best practices and Fluent Design principles, as you learn how to create Windows 10 Universal Windows Platform applications that can run on multiple devices, including desktops, tablets, phones, HoloLens, and Xbox consoles. With a prerequisite of proficiency in a C-based programming language like C, C#, C++, or Java, follow along with the instructor as you work through the labs to replicate and modify code in the examples.

Wrap up the course by creating an application that analyzes and tags images found in Twitter timelines for the presence of human faces and their recognized emotion, based on information returned from the Computer Vision, Face, Emotion, and Custom Vision APIs. The app can optionally even attempt to locate the source and physical location of people, places, or organizations referenced in the content. From a general overview to specific use cases and hands-on practice, this course gives you what you need to create AI apps with off-the-shelf features in Cognitive Services Vision APIs.

课程大纲

Modue1: Computer Vision Overview
Module2: Computer Vision
Module 3: OCR and Image Manipulation
Module 4: Custom Vision
Module 5: Final Project

预备知识

Intermediate Computer Science coding skills in a C based language such as C, C#, C++, Java. Course will primarily use C#, knowledge of C# is recommended, but not a prerequisite.