Warning: WP Redis: Connection refused in /www/wwwroot/cmooc.com/wp-content/plugins/powered-cache/includes/dropins/redis-object-cache.php on line 1433
用Microsoft Cognitive Services开发人工智能语音应用 | MOOC中国 - 慕课改变你,你改变世界

用Microsoft Cognitive Services开发人工智能语音应用

Developing AI Speech Apps with Microsoft Cognitive Services

Explore Microsoft Cognitive Speech Services, including language translation, speech and speaker recognition, and customized language models, then create your own AI app that can translate, recognize, synthesize, and perform authentication using speech.

1471 次查看
微软
edX
  • 完成时间大约为 5
  • 中级
  • 英语
注:因开课平台的各种因素变化,以上开课日期仅供参考

你将学到什么

Translate spoken content into other languages

Perform speech synthesis and recognition

Replace standard authentication with speaker verification

Integrate speech commanding into app experiences

Identify speakers via voice identification

Build a speech and speaker recognition app

课程概况

Microsoft Cognitive Services is a set of cloud-based intelligence services and APIs for building richer, smarter, and more sophisticated applications. The Speech APIs available in Microsoft Cognitive Services offer many ready-to-use and easy-to-consume features that help you use Artificial Intelligence (AI) to solve your business problems. In this practical course, take an in-depth look at Speech APIs, work through hands-on exercises to learn how to piece them together, and find out how to put them to work in your organization.

Start with an overview of Microsoft Cognitive Services, and then take a look at the Bing Speech API, which provides algorithms, exposed as simple REST-based service calls, to convert audio to text, understand speech intent, and convert text back to speech for natural responsiveness. Explore the Translator Speech API to add end-to-end, real-time, speech translation to applications and services. Get the details on the Speaker Recognition API, designed to perform speaker verification and identification. And dig into the Custom Speech API, which enables you to customize speech language models to perform domain-specific and use case-specific speech recognition.

Leverage the latest best practices and Fluent Design principles, as you learn how to create Windows 10 Universal Windows Platform applications that can run on multiple devices, including desktops, tablets, phones, HoloLens, and Xbox consoles. With a prerequisite of proficiency in a C-based programming language like C, C#, C++, or Java, follow along with the instructor as you work through the labs to replicate and modify code in the examples.

Wrap up the course by creating an application that authenticates users via speaker verification and searches relevant an popular news articles based on information returned from the Bing News Search API. The app can even optionally translate news headlines into your language of choice, using the Translator Speech API. From a general overview to specific use cases and hands-on practice, this course gives you what you need to create AI apps with off-the-shelf features in Cognitive Services Speech APIs.

课程大纲

Module 1: Bing Speech: Introduction to Microsoft Cognitive Service Bing Speech concepts and best practices, as well as integrating speech recognition and synthesis into applications.
Module 2: Translator Speech: Introduction to Microsoft Cognitive Service Translator Speech concepts and best practices, as well as integrating real-time speech translation into applications.
Module 3: Speaker Recognition: Introduction to Microsoft Cognitive Service Speaker Recognition concepts and best practices, as well as integrating speaker identification and verification into applications.
Module 4: Custom Speech Introduction to Microsoft Cognitive Service Custom Speech concepts, as well as integrating custom language models and speech recognition into applications.
Module 5: Final Project: Developing a Universal Windows Platform (UWP) application using various aspects of Microsoft Cognitive Speech Services.

预备知识

Intermediate coding skills in a C based language such as C, C#, C++, Java.  Course will primarily use C#, knowledge of C# is recommended, but not a prerequisite.

千万首歌曲。全无广告干扰。
此外,您还能在所有设备上欣赏您的整个音乐资料库。免费畅听 3 个月,之后每月只需 ¥10.00。
Apple 广告
声明:MOOC中国十分重视知识产权问题,我们发布之课程均源自下列机构,版权均归其所有,本站仅作报道收录并尊重其著作权益。感谢他们对MOOC事业做出的贡献!
  • Coursera
  • edX
  • OpenLearning
  • FutureLearn
  • iversity
  • Udacity
  • NovoEd
  • Canvas
  • Open2Study
  • Google
  • ewant
  • FUN
  • IOC-Athlete-MOOC
  • World-Science-U
  • Codecademy
  • CourseSites
  • opencourseworld
  • ShareCourse
  • gacco
  • MiriadaX
  • JANUX
  • openhpi
  • Stanford-Open-Edx
  • 网易云课堂
  • 中国大学MOOC
  • 学堂在线
  • 顶你学堂
  • 华文慕课
  • 好大学在线CnMooc
  • (部分课程由Coursera、Udemy、Linkshare共同提供)

© 2008-2022 CMOOC.COM 慕课改变你,你改变世界