Special offers now — see discounted courses.
day
:
hour
:
min
:
sec
See special offers
Azure AI for Developers: Azure AI Speech

Azure AI for Developers: Azure AI Speech

55mIntermediate2025-04-15

Authors

Marco Casalaina

Marco Casalaina

Course details

Using pre-built or customizable speech models, Azure AI Speech allows developers to build multimodal, multilingual, voice-enabled AI apps. In this course, instructor Marco Casalaina begins by outlining the basic features and capabilities of Azure Speech and identifies the most common use cases. Then, through hands-on instruction, he covers speech to text models and transcriptions, text to speech tools and voices, and avatar creation. The course wraps up with coverage of advanced Azure Speech capabilities.

Learning objectives
Identify common use cases for Azure AI Speech.
Customize speech to text models to fit specific needs.
Build and test text to speech audio content.
Build custom avatars and integrate gestures for enhanced communication.

Skills covered

Azure AI ServicesProgramming FoundationsCloud AdministrationArtificial Intelligence FoundationsCloud PlatformsArtificial Intelligence (AI)Cloud ComputingMicrosoftSoftware DevelopmentDeep Dive (X:Y)

Concepts

0. Introduction

  • 01 - What this course is about
  • 02 - What you should know

1. Azure Speech in Action - Common Use Cases

  • 03 - Common scenarios for Azure AI Speech

2. Speech to Text and Transcription

  • 04 - How speech to text works
  • 05 - Transcription
  • 06 - Customizing speech to text
  • 07 - Choosing between the OpenAI Whisper and Azure Speech models
  • 08 - Speech translation

3. Text to Speech

  • 09 - Text to speech - Azure Voice Gallery
  • 10 - Audio content creation
  • 11 - Custom voices

4. Avatars

  • 12 - Combining speech with avatars
  • 13 - Building custom avatars
  • 14 - Live chat avatars

5. Advanced Speech Capabilities

  • 15 - Video translation
  • 16 - Pronunciation assessment
  • 17 - Using Azure Content Understanding for audio and video
  • 18 - Azure Speech vs. real-time LLMs

Conclusion

  • 19 - More resources on Azure Speech

Related courses

Related learn paths

About us

LyndaKade is a leading learning platform that helps people learn business, software, technology, and creative skills to achieve personal and professional goals.

Phone numberAparat ChannelTelegram SupportTelegram ChannelInstagram Page

All rights to this site belong to LyndaKade.

Terms of Service|Privacy Policy

نماد الکترونیک enamad در صورت اتصال با آی‌پی داخل کشور، نمایش داده خواهد شد.
logo-samandehi - لوگو ساماندهی
zarinpal
zibal