Azure AI for Developers: Azure AI Speech
55mIntermediate2025-04-15
Authors

Marco Casalaina
Course details
Using pre-built or customizable speech models, Azure AI Speech allows developers to build multimodal, multilingual, voice-enabled AI apps. In this course, instructor Marco Casalaina begins by outlining the basic features and capabilities of Azure Speech and identifies the most common use cases. Then, through hands-on instruction, he covers speech to text models and transcriptions, text to speech tools and voices, and avatar creation. The course wraps up with coverage of advanced Azure Speech capabilities.
Learning objectives
Identify common use cases for Azure AI Speech.
Customize speech to text models to fit specific needs.
Build and test text to speech audio content.
Build custom avatars and integrate gestures for enhanced communication.
Learning objectives
Identify common use cases for Azure AI Speech.
Customize speech to text models to fit specific needs.
Build and test text to speech audio content.
Build custom avatars and integrate gestures for enhanced communication.
Skills covered
Azure AI ServicesProgramming FoundationsCloud AdministrationArtificial Intelligence FoundationsCloud PlatformsArtificial Intelligence (AI)Cloud ComputingMicrosoftSoftware DevelopmentDeep Dive (X:Y)
Concepts
0. Introduction
- 01 - What this course is about
- 02 - What you should know
1. Azure Speech in Action - Common Use Cases
- 03 - Common scenarios for Azure AI Speech
2. Speech to Text and Transcription
- 04 - How speech to text works
- 05 - Transcription
- 06 - Customizing speech to text
- 07 - Choosing between the OpenAI Whisper and Azure Speech models
- 08 - Speech translation
3. Text to Speech
- 09 - Text to speech - Azure Voice Gallery
- 10 - Audio content creation
- 11 - Custom voices
4. Avatars
- 12 - Combining speech with avatars
- 13 - Building custom avatars
- 14 - Live chat avatars
5. Advanced Speech Capabilities
- 15 - Video translation
- 16 - Pronunciation assessment
- 17 - Using Azure Content Understanding for audio and video
- 18 - Azure Speech vs. real-time LLMs
Conclusion
- 19 - More resources on Azure Speech
Related courses
- Building Agents Using the Azure AI Foundry Agent Service
- Building Apps with Azure AI Language and Python
- Microsoft Azure AI Fundamentals (AI-900) Cert Prep by Microsoft Press
- Azure AI for Developers: Content Safety and Responsible AI
- Azure AI for Developers: Using the Azure AI Model Catalog
- Azure AI for Developers: Process Images with Azure AI
- Azure AI for Developers: LLMs and SLMs
- Azure AI for Developers: Building AI Agents