Course Overview
TOPThis 1 day course focuses on building intelligent applications that can see, interpret, and reason over images and documents using different multimodal models and agent-based tools. Learners explore how visual and document inputs can be combined with language models to enable structured extraction, analysis, and decision-making workflows. The course emphasizes practical patterns for extracting information, orchestrating tools, and grounding model responses in visual data.
Scheduled Classes
TOPOutline
TOPExtract insights from visual data on Azure
Module 1: Develop a vision-enabled generative AI application
- Use a vision-capable model in the Microsoft Foundry portal
- Develop a vision-based chat app
- Exercise - Develop a vision-enabled chat app
Module 2:Generate images with AI
- What are image-generation models?
- Explore image-generation models in Microsoft Foundry portal
- Create a client application that uses an image generation model
- Exercise - Generate images with AIModule 3: Generate videos with Microsoft Foundry
- Deploy a video generating model
- Generate video from a prompt
- Generate video in Python
- Exercise - Generate video with Sora 2 in Microsoft Foundry
Module 4: Analyze images with Content Understanding
- What is Content Understanding?
- Analyze images with Content Understanding
- Exercise - Analyze images with Content Understanding
Module 5: Create a multimodal analysis solution with Azure Content Understanding
- What is Azure Content Understanding?
- Create a Content Understanding analyzer
- Use the Content Understanding API
- Exercise - Extract information from multimodal content
Module 6: Create an Azure Content Understanding client application
- Prepare to use the AI Content Understanding API
- Create a Content Understanding analyzer
- Analyze content
- Exercise - Develop a Content Understanding client application
Module 7: Extract data with Azure Document Intelligence
- What is Azure Document Intelligence?
- Use the Document Intelligence Studio
- Use prebuilt models
- Train and use custom models
- Exercise - Analyze documents with Document Intelligence
Module 8: Create a knowledge mining solution with Azure AI Search
- What is Azure AI Search?
- Extract data with an indexer
- Enrich extracted data with AI skills
- Search an index
- Persist extracted information in a knowledge store
- Exercise - Create a knowledge mining solution
Prerequisites
TOP- Familiarity with core Azure concepts, including subscriptions, resource groups, and basic Azure services.
- Basic understanding of AI and machine learning concepts, especially as they relate to visual or image data.
- General awareness of Azure AI services or prior exposure to Azure-based AI solutions.
- Experience working with data or applications in a cloud environment is helpful but not required.
Who Should Attend
TOPThis course is designed for developers, AI engineers, and technical professionals who want to build applications that work with images and documents using multimodal, agent-driven approaches. It s best suited for learners with basic programming experience and a general understanding of cloud or AI concepts.