Multimodal LLM Workflows in Vertex AI Training Course
Vertex AI equips developers with robust tools to construct multimodal Large Language Model (LLM) workflows, seamlessly integrating text, audio, and image data into unified pipelines. Leveraging extended context window capabilities and Gemini API parameters, the platform facilitates sophisticated applications in planning, complex reasoning, and cross-modal intelligence.
This instructor-led, live training (available online or onsite) is designed for intermediate to advanced practitioners seeking to design, build, and optimize multimodal AI workflows within the Vertex AI ecosystem.
Upon completion of this training, participants will be able to:
- Utilize Gemini models to handle multimodal inputs and generate corresponding outputs.
- Develop long-context workflows to address complex reasoning challenges.
- Architect pipelines that effectively integrate text, audio, and image analysis.
- Optimize Gemini API parameters to enhance performance while ensuring cost efficiency.
Course Format
- Interactive lectures and facilitated discussions.
- Practical, hands-on labs focused on multimodal workflows.
- Project-based exercises demonstrating applied multimodal use cases.
Course Customization Options
- To request customized training for this course, please contact us to arrange a session.
Course Outline
Introduction to Multimodal LLMs in Vertex AI
- Overview of multimodal capabilities in Vertex AI
- Gemini models and supported modalities
- Use cases in enterprise and research
Setting Up the Development Environment
- Configuring Vertex AI for multimodal workflows
- Working with datasets across modalities
- Hands-on lab: environment setup and dataset preparation
Long Context Windows and Advanced Reasoning
- Understanding long-context workflows
- Use cases in planning and decision-making
- Hands-on lab: implementing long-context analysis
Cross-Modal Workflow Design
- Combining text, audio, and image analysis
- Chaining multimodal steps in pipelines
- Hands-on lab: designing a multimodal pipeline
Working with Gemini API Parameters
- Configuring multimodal inputs and outputs
- Optimizing inference and efficiency
- Hands-on lab: tuning Gemini API parameters
Advanced Applications and Integrations
- Interactive multimodal agents and assistants
- Integrating external APIs and tools
- Hands-on lab: building a multimodal application
Evaluation and Iteration
- Testing multimodal performance
- Metrics for accuracy, alignment, and drift
- Hands-on lab: evaluating multimodal workflows
Summary and Next Steps
Requirements
- Proficiency in Python programming
- Experience in developing machine learning models
- Familiarity with multimodal data types (text, audio, image)
Audience
- AI researchers
- Senior software developers
- Machine learning scientists
Need help picking the right course?
Multimodal LLM Workflows in Vertex AI Training Course - Enquiry
Multimodal LLM Workflows in Vertex AI - Consultancy Enquiry
Related Courses
Advanced LangGraph: Optimization, Debugging, and Monitoring Complex Graphs
35 HoursLangGraph is a framework designed for building stateful, multi-actor LLM applications as composable graphs, offering persistent state and precise control over execution.
This instructor-led, live training (available online or on-site) is tailored for advanced-level AI platform engineers, DevOps professionals specializing in AI, and ML architects who aim to optimize, debug, monitor, and operate production-grade LangGraph systems.
By the conclusion of this training, participants will be able to:
- Design and optimize complex LangGraph topologies to enhance speed, reduce costs, and ensure scalability.
- Engineer system reliability through retries, timeouts, idempotency, and checkpoint-based recovery mechanisms.
- Debug and trace graph executions, inspect state, and systematically reproduce production issues.
- Instrument graphs with logs, metrics, and traces, deploy them to production environments, and monitor SLAs and operational costs.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and hands-on practice.
- Real-time implementation in a live-lab environment.
Course Customization Options
- To request a customized training session for this course, please contact us to arrange.
Building Coding Agents with Devstral: From Agent Design to Tooling
14 HoursDevstral is an open-source framework purpose-built for creating and operating coding agents capable of interacting with codebases, developer tools, and APIs to boost engineering productivity.
This instructor-led live training (available online or onsite) targets intermediate to advanced ML engineers, developer-tooling teams, and SREs aiming to design, implement, and optimize coding agents using Devstral.
Upon completing this training, participants will be able to:
- Configure and set up Devstral for coding agent development.
- Design agentic workflows for exploring and modifying codebases.
- Integrate coding agents with developer tools and APIs.
- Apply best practices for secure and efficient agent deployment.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical sessions.
- Hands-on implementation within a live lab environment.
Customization Options
- To arrange a tailored version of this course, please contact us.
Open-Source Model Ops: Self-Hosting, Fine-Tuning and Governance with Devstral & Mistral Models
14 HoursDevstral and Mistral are open-source AI technologies designed to enable flexible deployment, fine-tuning, and scalable integration.
This instructor-led live training, available online or onsite, targets intermediate to advanced machine learning engineers, platform teams, and research engineers who want to self-host, fine-tune, and govern Mistral and Devstral models in production environments.
Upon completing this training, participants will be able to:
- Set up and configure self-hosted environments for Mistral and Devstral models.
- Apply fine-tuning techniques to enhance domain-specific performance.
- Implement versioning, monitoring, and lifecycle governance processes.
- Ensure security, compliance, and responsible usage of open-source models.
Course Format
- Interactive lectures and discussions.
- Hands-on exercises focused on self-hosting and fine-tuning.
- Live-lab implementation of governance and monitoring pipelines.
Course Customization Options
- To request customized training for this course, please contact us to arrange.
LangGraph Applications in Finance
35 HoursLangGraph is a framework designed for constructing stateful, multi-actor LLM applications as composable graphs, featuring persistent state and precise control over execution flow.
This instructor-led, live training session (available online or onsite) targets intermediate to advanced professionals who aim to design, implement, and manage LangGraph-based financial solutions with appropriate governance, observability, and compliance.
Upon completing this training, participants will be able to:
- Design finance-specific LangGraph workflows that align with regulatory and audit requirements.
- Integrate financial data standards and ontologies into graph state and tooling.
- Implement reliability, safety, and human-in-the-loop controls for critical processes.
- Deploy, monitor, and optimize LangGraph systems to enhance performance, reduce costs, and meet SLAs.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practical application.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to arrange.
LangGraph Foundations: Graph-Based LLM Prompting and Chaining
14 HoursLangGraph is a framework designed for constructing graph-structured LLM applications that facilitate planning, branching, tool usage, memory management, and controllable execution.
This instructor-led, live training, available online or onsite, targets beginner-level developers, prompt engineers, and data practitioners aiming to design and build reliable, multi-step LLM workflows using LangGraph.
Upon completing this training, participants will be able to:
- Explain core LangGraph concepts (nodes, edges, state) and their appropriate use cases.
- Create prompt chains that branch, invoke tools, and maintain memory.
- Integrate retrieval mechanisms and external APIs into graph workflows.
- Test, debug, and evaluate LangGraph applications for reliability and safety.
Course Format
- Interactive lectures and guided discussions.
- Hands-on labs and code walkthroughs in a sandbox environment.
- Scenario-based exercises focused on design, testing, and evaluation.
Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph in Healthcare: Workflow Orchestration for Regulated Environments
35 HoursLangGraph enables stateful, multi-actor workflows driven by large language models (LLMs), offering precise control over execution paths and state persistence. In the healthcare sector, these capabilities are essential for ensuring compliance, interoperability, and the development of decision-support systems that align with established medical workflows.
This instructor-led, live training (available online or on-site) is designed for intermediate to advanced-level professionals seeking to design, implement, and manage LangGraph-based healthcare solutions while addressing regulatory, ethical, and operational challenges.
Upon completion of this training, participants will be able to:
- Design healthcare-specific LangGraph workflows with compliance and auditability as core priorities.
- Integrate LangGraph applications with medical ontologies and standards such as FHIR, SNOMED CT, and ICD.
- Apply best practices to ensure reliability, traceability, and explainability within sensitive healthcare environments.
- Deploy, monitor, and validate LangGraph applications in production healthcare settings.
Course Format
- Interactive lectures and discussions.
- Practical exercises featuring real-world case studies.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized version of this training, please contact us to arrange.
LangGraph for Legal Applications
35 HoursLangGraph is a framework designed for building stateful, multi-actor LLM applications as composable graphs, offering persistent state and precise control over execution.
This instructor-led, live training (available online or on-site) is tailored for intermediate to advanced professionals aiming to design, implement, and operate LangGraph-based legal solutions, ensuring robust compliance, traceability, and governance controls.
Upon completing this training, participants will be able to:
- Design legal-specific LangGraph workflows that maintain auditability and regulatory compliance.
- Integrate legal ontologies and document standards into graph state and processing logic.
- Implement guardrails, human-in-the-loop approvals, and fully traceable decision paths.
- Deploy, monitor, and maintain LangGraph services in production environments with effective observability and cost management.
Course Format
- Interactive lectures and group discussions.
- Extensive hands-on exercises and practical sessions.
- Real-time implementation within a live-lab environment.
Course Customisation Options
- To request a customised version of this training, please contact us to arrange a session.
Building Dynamic Workflows with LangGraph and LLM Agents
14 HoursLangGraph is a framework designed for creating graph-structured LLM workflows that support branching, tool usage, memory management, and controlled execution.
This instructor-led, live training (available online or on-site) is tailored for intermediate-level engineers and product teams who aim to integrate LangGraph's graph logic with LLM agent loops to develop dynamic, context-aware applications such as customer support agents, decision trees, and information retrieval systems.
By the end of this training, participants will be able to:
- Design graph-based workflows that coordinate LLM agents, tools, and memory.
- Implement conditional routing, retries, and fallback mechanisms for robust execution.
- Integrate retrieval systems, APIs, and structured outputs into agent loops.
- Evaluate, monitor, and strengthen agent behavior to ensure reliability and safety.
Course Format
- Interactive lectures and facilitated discussions.
- Guided labs and code walkthroughs in a sandbox environment.
- Scenario-based design exercises and peer reviews.
Course Customization Options
- To request a customized training session for this course, please contact us to arrange.
LangGraph for Marketing Automation
14 HoursLangGraph is a graph-based orchestration framework that enables conditional, multi-step LLM and tool workflows, ideal for automating and personalizing content pipelines.
This instructor-led, live training (online or onsite) is aimed at intermediate-level marketers, content strategists, and automation developers who wish to implement dynamic, branching email campaigns and content generation pipelines using LangGraph.
By the end of this training, participants will be able to:
- Design graph-structured content and email workflows with conditional logic.
- Integrate LLMs, APIs, and data sources for automated personalization.
- Manage state, memory, and context across multi-step campaigns.
- Evaluate, monitor, and optimize workflow performance and delivery outcomes.
Format of the Course
- Interactive lectures and group discussions.
- Hands-on labs implementing email workflows and content pipelines.
- Scenario-based exercises on personalization, segmentation, and branching logic.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Le Chat Enterprise: Private ChatOps, Integrations & Admin Controls
14 HoursLe Chat Enterprise offers a secure, customizable, and governed conversational AI solution designed for organizations. It supports role-based access control (RBAC), single sign-on (SSO), connectors, and enterprise app integrations, ensuring private ChatOps capabilities.
This instructor-led live training (available online or onsite) targets intermediate-level product managers, IT leads, solution engineers, and security/compliance teams. Participants will learn to deploy, configure, and govern Le Chat Enterprise within enterprise environments.
Upon completion of this training, participants will be able to:
- Set up and configure Le Chat Enterprise for secure deployments.
- Enable RBAC, SSO, and compliance-driven controls.
- Integrate Le Chat with enterprise applications and data stores.
- Design and implement governance and admin playbooks for ChatOps.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Cost-Effective LLM Architectures: Mistral at Scale (Performance / Cost Engineering)
14 HoursMistral is a high-performance family of large language models optimized for cost-effective production deployment at scale.
This instructor-led, live training (online or onsite) is aimed at advanced-level infrastructure engineers, cloud architects, and MLOps leads who wish to design, deploy, and optimize Mistral-based architectures for maximum throughput and minimum cost.
Upon completing this training, participants will be able to:
- Implement scalable deployment patterns for Mistral Medium 3.
- Apply batching, quantization, and efficient serving strategies.
- Optimize inference costs while maintaining performance.
- Design production-ready serving topologies for enterprise workloads.
Course Format
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Productizing Conversational Assistants with Mistral Connectors & Integrations
14 HoursMistral AI is an open artificial intelligence platform that empowers teams to construct and incorporate conversational assistants into both enterprise-internal and customer-facing operational workflows.
This instructor-led live training (available online or onsite) is tailored for beginner to intermediate-level product managers, full-stack developers, and integration engineers looking to design, integrate, and commercialize conversational assistants by leveraging Mistral connectors and integrations.
Upon completing this training, participants will be equipped to:
- Connect Mistral conversational models with enterprise and SaaS connectors.
- Deploy retrieval-augmented generation (RAG) to ensure grounded and accurate responses.
- Create UX patterns for both internal and external chat assistants.
- Integrate assistants into product workflows for practical, real-world applications.
Course Format
- Interactive lectures and discussions.
- Practical integration exercises.
- Live-lab development of conversational assistants.
Customization Options
- To request a customized version of this course, please contact us to arrange details.
Enterprise-Grade Deployments with Mistral Medium 3
14 HoursMistral Medium 3 is a high-performance, multimodal large language model designed for production-grade deployment across enterprise environments.
This instructor-led, live training (online or onsite) is aimed at intermediate-level to advanced-level AI/ML engineers, platform architects, and MLOps teams who wish to deploy, optimize, and secure Mistral Medium 3 for enterprise use cases.
By the end of this training, participants will be able to:
- Deploy Mistral Medium 3 using API and self-hosted options.
- Optimize inference performance and costs.
- Implement multimodal use cases with Mistral Medium 3.
- Apply security and compliance best practices for enterprise environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Mistral for Responsible AI: Privacy, Data Residency & Enterprise Controls
14 HoursMistral AI is an open and enterprise-ready AI platform that provides features for secure, compliant, and responsible AI deployment.
This instructor-led, live training (online or onsite) is aimed at intermediate-level compliance leads, security architects, and legal/ops stakeholders who wish to implement responsible AI practices with Mistral by leveraging privacy, data residency, and enterprise control mechanisms.
By the end of this training, participants will be able to:
- Implement privacy-preserving techniques in Mistral deployments.
- Apply data residency strategies to meet regulatory requirements.
- Set up enterprise-grade controls such as RBAC, SSO, and audit logs.
- Evaluate vendor and deployment options for compliance alignment.
Format of the Course
- Interactive lecture and discussion.
- Compliance-focused case studies and exercises.
- Hands-on implementation of enterprise AI controls.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Multimodal Applications with Mistral Models (Vision, OCR, & Document Understanding)
14 HoursMistral models are open-source AI technologies that are now expanding into multimodal workflows, supporting both language and vision tasks for enterprise and research applications.
This instructor-led, live training (online or onsite) is aimed at intermediate-level ML researchers, applied engineers, and product teams who wish to build multimodal applications with Mistral models, including OCR and document understanding pipelines.
By the end of this training, participants will be able to:
- Set up and configure Mistral models for multimodal tasks.
- Implement OCR workflows and integrate them with NLP pipelines.
- Design document understanding applications for enterprise use cases.
- Develop vision-text search and assistive UI functionalities.
Format of the Course
- Interactive lecture and discussion.
- Hands-on coding exercises.
- Live-lab implementation of multimodal pipelines.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.