Get in Touch

Course Outline

Introduction to Cambricon and MLU Architecture

  • Survey of Cambricon’s AI chip portfolio
  • Overview of MLU architecture and instruction pipelines
  • Supported model types and applicable use cases

Setting Up the Development Toolchain

  • Installation of BANGPy and Neuware SDK
  • Configuring environments for Python and C++
  • Ensuring model compatibility and preprocessing steps

Model Development Using BANGPy

  • Management of tensor structures and shapes
  • Construction of computation graphs
  • Support for custom operations within BANGPy

Deployment via Neuware Runtime

  • Converting and loading models
  • Controlling execution and inference
  • Best practices for edge and data center deployment

Performance Optimization

  • Tuning layers and managing memory mapping
  • Utilizing execution tracing and profiling tools
  • Identifying and resolving common bottlenecks

Integrating MLU into Applications

  • Leveraging Neuware APIs for application integration
  • Supporting streaming and multi-model scenarios
  • Implementing hybrid CPU-MLU inference setups

End-to-End Project and Use Case

  • Lab: Deploying vision or NLP models
  • Conducting edge inference with BANGPy integration
  • Evaluating accuracy and throughput

Summary and Next Steps

Requirements

  • Fundamental understanding of machine learning model architectures
  • Proficiency in Python and/or C++ programming
  • Familiarity with concepts of model deployment and acceleration

Audience

  • Developers specializing in embedded AI
  • ML engineers focusing on edge or datacenter deployments
  • Developers working with Chinese AI infrastructure
 21 Hours

Testimonials (1)

Related Categories