Building Data Pipelines with Apache Kafka Training Course
Apache Kafka is a distributed streaming platform. It has become the de facto standard for building data pipelines and addresses a wide range of data processing use cases: it can function as a message queue, distributed log, stream processor, and more.
We will begin with foundational concepts of data pipelines in general, then delve into the core principles of Kafka. Additionally, we will explore key components such as Kafka Streams and Kafka Connect.
This course is available as onsite live training in Uzbekistan or online live training.Course Outline
- Data pipelines 101: ingestion, storage, processing
- Kafka fundamentals: topics, partitions, brokers, replication, and more
- Producer and Consumer APIs
- Kafka Streams as a processing layer
- Kafka Connect for integration with external systems
- Best practices and tuning for Kafka
Requirements
A basic understanding of Java 8 or Scala is recommended. If you wish to run examples locally, please install Docker and Docker Compose.
Need help picking the right course?
uzbekistan@nobleprog.com or +919818060888
Building Data Pipelines with Apache Kafka Training Course - Enquiry
Building Data Pipelines with Apache Kafka - Consultancy Enquiry
Testimonials (2)
Possibility to perform independent exercises in the training environment.
Tomasz - PKO Zycie Towarzystwo Ubezpieczen S.A.
Course - Kafka for Administrators
The trainer tried to make the most complicated topics , explain it in simpler way
Calvin Raj Antony - SICPA SA
Course - Administration of Kafka Message Queue
Related Courses
Administration of Confluent Apache Kafka
21 HoursConfluent Apache Kafka is a distributed event streaming platform built for high-throughput, fault-tolerant data pipelines and real-time analytics.
This instructor-led live training (available online or on-site) targets intermediate-level system administrators and DevOps professionals who want to learn how to install, configure, monitor, and troubleshoot Confluent Apache Kafka clusters.
By the end of this training, participants will be able to:
- Grasp the components and architecture of Confluent Kafka.
- Deploy and manage Kafka brokers, Zookeeper quorums, and essential services.
- Configure advanced features such as security, replication, and performance tuning.
- Utilize management tools to monitor and maintain Kafka clusters.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to arrange it.
Apache Kafka Connect
7 HoursThis instructor-led, live training in Uzbekistan (available online or on-site) is intended for developers who wish to integrate Apache Kafka with existing databases and applications for processing, analysis, and other use cases.
By the end of this training, participants will be able to:
- Use Kafka Connect to ingest large amounts of data from a database into Kafka topics.
- Ingest log data generated by application servers into Kafka topics.
- Make all collected data available for stream processing.
- Export data from Kafka topics into secondary systems for storage and analysis.
Big Data Streaming for Developers
14 HoursMaster the implementation of complete big data streaming scenarios. Gain skills in real-time data preparation and maintenance using Informatica, Edge, Kafka, and Spark. This training addresses software versions 10.2.1 and later.
Confluent Apache Kafka: Cluster Operations and Configuration
16 HoursConfluent Apache Kafka is an enterprise-grade distributed event streaming platform built on Apache Kafka. It supports high-throughput, fault-tolerant data pipelines and real-time streaming applications.
This instructor-led, live training (online or onsite) is aimed at intermediate-level engineers and administrators who wish to deploy, configure, and optimize Confluent Kafka clusters in production environments.
By the end of this training, participants will be able to:
- Install, configure, and operate Confluent Kafka clusters with multiple brokers.
- Design high-availability setups using Zookeeper and replication techniques.
- Tune performance, monitor metrics, and apply recovery strategies.
- Secure, scale, and integrate Kafka with enterprise environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Kafka Solutions with Confluent
14 HoursThis instructor-led, live training (online or onsite) is designed for engineers who wish to use Confluent (a distribution of Kafka) to build and manage a real-time data processing platform for their applications.
By the end of this training, participants will be able to:
- Install and configure Confluent Platform.
- Leverage Confluent's management tools and services to operate Kafka more efficiently.
- Store and process incoming stream data.
- Optimize and manage Kafka clusters.
- Secure data streams.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and hands-on practice.
- Real-world implementation in a live-lab environment.
Course Customization Options
- This course is based on the open source version of Confluent: Confluent Open Source.
- To request a customized training session for this course, please contact us to arrange.
A Practical Introduction to Stream Processing
21 HoursIn this instructor-led, live training session in Uzbekistan (onsite or remote), participants will learn how to set up and integrate various Stream Processing frameworks with existing big data storage systems, software applications, and microservices.
By the end of this training, participants will be able to:
- Install and configure different Stream Processing frameworks, such as Spark Streaming and Kafka Streaming.
- Understand and select the most suitable framework for specific tasks.
- Process data continuously, concurrently, and on a record-by-record basis.
- Integrate Stream Processing solutions with existing databases, data warehouses, data lakes, and more.
- Integrate the most appropriate stream processing library with enterprise applications and microservices.
Distributed Messaging with Apache Kafka
14 HoursDesigned for enterprise architects, developers, system administrators, and professionals seeking to master high-throughput distributed messaging systems, this course provides comprehensive insights into Apache Kafka. If you have specific focus areas, such as exclusively system administration, the curriculum can be customized to align with your unique requirements.
Kafka for Administrators
21 HoursThis instructor-led, live training in Uzbekistan (online or onsite) is aimed at beginner-level / intermediate-level / advanced-level system administrators and operations engineers who wish to use Apache Kafka to deploy, secure, monitor, and troubleshoot Kafka clusters.
By the end of this training, participants will be able to: explain Kafka architecture and KRaft mode, operate and secure Kafka clusters, monitor performance and reliability, and resolve common production issues.
Apache Kafka for Developers
21 HoursDesigned for intermediate-level developers aiming to build big data applications using Apache Kafka, this instructor-led live training in Uzbekistan (online or onsite) offers comprehensive guidance.
Upon completion of this course, participants will have the ability to:
- Create Kafka producers and consumers to send and retrieve data.
- Connect Kafka with external systems via Kafka Connect.
- Build streaming applications utilizing Kafka Streams & ksqlDB.
- Link a Kafka client application to Confluent Cloud for cloud-based Kafka deployments.
- Acquire practical skills through hands-on exercises and real-world use cases.
Apache Kafka for Python Programmers
7 HoursThis instructor-led, live training in Uzbekistan (online or onsite) is designed for data engineers, data scientists, and programmers who wish to utilize Apache Kafka features for data streaming with Python.
By the end of this training, participants will be able to use Apache Kafka to monitor and manage conditions in continuous data streams using Python programming.
Kafka Fundamentals for Java Developers
14 HoursThis instructor-led, live training in Uzbekistan (online or onsite) targets intermediate-level Java developers who wish to integrate Apache Kafka into their applications for reliable, scalable, and high-throughput messaging.
Upon completion of this training, participants will be able to:
- Comprehend Kafka's architecture and its core components.
- Provision and configure a Kafka cluster.
- Produce and consume messages using Java.
- Deploy Kafka Streams for real-time data processing.
- Guarantee fault tolerance and scalability within Kafka applications.
Administration of Kafka Message Queue
14 HoursThis instructor-led, live training in Uzbekistan (online or onsite) is aimed at intermediate-level system administrators who wish to harness Kafka's message queuing features effectively.
By the end of this training, participants will be able to:
- Understand Kafka's message queuing capabilities and architecture.
- Configure Kafka topics for message queuing scenarios.
- Produce and consume messages using Kafka.
- Monitor and manage Kafka as a message queue.
Security for Apache Kafka
7 HoursThis instructor-led, live training in Uzbekistan (online or onsite) is aimed at software testers who wish to implement network security measures into an Apache Kafka application.
By the end of this training, participants will be able to:
- Deploy Apache Kafka onto a cloud based server.
- Implement SSL encryption to prevent attacks.
- Add ACL authentication to track and control user access.
- Ensure credible clients have access to Kafka clusters with SSL and SASL authentication.
Apache Kafka and Spring Boot
7 HoursThis instructor-led, live training in Uzbekistan (online or onsite) is aimed at intermediate-level developers who wish to learn the fundamentals of Kafka and integrate it with Spring Boot.
By the end of this training, participants will be able to:
- Understand Kafka and its architecture.
- Learn how to install, configure, and set up a basic Kafka environment.
- Integrate Kafka with Spring Boot.
Stream Processing with Kafka Streams
7 HoursKafka Streams is a client-side library designed for building applications and microservices that exchange data with the Apache Kafka messaging system. Traditionally, Apache Kafka has depended on Apache Spark or Apache Storm to process data between message producers and consumers. By invoking the Kafka Streams API directly within an application, data can be processed in real time within Kafka itself, eliminating the need to forward data to a separate cluster for processing.
In this instructor-led live training, participants will learn how to integrate Kafka Streams into a set of sample Java applications that send and receive data via Apache Kafka for stream processing.
By the end of this training, participants will be able to:
- Understand the key features of Kafka Streams and its advantages over other stream processing frameworks
- Process stream data directly within a Kafka cluster
- Develop Java or Scala applications or microservices that integrate with Kafka and Kafka Streams
- Write concise code to transform input Kafka topics into output Kafka topics
- Build, package, and deploy the application
Target Audience
- Developers
Course Format
- A combination of lecture, discussion, exercises, and extensive hands-on practice
Notes
- To request a customized version of this course, please contact us to arrange.