Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
- Section 1: Introduction to Big Data / NoSQL
- Overview of NoSQL
- Understanding the CAP theorem
- When NoSQL is the right choice
- Columnar storage mechanisms
- The NoSQL ecosystem
- Section 2 : Cassandra Basics
- Design principles and architecture
- Understanding Cassandra nodes, clusters, and datacenters
- Keyspaces, tables, rows, and columns
- Partitioning, replication, and tokens
- Quorum and consistency levels
- Labs : Interacting with Cassandra via CQLSH
- Section 3: Data Modeling – part 1
- Introduction to CQL
- CQL data types
- Creating keyspaces and tables
- Selecting appropriate columns and types
- Defining primary keys
- Designing row and column layouts
- Time to live (TTL)
- Executing queries with CQL
- Performing CQL updates
- Working with collections (list, map, set)
- Labs : Various data modeling exercises using CQL; experimenting with queries and supported data types
- Section 4: Data Modeling – part 2
- Creating and utilizing secondary indexes
- Composite keys (partition keys and clustering keys)
- Handling time series data
- Best practices for time series modeling
- Using counters
- Lightweight transactions (LWT)
- Labs : Creating and applying indexes; modeling time series data
- Section 5 : Data Modeling Labs : Group design session
- Presentation of multiple use cases across various domains
- Collaborative group work to develop system designs and data models
- Discussion and analysis of different design approaches
- Lab : Implementation of one selected scenario
- Section 6: Cassandra drivers
- Introduction to the Java driver
- Performing CRUD (Create, Read, Update, Delete) operations using the Java client
- Executing asynchronous queries
- Labs : Utilizing the Java API for Cassandra
- Section 7 : Cassandra Internals
- Understanding Cassandra’s underlying design
- Working with SSTables, MemTables, and the commit log
- The read and write pathways
- Caching strategies
- vnodes
- Section 8: Administration
- Guidelines for hardware selection
- Overview of Cassandra distributions
- Installing Cassandra
- Conducting performance benchmarks
- Tools for monitoring performance and node activities
- DataStax OpsCenter
- Troubleshooting Cassandra performance issues
- Investigating node crashes
- Understanding data repair, deletion, and replication
- Additional troubleshooting tools and tips
- Cassandra best practices (compaction, garbage collection)
- Section 9: Bonus Lab (time permitting)
- Implementing a music streaming service, such as Pandora or Spotify, on Cassandra
Requirements
- Familiarity with the Java programming language
- Comfort working within a Linux environment (command-line navigation, editing files using vi or nano)
Lab environment:
Participants will be provided with a fully functional Cassandra environment. An SSH client and a web browser are required to access the cluster.
Zero Install : No installation of Cassandra on personal devices is necessary.
21 Hours
Testimonials (1)
It was informative.