Maxtrain.com - info@maxtrain.com - 513-322-8888 - 866-595-6863
Introduction to Cassandra
Description
Introduction to Cassandra Training
Welcome to the “Introduction to Cassandra” course, designed to equip you with the skills necessary to excel in using the Cassandra database, a leader in scalability, high availability, and fault tolerance within the NoSQL database realm. In this two-day, hands-on training, you will dive into the intricacies of Cassandra’s architecture, learning to expand clusters seamlessly and enhance your applications’ capabilities.
Our course covers the essentials of tunable and eventual consistency, data modeling, and practical applications, providing a deep dive into the features that make Cassandra a top choice for modern data management needs.
Through a balanced mix of lectures and hands-on lab sessions, this course offers a comprehensive overview of Cassandra’s internal workings and best practices in schema design. You’ll engage in group discussions, lab exercises, and reviews that are designed to solidify your understanding and practical skills.
By the end of this course, you’ll be well-prepared to implement effective and optimal data models in Cassandra, ensuring your readiness to tackle the challenges of distributed databases in today’s data-driven landscape.
Introduction to Cassandra Course Objectives
Throughout the course students will learn to:
- Master Cassandra Fundamentals: Understand the core principles of Cassandra, including its architecture, scalability, and consistency models.
- Develop Effective Data Models: Learn to design and implement efficient and scalable data models tailored to Cassandra’s unique environment.
- Hands-On Skill Enhancement: Apply your knowledge through practical lab sessions, reinforcing learning through real-world scenarios.
- Navigate Advanced Features: Explore advanced Cassandra functionalities to optimize your database solutions for high performance and reliability.
- Preparation for Practical Application: Equip yourself with the necessary tools and knowledge to proficiently use Cassandra in professional settings.
Prerequisites
To ensure a smooth learning experience and maximize the benefits of attending this course, you should have the following prerequisite skills:
- Since Cassandra is a type of database, it is crucial that participants have some fundamental knowledge about databases. Knowing SQL would be beneficial. This includes understanding concepts such as tables, records, indexes, and queries.
- While not specific to any one language, participants should be comfortable with general programming concepts like variables, data types, loops, conditionals, and functions.
- Some of the operations with Cassandra will require using CLI tools. Therefore, attendees should be comfortable with using a command line interface on their chosen operating system.
Audience
- This introductory-level course is geared for data engineers, database administrators, system architects, and software developers, or those who are new to or have basic familiarity with NoSQL databases and are interested in building robust, scalable data-driven applications.
- Professionals who are tasked with managing or designing distributed data systems, working in industries where data scalability and availability are of high importance, will find this course particularly useful.
- Any individual involved in decision-making processes around technology choices, architecture or data modeling would benefit from the unique insights and practical skills developed in this hands-on course, ensuring optimal usage of the Cassandra database in production environments.
Introduction to Cassandra Outline
Cassandra Overview:
- Why We Need Cassandra – Big Data Challenges vs RDBMS
- High-level Cassandra Overview
- Cassandra Features
- Basic Cassandra Installation and Configuration
Cassandra Architecture and CQL Overview:
- Cassandra Architecture Overview
- Cassandra Clusters and Rings
- Nodes and Virtual Nodes
- Data Replication in Cassandra
- Introduction to CQL
- Defining Tables with a Single Primary Key
- Using cqlsh for Interactive Querying
- Selecting and Inserting/Upserting Data with CQL
- Data Replication and Distribution
- Basic Data Types (including uuid, timeuuid)
Data Modeling and CQL Core Concepts:
- Defining a Compound Primary Key
- CQL for Compound Primary Keys
- Partition Keys and Data Distribution
- Clustering Columns
- Overview of Internal Data Organization
- Overview of Other Querying Capabilities
- ORDER BY, CLUSTERING ORDER BY, UPDATE, DELETE, ALLOW FILTERING
- Batch Queries
- Data Modeling Guidelines
- Denormalization
- Data Modeling Workflow
- Data Modeling Principles
- Primary Key Considerations
- Composite Partition Keys
- Defining with CQL
- Data Distribution with Composite Partition Key
- Overview of Internal Data Organization
- Lab: Composite Partition Key (Substantial lab)
Additional CQL Capabilities:
- Indexing
- Primary/Partition Keys and Pagination with token()
- Secondary Indexes and Usage Guidelines
- Cassandra collections
- Collection Structure and Uses
- Defining and Querying Collections (set, list, and map)
- Materialized View
- Usage Guidelines
Data Consistency In Cassandra:
- Overview of Consistency in Cassandra
- CAP Theorem
- Eventual (Tunable) Consistency in C* – ONE, QUORUM, ALL
- Choosing CL ONE
- Choosing CL QUORUM
- Achieving Immediate Consistency
- Overview of Other Consistency Levels
- Supportive Consistency Mechanisms
- Writing / Hinted Handoff
- Read Repair
- Nodetool repair
Internal Mechanisms:
- Ring Details
- Partitioners
- Gossip Protocol
- Snitches
- Write Path
- Overview / Commit Log
- Memtables and SSTables
- Write Failure
- Unavailable Nodes and Node Failure
- Requirements for Write Operations
- Read Path Overview
- Read Mechanism
- Replication and Caching
- Deletion/Compaction Overview
- Delete Mechanism
- Tombstones and Compaction
OPTIONAL: Working with IntelliJ:
- Configuring JDBC Data Source for Cassandra
- Reading Schema Information
- Querying and Editing Tables
$1995.00
|
3 Days Course |