Cassandra

Cassandra Training In Bangalore

Apache Cassandra
=====
—–
* Course Id : NSQL-CASS
* Duration :
32 Hours

Overview
—–
* Cassandra is a distributed database from Apache designed to manage huge amount of unstructured data
* The training covers Cassandra concepts, features, architecture, and data model, and how to install, configure and monitor open-source databases
* You will get expertise by working with the high-volume database management system as part of the Big Data Hadoop framework
* You will also get to learn database Backup/Restore, Performance Tuning, and Hosting Cassandra Database on Cloud

Pre-Requisites
—–
* The participants should have the Knowledge of Linux command line
* An understanding of programming language like Java, Database is preferred

Training Objectives
—–
* Understand the NoSQL database and where Cassandra is used
* How to install Cassandra Single Node Cluster and manage them
* Understand Apache Cassandra Architecture
* Understand Design and model applications for Cassandra
* Learn about Keyspaces, Tables
* Understand Admin Operations for Managing a Cluster
* Implement Backup and Recovery Strategies for Cassandra
* Host Cassandra on Cloud

Course Structure
—–
* We provide more focus on hands-on in our technical courses (typically 80% hands-on/20% theory)
* Students get the capability to apply the material they learn to real-world problems

Materials Provided
—–
* PDF of slides and hands-on exercises
* Access to instance with lab environment

Software Requirements
—–
Any of the following
* Any current internet browser
* vnc client
* rdp client

Hardware Requirements
—–
* Processor: 1.2 GHz
* RAM: 512 MB
* Disk space: 1 GB
* Network Connection with low latency (<250ms) to Internet

Daywise Course Outline For Big Data Overview
—–

## Day 1
—–

* Unit 1 : Introduction to Big Data and Cassandra
* Unit 2 : Cassandra Data Model
## Day 2
—–

* Unit 3 : Cassandra Architecture
* Unit 4 : Deep Dive into Cassandra Database
## Day 3
—–

* Unit 5 : Node Operations in a Cluster
* Unit 6 : Managing and Monitoring the Cluster
## Day 4
—–

* Unit 7 : Backup and Restore and Performance Tuning
* Unit 8 : Hosting Cassandra Database on Cloud

Detailed Cassandra course outline
—–
Unit 1 : Introduction to Big Data and Cassandra
* Introduction to Big Data and its Problems
* Volume, Variety, Velocity, Veracity and Value
* Traditional Database Management System
* Limitations of RDMS
* NOSQL databases
* Common characteristics of NoSQL databases
* CAP theorem
* How Cassandra solves the Limitations?

Unit 2 : Cassandra Data Model
* Introduction to Database Model
* Analogy between RDBMS and Cassandra Data Model
* Database Elements: Cluster, Keyspace, Column Family/Table, Column
* Column Family Options
* Columns
* Wide Rows, Skinny Rows
* Static and dynamic tables

Unit 3 : Cassandra Architecture
* Cassandra as a Distributed Database
* Key Cassandra Elements
* Memtable
* Commit log
* SSTables
* Replication Factor
* Data Replication in Cassandra
* Gossip protocol – Detecting failures
* Gossip: Uses
* Snitch: Uses
* Data Distribution
* Staged Event-Driven Architecture (SEDA)
* Managers and Services
* Virtual Nodes: Write path and Read path
* Consistency level
* Repair
* Incremental repair

Unit 4 : Deep Dive into Cassandra Database
* Replication Factor
* Replication Strategy
* Defining columns and data types
* Defining a partition key
* Recognizing a partition key
* Specifying a descending clustering order
* Updating data
* Tombstones
* Deleting data
* Using TTL
* Updating a TTL

Unit 5 : Node Operations in a Cluster
* Cassandra nodes
* Specifying seed nodes
* Bootstrapping a node
* Adding a node (Commissioning) in Cluster
* Removing (Decommissioning) a node
* Removing a dead node
* Repair
* Read Repair
* What’s new in incremental repair
* Run a Repair Operation
* Cassandra and Spark Implementation

Unit 6 : Managing and Monitoring the Cluster
* Cassandra monitoring tools
* Logging
* Tailing
* Using Nodetool Utility
* Using JConsole
* Learning about OpsCenter
* Runtime Analysis Tools

Unit 7 : Backup and Restore and Performance Tuning
* Creating a Snapshot
* Restoring from a Snapshot
* RAM and CPU recommendations
* Hardware choices
* Selecting storage
* Types of Storage to Avoid
* Cluster connectivity, security and the factors that affect distributed system performance
* End-to-end performance tuning of Cassandra clusters against very large data sets
* Load balance and streams

Unit 8 : Hosting Cassandra Database on Cloud
* Security
* Ongoing Support of Cassandra Operational Data
* Hosting a Cassandra Database on Cloud

Scroll to top