SlideShare a Scribd company logo
1 of 34
Download to read offline
+
NoSQL – Part 2
CAP Theorem & Column Oriented
Mohammad Sadegh Salehi
Dr.Baraani
Winter2015 Sheikh Bahaie
University
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
2
Winter 2015
Agenda
—Review NoSQL
—Dynamo and BigTable
—NoSQL Classification
—Key-value Stores
—Column Oriented
—Casandra
—Why Casandra
—Question
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
3
Winter 2015
What is NoSQL
review
 Stands for Not Only SQL
 Class of non-relational data storage systems
 Usually do not require a fixed table schema nor do
they use the concept of joins
 All NoSQL offerings relax one or more of the ACID
properties (will talk about the CAP theorem)
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
4
Winter 2015
Dynamo and BigTable
 Three major papers were the seeds of the NoSQL
movement
• BigTable (Google)
• Dynamo (Amazon)
—Gossip protocol (discovery and error detection)
—Distributed key-value data store
—Eventual consistency
• CAP Theorem (discuss in a sec ..)
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
5
Winter 2015
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
7
Winter 2015
What kinds of NoSQL
Review
 NoSQL solutions fall into two major areas:
• Key/Value or ‘the big hash table’.
—Amazon S3 (Dynamo)
—Voldemort
—Scalaris
• Schema-less which comes in multiple flavors, column-
based, document-based or graph-based.
—Cassandra (column-based)
—CouchDB (document-based)
—Neo4J (graph-based)
—HBase (column-based)
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
8
Winter 2015
Key-Value Stores
 Extremely simple interface
• Data model: (key, value) pairs
• Operations:
—Insert(key,value),
—Fetch(key),
—Update(key),
—Delete(key).
 Implementation: efficiency, scalability, fault-
tolerance
• Records distributed to nodes based on key
• Replication
• Single-record transactions,“eventual consistency”
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
9
Winter 2015
Key-Value Data Stores
 Storing Session Information
 User Profiles, Preferences: Almost every user has
a unique userID as well as preferences such as
language, color, timezone, which products the
user has access to , and so on.
Suitable Use Cases
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
10
Winter 2015
Key-Value Data Stores
 As we want the shopping carts to be available
all the time, across browsers, machines, and
sessions, all the shopping information can be put
into value where the key is the userID
Shopping Cart Data
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
11
Winter 2015
Key-Value Data Stores
 Relationships among data
 Multi-operation Transactions
 Query by Data
 Operations by Sets
Not to Use
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
12
Winter 2015
Column-oriented
 Store data in column order
 Allow key-value pairs to be stored (and retrieved
on key) in a massively parallel system,
• Data model: families of attributes defined in a schema,
new attributes can be added,
• Storing principle: big hashed distributed tables,
• Properties: partitioning (horizontally and/or vertically),
high availability etc. completely transparent to
application,
Intro
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
13
Winter 2015
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
14
Winter 2015
Cassandra
 Apache Cassandra™ is a free
 Distributed…
 High performance…
 Extremely scalable…
 Fault tolerant (i.e. no single point of failure)…
 Post-relational database solution.
 Cassandra can serve as both real-time datastore and as a
read-intensive database.
 Compiles to: C++, Java, PHP, Ruby, Erlang, Perl, ...
Thrift
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
15
Winter 2015
Cassandra
Infographic
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
16
Winter 2015
Cassandra
 Originally developed at Facebook
 Follows the BigTable data model: column-oriented
 Uses the Dynamo Eventual Consistency model
 Written in Java
 Open-sourced and exists within the Apache family
 Uses Apache Thrift as it’s API
 Some of its myriad users:
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
17
Winter 2015
Cassandra
 keyspace: Usually the name of the application; e.g.,
'Twitter', 'Wordpress‘.
 column family: structure containing an unlimited
number of rows
• Simple
• Super (nested Column Families)
 column: a tuple with name, value and time stamp
• Each Column has
— Name
— Value
— Timestamp
 key: name of record
 super column: contains more columns
Data Model
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
18
Winter 2015
Cassandra – Data Model
keyspace
settings
column family
settings
column
name value timestamp
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
19
Winter 2015
Cassandra
Column Family & Super Column Family
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
20
Winter 2015
Cassandra
 Cassandra was designed with the understanding that
system/hardware failures can and do occur
 Peer-to-peer, distributed system
 All nodes the same
 Data partitioned among all nodes
in the cluster
 Custom data replication to ensure
fault tolerance
 Read/Write-anywhere design
Architecture Overview
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
21
Winter 2015
Cassandra
 Each node communicates with each other through the
Gossip protocol, which exchanges information across
the cluster every second,
 A commit log is used on each node to capture write
activity. Data durability is assured,
 Data also written to an in-memory
structure (memtable) and then to
disk once the memory structure is
full (an SStable).
Architecture Overview
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
22
Winter 2015
Why Cassandra?
 Gigabyte to Petabyte scalability
 Linear performance gains through adding nodes
 No single point of failure
 Easy replication / data distribution
 Multi-data center and Cloud capable
 No need for separate caching layer
 Tunable data consistency
 Flexible schema design
 Data Compression
 CQL language (like SQL)
 Support for key languages and platforms
 No need for special hardware or software
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
23
Winter 2015
Why Cassandra?
 Capable of comfortably scaling to petabytes
 New nodes = Linear performance increases
 Add new nodes online
Big Data Scalability
1
2
Double Throughput
Capabilities
1
2
3
4
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
24
Winter 2015
Why Cassandra?
 All nodes the same
 Customized replication affords tunable data redundancy
 Read/write from any node
 Can replicate data among different physical data center
racks
No Single Point of Failure
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
25
Winter 2015
Why Cassandra?
 Peer-to-peer architecture removes need for special
caching layer and the programming that goes with it
 The database cluster uses the memory from all
participating nodes to cache the data assigned to each
node
 No irregularities between a memory cache and database
are encountered
No Need for Caching Software
Database Server
Memcached Servers
Application Servers
Writes
Reads
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
26
Winter 2015
Why Cassandra?
 Uses Google’s Snappy data compression algorithm
 Compresses data on a per column family level
 Internal tests at DataStax show up to 80%+ compression
of raw data
 No performance penalty (and some increases in overall
performance due to less physical I/O)!
Data Compression
Portfolio Keyspace
Customer Column Family
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
27
Winter 2015
Why Cassandra?
 Very similar to RDBMS SQL syntax
 Create objects via DDL (e.g. CREATE…)
 Core DML commands supported: INSERT, UPDATE,
DELETE
 Query data with SELECT
CQL Language
Portfolio Keyspace
1
2
3
4
5
6
SELECT *
FROM USERS
WHERE STATE = ‘TX’;
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
28
Winter 2015
Comparison with MySQL
 MySQL > 50 GB Data
Writes Average : ~300 ms
Reads Average : ~350 ms
 Stats provided by Authors using facebook data.
 Cassandra > 50 GB Data
Writes Average : 0.12 ms
Reads Average : 15 ms
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
29
Winter 2015
Cassandra Tools
........DesktopnoSqlCassandra-sadeghnoSqlCassandra-sadegh.mp4
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
30
Winter 2015
Where to get Cassandra?
 Go to www.datastax.com
 DataStax makes free smart start installers available for
Cassandra that include:
• The most up-to-date Cassandra version that is production quality
• A version of DataStax OpsCenter, which is a visual, browser-
based management tool for managing and monitoring
Cassandra
• Drivers and connectors for popular development languages
• Same database and application
• Automatic configuration assistance for ensuring optimal
performance and setup for either stand-alone or cluster
implementations
• Getting Started Guide
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
31
Winter 2015
Where Can I Learn More?
www.datastax.com
 Free Online Documentation
 User/Customer Cas Studies
 Technical White Papers
 Software downloads
 Technical Articles
 User Forums
 Videos
 Tutorials
 FAQ’s
 Blogs
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
32
Winter 2015
Resources
Sites
 Cassandra
• http://cassandra.apache.org
 NoSQL News websites
• http://nosql.mypopescu.com
• http://www.nosqldatabases.com
 “a practical guide to noSQL”, Posted by Denise Miura on
March 17, 2011 at
• http://blogs.marklogic.com/2011/03/17/a-practical-
guide-to-nosql/
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
33
Winter 2015
Resources
Books
 “Cassandra The Definition Guide”, O'Reilly Media, nov2013
 “Cassandra Essential Toturial”, DataStax 2014
 “Professional NoSQL”, Wrox, 2011
 “NoSQL Distilled”, Martin Fowler, 2013
+
NoSQL (part 2) - CAP Theorem & Column Oriented
33
34
Winter 2015
Questions
+
Mohammad Sadegh Salehi
3adegh.ce@gmail.com
Thank You

More Related Content

What's hot

Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflakeSunil Gurav
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databasesAshwani Kumar
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Cathrine Wilhelmsen
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta LakeDatabricks
 
Cassandra an overview
Cassandra an overviewCassandra an overview
Cassandra an overviewPritamKathar
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overviewJames Serra
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLRamakant Soni
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraDataStax
 
Simplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeSimplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeDatabricks
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionDenodo
 
Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22Mark Kromer
 
Introduction to column oriented databases
Introduction to column oriented databasesIntroduction to column oriented databases
Introduction to column oriented databasesArangoDB Database
 
MongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and ShardingMongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and ShardingKnoldus Inc.
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)James Serra
 
Oracle backup and recovery
Oracle backup and recoveryOracle backup and recovery
Oracle backup and recoveryYogiji Creations
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Databasenehabsairam
 

What's hot (20)

Cassandra Database
Cassandra DatabaseCassandra Database
Cassandra Database
 
Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflake
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databases
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 
Cassandra an overview
Cassandra an overviewCassandra an overview
Cassandra an overview
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache Cassandra
 
Microsoft Fabric.pptx
Microsoft Fabric.pptxMicrosoft Fabric.pptx
Microsoft Fabric.pptx
 
Cassandra 101
Cassandra 101Cassandra 101
Cassandra 101
 
Introduction to SQL
Introduction to SQLIntroduction to SQL
Introduction to SQL
 
Simplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeSimplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta Lake
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22Mapping Data Flows Training deck Q1 CY22
Mapping Data Flows Training deck Q1 CY22
 
Introduction to column oriented databases
Introduction to column oriented databasesIntroduction to column oriented databases
Introduction to column oriented databases
 
MongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and ShardingMongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and Sharding
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)
 
Oracle backup and recovery
Oracle backup and recoveryOracle backup and recovery
Oracle backup and recovery
 
introduction to NOSQL Database
introduction to NOSQL Databaseintroduction to NOSQL Database
introduction to NOSQL Database
 

Viewers also liked

Data Modeling with Cassandra Column Families
Data Modeling with Cassandra Column FamiliesData Modeling with Cassandra Column Families
Data Modeling with Cassandra Column Familiesgdusbabek
 
Usergrid Overview
Usergrid OverviewUsergrid Overview
Usergrid Overviewusergrid
 
Open Source Mobile Backend on Cassandra
Open Source Mobile Backend on CassandraOpen Source Mobile Backend on Cassandra
Open Source Mobile Backend on CassandraEd Anuff
 
Cybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DBCybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DBSukumar Nayak
 
Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Johnny Miller
 
How to find Zero day vulnerabilities
How to find Zero day vulnerabilitiesHow to find Zero day vulnerabilities
How to find Zero day vulnerabilitiesMohammed A. Imran
 
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...Apigee | Google Cloud
 
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...DataStax
 
Apresentação cassandra
Apresentação cassandraApresentação cassandra
Apresentação cassandraRichiely Paiva
 
DataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clauseDataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clauseDataStax Academy
 
NoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAPNoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAPAricelio Souza
 
NoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDBNoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDBRodrigo Hjort
 
Advanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated TerrabizAdvanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated TerrabizAhmed Yasir Khan
 
Key-Value Stores: a practical overview
Key-Value Stores: a practical overviewKey-Value Stores: a practical overview
Key-Value Stores: a practical overviewMarc Seeger
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache CassandraDataStax Academy
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingDataStax Academy
 

Viewers also liked (20)

Data Modeling with Cassandra Column Families
Data Modeling with Cassandra Column FamiliesData Modeling with Cassandra Column Families
Data Modeling with Cassandra Column Families
 
Usergrid Overview
Usergrid OverviewUsergrid Overview
Usergrid Overview
 
Open Source Mobile Backend on Cassandra
Open Source Mobile Backend on CassandraOpen Source Mobile Backend on Cassandra
Open Source Mobile Backend on Cassandra
 
Docker and CloudStack
Docker and CloudStackDocker and CloudStack
Docker and CloudStack
 
CQL3 in depth
CQL3 in depthCQL3 in depth
CQL3 in depth
 
Cybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DBCybersecurity-Serverless-Graph DB
Cybersecurity-Serverless-Graph DB
 
Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1
 
How to find Zero day vulnerabilities
How to find Zero day vulnerabilitiesHow to find Zero day vulnerabilities
How to find Zero day vulnerabilities
 
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
I Love APIs 2015: Apache Usergrid Web Scale Mobile APIs with Elastic Search a...
 
Taming NoSQL with Spring Data
Taming NoSQL with Spring DataTaming NoSQL with Spring Data
Taming NoSQL with Spring Data
 
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
A Shortcut to Awesome: Cassandra Data Modeling By Example (Jon Haddad, The La...
 
Apresentação cassandra
Apresentação cassandraApresentação cassandra
Apresentação cassandra
 
DataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clauseDataStax: A deep look at the CQL WHERE clause
DataStax: A deep look at the CQL WHERE clause
 
NoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAPNoSQL, Base VS ACID e Teorema CAP
NoSQL, Base VS ACID e Teorema CAP
 
NoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDBNoSQL: onde, como e por quê? Cassandra e MongoDB
NoSQL: onde, como e por quê? Cassandra e MongoDB
 
CloudStack Architecture
CloudStack ArchitectureCloudStack Architecture
CloudStack Architecture
 
Advanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated TerrabizAdvanced excel 2010 & 2013 updated Terrabiz
Advanced excel 2010 & 2013 updated Terrabiz
 
Key-Value Stores: a practical overview
Key-Value Stores: a practical overviewKey-Value Stores: a practical overview
Key-Value Stores: a practical overview
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 

Similar to NoSQL Database- cassandra column Base DB

NewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACIDNewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACIDTony Rogerson
 
GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGuang Xu
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftSnapLogic
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017Pratim Das
 
Dynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the flyDynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the flyDataWorks Summit
 
Svccg nosql 2011_v4
Svccg nosql 2011_v4Svccg nosql 2011_v4
Svccg nosql 2011_v4Sid Anand
 
Data stores: beyond relational databases
Data stores: beyond relational databasesData stores: beyond relational databases
Data stores: beyond relational databasesJavier García Magna
 
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...Databricks
 
Lambda architecture @ Indix
Lambda architecture @ IndixLambda architecture @ Indix
Lambda architecture @ IndixRajesh Muppalla
 
Cloud Lambda Architecture Patterns
Cloud Lambda Architecture PatternsCloud Lambda Architecture Patterns
Cloud Lambda Architecture PatternsAsis Mohanty
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSAmazon Web Services
 
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Amazon Web Services
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream AnalyticsMarco Parenzan
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Trivadis
 
Spark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with SparkSpark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with SparkSpark Summit
 
AWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsAWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsKeeyong Han
 
Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksAnyscale
 
Dynamodb Presentation
Dynamodb PresentationDynamodb Presentation
Dynamodb Presentationadvaitdeo
 

Similar to NoSQL Database- cassandra column Base DB (20)

NewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACIDNewSQL - Deliverance from BASE and back to SQL and ACID
NewSQL - Deliverance from BASE and back to SQL and ACID
 
Gcp data engineer
Gcp data engineerGcp data engineer
Gcp data engineer
 
GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheet
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017
 
MySQL Cluster
MySQL ClusterMySQL Cluster
MySQL Cluster
 
Dynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the flyDynamic DDL: Adding structure to streaming IoT data on the fly
Dynamic DDL: Adding structure to streaming IoT data on the fly
 
Svccg nosql 2011_v4
Svccg nosql 2011_v4Svccg nosql 2011_v4
Svccg nosql 2011_v4
 
Data stores: beyond relational databases
Data stores: beyond relational databasesData stores: beyond relational databases
Data stores: beyond relational databases
 
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
Dynamic DDL: Adding Structure to Streaming Data on the Fly with David Winters...
 
Lambda architecture @ Indix
Lambda architecture @ IndixLambda architecture @ Indix
Lambda architecture @ Indix
 
Cloud Lambda Architecture Patterns
Cloud Lambda Architecture PatternsCloud Lambda Architecture Patterns
Cloud Lambda Architecture Patterns
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream Analytics
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
 
Spark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with SparkSpark and Couchbase: Augmenting the Operational Database with Spark
Spark and Couchbase: Augmenting the Operational Database with Spark
 
AWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsAWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data Analytics
 
Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with Databricks
 
Dynamodb Presentation
Dynamodb PresentationDynamodb Presentation
Dynamodb Presentation
 

More from sadegh salehi

Cloud intrusion detection System
Cloud intrusion detection SystemCloud intrusion detection System
Cloud intrusion detection Systemsadegh salehi
 
Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB  Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB sadegh salehi
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهsadegh salehi
 
Prototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی PrototypePrototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی Prototypesadegh salehi
 
Backup and recovery in oracle
Backup and recovery in oracleBackup and recovery in oracle
Backup and recovery in oraclesadegh salehi
 
مکتب کلبیان
مکتب کلبیانمکتب کلبیان
مکتب کلبیانsadegh salehi
 

More from sadegh salehi (9)

Cloud intrusion detection System
Cloud intrusion detection SystemCloud intrusion detection System
Cloud intrusion detection System
 
Fault prediction
Fault predictionFault prediction
Fault prediction
 
Integration test
Integration testIntegration test
Integration test
 
Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB  Interactive and Multimodal Pedagogy Using IWB
Interactive and Multimodal Pedagogy Using IWB
 
Ontology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغهOntology development in protégé-آنتولوژی در پروتوغه
Ontology development in protégé-آنتولوژی در پروتوغه
 
Prototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی PrototypePrototype design pattern - الگوی طراحی Prototype
Prototype design pattern - الگوی طراحی Prototype
 
Backup and recovery in oracle
Backup and recovery in oracleBackup and recovery in oracle
Backup and recovery in oracle
 
Jame isfahan mosque
Jame isfahan mosqueJame isfahan mosque
Jame isfahan mosque
 
مکتب کلبیان
مکتب کلبیانمکتب کلبیان
مکتب کلبیان
 

Recently uploaded

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 

Recently uploaded (20)

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 

NoSQL Database- cassandra column Base DB

  • 1. + NoSQL – Part 2 CAP Theorem & Column Oriented Mohammad Sadegh Salehi Dr.Baraani Winter2015 Sheikh Bahaie University
  • 2. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 2 Winter 2015 Agenda —Review NoSQL —Dynamo and BigTable —NoSQL Classification —Key-value Stores —Column Oriented —Casandra —Why Casandra —Question
  • 3. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 3 Winter 2015 What is NoSQL review  Stands for Not Only SQL  Class of non-relational data storage systems  Usually do not require a fixed table schema nor do they use the concept of joins  All NoSQL offerings relax one or more of the ACID properties (will talk about the CAP theorem)
  • 4. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 4 Winter 2015 Dynamo and BigTable  Three major papers were the seeds of the NoSQL movement • BigTable (Google) • Dynamo (Amazon) —Gossip protocol (discovery and error detection) —Distributed key-value data store —Eventual consistency • CAP Theorem (discuss in a sec ..)
  • 5. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 5 Winter 2015
  • 6. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 7 Winter 2015 What kinds of NoSQL Review  NoSQL solutions fall into two major areas: • Key/Value or ‘the big hash table’. —Amazon S3 (Dynamo) —Voldemort —Scalaris • Schema-less which comes in multiple flavors, column- based, document-based or graph-based. —Cassandra (column-based) —CouchDB (document-based) —Neo4J (graph-based) —HBase (column-based)
  • 7. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 8 Winter 2015 Key-Value Stores  Extremely simple interface • Data model: (key, value) pairs • Operations: —Insert(key,value), —Fetch(key), —Update(key), —Delete(key).  Implementation: efficiency, scalability, fault- tolerance • Records distributed to nodes based on key • Replication • Single-record transactions,“eventual consistency”
  • 8. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 9 Winter 2015 Key-Value Data Stores  Storing Session Information  User Profiles, Preferences: Almost every user has a unique userID as well as preferences such as language, color, timezone, which products the user has access to , and so on. Suitable Use Cases
  • 9. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 10 Winter 2015 Key-Value Data Stores  As we want the shopping carts to be available all the time, across browsers, machines, and sessions, all the shopping information can be put into value where the key is the userID Shopping Cart Data
  • 10. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 11 Winter 2015 Key-Value Data Stores  Relationships among data  Multi-operation Transactions  Query by Data  Operations by Sets Not to Use
  • 11. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 12 Winter 2015 Column-oriented  Store data in column order  Allow key-value pairs to be stored (and retrieved on key) in a massively parallel system, • Data model: families of attributes defined in a schema, new attributes can be added, • Storing principle: big hashed distributed tables, • Properties: partitioning (horizontally and/or vertically), high availability etc. completely transparent to application, Intro
  • 12. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 13 Winter 2015
  • 13. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 14 Winter 2015 Cassandra  Apache Cassandra™ is a free  Distributed…  High performance…  Extremely scalable…  Fault tolerant (i.e. no single point of failure)…  Post-relational database solution.  Cassandra can serve as both real-time datastore and as a read-intensive database.  Compiles to: C++, Java, PHP, Ruby, Erlang, Perl, ... Thrift
  • 14. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 15 Winter 2015 Cassandra Infographic
  • 15. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 16 Winter 2015 Cassandra  Originally developed at Facebook  Follows the BigTable data model: column-oriented  Uses the Dynamo Eventual Consistency model  Written in Java  Open-sourced and exists within the Apache family  Uses Apache Thrift as it’s API  Some of its myriad users:
  • 16. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 17 Winter 2015 Cassandra  keyspace: Usually the name of the application; e.g., 'Twitter', 'Wordpress‘.  column family: structure containing an unlimited number of rows • Simple • Super (nested Column Families)  column: a tuple with name, value and time stamp • Each Column has — Name — Value — Timestamp  key: name of record  super column: contains more columns Data Model
  • 17. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 18 Winter 2015 Cassandra – Data Model keyspace settings column family settings column name value timestamp
  • 18. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 19 Winter 2015 Cassandra Column Family & Super Column Family
  • 19. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 20 Winter 2015 Cassandra  Cassandra was designed with the understanding that system/hardware failures can and do occur  Peer-to-peer, distributed system  All nodes the same  Data partitioned among all nodes in the cluster  Custom data replication to ensure fault tolerance  Read/Write-anywhere design Architecture Overview
  • 20. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 21 Winter 2015 Cassandra  Each node communicates with each other through the Gossip protocol, which exchanges information across the cluster every second,  A commit log is used on each node to capture write activity. Data durability is assured,  Data also written to an in-memory structure (memtable) and then to disk once the memory structure is full (an SStable). Architecture Overview
  • 21. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 22 Winter 2015 Why Cassandra?  Gigabyte to Petabyte scalability  Linear performance gains through adding nodes  No single point of failure  Easy replication / data distribution  Multi-data center and Cloud capable  No need for separate caching layer  Tunable data consistency  Flexible schema design  Data Compression  CQL language (like SQL)  Support for key languages and platforms  No need for special hardware or software
  • 22. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 23 Winter 2015 Why Cassandra?  Capable of comfortably scaling to petabytes  New nodes = Linear performance increases  Add new nodes online Big Data Scalability 1 2 Double Throughput Capabilities 1 2 3 4
  • 23. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 24 Winter 2015 Why Cassandra?  All nodes the same  Customized replication affords tunable data redundancy  Read/write from any node  Can replicate data among different physical data center racks No Single Point of Failure
  • 24. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 25 Winter 2015 Why Cassandra?  Peer-to-peer architecture removes need for special caching layer and the programming that goes with it  The database cluster uses the memory from all participating nodes to cache the data assigned to each node  No irregularities between a memory cache and database are encountered No Need for Caching Software Database Server Memcached Servers Application Servers Writes Reads
  • 25. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 26 Winter 2015 Why Cassandra?  Uses Google’s Snappy data compression algorithm  Compresses data on a per column family level  Internal tests at DataStax show up to 80%+ compression of raw data  No performance penalty (and some increases in overall performance due to less physical I/O)! Data Compression Portfolio Keyspace Customer Column Family
  • 26. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 27 Winter 2015 Why Cassandra?  Very similar to RDBMS SQL syntax  Create objects via DDL (e.g. CREATE…)  Core DML commands supported: INSERT, UPDATE, DELETE  Query data with SELECT CQL Language Portfolio Keyspace 1 2 3 4 5 6 SELECT * FROM USERS WHERE STATE = ‘TX’;
  • 27. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 28 Winter 2015 Comparison with MySQL  MySQL > 50 GB Data Writes Average : ~300 ms Reads Average : ~350 ms  Stats provided by Authors using facebook data.  Cassandra > 50 GB Data Writes Average : 0.12 ms Reads Average : 15 ms
  • 28. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 29 Winter 2015 Cassandra Tools ........DesktopnoSqlCassandra-sadeghnoSqlCassandra-sadegh.mp4
  • 29. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 30 Winter 2015 Where to get Cassandra?  Go to www.datastax.com  DataStax makes free smart start installers available for Cassandra that include: • The most up-to-date Cassandra version that is production quality • A version of DataStax OpsCenter, which is a visual, browser- based management tool for managing and monitoring Cassandra • Drivers and connectors for popular development languages • Same database and application • Automatic configuration assistance for ensuring optimal performance and setup for either stand-alone or cluster implementations • Getting Started Guide
  • 30. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 31 Winter 2015 Where Can I Learn More? www.datastax.com  Free Online Documentation  User/Customer Cas Studies  Technical White Papers  Software downloads  Technical Articles  User Forums  Videos  Tutorials  FAQ’s  Blogs
  • 31. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 32 Winter 2015 Resources Sites  Cassandra • http://cassandra.apache.org  NoSQL News websites • http://nosql.mypopescu.com • http://www.nosqldatabases.com  “a practical guide to noSQL”, Posted by Denise Miura on March 17, 2011 at • http://blogs.marklogic.com/2011/03/17/a-practical- guide-to-nosql/
  • 32. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 33 Winter 2015 Resources Books  “Cassandra The Definition Guide”, O'Reilly Media, nov2013  “Cassandra Essential Toturial”, DataStax 2014  “Professional NoSQL”, Wrox, 2011  “NoSQL Distilled”, Martin Fowler, 2013
  • 33. + NoSQL (part 2) - CAP Theorem & Column Oriented 33 34 Winter 2015 Questions