SlideShare uma empresa Scribd logo
1 de 7
NoSQL
Thenraja Vettivelraj
Swansea University
Contents List
ABSTRACT
1. INTRODUCTION
2. MAIN FEATURES
2.1 COMPARISON WITH SQL
3. EXAMPLE - CASSANDRA
3.1 MAIN FEATURES OF APACHE CASSANDRA
3.2 WHY APACHE CASSANDRA?
3.3 APPLICATIONS
4. DRAWBACKS OF NOSQL
5. SUMMARY
6. REFERENCES
ABSTRACT
NoSQL is one of the emerging fields without any arguments. It is a very powerful and efficient tool in
data storage and manipulating the data. It has no fixed Schema, no Joins and it also avoided the
“ACID” properties. [Han, J. et al., 2011] And basically one of the advantages of the NoSQL is very
much faster than the SQL and also the operational cost will be low than the relational database. Due to
the current trend there is necessity in increase of Storage, Connectedness, Architecture and Semi-
Structure [Accessed: 25 Feb 2012].
1. INTRODUCTION
The term “NoSQL” means, it has so many interpretations at first many told that it is Non-Relational
database and others say that “NOSQL” stands for Not Only SQL. And now-a-days they are calling the
term “NOSQL” as an Umbrella term for all the databases and the data stores which don’t follow the
relational database and also it is not a single technology or a product but it is a class of products,
collection of diverse and matter of about how to manipulate and store the data [Accessed: 24 Feb
2012]
It's a term basically hit the market on 1998 [Accessed: 24 Feb 2012] and now for the past 3-4 years it
has its own place in the market because of its tremendous growth. Massive scalability, Lower cost,
Schema flexibility, Massive Data Stores and high availability [Accessed: 24 Feb2012]. Some of the
main applications of the NoSQL are Search Engines, Data Processing and Social Website. NoSQL
does not support Joins and but it supports ACID properties.
There are four main data models in NoSQL namely
Key-Value Stores
Big Table Clones
Document Databases
Graph Databases
In these we have to choose the right one for our job [Accessed: 25 Feb 2012]. Some of the very
examples of NOSQL databases are Cassandra which is used by Facebook (Social Networking Site)
and it comes under the Key-Value store. It has the capability to handle data very huge Terabyte (TB)
of data in a single day because of its users. Big Table is an example for BigTable Clones and they
reasoned for developing their own database in order to increase the control the performance and
scalability. Google uses for its Search Engine, Gmail, Orkut and other Google applications. Neo4j is a
very good example for Graph database and it is written in Java. Apache CouchDB which is an
example for Document database written in Erlang. In the Figure 1 they have compared the four
different data models of NoSQL in a graph size versus complexity.
2. MAIN FEATURES
CAP theorem-Consistency, Availability and Partition tolerance. According to [Accessed: 11 Mar
2012] “Available, Partition-Tolerant (AP) Systems achieve "eventual consistency" through
replication and verification. Examples of AP systems is Cassandra, CouchDB
Consistency means that each client always has the same view of the data.
Availability means that all clients can always read and write.
Partition tolerance means that the system works well across physical network
partitions.”
Size
Complexity
Figure 1: Comparison on NoSQL data models
2.1 COMPARISON WITH SQL
When we compare with SQL, NoSQL slightly have the upper hand because of scalability and
performance. Uses map reduce, CQL instead of SQL language.
3. EXAMPLE - CASSANDRA
Cassandra is one of the well known NoSQL database and it is used widely because it has the
capability to handle large amounts of structured data without any failure and it will be ease of use.
It is written in Java and it requires JVM (Java Virtual Machine) to be installed in the system before
you start your Server and also is of key-value store type. Basically Cassandra supports CQL
(Cassandra Query Language). DataStax is one of the third party distributions of the Cassandra and it
has the Cassandra CQL Shell where we have to create the Keyspace and Column family.
Figure 2: Cassandra CQL Shell where keyspace and column family created
Key-value
stores
Big table clones
Document databases
Graph databases
Keyspace is the outer most grouping of our data and it also a collection of column family and typically
each application will have one keyspace name. They are the management and configuration part for
the column family. And one most important thing about the keyspace is the replicating factor. In the
above we created the strategy class as Simple strategy, other than this there is Network strategy
topology. And we can create multiple number of nodes. Then created the Column family named
example. Normally there are two types of column namely
Standard column family and
Super column family
Cassandra consists of three simple methods. They are insert, get and delete.
Standard column family
Super Colum family
Figure 3: Cassandra Data Modelling
3.1 MAIN FEATURES OF APACHE CASSANDRA
Partitioning
This is one of the main features in Cassandra because the data we are storing will be partitioned
dynamically and stored in the cluster over the set of available nodes by using the Hash mechanism.
By consistent hashing we will get a fixed circular space or “ring”. Each node has been assigned with a
random which denotes the position in the ring. Each data stored has been assigned a specific key in
the ring.
Figure 4: Ring View of Cassandra Test cluster
The above shown is the ring view of the Cassandra test cluster which has a token value and also it has
some other information like IP, Size and Load which is available in Web Interface of Datastax
(http://localhost:8888/opscenter/index.html) by default.
Scaling the cluster
Cassandra can also support multi node. When a new node is added into the existing system which
already has one node will split up the workload of other node and hence will be responsible for the
same job what the other node does. This can be done by the Bootstrap algorithm by some node in
command line utility or by the Cassandra web dashboard.
Figure 5: Cassandra dashboard
3.2 WHY APACHE CASSANDRA?
There are many factors that why I should have Cassandra mainly because it has the capability to
handle TB or PB’s of data in a peer to peer architecture, it follows CQL (Cassandra Query Language)
which is alike SQL, peer to peer architecture, Data will be replicated to multiple nodes and hence
there won’t be single point of failure, cloud enabled, data will be replicated to more than one location
in case of disaster recovery scenarios so there will be durability and high availability, transparent fault
detection and recovery which follows gossip protocol, ease of use and no special hardware is required
to run.
3.3 APPLICATIONS
Companies like Accenture, Twitter, Facebook and many more companies were using the NoSQL
database in one or other way because of its main features. Not only in industries but also in
Educational and other government sectors also slowly started using the NoSQL database. For example
“Burt uses Cassandra in their software to help advertisers and agencies improve the efficiency and
effect of online campaigns” [Accessed: 11 Mar 2012].
4. DRAWBACKS OF NOSQL
Unlike the SQL it doesn't have ACID properties. So we cannot expect the degree of reliability what
we get in the SQL database. Many were unfamiliar with this technology. Unlike the other commercial
SQL databases here we won't get enough support for the product, since many of the NoSQL were
only limited support.
5. SUMMARY
Like Graph database, Key-value database, Big table Clones, Document database it has made a very
big impact in the database field and most of them are Open source. So in my point of view I am sure
that many will soon migrate towards NoSQL from SQL. So in the next two to four years we can
expect a major change in the database field because of its scalability and its other features, but
chances are less that it will replace the SQL databases. Each database has its Pros and Cons and it’s
our duty to choose the right one.
6. REFERENCES
[Accessed: 24 Feb2012] Slideshare.net (2010) NoSQL databases. [Online] Available at:
http://www.slideshare.net/marin_dimitrov/nosql-databases-3584443
[Accessed: 24 Feb 2012] Perdue, T. (1998) NoSQL - An Overview of NoSQL. [online] Available at:
http://newtech.about.com/od/databasemanagement/a/Nosql.htm
[Accessed: 24 Feb 2012] Tiwari, S. (2011) Professional NoSQL. [e-book] Wrox Programmer to
Programmer. Available through: Google Books
http://books.google.co.uk/books?id=tv5iO9MnObUC&printsec=frontcover&dq=nosql&hl=en&sa=X
&ei=5vw_T9CABMG_0QWtzqyPDw&ved=0CEQQ6AEwAg#v=onepage&q=nosql&f=false
[Han, J. et al. , 2011] Han, J. et al. (2011)"Survey on NoSQL database," Pervasive Computing and
Applications (ICPCA), 2011 6th International Conference on , vol., no., pp.363-366, 26-28 Oct. 2011
doi: 10.1109/ICPCA.2011.6106531
[Accessed: 4 Mar 2012] Slideshare.net (2010) NoSQL or not NoSQL? [Online] Available at:
http://www.slideshare.net/ruflin/nosql-or-not-nosql
[Accessed: 25 Feb 2012] Blogs.neotechnology.com (2009) NOSQL: scaling to size and scaling to
complexity - Emil's Neo Thoughts. [Online] Available at:
http://blogs.neotechnology.com/emil/2009/11/nosql-scaling-to-size-and-scaling-to-complexity.html
[Accessed: 25 Feb 2012] Slideshare.net (2011) A NOSQL Overview And The Benefits Of Graph
Databases (nosql east 2009). [Online] Available at: http://www.slideshare.net/emileifrem/nosql-east-
a-nosql-overview-and-the-benefits-of-graph-databases
[Accessed: 25 Feb 2012] Slideshare.net (2011) NOSQL for Dummies. [Online] Available at:
http://www.slideshare.net/thobe/nosql-for-dummies
Leavitt, N.; , "Will NoSQL Databases Live Up to Their Promise?," Computer , vol.43, no.2, pp.12-14,
Feb. 2010 doi: 10.1109/MC.2010.58
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5410700&isnumber=5410692
[Accessed: 11 Mar 2012] Blog.nahurst.com (2010) Visual Guide to NoSQL Systems - Nathan Hurst's
Blog. [Online] Available at: http://blog.nahurst.com/visual-guide-to-nosql-systems
[Accessed: 11 Mar 2012] Datastax.com (2011) Cassandra Users | DataStax. [online] Available at:
http://www.datastax.com/cassandrausers

Mais conteúdo relacionado

Mais procurados

Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS  : MySQL, PostgreSQL and SQL SERVERRelational RDBMS  : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS : MySQL, PostgreSQL and SQL SERVERDalila Chouaya
 
Vskills Apache Cassandra sample material
Vskills Apache Cassandra sample materialVskills Apache Cassandra sample material
Vskills Apache Cassandra sample materialVskills
 
Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...
Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...
Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...DataStax
 
Introducing ms sql_server_updated
Introducing ms sql_server_updatedIntroducing ms sql_server_updated
Introducing ms sql_server_updatedleetinhf
 
Big Data - Hadoop Ecosystem
Big Data -  Hadoop Ecosystem Big Data -  Hadoop Ecosystem
Big Data - Hadoop Ecosystem nuriadelasheras
 
Oracle vs. MS SQL Server
Oracle vs. MS SQL ServerOracle vs. MS SQL Server
Oracle vs. MS SQL ServerTeresa Rothaar
 
Sql server 2012 dba online training
Sql server 2012 dba online trainingSql server 2012 dba online training
Sql server 2012 dba online trainingsqlmasters
 
Deploying data tier applications sql saturday dc
Deploying data tier applications sql saturday dcDeploying data tier applications sql saturday dc
Deploying data tier applications sql saturday dcJoseph D'Antoni
 
Wengines, Workflows, and 2 years of advanced data processing in Apache OODT
Wengines, Workflows, and 2 years of advanced data processing in Apache OODTWengines, Workflows, and 2 years of advanced data processing in Apache OODT
Wengines, Workflows, and 2 years of advanced data processing in Apache OODTChris Mattmann
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfan
 
Azure Data Factory usage at Aucfanlab
Azure Data Factory usage at AucfanlabAzure Data Factory usage at Aucfanlab
Azure Data Factory usage at AucfanlabAucfan
 
An overview of snowflake
An overview of snowflakeAn overview of snowflake
An overview of snowflakeSivakumar Ramar
 
Sql interview questions and answers
Sql interview questions and  answersSql interview questions and  answers
Sql interview questions and answerssheibansari
 
Oracle developer interview questions(entry level)
Oracle developer interview questions(entry level)Oracle developer interview questions(entry level)
Oracle developer interview questions(entry level)Naveen P
 

Mais procurados (20)

Unit 3 MongDB
Unit 3 MongDBUnit 3 MongDB
Unit 3 MongDB
 
Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS  : MySQL, PostgreSQL and SQL SERVERRelational RDBMS  : MySQL, PostgreSQL and SQL SERVER
Relational RDBMS : MySQL, PostgreSQL and SQL SERVER
 
Oracle archi ppt
Oracle archi pptOracle archi ppt
Oracle archi ppt
 
Vskills Apache Cassandra sample material
Vskills Apache Cassandra sample materialVskills Apache Cassandra sample material
Vskills Apache Cassandra sample material
 
Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...
Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...
Cassandra-Based Image Processing: Two Case Studies (Kerry Koitzsch, Kildane) ...
 
Introducing ms sql_server_updated
Introducing ms sql_server_updatedIntroducing ms sql_server_updated
Introducing ms sql_server_updated
 
Big Data - Hadoop Ecosystem
Big Data -  Hadoop Ecosystem Big Data -  Hadoop Ecosystem
Big Data - Hadoop Ecosystem
 
Stretch db sql server 2016 (sn0028)
Stretch db   sql server 2016 (sn0028)Stretch db   sql server 2016 (sn0028)
Stretch db sql server 2016 (sn0028)
 
Oracle vs. MS SQL Server
Oracle vs. MS SQL ServerOracle vs. MS SQL Server
Oracle vs. MS SQL Server
 
Sql server 2012 dba online training
Sql server 2012 dba online trainingSql server 2012 dba online training
Sql server 2012 dba online training
 
Deploying data tier applications sql saturday dc
Deploying data tier applications sql saturday dcDeploying data tier applications sql saturday dc
Deploying data tier applications sql saturday dc
 
Wengines, Workflows, and 2 years of advanced data processing in Apache OODT
Wengines, Workflows, and 2 years of advanced data processing in Apache OODTWengines, Workflows, and 2 years of advanced data processing in Apache OODT
Wengines, Workflows, and 2 years of advanced data processing in Apache OODT
 
Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -Aucfanlab Datalake - Big Data Management Platform -
Aucfanlab Datalake - Big Data Management Platform -
 
Azure Data Factory usage at Aucfanlab
Azure Data Factory usage at AucfanlabAzure Data Factory usage at Aucfanlab
Azure Data Factory usage at Aucfanlab
 
Oracle database introduction
Oracle database introductionOracle database introduction
Oracle database introduction
 
An overview of snowflake
An overview of snowflakeAn overview of snowflake
An overview of snowflake
 
Nosql intro
Nosql introNosql intro
Nosql intro
 
Sql interview questions and answers
Sql interview questions and  answersSql interview questions and  answers
Sql interview questions and answers
 
MS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTUREMS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTURE
 
Oracle developer interview questions(entry level)
Oracle developer interview questions(entry level)Oracle developer interview questions(entry level)
Oracle developer interview questions(entry level)
 

Semelhante a NoSQL

CS828 P5 Individual Project v101
CS828 P5 Individual Project v101CS828 P5 Individual Project v101
CS828 P5 Individual Project v101ThienSi Le
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQLbalwinders
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAinventy
 
A Comparative Study of NoSQL and Relational Database.pdf
A Comparative Study of NoSQL and Relational Database.pdfA Comparative Study of NoSQL and Relational Database.pdf
A Comparative Study of NoSQL and Relational Database.pdfJennifer Roman
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sqlRam kumar
 
1. introduction to no sql
1. introduction to no sql1. introduction to no sql
1. introduction to no sqlAnuja Gunale
 
Assignment_4
Assignment_4Assignment_4
Assignment_4Kirti J
 
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMINGEVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMINGijiert bestjournal
 
Brief introduction to NoSQL by fas mosleh
Brief introduction to NoSQL by fas moslehBrief introduction to NoSQL by fas mosleh
Brief introduction to NoSQL by fas moslehFas (Feisal) Mosleh
 
Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSatya Pal
 
DEE 431 Introduction to Mysql Slide 3
DEE 431 Introduction to Mysql Slide 3DEE 431 Introduction to Mysql Slide 3
DEE 431 Introduction to Mysql Slide 3YOGESH SINGH
 
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...IJCERT JOURNAL
 
Data management in cloud study of existing systems and future opportunities
Data management in cloud study of existing systems and future opportunitiesData management in cloud study of existing systems and future opportunities
Data management in cloud study of existing systems and future opportunitiesEditor Jacotech
 
SQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George GrammatikosSQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George GrammatikosGeorge Grammatikos
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013Facundo Farias
 
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRAA NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRAijfcstjournal
 

Semelhante a NoSQL (20)

No sql database
No sql databaseNo sql database
No sql database
 
CS828 P5 Individual Project v101
CS828 P5 Individual Project v101CS828 P5 Individual Project v101
CS828 P5 Individual Project v101
 
NoSQL Basics and MongDB
NoSQL Basics and  MongDBNoSQL Basics and  MongDB
NoSQL Basics and MongDB
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Challenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBAChallenges Management and Opportunities of Cloud DBA
Challenges Management and Opportunities of Cloud DBA
 
A Comparative Study of NoSQL and Relational Database.pdf
A Comparative Study of NoSQL and Relational Database.pdfA Comparative Study of NoSQL and Relational Database.pdf
A Comparative Study of NoSQL and Relational Database.pdf
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
Erciyes university
Erciyes universityErciyes university
Erciyes university
 
1. introduction to no sql
1. introduction to no sql1. introduction to no sql
1. introduction to no sql
 
Assignment_4
Assignment_4Assignment_4
Assignment_4
 
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMINGEVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
 
Brief introduction to NoSQL by fas mosleh
Brief introduction to NoSQL by fas moslehBrief introduction to NoSQL by fas mosleh
Brief introduction to NoSQL by fas mosleh
 
Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explained
 
DEE 431 Introduction to Mysql Slide 3
DEE 431 Introduction to Mysql Slide 3DEE 431 Introduction to Mysql Slide 3
DEE 431 Introduction to Mysql Slide 3
 
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
SURVEY ON IMPLEMANTATION OF COLUMN ORIENTED NOSQL DATA STORES ( BIGTABLE & CA...
 
the rising no sql technology
the rising no sql technologythe rising no sql technology
the rising no sql technology
 
Data management in cloud study of existing systems and future opportunities
Data management in cloud study of existing systems and future opportunitiesData management in cloud study of existing systems and future opportunities
Data management in cloud study of existing systems and future opportunities
 
SQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George GrammatikosSQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George Grammatikos
 
NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013NoSQL Databases Introduction - UTN 2013
NoSQL Databases Introduction - UTN 2013
 
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRAA NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
 

Último

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 

Último (20)

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 

NoSQL

  • 1. NoSQL Thenraja Vettivelraj Swansea University Contents List ABSTRACT 1. INTRODUCTION 2. MAIN FEATURES 2.1 COMPARISON WITH SQL 3. EXAMPLE - CASSANDRA 3.1 MAIN FEATURES OF APACHE CASSANDRA 3.2 WHY APACHE CASSANDRA? 3.3 APPLICATIONS 4. DRAWBACKS OF NOSQL 5. SUMMARY 6. REFERENCES
  • 2. ABSTRACT NoSQL is one of the emerging fields without any arguments. It is a very powerful and efficient tool in data storage and manipulating the data. It has no fixed Schema, no Joins and it also avoided the “ACID” properties. [Han, J. et al., 2011] And basically one of the advantages of the NoSQL is very much faster than the SQL and also the operational cost will be low than the relational database. Due to the current trend there is necessity in increase of Storage, Connectedness, Architecture and Semi- Structure [Accessed: 25 Feb 2012]. 1. INTRODUCTION The term “NoSQL” means, it has so many interpretations at first many told that it is Non-Relational database and others say that “NOSQL” stands for Not Only SQL. And now-a-days they are calling the term “NOSQL” as an Umbrella term for all the databases and the data stores which don’t follow the relational database and also it is not a single technology or a product but it is a class of products, collection of diverse and matter of about how to manipulate and store the data [Accessed: 24 Feb 2012] It's a term basically hit the market on 1998 [Accessed: 24 Feb 2012] and now for the past 3-4 years it has its own place in the market because of its tremendous growth. Massive scalability, Lower cost, Schema flexibility, Massive Data Stores and high availability [Accessed: 24 Feb2012]. Some of the main applications of the NoSQL are Search Engines, Data Processing and Social Website. NoSQL does not support Joins and but it supports ACID properties. There are four main data models in NoSQL namely Key-Value Stores Big Table Clones Document Databases Graph Databases In these we have to choose the right one for our job [Accessed: 25 Feb 2012]. Some of the very examples of NOSQL databases are Cassandra which is used by Facebook (Social Networking Site) and it comes under the Key-Value store. It has the capability to handle data very huge Terabyte (TB) of data in a single day because of its users. Big Table is an example for BigTable Clones and they reasoned for developing their own database in order to increase the control the performance and scalability. Google uses for its Search Engine, Gmail, Orkut and other Google applications. Neo4j is a very good example for Graph database and it is written in Java. Apache CouchDB which is an example for Document database written in Erlang. In the Figure 1 they have compared the four different data models of NoSQL in a graph size versus complexity. 2. MAIN FEATURES CAP theorem-Consistency, Availability and Partition tolerance. According to [Accessed: 11 Mar 2012] “Available, Partition-Tolerant (AP) Systems achieve "eventual consistency" through replication and verification. Examples of AP systems is Cassandra, CouchDB Consistency means that each client always has the same view of the data. Availability means that all clients can always read and write. Partition tolerance means that the system works well across physical network partitions.”
  • 3. Size Complexity Figure 1: Comparison on NoSQL data models 2.1 COMPARISON WITH SQL When we compare with SQL, NoSQL slightly have the upper hand because of scalability and performance. Uses map reduce, CQL instead of SQL language. 3. EXAMPLE - CASSANDRA Cassandra is one of the well known NoSQL database and it is used widely because it has the capability to handle large amounts of structured data without any failure and it will be ease of use. It is written in Java and it requires JVM (Java Virtual Machine) to be installed in the system before you start your Server and also is of key-value store type. Basically Cassandra supports CQL (Cassandra Query Language). DataStax is one of the third party distributions of the Cassandra and it has the Cassandra CQL Shell where we have to create the Keyspace and Column family. Figure 2: Cassandra CQL Shell where keyspace and column family created Key-value stores Big table clones Document databases Graph databases
  • 4. Keyspace is the outer most grouping of our data and it also a collection of column family and typically each application will have one keyspace name. They are the management and configuration part for the column family. And one most important thing about the keyspace is the replicating factor. In the above we created the strategy class as Simple strategy, other than this there is Network strategy topology. And we can create multiple number of nodes. Then created the Column family named example. Normally there are two types of column namely Standard column family and Super column family Cassandra consists of three simple methods. They are insert, get and delete. Standard column family Super Colum family Figure 3: Cassandra Data Modelling 3.1 MAIN FEATURES OF APACHE CASSANDRA Partitioning This is one of the main features in Cassandra because the data we are storing will be partitioned dynamically and stored in the cluster over the set of available nodes by using the Hash mechanism. By consistent hashing we will get a fixed circular space or “ring”. Each node has been assigned with a random which denotes the position in the ring. Each data stored has been assigned a specific key in the ring.
  • 5. Figure 4: Ring View of Cassandra Test cluster The above shown is the ring view of the Cassandra test cluster which has a token value and also it has some other information like IP, Size and Load which is available in Web Interface of Datastax (http://localhost:8888/opscenter/index.html) by default. Scaling the cluster Cassandra can also support multi node. When a new node is added into the existing system which already has one node will split up the workload of other node and hence will be responsible for the same job what the other node does. This can be done by the Bootstrap algorithm by some node in command line utility or by the Cassandra web dashboard. Figure 5: Cassandra dashboard 3.2 WHY APACHE CASSANDRA? There are many factors that why I should have Cassandra mainly because it has the capability to handle TB or PB’s of data in a peer to peer architecture, it follows CQL (Cassandra Query Language) which is alike SQL, peer to peer architecture, Data will be replicated to multiple nodes and hence
  • 6. there won’t be single point of failure, cloud enabled, data will be replicated to more than one location in case of disaster recovery scenarios so there will be durability and high availability, transparent fault detection and recovery which follows gossip protocol, ease of use and no special hardware is required to run. 3.3 APPLICATIONS Companies like Accenture, Twitter, Facebook and many more companies were using the NoSQL database in one or other way because of its main features. Not only in industries but also in Educational and other government sectors also slowly started using the NoSQL database. For example “Burt uses Cassandra in their software to help advertisers and agencies improve the efficiency and effect of online campaigns” [Accessed: 11 Mar 2012]. 4. DRAWBACKS OF NOSQL Unlike the SQL it doesn't have ACID properties. So we cannot expect the degree of reliability what we get in the SQL database. Many were unfamiliar with this technology. Unlike the other commercial SQL databases here we won't get enough support for the product, since many of the NoSQL were only limited support. 5. SUMMARY Like Graph database, Key-value database, Big table Clones, Document database it has made a very big impact in the database field and most of them are Open source. So in my point of view I am sure that many will soon migrate towards NoSQL from SQL. So in the next two to four years we can expect a major change in the database field because of its scalability and its other features, but chances are less that it will replace the SQL databases. Each database has its Pros and Cons and it’s our duty to choose the right one.
  • 7. 6. REFERENCES [Accessed: 24 Feb2012] Slideshare.net (2010) NoSQL databases. [Online] Available at: http://www.slideshare.net/marin_dimitrov/nosql-databases-3584443 [Accessed: 24 Feb 2012] Perdue, T. (1998) NoSQL - An Overview of NoSQL. [online] Available at: http://newtech.about.com/od/databasemanagement/a/Nosql.htm [Accessed: 24 Feb 2012] Tiwari, S. (2011) Professional NoSQL. [e-book] Wrox Programmer to Programmer. Available through: Google Books http://books.google.co.uk/books?id=tv5iO9MnObUC&printsec=frontcover&dq=nosql&hl=en&sa=X &ei=5vw_T9CABMG_0QWtzqyPDw&ved=0CEQQ6AEwAg#v=onepage&q=nosql&f=false [Han, J. et al. , 2011] Han, J. et al. (2011)"Survey on NoSQL database," Pervasive Computing and Applications (ICPCA), 2011 6th International Conference on , vol., no., pp.363-366, 26-28 Oct. 2011 doi: 10.1109/ICPCA.2011.6106531 [Accessed: 4 Mar 2012] Slideshare.net (2010) NoSQL or not NoSQL? [Online] Available at: http://www.slideshare.net/ruflin/nosql-or-not-nosql [Accessed: 25 Feb 2012] Blogs.neotechnology.com (2009) NOSQL: scaling to size and scaling to complexity - Emil's Neo Thoughts. [Online] Available at: http://blogs.neotechnology.com/emil/2009/11/nosql-scaling-to-size-and-scaling-to-complexity.html [Accessed: 25 Feb 2012] Slideshare.net (2011) A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009). [Online] Available at: http://www.slideshare.net/emileifrem/nosql-east- a-nosql-overview-and-the-benefits-of-graph-databases [Accessed: 25 Feb 2012] Slideshare.net (2011) NOSQL for Dummies. [Online] Available at: http://www.slideshare.net/thobe/nosql-for-dummies Leavitt, N.; , "Will NoSQL Databases Live Up to Their Promise?," Computer , vol.43, no.2, pp.12-14, Feb. 2010 doi: 10.1109/MC.2010.58 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5410700&isnumber=5410692 [Accessed: 11 Mar 2012] Blog.nahurst.com (2010) Visual Guide to NoSQL Systems - Nathan Hurst's Blog. [Online] Available at: http://blog.nahurst.com/visual-guide-to-nosql-systems [Accessed: 11 Mar 2012] Datastax.com (2011) Cassandra Users | DataStax. [online] Available at: http://www.datastax.com/cassandrausers