SlideShare a Scribd company logo
1 of 63
Download to read offline
The CIO's Guide to
     NoSQL
    Dan McCreary
    July 12th 2012
        Version 6
Agenda
    • What is NoSQL?
    • What Triggered the NoSQL Movement?
    • How is NoSQL distinct from Big Data and Cloud
      Computing?
    • Common Characteristics of NoSQL System
    • Business Benefits of NoSQL
    • Core NoSQL Concepts
    • Selected NoSQL Implementations
    • Recent NoSQL Developments
    • Selecting the Right NoSQL System
    • Next Step: Selecting the Right NoSQL Pilot Project
M
    D                                                            2
                    Copyright Kelly-McCreary & Associates, LLC
Manning NoSQL Books




M
    D        Kelly-McCreary & Associates, LLC
                                                3
Background for Dan McCreary
                 • Bell Labs
                 • NeXT Computer (Steve Jobs)
                 • Owner of Custom Object-Oriented
                   Software Consultancy
                 • Federal data integration (National
                   Information Exchange Model)
                 • Native XML/XQuery – 2006
                 • Advocate of NoSQL/XRX systems
                 • Working with Manning
                   Publications on NoSQL Topic

M
    D                                                      4
              Copyright Kelly-McCreary & Associates, LLC
NoSQL Definition
    The NoSQL movement is a set of concepts
    and technologies that allow the rapid and
    efficient processing of large data sets with a
    focus on performance and resiliency.




M
    D                                                         5
                 Copyright Kelly-McCreary & Associates, LLC
Sample of NoSQL Jargon
        Document orientation                     Indexing
                                                 B-Tree
        Schema free
                                                 Configurable durability
        MapReduce                                Documents for archives
        Horizontal scaling                       Functional programming
        Sharding and auto-sharding               Document Transformation
                                                 Document Indexing and Search
        Brewer's CAP Theorem                     Alternate Query Languages
        Consistency                              Aggregates
        Reliability                              OLAP
                                                 XQuery
        Partition tolerance
                                                 MDX
        Single-point-of-failure                  RDF
        Object-Relational mapping                SPARQL
        Key-value stores                         Architecture Tradeoff Modeling
                                                 ATAM
        Column stores
        Document-stores
        Memcached                  Note that within the context of NoSQL many
                                  of these terms have different meanings!
M
    D                                                                             6
                            Copyright Kelly-McCreary & Associates, LLC
Selecting a Database…
    "Selecting the right data storage solution is
      no longer a trivial task."

                    Does it               Yes
        Start      look like
                                                       Use Microsoft
                  document?                               Office

                  No


                 Use the                                        Stop
                 RDBMS
M
    D                                                                  7
                   Copyright Kelly-McCreary & Associates, LLC
Pressures on SQL Only Systems

                                       Scalability




         OLAP/BI/Data
          Warehouse                 SQL                               Social
                                                                     Networks



                                         Agile
                                        Schema
                                          Free


M
    D                                                                           8
                        Copyright Kelly-McCreary & Associates, LLC
Simplicity is a Virtue




    • Many systems derive their strength by dramatically limiting the
      features in their system
    • Simplicity allows database designers to focus on the primary
      business driver
    • Examples:
        – Touch screen interfaces
        – Key-value data stores
M
    D                                                               9
                       Copyright Kelly-McCreary & Associates, LLC
Historical Context
         Mainframe Era                                      MapReduce Era




    •   1 CPU                                      •     10,000 CPUs
    •   COBOL and FORTRAN                          •     Functional programming
    •   Punchcards and flat files                  •     MapReduce "server farms"
    •   $10,000 per CPU hour                       •     Pennies per CPU hour
M
    D                      Copyright Kelly-McCreary & Associates, LLC
                                                                                    10
Two Approaches to Computation
                               1930s and 40s




              John Von Neumann                                          Alonzo Church


    Manage state with a program counter.            Make computations act like math functions.

         Which is simpler? Which is cheaper? Which will scale to 10,000 CPUs?
M
     D                                                                                     11
                             Copyright 2010 Dan McCreary & Associates
Standard vs. MapReduce Prices
                                             John's Way     Alonzo's Way




M
                       http://aws.amazon.com/elasticmapreduce/#pricing

    D                                                                      12
               Copyright Kelly-McCreary & Associates, LLC
MapReduce CPUs Cost Less!
           40
                Cost Per CPU Hour (Cents)
           35
           30
           25
           20
           15
           10
            5
            0
                 Standard MapReduce                 Cuts cost from 32 to 6 cents per CPU hour!
                   CPU      CPU                     Perhaps Alanzo was right!


        Why? (hint: how "shareable" is this process)
M                                                      http://aws.amazon.com/elasticmapreduce/#pricing

    D                                                                                                    13
                            Copyright Kelly-McCreary & Associates, LLC
Perspectives


                   Object                             OLAP
        Native     Stores                             MDX
         XML
                 NoSQL for                       Graph
                  Web 2.0                        Stores
                    and
                  BigData
M                                         Perspective depends on your context
    D              Kelly-McCreary & Associates, LLC
                                                                                14
Architectural Tradeoffs

              "I want a fast car with good mileage."




        "I want a scaleable database with low cost that runs
        well on the 1,000 CPUs in our data center."
M
    D                    Kelly-McCreary & Associates, LLC
                                                               15
NoSQL on Google Trends




                                                !




M
    D                                               16
             Kelly-McCreary & Associates, LLC
Recent History
    • The term NoSQL became re-popularized
      around 2009
    • Used for conferences of advocates of non-
      relational databases
    • Became a contagious idea "meme"
    • First of many "NoSQL meetups" in San
      Francisco organized by Jon Oskarsson
    • Conversion from "No SQL" to "Not Only
      SQL" in recent year
M
    D                                               17
                 Kelly-McCreary & Associates, LLC
NoSQL and Web 2.0 Startups
    • Many web 2.0 startups did not use Oracle
      or MySQL
    • They built their own data stores influenced
      by Amazon’s Dynamo and Google’s
      BigTable in order to store and process
      huge amounts of data
    • In the social community or cloud
      computing applications, most of these data
      stores became OpenSource software
M
    D                                                18
                  Kelly-McCreary & Associates, LLC
Google MapReduce




    • 2004 paper that had huge impact of
      functional programming in the entire
      community
    • Copied by many organizations, including
      Yahoo
M
    D                                                        19
                Copyright Kelly-McCreary & Associates, LLC
Google Bigtable Paper




    • 2006 paper that gave focus to scaleable
      databases
    • designed to reliably scale to petabytes of
      data and thousands of machines
M
    D                                                         20
                 Copyright Kelly-McCreary & Associates, LLC
Amazon's Dynamo Paper
                                                           • Werner Vogels
                                                           • CTO - Amazon.com
                                                           • October 2, 2007
                                                           • Used to power Amazon's
                                                             S3 service
                                                           • One of the most
                                                             influential papers in the
                                                             NoSQL movement
                                                           • Service in 2012
    Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin,
    Swami Sivasubramanian, Peter Vosshall and Werner Vogels, “Dynamo: Amazon's Highly Available Key-Value Store”,
    in the Proceedings of the 21st ACM Symposium on Operating Systems Principles, Stevenson, WA, October 2007.


M
      D                                                                                                             21
                                    Copyright Kelly-McCreary & Associates, LLC
NoSQL "Meetups"
    “NoSQLers came to share how they had
    overthrown the tyranny of slow, expensive
    relational databases in favor of more
    efficient and cheaper ways of managing
    data.”



          Computerworld magazine, July 1st, 2009
M
    D                                                22
                  Kelly-McCreary & Associates, LLC
Key Motivators
    • Licensing RDBMS on multiple CPUs
    • The Thee "V"s
        – Velocity – lots of data arriving fast
        – Volume – web-scale BigData
        – Variability – many exceptions
    • Desire to escape rigid schema design
    • Avoidance of complex Object-Relational
      Mapping (the "Vietnam" of computer
      science)
M
    D                                                    23
                      Kelly-McCreary & Associates, LLC
Many Processes Today Are Driven By…

        The constraints of yesterday…




    Challenge:
        Ask ourselves the question…
        Do our current method of solving problems with tabular data…
        Reflect the storage of the 1950s…
        Or our actual business requirements?
        What structures best solve the actual business problem?
M
                                                                       24
    D
                        Copyright 2008 Dan McCreary & Associates
No-Shredding!


                                My
                               Data
    • Relational databases take a single hierarchical document and
      shred it into many pieces so it will fit in tabular structures
    • Document stores prevent this shredding

M
                                                                       25
    D
                       Copyright 2008 Dan McCreary & Associates
Is Shredding Really Necessary?
                           • Every time you take
                             hierarchical data and
                             put it into a traditional
                             database you have to
                             put repeating groups in
                             separate tables and
                             use SQL “joins” to
                             reassemble the data

M
                                                          26
    D
               Copyright 2008 Dan McCreary & Associates
Object Relational Mapping
                           T1                            T2



                           T4                            T3
                                                              Relational
            Web Browser              Object Middle
                                                              Database
                                         Tier


        •    T1 – HTML into Objects
        •    T2 –Objects into SQL Tables
        •    T3 – Tables into Objects
        •    T4 – Objects into HTML
M
    D                                                                      27
                          Kelly-McCreary & Associates, LLC
"The Vietnam of Applications"
    • Object-relational mapping has become one of
      the most complex components of building
      applications today
    • A "Quagmire" where many projects get lost
    • Many "heroic efforts" have been made to
      solve the problem:
        – Hibernate
        – Ruby on Rails
    • But sometimes the way to avoid complexity is
      to keep your architecture very simple
M
    D                                                           28
                   Copyright Kelly-McCreary & Associates, LLC
Document Stores Need No Translation


               Document                                           Document


           Application Layer                                       Database

           •   Documents in the database (JSON or XML)
           •   Documents in the application
           •   No object middle tier
           •   No "shredding"
           •   No reassembly
           •   Simple!
M
                                                                              29
    D
                       Copyright 2010 Dan McCreary & Associates
The XML "Full Stack"

                XForms               REST-Interfaces


               Web Browser                                          XML database

        •   XML lives in the web browser (XForms)
        •   REST interfaces
        •   XML in the database (Native XML, XQuery)
        •   XRX Web Application Architecture
        •   No translation!

M
                                                                                   30
    D
                         Copyright 2010 Dan McCreary & Associates
"Schema Free"
    • Systems that automatically determine how to
      index data as the data is loaded into the
      database
    • No a priori knowledge of data structure
    • No need for up-front logical data modeling
        – …but some modeling is still critical
    • Adding new data elements or changing data
      elements is not disruptive
    • Searching millions of records still has sub-
      second response time
M
                                                               31
    D
                    Copyright 2010 Dan McCreary & Associates
Monoculture and Mono-architecture




M                                                           Image Source: Wikipedia
                                                                                      32
    D
                 Copyright 2010 Dan McCreary & Associates
Eric Evans

        “The whole point of seeking alternatives
        [to RDBMS systems] is that you need to
        solve a problem that relational databases
        are a bad fit for.”


                                                       Eric Evans
                                                       Rackspace
M
    D                                                               33
                    Kelly-McCreary & Associates, LLC
Evolution of Ideas in OpenSource
     New Database Ideas                                                           New Products
                                                    Proprietary Software


                                                                                    Product A

                Schema-free

                                                                                    Product B
                                              OpenSource
Auto-sharding                 MapReduce
                                                                                    Product B

              Cloud Computing


     •       How quickly can new ideas be recombined into new database products?
     •       OpenSource software has proved to be the most efficient way to quickly
             recombine new ideas into new products
M
         D                                                                                       34
                                     Copyright Kelly-McCreary & Associates, LLC
Storage Architectural Patterns
    Tables                               Trees




                                 Stars
    Triples




M
     D
    35           Copyright 2010 Dan McCreary & Associates
Finding the Right Match
        Schema-Free




                                      Standards Compliant




                                           Mature Query Language



            Use CMU's Architectural Tradeoff and Modeling (ATAM) Process
M
                                                                           36
    D             Copyright 2010 Dan McCreary & Associates
Avoidance of Unneeded Complexity
    • Relational databases provide a variety of
      features to ALWAYS support strict data
      consistency
    • Rich feature set and the ACID properties
      implemented by RDBMSs might be more
      than necessary for particular applications
      and use cases


M
    D                                                37
                  Kelly-McCreary & Associates, LLC
"Once Size Fits…"

    "One Size Does Not Fit All"
         James Hamilton Nov. 3rd, 2009




        http://perspectives.mvdirona.com/CommentView,guid,afe46691-a293-4f9a-8900-5688a597726a.aspx


M
    D                                                                                          38
                                   Kelly-McCreary & Associates, LLC
Different Thinking
        Sequential Processing                                Parallel Processing




    • The output of any step
      can be used in the                       • Each loop of XQuery FLOWR
      next step                                  statements are independent
    • State must be carefully                    thread (no side-effects)
      managed
M
    D                           Kelly-McCreary & Associates, LLC
                                                                                   39
Cloud Computing
    • High scalability
        – Especially in the horizontal direction (multi
          CPUs)
    • Low administration overhead
        – Simple web page administration




M
    D                                                     40
                      Kelly-McCreary & Associates, LLC
Databases work well in the cloud
    • Data warehousing specific databases for
      batch data processing and map/reduce
      operations
    • Simple, scalable and fast key/value-stores
    • Databases containing a richer feature set
      than key/value-stores fitting the gap with
      traditional
    • RDBMS while offering good performance and
      scalability properties (such as document
      databases).
M
    D                                                41
                  Kelly-McCreary & Associates, LLC
Auto-Sharding
     • When one database gets almost full it tells a "coordinator" system
       and the data automatically gets migrated to other systems
     • Systems have "Partition Tolerance"



                                                                          Warning Disk Full!
    Before: one disk 90% full:
                                                                          Time to "Shard"




    After: two disks 45% full:



M
      D                                                                                        42
                             Copyright Kelly-McCreary & Associates, LLC
Brewer's CAP Theorem

                             Consistency




                        You can not
                       have all three
                        so pick two!

        Availability                                       Partition Tolerance
M
    D                   Kelly-McCreary & Associates, LLC
                                                                                 43
Migrating to Partition Tolarance
                         Consistency




              CA                                           CP
                       RDBMS




    Availability                 AP                      Partition Tolerance
M
    D                                                                          44
                   Copyright Kelly-McCreary & Associates, LLC
Scale Up vs. Scale Out




           Scale Up                                              Scale Out
    • Make a single CPU as fast as                • Make Many CPUs work
      possible                                      together
    • Increase clock speed                        • Learn how to divide your
    • Add RAM                                       problems into independent
    • Make disk I/O go faster                       threads
M
    D                   Copyright Kelly-McCreary & Associates, LLC
                                                                             45
Sample of NO-SQL Systems
                                                      Document Stores
    Key-Value Stores
      Memcache


                                                                        XML


     Column Stores



      Graph Stores

         Object Stores



M
                                                                              46
     D
                         Copyright 2010 Dan McCreary & Associates
If you can't beat them…




M
    D         Kelly-McCreary & Associates, LLC
                                                 47
Key Value Stores
        Key   Value
                           • A table with two columns
                             and a simple interface
                                  – Add a key-value
                                  – For this key, give me the
                                    value
                                  – Delete a key
                           • Blazingly fast and easy to
                             scale

M
    D                                                              48
                      Copyright Kelly-McCreary & Associates, LLC
Types of Key-Value Stores
    •   Eventually‐consistent Key‐Value store
    •   Hierarchical Key-Value Stores
    •   Key-Value Stores In RAM
    •   Key Value Stores on Disk
    •   Ordered Key-Value Stores




M
    D                                                          49
                  Copyright Kelly-McCreary & Associates, LLC
Cassendra

    • Apache open source project
    • Originally developed by Facebook
    • Designed for highly distributed high-
      reliable systems
    • No single point of failure
    • Column-family data model

        http://www.cs.cornell.edu/projects/ladis2009/papers/lakshman-ladis2009.pdf

M
    D                                                                            50
                          Copyright Kelly-McCreary & Associates, LLC
MongoDB
    •   Open Source License
    •   Document/Collection centric
    •   Sharding built-in, automatic
    •   Stores data in JSON format
    •   Query language is JSON
    •   Can be 10x faster than MySQL
    •   Many languages (C++, JavaScript, Java,
        Perl, Python etc.)
M
    D                                                         51
                 Copyright Kelly-McCreary & Associates, LLC
Hadoop/Hbase
    • Open source implementation of
      MapReduce algorithm written in Java
    • Initially created by Yahoo
        – 300 person-years development
    • Column-oriented data store similar to
      Google's BigTable
    • Java interface
    • H-Base designed specifically to work with
      Hadoop and the Hadoop file system
M
    D                                                          52
                  Copyright Kelly-McCreary & Associates, LLC
CouchDB


    •   Commercial Company
    •   Apache Project
    •   Written in ERLANG
    •   RESTful JSON API
    •   Distributed, featuring robust, incremental
        replication with bi-directional conflict
        detection and management
M
    D                                                          53
                  Copyright Kelly-McCreary & Associates, LLC
Memcached



    • Free & open source in-memory caching system
    • Designed to speeding up dynamic web applications by
      alleviating database load
    • RAM resident key-value store for small chunks of arbitrary
      data (strings, objects) from results of database calls, API calls,
      or page rendering
    • Simple interface
    • Designed for quick deployment, ease of development
    • APIs in many languages

M
    D                                                                 54
                       Copyright Kelly-McCreary & Associates, LLC
MarkLogic
    • Native XML database designed to used by
      Petabyte data stores
    • ACID compliant
    • Role-based access control
    • Heavy use by federal agencies, document
      publishers and "high-variability" data
    • Arguably the most successful NoSQL
      company
M
    D                                                       55
               Copyright Kelly-McCreary & Associates, LLC
eXist
    • OpenSource native XML database
    • Strong support for XQuery and XQuery
      extensions
    • Heavily used by the Text Encoding Initiative
      (TEI) community and XRX/XForms communities
    • Ideal for metadata management
    • Integrated Lucene search and structured search



M
    D                                                         56
                 Copyright Kelly-McCreary & Associates, LLC
Riak
    •   Community and Commercial licenses
    •   A "Dynamo-inspired" database
    •   Written in ERLANG
    •   Query JSON or ERLANG




M
    D                                                         57
                 Copyright Kelly-McCreary & Associates, LLC
Hypertable
    • Open Source
    • Closely modeled after Google's Bigtable
      project
    • High performance distributed data storage
      system
    • Designed to support applications requiring
      maximum performance, scalability, and
      reliability
    • Hypertable Query Language (HQL) that is
      syntactically similar to SQL
M
    D                                                         58
                 Copyright Kelly-McCreary & Associates, LLC
Selecting a NoSQL Pilot Project
                                          • The "Goldilocks Pilot
                                            Project Strategy"
                                          • Not to big, not to
                                            small, just the right
                                            size
                                          • Duration
                                          • Sponsorship
                                          • Importance
                                          • Skills
                                          • Mentorship

M
                                                                    59
    D
               Copyright 2010 Dan McCreary & Associates
The Future of the NoSQL Movement
                    Growth                                                Diversity




        • Will data sets continue to grow at exponential rates?
        • Will new system options become more diverse?
        • Will new markets have different demands?
        • Will some ideas be "absorbed" into existing RDBMS vendors
          products?
        • Will the NoSQL community continue to be the place where new
          database ideas and products are incubated?
        • Will the job of doing high-quality architectural tradeoffs analysis
M         become easier?
    D                                                                                 60
                             Copyright Kelly-McCreary & Associates, LLC
Using the Wrong Architecture




        Start                                                       Finish




                Credit: Isaac Homelund – MN Office of the Revisor
M
    D
Using the Right Architecture



                                                             Finish
        Start




                Find ways to remove barriers to empowering
                the non programmers on your team.




M
    D
Questions
    Dan McCreary
    President, Kelly-McCreary & Associates
    dan@danmccreary.com




M
    D                                               63
                 Kelly-McCreary & Associates, LLC

More Related Content

What's hot

An Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsAn Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsXiao Qin
 
Analytics on Hadoop
Analytics on HadoopAnalytics on Hadoop
Analytics on HadoopEMC
 
Architecting Virtualized Infrastructure for Big Data
Architecting Virtualized Infrastructure for Big DataArchitecting Virtualized Infrastructure for Big Data
Architecting Virtualized Infrastructure for Big DataRichard McDougall
 
No sql and data scalability
No sql and data scalabilityNo sql and data scalability
No sql and data scalabilityRoger Xia
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data ApplicationsRichard McDougall
 
Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...
Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...
Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...lucenerevolution
 
Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Jan Robin
 
Microsoft SQL Azure - Agility in the New Economy Technical Datasheet
Microsoft SQL Azure - Agility in the New Economy Technical DatasheetMicrosoft SQL Azure - Agility in the New Economy Technical Datasheet
Microsoft SQL Azure - Agility in the New Economy Technical DatasheetMicrosoft Private Cloud
 
ClassCloud: switch your PC Classroom into Cloud Testbed
ClassCloud: switch your PC Classroom into Cloud TestbedClassCloud: switch your PC Classroom into Cloud Testbed
ClassCloud: switch your PC Classroom into Cloud TestbedJazz Yao-Tsung Wang
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalIntelHealthcare
 
Accelerating micro strategy for real time bi
Accelerating micro strategy for real time biAccelerating micro strategy for real time bi
Accelerating micro strategy for real time biKognitio
 
Idc Reducing It Costs With Blades
Idc Reducing It Costs With BladesIdc Reducing It Costs With Blades
Idc Reducing It Costs With Bladespankaj009
 
Erlang Cache
Erlang CacheErlang Cache
Erlang Cacheice j
 
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on DemandApachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on DemandRichard McDougall
 
SQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelSQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelMark Ginnebaugh
 
Ramakrishnan Keynote Ladis2009
Ramakrishnan Keynote Ladis2009Ramakrishnan Keynote Ladis2009
Ramakrishnan Keynote Ladis2009yarapavan
 

What's hot (18)

An Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsAn Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive Applications
 
Analytics on Hadoop
Analytics on HadoopAnalytics on Hadoop
Analytics on Hadoop
 
Architecting Virtualized Infrastructure for Big Data
Architecting Virtualized Infrastructure for Big DataArchitecting Virtualized Infrastructure for Big Data
Architecting Virtualized Infrastructure for Big Data
 
No sql and data scalability
No sql and data scalabilityNo sql and data scalability
No sql and data scalability
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data Applications
 
Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...
Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...
Lucene in the Cloud: Learn how GCE leveraged the power of search and Big Data...
 
Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI Seneca, Pittsburgh Supercomputer, and LSI
Seneca, Pittsburgh Supercomputer, and LSI
 
Microsoft SQL Azure - Agility in the New Economy Technical Datasheet
Microsoft SQL Azure - Agility in the New Economy Technical DatasheetMicrosoft SQL Azure - Agility in the New Economy Technical Datasheet
Microsoft SQL Azure - Agility in the New Economy Technical Datasheet
 
ClassCloud: switch your PC Classroom into Cloud Testbed
ClassCloud: switch your PC Classroom into Cloud TestbedClassCloud: switch your PC Classroom into Cloud Testbed
ClassCloud: switch your PC Classroom into Cloud Testbed
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
 
Accelerating micro strategy for real time bi
Accelerating micro strategy for real time biAccelerating micro strategy for real time bi
Accelerating micro strategy for real time bi
 
Big data and cloud
Big data and cloudBig data and cloud
Big data and cloud
 
Idc Reducing It Costs With Blades
Idc Reducing It Costs With BladesIdc Reducing It Costs With Blades
Idc Reducing It Costs With Blades
 
Erlang Cache
Erlang CacheErlang Cache
Erlang Cache
 
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on DemandApachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
 
SQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelSQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next Level
 
Ramakrishnan Keynote Ladis2009
Ramakrishnan Keynote Ladis2009Ramakrishnan Keynote Ladis2009
Ramakrishnan Keynote Ladis2009
 
Greenplum hadoop
Greenplum hadoopGreenplum hadoop
Greenplum hadoop
 

Viewers also liked

LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureLDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureDATAVERSITY
 
Mastering DevOps With Oracle
Mastering DevOps With OracleMastering DevOps With Oracle
Mastering DevOps With OracleKelly Goetsch
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDATAVERSITY
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenDATAVERSITY
 
RWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkRWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkDATAVERSITY
 
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?DATAVERSITY
 

Viewers also liked (6)

LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise ArchitectureLDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
LDM Slides: How Data Modeling Fits into an Overall Enterprise Architecture
 
Mastering DevOps With Oracle
Mastering DevOps With OracleMastering DevOps With Oracle
Mastering DevOps With Oracle
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
 
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data GardenData-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
 
RWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkRWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance Framework
 
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
Smart Data Webinar: Artificial General Intelligence - When Can I Get It?
 

Similar to CIO Guide to NoSQL Databases

Accelerating big data with ioMemory and Cisco UCS and NOSQL
Accelerating big data with ioMemory and Cisco UCS and NOSQLAccelerating big data with ioMemory and Cisco UCS and NOSQL
Accelerating big data with ioMemory and Cisco UCS and NOSQLSumeet Bansal
 
Agile NoSQL With XRX
Agile NoSQL With XRXAgile NoSQL With XRX
Agile NoSQL With XRXDATAVERSITY
 
Big Challenges in Data Modeling: NoSQL and Data Modeling
Big Challenges in Data Modeling: NoSQL and Data ModelingBig Challenges in Data Modeling: NoSQL and Data Modeling
Big Challenges in Data Modeling: NoSQL and Data ModelingDATAVERSITY
 
NoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsNoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsDATAVERSITY
 
The CIOs Guide to NoSQL
The CIOs Guide to NoSQLThe CIOs Guide to NoSQL
The CIOs Guide to NoSQLDATAVERSITY
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesshnkr_rmchndrn
 
Database Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastDatabase Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastInside Analysis
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12mark madsen
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
AWS case study: real estate portal
AWS case study: real estate portalAWS case study: real estate portal
AWS case study: real estate portalAndreas Chatzakis
 
The Art & Sience of Optimization
The Art & Sience of OptimizationThe Art & Sience of Optimization
The Art & Sience of OptimizationHertzel Karbasi
 
Scaling Your Database in the Cloud
Scaling Your Database in the CloudScaling Your Database in the Cloud
Scaling Your Database in the CloudRightScale
 
Cloud Computing Tutorial - Jens Nimis
Cloud Computing Tutorial - Jens NimisCloud Computing Tutorial - Jens Nimis
Cloud Computing Tutorial - Jens NimisJensNimis
 
Towards CloudML, a Model-Based Approach to Provision Resources in the Clouds
Towards CloudML, a Model-Based Approach  to Provision Resources in the CloudsTowards CloudML, a Model-Based Approach  to Provision Resources in the Clouds
Towards CloudML, a Model-Based Approach to Provision Resources in the CloudsSébastien Mosser
 
Enterprise Architecture
Enterprise ArchitectureEnterprise Architecture
Enterprise ArchitectureRaman Kannan
 
The Microsoft BigData Story
The Microsoft BigData StoryThe Microsoft BigData Story
The Microsoft BigData StoryLynn Langit
 
The IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceThe IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceIBM Danmark
 

Similar to CIO Guide to NoSQL Databases (20)

Accelerating big data with ioMemory and Cisco UCS and NOSQL
Accelerating big data with ioMemory and Cisco UCS and NOSQLAccelerating big data with ioMemory and Cisco UCS and NOSQL
Accelerating big data with ioMemory and Cisco UCS and NOSQL
 
Agile NoSQL With XRX
Agile NoSQL With XRXAgile NoSQL With XRX
Agile NoSQL With XRX
 
Big Challenges in Data Modeling: NoSQL and Data Modeling
Big Challenges in Data Modeling: NoSQL and Data ModelingBig Challenges in Data Modeling: NoSQL and Data Modeling
Big Challenges in Data Modeling: NoSQL and Data Modeling
 
NoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture PatternsNoSQL Now! NoSQL Architecture Patterns
NoSQL Now! NoSQL Architecture Patterns
 
The CIOs Guide to NoSQL
The CIOs Guide to NoSQLThe CIOs Guide to NoSQL
The CIOs Guide to NoSQL
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skies
 
Database Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastDatabase Revolution - Exploratory Webcast
Database Revolution - Exploratory Webcast
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
No sql
No sqlNo sql
No sql
 
AWS case study: real estate portal
AWS case study: real estate portalAWS case study: real estate portal
AWS case study: real estate portal
 
Azure and cloud design patterns
Azure and cloud design patternsAzure and cloud design patterns
Azure and cloud design patterns
 
The Art & Sience of Optimization
The Art & Sience of OptimizationThe Art & Sience of Optimization
The Art & Sience of Optimization
 
Scaling Your Database in the Cloud
Scaling Your Database in the CloudScaling Your Database in the Cloud
Scaling Your Database in the Cloud
 
Cloud Computing Tutorial - Jens Nimis
Cloud Computing Tutorial - Jens NimisCloud Computing Tutorial - Jens Nimis
Cloud Computing Tutorial - Jens Nimis
 
Towards CloudML, a Model-Based Approach to Provision Resources in the Clouds
Towards CloudML, a Model-Based Approach  to Provision Resources in the CloudsTowards CloudML, a Model-Based Approach  to Provision Resources in the Clouds
Towards CloudML, a Model-Based Approach to Provision Resources in the Clouds
 
Enterprise Architecture
Enterprise ArchitectureEnterprise Architecture
Enterprise Architecture
 
The Microsoft BigData Story
The Microsoft BigData StoryThe Microsoft BigData Story
The Microsoft BigData Story
 
No sql database
No sql databaseNo sql database
No sql database
 
The IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse applianceThe IBM Netezza datawarehouse appliance
The IBM Netezza datawarehouse appliance
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Recently uploaded (20)

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

CIO Guide to NoSQL Databases

  • 1. The CIO's Guide to NoSQL Dan McCreary July 12th 2012 Version 6
  • 2. Agenda • What is NoSQL? • What Triggered the NoSQL Movement? • How is NoSQL distinct from Big Data and Cloud Computing? • Common Characteristics of NoSQL System • Business Benefits of NoSQL • Core NoSQL Concepts • Selected NoSQL Implementations • Recent NoSQL Developments • Selecting the Right NoSQL System • Next Step: Selecting the Right NoSQL Pilot Project M D 2 Copyright Kelly-McCreary & Associates, LLC
  • 3. Manning NoSQL Books M D Kelly-McCreary & Associates, LLC 3
  • 4. Background for Dan McCreary • Bell Labs • NeXT Computer (Steve Jobs) • Owner of Custom Object-Oriented Software Consultancy • Federal data integration (National Information Exchange Model) • Native XML/XQuery – 2006 • Advocate of NoSQL/XRX systems • Working with Manning Publications on NoSQL Topic M D 4 Copyright Kelly-McCreary & Associates, LLC
  • 5. NoSQL Definition The NoSQL movement is a set of concepts and technologies that allow the rapid and efficient processing of large data sets with a focus on performance and resiliency. M D 5 Copyright Kelly-McCreary & Associates, LLC
  • 6. Sample of NoSQL Jargon Document orientation Indexing B-Tree Schema free Configurable durability MapReduce Documents for archives Horizontal scaling Functional programming Sharding and auto-sharding Document Transformation Document Indexing and Search Brewer's CAP Theorem Alternate Query Languages Consistency Aggregates Reliability OLAP XQuery Partition tolerance MDX Single-point-of-failure RDF Object-Relational mapping SPARQL Key-value stores Architecture Tradeoff Modeling ATAM Column stores Document-stores Memcached Note that within the context of NoSQL many of these terms have different meanings! M D 6 Copyright Kelly-McCreary & Associates, LLC
  • 7. Selecting a Database… "Selecting the right data storage solution is no longer a trivial task." Does it Yes Start look like Use Microsoft document? Office No Use the Stop RDBMS M D 7 Copyright Kelly-McCreary & Associates, LLC
  • 8. Pressures on SQL Only Systems Scalability OLAP/BI/Data Warehouse SQL Social Networks Agile Schema Free M D 8 Copyright Kelly-McCreary & Associates, LLC
  • 9. Simplicity is a Virtue • Many systems derive their strength by dramatically limiting the features in their system • Simplicity allows database designers to focus on the primary business driver • Examples: – Touch screen interfaces – Key-value data stores M D 9 Copyright Kelly-McCreary & Associates, LLC
  • 10. Historical Context Mainframe Era MapReduce Era • 1 CPU • 10,000 CPUs • COBOL and FORTRAN • Functional programming • Punchcards and flat files • MapReduce "server farms" • $10,000 per CPU hour • Pennies per CPU hour M D Copyright Kelly-McCreary & Associates, LLC 10
  • 11. Two Approaches to Computation 1930s and 40s John Von Neumann Alonzo Church Manage state with a program counter. Make computations act like math functions. Which is simpler? Which is cheaper? Which will scale to 10,000 CPUs? M D 11 Copyright 2010 Dan McCreary & Associates
  • 12. Standard vs. MapReduce Prices John's Way Alonzo's Way M http://aws.amazon.com/elasticmapreduce/#pricing D 12 Copyright Kelly-McCreary & Associates, LLC
  • 13. MapReduce CPUs Cost Less! 40 Cost Per CPU Hour (Cents) 35 30 25 20 15 10 5 0 Standard MapReduce Cuts cost from 32 to 6 cents per CPU hour! CPU CPU Perhaps Alanzo was right! Why? (hint: how "shareable" is this process) M http://aws.amazon.com/elasticmapreduce/#pricing D 13 Copyright Kelly-McCreary & Associates, LLC
  • 14. Perspectives Object OLAP Native Stores MDX XML NoSQL for Graph Web 2.0 Stores and BigData M Perspective depends on your context D Kelly-McCreary & Associates, LLC 14
  • 15. Architectural Tradeoffs "I want a fast car with good mileage." "I want a scaleable database with low cost that runs well on the 1,000 CPUs in our data center." M D Kelly-McCreary & Associates, LLC 15
  • 16. NoSQL on Google Trends ! M D 16 Kelly-McCreary & Associates, LLC
  • 17. Recent History • The term NoSQL became re-popularized around 2009 • Used for conferences of advocates of non- relational databases • Became a contagious idea "meme" • First of many "NoSQL meetups" in San Francisco organized by Jon Oskarsson • Conversion from "No SQL" to "Not Only SQL" in recent year M D 17 Kelly-McCreary & Associates, LLC
  • 18. NoSQL and Web 2.0 Startups • Many web 2.0 startups did not use Oracle or MySQL • They built their own data stores influenced by Amazon’s Dynamo and Google’s BigTable in order to store and process huge amounts of data • In the social community or cloud computing applications, most of these data stores became OpenSource software M D 18 Kelly-McCreary & Associates, LLC
  • 19. Google MapReduce • 2004 paper that had huge impact of functional programming in the entire community • Copied by many organizations, including Yahoo M D 19 Copyright Kelly-McCreary & Associates, LLC
  • 20. Google Bigtable Paper • 2006 paper that gave focus to scaleable databases • designed to reliably scale to petabytes of data and thousands of machines M D 20 Copyright Kelly-McCreary & Associates, LLC
  • 21. Amazon's Dynamo Paper • Werner Vogels • CTO - Amazon.com • October 2, 2007 • Used to power Amazon's S3 service • One of the most influential papers in the NoSQL movement • Service in 2012 Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swami Sivasubramanian, Peter Vosshall and Werner Vogels, “Dynamo: Amazon's Highly Available Key-Value Store”, in the Proceedings of the 21st ACM Symposium on Operating Systems Principles, Stevenson, WA, October 2007. M D 21 Copyright Kelly-McCreary & Associates, LLC
  • 22. NoSQL "Meetups" “NoSQLers came to share how they had overthrown the tyranny of slow, expensive relational databases in favor of more efficient and cheaper ways of managing data.” Computerworld magazine, July 1st, 2009 M D 22 Kelly-McCreary & Associates, LLC
  • 23. Key Motivators • Licensing RDBMS on multiple CPUs • The Thee "V"s – Velocity – lots of data arriving fast – Volume – web-scale BigData – Variability – many exceptions • Desire to escape rigid schema design • Avoidance of complex Object-Relational Mapping (the "Vietnam" of computer science) M D 23 Kelly-McCreary & Associates, LLC
  • 24. Many Processes Today Are Driven By… The constraints of yesterday… Challenge: Ask ourselves the question… Do our current method of solving problems with tabular data… Reflect the storage of the 1950s… Or our actual business requirements? What structures best solve the actual business problem? M 24 D Copyright 2008 Dan McCreary & Associates
  • 25. No-Shredding! My Data • Relational databases take a single hierarchical document and shred it into many pieces so it will fit in tabular structures • Document stores prevent this shredding M 25 D Copyright 2008 Dan McCreary & Associates
  • 26. Is Shredding Really Necessary? • Every time you take hierarchical data and put it into a traditional database you have to put repeating groups in separate tables and use SQL “joins” to reassemble the data M 26 D Copyright 2008 Dan McCreary & Associates
  • 27. Object Relational Mapping T1 T2 T4 T3 Relational Web Browser Object Middle Database Tier • T1 – HTML into Objects • T2 –Objects into SQL Tables • T3 – Tables into Objects • T4 – Objects into HTML M D 27 Kelly-McCreary & Associates, LLC
  • 28. "The Vietnam of Applications" • Object-relational mapping has become one of the most complex components of building applications today • A "Quagmire" where many projects get lost • Many "heroic efforts" have been made to solve the problem: – Hibernate – Ruby on Rails • But sometimes the way to avoid complexity is to keep your architecture very simple M D 28 Copyright Kelly-McCreary & Associates, LLC
  • 29. Document Stores Need No Translation Document Document Application Layer Database • Documents in the database (JSON or XML) • Documents in the application • No object middle tier • No "shredding" • No reassembly • Simple! M 29 D Copyright 2010 Dan McCreary & Associates
  • 30. The XML "Full Stack" XForms REST-Interfaces Web Browser XML database • XML lives in the web browser (XForms) • REST interfaces • XML in the database (Native XML, XQuery) • XRX Web Application Architecture • No translation! M 30 D Copyright 2010 Dan McCreary & Associates
  • 31. "Schema Free" • Systems that automatically determine how to index data as the data is loaded into the database • No a priori knowledge of data structure • No need for up-front logical data modeling – …but some modeling is still critical • Adding new data elements or changing data elements is not disruptive • Searching millions of records still has sub- second response time M 31 D Copyright 2010 Dan McCreary & Associates
  • 32. Monoculture and Mono-architecture M Image Source: Wikipedia 32 D Copyright 2010 Dan McCreary & Associates
  • 33. Eric Evans “The whole point of seeking alternatives [to RDBMS systems] is that you need to solve a problem that relational databases are a bad fit for.” Eric Evans Rackspace M D 33 Kelly-McCreary & Associates, LLC
  • 34. Evolution of Ideas in OpenSource New Database Ideas New Products Proprietary Software Product A Schema-free Product B OpenSource Auto-sharding MapReduce Product B Cloud Computing • How quickly can new ideas be recombined into new database products? • OpenSource software has proved to be the most efficient way to quickly recombine new ideas into new products M D 34 Copyright Kelly-McCreary & Associates, LLC
  • 35. Storage Architectural Patterns Tables Trees Stars Triples M D 35 Copyright 2010 Dan McCreary & Associates
  • 36. Finding the Right Match Schema-Free Standards Compliant Mature Query Language Use CMU's Architectural Tradeoff and Modeling (ATAM) Process M 36 D Copyright 2010 Dan McCreary & Associates
  • 37. Avoidance of Unneeded Complexity • Relational databases provide a variety of features to ALWAYS support strict data consistency • Rich feature set and the ACID properties implemented by RDBMSs might be more than necessary for particular applications and use cases M D 37 Kelly-McCreary & Associates, LLC
  • 38. "Once Size Fits…" "One Size Does Not Fit All" James Hamilton Nov. 3rd, 2009 http://perspectives.mvdirona.com/CommentView,guid,afe46691-a293-4f9a-8900-5688a597726a.aspx M D 38 Kelly-McCreary & Associates, LLC
  • 39. Different Thinking Sequential Processing Parallel Processing • The output of any step can be used in the • Each loop of XQuery FLOWR next step statements are independent • State must be carefully thread (no side-effects) managed M D Kelly-McCreary & Associates, LLC 39
  • 40. Cloud Computing • High scalability – Especially in the horizontal direction (multi CPUs) • Low administration overhead – Simple web page administration M D 40 Kelly-McCreary & Associates, LLC
  • 41. Databases work well in the cloud • Data warehousing specific databases for batch data processing and map/reduce operations • Simple, scalable and fast key/value-stores • Databases containing a richer feature set than key/value-stores fitting the gap with traditional • RDBMS while offering good performance and scalability properties (such as document databases). M D 41 Kelly-McCreary & Associates, LLC
  • 42. Auto-Sharding • When one database gets almost full it tells a "coordinator" system and the data automatically gets migrated to other systems • Systems have "Partition Tolerance" Warning Disk Full! Before: one disk 90% full: Time to "Shard" After: two disks 45% full: M D 42 Copyright Kelly-McCreary & Associates, LLC
  • 43. Brewer's CAP Theorem Consistency You can not have all three so pick two! Availability Partition Tolerance M D Kelly-McCreary & Associates, LLC 43
  • 44. Migrating to Partition Tolarance Consistency CA CP RDBMS Availability AP Partition Tolerance M D 44 Copyright Kelly-McCreary & Associates, LLC
  • 45. Scale Up vs. Scale Out Scale Up Scale Out • Make a single CPU as fast as • Make Many CPUs work possible together • Increase clock speed • Learn how to divide your • Add RAM problems into independent • Make disk I/O go faster threads M D Copyright Kelly-McCreary & Associates, LLC 45
  • 46. Sample of NO-SQL Systems Document Stores Key-Value Stores Memcache XML Column Stores Graph Stores Object Stores M 46 D Copyright 2010 Dan McCreary & Associates
  • 47. If you can't beat them… M D Kelly-McCreary & Associates, LLC 47
  • 48. Key Value Stores Key Value • A table with two columns and a simple interface – Add a key-value – For this key, give me the value – Delete a key • Blazingly fast and easy to scale M D 48 Copyright Kelly-McCreary & Associates, LLC
  • 49. Types of Key-Value Stores • Eventually‐consistent Key‐Value store • Hierarchical Key-Value Stores • Key-Value Stores In RAM • Key Value Stores on Disk • Ordered Key-Value Stores M D 49 Copyright Kelly-McCreary & Associates, LLC
  • 50. Cassendra • Apache open source project • Originally developed by Facebook • Designed for highly distributed high- reliable systems • No single point of failure • Column-family data model http://www.cs.cornell.edu/projects/ladis2009/papers/lakshman-ladis2009.pdf M D 50 Copyright Kelly-McCreary & Associates, LLC
  • 51. MongoDB • Open Source License • Document/Collection centric • Sharding built-in, automatic • Stores data in JSON format • Query language is JSON • Can be 10x faster than MySQL • Many languages (C++, JavaScript, Java, Perl, Python etc.) M D 51 Copyright Kelly-McCreary & Associates, LLC
  • 52. Hadoop/Hbase • Open source implementation of MapReduce algorithm written in Java • Initially created by Yahoo – 300 person-years development • Column-oriented data store similar to Google's BigTable • Java interface • H-Base designed specifically to work with Hadoop and the Hadoop file system M D 52 Copyright Kelly-McCreary & Associates, LLC
  • 53. CouchDB • Commercial Company • Apache Project • Written in ERLANG • RESTful JSON API • Distributed, featuring robust, incremental replication with bi-directional conflict detection and management M D 53 Copyright Kelly-McCreary & Associates, LLC
  • 54. Memcached • Free & open source in-memory caching system • Designed to speeding up dynamic web applications by alleviating database load • RAM resident key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering • Simple interface • Designed for quick deployment, ease of development • APIs in many languages M D 54 Copyright Kelly-McCreary & Associates, LLC
  • 55. MarkLogic • Native XML database designed to used by Petabyte data stores • ACID compliant • Role-based access control • Heavy use by federal agencies, document publishers and "high-variability" data • Arguably the most successful NoSQL company M D 55 Copyright Kelly-McCreary & Associates, LLC
  • 56. eXist • OpenSource native XML database • Strong support for XQuery and XQuery extensions • Heavily used by the Text Encoding Initiative (TEI) community and XRX/XForms communities • Ideal for metadata management • Integrated Lucene search and structured search M D 56 Copyright Kelly-McCreary & Associates, LLC
  • 57. Riak • Community and Commercial licenses • A "Dynamo-inspired" database • Written in ERLANG • Query JSON or ERLANG M D 57 Copyright Kelly-McCreary & Associates, LLC
  • 58. Hypertable • Open Source • Closely modeled after Google's Bigtable project • High performance distributed data storage system • Designed to support applications requiring maximum performance, scalability, and reliability • Hypertable Query Language (HQL) that is syntactically similar to SQL M D 58 Copyright Kelly-McCreary & Associates, LLC
  • 59. Selecting a NoSQL Pilot Project • The "Goldilocks Pilot Project Strategy" • Not to big, not to small, just the right size • Duration • Sponsorship • Importance • Skills • Mentorship M 59 D Copyright 2010 Dan McCreary & Associates
  • 60. The Future of the NoSQL Movement Growth Diversity • Will data sets continue to grow at exponential rates? • Will new system options become more diverse? • Will new markets have different demands? • Will some ideas be "absorbed" into existing RDBMS vendors products? • Will the NoSQL community continue to be the place where new database ideas and products are incubated? • Will the job of doing high-quality architectural tradeoffs analysis M become easier? D 60 Copyright Kelly-McCreary & Associates, LLC
  • 61. Using the Wrong Architecture Start Finish Credit: Isaac Homelund – MN Office of the Revisor M D
  • 62. Using the Right Architecture Finish Start Find ways to remove barriers to empowering the non programmers on your team. M D
  • 63. Questions Dan McCreary President, Kelly-McCreary & Associates dan@danmccreary.com M D 63 Kelly-McCreary & Associates, LLC