SlideShare uma empresa Scribd logo
1 de 19
Baixar para ler offline
Data services in a Service Oriented
Architecture

 Syed M Shaaf
 Red Hat


 @sshaaf
 20th April 2012
Problem: Data Challenges

                 Tremendous value in existing information assets, but...

              Time consuming and costly to implement new applications
                           that leverage this information



Challenges
      Different physical structure
      Different terminology and meaning
                                                                            Data Gap
      Different interfaces
      May need to federate/integrate
      May be “locked in” to database
      Must ensure performance
                                                             Operational    Packaged
      Maintain/Improve security            Data Warehouse
                                                             Data Stores   Applications
EDS – Open source solution



EDS is a data virtualization
 system that allows applications
 to use data from multiple,
 heterogenous data stores.
What is EDS?
●   EDS is an open source solution for scalable
    information integration through a relational
    abstraction.
●   EDS focuses on:
    ●   Real-time integration performance
    ●   Feature-full integration via SQL/Procedures/XQuery
    ●   Providing JDBC access
●   EDS enables:
    ●   Data Services / SOA
    ●   Legacy / JPA integration
Turn the data you have into the
     information you need
Architecture
Architecture
Query Plan

●   Parsing
●   Resolving
●   Validating
●   Rewriting
●   Logical plan optimization -
●   Processing plan conversion -
Query Plan
 SELECT e.title, e.lastname FROM Employees AS e JOIN
 Departments AS d ON e.dept_id = d.dept_id WHERE year(e.birthday) >= 1970
 AND d.dept_name = 'Engineering'
Optimizations




  Access patterns for handling criteria
  and pushdown leverages the source
  database
Optimizations




  Making use of Dependant joins and
  optional joins
Cursors and batching



  By default all results are cursored and
  all results are in a batch. Set the
  processor batch size, connector
  batch size or fetchsize via jdbc
Buffer Management




  Buffers are stored in memory and/or
  on disk
Processing



 Joins are done by default as merge-
 sort and sorting algorithm is multi
 pass merge-sort
Handling Load
●   Memory Usage – the BufferManager acts as a memory
    manager for batches (with passivation) to ensure that
    memory will not be exhausted.
●   Non-blocking source queries – rather than waiting for
    source query results processor thread detach from the
    plan and pick up a plan that has work.
●   Time slicing – plans produce batches for a time slice
    before re-queuing and allowing their thread to do other
    work (preemptive control only between batches)
●   Caching – ResultSets, processing plans, internal
    materialized views, etc.
More on Caching

●   See the caching guide and
    http://community.jboss.org/wiki/AHowToGuideForMaterializationcachingViewsInTeiid

●   Admins can primarily control prepared plan and result
    set caching. Procedure plans are also automatically
    cached in the plan cache.
●   Scoping of cache entries is determined automatically
●   Internal materialization leverages EDS temp tables,
    which are in turn backed by the buffer manager.
●   Canonical value caching is dynamically used to cut
    down on the memory profile – can be disabled.
●   Internal caching of metadata at various levels.
Transactions
●   Three scopes
    ●   Global (through XAResource)
    ●   Local (autocommit = false)
    ●   Command (autocommit = true)
●   All scopes are handled by JBoss Transactions
    JTA
●   Command scope behavior is handled through
    txnAutoWrap={ON|OFF|DETECT}
●   Isolation level is set on a per connector basis.
Demo
Where to find it

●   http://access.redhat.com

Mais conteúdo relacionado

Mais procurados

BTech_IT_Oracle_RAC_DBA_with_3.7_Years_Exp
BTech_IT_Oracle_RAC_DBA_with_3.7_Years_ExpBTech_IT_Oracle_RAC_DBA_with_3.7_Years_Exp
BTech_IT_Oracle_RAC_DBA_with_3.7_Years_Exp
BHIMASANKARA GADAMSETTI
 
zahidCvFinal(Updated Jan 17)-2
zahidCvFinal(Updated Jan 17)-2zahidCvFinal(Updated Jan 17)-2
zahidCvFinal(Updated Jan 17)-2
Zahid Ayub
 
Avanthi Guduru ( Oracle DBA) Resume
Avanthi Guduru ( Oracle DBA) ResumeAvanthi Guduru ( Oracle DBA) Resume
Avanthi Guduru ( Oracle DBA) Resume
Avanthi Guduru
 
Karthikeyan_Oracle_dba
Karthikeyan_Oracle_dbaKarthikeyan_Oracle_dba
Karthikeyan_Oracle_dba
Karthikeyan P
 
Himanshu_Oracle_DBA_Resume
Himanshu_Oracle_DBA_ResumeHimanshu_Oracle_DBA_Resume
Himanshu_Oracle_DBA_Resume
Himanshu Jain
 

Mais procurados (20)

Rachita Gupta
Rachita GuptaRachita Gupta
Rachita Gupta
 
Introduction to nosql
Introduction to nosqlIntroduction to nosql
Introduction to nosql
 
Big data and polyglot solutions
Big data and polyglot solutionsBig data and polyglot solutions
Big data and polyglot solutions
 
BTech_IT_Oracle_RAC_DBA_with_3.7_Years_Exp
BTech_IT_Oracle_RAC_DBA_with_3.7_Years_ExpBTech_IT_Oracle_RAC_DBA_with_3.7_Years_Exp
BTech_IT_Oracle_RAC_DBA_with_3.7_Years_Exp
 
zahidCvFinal(Updated Jan 17)-2
zahidCvFinal(Updated Jan 17)-2zahidCvFinal(Updated Jan 17)-2
zahidCvFinal(Updated Jan 17)-2
 
Physical design relational_db_devanshu
Physical design relational_db_devanshuPhysical design relational_db_devanshu
Physical design relational_db_devanshu
 
Avanthi Guduru ( Oracle DBA) Resume
Avanthi Guduru ( Oracle DBA) ResumeAvanthi Guduru ( Oracle DBA) Resume
Avanthi Guduru ( Oracle DBA) Resume
 
Parameter substitution in Aginity Workbench
Parameter substitution in Aginity WorkbenchParameter substitution in Aginity Workbench
Parameter substitution in Aginity Workbench
 
Performance Benchmarking of Key-Value Store NoSQL Databases
Performance Benchmarking of Key-Value Store NoSQL Databases Performance Benchmarking of Key-Value Store NoSQL Databases
Performance Benchmarking of Key-Value Store NoSQL Databases
 
Karthikeyan_Oracle_dba
Karthikeyan_Oracle_dbaKarthikeyan_Oracle_dba
Karthikeyan_Oracle_dba
 
Mohammed Gulam
Mohammed GulamMohammed Gulam
Mohammed Gulam
 
Oracle dba
Oracle dbaOracle dba
Oracle dba
 
Satheesh Oracle DBA Resume
Satheesh Oracle DBA ResumeSatheesh Oracle DBA Resume
Satheesh Oracle DBA Resume
 
Himanshu_Oracle_DBA_Resume
Himanshu_Oracle_DBA_ResumeHimanshu_Oracle_DBA_Resume
Himanshu_Oracle_DBA_Resume
 
Mysql certification in chennai
Mysql certification in chennaiMysql certification in chennai
Mysql certification in chennai
 
Managing user Online Training in IBM Netezza DBA Development by www.etraining...
Managing user Online Training in IBM Netezza DBA Development by www.etraining...Managing user Online Training in IBM Netezza DBA Development by www.etraining...
Managing user Online Training in IBM Netezza DBA Development by www.etraining...
 
ORACLE DBA RESUME
ORACLE DBA RESUMEORACLE DBA RESUME
ORACLE DBA RESUME
 
Prabhu_dba
Prabhu_dbaPrabhu_dba
Prabhu_dba
 
netezza-pdf
netezza-pdfnetezza-pdf
netezza-pdf
 
Microsoft azure database offerings
Microsoft azure database offeringsMicrosoft azure database offerings
Microsoft azure database offerings
 

Destaque

Redhat rhev 31-update by syedmshaaf
Redhat rhev 31-update by syedmshaafRedhat rhev 31-update by syedmshaaf
Redhat rhev 31-update by syedmshaaf
Syed Shaaf
 
Doc103 Red Hat Comparison Whitepaper
Doc103 Red Hat Comparison WhitepaperDoc103 Red Hat Comparison Whitepaper
Doc103 Red Hat Comparison Whitepaper
mikhail.mikheev
 
Red hat enterprise_virtualization_load
Red hat enterprise_virtualization_loadRed hat enterprise_virtualization_load
Red hat enterprise_virtualization_load
silviucojocaru
 
NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus
Hirofumi Ichihara
 
Symantec rhev 31-update by syed m shaaf
Symantec rhev 31-update by syed m shaafSymantec rhev 31-update by syed m shaaf
Symantec rhev 31-update by syed m shaaf
Syed Shaaf
 

Destaque (20)

Redhat rhev 31-update by syedmshaaf
Redhat rhev 31-update by syedmshaafRedhat rhev 31-update by syedmshaaf
Redhat rhev 31-update by syedmshaaf
 
Cc presentation
Cc presentationCc presentation
Cc presentation
 
Doc103 Red Hat Comparison Whitepaper
Doc103 Red Hat Comparison WhitepaperDoc103 Red Hat Comparison Whitepaper
Doc103 Red Hat Comparison Whitepaper
 
RHEVM - Snapshots
RHEVM - SnapshotsRHEVM - Snapshots
RHEVM - Snapshots
 
Rhev overview-doc
Rhev overview-docRhev overview-doc
Rhev overview-doc
 
RHEVM - Live Storage Migration
RHEVM - Live Storage MigrationRHEVM - Live Storage Migration
RHEVM - Live Storage Migration
 
What is the KISS principle
What is the KISS principleWhat is the KISS principle
What is the KISS principle
 
RedHat Virtualization Manager
RedHat Virtualization ManagerRedHat Virtualization Manager
RedHat Virtualization Manager
 
Optimising nfv service chains on open stack using docker
Optimising nfv service chains on open stack using dockerOptimising nfv service chains on open stack using docker
Optimising nfv service chains on open stack using docker
 
Getting to know the Grid - Goto Aarhus 2013
Getting to know the Grid - Goto Aarhus 2013Getting to know the Grid - Goto Aarhus 2013
Getting to know the Grid - Goto Aarhus 2013
 
Gluster Storage
Gluster StorageGluster Storage
Gluster Storage
 
SDN the network becomes the application
SDN the network becomes the applicationSDN the network becomes the application
SDN the network becomes the application
 
Red Hat Enterprise Linux and NFS by syedmshaaf
Red Hat Enterprise Linux and NFS by syedmshaafRed Hat Enterprise Linux and NFS by syedmshaaf
Red Hat Enterprise Linux and NFS by syedmshaaf
 
Red hat enterprise_virtualization_load
Red hat enterprise_virtualization_loadRed hat enterprise_virtualization_load
Red hat enterprise_virtualization_load
 
NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus NFV Infrastructure Manager with High Performance Software Switch Lagopus
NFV Infrastructure Manager with High Performance Software Switch Lagopus
 
Symantec rhev 31-update by syed m shaaf
Symantec rhev 31-update by syed m shaafSymantec rhev 31-update by syed m shaaf
Symantec rhev 31-update by syed m shaaf
 
Dev Conf 2017 - Meeting nfv networking requirements
Dev Conf 2017 - Meeting nfv networking requirementsDev Conf 2017 - Meeting nfv networking requirements
Dev Conf 2017 - Meeting nfv networking requirements
 
Containers - What are they and Atomic
Containers - What are they and AtomicContainers - What are they and Atomic
Containers - What are they and Atomic
 
Developing microservices with wildfly swarm and deploying on openshift
Developing microservices with wildfly swarm and deploying on openshiftDeveloping microservices with wildfly swarm and deploying on openshift
Developing microservices with wildfly swarm and deploying on openshift
 
2011-12-08 Red Hat Enterprise Virtualization for Desktops (RHEV VDI) with Cis...
2011-12-08 Red Hat Enterprise Virtualization for Desktops (RHEV VDI) with Cis...2011-12-08 Red Hat Enterprise Virtualization for Desktops (RHEV VDI) with Cis...
2011-12-08 Red Hat Enterprise Virtualization for Desktops (RHEV VDI) with Cis...
 

Semelhante a Mow2012 data services

An Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsAn Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive Applications
Xiao Qin
 
1. Briefly describe the major components of a data warehouse archi.docx
1. Briefly describe the major components of a data warehouse archi.docx1. Briefly describe the major components of a data warehouse archi.docx
1. Briefly describe the major components of a data warehouse archi.docx
monicafrancis71118
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Kognitio
 

Semelhante a Mow2012 data services (20)

EDB Database Servers and Tools
EDB Database Servers and Tools EDB Database Servers and Tools
EDB Database Servers and Tools
 
Cloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control MethodCloud Computing Ambiance using Secluded Access Control Method
Cloud Computing Ambiance using Secluded Access Control Method
 
Enterprise-class security with PostgreSQL - 2
Enterprise-class security with PostgreSQL - 2Enterprise-class security with PostgreSQL - 2
Enterprise-class security with PostgreSQL - 2
 
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
IMC Summit 2016 Breakout - Pandurang Naik - Demystifying In-Memory Data Grid,...
 
Climbing the beanstalk
Climbing the beanstalkClimbing the beanstalk
Climbing the beanstalk
 
Oracle database 12c introduction- Satyendra Pasalapudi
Oracle database 12c introduction- Satyendra PasalapudiOracle database 12c introduction- Satyendra Pasalapudi
Oracle database 12c introduction- Satyendra Pasalapudi
 
An Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive ApplicationsAn Active and Hybrid Storage System for Data-intensive Applications
An Active and Hybrid Storage System for Data-intensive Applications
 
J2EE Batch Processing
J2EE Batch ProcessingJ2EE Batch Processing
J2EE Batch Processing
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedIn
 
Resume..
Resume..Resume..
Resume..
 
1. Briefly describe the major components of a data warehouse archi.docx
1. Briefly describe the major components of a data warehouse archi.docx1. Briefly describe the major components of a data warehouse archi.docx
1. Briefly describe the major components of a data warehouse archi.docx
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
SQL PASS Taiwan 七月份聚會-1
SQL PASS Taiwan 七月份聚會-1SQL PASS Taiwan 七月份聚會-1
SQL PASS Taiwan 七月份聚會-1
 
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
Iod session 3423   analytics patterns of expertise, the fast path to amazing ...Iod session 3423   analytics patterns of expertise, the fast path to amazing ...
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
 
Resume
ResumeResume
Resume
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
SAMADMohammad
SAMADMohammadSAMADMohammad
SAMADMohammad
 
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingApache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query Processing
 
The Six pillars for Building big data analytics ecosystems
The Six pillars for Building big data analytics ecosystemsThe Six pillars for Building big data analytics ecosystems
The Six pillars for Building big data analytics ecosystems
 

Mais de Syed Shaaf (6)

Build and manage private and hybrid cloud
Build and manage private and hybrid cloudBuild and manage private and hybrid cloud
Build and manage private and hybrid cloud
 
Red Hat JBoss Technical update
Red Hat JBoss Technical updateRed Hat JBoss Technical update
Red Hat JBoss Technical update
 
OpenShift and next generation application development
OpenShift and next generation application developmentOpenShift and next generation application development
OpenShift and next generation application development
 
Unix to Red Hat Enterprise Linux
Unix to Red Hat Enterprise Linux Unix to Red Hat Enterprise Linux
Unix to Red Hat Enterprise Linux
 
Conduct JBoss EAP 6 seminar
Conduct JBoss EAP 6 seminarConduct JBoss EAP 6 seminar
Conduct JBoss EAP 6 seminar
 
Technical update KVM and Red Hat Enterprise Virtualization (RHEV) by syedmshaaf
Technical update KVM and Red Hat Enterprise Virtualization (RHEV) by syedmshaafTechnical update KVM and Red Hat Enterprise Virtualization (RHEV) by syedmshaaf
Technical update KVM and Red Hat Enterprise Virtualization (RHEV) by syedmshaaf
 

Mow2012 data services

  • 1. Data services in a Service Oriented Architecture Syed M Shaaf Red Hat @sshaaf 20th April 2012
  • 2. Problem: Data Challenges Tremendous value in existing information assets, but... Time consuming and costly to implement new applications that leverage this information Challenges  Different physical structure  Different terminology and meaning Data Gap  Different interfaces  May need to federate/integrate  May be “locked in” to database  Must ensure performance Operational Packaged  Maintain/Improve security Data Warehouse Data Stores Applications
  • 3. EDS – Open source solution EDS is a data virtualization system that allows applications to use data from multiple, heterogenous data stores.
  • 4. What is EDS? ● EDS is an open source solution for scalable information integration through a relational abstraction. ● EDS focuses on: ● Real-time integration performance ● Feature-full integration via SQL/Procedures/XQuery ● Providing JDBC access ● EDS enables: ● Data Services / SOA ● Legacy / JPA integration
  • 5. Turn the data you have into the information you need
  • 8. Query Plan ● Parsing ● Resolving ● Validating ● Rewriting ● Logical plan optimization - ● Processing plan conversion -
  • 9. Query Plan SELECT e.title, e.lastname FROM Employees AS e JOIN Departments AS d ON e.dept_id = d.dept_id WHERE year(e.birthday) >= 1970 AND d.dept_name = 'Engineering'
  • 10. Optimizations Access patterns for handling criteria and pushdown leverages the source database
  • 11. Optimizations Making use of Dependant joins and optional joins
  • 12. Cursors and batching By default all results are cursored and all results are in a batch. Set the processor batch size, connector batch size or fetchsize via jdbc
  • 13. Buffer Management Buffers are stored in memory and/or on disk
  • 14. Processing Joins are done by default as merge- sort and sorting algorithm is multi pass merge-sort
  • 15. Handling Load ● Memory Usage – the BufferManager acts as a memory manager for batches (with passivation) to ensure that memory will not be exhausted. ● Non-blocking source queries – rather than waiting for source query results processor thread detach from the plan and pick up a plan that has work. ● Time slicing – plans produce batches for a time slice before re-queuing and allowing their thread to do other work (preemptive control only between batches) ● Caching – ResultSets, processing plans, internal materialized views, etc.
  • 16. More on Caching ● See the caching guide and http://community.jboss.org/wiki/AHowToGuideForMaterializationcachingViewsInTeiid ● Admins can primarily control prepared plan and result set caching. Procedure plans are also automatically cached in the plan cache. ● Scoping of cache entries is determined automatically ● Internal materialization leverages EDS temp tables, which are in turn backed by the buffer manager. ● Canonical value caching is dynamically used to cut down on the memory profile – can be disabled. ● Internal caching of metadata at various levels.
  • 17. Transactions ● Three scopes ● Global (through XAResource) ● Local (autocommit = false) ● Command (autocommit = true) ● All scopes are handled by JBoss Transactions JTA ● Command scope behavior is handled through txnAutoWrap={ON|OFF|DETECT} ● Isolation level is set on a per connector basis.
  • 18. Demo
  • 19. Where to find it ● http://access.redhat.com