SlideShare uma empresa Scribd logo
1 de 175
LLeeccttuurree 33 
DDaattaa AArrcchhiitteeccttuurreess SSyynntthheessiiss 
DDaattaa PPrroocceessssiinngg 
DDaattaa MMiinniinngg 
MMooddeelliinngg --AAllggoorriitthhmm –– MMeetthhoodd 
 IInntteeggrraattiioonn iinnttoo tthhee IInnffoorrmmaattiioonn SSyysstteemm 
Conference - Study : Jean-Antoine Moreau (Engineer - Lecturer) 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP.
DDAATTAA SSCCIIEENNCCEE 
LLeessssoonn 33 
Conference - Study : Jean-Antoine Moreau (Engineer - Lecturer) 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP.
JAM 3 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
« Tell me and I forget. Teach me and I 
remember. Involve me and I learn. » 
Benjamin Franklin
JAM 4 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Volumes of data are: 
– more and more important; 
– distributed; 
– interconnected; 
– heterogeneous; 
– mobile.
JAM 5 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Data volumes are more difficult to: 
– integrate 
– analyze in real-time
JAM 6 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
These new requirements 
these new conditions 
involve 
new data models 
new methods of modeling
JAM 7 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
new data models 
new methods of modeling 
information systems architectures 
Large-Scale Distributed Data Intensive Information Systems.
JAM 8 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Simple modeling:
JAM 9 / 175 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Layers’s modeling
JAM 10 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Massively parallel architectures
JAM 11 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
diagram
JAM 12 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
data streaming 
streaming bulk data
JAM 13 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Basic diagram :
JAM 14 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Changeover / transition of DBMS 
(DATABASE MANAGEMENT SYSTEM) to 
NoSQL or Hybrid DBMS.
JAM 15 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
summary table
JAM 16 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Base use
JAM 17 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• New data models: 
– tables; 
– key values; 
– documents; 
– graphs; 
– streams; 
• multiple sizes; 
• size (volume) changing in time; 
• different versions.
JAM 18 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Modeling includes : 
– in-depth of data; 
– the position in time; 
– release management.
JAM 19 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Modeling includes : 
– Real time applied to each object; 
• each attribute; 
• each relationship between objects; 
– Simple, composition, coupling, cardinality; 
• each relationship between process.
JAM 20 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Modeling of data distributed 
multi-site architectures 
and cloud
JAM 21 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Massively parallel architectures; 
• Massively parallel processing; 
• Massively parallel computing.
JAM 22 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Online Storage 
• Database Machine 
MapReduce 
Oracle 
…
JAM 23 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Using 
– In-Memory DataBase (IMDB); 
– Memory DataBase System (MMDB); 
– Memory Resident DataBase : 
• database management system that primarily relies 
on main memory for computer data storage.
JAM 24 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Using the In-Memory 
New form of modelisation
JAM 25 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example of modelisation:
JAM 26 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
SSttrreeaammiinngg ddaattaa 
• How to analyze data streams in real time? 
• How to analyze process flows in real time? 
• How to use and cross these streams with the 
information system data?
JAM 27 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Data evolution : 
– Open data; 
– Interconnection of data: 
• social networks; 
• linked open data; 
– Big Data: 
• volume; 
• variety; 
• velocity; 
• truthfulness; 
• validity.
JAM 28 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Evolution data: 
– Internet: 
• Data distribution; 
• Heterogeneity of the data; 
• Data mobility. 
– Flows: 
• Continuous data stream; 
• Complex events.
DDAATTAA SSCCIIEENNCCEE -- LL33 
JAM 29 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
in-depth data 
time 
Location/ localization 
Data Stream 
Version xX 
Data Stream 
Version xY 
Data Stream 
Version xZ
JAM 30 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Computer Applications 
The issues 
• IT operations in real-time from distributed data; 
• Crossing and visualization of public and private data; 
• Complex Big Data analysis.
JAM 31 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Big Data 
Data Analytic
JAM 32 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Transactions distributed over the network 
data stream processing in real time 
Complex Event Processing (CEP)
JAM 33 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Technology 
• Flash memory; 
• HDD (Hard Disk Drive) - SSD (Solid-State Drive); 
• Multi-core processing; 
• Combination CPU (Central Processing Unit) / GPU (Graphics 
Processing Unit); 
• Prioritization of parallel computing; 
• Server virtualization (VM virtual Machines); 
• …
JAM 34 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Technology 
• Virtualization of disk storage: 
– NAS (Network Attached Storage) 
– SAN (Storage Area Network); 
• New architectures DBMS (database management system).
JAM 35 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
SAN 
Storage Area Network 
with a Cluster
JAM 36 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Databases 
• Adaptation: 
– patterns; 
– requests; 
– transactions; 
– views; 
– integrity constraints; 
– triggers; 
– managing indexes; 
– changing rules of consistency; 
– traceability rules.
JAM 37 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Databases 
ACID database models 
ACID : 
– AAtomicity; 
– CConsistency; 
– IIsolation; 
– DDurability.
JAM 38 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Modeling in a Hybrid NoSQL / Relational
JAM 39 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Model 
Data Modeling
JAM 40 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data models 
Relational : 
– value domain; 
– link; 
– algebra and calculus; 
– concept of zero. 
SQL 2: 
– the data type; 
– insulation levels; 
– the portablité. 
SQL 3: 
– Nested tables; 
– Complex types; 
– Extensions (expand) objects.
JAM 41 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
New models: 
– key values; 
– tables; 
– documents; 
– graph; 
– stream.
JAM 42 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Access to the database 
• The object-relational impedance mismatch. 
• API (Application Programmable Interface): 
– Ado (Microsoft ActiveX Data Objects); 
– JDBC (Java DataBase Connectivity); 
– SQLJ (program can call the JDBC ); 
– JPA (Java Persistence API); 
– JDD (Java Decision Diagrams); 
• Using Language Integrated Query (example : Microsoft .NET 
Framework ).
JAM 43 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Analysis 
• Decision-making system and OLAP 
– multidimensional databases; 
• OLAP system (On-Line Analytical Processing); 
• OLTP (Online transaction processing) - class of software programs capable of 
supporting transaction-oriented applications; 
• TPC (TPC-H / TPC-E) 
– Transaction Processing Performance Council; 
• Simulates a Decision Support System or Business Intelligence database environment. 
– Multi-dimensional data.
JAM 44 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Analysis 
• Data cubes: 
– array of data; 
– multidimensional matrix.
JAM 45 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
array of data multidimensional matrix
JAM 46 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Business Intelligence 
Data Mining 
Extraction of knowledge from the data.
JAM 47 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
ddiissttrriibbuutteedd aarrcchhiitteeccttuurreess
JAM 48 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• multi-tier architectures: 
– 2-Tiered architecture; 
– architecture technology to 3-tier architecture; 
– N-tier architecture.
JAM 49 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
2 Tiered :
JAM 50 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Three-Tier Architecture
JAM 51 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Three-Tier Architecture
JAM 52 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
« We know what we are, but know not 
what we may be. » 
William Shakespeare 
A short break, 
You can ask your questions.
JAM 53 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Architectures 
• RIA: Rich Internet application; 
• RDA: Rich Desktop Application.
JAM 54 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
RIA: Rich Internet application
JAM 55 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
client network server 
thick 
thin 
rich 
application
JAM 56 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Using : 
– HTML 5 (HyperText Mark-Up Language); 
– CSS 3 (Cascading Style Sheets); 
– Java Script; 
– Ajax (Asynchronous JavaScript and XML).
JAM 57 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Coupling Server / Application 
Data server
JAM 58 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Protection : 
– encryption; 
– discretionary access control; 
– control of multi-level access.
JAM 59 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Service-Oriented Architecture 
• Web services 
method of communication between two electronic devices over a network; 
• WSDL 
Web Services Description Language; 
• UDDI 
Universal Description, Discovery and Integration; 
• SOAP 
Simple Object Access protocol;
JAM 60 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• SOA (Service-Oriented Architecture), deployment of distributed 
services on ESB (Enterprise Service Bus); 
• EAI functions (Enterprise Application Integration – Workflow Process). 
• WOA (Web Oriented Architecture) 
– use technologies : 
• POX (Plain Old XML); 
• REST (Representational State Transfer) .
JAM 61 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Massively distributed architectures; 
• Massively distributed systems; 
• Massively distributed computing.
JAM 62 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Cluster : 
– Parallelism; 
• Grid : 
– Virtual Infrastructure; 
– Parallelism and Distribution; 
• P2P : 
– Distribution;
JAM 63 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Diagram 
Architecture Master/Slaves
JAM 64 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• and the cclloouudd: 
 parallelism of computer centers distributed across the 
planet 
 GGlloobbaalliizziinngg tthhee tteecchhnniiccaall aarrcchhiitteeccttuurreess .
JAM 65 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example : 
Cloud architecture 
including API to connect 
on a social network.
JAM 66 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Management of the distributed data 
• Integration of heterogeneous data; 
• Unifying the distributed data; 
• DBMS transaction (database management system) distributed.
JAM 67 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Integration of heterogeneous data 
• Federation of data sources; 
• Mediator & Adapter / Architectures; 
• Integration of schema.
JAM 68 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Integration of heterogeneous data 
• Data models for the integration : 
– Relational model; 
– Object; 
– XML; 
• data model that's based on an XML file 
• the XML Data Model makes it easier to integrate XFDL forms with 
other applications. 
– XFDL : Extensible Forms Description Language is a class of the 
Extensible Markup Language (XML) originally specified in World 
Wide Web Consortium (W3C) ; 
– key-value model.
JAM 69 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Integration of heterogeneous data 
Visualization of the integrated data : 
– Dataviz (software) ; 
– Dashboards; 
– …
JAM 70 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Unifying distributed data 
• Query optimization processing; 
• Intermediate results of integration; 
• Products (some examples) : 
– Google Fusion Tables (or simply Fusion Tables) is a web service provided by Google 
for data management; 
– IBM InfoSphere Information Server for Data Integration ; 
– Oracle Data Integrator (data integration platform); 
– SAP BusinessObjects (Data Integrator and Data Quality Management); 
– Talend for Data Integration (Talend Open Studio for Data Integration).
JAM 71 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Distributed DBMS 
Transaction Management in Distributed Database.
JAM 72 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Transaction Management in Distributed Database: 
– Two-Phase Commit Protocol (2PC) 
• type of atomic commitment protocol (ACP); 
– Fault tolerance; 
– Scalability; 
• Data replication: 
– consistency of copies; 
– propagation of updates; 
– adjusting the consistency rules of the databases; 
ORACLE, IBM DB2, Microsoft SQL server, SAP sybase, PostgreSQL and Postgres-XC.
JAM 73 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Parallel data processing : 
– Massive parallelism of data; 
• massively data-parallel applications on GPU clusters; 
– (GPU Graphics Processing Unit); 
• data-parallel ... 
– Database machine; 
– Also the basic techniques.
JAM 74 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Massive parallelism of data 
GGooaallss
JAM 75 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Goals : 
– Linear Speed up; 
– Linear Scale up; 
– Scale out; 
– Elasticity; 
– Adaptability.
JAM 76 / 
Shared-Memory 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Multiprocessor Architectures 
SMP 
shared memory multiprocessor 
NUMA 
Non Uniform Memory Architecture
JAM 77 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
SSMMPP 
shared memory multiprocessor
JAM 78 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
NNUUMMAA 
Non Uniform Memory Architecture
JAM 79 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Multiprocessor Architectures 
• Cluster shared-disk; 
• Cluster shared-computing; 
• …
JAM 80 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example 1 :
JAM 81 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example 2:
JAM 82 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
using servers
JAM 83 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
tree – organization - cluster
JAM 84 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Types of data parallelism: 
– Inter-application; 
– Application; 
– Inter-operation; 
– Pipeline.
JAM 85 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example : Data Processing
JAM 86 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example : Signal Processing
JAM 87 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Parallel data processing 
basic techniques
JAM 88 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data storage 
Partitioning and placement strategies in database machines 
Partitioning - Sharing replication indexing
JAM 89 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
performances 
Parallelism of applications Balance of charges Transactions
JAM 90 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Fall-over 
management 
High availability 
Savepoints
JAM 91 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
DDaattaabbaassee MMaacchhiinnee 
Products : 
• IBM DB2 pureScale; 
• SQL Server 2012 Parallel Data Warehouse; 
• ORACLE RAC (Real Application Clusters) et Exadata; 
• SAP (sysbase); 
• Teradata Aster Data; 
• MySQL cluster (write-scalable, real-time, ACID-compliant transactional 
database ); 
• HP Vertica (software-based analytics platform); 
• ParAccel (Analytic DataBase and Plateform); 
• EMC Greenplum (Data Computing Appliance DCA , massively parallel processing 
database MPP).
JAM 92 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MMaappRReedduuccee
JAM 93 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MMaappRReedduuccee
JAM 94 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MMaappRReedduuccee
JAM 95 / 
MMaappRReedduuccee Map Reduce 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Big Data Result
JAM 96 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce 
• Data storage; 
• Architectures; 
• Parallel programming; 
• Integration.
JAM 97 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce Layers - Architecture 
With Api, for example using IBM API.
JAM 98 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce 
Processing 
Split: the input data-set into independent chunks
JAM 99 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data Storage 
• DDistributed FFile SSystem: 
Google File System (scalable distributed file 
system for large distributed data-intensive 
applications ); 
Hadoop Distributed File System (HDFS) is a 
distributed, scalable, and portable.
JAM 100 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
HDFS 
layers
JAM 101 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
HDFS diagram :
JAM 102 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
HDFS diagram 
Simple architecture :
JAM 103 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
HDFS 
Cluster diagram
JAM 104 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Flows on Global File System / Cluster
JAM 105 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Master / Slave Diagram:
JAM 106 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MMaappRReedduuccee using 
• GFS (Global File System) 
– shared disk file system for Linux computer 
clusters; 
– symmetric Cluster Architecture; 
• HPFS 
– High Performance File System.
JAM 107 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example :
JAM 108 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Meta Data on Serveur / Cluster
JAM 109 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Map Reduce 
GFS / HPFS Architectures : 
– Master node; 
• The architecture of GFS follows the master-slave model. 
– Distribution of chunks (fragment of information); 
– Replicating chunks.
JAM 110 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Map Reduce 
IT architecture models 
• Partitioning according to the input files; 
• Fault tolerance; 
• Balancing loads.
JAM 111 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Map Reduce 
IT architecture models 
• key–value model; 
• Function 
Map 
Reduce
JAM 112 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Business Model IT Architecture Model
JAM 113 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Business IT architecture and process
JAM 114 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
IT architecture models 
Internet 
Load Balencer 
Vm (1) Vm (2) ….. Vm (n) 
Real machine ( 1 …n) 
Virtual
JAM 115 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
diagram - modeling :
JAM 116 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. A short break, 
You can ask your questions. 
« The direction in which 
education starts a man 
will determine his future in 
life. » 
Plato
JAM 117 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Hadoop & MapReduce
JAM 118 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Switch 
Hadoop slave 
servers 
Hadoop master 
and slave server Test server
JAM 119 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Switch 
Hadoop slave 
servers 
Hadoop master 
and slave server Test server
Hadoop HDFS 
JAM 120 / 
Hadoop 
Manage File Transfer 
Hadoop Gateway Server 
inspect process loading Loading 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
aggregate automate load
JAM 121 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Hadoop BD Architecture and Layers
JAM 122 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Cluster
JAM 123 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Cluster 
Flow
JAM 124 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce Architecture
JAM 125 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce
JAM 126 / 
MapReduce Architecture 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
methods and algorithms
JAM 127 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce Architecture
JAM 128 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Map Reduce
JAM 129 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
PPaarraalllleell pprrooggrraammmmiinngg with MapReduce framework 
Tools: 
– Pig (workflow interface – Pig script); 
– Hive (SQL Interface); 
– … 
• Software publisher: 
– cloudera, datameer, greenplem HD, hurence, Intel, Talend,Vmware.
The software layers 
Apache ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and 
providing group services 
JAM 130 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Apache Ambari is an operational framework for provisioning, managing and monitoring Apache Hadoop clusters.
JAM 131 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
MapReduce integration into the information system 
• Database Machine; 
• Cloud: 
– Outsourcing of the Big data processing; 
• Tools : 
– Google cloud (Platform, Print, BigQuery …), Amazon web services, Microsoft 
HDInsight (Microsoft's Hadoop-based service), IBM InfoSphere BigInsights (built on 
the Apache Hadoop software framework - for analyzing massive volumes) , 
ORACLE BigData Appliance (engineered system for acquiring, organizing, and 
loading unstructured data into an Oracle Database), Cloudera Impala (open source 
massively parallel processing (MPP) SQL query engine for data stored in a computer 
cluster running Apache ).
JAM 132 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Model of Information Systems 
in the Cloud environment.
JAM 133 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
User Applications 
Integrated 
Services 
Single Data Strore 
Service 
Information System 
Internet 
and mobile 
and tablet 
ERP 
data warehouse
JAM 134 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Exemple for data warehouse
JAM 135 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Cloud Architecture 
Layers
JAM 136 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Cloud Architecture 
Platform as a service 
Infrastructure as a service
JAM 137 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Cloud Architecture: 
Access Security.
JAM 138 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Data streaming 
• SCA Streaming Computing Architecture; 
• Streaming SQL; 
• Processing and stream management of the 
data flow.
JAM 139 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
SCA Streaming Computing Architecture 
• Concepts: 
– Data stream; 
– Continuous queries; 
– Temporary data; 
• Flow modeling: 
– Sequences; 
– Time series; 
• Parallel streaming.
JAM 140 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
SCA Streaming Computing Architecture 
• Concepts: 
– Data stream; 
– Continuous queries; 
– Temporary data; 
• Flows modeling: 
– Sequences; 
– Time series; 
• Parallel streaming.
JAM 141 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Streaming Graph
JAM 142 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Streaming SQL 
• Filtering; 
• Merger; 
• Flows aggregation ; 
• Join query between flows and the persistent 
data of the information system.
JAM 143 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33
JAM 144 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Processing data flows 
• Query stream; 
• Set of queries on the stream; 
• Stream mining; 
• Automatic classification of flows. 
• Products: Tibco streambase (push-based real-time analytics solution), ORACLE 
Streams, IBM Infosphere Streams (stream computing solution to enable real-time 
analytic processing of data), Microsoft streamInsight (platform for complex event 
processing), Telegraph CQ, LighStream (solution for real-time messaging over HTTP 
and WebSockets), Apach Samza (stream processing framework), Storm (has use cases: 
realtime analytics, online machine learning, continuous computation, distributed RPC, 
ETL ).
JAM 145 / 
A short break, You can ask your questions. 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
“Better three hours too soon than a minute too late.” 
William Shakespeare
JAM 146 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
NNooSSQQLL DDBBMMSS
JAM 147 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
NoSQL DBMS 
• NoSQL architecture; 
• NoSQL approaches; 
• Integrating SQL into NoSQL.
JAM 148 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
aarrcchhiitteeccttuurree 
ssooffttwwaarree llaayyeerrss 
example
JAM 149 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
aarrcchhiitteeccttuurree 
ssooffttwwaarree llaayyeerrss 
example
JAM 150 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
NoSQL architectures 
• Scalability; 
• Architecture 3 Tier in the cloud (cloud computing); 
• NoSQL : 
– RDBMS object; 
DBMS XML.
JAM 151 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Architecture in the cloud 
example
JAM 152 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
AArrcchhiitteeccttuurree iinn tthhee cclloouudd 
example
JAM 153 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• DBMS key value: 
– Amazon dynamo DB (NoSQL database service that is offered by 
Amazon.com), Amazon simple DB (distributed database written in 
Erlang by Amazon.com), Apache Cassandra (massively scalable 
open source NoSQL database), Voldemort (a distributed key-value 
storage system, Data is automatically replicated over multiple 
servers, Processing Data Streams at LinkedIn), LinkedIn Espresso 
(horizontally scalable, indexed, timeline-consistent, document-oriented, 
highly available NoSQL data store).
JAM 154 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Using, Schema
JAM 155 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Example, Diagram
JAM 156 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Large DBMS table : 
– Google Big table 
• a compressed, high performance, and proprietary data storage 
system built on Google File System; 
– Hadoop Hbase 
• open source, non-relational, scalable, distributed database 
modeled; 
– Apache Acumulo 
• high performance data storage and retrieval system with cell-level 
access control.
JAM 157 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33
JAM 158 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Hadoop Hbase 
Layers
JAM 159 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Hbase in the 
Hadoop Ecosystem
JAM 160 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Document database (managing document-oriented information): 
– MongoDB; 
• Agile and Scalable; 
– Ubuntu Couche DB.
JAM 161 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Document database : 
– MongoDB; 
– Ubuntu Couche DB
JAM 162 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Graph database : 
Graph databases apply graph theory to the 
storage of information about the relationships between 
entries; 
Graph database is a database that uses graph 
structures with nodes, edges, and properties to represent 
and store data.
JAM 163 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
• Graph database : 
– Google Pregel (designed for the Google cluster 
architecture), (use for processing graph algorithms); 
– Neo4j (highly scalable, robust – ACID, native graph 
database, JAVA Graph Database; 
– DEX (scalable graph database management system - 
allows for the integration of multiple data sources).
JAM 164 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Graph Model
JAM 165 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Graph properties 
Model
JAM 166 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Graph roles 
Model
JAM 167 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
SQL / NoSQL integration 
Decrease in consistency.
JAM 168 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Synthesis scheme
JAM 169 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
The management of the calculations distribution.
JAM 170 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
EntryProcessor 
MapEntry 
 send the code (computer program) to the data 
rather than sending the data to the code (computer program).
JAM 171 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
EntryProcessor 
MapEntry 
 send the code (computer program) to the data 
rather than sending the data to the code (computer program). 
COHERENCE
JAM 172 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
EntryProcessor 
• Decrease the amount of data passing 
through the network; 
• Répartion of the load on different nodes of 
the cluster; 
• Distribution of concurrent operations 
without explicit lock;
JAM 173 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
DDAATTAA SSCCIIEENNCCEE -- LL33 
Parallel Execution Process 
• Method; 
• Distributed execution on a cluster; 
• Execution parallelized on the cluster; 
• Each cluster node handles its local data.
JAM 174 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
« We are born weak, we need strength; 
helpless, we need aid; foolish, we need 
reason. All that we lack at birth, all that we 
need when we come to man's estate, is the gift 
of education. » 
Jean-Jacques Rousseau
JAM 175 / 
Contact http://www.jean-antoine-moreau.fr.nf 
© Jean-Antoine Moreau 
This lesson, copying and reproduction prohibited 
Managing my copyright: ADAGP. 
End of the part of this lesson published on the internet.

Mais conteúdo relacionado

Semelhante a DATA SCIENCE Lesson 3 Data Architectures Data Processing Modeling -Algorithm - Method conference Jean-Antoine Moreau

Data Science Lesson 1 Jean-Antoine Moreau
Data Science Lesson 1 Jean-Antoine MoreauData Science Lesson 1 Jean-Antoine Moreau
Data Science Lesson 1 Jean-Antoine MoreauJean-Antoine Moreau
 
Big Data Lesson 3 Jean-Antoine Moreau
Big Data Lesson 3 Jean-Antoine MoreauBig Data Lesson 3 Jean-Antoine Moreau
Big Data Lesson 3 Jean-Antoine MoreauJean-Antoine Moreau
 
Architecture IT Conference Jean-Antoine Moreau
Architecture IT Conference Jean-Antoine MoreauArchitecture IT Conference Jean-Antoine Moreau
Architecture IT Conference Jean-Antoine MoreauJean-Antoine Moreau
 
EMF Views - EclipseCon 2014, Modeling Symposium
EMF Views - EclipseCon 2014, Modeling SymposiumEMF Views - EclipseCon 2014, Modeling Symposium
EMF Views - EclipseCon 2014, Modeling SymposiumHugo Bruneliere
 
Optimization of power systems - old and new tools
Optimization of power systems - old and new toolsOptimization of power systems - old and new tools
Optimization of power systems - old and new toolsOlivier Teytaud
 
Tools for Discrete Time Control; Application to Power Systems
Tools for Discrete Time Control; Application to Power SystemsTools for Discrete Time Control; Application to Power Systems
Tools for Discrete Time Control; Application to Power SystemsOlivier Teytaud
 
Collaboro - EclipseCon France 2013 - Ignite Talks Session
Collaboro - EclipseCon France 2013 - Ignite Talks SessionCollaboro - EclipseCon France 2013 - Ignite Talks Session
Collaboro - EclipseCon France 2013 - Ignite Talks SessionHugo Bruneliere
 
Artificial Intelligence on Data Centric Platform
Artificial Intelligence on Data Centric PlatformArtificial Intelligence on Data Centric Platform
Artificial Intelligence on Data Centric PlatformStratio
 
Monitoring : The art of knowing when and why things go wrong
Monitoring : The art of knowing when and why things go wrongMonitoring : The art of knowing when and why things go wrong
Monitoring : The art of knowing when and why things go wrongOpen Source School
 
Have you been stalking your servers?
Have you been stalking your servers?Have you been stalking your servers?
Have you been stalking your servers?morpht
 
practicing what you never preached: sorting and discarding from a practical ...
practicing what you never preached:  sorting and discarding from a practical ...practicing what you never preached:  sorting and discarding from a practical ...
practicing what you never preached: sorting and discarding from a practical ...FIAT/IFTA
 
XNAT_online_workshop_2020-05-13.pdf
XNAT_online_workshop_2020-05-13.pdfXNAT_online_workshop_2020-05-13.pdf
XNAT_online_workshop_2020-05-13.pdfMichael394934
 
Collaboro - Community-Driven Language Development
Collaboro - Community-Driven Language DevelopmentCollaboro - Community-Driven Language Development
Collaboro - Community-Driven Language DevelopmentJavier Canovas
 
Data recovery slide show
Data recovery slide showData recovery slide show
Data recovery slide showtutannandi
 
Profiling your Java Application
Profiling your Java ApplicationProfiling your Java Application
Profiling your Java ApplicationVictor Rentea
 
IRJET- Review on Anti-Piracy Screening System
IRJET-  	  Review on Anti-Piracy Screening SystemIRJET-  	  Review on Anti-Piracy Screening System
IRJET- Review on Anti-Piracy Screening SystemIRJET Journal
 
Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...
Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...
Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...jvp3
 
IRJET- Criminal Recognization in CCTV Surveillance Video
IRJET-  	  Criminal Recognization in CCTV Surveillance VideoIRJET-  	  Criminal Recognization in CCTV Surveillance Video
IRJET- Criminal Recognization in CCTV Surveillance VideoIRJET Journal
 

Semelhante a DATA SCIENCE Lesson 3 Data Architectures Data Processing Modeling -Algorithm - Method conference Jean-Antoine Moreau (20)

Data Science Lesson 1 Jean-Antoine Moreau
Data Science Lesson 1 Jean-Antoine MoreauData Science Lesson 1 Jean-Antoine Moreau
Data Science Lesson 1 Jean-Antoine Moreau
 
Big Data Lesson 3 Jean-Antoine Moreau
Big Data Lesson 3 Jean-Antoine MoreauBig Data Lesson 3 Jean-Antoine Moreau
Big Data Lesson 3 Jean-Antoine Moreau
 
Architecture IT Conference Jean-Antoine Moreau
Architecture IT Conference Jean-Antoine MoreauArchitecture IT Conference Jean-Antoine Moreau
Architecture IT Conference Jean-Antoine Moreau
 
EMF Views - EclipseCon 2014, Modeling Symposium
EMF Views - EclipseCon 2014, Modeling SymposiumEMF Views - EclipseCon 2014, Modeling Symposium
EMF Views - EclipseCon 2014, Modeling Symposium
 
0-introduction.pdf
0-introduction.pdf0-introduction.pdf
0-introduction.pdf
 
Optimization of power systems - old and new tools
Optimization of power systems - old and new toolsOptimization of power systems - old and new tools
Optimization of power systems - old and new tools
 
Tools for Discrete Time Control; Application to Power Systems
Tools for Discrete Time Control; Application to Power SystemsTools for Discrete Time Control; Application to Power Systems
Tools for Discrete Time Control; Application to Power Systems
 
Collaboro - EclipseCon France 2013 - Ignite Talks Session
Collaboro - EclipseCon France 2013 - Ignite Talks SessionCollaboro - EclipseCon France 2013 - Ignite Talks Session
Collaboro - EclipseCon France 2013 - Ignite Talks Session
 
PenO1: les 2
PenO1: les 2PenO1: les 2
PenO1: les 2
 
Artificial Intelligence on Data Centric Platform
Artificial Intelligence on Data Centric PlatformArtificial Intelligence on Data Centric Platform
Artificial Intelligence on Data Centric Platform
 
Monitoring : The art of knowing when and why things go wrong
Monitoring : The art of knowing when and why things go wrongMonitoring : The art of knowing when and why things go wrong
Monitoring : The art of knowing when and why things go wrong
 
Have you been stalking your servers?
Have you been stalking your servers?Have you been stalking your servers?
Have you been stalking your servers?
 
practicing what you never preached: sorting and discarding from a practical ...
practicing what you never preached:  sorting and discarding from a practical ...practicing what you never preached:  sorting and discarding from a practical ...
practicing what you never preached: sorting and discarding from a practical ...
 
XNAT_online_workshop_2020-05-13.pdf
XNAT_online_workshop_2020-05-13.pdfXNAT_online_workshop_2020-05-13.pdf
XNAT_online_workshop_2020-05-13.pdf
 
Collaboro - Community-Driven Language Development
Collaboro - Community-Driven Language DevelopmentCollaboro - Community-Driven Language Development
Collaboro - Community-Driven Language Development
 
Data recovery slide show
Data recovery slide showData recovery slide show
Data recovery slide show
 
Profiling your Java Application
Profiling your Java ApplicationProfiling your Java Application
Profiling your Java Application
 
IRJET- Review on Anti-Piracy Screening System
IRJET-  	  Review on Anti-Piracy Screening SystemIRJET-  	  Review on Anti-Piracy Screening System
IRJET- Review on Anti-Piracy Screening System
 
Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...
Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...
Digital Multimedia Productions: Incroporating Wikis, the Jing Project and Oth...
 
IRJET- Criminal Recognization in CCTV Surveillance Video
IRJET-  	  Criminal Recognization in CCTV Surveillance VideoIRJET-  	  Criminal Recognization in CCTV Surveillance Video
IRJET- Criminal Recognization in CCTV Surveillance Video
 

Mais de Jean-Antoine Moreau

l'Intelligence Artificielle Jean-Antoine Moreau
l'Intelligence Artificielle Jean-Antoine Moreaul'Intelligence Artificielle Jean-Antoine Moreau
l'Intelligence Artificielle Jean-Antoine MoreauJean-Antoine Moreau
 
Management of the Performance Jean-Antoine Moreau
Management of the Performance Jean-Antoine MoreauManagement of the Performance Jean-Antoine Moreau
Management of the Performance Jean-Antoine MoreauJean-Antoine Moreau
 
Management de la Performance Jean-Antoine Moreau
Management de la Performance Jean-Antoine MoreauManagement de la Performance Jean-Antoine Moreau
Management de la Performance Jean-Antoine MoreauJean-Antoine Moreau
 
Stratégie Économique Jean-Antoine Moreau
Stratégie Économique Jean-Antoine MoreauStratégie Économique Jean-Antoine Moreau
Stratégie Économique Jean-Antoine MoreauJean-Antoine Moreau
 
Economic Strategy Jean-Antoine Moreau
Economic Strategy Jean-Antoine MoreauEconomic Strategy Jean-Antoine Moreau
Economic Strategy Jean-Antoine MoreauJean-Antoine Moreau
 
Stratégie Industrielle Jean-Antoine Moreau
Stratégie Industrielle Jean-Antoine MoreauStratégie Industrielle Jean-Antoine Moreau
Stratégie Industrielle Jean-Antoine MoreauJean-Antoine Moreau
 
Regional Economic Development Jean-Antoine Moreau
Regional Economic Development Jean-Antoine MoreauRegional Economic Development Jean-Antoine Moreau
Regional Economic Development Jean-Antoine MoreauJean-Antoine Moreau
 
MARKETING STRATEGY Jean-Antoine Moreau
MARKETING STRATEGY Jean-Antoine MoreauMARKETING STRATEGY Jean-Antoine Moreau
MARKETING STRATEGY Jean-Antoine MoreauJean-Antoine Moreau
 
Politique Industrielle Seconde Partie
Politique Industrielle Seconde PartiePolitique Industrielle Seconde Partie
Politique Industrielle Seconde PartieJean-Antoine Moreau
 
Politique industrielle Jean-Antoine Moreau
Politique industrielle Jean-Antoine MoreauPolitique industrielle Jean-Antoine Moreau
Politique industrielle Jean-Antoine MoreauJean-Antoine Moreau
 
Réindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine Moreau
Réindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine MoreauRéindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine Moreau
Réindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine MoreauJean-Antoine Moreau
 
Le Chômage en France Etude, Impacts sociétaux et économiques
Le Chômage en France Etude, Impacts sociétaux et économiquesLe Chômage en France Etude, Impacts sociétaux et économiques
Le Chômage en France Etude, Impacts sociétaux et économiquesJean-Antoine Moreau
 
Diagnostic Projet Jean-Antoine Moreau
Diagnostic Projet  Jean-Antoine MoreauDiagnostic Projet  Jean-Antoine Moreau
Diagnostic Projet Jean-Antoine MoreauJean-Antoine Moreau
 
Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...
Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...
Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...Jean-Antoine Moreau
 
Systemic approach to commercial programming and commercial choices Jean-Antoi...
Systemic approach to commercial programming and commercial choices Jean-Antoi...Systemic approach to commercial programming and commercial choices Jean-Antoi...
Systemic approach to commercial programming and commercial choices Jean-Antoi...Jean-Antoine Moreau
 
La Gestion de Configuration des logiciels et du Système d’Information Cours J...
La Gestion de Configuration des logiciels et du Système d’Information Cours J...La Gestion de Configuration des logiciels et du Système d’Information Cours J...
La Gestion de Configuration des logiciels et du Système d’Information Cours J...Jean-Antoine Moreau
 

Mais de Jean-Antoine Moreau (20)

Histoire de la Drogue en France
Histoire de la Drogue en FranceHistoire de la Drogue en France
Histoire de la Drogue en France
 
l'Intelligence Artificielle Jean-Antoine Moreau
l'Intelligence Artificielle Jean-Antoine Moreaul'Intelligence Artificielle Jean-Antoine Moreau
l'Intelligence Artificielle Jean-Antoine Moreau
 
Blockchain Jean-Antoine Moreau
Blockchain   Jean-Antoine MoreauBlockchain   Jean-Antoine Moreau
Blockchain Jean-Antoine Moreau
 
Management of the Performance Jean-Antoine Moreau
Management of the Performance Jean-Antoine MoreauManagement of the Performance Jean-Antoine Moreau
Management of the Performance Jean-Antoine Moreau
 
Management de la Performance Jean-Antoine Moreau
Management de la Performance Jean-Antoine MoreauManagement de la Performance Jean-Antoine Moreau
Management de la Performance Jean-Antoine Moreau
 
Le Budget Jean-Antoine Moreau
Le Budget Jean-Antoine MoreauLe Budget Jean-Antoine Moreau
Le Budget Jean-Antoine Moreau
 
Stratégie Économique Jean-Antoine Moreau
Stratégie Économique Jean-Antoine MoreauStratégie Économique Jean-Antoine Moreau
Stratégie Économique Jean-Antoine Moreau
 
Economic Strategy Jean-Antoine Moreau
Economic Strategy Jean-Antoine MoreauEconomic Strategy Jean-Antoine Moreau
Economic Strategy Jean-Antoine Moreau
 
Stratégie Industrielle Jean-Antoine Moreau
Stratégie Industrielle Jean-Antoine MoreauStratégie Industrielle Jean-Antoine Moreau
Stratégie Industrielle Jean-Antoine Moreau
 
Regional Economic Development Jean-Antoine Moreau
Regional Economic Development Jean-Antoine MoreauRegional Economic Development Jean-Antoine Moreau
Regional Economic Development Jean-Antoine Moreau
 
MARKETING STRATEGY Jean-Antoine Moreau
MARKETING STRATEGY Jean-Antoine MoreauMARKETING STRATEGY Jean-Antoine Moreau
MARKETING STRATEGY Jean-Antoine Moreau
 
Politique Industrielle Seconde Partie
Politique Industrielle Seconde PartiePolitique Industrielle Seconde Partie
Politique Industrielle Seconde Partie
 
Politique industrielle Jean-Antoine Moreau
Politique industrielle Jean-Antoine MoreauPolitique industrielle Jean-Antoine Moreau
Politique industrielle Jean-Antoine Moreau
 
Réindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine Moreau
Réindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine MoreauRéindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine Moreau
Réindustrialisation,Politique Industrielle,Plan Industriel Jean-Antoine Moreau
 
Le Chômage en France Etude, Impacts sociétaux et économiques
Le Chômage en France Etude, Impacts sociétaux et économiquesLe Chômage en France Etude, Impacts sociétaux et économiques
Le Chômage en France Etude, Impacts sociétaux et économiques
 
Diagnostic Projet Jean-Antoine Moreau
Diagnostic Projet  Jean-Antoine MoreauDiagnostic Projet  Jean-Antoine Moreau
Diagnostic Projet Jean-Antoine Moreau
 
Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...
Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...
Élaborer et Mesurer Une Stratégie d’Entreprise Modèle Méthode simple d’évalua...
 
Systemic approach to commercial programming and commercial choices Jean-Antoi...
Systemic approach to commercial programming and commercial choices Jean-Antoi...Systemic approach to commercial programming and commercial choices Jean-Antoi...
Systemic approach to commercial programming and commercial choices Jean-Antoi...
 
SQL Jean-Antoine Moreau
SQL  Jean-Antoine MoreauSQL  Jean-Antoine Moreau
SQL Jean-Antoine Moreau
 
La Gestion de Configuration des logiciels et du Système d’Information Cours J...
La Gestion de Configuration des logiciels et du Système d’Information Cours J...La Gestion de Configuration des logiciels et du Système d’Information Cours J...
La Gestion de Configuration des logiciels et du Système d’Information Cours J...
 

Último

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Último (20)

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

DATA SCIENCE Lesson 3 Data Architectures Data Processing Modeling -Algorithm - Method conference Jean-Antoine Moreau

  • 1. LLeeccttuurree 33 DDaattaa AArrcchhiitteeccttuurreess SSyynntthheessiiss DDaattaa PPrroocceessssiinngg DDaattaa MMiinniinngg MMooddeelliinngg --AAllggoorriitthhmm –– MMeetthhoodd  IInntteeggrraattiioonn iinnttoo tthhee IInnffoorrmmaattiioonn SSyysstteemm Conference - Study : Jean-Antoine Moreau (Engineer - Lecturer) © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP.
  • 2. DDAATTAA SSCCIIEENNCCEE LLeessssoonn 33 Conference - Study : Jean-Antoine Moreau (Engineer - Lecturer) © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP.
  • 3. JAM 3 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. « Tell me and I forget. Teach me and I remember. Involve me and I learn. » Benjamin Franklin
  • 4. JAM 4 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Volumes of data are: – more and more important; – distributed; – interconnected; – heterogeneous; – mobile.
  • 5. JAM 5 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Data volumes are more difficult to: – integrate – analyze in real-time
  • 6. JAM 6 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 These new requirements these new conditions involve new data models new methods of modeling
  • 7. JAM 7 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 new data models new methods of modeling information systems architectures Large-Scale Distributed Data Intensive Information Systems.
  • 8. JAM 8 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Simple modeling:
  • 9. JAM 9 / 175 Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Layers’s modeling
  • 10. JAM 10 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Massively parallel architectures
  • 11. JAM 11 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 diagram
  • 12. JAM 12 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 data streaming streaming bulk data
  • 13. JAM 13 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Basic diagram :
  • 14. JAM 14 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Changeover / transition of DBMS (DATABASE MANAGEMENT SYSTEM) to NoSQL or Hybrid DBMS.
  • 15. JAM 15 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 summary table
  • 16. JAM 16 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Base use
  • 17. JAM 17 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • New data models: – tables; – key values; – documents; – graphs; – streams; • multiple sizes; • size (volume) changing in time; • different versions.
  • 18. JAM 18 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Modeling includes : – in-depth of data; – the position in time; – release management.
  • 19. JAM 19 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Modeling includes : – Real time applied to each object; • each attribute; • each relationship between objects; – Simple, composition, coupling, cardinality; • each relationship between process.
  • 20. JAM 20 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Modeling of data distributed multi-site architectures and cloud
  • 21. JAM 21 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Massively parallel architectures; • Massively parallel processing; • Massively parallel computing.
  • 22. JAM 22 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Online Storage • Database Machine MapReduce Oracle …
  • 23. JAM 23 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Using – In-Memory DataBase (IMDB); – Memory DataBase System (MMDB); – Memory Resident DataBase : • database management system that primarily relies on main memory for computer data storage.
  • 24. JAM 24 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Using the In-Memory New form of modelisation
  • 25. JAM 25 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example of modelisation:
  • 26. JAM 26 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 SSttrreeaammiinngg ddaattaa • How to analyze data streams in real time? • How to analyze process flows in real time? • How to use and cross these streams with the information system data?
  • 27. JAM 27 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Data evolution : – Open data; – Interconnection of data: • social networks; • linked open data; – Big Data: • volume; • variety; • velocity; • truthfulness; • validity.
  • 28. JAM 28 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Evolution data: – Internet: • Data distribution; • Heterogeneity of the data; • Data mobility. – Flows: • Continuous data stream; • Complex events.
  • 29. DDAATTAA SSCCIIEENNCCEE -- LL33 JAM 29 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. in-depth data time Location/ localization Data Stream Version xX Data Stream Version xY Data Stream Version xZ
  • 30. JAM 30 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Computer Applications The issues • IT operations in real-time from distributed data; • Crossing and visualization of public and private data; • Complex Big Data analysis.
  • 31. JAM 31 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Big Data Data Analytic
  • 32. JAM 32 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Transactions distributed over the network data stream processing in real time Complex Event Processing (CEP)
  • 33. JAM 33 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Technology • Flash memory; • HDD (Hard Disk Drive) - SSD (Solid-State Drive); • Multi-core processing; • Combination CPU (Central Processing Unit) / GPU (Graphics Processing Unit); • Prioritization of parallel computing; • Server virtualization (VM virtual Machines); • …
  • 34. JAM 34 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Technology • Virtualization of disk storage: – NAS (Network Attached Storage) – SAN (Storage Area Network); • New architectures DBMS (database management system).
  • 35. JAM 35 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 SAN Storage Area Network with a Cluster
  • 36. JAM 36 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Databases • Adaptation: – patterns; – requests; – transactions; – views; – integrity constraints; – triggers; – managing indexes; – changing rules of consistency; – traceability rules.
  • 37. JAM 37 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Databases ACID database models ACID : – AAtomicity; – CConsistency; – IIsolation; – DDurability.
  • 38. JAM 38 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Modeling in a Hybrid NoSQL / Relational
  • 39. JAM 39 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Model Data Modeling
  • 40. JAM 40 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data models Relational : – value domain; – link; – algebra and calculus; – concept of zero. SQL 2: – the data type; – insulation levels; – the portablité. SQL 3: – Nested tables; – Complex types; – Extensions (expand) objects.
  • 41. JAM 41 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 New models: – key values; – tables; – documents; – graph; – stream.
  • 42. JAM 42 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Access to the database • The object-relational impedance mismatch. • API (Application Programmable Interface): – Ado (Microsoft ActiveX Data Objects); – JDBC (Java DataBase Connectivity); – SQLJ (program can call the JDBC ); – JPA (Java Persistence API); – JDD (Java Decision Diagrams); • Using Language Integrated Query (example : Microsoft .NET Framework ).
  • 43. JAM 43 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Analysis • Decision-making system and OLAP – multidimensional databases; • OLAP system (On-Line Analytical Processing); • OLTP (Online transaction processing) - class of software programs capable of supporting transaction-oriented applications; • TPC (TPC-H / TPC-E) – Transaction Processing Performance Council; • Simulates a Decision Support System or Business Intelligence database environment. – Multi-dimensional data.
  • 44. JAM 44 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Analysis • Data cubes: – array of data; – multidimensional matrix.
  • 45. JAM 45 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 array of data multidimensional matrix
  • 46. JAM 46 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Business Intelligence Data Mining Extraction of knowledge from the data.
  • 47. JAM 47 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 ddiissttrriibbuutteedd aarrcchhiitteeccttuurreess
  • 48. JAM 48 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • multi-tier architectures: – 2-Tiered architecture; – architecture technology to 3-tier architecture; – N-tier architecture.
  • 49. JAM 49 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 2 Tiered :
  • 50. JAM 50 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Three-Tier Architecture
  • 51. JAM 51 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Three-Tier Architecture
  • 52. JAM 52 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. « We know what we are, but know not what we may be. » William Shakespeare A short break, You can ask your questions.
  • 53. JAM 53 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Architectures • RIA: Rich Internet application; • RDA: Rich Desktop Application.
  • 54. JAM 54 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 RIA: Rich Internet application
  • 55. JAM 55 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 client network server thick thin rich application
  • 56. JAM 56 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Using : – HTML 5 (HyperText Mark-Up Language); – CSS 3 (Cascading Style Sheets); – Java Script; – Ajax (Asynchronous JavaScript and XML).
  • 57. JAM 57 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Coupling Server / Application Data server
  • 58. JAM 58 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Protection : – encryption; – discretionary access control; – control of multi-level access.
  • 59. JAM 59 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Service-Oriented Architecture • Web services method of communication between two electronic devices over a network; • WSDL Web Services Description Language; • UDDI Universal Description, Discovery and Integration; • SOAP Simple Object Access protocol;
  • 60. JAM 60 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • SOA (Service-Oriented Architecture), deployment of distributed services on ESB (Enterprise Service Bus); • EAI functions (Enterprise Application Integration – Workflow Process). • WOA (Web Oriented Architecture) – use technologies : • POX (Plain Old XML); • REST (Representational State Transfer) .
  • 61. JAM 61 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Massively distributed architectures; • Massively distributed systems; • Massively distributed computing.
  • 62. JAM 62 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Cluster : – Parallelism; • Grid : – Virtual Infrastructure; – Parallelism and Distribution; • P2P : – Distribution;
  • 63. JAM 63 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Diagram Architecture Master/Slaves
  • 64. JAM 64 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • and the cclloouudd:  parallelism of computer centers distributed across the planet  GGlloobbaalliizziinngg tthhee tteecchhnniiccaall aarrcchhiitteeccttuurreess .
  • 65. JAM 65 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example : Cloud architecture including API to connect on a social network.
  • 66. JAM 66 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Management of the distributed data • Integration of heterogeneous data; • Unifying the distributed data; • DBMS transaction (database management system) distributed.
  • 67. JAM 67 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Integration of heterogeneous data • Federation of data sources; • Mediator & Adapter / Architectures; • Integration of schema.
  • 68. JAM 68 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Integration of heterogeneous data • Data models for the integration : – Relational model; – Object; – XML; • data model that's based on an XML file • the XML Data Model makes it easier to integrate XFDL forms with other applications. – XFDL : Extensible Forms Description Language is a class of the Extensible Markup Language (XML) originally specified in World Wide Web Consortium (W3C) ; – key-value model.
  • 69. JAM 69 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Integration of heterogeneous data Visualization of the integrated data : – Dataviz (software) ; – Dashboards; – …
  • 70. JAM 70 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Unifying distributed data • Query optimization processing; • Intermediate results of integration; • Products (some examples) : – Google Fusion Tables (or simply Fusion Tables) is a web service provided by Google for data management; – IBM InfoSphere Information Server for Data Integration ; – Oracle Data Integrator (data integration platform); – SAP BusinessObjects (Data Integrator and Data Quality Management); – Talend for Data Integration (Talend Open Studio for Data Integration).
  • 71. JAM 71 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Distributed DBMS Transaction Management in Distributed Database.
  • 72. JAM 72 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Transaction Management in Distributed Database: – Two-Phase Commit Protocol (2PC) • type of atomic commitment protocol (ACP); – Fault tolerance; – Scalability; • Data replication: – consistency of copies; – propagation of updates; – adjusting the consistency rules of the databases; ORACLE, IBM DB2, Microsoft SQL server, SAP sybase, PostgreSQL and Postgres-XC.
  • 73. JAM 73 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Parallel data processing : – Massive parallelism of data; • massively data-parallel applications on GPU clusters; – (GPU Graphics Processing Unit); • data-parallel ... – Database machine; – Also the basic techniques.
  • 74. JAM 74 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Massive parallelism of data GGooaallss
  • 75. JAM 75 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Goals : – Linear Speed up; – Linear Scale up; – Scale out; – Elasticity; – Adaptability.
  • 76. JAM 76 / Shared-Memory Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Multiprocessor Architectures SMP shared memory multiprocessor NUMA Non Uniform Memory Architecture
  • 77. JAM 77 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 SSMMPP shared memory multiprocessor
  • 78. JAM 78 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 NNUUMMAA Non Uniform Memory Architecture
  • 79. JAM 79 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Multiprocessor Architectures • Cluster shared-disk; • Cluster shared-computing; • …
  • 80. JAM 80 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example 1 :
  • 81. JAM 81 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example 2:
  • 82. JAM 82 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 using servers
  • 83. JAM 83 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 tree – organization - cluster
  • 84. JAM 84 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Types of data parallelism: – Inter-application; – Application; – Inter-operation; – Pipeline.
  • 85. JAM 85 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example : Data Processing
  • 86. JAM 86 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example : Signal Processing
  • 87. JAM 87 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Parallel data processing basic techniques
  • 88. JAM 88 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data storage Partitioning and placement strategies in database machines Partitioning - Sharing replication indexing
  • 89. JAM 89 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 performances Parallelism of applications Balance of charges Transactions
  • 90. JAM 90 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Fall-over management High availability Savepoints
  • 91. JAM 91 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 DDaattaabbaassee MMaacchhiinnee Products : • IBM DB2 pureScale; • SQL Server 2012 Parallel Data Warehouse; • ORACLE RAC (Real Application Clusters) et Exadata; • SAP (sysbase); • Teradata Aster Data; • MySQL cluster (write-scalable, real-time, ACID-compliant transactional database ); • HP Vertica (software-based analytics platform); • ParAccel (Analytic DataBase and Plateform); • EMC Greenplum (Data Computing Appliance DCA , massively parallel processing database MPP).
  • 92. JAM 92 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MMaappRReedduuccee
  • 93. JAM 93 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MMaappRReedduuccee
  • 94. JAM 94 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MMaappRReedduuccee
  • 95. JAM 95 / MMaappRReedduuccee Map Reduce Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Big Data Result
  • 96. JAM 96 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce • Data storage; • Architectures; • Parallel programming; • Integration.
  • 97. JAM 97 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce Layers - Architecture With Api, for example using IBM API.
  • 98. JAM 98 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce Processing Split: the input data-set into independent chunks
  • 99. JAM 99 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data Storage • DDistributed FFile SSystem: Google File System (scalable distributed file system for large distributed data-intensive applications ); Hadoop Distributed File System (HDFS) is a distributed, scalable, and portable.
  • 100. JAM 100 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 HDFS layers
  • 101. JAM 101 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 HDFS diagram :
  • 102. JAM 102 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 HDFS diagram Simple architecture :
  • 103. JAM 103 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 HDFS Cluster diagram
  • 104. JAM 104 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Flows on Global File System / Cluster
  • 105. JAM 105 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Master / Slave Diagram:
  • 106. JAM 106 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MMaappRReedduuccee using • GFS (Global File System) – shared disk file system for Linux computer clusters; – symmetric Cluster Architecture; • HPFS – High Performance File System.
  • 107. JAM 107 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example :
  • 108. JAM 108 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Meta Data on Serveur / Cluster
  • 109. JAM 109 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Map Reduce GFS / HPFS Architectures : – Master node; • The architecture of GFS follows the master-slave model. – Distribution of chunks (fragment of information); – Replicating chunks.
  • 110. JAM 110 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Map Reduce IT architecture models • Partitioning according to the input files; • Fault tolerance; • Balancing loads.
  • 111. JAM 111 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Map Reduce IT architecture models • key–value model; • Function Map Reduce
  • 112. JAM 112 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Business Model IT Architecture Model
  • 113. JAM 113 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Business IT architecture and process
  • 114. JAM 114 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 IT architecture models Internet Load Balencer Vm (1) Vm (2) ….. Vm (n) Real machine ( 1 …n) Virtual
  • 115. JAM 115 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 diagram - modeling :
  • 116. JAM 116 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. A short break, You can ask your questions. « The direction in which education starts a man will determine his future in life. » Plato
  • 117. JAM 117 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Hadoop & MapReduce
  • 118. JAM 118 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Switch Hadoop slave servers Hadoop master and slave server Test server
  • 119. JAM 119 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Switch Hadoop slave servers Hadoop master and slave server Test server
  • 120. Hadoop HDFS JAM 120 / Hadoop Manage File Transfer Hadoop Gateway Server inspect process loading Loading Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 aggregate automate load
  • 121. JAM 121 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Hadoop BD Architecture and Layers
  • 122. JAM 122 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Cluster
  • 123. JAM 123 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Cluster Flow
  • 124. JAM 124 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce Architecture
  • 125. JAM 125 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce
  • 126. JAM 126 / MapReduce Architecture Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 methods and algorithms
  • 127. JAM 127 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce Architecture
  • 128. JAM 128 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Map Reduce
  • 129. JAM 129 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 PPaarraalllleell pprrooggrraammmmiinngg with MapReduce framework Tools: – Pig (workflow interface – Pig script); – Hive (SQL Interface); – … • Software publisher: – cloudera, datameer, greenplem HD, hurence, Intel, Talend,Vmware.
  • 130. The software layers Apache ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services JAM 130 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Apache Ambari is an operational framework for provisioning, managing and monitoring Apache Hadoop clusters.
  • 131. JAM 131 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 MapReduce integration into the information system • Database Machine; • Cloud: – Outsourcing of the Big data processing; • Tools : – Google cloud (Platform, Print, BigQuery …), Amazon web services, Microsoft HDInsight (Microsoft's Hadoop-based service), IBM InfoSphere BigInsights (built on the Apache Hadoop software framework - for analyzing massive volumes) , ORACLE BigData Appliance (engineered system for acquiring, organizing, and loading unstructured data into an Oracle Database), Cloudera Impala (open source massively parallel processing (MPP) SQL query engine for data stored in a computer cluster running Apache ).
  • 132. JAM 132 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Model of Information Systems in the Cloud environment.
  • 133. JAM 133 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 User Applications Integrated Services Single Data Strore Service Information System Internet and mobile and tablet ERP data warehouse
  • 134. JAM 134 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Exemple for data warehouse
  • 135. JAM 135 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Cloud Architecture Layers
  • 136. JAM 136 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Cloud Architecture Platform as a service Infrastructure as a service
  • 137. JAM 137 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Cloud Architecture: Access Security.
  • 138. JAM 138 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Data streaming • SCA Streaming Computing Architecture; • Streaming SQL; • Processing and stream management of the data flow.
  • 139. JAM 139 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 SCA Streaming Computing Architecture • Concepts: – Data stream; – Continuous queries; – Temporary data; • Flow modeling: – Sequences; – Time series; • Parallel streaming.
  • 140. JAM 140 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 SCA Streaming Computing Architecture • Concepts: – Data stream; – Continuous queries; – Temporary data; • Flows modeling: – Sequences; – Time series; • Parallel streaming.
  • 141. JAM 141 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Streaming Graph
  • 142. JAM 142 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Streaming SQL • Filtering; • Merger; • Flows aggregation ; • Join query between flows and the persistent data of the information system.
  • 143. JAM 143 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33
  • 144. JAM 144 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Processing data flows • Query stream; • Set of queries on the stream; • Stream mining; • Automatic classification of flows. • Products: Tibco streambase (push-based real-time analytics solution), ORACLE Streams, IBM Infosphere Streams (stream computing solution to enable real-time analytic processing of data), Microsoft streamInsight (platform for complex event processing), Telegraph CQ, LighStream (solution for real-time messaging over HTTP and WebSockets), Apach Samza (stream processing framework), Storm (has use cases: realtime analytics, online machine learning, continuous computation, distributed RPC, ETL ).
  • 145. JAM 145 / A short break, You can ask your questions. Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. “Better three hours too soon than a minute too late.” William Shakespeare
  • 146. JAM 146 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 NNooSSQQLL DDBBMMSS
  • 147. JAM 147 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 NoSQL DBMS • NoSQL architecture; • NoSQL approaches; • Integrating SQL into NoSQL.
  • 148. JAM 148 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 aarrcchhiitteeccttuurree ssooffttwwaarree llaayyeerrss example
  • 149. JAM 149 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 aarrcchhiitteeccttuurree ssooffttwwaarree llaayyeerrss example
  • 150. JAM 150 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 NoSQL architectures • Scalability; • Architecture 3 Tier in the cloud (cloud computing); • NoSQL : – RDBMS object; DBMS XML.
  • 151. JAM 151 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Architecture in the cloud example
  • 152. JAM 152 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 AArrcchhiitteeccttuurree iinn tthhee cclloouudd example
  • 153. JAM 153 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • DBMS key value: – Amazon dynamo DB (NoSQL database service that is offered by Amazon.com), Amazon simple DB (distributed database written in Erlang by Amazon.com), Apache Cassandra (massively scalable open source NoSQL database), Voldemort (a distributed key-value storage system, Data is automatically replicated over multiple servers, Processing Data Streams at LinkedIn), LinkedIn Espresso (horizontally scalable, indexed, timeline-consistent, document-oriented, highly available NoSQL data store).
  • 154. JAM 154 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Using, Schema
  • 155. JAM 155 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Example, Diagram
  • 156. JAM 156 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Large DBMS table : – Google Big table • a compressed, high performance, and proprietary data storage system built on Google File System; – Hadoop Hbase • open source, non-relational, scalable, distributed database modeled; – Apache Acumulo • high performance data storage and retrieval system with cell-level access control.
  • 157. JAM 157 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33
  • 158. JAM 158 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Hadoop Hbase Layers
  • 159. JAM 159 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Hbase in the Hadoop Ecosystem
  • 160. JAM 160 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Document database (managing document-oriented information): – MongoDB; • Agile and Scalable; – Ubuntu Couche DB.
  • 161. JAM 161 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Document database : – MongoDB; – Ubuntu Couche DB
  • 162. JAM 162 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Graph database : Graph databases apply graph theory to the storage of information about the relationships between entries; Graph database is a database that uses graph structures with nodes, edges, and properties to represent and store data.
  • 163. JAM 163 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 • Graph database : – Google Pregel (designed for the Google cluster architecture), (use for processing graph algorithms); – Neo4j (highly scalable, robust – ACID, native graph database, JAVA Graph Database; – DEX (scalable graph database management system - allows for the integration of multiple data sources).
  • 164. JAM 164 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Graph Model
  • 165. JAM 165 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Graph properties Model
  • 166. JAM 166 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Graph roles Model
  • 167. JAM 167 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 SQL / NoSQL integration Decrease in consistency.
  • 168. JAM 168 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Synthesis scheme
  • 169. JAM 169 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 The management of the calculations distribution.
  • 170. JAM 170 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 EntryProcessor MapEntry  send the code (computer program) to the data rather than sending the data to the code (computer program).
  • 171. JAM 171 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 EntryProcessor MapEntry  send the code (computer program) to the data rather than sending the data to the code (computer program). COHERENCE
  • 172. JAM 172 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 EntryProcessor • Decrease the amount of data passing through the network; • Répartion of the load on different nodes of the cluster; • Distribution of concurrent operations without explicit lock;
  • 173. JAM 173 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. DDAATTAA SSCCIIEENNCCEE -- LL33 Parallel Execution Process • Method; • Distributed execution on a cluster; • Execution parallelized on the cluster; • Each cluster node handles its local data.
  • 174. JAM 174 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. « We are born weak, we need strength; helpless, we need aid; foolish, we need reason. All that we lack at birth, all that we need when we come to man's estate, is the gift of education. » Jean-Jacques Rousseau
  • 175. JAM 175 / Contact http://www.jean-antoine-moreau.fr.nf © Jean-Antoine Moreau This lesson, copying and reproduction prohibited Managing my copyright: ADAGP. End of the part of this lesson published on the internet.