The Rise of DataOps - SQL on Redis

•Transferir como PPTX, PDF•

0 gostou•279 visualizações

Slides from Lenses session at Redis Conf 19 The Rise of DataOps on Streaming data, Lenses as a DataOps platform with SQL on Redis and Kafka. Gain visibility and unlock your data scientists.

Software

The Rise of DataOps - SQL on Redis
Andrew Stevenson
Lenses, CTO

Speaker – Andrew Stevenson
CTO at lenses.io
C++, Data Warehousing, Big/Fast Data
Always realtime
Clearing & Settlement
HFT
Investment Banking
Energy
Netherlands
lenses.io

lenses.io
lenses.io/careers
We hiring! Location doesn’t matter

Streaming
Data Flows
Data Visibility
Data
Governance
DataOps by Lenses
Enable everyone
Data focused
Includes DevOps
$

What does a Data Platform look like?
lenses.io
Sources StorageTransport
Processing

DataOps Platform with Lenses?
lenses.io
Visibility and Flows
Operations
Security and Auditing
Data Policies

lenses.io
SQLYes, we know, not everything can
be done in SQL

lenses.io
SQL
Yes, we know, not everything can be done in
SQL
But a huge amount can

Lenses SQL Engine
lenses.io
Table Query
Continuous Query
SQL Processors
Connect Query
}Pluggable

50 1 2 3 4 6 7 8 9 10
11
query start time
GBPGBP GBP GBPGBP GBP
SELECT eventId
FROM trades
WHERE currency = ‘GBP’
Events { 1, 3, 5 }
12
SELECT eventId
FROM trades
WHERE currency = ‘GBP’
Events { 7, 8, 9, 11, 12, … }
Continuous QueryTable Query

What’s supported?
Aggregates
Functions
User defined functions
Joins
Predicates including pushdowns
Complex types
lenses.io
Client libraries

Table & Continuous Queries
lenses.io
Authenticate
Authorize
Apply data policies
Start Akka Stream

SQL Processors
Long running processor
T in ETL or ELT
SQL to Kafka Streams
On Redis Streams?

Config as Code
lenses.io
SQL +
One docker image different SQL
+

Mais conteúdo relacionado

Mais procurados

WhereScape, the pioneer in data warehouse automation software Patrick Van Renterghem

Why Are Digital Disruptors Successful And How Can You Become One? VMware Tanzu

Data and its Role in Your Digital TransformationVMware Tanzu

ODSC data science to DataOpsChristopher Bergh

Pivotal Big Data Roadshow VMware Tanzu

How to Streamline DataOps on AWSEnterprise Management Associates

Best Practices for Building a Warehouse QuicklyWhereScape

Oracle: Building Cloud Native ApplicationsKelly Goetsch

Connect Faster with SnapLogic at Workday RisingSnapLogic

Next Steps In Your Digital TransformationVMware Tanzu

Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...SnapLogic

Webinar: Attaining Excellence in Big Data IntegrationSnapLogic

Operationalizing Data AnalyticsVMware Tanzu

Running Data Platforms Like ProductsVMware Tanzu

Oracle data Visualization(Components)Bizinsight Consulting Inc

NetApp Clustered Data ONTAP with Oracle DatabasesNetApp

Unleash the Power of Big Data and Machine LearningTalend

Platform Requirements for CI/CD Success—and the Enterprises Leading the WayVMware Tanzu

Everything you need to know about cloud migration(Build Stuff 2021)Radu Vunvulea

Cloud Decision FrameworkNetApp

Mais procurados (20)

WhereScape, the pioneer in data warehouse automation software

Why Are Digital Disruptors Successful And How Can You Become One?

Data and its Role in Your Digital Transformation

ODSC data science to DataOps

Pivotal Big Data Roadshow

How to Streamline DataOps on AWS

Best Practices for Building a Warehouse Quickly

Oracle: Building Cloud Native Applications

Connect Faster with SnapLogic at Workday Rising

Next Steps In Your Digital Transformation

Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...

Webinar: Attaining Excellence in Big Data Integration

Operationalizing Data Analytics

Running Data Platforms Like Products

Oracle data Visualization(Components)

NetApp Clustered Data ONTAP with Oracle Databases

Unleash the Power of Big Data and Machine Learning

Platform Requirements for CI/CD Success—and the Enterprises Leading the Way

Everything you need to know about cloud migration(Build Stuff 2021)

Cloud Decision Framework

Semelhante a The Rise of DataOps - SQL on Redis

The Rise Of DataOps - SQL On Redis: Andrew StevensonRedis Labs

2014-10 DevOps NFi - Why it's a good idea to deploy 10 times per day v1.0Joakim Lindbom

ActiveEon’s OW2 ProActive accelerates, automates and scales Metagenomics anal...OW2

Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Caserta

Procedural to oop in phpBarrett Avery

2015 10 dev ops n-fi - why it's a good idea to deploy 10 times per day v1.0 -...Joakim Lindbom

#bluecruxtalks in May: Building master data factories, togetherBluecrux

Data Culture Series - Keynote - 3rd DecJonathan Woodward

Data Estate ModernizationIndra Dharmawan

NoOps for noobs; why i think Devs do not need OpsGeert van der Cruijsen

Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo

The Business Value of PaaS Automation - Kieron Sambrook-Smith - Presentation ...eZ Systems

Data vault: What's NextEmpowered Holdings, LLC

Bi modal IT- a perspektive - Joakim LindbomJoakim Lindbom

Data & AI Platform ConceptsAnkit Rathi

Cloud Migration journeyPaul Birkbeck

Demystifying Data Warehousing as a Service - DFWKent Graziano

Business Intelligence with SQL ServerPeter Gfader

Digital transformation with microsoft data and ai MichaelRoenker

OpsStack Overview 20170806.1Siglos

Semelhante a The Rise of DataOps - SQL on Redis (20)

The Rise Of DataOps - SQL On Redis: Andrew Stevenson

2014-10 DevOps NFi - Why it's a good idea to deploy 10 times per day v1.0

ActiveEon’s OW2 ProActive accelerates, automates and scales Metagenomics anal...

Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017

Procedural to oop in php

2015 10 dev ops n-fi - why it's a good idea to deploy 10 times per day v1.0 -...

#bluecruxtalks in May: Building master data factories, together

Data Culture Series - Keynote - 3rd Dec

Data Estate Modernization

NoOps for noobs; why i think Devs do not need Ops

Accelerate Self-Service Analytics with Data Virtualization and Visualization

The Business Value of PaaS Automation - Kieron Sambrook-Smith - Presentation ...

Data vault: What's Next

Bi modal IT- a perspektive - Joakim Lindbom

Data & AI Platform Concepts

Cloud Migration journey

Demystifying Data Warehousing as a Service - DFW

Business Intelligence with SQL Server

Digital transformation with microsoft data and ai

OpsStack Overview 20170806.1

Último

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

DNT_Corporate presentation know about usDynamic Netsoft

The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS

A Secure and Reliable Document Management System is Essential.docxComplianceQuest1

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08

Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110

The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171

Professional Resume Template for Software DevelopersVinodh Ram

Optimizing AI for immediate response in Smart CCTVshikhaohhpro

5 Signs You Need a Fashion PLM Software.pdfWave PLM

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

TECUNIQUE: Success Stories: IT Service providermohitmore19

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

Project Based Learning (A.I).pptx detail explanationkaushalgiri8080

The Rise of DataOps - SQL on Redis

1. The Rise of DataOps - SQL on Redis Andrew Stevenson Lenses, CTO

2. Speaker – Andrew Stevenson CTO at lenses.io C++, Data Warehousing, Big/Fast Data Always realtime Clearing & Settlement HFT Investment Banking Energy Netherlands lenses.io

3. lenses.io lenses.io/careers We hiring! Location doesn’t matter

4. Technology focused DevOps lenses.io $

5. Streaming Data Flows Data Visibility Data Governance DataOps by Lenses Enable everyone Data focused Includes DevOps $

6. What does a Data Platform look like? lenses.io Sources StorageTransport Processing

7. DataOps Platform with Lenses? lenses.io Visibility and Flows Operations Security and Auditing Data Policies

8. Lenses DataOps Platform lenses.io

9. Lenses DataOps Platform lenses.io

10. Lenses DataOps Platform lenses.io

11. lenses.io SQL

12. lenses.io SQLYes, we know, not everything can be done in SQL

13. lenses.io SQL Yes, we know, not everything can be done in SQL But a huge amount can

14. Lenses SQL Engine lenses.io Table Query Continuous Query SQL Processors Connect Query }Pluggable

15. 50 1 2 3 4 6 7 8 9 10 11 query start time GBPGBP GBP GBPGBP GBP SELECT eventId FROM trades WHERE currency = ‘GBP’ Events { 1, 3, 5 } 12 SELECT eventId FROM trades WHERE currency = ‘GBP’ Events { 7, 8, 9, 11, 12, … } Continuous QueryTable Query

16. What’s supported? Aggregates Functions User defined functions Joins Predicates including pushdowns Complex types lenses.io Client libraries

17. Table Query lenses.io

18. Continuous Queries lenses.io

19. Table & Continuous Queries lenses.io Authenticate Authorize Apply data policies Start Akka Stream

20. SQL Processors Long running processor T in ETL or ELT SQL to Kafka Streams On Redis Streams?

21. Config as Code lenses.io SQL + One docker image different SQL +

22. Demo! lenses.io

23. Thank you!

24. PRESENTED BY

Notas do Editor

DevOps We all know what devops is about, at least I hope you do. Its about developers and operation practices coming together to essentially ship products faster, creating a converyor belt to deliver software that combines both disciplines, so we get CI/CD, monitoring, logging, metrics and better testing so improved software quality. But important point here is its tech focused, developers and operations.
DataOps sits at a higher level. We heard from Thomas, from Google this morning about the higher the abstraction to more value you add. Every company I know is trying to be data driven, we have data scientists, data engineers, business analysts, data warehouses, the protagonists is Data. At Lenses we see 3 pillars forming DataOps, streaming flows, think Redis Gears here, but we focus on real time data. Data goverance, auditing and security. There’s been dubious data ethincs by companies recently. And data visibility.
Ok, so what does a DataOps platform look like. We’ll we have data sources, usually lots, some streaming, some not, anything from flat files to stock exchange feeds. We have data storage, typically more than one, like S3 for cold storage, a RDBMS, KV store. Its varied, you have different access patterns and different needs You also have some form of data transport, ideal a distributed log, that supports high throughput and low latency with ordering guarantees, something like Redis Streams, Kafka or Pulsar. You also need processing, to transform and manipulate the data and ideally somewhere to run that, like kubernetes. You need monitoring, You need visibility – we just talk about how important that is to enabling data drive organisations.
This become interesting, say you want to move from Kafka to Redis Streams, ideally you’d prefer not to rewrite you application landscape.
So our weapon of choice is to power Lenses is SQL. SQL is everywhere in data, SQL on this, SQL on that, and why is that….. Nearly everybody knows some SQL and lets not forget big or fast data was around before the recent fad. Many big data teams come from a data warehousing background, that was my personal journey.
It has its flaws, for example, syntax varies from vendor to vender. Not everything can be done via SQL, you’d not write a Machine learning algorithm in SQL
I’ve used SQL successfully for all sorts for things From simple ETL loads to realtime trade reconciliation, value at risk reporting and trade analysis, and at scale.
Onto the Lenses SQL Engine. It has 3 main components, You see 4 but the Connect Query is actually part of Lenses yet. The table query which is like querying a database, the continuous query is like tailing a file and the SQL processor are like the T in ETL.
So the two query APIs we have implemented for Redis are the table query and the continuous query. This slide shows and example of each. We have a stream of events being appended a lot the bottom, 1 to 12 with new events arriving all the time. The message have a schema which contains a currency field, and we want all records with currency GBP. If we query using the Table API we get all the messages from the start until the time of the query matching the predicate, where currency equals GBP, this will return events 1, 3, 5 For the Continous Query API we get new events arriving after the query start time that match the predicate, events, 7,8,9, 11, 12 and so on as new events arrive.
So how does is work? At a high level each request, against the websocket endpoint then spins up an Akka Streams flow from the SQL received, this then streams the data back to the client.
I don’t mention SQL is config lightly. SQL can be version controlled and if we are using a container orchestrator like Kubernetes, we need just one docker to manage, inject the SQL and deploy in our CI/CD.

The Rise of DataOps - SQL on Redis

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a The Rise of DataOps - SQL on Redis

Semelhante a The Rise of DataOps - SQL on Redis (20)

Último

Último (20)

The Rise of DataOps - SQL on Redis

Notas do Editor