Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB

•

0 gostou•860 visualizações

ShareChat is a social media app with ~180 MAU and 50M DAU. We capture and aggregate various engagement metrics, viz. likes, views, shares, comments, etc., at a post level to curate better content for our users. In terms of numbers for the engagement metrics, we have writes and reads happening at a scale of 55k-60k ops/sec and 290k-300k ops/sec, respectively. With these engagement metrics directly impacting users, we need a datastore that would offer lower latencies and is highly available, resilient, and scalable. It would be better if we could achieve all of these at an optimal cost. This is to learn how we accomplished the abovementioned criteria by using in-house Kafka streams and ScyllaDB.

Tecnologia

Aggregations at Scale for
ShareChat — Using Kafka
Streams and ScyllaDB
Charan Movva, Technical Lead

■ About ShareChat
■ Why Streaming?
■ Requirements
■ Architecture and Deepdive
■ How is ScyllaDB helping us?
Agenda

ShareChat is India's largest home-grown regional social media platform.
■ We offer easy content consumption and sharing in 15 Indian languages
■ 125 Mn MAU
■ 1.3+ BN per month Shares
■ 31 Minutes per day
ShareChat

We capture a lot of client events around the engagement of a post
■ Multiple posts
■ Multiple levels of engagement
■ 370k-440k ops/sec
■ Showing these counters back to our users
■ Helps in curating the better content
Scale and Criticality of Engagement
Events

Possible different paradigms and issues wrt problem we are trying to solve
■ Request-response
■ Lowest-latency
■ 12500(12.5K) and 12599(12.5k) are same.
■ Batch processing
■ High-latency/high-throughput
■ Stream processing
■ Continuous and non-blocking
Why Stream Processing?

■ Windowed aggregations
■ Support for multiple windows.
■ Triggers
■ Easy onboarding of new counters in future.
■ Easy onboarding of new triggers and aggregation windows.
Requirements?

Under the Hood
■ Leveraging the features that kafka has to offer
■ Streams API
■ Topology
■ Aggregated value to Data Store(ScyllaDB)

Some Code
Some
● KStream<CounterKey, CounterIngest> counterIngestStream =
builder.stream(kafkaProperties.getCounterHitsTopicName(),
Consumed.with(counterKeySerdeSansSchemaRegistry,
counterIngestSerdeSansSchemaRegistry, timestampExtractor,
Topology.AutoOffsetReset.LATEST));
● KGroupedStream<CounterKey, CounterIngest> counterGroupedStream =
counterPriorityStream.groupByKey(Grouped.with(
counterKeySerde,counterIngestSerde));
● TimeWindowedKStream<CounterKey, CounterIngest> counterTimeWindowedStream =
counterGroupedStream.windowedBy(
TimeWindows.of(Duration.ofSeconds(priority.getSecondsThreshold()))
.grace(Duration.ofMillis(0)));

● Materialized<CounterKey, Integer, WindowStore<Bytes, byte[]>>
materialized = Materialized.<CounterKey, Integer, WindowStore<Bytes,
byte[]>>as(COUNTER_STATE_STORE_NAME + priorityLevel)
.withKeySerde(counterKeySerde).withValueSerde(Serdes.Integer())
.withRetention(Duration.ofSeconds(retentionDurationInSeconds));
● KStream<Windowed<CounterKey>, Integer> countStream =
counterTimeWindowedStream.aggregate
(countInitializer, countAggregator, streamName,materialized)
.suppress(Suppressed.untilWindowCloses(
Suppressed.BufferConfig.unbounded())).toStream();

Next Problem?
■ Heavy reads.
■ We need a datastore that could handle the increasing reads
with the best latency numbers possible.

Enter ScyllaDB
■ It is fast
■ Offers sub-millisecond latency
■ Better monitoring
■ Metrics visibility at DC, Cluster, Instance and shards
■ Min 50% lesser database costs
■ Well, it is the best

Battle Testing
■ Recent festival scale of 500K ops/sec.
■ The same setup handles the 5x-10x the current scale.
■ The cluster is stable even when the load crosses 90%.

We could not have been in this state without the contributions of these bright minds
■ Engineering: Shubham Dhal, Sanket Gawande, Prateek Bhargav
■ Dev Ops: Abhiroop Soni
■ Mentors/Leadership: Harshal Vora, Geetish Nayak, Chhaya Sharma
Also, you can learn more about the operational challenges and
the problems we’ve encountered in our blog post
@https://sharechat.com/blogs/engineering/streaming-aggregations-at-scale
The Team

Thank You
Stay in Touch
Charan Movva
charan@sharechat.co
https://twitter.com/iamCharanMovva
https://github.com/charanmovva
https://linkedin.com/in/charanmovva

Mais conteúdo relacionado

Semelhante a Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB

How Netflix Uses Amazon Kinesis Streams to Monitor and Optimize Large-scale N...Amazon Web Services

How Level Infinite Implemented CQRS and Event Sourcing on Top of Apache Pulsa...ScyllaDB

Introducing TiDB [Delivered: 09/27/18 at NYC SQL Meetup]Kevin Xu

WSO2 Analytics Platform: The one stop shop for all your data needsSriskandarajah Suhothayan

Apache Samza 1.0 - What's New, What's NextPrateek Maheshwari

ScaleDB Technical PresentationIvan Zoratti

Spark streaming: Best PracticesPrakash Chockalingam

Event Driven MicroservicesFabrizio Fortino

Implementing Real-Time IoT Stream Processing in Azure Chris Pietschmann (Microsoft MVP)

MongoDB World 2019: Near Real-Time Analytical Data Hub with MongoDBMongoDB

MongoDB Sharding Webinar 2014Dylan Tong

Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...DataStax

Predicting Loan Delinquency at One Million Transactions per SecondRevolution Analytics

Amazon RedshiftJeff Patti

TiDB IntroductionMorgan Tocker

Using druid for interactive count distinct queries at scaleItai Yaffe

Storing State Forever: Why It Can Be Good For Your AnalyticsYaroslav Tkachenko

Altitude San Francisco 2018: Logging at the Edge Fastly

Tweaking performance on high-load projectsDmitriy Dumanskiy

Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uberconfluent

Semelhante a Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB (20)

How Netflix Uses Amazon Kinesis Streams to Monitor and Optimize Large-scale N...

How Level Infinite Implemented CQRS and Event Sourcing on Top of Apache Pulsa...

Introducing TiDB [Delivered: 09/27/18 at NYC SQL Meetup]

WSO2 Analytics Platform: The one stop shop for all your data needs

Apache Samza 1.0 - What's New, What's Next

ScaleDB Technical Presentation

Spark streaming: Best Practices

Event Driven Microservices

Implementing Real-Time IoT Stream Processing in Azure

MongoDB World 2019: Near Real-Time Analytical Data Hub with MongoDB

MongoDB Sharding Webinar 2014

Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...

Predicting Loan Delinquency at One Million Transactions per Second

Amazon Redshift

TiDB Introduction

Using druid for interactive count distinct queries at scale

Storing State Forever: Why It Can Be Good For Your Analytics

Altitude San Francisco 2018: Logging at the Edge

Tweaking performance on high-load projects

Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber

Mais de ScyllaDB

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

What Developers Need to Unlearn for High Performance NoSQLScyllaDB

Low Latency at Extreme Scale: Proven Practices & PitfallsScyllaDB

Dissecting Real-World Database Performance DilemmasScyllaDB

Beyond Linear Scaling: A New Path for Performance with ScyllaDBScyllaDB

Dissecting Real-World Database Performance DilemmasScyllaDB

Database Performance at Scale Masterclass: Workload Characteristics by Felipe...ScyllaDB

Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...ScyllaDB

Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaScyllaDB

Replacing Your Cache with ScyllaDBScyllaDB

Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityScyllaDB

7 Reasons Not to Put an External Cache in Front of Your Database.pptxScyllaDB

Getting the most out of ScyllaDBScyllaDB

NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationScyllaDB

NoSQL Database Migration Masterclass - Session 3: Migration LogisticsScyllaDB

NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesScyllaDB

ScyllaDB Virtual WorkshopScyllaDB

DBaaS in the Real World: Risks, Rewards & TradeoffsScyllaDB

Build Low-Latency Applications in Rust on ScyllaDBScyllaDB

NoSQL Data Modeling 101ScyllaDB

Mais de ScyllaDB (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL

What Developers Need to Unlearn for High Performance NoSQL

Low Latency at Extreme Scale: Proven Practices & Pitfalls

Dissecting Real-World Database Performance Dilemmas

Beyond Linear Scaling: A New Path for Performance with ScyllaDB

Dissecting Real-World Database Performance Dilemmas

Database Performance at Scale Masterclass: Workload Characteristics by Felipe...

Database Performance at Scale Masterclass: Database Internals by Pavel Emelya...

Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna

Replacing Your Cache with ScyllaDB

Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability

7 Reasons Not to Put an External Cache in Front of Your Database.pptx

Getting the most out of ScyllaDB

NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration

NoSQL Database Migration Masterclass - Session 3: Migration Logistics

NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges

ScyllaDB Virtual Workshop

DBaaS in the Real World: Risks, Rewards & Tradeoffs

Build Low-Latency Applications in Rust on ScyllaDB

NoSQL Data Modeling 101

Último

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

CNIC Information System with Pakdata Cf In Pakistandanishmna97

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Platformless Horizons for Digital AdaptabilityWSO2

Why Teams call analytics are critical to your entire businesspanagenda

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Understanding the FAA Part 107 License ..Christopher Logan Kennedy

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra

Architecting Cloud Native ApplicationsWSO2

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021

Corporate and higher education May webinar.pptxRustici Software

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney

DBX First Quarter 2024 Investor PresentationDropbox

Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB

1. Aggregations at Scale for ShareChat — Using Kafka Streams and ScyllaDB Charan Movva, Technical Lead

2. ■ About ShareChat ■ Why Streaming? ■ Requirements ■ Architecture and Deepdive ■ How is ScyllaDB helping us? Agenda

3. ShareChat is India's largest home-grown regional social media platform. ■ We offer easy content consumption and sharing in 15 Indian languages ■ 125 Mn MAU ■ 1.3+ BN per month Shares ■ 31 Minutes per day ShareChat

4. We capture a lot of client events around the engagement of a post ■ Multiple posts ■ Multiple levels of engagement ■ 370k-440k ops/sec ■ Showing these counters back to our users ■ Helps in curating the better content Scale and Criticality of Engagement Events

5. Possible different paradigms and issues wrt problem we are trying to solve ■ Request-response ■ Lowest-latency ■ 12500(12.5K) and 12599(12.5k) are same. ■ Batch processing ■ High-latency/high-throughput ■ Stream processing ■ Continuous and non-blocking Why Stream Processing?

6. ■ Windowed aggregations ■ Support for multiple windows. ■ Triggers ■ Easy onboarding of new counters in future. ■ Easy onboarding of new triggers and aggregation windows. Requirements?

7. Architecture

8. Under the Hood ■ Leveraging the features that kafka has to offer ■ Streams API ■ Topology ■ Aggregated value to Data Store(ScyllaDB)

9. Topology

10. Some Code Some ● KStream<CounterKey, CounterIngest> counterIngestStream = builder.stream(kafkaProperties.getCounterHitsTopicName(), Consumed.with(counterKeySerdeSansSchemaRegistry, counterIngestSerdeSansSchemaRegistry, timestampExtractor, Topology.AutoOffsetReset.LATEST)); ● KGroupedStream<CounterKey, CounterIngest> counterGroupedStream = counterPriorityStream.groupByKey(Grouped.with( counterKeySerde,counterIngestSerde)); ● TimeWindowedKStream<CounterKey, CounterIngest> counterTimeWindowedStream = counterGroupedStream.windowedBy( TimeWindows.of(Duration.ofSeconds(priority.getSecondsThreshold())) .grace(Duration.ofMillis(0)));

11. ● Materialized<CounterKey, Integer, WindowStore<Bytes, byte[]>> materialized = Materialized.<CounterKey, Integer, WindowStore<Bytes, byte[]>>as(COUNTER_STATE_STORE_NAME + priorityLevel) .withKeySerde(counterKeySerde).withValueSerde(Serdes.Integer()) .withRetention(Duration.ofSeconds(retentionDurationInSeconds)); ● KStream<Windowed<CounterKey>, Integer> countStream = counterTimeWindowedStream.aggregate (countInitializer, countAggregator, streamName,materialized) .suppress(Suppressed.untilWindowCloses( Suppressed.BufferConfig.unbounded())).toStream();

12. Next Problem? ■ Heavy reads. ■ We need a datastore that could handle the increasing reads with the best latency numbers possible.

13. Enter ScyllaDB ■ It is fast ■ Offers sub-millisecond latency ■ Better monitoring ■ Metrics visibility at DC, Cluster, Instance and shards ■ Min 50% lesser database costs ■ Well, it is the best

14. Sample Metrics

15. Battle Testing ■ Recent festival scale of 500K ops/sec. ■ The same setup handles the 5x-10x the current scale. ■ The cluster is stable even when the load crosses 90%.

16. Extra Load

17. We could not have been in this state without the contributions of these bright minds ■ Engineering: Shubham Dhal, Sanket Gawande, Prateek Bhargav ■ Dev Ops: Abhiroop Soni ■ Mentors/Leadership: Harshal Vora, Geetish Nayak, Chhaya Sharma Also, you can learn more about the operational challenges and the problems we’ve encountered in our blog post @https://sharechat.com/blogs/engineering/streaming-aggregations-at-scale The Team

18. Thank You Stay in Touch Charan Movva charan@sharechat.co https://twitter.com/iamCharanMovva https://github.com/charanmovva https://linkedin.com/in/charanmovva

Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB

Semelhante a Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB (20)

Mais de ScyllaDB

Mais de ScyllaDB (20)

Último

Último (20)

Aggregations at Scale for ShareChat —Using Kafka Streams and ScyllaDB