SlideShare a Scribd company logo
1 of 25
Building massive scale,
    fault tolerant,
job processing systems
    with Scala Akka
      framework
     Vignesh Sukumar
        SVCC 2012
About me

• Storage group, Backend Engineering at Box
• Love enterprise software!
• Interested in Big Data and building distributed
  systems in the cloud
About Box

• Leader in enterprise cloud collaboration and
  storage
• Cutting-edge work in backend, frontend,
  platform and engineering services
• A really fun place to work – we have a long
  slide!
Talk outline
• Job processing requirements
• Traditional & new models for job processing

• Akka actors framework
• Achieving and controlling high IO throughput
• Fine-grained fault tolerance
Typical architecture in a cloud storage
             environment
Practical realities

•Storage nodes are usually of varying
configurations (OS, processing power, storage
capacity, etc) mainly because of rapid evolution
in provisioning operations
•Some nodes are more over-worked than the
others (for ex, accepting live uploads)
•Billions of files; petabytes
Job processing requirements

• Iterate over all files (billions, petabyte scale):
  for ex, check consistency of all files

• High throughput

• Fault tolerant

• Secure
Traditional job processing model
Why traditional models fail in cloud
       storage environments
• Not scalable: petabyte scale, billions of files
• Insecure: cannot move files out of storage
  nodes
• No performance control: easy to overwhelm
  any storage node
• No fine grained fault tolerance
Compute on Storage

• Move job computation directly to storage
  nodes
• Utilize abundant CPU on storage nodes
• Metadata store still stays in a highly available
  system like a RDBMS
• Results from operations on a file are
  completely independent
Master – slave architecture
Benefits

• High IO throughput: Direct access; no transfer
  of files over a network
• Secure: files do not leave storage nodes
• Better performance control: compute can
  easily monitor system load and back off
• Better fault tolerance handling: finer grained
  handling of errors
Master node

• Responsible for accepting job submissions and
  splitting them to tasks for slave nodes
• Stateful: keeps durable copy of jobs and tasks
  in Zookeeper
• Horizontally scalable: service can be run on
  multiple nodes
Agent

• Runs directly on the storage nodes on a
  machine-independent JVM container
• Stateless: no task state is maintained
• Monitors system load with back-off
• Reports results directly to master without
  synchronizing with other agents
Implementation with the
  the Scala Akka Actor
       framework
Actors

• Concurrent threads abstraction with no
  shared state
• Exchange messages
• Asynchronous, non-blocking
• Multiple actors can map to a single OS thread
• Parent-children hierarchical relationship
Actors and messages
• Class MyActor extends Actor {
  def receive = {
    case MsgType1 => // do something
  }
}

// instantiation and sending messages
 val actorRef = system.actorOf(Props(new MyActor))
actorRef ! MsgType1
Agent Actor System
Achieving high IO throughput
• Parallel, asynchronous IO through “Futures”
val fileIOResult = Future {
  // issue high latency tasks like file IO
 }
val networkIOResult = Future { // read from network }

Futures.awaitAll(<wait time>, fileIOResult, networkIOResult)
fileIOResult onSuccess { // do something }
networkIOResult onFailure { // retry }
Controlling system throughput

• The problem: agents need to throttle
  themselves as storage nodes serve live traffic

• Adjust number of parallel workers dynamically
  through a monitoring service
Controlling throughput: Examples

•Parallelism parameters can be gotten from a
separate configuration service on a per node
basis
•Some machines can be speeded up and others
slowed down this way
•The configuration can be updated on a cron
schedule to speed up during weekends
Fine grained fault tolerance with
              Supervisors

• Parents of child actors can define specific
  fault-handling strategies for each failure
  scenario in their children
• Components can fail gracefully without
  affecting the entire system
Supervision strategy: Examples


Class TaskActor extends Actor {
  // create child workers
  override val supervisorStrategy = OneForOneStrategy(maxNrOrRetries = 3) {
   case SqlException => Resume // retry the same file
   case FileCorruptionException => Stop // don’t clobber it!
   case IOException => Restart // report and move on
}
Unit testing

• Scalatra test framework: very easy to read!
  TaskActorTest.receive(BadFileMsg) must throw
  FileNotFoundException
• Mocks for network and database calls
val mockHttp = mock[HttpExecutor]
TaskActorTest ! doHttpPost
there was atLeastOne(mockHttp).POST


• Extensive testing of failure injection scenarios
Takeaways
• Keep your architecture simple by modeling
  actor message flow along the same paths as
  parent-child actor hierarchy (i.e., no message
  exchange between peer child actors)
• Design and implement for component failures
• Write unit tests extensively: we did not have
  any fundamental level functionality breakage
• Box Engineering is awesome!

More Related Content

What's hot

Object Oriented Design in Software Engineering SE12
Object Oriented Design in Software Engineering SE12Object Oriented Design in Software Engineering SE12
Object Oriented Design in Software Engineering SE12
koolkampus
 

What's hot (20)

Process synchronization in Operating Systems
Process synchronization in Operating SystemsProcess synchronization in Operating Systems
Process synchronization in Operating Systems
 
Making Structured Streaming Ready for Production
Making Structured Streaming Ready for ProductionMaking Structured Streaming Ready for Production
Making Structured Streaming Ready for Production
 
Link state routing protocol
Link state routing protocolLink state routing protocol
Link state routing protocol
 
Naming in Distributed System
Naming in Distributed SystemNaming in Distributed System
Naming in Distributed System
 
Clock synchronization in distributed system
Clock synchronization in distributed systemClock synchronization in distributed system
Clock synchronization in distributed system
 
Transport layer
Transport layer Transport layer
Transport layer
 
Object Oriented Design in Software Engineering SE12
Object Oriented Design in Software Engineering SE12Object Oriented Design in Software Engineering SE12
Object Oriented Design in Software Engineering SE12
 
Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?Kafka Streams: What it is, and how to use it?
Kafka Streams: What it is, and how to use it?
 
Apache Kafka from 0.7 to 1.0, History and Lesson Learned
Apache Kafka from 0.7 to 1.0, History and Lesson LearnedApache Kafka from 0.7 to 1.0, History and Lesson Learned
Apache Kafka from 0.7 to 1.0, History and Lesson Learned
 
clock synchronization in Distributed System
clock synchronization in Distributed System clock synchronization in Distributed System
clock synchronization in Distributed System
 
Distributed Transactions: Saga Patterns
Distributed Transactions: Saga PatternsDistributed Transactions: Saga Patterns
Distributed Transactions: Saga Patterns
 
Integrating Public & Private Clouds
Integrating Public & Private CloudsIntegrating Public & Private Clouds
Integrating Public & Private Clouds
 
DeadLock in Operating-Systems
DeadLock in Operating-SystemsDeadLock in Operating-Systems
DeadLock in Operating-Systems
 
chapter 2 architecture
chapter 2 architecturechapter 2 architecture
chapter 2 architecture
 
Message passing in Distributed Computing Systems
Message passing in Distributed Computing SystemsMessage passing in Distributed Computing Systems
Message passing in Distributed Computing Systems
 
The CAP Theorem
The CAP Theorem The CAP Theorem
The CAP Theorem
 
Open mp directives
Open mp directivesOpen mp directives
Open mp directives
 
Software Architecture Patterns
Software Architecture PatternsSoftware Architecture Patterns
Software Architecture Patterns
 
Distributed system architecture
Distributed system architectureDistributed system architecture
Distributed system architecture
 
Object Oriented Analysis and Design
Object Oriented Analysis and DesignObject Oriented Analysis and Design
Object Oriented Analysis and Design
 

Similar to Building large scale, job processing systems with Scala Akka Actor framework

Agile Lab_BigData_Meetup_AKKA
Agile Lab_BigData_Meetup_AKKAAgile Lab_BigData_Meetup_AKKA
Agile Lab_BigData_Meetup_AKKA
Paolo Platter
 
Typesafe stack - Scala, Akka and Play
Typesafe stack - Scala, Akka and PlayTypesafe stack - Scala, Akka and Play
Typesafe stack - Scala, Akka and Play
Luka Zakrajšek
 

Similar to Building large scale, job processing systems with Scala Akka Actor framework (20)

Stream Computing (The Engineer's Perspective)
Stream Computing (The Engineer's Perspective)Stream Computing (The Engineer's Perspective)
Stream Computing (The Engineer's Perspective)
 
Agile Lab_BigData_Meetup_AKKA
Agile Lab_BigData_Meetup_AKKAAgile Lab_BigData_Meetup_AKKA
Agile Lab_BigData_Meetup_AKKA
 
Distributed Model Validation with Epsilon
Distributed Model Validation with EpsilonDistributed Model Validation with Epsilon
Distributed Model Validation with Epsilon
 
Typesafe stack - Scala, Akka and Play
Typesafe stack - Scala, Akka and PlayTypesafe stack - Scala, Akka and Play
Typesafe stack - Scala, Akka and Play
 
Indic threads pune12-typesafe stack software development on the jvm
Indic threads pune12-typesafe stack software development on the jvmIndic threads pune12-typesafe stack software development on the jvm
Indic threads pune12-typesafe stack software development on the jvm
 
Scaling tappsi
Scaling tappsiScaling tappsi
Scaling tappsi
 
Fastest Servlets in the West
Fastest Servlets in the WestFastest Servlets in the West
Fastest Servlets in the West
 
Fault tolerance
Fault toleranceFault tolerance
Fault tolerance
 
Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications Latest (storage IO) patterns for cloud-native applications
Latest (storage IO) patterns for cloud-native applications
 
Machine Learning With H2O vs SparkML
Machine Learning With H2O vs SparkMLMachine Learning With H2O vs SparkML
Machine Learning With H2O vs SparkML
 
Alluxio - Scalable Filesystem Metadata Services
Alluxio - Scalable Filesystem Metadata ServicesAlluxio - Scalable Filesystem Metadata Services
Alluxio - Scalable Filesystem Metadata Services
 
Graphene – Microsoft SCOPE on Tez
Graphene – Microsoft SCOPE on Tez Graphene – Microsoft SCOPE on Tez
Graphene – Microsoft SCOPE on Tez
 
Enhanced Reframework Session_16-07-2022.pptx
Enhanced Reframework Session_16-07-2022.pptxEnhanced Reframework Session_16-07-2022.pptx
Enhanced Reframework Session_16-07-2022.pptx
 
MongoDB: How We Did It – Reanimating Identity at AOL
MongoDB: How We Did It – Reanimating Identity at AOLMongoDB: How We Did It – Reanimating Identity at AOL
MongoDB: How We Did It – Reanimating Identity at AOL
 
Reactive programming with examples
Reactive programming with examplesReactive programming with examples
Reactive programming with examples
 
DataOps with Project Amaterasu
DataOps with Project AmaterasuDataOps with Project Amaterasu
DataOps with Project Amaterasu
 
Case Study: Migrating Hyperic from EJB to Spring from JBoss to Apache Tomcat
Case Study: Migrating Hyperic from EJB to Spring from JBoss to Apache TomcatCase Study: Migrating Hyperic from EJB to Spring from JBoss to Apache Tomcat
Case Study: Migrating Hyperic from EJB to Spring from JBoss to Apache Tomcat
 
Road Trip To Component
Road Trip To ComponentRoad Trip To Component
Road Trip To Component
 
OracleStore: A Highly Performant RawStore Implementation for Hive Metastore
OracleStore: A Highly Performant RawStore Implementation for Hive MetastoreOracleStore: A Highly Performant RawStore Implementation for Hive Metastore
OracleStore: A Highly Performant RawStore Implementation for Hive Metastore
 
John adams talk cloudy
John adams   talk cloudyJohn adams   talk cloudy
John adams talk cloudy
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Building large scale, job processing systems with Scala Akka Actor framework

  • 1. Building massive scale, fault tolerant, job processing systems with Scala Akka framework Vignesh Sukumar SVCC 2012
  • 2. About me • Storage group, Backend Engineering at Box • Love enterprise software! • Interested in Big Data and building distributed systems in the cloud
  • 3. About Box • Leader in enterprise cloud collaboration and storage • Cutting-edge work in backend, frontend, platform and engineering services • A really fun place to work – we have a long slide!
  • 4. Talk outline • Job processing requirements • Traditional & new models for job processing • Akka actors framework • Achieving and controlling high IO throughput • Fine-grained fault tolerance
  • 5. Typical architecture in a cloud storage environment
  • 6. Practical realities •Storage nodes are usually of varying configurations (OS, processing power, storage capacity, etc) mainly because of rapid evolution in provisioning operations •Some nodes are more over-worked than the others (for ex, accepting live uploads) •Billions of files; petabytes
  • 7. Job processing requirements • Iterate over all files (billions, petabyte scale): for ex, check consistency of all files • High throughput • Fault tolerant • Secure
  • 9. Why traditional models fail in cloud storage environments • Not scalable: petabyte scale, billions of files • Insecure: cannot move files out of storage nodes • No performance control: easy to overwhelm any storage node • No fine grained fault tolerance
  • 10. Compute on Storage • Move job computation directly to storage nodes • Utilize abundant CPU on storage nodes • Metadata store still stays in a highly available system like a RDBMS • Results from operations on a file are completely independent
  • 11. Master – slave architecture
  • 12. Benefits • High IO throughput: Direct access; no transfer of files over a network • Secure: files do not leave storage nodes • Better performance control: compute can easily monitor system load and back off • Better fault tolerance handling: finer grained handling of errors
  • 13. Master node • Responsible for accepting job submissions and splitting them to tasks for slave nodes • Stateful: keeps durable copy of jobs and tasks in Zookeeper • Horizontally scalable: service can be run on multiple nodes
  • 14. Agent • Runs directly on the storage nodes on a machine-independent JVM container • Stateless: no task state is maintained • Monitors system load with back-off • Reports results directly to master without synchronizing with other agents
  • 15. Implementation with the the Scala Akka Actor framework
  • 16. Actors • Concurrent threads abstraction with no shared state • Exchange messages • Asynchronous, non-blocking • Multiple actors can map to a single OS thread • Parent-children hierarchical relationship
  • 17. Actors and messages • Class MyActor extends Actor { def receive = { case MsgType1 => // do something } } // instantiation and sending messages val actorRef = system.actorOf(Props(new MyActor)) actorRef ! MsgType1
  • 19. Achieving high IO throughput • Parallel, asynchronous IO through “Futures” val fileIOResult = Future { // issue high latency tasks like file IO } val networkIOResult = Future { // read from network } Futures.awaitAll(<wait time>, fileIOResult, networkIOResult) fileIOResult onSuccess { // do something } networkIOResult onFailure { // retry }
  • 20. Controlling system throughput • The problem: agents need to throttle themselves as storage nodes serve live traffic • Adjust number of parallel workers dynamically through a monitoring service
  • 21. Controlling throughput: Examples •Parallelism parameters can be gotten from a separate configuration service on a per node basis •Some machines can be speeded up and others slowed down this way •The configuration can be updated on a cron schedule to speed up during weekends
  • 22. Fine grained fault tolerance with Supervisors • Parents of child actors can define specific fault-handling strategies for each failure scenario in their children • Components can fail gracefully without affecting the entire system
  • 23. Supervision strategy: Examples Class TaskActor extends Actor { // create child workers override val supervisorStrategy = OneForOneStrategy(maxNrOrRetries = 3) { case SqlException => Resume // retry the same file case FileCorruptionException => Stop // don’t clobber it! case IOException => Restart // report and move on }
  • 24. Unit testing • Scalatra test framework: very easy to read! TaskActorTest.receive(BadFileMsg) must throw FileNotFoundException • Mocks for network and database calls val mockHttp = mock[HttpExecutor] TaskActorTest ! doHttpPost there was atLeastOne(mockHttp).POST • Extensive testing of failure injection scenarios
  • 25. Takeaways • Keep your architecture simple by modeling actor message flow along the same paths as parent-child actor hierarchy (i.e., no message exchange between peer child actors) • Design and implement for component failures • Write unit tests extensively: we did not have any fundamental level functionality breakage • Box Engineering is awesome!

Editor's Notes

  1. 1. Example of a job is to check consistency of all the files: this will involve iterating over every file on all storage nodes, reading file and verifying content integrity.
  2. Scalability: non-performant because of the IO bottleneck in getting files to the application cluster Insecure: application clusters can store the files locally. It’s easy to melt a single a storage node by reading or writing a lot to it Cannot perform fine grained fault tolerance