At Mist, we believe the best way of achieving reliability is by embracing uncontrolled unreliability. We run our microservices and stream processing entirely on Amazon EC2 Spot Instances, which introduces uncontrolled chaos. In this session, we start with an overview of our system and discuss how we achieve high reliability on our largest service, Live Aggregators (LA). LA is our in-house, real-time, timeseries aggregation system that processes over a billion messages a day while aggregating over 100 million timeseries. We also share how we are autoscaling LA by predicting and adapting to changing loads that results in server utilizations of over 70%.