19. Elastic Stack - Best Practices
- Do not lose log data
- Monitor logstash/elasticsearch
- Data retention
- Test and verify configurations
- Log request/response models
24. Prometheus - Best Practices
- Use recording rules to create sub metrics
- Write unit tests for alert rules
- Alertmanager deployment
- At least two instances of alertmanager
- Do not use load balancer
- Send alerts to all alertmanager instances
- Unsee by cloudflare
29. Jaeger
- OpenTracing compatible data model and instrumentation libraries
- Features
- Distributed transaction monitoring
- Root cause analysis
- Service dependency analysis