It's not in production unless it's monitored
- our infrastructure is agile now
- ephemeral
Continuous deployment
- continuous integration
- one click deploy
- feature flagging
- monitoring
- monitoring in operations is like testing in for developers
- alerting
- #monitoringsucks
Metrics
- business drivers
- application performance
- system resources
- network
Separate your concerns
Collection
- cost of collection should be zero
- logging
- logster
- logstash
- papertrail
- active support notifications
- lograge
- harness
- erik/metriks
Aggregation
- statsd
- rack-statsd
- statsd-instrument
Storage
- rrdtool
- constant storage size
- rollups
- graphite
- rrd
- http
- opentsdb
- hbase
- multidimensional
- saas
Visualizations
- Correlation
- metrics
- annotations
- arbitrary combination
- Dashboards
- shared understanding
- aberration detecting
- firefighting manual
- template dashboards, make them easy to create
Alerting
- tuning alerts
- trigger threshold
- cancel threshold
- re-arm window
- function
- windowing capabilities
- look for integrations
- aberrant behaviors