Metrics¶
Armada exposes metric data, for consumption by Prometheus.
Exporting¶
Metric data can be exported via:
- API: Prometheus exporter in the /metrics endpoint. The Armada chart includes the appropriate Prometheus scrape configurations for this endpoint.
- CLI: –metrics-output=<path> of apply command. The node exporter text file collector can then be used to export the produced text files to Prometheus.
Supported <action>s¶
The below tree of <action>s are measured. Supported prometheus labels are noted. Labels are inherited by sub-actions except as noted.
- apply:
- description: apply a manifest
- labels: manifest
- sub-actions:
- chart_handle:
- description: fully handle a chart (see below sub-actions)
- labels:
- chart
- action (install|upgrade|noop) (not included in sub-actions)
- sub-actions:
- chart_download
- chart_deploy
- chart_test
- chart_delete:
- description: delete a chart (e.g. due to FAILED status)
- labels: chart
Supported <metric>s¶
- failure_total: total failed attempts
- attempt_total: total attempts
- attempt_inprogress: total attempts in progress
- duration_seconds: duration of each attempt
Timeouts¶
The chart_handle and chart_test actions additionally include the following metrics:
- timeout_duration_seconds: configured chart timeout duration in seconds
- timeout_usage_ratio: = duration_seconds / timeout_duration_seconds
These can help identify charts whose timeouts may need to be changed to avoid potential failures or to acheive faster failures.
Chart concurrency¶
The chart_handle action additionally includes the following metric:
- concurrency_count: count of charts being handled concurrently
This can help identify opportunities for greater chart concurrency.