Operations grimoire/Prometheus
From Nasqueron Agora
Prometheus is an open-source monitoring tool to collect, store, and query time-series data.
It's currently installed on WindRiver and deployed by rOPS: roles/prometheus.
Prometheus is commonly used in conjunction with Grafana for visualizing the collected metrics, providing a comprehensive observability solution.
Add source to scrape
Sources are defined in rOPS: pillar/observability/prometheus.sls. All service values are resolved through rOPS: pillar/services/table.sls.
If you need to add one source, you can follow D3388 as an example.
Troubleshoot
- Service discovery on Prometheus Web UI (*)
- Targets on Prometheus Web UI (*)
- Grafana dashboard
(*) Those links requires a SSH tunnel configured with -L 9090:windriver:9090
.