Prometheus Guide for the Monitoring Stack¶
Prometheus is the core metrics and alerting engine in this monitoring lab.
What It Does¶
- Scrapes metrics from configured targets
- Stores time-series data
- Evaluates alert rules
- Supplies metrics to Grafana
Important Files¶
prometheus/prometheus.ymlprometheus/rules/alerts.yml
Main Scrape Targets¶
- Prometheus
- Alertmanager
- Node Exporter
- cAdvisor
- Blackbox Exporter
- Blackbox HTTP probe jobs
Why This Layout Works¶
It combines host metrics, container metrics, and endpoint checks in one place. That gives a more complete view than using only one exporter type.
Example Alerts¶
TargetDownHostHighCPUHostHighMemoryContainerHighCPUContainerHighMemorySyntheticProbeFailed
Practical Improvements¶
- Add severity labels such as
warningandcritical - Add
for:windows to reduce alert noise - Add ownership labels for teams or services
- Add recording rules for repeated expensive queries