Monitoring
Prometheus metrics, Grafana dashboards, and pipeline run history
Prometheus Metrics
Raw metrics endpoint at /metrics for Prometheus scraping
API Documentation
Interactive Swagger/OpenAPI docs at /docs
Prometheus Configuration
Add this to your prometheus.yml to scrape ResilienceOps metrics:
scrape_configs:
- job_name: 'resilienceops'
scrape_interval: 15s
static_configs:
- targets: ['backend:8000']
metrics_path: /metrics
Available Metrics
| Metric | Type | Description |
|---|---|---|
| resilienceops_events_total | counter | Total security events ingested |
| resilienceops_events_by_source | gauge | Events by source (guardduty/cloudtrail) |
| resilienceops_events_by_severity | gauge | Events by severity level |
| resilienceops_anomalies_total | gauge | Total anomalies detected |
| resilienceops_critical_incidents | gauge | Critical incidents (risk ≥ 70) |
| resilienceops_avg_risk_score | gauge | Average risk score |
| resilienceops_jira_tickets_total | counter | Jira tickets created |
| resilienceops_remediations_total | counter | Remediations generated |
Pipeline Run History
Loading...