Prometheus监控系统搭建

使用Prometheus和Grafana搭建完整的监控告警系统,监控服务器、应用和业务指标。
Prometheus监控系统搭建

Prometheus监控系统搭建

Prometheus是一个开源的监控和告警系统,配合Grafana可以构建完整的监控解决方案。

架构组件

  • Prometheus Server:数据采集和存储
  • Grafana:数据可视化
  • Alertmanager:告警管理
  • Exporters:指标采集器

Docker部署

# docker-compose.yml
version: "3.8"
services:
  prometheus:
    image: prom/prometheus:latest
    ports:
      - "9090:9090"
    volumes:
      - ./prometheus.yml:/etc/prometheus/prometheus.yml
    
  grafana:
    image: grafana/grafana:latest
    ports:
      - "3000:3000"
    environment:
      - GF_SECURITY_ADMIN_PASSWORD=admin
    
  node-exporter:
    image: prom/node-exporter:latest
    ports:
      - "9100:9100"

配置文件

# prometheus.yml
global:
  scrape_interval: 15s

scrape_configs:
  - job_name: "prometheus"
    static_configs:
      - targets: ["localhost:9090"]
  
  - job_name: "node"
    static_configs:
      - targets: ["localhost:9100"]

常用监控指标

  • CPU使用率
  • 内存使用率
  • 磁盘IO
  • 网络流量
  • 应用响应时间

告警规则

# alert.rules
groups:
- name: system
  rules:
  - alert: HighCPU
    expr: 100 - (avg by(instance) (rate(node_cpu_seconds_total{mode="idle"}[5m])) * 100) > 80
    for: 5m
    labels:
      severity: warning
    annotations:
      summary: "High CPU usage on {{ $labels.instance }}"

评论 (0)