安装
克隆项目
1 2 3 4 5 6 7 8 9
| git clone https://github.com/coreos/kube-prometheus.git
先运行这个,等pod都runing
kubectl create -f manifests/setup
再执行这个等所有pod runing
kubectl create -f manifests/
|
访问界面
1 2 3 4 5 6 7 8 9 10 11 12 13
| 使用traefik暴露
service: prometheus-k8s 9090
service: grafana 3000
service: alertmanager-main 9093
|
数据持久化
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
| 执行 kubectl create -f manifests/ 前
修改 /root/yaml/prometheus-operator/kube-prometheus/manifests/prometheus-prometheus.yaml
apiVersion: monitoring.coreos.com/v1 kind: Prometheus metadata: labels: prometheus: k8s name: k8s namespace: monitoring spec: alerting: alertmanagers: - name: alertmanager-main namespace: monitoring port: web storage: volumeClaimTemplate: spec: storageClassName: nfs resources: requests: storage: 10Gi baseImage: quay.io/prometheus/prometheus nodeSelector:
|
配置
配置告警信息
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32
| global: resolve_timeout: 5m smtp_smarthost: 'smtp.163.com:25' smtp_from: 'Rokin_NetGain@163.com' smtp_auth_username: 'Rokin_NetGain@163.com' smtp_auth_password: 'q59163303' smtp_hello: '163.com' smtp_require_tls: false route: group_by: ['job', 'severity'] group_wait: 30s group_interval: 5m repeat_interval: 12h receiver: default routes: - receiver: webhook match: alertname: CoreDNSDown receivers: - name: 'default' email_configs: - to: '12345678@qq.com' send_resolved: true - name: 'webhook' webhook_configs: - url: 'http://dingtalk-hook.kube-ops:5000' send_resolved: true
kubectl delete secrets -n monitoring alertmanager-main kubectl create secret generic alertmanager-main --from-file=alertmanager.yaml -n monitoring
|
替换 secrets alertmanager-main 替换完后,界面上信息也会变掉