Dmitry Kalashnik
c3716cb1e9
Add alerts and targets for prometheus LTS and relay
...
Closes-Bug: PROD-20724
Change-Id: I1f4839a4900a2d417d85a52ffef6e11e4bb2cac1
2018-06-20 12:19:36 +04:00
Michal Kobus
d257cdbfe2
Fix typo
...
Change-Id: I52958fe30538491b0c8b150b3a206b8009b537c1
2018-06-13 13:03:05 +02:00
Michal Kobus
9a358f7c17
Cosmetic changes for alerts
...
Change-Id: I9e6b2f4a5876e7d5697236166b4a6dc30cf4615a
Closes-bug: PROD-20466
2018-06-12 11:54:39 +02:00
Michal Kobus
355aa0b480
Alerts reworked
...
Change alerts names, severities and descriptions.
Change-Id: Ib06f08a6f336d28592d5f70e97aedfeb12eb603c
Closes-bug: PROD-19698
2018-05-10 16:36:08 +02:00
Bartosz Kupidura
8bdf3ed090
Add support for prometheus 2.0
...
New version changes:
* different alerts format
* rewritten storage (some config flags removed)
Closes-Bug: PROD-16609
Change-Id: I805fa322e4744e98177d6c3e29589ebc6fb917a2
2018-01-03 12:26:10 +01:00
Bartosz Kupidura
6fce6098d7
Add prometheus alerts
...
* PrometheusRushMode
* PrometheusRemoteStorageQueue
* AlertmanagerNotificationFailed
Change-Id: I5a875e7b9861f860bac501da55f0e8b20e799d52
2017-09-27 16:48:33 +02:00
Simon Pasquier
cd90c9f842
Trigger the target down alert after 2 minutes
...
Otherwise the alert fires as soon as Prometheus can't scrape a target.
It is too aggressive in case of transient connectivity issues or
endpoint restart.
Change-Id: Ib3de5b141db7a7f2397bf332844a9c44d38f2d3c
2017-09-12 15:14:21 +02:00
Olivier Bourdon
5b6b583c42
Add Prometheus alerts
...
Change-Id: I4ad10555d728d62c8e6504659d30558f95b410ac
2017-07-26 13:45:51 +02:00
Simon Pasquier
2959ab41f3
Rename Prometheus alerts for consistency
...
Change-Id: I96fb789bf73af22d56fc6c6980626647f87409d4
2017-07-24 15:38:28 +02:00
Bartosz Kupidura
5f644a1921
Typo
...
Change-Id: I7e04706411018b117e8d2ec523667f6048cf932e
2017-04-11 15:34:25 +02:00
Bartosz Kupidura
2b784c85b9
Add support.yml for alerts and recording rules
...
Change-Id: If1927033922c350257999f59ba3031445689e11b
2017-04-11 12:17:08 +02:00