bbd428bb7f
Added direct extraction support for prometheus exporters
2019-05-07 11:36:26 -04:00
Mateusz Matuszkowiak
f945e6091b
Removed kubernetes-cadvisor as its deprecated in k8s v1.11
...
Instead of cadvisor we are adviced to use telegraf kubernetes
input plugin.
Change-Id: Ic9892e2f6a2adc305e550b943a669a606010aa1e
Closes-Bug: PROD-24334
2018-10-25 07:39:32 +00:00
Dmitry Kalashnik
57a124a3d2
Merge "Enable http_config section for webhook"
2018-10-01 13:23:51 +00:00
Ildar Svetlov
707af19587
Add alertmanager kubernetes sd into prometheus conf template
...
Change-Id: I60189136bcf7bc63346714f8c3f911cc6f863e09
Related-PROD: PROD-23277
2018-09-24 09:33:39 +00:00
Dmitry Kalashnik
c8d5390cd1
Enable http_config section for webhook
...
Change-Id: Ib7d94f9a31130de7c8e24fc2f123cb7a89048a9b
PROD-related: PROD-22142
2018-09-06 18:54:08 +04:00
Dmitry Kalashnik
6c2a6057fc
Add environment_label label to all prom jobs
...
Change-Id: I3730e56f572f0b569c3fea637b1b4106268de1af
Prod-related: PROD-22139
2018-09-05 17:26:13 +04:00
Dmitry Kalashnik
407c7776fb
Add condition to remote_storage and pushgateway dashboards
...
Change-Id: If12b9aa69c9d5e40502fe6fafdb296c0bc6a8cfe
Closes-Bug: PROD-21466
2018-07-17 15:24:26 +04:00
Ildar Svetlov
bc97b20276
Merge "Enable prometheus pushgateway persistent storage"
2018-07-06 11:57:34 +00:00
Michal Kobus
2b64974054
Enable prometheus pushgateway persistent storage
...
Change-Id: If539f26909bd56b168a806773bb9c6574c86bf40
Closes-bug: PROD-21073
2018-07-06 13:16:34 +02:00
Mateusz Matuszkowiak
7485855965
Source the exported vars before entrypoint exec
...
Change-Id: If40e943fee2ad928d8af64f61201800cc6f308de
Related-Bug: PROD-21141
2018-07-03 13:03:41 +02:00
Richard Felkl
6affd12cc0
Merge "Fix alertmanager dashboard"
2018-06-27 16:14:04 +00:00
Michal Kobus
01654de62e
Fix alertmanager dashboard
...
Closes-bug: PROD-20098
Change-Id: I6223afef600ec9abe449ae909b6a8a0f8ad40c6a
2018-06-27 14:03:49 +02:00
Dmitry Kalashnik
c61d874083
Fix telegraf meta enabling condition
...
Change-Id: I063bd5e948ba31990ffb8dd971e738731b41d996
Closes-Bug: PROD-20950
2018-06-27 15:58:14 +04:00
Mateusz Matuszkowiak
858fc1fb57
Be able to change duration of the gainsight execution
...
Change-Id: Ib46fd444ef87a6a76775ac37f6b4876aab6fd49c
Related-Bug: PROD-21034
2018-06-27 09:10:12 +02:00
Richard Felkl
a335a1678d
Merge "Dashboard rationalization"
2018-06-26 12:46:48 +00:00
Dmitry Kalashnik
da4fd1e307
Add separate host and instance variables.
...
Change-Id: I9bd4d144b382a7f31ad05c062cb5ba5d04bad067
Closes-Bug: PROD-20985
2018-06-26 15:15:53 +04:00
Michal Kobus
f7d07391fe
Dashboard rationalization
...
Change-Id: Ifb3f4091625b34cfefc484308f6102f7a36937a7
Closes-bug: PROD-20098
2018-06-26 12:23:43 +02:00
Dmitry Kalashnik
161a4f03d9
Merge "Add alerts and targets for prometheus LTS and relay"
2018-06-20 12:07:24 +00:00
Dmitry Kalashnik
2f831f3709
Merge "Fix warning color"
2018-06-20 12:07:11 +00:00
Dmitry Kalashnik
c3716cb1e9
Add alerts and targets for prometheus LTS and relay
...
Closes-Bug: PROD-20724
Change-Id: I1f4839a4900a2d417d85a52ffef6e11e4bb2cac1
2018-06-20 12:19:36 +04:00
Mateusz Matuszkowiak
520c9c6ebf
Added new container for prometheus to gainsight
...
Change-Id: I28c88a10537a2b4aee235501b3854e52a10c7ab8
Related-Bug: PROD-19567
2018-06-18 09:15:23 +00:00
Dmitry Kalashnik
81334b01a5
Fix warning color
...
Change-Id: I64b22607c0e55dd595ab640aa1d6f647974de73e
2018-06-15 12:15:20 +04:00
Michal Kobus
d257cdbfe2
Fix typo
...
Change-Id: I52958fe30538491b0c8b150b3a206b8009b537c1
2018-06-13 13:03:05 +02:00
Michal Kobus
9a358f7c17
Cosmetic changes for alerts
...
Change-Id: I9e6b2f4a5876e7d5697236166b4a6dc30cf4615a
Closes-bug: PROD-20466
2018-06-12 11:54:39 +02:00
Michal Kobus
355aa0b480
Alerts reworked
...
Change alerts names, severities and descriptions.
Change-Id: Ib06f08a6f336d28592d5f70e97aedfeb12eb603c
Closes-bug: PROD-19698
2018-05-10 16:36:08 +02:00
Dmitry Kalashnik
38242186dd
Add alerta to prometheus
...
Change-Id: I0349936dc683475e18155cf5f8ca750d06482cfb
2018-04-25 15:39:59 +04:00
Dmitry Kalashnik
2eaddd2469
Merge "Allow to configure relabeling for alerts"
2018-04-24 14:20:46 +00:00
Dmitry Kalashnik
03a5b6d8eb
Allow to configure relabeling for alerts
...
Change-Id: I9fab8dc868c620330c5bdc8678d33a57c8256494
Prod-related: PROD-19595
2018-04-24 14:02:15 +04:00
Ildar Svetlov
53932d47a1
Collect dns targets from grains
...
Change-Id: If0ca7f7cb917537c0fbba9c6a4da479adada111d
2018-04-05 17:49:24 +04:00
Jakub Josef
ccd0b6eec4
Merge "Add relabel_configs section to targets config"
2018-03-27 11:59:59 +00:00
Mateusz Matuszkowiak
63051d0845
Remove handler as its not needed
...
Change-Id: I11d29619665c053bd6b8f5b133c5143edb80fb5c
Closes-Bug: PROD-18802
2018-03-27 11:45:23 +02:00
Dmitry Kalashnik
93992a0742
Add relabel_configs section to targets config
...
Add host label to exporters grain generator
Change-Id: I0a8cbf27ebcf724fe2bf7f547a30080d94198592
PROD-related: PROD-18671
2018-03-27 13:45:06 +04:00
mcp-jenkins
846332b40d
Merge "Use max instead of min"
2018-03-20 13:31:01 +00:00
Mateusz Matuszkowiak
53234ab0e5
Use max instead of min
...
Since these services are replicated (two per service),
we should check for the max, not min, as one of the replicas
could be down, but still the API is Up.
Change-Id: I6dad9b74d65415f5f4f91e00c732792010487619
2018-03-20 14:12:30 +01:00
Jakub Josef
b13e62b904
Merge "Allow to deploy prometheus as standalone service"
2018-03-12 16:21:54 +00:00
Bartosz Kupidura
5e9fb1d347
Allow to deploy prometheus as standalone service
...
Now it is able to install proemtheus on host and
use it to scrape data from other prometheuses.
Prod-related: PROD-17350
Change-Id: I11d65920c0e9b685dead22a5551521fa15d13f25
2018-03-12 17:26:06 +04:00
Ildar Svetlov
3079228995
Add new fields to metric_relabel_config
...
Change-Id: I581216e5b6918c8c613c20fd141f287490509b59
2018-03-06 15:47:04 +04:00
Dmitry Kalashnik
be606a43c4
Allow to set custom template for email receivers
...
Change-Id: I9fb7fbd1946e564e1e245ea226c5bbe7440f25d4
Closes-Bug: PROD-18240
2018-03-05 17:34:51 +04:00
Dmitry Kalashnik
cdad46d5ad
Allow to use multimatching for alermanager routes
...
Change-Id: I2c0d439055efc0d218c8f247aea264d425fca838
2018-03-01 16:35:41 +04:00
Bartosz Kupidura
8bdf3ed090
Add support for prometheus 2.0
...
New version changes:
* different alerts format
* rewritten storage (some config flags removed)
Closes-Bug: PROD-16609
Change-Id: I805fa322e4744e98177d6c3e29589ebc6fb917a2
2018-01-03 12:26:10 +01:00
Ondrej Smola
94baab3660
iadded posibility to define two endpoint in one receiver + test pillars + update Readme
...
This option is usefull when all alarms should be send to two or more receivers simultaneously - without need to use continue parameter
Change-Id: I6b8d833776549614d0381378a2e92cb5bc05af77
2017-11-21 10:18:34 +01:00
mcp-jenkins
2f5ed47dc9
Merge "Add prometheus alerts"
2017-09-28 11:26:00 +00:00
Bartosz Kupidura
2f8a10309e
Merge "Add pushgateway and alertmanager dashboards"
2017-09-28 11:24:32 +00:00
Bartosz Kupidura
6fce6098d7
Add prometheus alerts
...
* PrometheusRushMode
* PrometheusRemoteStorageQueue
* AlertmanagerNotificationFailed
Change-Id: I5a875e7b9861f860bac501da55f0e8b20e799d52
2017-09-27 16:48:33 +02:00
Bartosz Kupidura
20a437a58e
Add pushgateway and alertmanager dashboards
...
Change-Id: I83d705833f73bd827f5b752b215d3be53dc278dd
2017-09-27 16:20:06 +02:00
Ildar Svetlov
5932165d11
Alignment Remote Storage Adapter Grafana dashboard
...
Change-Id: Ic61a852118eb932c837689d1eafbeba313adeb2d
Closes-bug: PROD-14712
2017-09-25 14:11:12 +04:00
Bartosz Kupidura
37925ad206
Get kubernetes status from grains
...
Change-Id: If921742f5f2601bcd974922d272f0d8f4aa2b0b0
2017-09-20 12:17:23 +02:00
Ales Komarek
22f7ab3572
Merge "Trigger the target down alert after 2 minutes"
2017-09-15 09:51:58 +00:00
Konstantin Hontar
7b14deaf31
Add Slack and mail receiver to AlertManager config
...
Change-Id: Ib82ef0a7d029f84a7f92a90384bac02a0870b521
2017-09-14 13:24:04 +02:00
Simon Pasquier
cd90c9f842
Trigger the target down alert after 2 minutes
...
Otherwise the alert fires as soon as Prometheus can't scrape a target.
It is too aggressive in case of transient connectivity issues or
endpoint restart.
Change-Id: Ib3de5b141db7a7f2397bf332844a9c44d38f2d3c
2017-09-12 15:14:21 +02:00