Page MenuHomePhabricator

GrafanaTag
ActivePublic

Details

Description

This tag is for issues relating to Grafana and/or with one of the following related backends services:

Maintained by: SRE Observability

See also Observability-Metrics

Recent Activity

Today

ayounsi updated the task description for T413156: titan1002 unreachable for 2min.
Fri, Dec 19, 5:35 AM · Grafana, Observability-Metrics
ayounsi created T413156: titan1002 unreachable for 2min.
Fri, Dec 19, 5:34 AM · Grafana, Observability-Metrics

Yesterday

Stashbot added a comment to T383563: mw.track: support for histogram metrics.

Mentioned in SAL (#wikimedia-operations) [2025-12-18T22:11:19Z] <cwhite@deploy2002> Finished deploy [statsv/statsv@0751b0b]: T383563 (duration: 00m 10s)

Thu, Dec 18, 10:11 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
Stashbot added a comment to T383563: mw.track: support for histogram metrics.

Mentioned in SAL (#wikimedia-operations) [2025-12-18T22:11:09Z] <cwhite@deploy2002> Started deploy [statsv/statsv@0751b0b]: T383563

Thu, Dec 18, 10:11 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
gerritbot added a comment to T383563: mw.track: support for histogram metrics.

Change #1217213 merged by jenkins-bot:

[performance/statsv@master] statsv: Add support for histograms

https://gerrit.wikimedia.org/r/1217213

Thu, Dec 18, 10:10 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments

Tue, Dec 16

Urbanecm_WMF edited projects for T383563: mw.track: support for histogram metrics, added: Growth-Team (FY2025-26 Q2 Sprint 6); removed Growth-Team (FY2025-26 Q2 Sprint 5).
Tue, Dec 16, 5:39 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments

Thu, Dec 11

Michael added a comment to T383563: mw.track: support for histogram metrics.

@Krinkle @colewhite I've added you both as reviewers for the performance/statsv change needed for this. It has tests, but I don't actually know that domain very well, so any guidance would be appreciated if I still got something wrong.

Thu, Dec 11, 5:07 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments

Wed, Dec 10

Michael moved T383563: mw.track: support for histogram metrics from Doing to Code Review on the Growth-Team (FY2025-26 Q2 Sprint 5) board.
Wed, Dec 10, 6:50 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
gerritbot added a comment to T383563: mw.track: support for histogram metrics.

Change #1217213 had a related patch set uploaded (by Michael Große; author: Michael Große):

[performance/statsv@master] statsv: Add support for histograms

https://gerrit.wikimedia.org/r/1217213

Wed, Dec 10, 2:53 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments

Tue, Dec 9

Michael moved T383563: mw.track: support for histogram metrics from Incoming to Doing on the Growth-Team (FY2025-26 Q2 Sprint 5) board.
Tue, Dec 9, 6:46 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
Michael claimed T383563: mw.track: support for histogram metrics.
Tue, Dec 9, 6:46 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
Michael added a comment to T383563: mw.track: support for histogram metrics.

However, I lack the knowledge about what happens afterward. dogstatsd seems to support metrics (|h), but I'm not sure what it does with that and how it is passed on to the next part of our pipeline. And we also need to somehow define the buckets to use.

The "dogstatsd native" histogram type passes control of bucket definitions to infrastructure configuration. This added friction and shared nature makes the option less appealing than at first glance.

Tue, Dec 9, 4:02 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments

Fri, Dec 5

colewhite added a comment to T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

@colewhite is there a way to search for all of the dashboards that have graphite annotations? I fixed the 3 that @hashar explicitly listed, but I'm not excited enough about this to want to manually check every dashboard.

Fri, Dec 5, 6:21 PM · Scap, Release-Engineering-Team (Radar), Grafana
bd808 updated the task description for T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.
Fri, Dec 5, 12:58 AM · Scap, Release-Engineering-Team (Radar), Grafana
bd808 added a comment to T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

@colewhite is there a way to search for all of the dashboards that have graphite annotations? I fixed the 3 that @hashar explicitly listed, but I'm not excited enough about this to want to manually check every dashboard.

Fri, Dec 5, 12:47 AM · Scap, Release-Engineering-Team (Radar), Grafana
bd808 updated the task description for T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.
Fri, Dec 5, 12:45 AM · Scap, Release-Engineering-Team (Radar), Grafana
bd808 updated the task description for T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.
Fri, Dec 5, 12:41 AM · Scap, Release-Engineering-Team (Radar), Grafana
bd808 added a comment to T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

For awareness, there's also a regression affecting annotations in Grafana: https://github.com/grafana/grafana/issues/110265

Fri, Dec 5, 12:36 AM · Scap, Release-Engineering-Team (Radar), Grafana

Thu, Dec 4

colewhite added a comment to T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

For awareness, there's also a regression affecting annotations in Grafana: https://github.com/grafana/grafana/issues/110265

Thu, Dec 4, 10:23 PM · Scap, Release-Engineering-Team (Radar), Grafana
colewhite added a comment to T383563: mw.track: support for histogram metrics.

However, I lack the knowledge about what happens afterward. dogstatsd seems to support metrics (|h), but I'm not sure what it does with that and how it is passed on to the next part of our pipeline. And we also need to somehow define the buckets to use.

Thu, Dec 4, 10:00 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
Michael added a comment to T383563: mw.track: support for histogram metrics.

I'm now encountering this issue again for our current OKR work on the Revise Tone Structured Task and tracking how well we can match paragraphs (T407031).

Thu, Dec 4, 7:34 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
gerritbot added a project to T383563: mw.track: support for histogram metrics: Patch-For-Review.
Thu, Dec 4, 7:27 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
gerritbot added a comment to T383563: mw.track: support for histogram metrics.

Change #1215241 had a related patch set uploaded (by Michael Große; author: Michael Große):

[mediawiki/extensions/WikimediaEvents@master] feat(mw.track): add support for histogram metrics

https://gerrit.wikimedia.org/r/1215241

Thu, Dec 4, 7:27 PM · Growth-Team (FY2025-26 Q2 Sprint 6), User-Michael, Patch-For-Review, patch-welcome, MediaWiki-Engineering, Observability-Metrics, Data-Engineering-Radar, MediaWiki-Platform-Team (Radar), Data-Engineering, MediaWiki-extensions-WikimediaEvents, Grafana, GrowthExperiments
bd808 added a comment to T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

I took a shot at fixing https://grafana-rw.wikimedia.org/d/35WSHOjVk/application-servers-red-k8s. I think it works. I used {channel="scap"} |~ "Finished scap sync-world|Synchronized" for the MW deploy query. Just "(?i)finished|synchronized" seemed to pick up helmfile deployments, or maybe they were legacy scap3 things?

Thu, Dec 4, 1:11 AM · Scap, Release-Engineering-Team (Radar), Grafana
bd808 updated subscribers of T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

@Michael pointed to https://grafana-rw.wikimedia.org/d/vGq7hbnMz/special3a-homepage-and-suggested-edits as a dashboard with some working annotations.

Thu, Dec 4, 12:56 AM · Scap, Release-Engineering-Team (Radar), Grafana

Tue, Dec 2

hashar added a comment to T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.

Note from @colewhite, the scap logs are available at (requires NDA login) https://grafana.wikimedia.org/goto/XO75YxZDR?orgId=1

Tue, Dec 2, 7:03 PM · Scap, Release-Engineering-Team (Radar), Grafana
hashar edited projects for T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards, added: Scap; removed Deployments.

Scap has:

scap/main.py
def increment_stat(self, stat, all_stat=True, value=1):
    """Increment a stat in deploy.*
Tue, Dec 2, 6:54 PM · Scap, Release-Engineering-Team (Radar), Grafana
colewhite removed projects from T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards: Observability-Metrics, observability.

The replacement for this annotation tool is to use the Public Logs datasource in Grafana which is backed by Loki. Please let us know if the Observability team can be of further assistance.

Tue, Dec 2, 6:49 PM · Scap, Release-Engineering-Team (Radar), Grafana
colewhite renamed T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards from Graphana no more shows "MW deploy" "Train deployments" annotations to Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.
Tue, Dec 2, 4:05 PM · Scap, Release-Engineering-Team (Radar), Grafana
hashar created T411474: Grafana "MW deploy" "Train deployments" annotations broken on some dashboards.
Tue, Dec 2, 10:45 AM · Scap, Release-Engineering-Team (Radar), Grafana

Nov 6 2025

daniel closed T409173: api-gateway: improve ratelimit metrics mappings as Resolved.

Deployed and confirmed

Nov 6 2025, 8:58 AM · Grafana, serviceops

Nov 5 2025

Maintenance_bot removed a project from T409173: api-gateway: improve ratelimit metrics mappings: Patch-For-Review.
Nov 5 2025, 5:32 PM · Grafana, serviceops
gerritbot added a comment to T409173: api-gateway: improve ratelimit metrics mappings.

Change #1202200 merged by jenkins-bot:

[operations/deployment-charts@master] api-gateway: Fix regex for api-gateway metrics

https://gerrit.wikimedia.org/r/1202200

Nov 5 2025, 4:44 PM · Grafana, serviceops
gerritbot added a project to T409173: api-gateway: improve ratelimit metrics mappings: Patch-For-Review.
Nov 5 2025, 4:38 PM · Grafana, serviceops
gerritbot added a comment to T409173: api-gateway: improve ratelimit metrics mappings.

Change #1202200 had a related patch set uploaded (by Clément Goubert; author: Clément Goubert):

[operations/deployment-charts@master] api-gateway: Fix regex for api-gateway metrics

https://gerrit.wikimedia.org/r/1202200

Nov 5 2025, 4:38 PM · Grafana, serviceops
Maintenance_bot removed a project from T409173: api-gateway: improve ratelimit metrics mappings: Patch-For-Review.
Nov 5 2025, 4:31 PM · Grafana, serviceops
gerritbot added a comment to T409173: api-gateway: improve ratelimit metrics mappings.

Change #1201599 merged by jenkins-bot:

[operations/deployment-charts@master] api-gateway: improve metrics mapping

https://gerrit.wikimedia.org/r/1201599

Nov 5 2025, 4:00 PM · Grafana, serviceops

Nov 4 2025

daniel updated the task description for T409173: api-gateway: improve ratelimit metrics mappings.
Nov 4 2025, 1:29 PM · Grafana, serviceops
gerritbot added a project to T409173: api-gateway: improve ratelimit metrics mappings: Patch-For-Review.
Nov 4 2025, 1:09 PM · Grafana, serviceops
gerritbot added a comment to T409173: api-gateway: improve ratelimit metrics mappings.

Change #1201599 had a related patch set uploaded (by Daniel Kinzler; author: Daniel Kinzler):

[operations/deployment-charts@master] api-gateway: improve metrics mapping

https://gerrit.wikimedia.org/r/1201599

Nov 4 2025, 1:09 PM · Grafana, serviceops
daniel added a comment to T409173: api-gateway: improve ratelimit metrics mappings.

This is what thanos shows for the ratelimit_service_rate_limit_wikimedia_route_name_default_rate_client_id prefix...

Nov 4 2025, 1:04 PM · Grafana, serviceops
daniel created T409173: api-gateway: improve ratelimit metrics mappings.
Nov 4 2025, 1:03 PM · Grafana, serviceops

Oct 23 2025

andrea.denisse claimed T401908: Define a policy for Grafana Alerting.

Hi folks, I'm working on updating the Grafana alerts Wikitech section. It's still a WIP but I'd greatly appreciate your feedback:

Oct 23 2025, 3:07 AM · SRE Observability (FY2025/2026-Q1), Grafana

Oct 20 2025

Maintenance_bot removed a project from T406689: Email alerts from Grafana stopped working?: Patch-For-Review.
Oct 20 2025, 4:31 PM · Observability-Alerting, Grafana
gerritbot added a comment to T406689: Email alerts from Grafana stopped working?.

Change #1196533 merged by Andrea Denisse:

[operations/puppet@production] alertmanager: Add Slack route for the rweb team

https://gerrit.wikimedia.org/r/1196533

Oct 20 2025, 4:16 PM · Observability-Alerting, Grafana

Oct 16 2025

Peter added a comment to T406689: Email alerts from Grafana stopped working?.

Great! So from the chat: we have #performance-alerts and then #test-platform-tools-reporting. The test-platform-tools-reporting is the channel where the Test Platform team send their alerts.

Oct 16 2025, 3:54 AM · Observability-Alerting, Grafana

Oct 15 2025

gerritbot added a project to T406689: Email alerts from Grafana stopped working?: Patch-For-Review.
Oct 15 2025, 10:20 PM · Observability-Alerting, Grafana
gerritbot added a comment to T406689: Email alerts from Grafana stopped working?.

Change #1196533 had a related patch set uploaded (by Andrea Denisse; author: Andrea Denisse):

[operations/puppet@production] alertmanager: Add Slack route for the rweb team

https://gerrit.wikimedia.org/r/1196533

Oct 15 2025, 10:20 PM · Observability-Alerting, Grafana
andrea.denisse added a comment to T406689: Email alerts from Grafana stopped working?.

#api-alerts channel

Ah cool, didn't know, I'll try that out, thank you!

Oct 15 2025, 4:10 PM · Observability-Alerting, Grafana
Peter added a comment to T406689: Email alerts from Grafana stopped working?.

#api-alerts channel

Ah cool, didn't know, I'll try that out, thank you!

Oct 15 2025, 3:49 PM · Observability-Alerting, Grafana