User Details
- User Since
- Nov 5 2018, 2:54 PM (365 w, 1 d)
- Availability
- Available
- IRC Nick
- cdanis
- LDAP User
- CDanis
- MediaWiki User
- CDanis (WMF) [ Global Accounts ]
Fri, Oct 31
Thu, Oct 30
Tue, Oct 28
Sounds good to me @Jdrewniak ! Thanks :)
Fri, Oct 24
For the record, I needed something similar but different for DNS Discovery records: https://gerrit.wikimedia.org/r/c/operations/puppet/+/1198583
Thu, Oct 23
tcp-proxy sounds good to me as a name.
Tue, Oct 21
Mon, Oct 20
Fri, Oct 17
Are there any early estimates of the expected %age increase in something like logged-in daily active users?
Tue, Oct 14
Fri, Oct 10
Wed, Oct 8
BTW, after looking at a few weeks of data, I suggest increasing the failure sampling fraction for these services. 10% or 20% would be absolutely fine, given their usage, and would give much more signal.
Tue, Oct 7
Mon, Oct 6
@Prototyperspective Can you please post the output of the traceroute step in these instructions?
joal please deploy refinery patch at your leisure, then I'll deploy a Turnilo patch and we can call this done 🎉
Oct 3 2025
Dashboards look good to me, let's call this finalized!
@brouberol Let's deploy together Monday, my morning/your afternoon?
Will deploy Puppet patches on Monday.
Sep 30 2025
OK, sounds good to me! I'll merge and deploy my patch now then, and can help you with the other one later.
@brouberol if you want, I'm happy to add this onto my existing patch against the same file: https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/1192572
Sep 29 2025
Sep 26 2025
I stumbled across this post which has a few interesting takes, like using mdadm metadata v1.0 to keep the ESP partition bootable but still active in mdadm, and also, forcing a manual resync every boot in case UEFI writes to its active ESP pre-boot (since it doesn't know anything about mdadm, of course). I'm not 100% convinced but it doesn't sound too unreasonable?
Sep 25 2025
Sep 24 2025
Sep 22 2025
approved, and merged
Sep 19 2025
Sep 18 2025
Sep 16 2025
FWIW: I think the new considerations in the task description make it pretty reasonable to take ~1.1% of the total RAM already allocated to Ganeti globally and spend it on VMs that are a known chokepoint in our deployments and the administration and development of the site in general.
Luca, do you want an early test subject for the Sloth trial?
Sep 15 2025
Thanks @taavi, good points as always.
Sep 12 2025
Sep 11 2025
Thank you @JAllemandou !
Sep 10 2025
Oh, one other thing we might want to do, if mesh.tracing.service_name is unset, default it to .Release.Namespace.
Sep 8 2025
FTR: This is data that might potentially be useful for SDS1.3.
Looks like we might not actually need this? https://phabricator.wikimedia.org/T325131#8480824
Sep 4 2025
Sep 3 2025
Sorry, I must have missed it -- did we explicitly reject the idea of using a simple MW cron job to get data into Prometheus?
Aug 29 2025
Hi @Lydia_Pintscher , SRE can make some exception here. It seems warranted given the status quo in the broader sparql ecosystem.
Thank you, that helps a lot.
Aug 27 2025
Open questions / next steps:
- Is configuring the Puppet server with HDFS access acceptable?
I'm not sure if there's an easy way to re-use the logic already in Refine given the current setup of webrequest_sampled_live. If there is, that would be ideal from a DRY & business logic drift perspective. Can someone advise?
As one option for a workaround, running tunnelencabulator --tunnel-everything after applying the above patchset will allow you to bypass this temporarily.
Aug 21 2025
Aug 18 2025
Is that something we could hack up a Prometheus exporter for real quick? What's the method of fetching the sizes?
@JAllemandou now that we've removed client_port, how is the Druid storage usage of webrequest looking?
Aug 14 2025
Aug 13 2025
Aug 11 2025
Aug 7 2025
You got me curious, so I spent a little time digging on this -- as far as I can tell not much has changed in the real world since Daniel Stenberg's blog post.
Aug 6 2025
@JAllemandou uploaded some patches, please take a look :)
Aug 1 2025
Any updates on this work, or any guesses on when there might be time?