User Details
- User Since
- Oct 3 2014, 5:57 AM (578 w, 5 d)
- Availability
- Available
- LDAP User
- Giuseppe Lavagetto
- MediaWiki User
- GLavagetto (WMF) [ Global Accounts ]
Today
Triaging to UBN! as this is blocking activity for at least two hypotheses.
Mon, Nov 3
Thu, Oct 30
Wed, Oct 29
Anyone wanting to know what that tomcat is running can just use github.
Fri, Oct 24
Thu, Oct 23
Wed, Oct 22
Tue, Oct 21
The generated VCL code for wmfuniq-based rate-limiting looks like this:
For instance, I see some of the IP ranges from GCP are part of a global on-wiki block, so I don't think you'll be able to create edits from that range.
Hi, I'm not sure this task is tagged with the right tags.
Fri, Oct 17
For python3-conftool and dervied packages, you can simply patch the .gitlab-ci.yml file to generate debs also for trixie, and then get them from apt-staging; assuming apt-staging is setup for trixie.
Thu, Oct 16
I think you have misunderstood what this task was about: it's about specifically removing that ACL from varnish, not about removing the concept, which is central to how we're doing traffic filtering.
Mon, Oct 13
Tue, Oct 7
Marking as resolved as this task is obsolete at this point IMO.
Sep 23 2025
Sep 22 2025
Sep 18 2025
Coming to @SLyngshede-WMF's concern, I think some of them are valid, like having disjoint configuration going besides the actual content of a file, including the puppet breakage.
Sep 17 2025
I will tentatively close this task for now.
Sep 16 2025
Aug 22 2025
Aug 21 2025
Aug 19 2025
Sorry, for reasons I don't understand my first request for that page got a x-cache-status: pass, I see now it's cacheable at the edge.
A couple of things we might want to consider:
* Is there any reason not to cache for some time the sitemap api URLs at the edge? Even a few hours would be beneficial I think if we ever publicize this more.
- Is there a reason to include User_talk: pages and similar in the sitemap? these pages are rarely cached and relatively expensive to render in some cases, do we care about them being indexed in search engines?
Aug 18 2025
To clarify:
- If your error is coming from our filtering at the edge, you will get an error page containing "Error" in the <h1> tags
- Service Unavailable only happens when the negative response is coming from the backend.
I should also add - if this has happened in the last week, that might be connected to T400119 - if users have any browser extension that modifies their user-agent, for example removing it, that would cause failures now.
This task seems to coalesce different issues; having said that, we've had quite a few abusers lately that used old versions of Chrome as their user agent, so we had to block traffic at times. Most of the rules should be disabled soon, and we are trying to improve our detection of actual browsers vs forged ones.
Aug 15 2025
Aug 14 2025
Aug 13 2025
Aug 6 2025
We have 5M successful responses in the same period of time.
Sorry I hadn't realized the number of requests was this large, it reframes the problem a bit:
You missed changing the use of the abuse ACLs and removing the loading of netmaps for X-Public-Cloud and similar stuff as well.
Aug 5 2025
Aug 4 2025
I actually resolved the task by mistake: there's still some work to do on the varnish/haproxy side.
I fear this is a well known we've already encountered. You can see here https://grafana.wikimedia.org/d/zsdYRV7Vk/istio-sidecar?orgId=1&from=now-7d&to=now&timezone=utc&var-cluster=aWotKxQMz&var-namespace=revscoring-editquality-damaging&var-backend=$__all&var-response_code=503&var-quantile=0.5&var-quantile=0.95&var-quantile=0.99&viewPanel=panel-16 that most of the 503 have failure code UC which is envoy/istio for Upstream connection failed. In our experience, this can happen when a limit is not configured for the life of an upstream http connection.
I've taken a look from the side of the api.log mediawiki generates, and I was a bit surprised to find 16 identical calls over the span of an hour for the same revid that I found failing in logstash.
I thought about this a bit, and I think we need to distinguish between the grading system (which should express trust levels) and request feature (like bearing a valid session cookie, being identified as a bot...).
Aug 1 2025
To give a bit of context, over the last day we saw:
- 62 million valid requests with no user-agent
- 24.5 million valid requests with user agent okhttp/*
- 17 million valid requests with user-agent python-{requests,urllib}*
Please note, this solution is temporary: bots working from clouds will break repeatedly if they're not properly identified with us.