Had an extended conversation with @Eevans about this on IRC today. His values are good with me -- as long as he/Core Platform feel that session storage is fast enough I'm happy to take the increased latency in exchange for a move toward multi-master.

Dec 11 2018, 11:30 PM · Core Platform Team Initiatives (Session Management Service (CDP2)), MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), Performance-Team (Radar), TechCom, Services (next), SRE, User-Clarakosi, User-Eevans

• Imarlier added a comment to T211534: Pin test dependencies in perf-team Python projects.

@Krinkle Opened CR's for coal and navtiming. arc-lamp is missing most of the scaffolding for a python package (tox.ini, setup.py, etc), so rather than add that as part of this ticket, I'd suggest that the tox.ini file be created with a pinned version whenever that happens.

Dec 11 2018, 5:54 PM · Performance-Team

• Imarlier added a comment to T211534: Pin test dependencies in perf-team Python projects.

Verified that tox.ini pinning works, by changing and then running locally:

Dec 11 2018, 5:28 PM · Performance-Team

Dec 10 2018

• Imarlier moved T181959: Blog post: Why we measure performance from To-do: Goals, prioritized next 4 Quarters to Doing (old) on the Performance-Team board.

Dec 10 2018, 9:15 PM · Performance-Team

• Imarlier moved T211631: Add info boxes to all save timing graphs on Grafana from Inbox, needs triage to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 10 2018, 9:07 PM · Performance-Team

• Imarlier moved T210484: Only serve debug HTTP headers when x-wikimedia-debug is present from Inbox, needs triage to Doing (old) on the Performance-Team board.

Dec 10 2018, 9:07 PM · Patch-For-Review, Analytics-Radar, SRE, Traffic, Performance-Team

• Imarlier moved T210911: Request on mobile isn't recorded in the replay proxy from Inbox, needs triage to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 10 2018, 9:07 PM · Synthetic-Performance-Testing, Performance-Team

• Imarlier assigned T210992: Increase parsercache keys TTL from 22 days back to 30 days to aaron.

@aaron to provide feedback, will assign back once he has.

Dec 10 2018, 9:06 PM · Performance-Team (Radar), SRE, DBA

• Imarlier assigned T204531: Wikidata dumps creating large amounts of log spam to aaron.

@aaron is going to see what else can be done to reduce spam, will then assign back to @ArielGlenn

Dec 10 2018, 9:04 PM · MW-1.32-notes, MW-1.30-release-notes, MW-1.31-release-notes, Performance-Team, Datacenter-Switchover, MediaWiki-Logevents, Wikidata, Dumps-Generation

• Imarlier moved T211487: Create on-wiki baseline page for our synthetic tests from Inbox, needs triage to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Dec 10 2018, 9:02 PM · WebPageTest, Performance-Team

• Imarlier edited projects for T210416: Upgrade grafana to 5.x, added: Performance-Team (Radar); removed Performance-Team.

Dec 10 2018, 9:01 PM · Performance-Team (Radar), Patch-For-Review, SRE, observability, User-CDanis

• Imarlier moved T211352: Add info boxes on each graph/dashboard for WebPageReplay from Inbox, needs triage to Doing (old) on the Performance-Team board.

Dec 10 2018, 9:00 PM · Performance-Team, Synthetic-Performance-Testing

• Imarlier updated the task description for T211381: Detect performance changes on deploy more quickly.

Dec 10 2018, 8:59 PM · Performance-Team

• Imarlier assigned T211381: Detect performance changes on deploy more quickly to Peter.

Dec 10 2018, 8:59 PM · Performance-Team

• Imarlier edited projects for T205870: Fully migrate producers off statsd, added: Performance-Team (Radar); removed Performance-Team.

Dec 10 2018, 8:57 PM · MW-1.44-notes (1.44.0-wmf.20; 2025-03-11), Patch-For-Review, Observability-Metrics, MW-1.38-notes (1.38.0-wmf.4; 2021-10-12), Goal, SRE

• Imarlier moved T68828: CentralAuth: Audit autologin procedure for performance and code quality from Inbox, needs triage to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Dec 10 2018, 8:54 PM · Wikimedia-Performance-recommendation, MediaWiki-Platform-Team, MediaWiki-Core-AuthManager, MediaWiki-extensions-CentralAuth

• Imarlier edited projects for T190379: RFC: Re-establish the development policies, added: Performance-Team (Radar); removed Performance-Team.

Dec 10 2018, 8:52 PM · DBA, Performance-Team, TechCom-RFC (TechCom-RFC-Closed), TechCom

• Imarlier moved T211534: Pin test dependencies in perf-team Python projects from Inbox, needs triage to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 10 2018, 8:49 PM · Performance-Team

• Imarlier moved T211538: Report cpu seconds spent from MediaWiki to Graphite from Inbox, needs triage to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Dec 10 2018, 8:49 PM · MediaWiki-General, Performance-Team

• Imarlier moved T211579: Test on multiple phones using one server from Inbox, needs triage to Doing (old) on the Performance-Team board.

Dec 10 2018, 8:48 PM · Performance-Team

• Imarlier moved T211618: New alert strategy: One URL can fire first visual change from Inbox, needs triage to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 10 2018, 8:48 PM · Synthetic-Performance-Testing, Performance-Team

• Imarlier added a comment to T211534: Pin test dependencies in perf-team Python projects.

Given that we mix and match using pip to manage python deps (which is good!), and using puppet to install them (which is bad!), I'd suggest using a VERY light hand with this.

Dec 10 2018, 3:46 PM · Performance-Team

Dec 6 2018

• Imarlier created T211381: Detect performance changes on deploy more quickly.

Dec 6 2018, 9:38 PM · Performance-Team

Dec 5 2018

• Imarlier edited projects for T197954: Mobile Safari (WebView) produces invalid Navigation Timing data, added: Performance-Team (Radar); removed Performance-Team.

Dec 5 2018, 10:23 PM · TestMe, Upstream, NavigationTiming

• Imarlier moved T186396: Remove desktop tests running on WebPageTest.org from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:37 PM · Performance-Team, WebPageTest

• Imarlier moved T182949: Get size compressed and uncompressed size of JS and CSS from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:37 PM · Performance-Team, WebPageTest

• Imarlier moved T192536: High variance in Chrome for one of the URLs on the Japanese Wikipedia from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:37 PM · Performance-Team, WebPageTest

• Imarlier moved T185741: Alert on log errors on Browsertime/WebPageReplay from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:36 PM · EngProd-Virtual-Hackathon, Performance-Team

• Imarlier moved T101732: Use Service Worker cache when available for ResourceLoader caching instead of LocalStorage from Blocked (old) to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Dec 5 2018, 5:30 PM · MediaWiki-Platform-Team (Radar), MediaWiki-ResourceLoader

• Imarlier added a comment to T184681: Document how to run performance tests on real phones.

@phuedx Any chance that someone had an opportunity to look at this?

Dec 5 2018, 5:24 PM · Web-Team-Backlog-Archived (Tracking), Mobile, Documentation, Performance-Team

• Imarlier edited projects for T107561: MediaWiki support for Composer equivalent for JavaScript packages, added: Performance-Team (Radar); removed Performance-Team.

Dec 5 2018, 5:23 PM · MediaWiki-Platform-Team, MediaWiki-ResourceLoader, Developer-Wishlist (2017), TechCom-RFC, Front-end-Standards-Group

• Imarlier moved T188689: Finalise addModuleStyles() and addModuleScripts() legacy behaviours from Blocked (old) to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:21 PM · MW-1.34-release, Technical-Debt (Deprecation process), Wikimedia-Performance-publish, Performance-Team, MediaWiki-ResourceLoader

• Imarlier added a comment to T197954: Mobile Safari (WebView) produces invalid Navigation Timing data.

Had a conversation with Apple Web Tech Evangelist, they are aware of this and it's assigned, but no release date known

Dec 5 2018, 5:20 PM · TestMe, Upstream, NavigationTiming

• Imarlier moved T202479: Investigate source of 404 Not Found responses from load.php from Blocked (old) to Doing (old) on the Performance-Team board.

Dec 5 2018, 5:17 PM · SRE, Traffic, Performance-Team

• Imarlier moved T190936: navtiming.py: When processing metrics, include effectiveConnectionType as a factor from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:08 PM · Performance-Team

• Imarlier claimed T187684: Define key metrics (3-5 total) that we're tracking and go public with the metrics.

Dec 5 2018, 5:08 PM · Performance-Team

• Imarlier edited projects for T88445: MediaWiki active/active datacenter investigation and work (tracking), added: Performance-Team (Radar); removed Performance-Team.

Dec 5 2018, 5:08 PM · Platform Team Workboards (Initiatives), Core Platform Team Initiatives (Multi-DC (TEC1)), User-mobrovac, Sustainability (MediaWiki-MultiDC), Epic

• Imarlier added a subtask for T193221: Mobile doesn't render until full HTML is downloaded in Chrome: T210141: Test our production stack's HTTP/2 priority support.

Dec 5 2018, 5:05 PM · Wikimedia-Performance-recommendation, Upstream, Performance-Team

• Imarlier added a parent task for T210141: Test our production stack's HTTP/2 priority support: T193221: Mobile doesn't render until full HTML is downloaded in Chrome.

Dec 5 2018, 5:05 PM · Wikimedia-Performance-publish, Performance-Team

• Imarlier moved T194639: Pick up Navigation Timing metrics with WebPageTest from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:01 PM · Performance-Team, WebPageTest

• Imarlier moved T194684: Migrate Performance Inspector from GUI to console from To-do: Goals prioritized current Quarter to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Dec 5 2018, 5:01 PM · MediaWiki-ResourceLoader, Performance-Team

• Imarlier moved T195233: Blog post: Explain how we use Browsertime/WebPageReplay/WebPageTest from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:00 PM · Wikimedia-Performance-publish, Synthetic-Performance-Testing

• Imarlier moved T196312: Fine tune WebPageTest alerts from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:00 PM · WebPageTest, Performance-Team

• Imarlier moved T102793: Bash tools with histograms, trends, and "field" tool should be available to all users on fluorine from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 5:00 PM · Performance-Team

• Imarlier moved T153301: Make it easy to compare WebPageTest results on alerts from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:59 PM · Upstream, WebPageTest, Performance-Team

• Imarlier assigned T185446: Create static version of wiki page as reference page for our tests to Krinkle.

Dec 5 2018, 4:58 PM · WebPageTest, Performance-Team

• Imarlier moved T189966: Audit and simplify MediaWiki initialisation code (Spring 2018) from To-do: Goals prioritized current Quarter to To-do: Goals, prioritized next 4 Quarters on the Performance-Team board.

Dec 5 2018, 4:58 PM · Wikimedia-Performance-publish, MW-1.35-notes (1.35.0-wmf.31; 2020-05-05), Platform Team Workboards (Clinic Duty Team), MediaWiki-General, Performance-Team

• Imarlier moved T159668: WebPageTest private server loads assets from Google from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:57 PM · Performance-Team, Upstream, WebPageTest

• Imarlier edited projects for T133462: Provide standard-compatible way to load multi-file packages (not plain concatenation), added: Performance-Team (Radar); removed Performance-Team.

Dec 5 2018, 4:57 PM · Performance-Team, MW-1.33-notes (1.33.0-wmf.17; 2019-02-12), TechCom-RFC (TechCom-RFC-Closed), Web-Team-Backlog-Archived (Tracking), Front-end-Standards-Group, MediaWiki-ResourceLoader

• Imarlier removed a project from T200526: ThumbnailRenderJob should use MWHttpRequest isOK() function: Patch-For-Review.

Dec 5 2018, 4:56 PM · Multimedia, MediaWiki-File-management, Commons, Performance-Team

• Imarlier closed T200526: ThumbnailRenderJob should use MWHttpRequest isOK() function as Declined.

Dec 5 2018, 4:55 PM · Multimedia, MediaWiki-File-management, Commons, Performance-Team

• Imarlier updated subscribers of T200758: Performance review for SecureLinkFixer extension.

@CCicalese_WMF @Legoktm Is this ready to review? We are a bit unclear on current status.

Dec 5 2018, 4:54 PM · Patch-For-Review, MW-1.41-notes (1.41.0-wmf.15; 2023-06-27), Performance-Team (Radar), Platform Engineering (Needs Cleaning - Security, stability, performance, and scalability (TEC1)), MediaWiki-extensions-SecureLinkFixer

• Imarlier moved T185724: Publish Doxygen for RunningStat library from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:52 PM · Patch-For-Review, patch-welcome, Librarization, Performance-Team, MediaWiki-libs-RunningStat, Continuous-Integration-Config

• Imarlier moved T198394: Change viewport for desktop tests (Browsertime and WebPageTest) from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:51 PM · Synthetic-Performance-Testing, WebPageTest, Performance-Team

• Imarlier closed T203845: [Bug] 150 ms increase in edit response time median since August 18th, 2018 as Resolved.

Same as T205369, which has been resolved.

Dec 5 2018, 4:51 PM · Web-Team-Backlog-Archived (Tracking), Performance-Team

• Imarlier moved T211220: Update Save Timing grafana dashboards to break down by content model from Inbox, needs triage to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:50 PM · Performance-Team

• Imarlier created T211220: Update Save Timing grafana dashboards to break down by content model.

Dec 5 2018, 4:50 PM · Performance-Team

• Imarlier moved T204423: MW 1.31 install reports "InvalidArgumentException ... DatabaseDomain.php: Domain has too few or too many parts " from To-do: Goals prioritized current Quarter to Blocked (old) on the Performance-Team board.

Dec 5 2018, 4:46 PM · MW-1.31-release-notes, MW-1.32-notes, Patch-For-Review, Performance-Team, MW-1.31-release, MediaWiki-libs-Rdbms, MediaWiki-Installer

• Imarlier added a comment to T204423: MW 1.31 install reports "InvalidArgumentException ... DatabaseDomain.php: Domain has too few or too many parts ".

@mwjames we believe that this is fixed, but we're waiting on confirmation of that. Could you please let us know if this is addressed for you?

Dec 5 2018, 4:46 PM · MW-1.31-release-notes, MW-1.32-notes, Patch-For-Review, Performance-Team, MW-1.31-release, MediaWiki-libs-Rdbms, MediaWiki-Installer

• Imarlier edited projects for T204245: Run MediaWiki media originals active/active, added: Performance-Team (Radar); removed Performance-Team.

Dec 5 2018, 4:44 PM · Services (watching), Sustainability (MediaWiki-MultiDC), Epic

• Imarlier moved T204174: FileOperation error "SwiftFileBackend::addMissingMetadata: {path} was not stored with SHA-1 metadata." from To-do: Goals prioritized current Quarter to Doing (old) on the Performance-Team board.

Dec 5 2018, 4:44 PM · MW-1.33-notes (1.33.0-wmf.8; 2018-12-11), Performance-Team, Thumbor, MediaWiki-File-management, Wikimedia-production-error

• Imarlier moved T205601: Clean up start script for WebPageReplay from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:43 PM · Synthetic-Performance-Testing, Performance-Team

• Imarlier assigned T206288: Exception from LinksUpdate "Could not acquire lock for page" when a page is edited frequently to Krinkle.

Dec 5 2018, 4:43 PM · Patch-For-Review, Performance-Team, MediaWiki-General, Wikimedia-production-error

• Imarlier assigned T206283: Failed deferred updates should be queued as jobs if possible (Deadlock from LinksUpdate in WikiPage::updateCategoryCounts) to Krinkle.

Dec 5 2018, 4:42 PM · MW-1.35-notes (1.35.0-wmf.24; 2020-03-17), Platform Team Workboards (Clinic Duty Team), MediaWiki-Page-derived-data, Performance-Team, Wikimedia-production-error

• Imarlier moved T207440: Chrome 69 -> Chrome 70 changes in metrics from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:41 PM · WebPageTest, Synthetic-Performance-Testing, Performance-Team

• Imarlier moved T208804: Document and rotate Daily Duties in the Performance Team from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:41 PM · Performance-Team

• Imarlier moved T203543: Chrome 69 increase time for first visual change in synthetic testing from To-do: Goals prioritized current Quarter to Backlog: Maintenance, non-prioritized on the Performance-Team board.

Dec 5 2018, 4:40 PM · Synthetic-Performance-Testing, Upstream, Performance-Team

• Imarlier edited projects for T207941: Spike of DBTransactionSizeError exceptions from /w/api.php from Special:Watchlist, added: Performance-Team (Radar); removed Performance-Team.

Dec 5 2018, 4:39 PM · MW-1.33-notes (1.33.0-wmf.23; 2019-03-26), Growth-Team (Sprint 0 (Growth Team)), Patch-For-Review, Performance-Team (Radar), Wikimedia-production-error

Dec 4 2018

• Imarlier added a comment to T196378: Investigate solutions for MySQL connection pooling.

@jcrespo Why would we need to deploy Mediawiki in order to repoint when the master is switched? Wouldn't the proxy be responsible for that?

Dec 4 2018, 2:51 PM · DBA, Sustainability (MediaWiki-MultiDC), Performance-Team (Radar)

Dec 3 2018

• Imarlier reassigned T207718: Errors trying to fetch RDF from Wikidata from • Imarlier to Smalyshev.

Dec 3 2018, 9:13 PM · Traffic, SRE, Performance-Team, Wikidata, Wikidata-Query-Service

• Imarlier added a comment to T207718: Errors trying to fetch RDF from Wikidata.

@Smalyshev Guessing this should go back to you for followup?

Dec 3 2018, 8:48 PM · Traffic, SRE, Performance-Team, Wikidata, Wikidata-Query-Service

• Imarlier added a comment to T207718: Errors trying to fetch RDF from Wikidata.

I've been running this in a tmux session on a few of the wdqs servers: while :; do DSTAMP=$(date); CW=$(sudo netstat -anet | grep 208.80.154.224 | grep -c CLOSE_WAIT); echo "${DSTAMP}: ${CW}"; sleep 1; done >> ~/close_waits.txt. (154.224 is the edge for the text cache cluster.)

Dec 3 2018, 8:48 PM · Traffic, SRE, Performance-Team, Wikidata, Wikidata-Query-Service

• Imarlier added a comment to T207718: Errors trying to fetch RDF from Wikidata.

@Smalyshev Yes, it would be slower, but it would also be diagnostic -- if persistent connections are disabled and the errors stop, we can be pretty confident that something about the way that they're configured is what's resulting in this issue.

Dec 3 2018, 7:55 PM · Traffic, SRE, Performance-Team, Wikidata, Wikidata-Query-Service

• Imarlier added a comment to T207718: Errors trying to fetch RDF from Wikidata.

@Smalyshev Another thought: why not just disable pooling, and have the client close each connection after each request?

Dec 3 2018, 7:51 PM · Traffic, SRE, Performance-Team, Wikidata, Wikidata-Query-Service

• Imarlier added a comment to T207718: Errors trying to fetch RDF from Wikidata.

@BBlack @ema Couple of questions for you about Nginx:

Do we have nginx configured to handle a specific number of requests on a given worker process/thread, and then shut that down?
If it possible for nginx to be restarted (interrupting existing persistent connections) due to config updates or the like, and if so, is there a record of times when that has happened?

Dec 3 2018, 7:18 PM · Traffic, SRE, Performance-Team, Wikidata, Wikidata-Query-Service

• Imarlier added a comment to T177747: grafana-labs often fails to generate graphs with c.datapoints is undefined.

In T177747#4793721, @fgiunchedi wrote:

tentatively resolving, graphite 0.9.15 is on labmon1001 (jessie) while production runs graphite 1.x on stretch

Dec 3 2018, 3:28 PM · cloud-services-team, Grafana, Cloud-VPS

Nov 30 2018

• Imarlier added a comment to T205369: Investigate > 40% Save Timing regression (2018-09-05).

Hey, that looks a lot better! Nice work, @daniel !

Nov 30 2018, 7:06 PM · Core Platform Team Initiatives (MCR), Multi-Content-Revisions, Platform Team Workboards (Done with CPT), Wikimedia-Performance-publish, Performance-Team

• Imarlier closed T210824: Barack Obama and other pages significant performance drop as Resolved.

Banner went live - happens every year

Nov 30 2018, 2:27 PM · Regression, Performance-Team

Nov 29 2018

• Imarlier added a comment to T210725: Replace parsercache keys to something more meaningful on db-XXXX.php.

@jcrespo I should have kept my answer simpler: I think it's fine to go ahead and do this. I would suggest changing one key (eg, '10.64.0.12' becomes 'pc1'), and deploying that. Wait 24-48 hours, until the hit rate has basically recovered, then change the second key. Wait another 24-48 hours, and change the last one.

Nov 29 2018, 6:56 PM · MediaWiki-libs-BagOStuff, Performance-Team (Radar), DBA, User-Marostegui

• Imarlier added a comment to T210725: Replace parsercache keys to something more meaningful on db-XXXX.php.

In T210725#4785435, @Banyek wrote:

In T210725#4785416, @Marostegui wrote:

So given the fact that we started with all the pc partially cold after switching back from codfw to eqiad (due to replication being disconnected) and we kinda survived - maybe we can try to switch keys one by one on a controlled manner?. So like flushing one parsercache per month?

I like the idea - we should ensure that the services undisturbed even in the worst conditions - if we flush those regularly we'll be sure that we don't have to worry about a crash

Nov 29 2018, 4:08 PM · MediaWiki-libs-BagOStuff, Performance-Team (Radar), DBA, User-Marostegui

Imarlier (Ian Marlier)Disabled

Projects

User Details

Recent ActivityView All

Jan 10 2019

Jan 7 2019

Dec 18 2018

Dec 17 2018

Dec 14 2018

Dec 13 2018

Dec 12 2018

Dec 11 2018

Dec 10 2018

Dec 6 2018

Dec 5 2018

Dec 4 2018

Dec 3 2018

Nov 30 2018

Nov 29 2018

Imarlier (Ian Marlier)
Disabled

Recent Activity
View All