Using statistics, Bayesian inference, machine learning, and software/data engineering to solve problems and inform decisions in Product Analytics and improve product experimentation capabilities with Experiment Platform
User Details
- User Since
- Jul 27 2015, 4:15 PM (553 w, 1 d)
- Availability
- Available
- IRC Nick
- bearloga
- LDAP User
- Bearloga
- MediaWiki User
- MPopov (WMF) [ Global Accounts ]
Today
This needs to be blocked on T419021: LDAP groups for tiered access to GrowthBook
Fri, Feb 27
The short term fix is to pause the mobile TOC experiment on zhwiki.
zhwiki was removed from the experiment's target wikis as of an hour ago – the experiment is off zhwiki for the remainder of the experiment
Thu, Feb 26
Or maybe I should have created a new task instead?
I'm still frequently getting
QueuePool limit of size 10 overflow 10 reached, connection timed out, timeout 30.00 (Background on this error at: https://sqlalche.me/e/14/3o7r)
and intermittent 500 errors, including just now while trying to open https://superset.wikimedia.org/users/userinfo/
Wed, Feb 25
I'm not opposed to Michael's proposal but want to give it a thought. In the past GrowthExperiments had added hidden fields to the account creation form including information such as a variant assigned. I think the TestKitchen JS SDK should be able to add the subjectId information along with the variant and all be sent within the authentication request and received on the server side PHP hook that ensures account has been created. I will give it a thought and get back asap.
Thank you!!
Tue, Feb 24
Mon, Feb 23
Wouldn't client side be unreliable for capturing account_created events as they are prone to be cut-off by add-blockers, private browsing settings and others while the server side account creation is reliable
Fri, Feb 20
AMAZING WORK. I did some benchmarking (using the synth A/A/A test with 808K events) and the speed gains are EXQUISITE:
Thu, Feb 19
Wed, Feb 18
If you generate the version string by hashing the appropriate properties (e.g. include machine-readable name, exclude friendly name), then you don't need to worry about friendly database tinkerers keeping the version field up to date.
Tue, Feb 17
The scenario I mentioned is possible because GB are limited in what configs/dbs they regularly test on (ref. https://github.com/growthbook/growthbook/pull/5298) and we know we're basically the only ones using kerberos auth, so if anything emerges there we're the canary.
So, at first I was thinking production data lake because one check we need to be able to do with growthbook-next is presto connector and kerberos authentication. If GB releases a version that accidentally breaks either of those, we should detect that in gb-next rather than in main/prod instance.
Fri, Feb 13
Thanks so much for thinking through it and thank you for including potential code snippets to inform the implementation! I love the 2-tier record/ledger idea.
Thu, Feb 12
@Sfaci @KReid-WMF: @phuedx and I were talking about the future where metric specifications specify which contextual attributes should be collected per event, rather than per instrument/experiment. We've seen a need for this already, both with the exposure logging work and Editing & Growth's experiments where only edit_saved events need page_namespasce_id and page_revision_id but those aren't needed for any other events.
Santi and I confirmed that the contributors experiments events are now decorated with the correct contextual attributes.
@Sfaci: Sorry for last minute scope change, I just realized we also need performer_is_bot as one of the contextual attributes.
Wed, Feb 11
Via the background queue (implemented in DefaultEventSubmitter) and via a new method that calls navigator.sendBeacon() directly, e.g. Instrument#sendImmediately()
Tue, Feb 10
@cjming: By the way, the mechanism here doesn't have to be super complicated or "perfect". We just need a way of avoiding unnecessarily logging exposure if/when we can.
(Sounds good!)
- Notable observations
- v0.1 (initial release)
Mon, Feb 9
@MoritzMuehlenhoff: Oh that's a great point. Yes, the outcome is that @AKhatun_WMF would have admin rights on the analytics_product Airflow instance. If getting that outcome requires the membership you mentioned, please make the necessary change.
Thanks! I copied that into the acceptance criteria and updated exp_platform to experiment_platform
Fri, Feb 6
SuggestedEdits::isActivated( $this->getContext()->getUser()
seems more accurate for action: experiment_exposure event than
SuggestedEdits::isEnabledForAnyone( $this->wikiConfig )
We want the exposure event to only fire if the suggested edits module has been activated and contains either the control or the treatment experience. We don't want exposure event on a homepage visit where the module hasn't been activated and so there is no actual exposure.
As the approving party for both groups (and the person requesting this access), I approve @AKhatun_WMF's membership.
Thu, Feb 5
There is still some nuance here about what exactly counts as an exposure. But based on Kirsten's general support above, I'll go for the users visiting the homepage when SuggestedEdits are enabled for the wiki. Please let me know if that is not sufficiently precise enough!
Wed, Feb 4
@JVanderhoop-WMF's thoughts:
I think the use case here is really specific: only some folks in the treatment group created reading lists, and they want to ensure that those folks are told that it's now a beta feature, that their list lives on, etc.
This seems rare, and I am concerned about a slippery slope of providing more "individual" information rather than the aggregates that matter in our A/B testing context. (Though I'm not clear that I've articulated that feeling all too clearly here)
I think exposure logging guidance probably warrants its own guide/page which is then linked to from the conduct an experiment guide.
@MNeisler and I discussed this and arrived at the following proposal:
Thank you!
Mon, Feb 2
Some additional data to assist with investigation: https://docs.google.com/spreadsheets/d/1P3_8tGbg3Suvfa1q8TrgAXv_jk29_z7cfkOenXDuQYE/edit?gid=1285859969#gid=1285859969 (WMF internal only)
| wiki_id | assigned | subject_count | |
|---|---|---|---|
| 0 | arwiki | control | 787 |
| 1 | arwiki | treatment | 774 |
| 2 | enwiki | control | 34215 |
| 3 | enwiki | treatment | 36732 |
| 4 | frwiki | control | 4875 |
| 5 | frwiki | treatment | 5060 |
| 6 | ptwiki | control | 1497 |
| 7 | ptwiki | treatment | 1601 |
Jan 30 2026
@KReid-WMF @phuedx: I think Reader Growth has a pretty good pattern/practice that we can recommend in the docs as part of T414735: Update documentation with guidance on exposure logging.