Page MenuHomePhabricator

Data-Platform-SRE (2024.05.06 - 2024.05.26)Milestone
ArchivedPublic

Members (6)

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

Milestone for Data Platform SRE work

Recent Activity

Sep 5 2025

Gehel closed T363521: Completion suggester can promote a bad build, a subtask of T363694: Post incident tasks: Search missing results/unavailable for some eqiad users, as Resolved.
Sep 5 2025, 7:51 AM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Discovery-Search (Current work), Sustainability (Incident Followup), SRE-OnFire

Jul 9 2025

bking closed T363697: Pybal: Depool nodes outside broadcast domain, a subtask of T363694: Post incident tasks: Search missing results/unavailable for some eqiad users, as Invalid.
Jul 9 2025, 4:12 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Discovery-Search (Current work), Sustainability (Incident Followup), SRE-OnFire

May 13 2025

bking closed T363702: LVS hosts: Monitor/alert when pooled nodes are outside broadcast domain, a subtask of T363694: Post incident tasks: Search missing results/unavailable for some eqiad users, as Resolved.
May 13 2025, 5:00 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Discovery-Search (Current work), Sustainability (Incident Followup), SRE-OnFire

May 8 2025

dcausse reopened T363521: Completion suggester can promote a bad build, a subtask of T363694: Post incident tasks: Search missing results/unavailable for some eqiad users, as Open.
May 8 2025, 10:15 AM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Discovery-Search (Current work), Sustainability (Incident Followup), SRE-OnFire

Feb 11 2025

Gehel edited projects for T360697: Investigate/fix broken apifeatureusage index deletion, added: Discovery-Search (Current work); removed Discovery-Search.
Feb 11 2025, 3:25 PM · Discovery-Search (Current work), Data-Platform-SRE (2024.05.06 - 2024.05.26)

Jan 10 2025

Gehel moved T360598: kafka-main certificates expiring on 2024-04-04 from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:59 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Data-Engineering, serviceops
Gehel edited projects for T360598: kafka-main certificates expiring on 2024-04-04, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:58 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Data-Engineering, serviceops
Gehel moved T360697: Investigate/fix broken apifeatureusage index deletion from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:40 PM · Discovery-Search (Current work), Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T360697: Investigate/fix broken apifeatureusage index deletion, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:40 PM · Discovery-Search (Current work), Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T361024: NEW BUG REPORT SSL certificate verification error when using internal API endpoints from conda-analytics and Jupyter on stat host from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:38 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Data-Platform
Gehel edited projects for T361024: NEW BUG REPORT SSL certificate verification error when using internal API endpoints from conda-analytics and Jupyter on stat host, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:37 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Data-Platform
Gehel edited projects for T317182: Move archiva to private IPs + CDN, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:34 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T354936: Review the use of scap + git-fat for Data Platform Engineering use cases from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:34 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T354936: Review the use of scap + git-fat for Data Platform Engineering use cases, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:34 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T273086: Downloading from Archiva.wikimedia.org is slower than Maven Central from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:33 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T273086: Downloading from Archiva.wikimedia.org is slower than Maven Central, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:33 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T327267: Create a DSE Kubernetes cluster with support for persistent storage from Ceph from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:27 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Epic, Foundational Technology Requests
Gehel edited projects for T327267: Create a DSE Kubernetes cluster with support for persistent storage from Ceph, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:26 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Epic, Foundational Technology Requests
Gehel moved T280905: Analytics coordinator failover improvements from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:17 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T361511: Try to reverse wipefs on host using DRAC/iLO and document from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 4:17 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T280905: Analytics coordinator failover improvements, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:15 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T361511: Try to reverse wipefs on host using DRAC/iLO and document, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE.
Jan 10 2025, 4:14 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T366137: Remove datahub from LVS from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 3:51 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel moved T364037: Investigate why pools.json does not match https://config-master.wikimedia.org/pybal/${datacenter}/${service} T363702 from Backlog to Done on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
Jan 10 2025, 3:51 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), serviceops, Traffic
Gehel edited projects for T366137: Remove datahub from LVS, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:51 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T365576: provision datahub-next subdomain, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:50 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T365400: Enable blocked commands in Zookeeper management interface, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:50 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T365010: Tear down the flink-operator on dse-k8s-eqiad, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:50 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T364964: an-redacteddb1001 experiencing disk errors, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:50 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T364795: Build the required container images for the cloudnativepg postgresql operator, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:49 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
Gehel edited projects for T364037: Investigate why pools.json does not match https://config-master.wikimedia.org/pybal/${datacenter}/${service} T363702, added: Data-Platform-SRE (2024.05.06 - 2024.05.26); removed Data-Platform-SRE (2023.12.01 - 2023.12.31).
Jan 10 2025, 3:49 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), serviceops, Traffic

Aug 15 2024

klausman closed T365479: Update kserve and knative-serving charts for new-style Calico network policies, a subtask of T287491: Allow to address Kubernetes API servers from NetworkPolicy, as Resolved.
Aug 15 2024, 2:14 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Patch-For-Review, serviceops, Prod-Kubernetes, Kubernetes

Jul 19 2024

Gehel closed T363521: Completion suggester can promote a bad build, a subtask of T363694: Post incident tasks: Search missing results/unavailable for some eqiad users, as Resolved.
Jul 19 2024, 12:47 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Discovery-Search (Current work), Sustainability (Incident Followup), SRE-OnFire

Jul 16 2024

CDanis closed T365855: Stop hardcoding k8s master (k8s API) endpoint IP addresses, a subtask of T287491: Allow to address Kubernetes API servers from NetworkPolicy, as Resolved.
Jul 16 2024, 7:24 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Patch-For-Review, serviceops, Prod-Kubernetes, Kubernetes
jijiki added a subtask for T287491: Allow to address Kubernetes API servers from NetworkPolicy: T365855: Stop hardcoding k8s master (k8s API) endpoint IP addresses.
Jul 16 2024, 3:19 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Patch-For-Review, serviceops, Prod-Kubernetes, Kubernetes

Jul 12 2024

jijiki closed T287491: Allow to address Kubernetes API servers from NetworkPolicy as Resolved.
Jul 12 2024, 10:26 AM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Patch-For-Review, serviceops, Prod-Kubernetes, Kubernetes

Jun 11 2024

gerritbot added a comment to T287491: Allow to address Kubernetes API servers from NetworkPolicy.

Change #1031892 merged by jenkins-bot:

[operations/deployment-charts@master] cirrus-streaming-updater: remove zk network policies

https://gerrit.wikimedia.org/r/1031892

Jun 11 2024, 4:05 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Patch-For-Review, serviceops, Prod-Kubernetes, Kubernetes

Jun 5 2024

Albertvillanovadelmoral added a comment to T354687: Missing dumps with underscores on mirrors.

Thanks for the fix @xcollazo!

Jun 5 2024, 1:38 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), Test Kitchen (Data Products Sprint 13), Dumps-Generation

Jun 4 2024

bking closed T361114: Alert Search Platform and/or DPE SRE when Wikidata is lagged, a subtask of T360993: WDQS lag propagation to wikidata not working as intended, as Resolved.
Jun 4 2024, 1:40 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Wikidata, Discovery-Search (Current work)
Maintenance_bot removed a project from T364795: Build the required container images for the cloudnativepg postgresql operator: Patch-For-Review.
Jun 4 2024, 1:30 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)
CodeReviewBot added a comment to T364795: Build the required container images for the cloudnativepg postgresql operator.

brouberol merged https://gitlab.wikimedia.org/repos/data-engineering/postgresql-kubernetes/-/merge_requests/2

Jun 4 2024, 1:11 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26)

Jun 3 2024

JAllemandou added a comment to T365197: ISPDatabaseReader null pointer exception.

Task description updated with latest stack trace.

Jun 3 2024, 7:09 AM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Patch-For-Review, Data-Engineering

May 31 2024

CodeReviewBot added a project to T364795: Build the required container images for the cloudnativepg postgresql operator: Patch-For-Review.

brouberol opened https://gitlab.wikimedia.org/repos/data-engineering/postgresql-kubernetes/-/merge_requests/2

May 31 2024, 8:33 AM · Data-Platform-SRE (2024.05.06 - 2024.05.26)

May 30 2024

CodeReviewBot added a comment to T365197: ISPDatabaseReader null pointer exception.

joal merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/717

May 30 2024, 5:33 PM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Patch-For-Review, Data-Engineering
CodeReviewBot added a comment to T365197: ISPDatabaseReader null pointer exception.

joal opened https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/717

May 30 2024, 4:55 PM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Patch-For-Review, Data-Engineering
JAllemandou reopened T365197: ISPDatabaseReader null pointer exception as "Open".
May 30 2024, 4:54 PM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Patch-For-Review, Data-Engineering

May 24 2024

Gehel archived Data-Platform-SRE (2024.05.06 - 2024.05.26).
May 24 2024, 12:22 PM
Gehel moved T360219: Migrate WMF Maven projects to new parent pom from Needs Review to In Progress on the Data-Platform-SRE (2024.05.06 - 2024.05.26) board.
May 24 2024, 12:19 PM · Data-Engineering-Radar, Data-Engineering, Java-Scala-Standardization
Maintenance_bot removed a project from T349619: Migrate roles to puppet7: Patch-For-Review.
May 24 2024, 11:31 AM · Patch-For-Review, Data-Platform-SRE (2024.06.17 - 2024.07.07), serviceops, collaboration-services, SRE-tools, Puppet-Core, Puppet (Puppet 7.0), Infrastructure-Foundations, SRE
Maintenance_bot removed a project from T364533: Migrate AQS and image-suggestions services to Calico Network Policies: Patch-For-Review.
May 24 2024, 11:31 AM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Test Kitchen, Kubernetes