Page MenuHomePhabricator

Scott_French (Scott French)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Jan 18 2024, 5:33 PM (75 w, 6 d)
Availability
Available
LDAP User
Scott French
MediaWiki User
SFrench-WMF [ Global Accounts ]

Recent Activity

Today

Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

FYI, I will be out next week and intend to pick this back up when I return. Assuming all goes smoothly with the miscweb migration, the next step is to rebase the "plain" httpd image stack on bookworm via https://gerrit.wikimedia.org/r/1162030 and deprecate the -bookworm track.

Thu, Jul 3, 1:16 AM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French added a comment to T398245: Prepare WMF PHP 8.3 packages for bullseye.

Status update before I disappear for a bit:

Thu, Jul 3, 12:50 AM · serviceops
Scott_French closed T398523: Suppress ATSBackendErrorsHigh for wdqs2009.codfw.wmnet as Resolved.

It has been over 1h since https://gerrit.wikimedia.org/r/1166016 was merged, and subsequent puppet runs on the prometheus hosts should now have picked up the change. Closing this out as resolved by @RKemper.

Thu, Jul 3, 12:37 AM · Data-Platform-SRE, SRE

Yesterday

Scott_French added a project to T398523: Suppress ATSBackendErrorsHigh for wdqs2009.codfw.wmnet: Data-Platform-SRE.
Wed, Jul 2, 11:21 PM · Data-Platform-SRE, SRE
Scott_French created T398523: Suppress ATSBackendErrorsHigh for wdqs2009.codfw.wmnet.
Wed, Jul 2, 11:16 PM · Data-Platform-SRE, SRE
Scott_French closed T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition, a subtask of T319432: Migrate WMF production from PHP 7.4 to PHP 8.1, as Resolved.
Wed, Jul 2, 1:26 AM · Data-Engineering-Radar, Data-Engineering, Dumps-Generation, MediaWiki-Platform-Team, serviceops
Scott_French closed T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition as Resolved.
Wed, Jul 2, 1:26 AM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French updated the task description for T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.
Wed, Jul 2, 1:11 AM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French updated the task description for T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.
Wed, Jul 2, 12:47 AM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

My sincere thanks for all of your help here @tstarling.

Wed, Jul 2, 12:45 AM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French updated the task description for T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.
Wed, Jul 2, 12:40 AM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops

Tue, Jul 1

Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

Thank you very much, @tstarling. That final rename is now complete.

Tue, Jul 1, 11:04 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T397907: Upgrade Excimer to 1.2.5 in production.

Alright, the remaining steps here are:

Tue, Jul 1, 9:02 PM · Patch-For-Review, serviceops
Scott_French added a comment to T398245: Prepare WMF PHP 8.3 packages for bullseye.

Once I'm able to pull the PCRE2 backport builds into component/php83, I should be able to start on the build process.

Tue, Jul 1, 7:15 PM · serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

Alright, one straggler I missed before on thwiki:

Tue, Jul 1, 5:35 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French updated the task description for T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.
Tue, Jul 1, 4:40 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French closed T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior, a subtask of T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition, as Resolved.
Tue, Jul 1, 4:37 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French closed T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior as Resolved.
Tue, Jul 1, 4:37 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Tue, Jul 1, 4:37 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.

The renames listed in the task description have now been completed, and a MediaWiki configuration change has been applied that ensures titles starting with previously overridden characters now canonicalize to their correct title-case equivalents (ensuring that, e.g., the former redirects to the latter). Remaining cleanup will be tracked in the parent task (T394556).

Tue, Jul 1, 4:37 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

Alright, all renames should be complete and the title-case mappings have been reverted to just the static override for Eszett. During the deployment, I spot-checked a number of previously overridden characters now canonicalize to their proper title-case equivalents.

Tue, Jul 1, 4:30 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

comms: I've reached out to the user via Special:EmailUser with a heads-up about the upcoming rename.

Tue, Jul 1, 1:44 AM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T397907: Upgrade Excimer to 1.2.5 in production.

Excimer 1.2.5-1 packages are now available in /var/cache/pbuilder/result/bullseye-amd64/ on build2001, but have not yet been included in the apt repository.

Tue, Jul 1, 12:07 AM · Patch-For-Review, serviceops

Mon, Jun 30

Scott_French renamed T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior from Rename pages and images to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior to Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Mon, Jun 30, 11:15 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

@Ladsgroup and @Zabe - Thank you both. It sounds like I do indeed need to pick up the change to support file renames. While I can do an initial test run with a local copy of the uppercaseTitlesForUnicodeTransition.php, I can't use that strategy with mwscript-k8s more generally, so I may need to backport https://gerrit.wikimedia.org/r/1164665 depending on timing.

Mon, Jun 30, 10:54 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T397907: Upgrade Excimer to 1.2.5 in production.

My understanding is that your RequestTimeout change is a performance mitigation for the now-more-expensive manipulation of Excimer timers. Is there anything else that should be monitored as a result of that? (i.e., in addition to segfaults and exceptions more generally)

Mon, Jun 30, 9:39 PM · Patch-For-Review, serviceops
Scott_French added a subtask for T398246: Prepare PHP 8.3 production images: T398245: Prepare WMF PHP 8.3 packages for bullseye.
Mon, Jun 30, 8:23 PM · Patch-For-Review, serviceops
Scott_French added a parent task for T398245: Prepare WMF PHP 8.3 packages for bullseye: T398246: Prepare PHP 8.3 production images.
Mon, Jun 30, 8:23 PM · serviceops
Scott_French created T398246: Prepare PHP 8.3 production images.
Mon, Jun 30, 8:17 PM · Patch-For-Review, serviceops
Scott_French created T398245: Prepare WMF PHP 8.3 packages for bullseye.
Mon, Jun 30, 8:16 PM · serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

Ah, thanks for highlighting that, @Zabe! It looks like that was merged yesterday, so if it's critical, we'll need to make sure it gets picked up before we re-run the script. Given the current status of the schema migration, do you know whether we expect content to exist in that table that requires renames?

Mon, Jun 30, 7:13 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Mon, Jun 30, 5:43 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Mon, Jun 30, 4:32 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

I was doing the final prep for actually running the renames this morning, and it seems there was a user created on idwiktionary 3 days ago would now be renamed. I need to sort out how / whether to communicate with this user before proceeding.

Mon, Jun 30, 4:19 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a project to T378128: Upgrade httpd images to bullseye or bookworm: collaboration-services.

I was chatting with @Jelto earlier today about migrating miscweb, and it sounds like it should be doable / preferable to migrate in two steps, similar to what we're doing with shellbox and mediawiki - i.e., switch to httpd-bookworm and deploy / verify, then switch back to httpd once the latter has been rebased on bookworm.

Mon, Jun 30, 2:23 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops

Thu, Jun 26

Scott_French added a comment to T352245: Migrate the etcd main cluster to cfssl-based PKI.

Alright, I think https://gerrit.wikimedia.org/r/1164264 is the simplest option to achieve the specific behavior we want - i.e., reload rather than restart, and do so when any of the relevant resources change.

Thu, Jun 26, 9:31 PM · Patch-For-Review, serviceops
Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

As of 17:20 UTC, all mediawiki releases have now migrated to the bookworm-based webserver image.

Thu, Jun 26, 6:01 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French updated the task description for T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.
Thu, Jun 26, 3:52 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T397685: helmfile/scap does not reliably bootstrap mediawiki.

@Clement_Goubert - Ah, thanks for the additional details!

Thu, Jun 26, 2:42 PM · MW-on-K8s, Release-Engineering-Team, Scap, serviceops
Scott_French added a comment to T352245: Migrate the etcd main cluster to cfssl-based PKI.

Thanks, @MoritzMuehlenhoff - that's an interesting idea!

Thu, Jun 26, 12:19 AM · Patch-For-Review, serviceops

Wed, Jun 25

Scott_French updated the task description for T397874: Assess switchover behavior for mw-wikifunctions.
Wed, Jun 25, 10:26 PM · serviceops, Datacenter-Switchover
Scott_French updated subscribers of T397685: helmfile/scap does not reliably bootstrap mediawiki.

I was chatting with @dancy earlier today about what might have caused this, and it's kind of a puzzling one.

Wed, Jun 25, 9:43 PM · MW-on-K8s, Release-Engineering-Team, Scap, serviceops
Scott_French renamed T397874: Assess switchover behavior for mw-wikifunctions from Update switchover behavior for mw-wikifunctions to Assess switchover behavior for mw-wikifunctions.
Wed, Jun 25, 8:13 PM · serviceops, Datacenter-Switchover
Scott_French closed T376237: Turn down unused swift-r[ow] discovery services as Resolved.

This is done now. Thanks for the reviews, all!

Wed, Jun 25, 8:06 PM · SRE-swift-storage, Datacenter-Switchover, serviceops
Scott_French created T397874: Assess switchover behavior for mw-wikifunctions.
Wed, Jun 25, 8:02 PM · serviceops, Datacenter-Switchover
Scott_French added a comment to T352245: Migrate the etcd main cluster to cfssl-based PKI.

Let's start with the good news: Everything that could be evaluated after migrating a single host (conf2006) seems to work as expected. We were able to confirm that conftool, etcdctl, MediaWiki, confd, and Liberica experience no connectivity issues as a result of the new certificates. Further, since conftool works, there is no reason to expect other python-etcd-based clients (e.g., navtiming, spicerack) will not.

Wed, Jun 25, 4:24 PM · Patch-For-Review, serviceops
Scott_French created P78683 (An Untitled Masterwork).
Wed, Jun 25, 2:01 PM

Tue, Jun 24

Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

As of ~ 17:30 UTC today, both mw-api-ext and mw-web are serving ~ 5% of traffic via the migration releases, which are in turn using the bookworm webserver image.

Tue, Jun 24, 11:40 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French added a project to T397727: Can't upload video to Commons: "incorrect fragment shifting": UploadWizard.

@MBH - Thanks for the report. Let me see if I can find the right folks to investigate this further.

Tue, Jun 24, 4:21 PM · UploadWizard, Commons, MediaWiki-Uploading
Scott_French updated the task description for T376237: Turn down unused swift-r[ow] discovery services.
Tue, Jun 24, 3:35 PM · SRE-swift-storage, Datacenter-Switchover, serviceops
Scott_French updated the task description for T376237: Turn down unused swift-r[ow] discovery services.
Tue, Jun 24, 3:28 PM · SRE-swift-storage, Datacenter-Switchover, serviceops
Scott_French updated the task description for T376237: Turn down unused swift-r[ow] discovery services.
Tue, Jun 24, 12:44 AM · SRE-swift-storage, Datacenter-Switchover, serviceops

Mon, Jun 23

Scott_French updated the task description for T376237: Turn down unused swift-r[ow] discovery services.
Mon, Jun 23, 11:21 PM · SRE-swift-storage, Datacenter-Switchover, serviceops
Scott_French added a comment to T376237: Turn down unused swift-r[ow] discovery services.

@MatthewVernon - Ah, that's great! Yes, let's keep those pointed to failoid, then. I'll post a patch shortly to do the "manual equivalent" for the swift-ro services.

Mon, Jun 23, 10:20 PM · SRE-swift-storage, Datacenter-Switchover, serviceops
Scott_French updated the task description for T376237: Turn down unused swift-r[ow] discovery services.
Mon, Jun 23, 8:53 PM · SRE-swift-storage, Datacenter-Switchover, serviceops
Scott_French updated the task description for T378128: Upgrade httpd images to bullseye or bookworm.
Mon, Jun 23, 8:25 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French renamed T352245: Migrate the etcd main cluster to cfssl-based PKI from Migrate etcd::tlsproxy Nginx certs and etcd itself to PKI to Migrate the etcd main cluster to cfssl-based PKI.
Mon, Jun 23, 7:16 PM · Patch-For-Review, serviceops
Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Mon, Jun 23, 5:53 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

The webserver-bookworm image flavour is now live in mw-debug/next, passing httpbb checks and manual kicking-of-tires by me. No errors / issues surfaced in httpd container logs. None of this is surprising, given that apache 2.4.62 has been live on the mwdebug hosts for some time without issue.

Mon, Jun 23, 5:48 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops

Sat, Jun 21

Scott_French added a comment to T352245: Migrate the etcd main cluster to cfssl-based PKI.

Revisiting this today, here's a revised plan for the nginx TLS proxy portion of the migration.

Sat, Jun 21, 12:22 AM · Patch-For-Review, serviceops

Wed, Jun 18

Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

After no issues were uncovered for shellbox-syntaxhighlight with ~ 24h on the new images, the remaining (5) shellbox instances have now been updated as well (staggered by datacenter by ~ 20m). Validating using the same graphs and logs as in T378128#10925040, no issues have been uncovered so far, though again I'll check in periodically throughout the day.

Wed, Jun 18, 6:26 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French added a comment to T395696: Move ExternalStore config out of mediawiki config.

Alright, I'll let you take this from here, @Ladsgroup :)

Wed, Jun 18, 4:12 PM · conftool, DBA
Scott_French added a comment to T395696: Move ExternalStore config out of mediawiki config.

Thank you both for resolving that! Indeed, prior to [0] landing in 5.3.0, RO state on external sections was ignored by dbctl when generating the committed dbconfig - thus Amir's observation in T395696#10927592.

Wed, Jun 18, 2:38 PM · conftool, DBA

Tue, Jun 17

Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Tue, Jun 17, 9:19 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Tue, Jun 17, 9:14 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

A couple of updates:

Tue, Jun 17, 9:03 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T395696: Move ExternalStore config out of mediawiki config.

@Ladsgroup - I was optimistically walking through some final checks ahead of releasing conftool, and I noticed that x3 is marked read-only:

Tue, Jun 17, 7:33 PM · conftool, DBA
Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

After about and hour of soak with 1 replica per DC on the new httpd images and no issues observed, I've now moved all of syntaxhighlight forward. I've been keeping an eye on general service health in grafana (eqiad, codfw), httpd container logs (manual tailing with kubectl), mediawiki exec-channel errors (logstash), and ShellboxError exceptions (logstash), and will be checking in periodically throughout the day.

Tue, Jun 17, 5:38 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops

Mon, Jun 16

Scott_French updated the task description for T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Mon, Jun 16, 8:20 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T389786: Integrate mediawiki-dumps-legacy with the regular MW scap deployments.

After the switch to a CronJob, I was able to successfully apply a lingering image diff from today's UTC-afternoon backport window using scap. Thanks for driving that @brouberol!

Mon, Jun 16, 5:19 PM · Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review
Scott_French closed T390630: Alert when disk space utilization on sessionstore nodes is trending high as Resolved.

With the alert routing and severity changes now merged, I believe that wraps up the remaining work here. Thanks for the discussion, folks.

Mon, Jun 16, 2:37 PM · Cassandra, SRE-OnFire, Sustainability (Incident Followup)
Scott_French updated the task description for T390630: Alert when disk space utilization on sessionstore nodes is trending high.
Mon, Jun 16, 2:36 PM · Cassandra, SRE-OnFire, Sustainability (Incident Followup)

Fri, Jun 13

Scott_French renamed T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior from Rename pages and images to reflect migration to PHP 8.1 (Unicode 14) to Rename pages and images to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Fri, Jun 13, 8:57 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French created T396903: Rename pages, images, and users to reflect migration to PHP 8.1 (Unicode 14) title-casing behavior.
Fri, Jun 13, 8:55 PM · User-notice, MediaWiki-Engineering, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

@tstarling - Thanks for reviewing the proposed renames and for confirming the mutating mode of the script still works as expected. I realize it has been a few years since it was last used, so that was definitely a concern.

Fri, Jun 13, 7:37 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French claimed T390630: Alert when disk space utilization on sessionstore nodes is trending high.
Fri, Jun 13, 3:25 PM · Cassandra, SRE-OnFire, Sustainability (Incident Followup)
Scott_French updated subscribers of T390630: Alert when disk space utilization on sessionstore nodes is trending high.

Now that the new alert has been live for a couple of days without issuing false positives, I believe it makes sense to switch to task-severity.

Fri, Jun 13, 1:58 PM · Cassandra, SRE-OnFire, Sustainability (Incident Followup)

Thu, Jun 12

Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

Alright, we now have the ability to override the httpd image name easily. I'd propose we start with pilot on a single shellbox service in two steps (fraction of traffic -> all traffic), then expand to the remaining services, similar to what we did for the PHP 8.1 migration (although we can and should go much faster here).

Thu, Jun 12, 8:45 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French added a comment to T395696: Move ExternalStore config out of mediawiki config.

The conftool 5.3.0 packages are now live on apt-staging, but have not yet been included in apt.wikimedia.org.

Thu, Jun 12, 7:56 PM · conftool, DBA
Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

Inverting the order and piloting on shellbox early on sounds good. The only downside to that is the necessary change to the chart, but that's really quite easy.

Thu, Jun 12, 1:07 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops

Wed, Jun 11

Scott_French updated the task description for T378128: Upgrade httpd images to bullseye or bookworm.
Wed, Jun 11, 7:51 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French added a comment to T378128: Upgrade httpd images to bullseye or bookworm.

Now that we can reuse some of the tools we created for the PHP 8.1 migration to pilot this easily, it makes sense to pick this back up and get it done.

Wed, Jun 11, 7:51 PM · collaboration-services, Patch-For-Review, Infrastructure-Foundations, serviceops
Scott_French closed T388260: Converge / update image version across shellbox service instances (cleanup), a subtask of T364249: New upstream release for Pygments (2.18.0), as Resolved.
Wed, Jun 11, 7:07 PM · User-bd808, MW-1.45-notes (1.45.0-wmf.5; 2025-06-10), MW-1.43-notes (1.43.0-wmf.6; 2024-05-21), User-TheDJ, Patch-For-Review, SyntaxHighlight
Scott_French closed T388260: Converge / update image version across shellbox service instances (cleanup), a subtask of T377038: Migrate production Shellbox variants to PHP 8.1, as Resolved.
Wed, Jun 11, 7:07 PM · Patch-For-Review, serviceops
Scott_French closed T388260: Converge / update image version across shellbox service instances (cleanup) as Resolved.

The remaining shellbox instances have been updated everywhere as of 18:52 UTC today. Looking at general service health and some of the use-case-specific logstash queries from T377038, all looks well.

Wed, Jun 11, 7:07 PM · serviceops
Scott_French added a comment to T381680: The ops-maint-gcal.js script is missing support for some vendors.

@elukey - Ah, I wonder if Google might have changed something. The 16384 number was based entirely on bisection with a small number of test events. It seemed to consistently be the "too large" threshold at the time, but something might have changed in the interim.

Wed, Jun 11, 12:53 PM · SRE, SRE-Unowned

Mon, Jun 9

Scott_French updated the task description for T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.
Mon, Jun 9, 8:37 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T390630: Alert when disk space utilization on sessionstore nodes is trending high.

The SessionStoreDiskSpaceRunwayTooLow alert is now live, although in warning severity, to limit the scope of potential noise if something odd happens early on with the maths (e.g., while the history of the recording rule builds up).

Mon, Jun 9, 6:59 PM · Cassandra, SRE-OnFire, Sustainability (Incident Followup)
Scott_French added a comment to T388260: Converge / update image version across shellbox service instances (cleanup).

The 2025-06-05-215815 image is live in shellbox-video as of ~ 17:10 UTC. No issues observed so far. I'll wait for https://gerrit.wikimedia.org/r/1154132 to be deployed (and soak for a bit) before moving ahead with the remaining shellbox instances.

Mon, Jun 9, 5:38 PM · serviceops

Fri, Jun 6

Scott_French updated the task description for T350565: Switch conftool to use the version 3 etcd datastore.
Fri, Jun 6, 5:54 PM · Patch-For-Review, conftool, Data-Persistence, Traffic, serviceops
Scott_French updated the task description for T350565: Switch conftool to use the version 3 etcd datastore.
Fri, Jun 6, 5:52 PM · Patch-For-Review, conftool, Data-Persistence, Traffic, serviceops
Scott_French added a comment to T388260: Converge / update image version across shellbox service instances (cleanup).

Scanning through the changes merged since the summary in the task description was collected, the only notable one I see was the bump to wikimedia/wikipeg 5.0.0. Production appears to be running 4.0.0 as of the 2025-01-07-141744 image. As long as that doesn't carry any notable risk, beyond what could sneak through the PEG parser tests for ShellParser, that seems fine?

Fri, Jun 6, 3:12 PM · serviceops
Scott_French added a comment to T388260: Converge / update image version across shellbox service instances (cleanup).

@bd808 - Thanks for flagging! Indeed, this fell by the wayside while dealing with other aspects of the PHP migration, and for lack of any urgent changes that needed to be deployed.

Fri, Jun 6, 3:06 PM · serviceops

Wed, Jun 4

Scott_French added a comment to T383760: dbctl: expose diff via API in a more structured way.

Thanks for the follow-up, all!

Wed, Jun 4, 1:11 AM · DBA, conftool

Tue, Jun 3

Scott_French added a comment to T395696: Move ExternalStore config out of mediawiki config.

Two items came to mind while reviewing https://gerrit.wikimedia.org/r/1152853, which will also need done to make use of this (in order):

  1. The patternProperties for readOnlyBySection will need updated in the dbconfig json-schema [0].
  2. The section flavor check [1] in DbConfig.compute_config will need updated, to consider external-flavored sections when populating readOnlyBySection.
Tue, Jun 3, 11:38 PM · conftool, DBA
Scott_French added a comment to T341553: Allow running one-off scripts manually.

@Zabe - There will be an additional announcement soon, but similar to the guidance around other not-yet-supported use cases like sql.php in this wikitech-l thread, the interim solution is likely to involve moving your mwscript usage to the active deployment host (i.e., deployment.eqiad.wmnet) instead of the soon-to-be-decommissioned mwmaint* hosts.

Tue, Jun 3, 10:04 PM · MW-on-K8s, serviceops
Scott_French added a comment to T394556: Clean up UcfirstOverrides.php following PHP 7.4 -> 8.1 transition.

Alright, I was able to run uppercaseTitlesForUnicodeTransition.php across all wikis in the default dry-run mode today.

Tue, Jun 3, 9:03 PM · MW-1.43-notes, MW-1.44-notes, MW-1.45-notes (1.45.0-wmf.9; 2025-07-08), MediaWiki-Engineering, serviceops
Scott_French added a comment to T388761: scap needs to be k8s-cluster aware.

Although changes to mediawiki-dumps-legacy will be needed before this feature can actually be put to use there (details in T389786#10881115), we were still able to "successfully" test this functionality today, and indeed it appears to work as expected.

Tue, Jun 3, 6:17 PM · Release-Engineering-Team, Dumps-Generation, Scap
Scott_French added a comment to T389786: Integrate mediawiki-dumps-legacy with the regular MW scap deployments.

Alas, as foretold in T389499#10671841, you cannot mutate the spec.template of a k8s Job object, regardless of whether it's suspended or not:

Tue, Jun 3, 5:57 PM · Data-Platform-SRE (2025.06.13 - 2025.07.04), Patch-For-Review
Scott_French closed T395521: Members of restricted cannot use --dblist as Resolved.

With https://gerrit.wikimedia.org/r/1152854 merged, I believe this problem should be fixed.

Tue, Jun 3, 4:10 PM · serviceops, MW-on-K8s