Playbook/Checklist: adding disk space to boost01/mount

Failover to boost02

(Maybe/Probably) Suspend boost reporting job
~~ut site in maintenance mode(?)~~
Manual run of boost etl job
Switch Production k8s endpoints to boost02
- external-reporting-compute-database
- external-reporting-database
- PR in infrastructure repo https://github.com/watermelonexpress/infrastructure/pull/673
- Deploy ☝️
Note: switching endpoints before promoting the standby database seems like the best way to prevent any split-brain or data loss in the HA cluster, but may result in some momentary ugly airbrakes. Obviously the goal is to switch/promote/stop old master as close to simultaneously as we can manage.

Q: Should this be a PR into infrastructure:production branch, or can we deploy a separate branch into Production (we do intend to switch back, after all)?
Roll pgbouncer pods in k8s
Switchover boost01:5432 primary instance to boost02:5432 by running repmgr standby switchover on boost02
Confirm logical replication subscription in session db (refresh or rebuild as needed)

Logical subscription in benchprep_reporting_api_production is pointed at db02, and in theory will pickup where it left off when we promote the standby, but my confidence in that is limited

Put site in maintenance mode(?)
Switch Production k8s endpoints to boost01
- external-reporting-compute-database
- external-reporting-database
Roll pgbouncer pods
Stop postgres on boost02:5432
Promote boost01:5432 from standby to master
Out of maintenance mode
Confirm FDW config and connection from boost01:5432/production_boost_reporting to wmx_rails_api_production on 6432 replica.
Refresh or rebuild logical replication subscription in session db
Re-enable boost etl cron job