Rafael Henchen activity https://gitlab.com/rhenchen.gitlab 2026-03-20T01:15:37Z tag:gitlab.com,2026-03-20:5224405402 Rafael Henchen deleted project branch duo-edit-20260320-011416 at GitLab.com / GitLab Infrastructure Team / db-migration 2026-03-20T01:15:37Z rhenchen.gitlab Rafael Henchen

Rafael Henchen (985e966e) at 20 Mar 01:15

tag:gitlab.com,2026-03-20:5224405253 Rafael Henchen pushed to project branch master at GitLab.com / GitLab Infrastructure Team / db-migration 2026-03-20T01:15:29Z rhenchen.gitlab Rafael Henchen

Rafael Henchen (2cc5859e) at 20 Mar 01:15

Merge branch 'duo-edit-20260320-011416' into 'master'

... and 1 more commit

tag:gitlab.com,2026-03-20:5224405239 Rafael Henchen accepted merge request !668: Remove patroni-ci-v17-14 through v17-19 nodes from gprd-ci inventory at GitLab.com / GitLab Infrastructure Team /... 2026-03-20T01:15:29Z rhenchen.gitlab Rafael Henchen

Removes the following nodes from dbre-toolkit/inventory/gprd-ci.yml:

  • patroni-ci-v17-14-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-15-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-16-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-17-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-18-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-19-db-gprd.c.gitlab-production.internal
tag:gitlab.com,2026-03-20:5224404054 Rafael Henchen opened merge request !668: Remove patroni-ci-v17-14 through v17-19 nodes from gprd-ci inventory at GitLab.com / GitLab Infrastructure Team / d... 2026-03-20T01:14:28Z rhenchen.gitlab Rafael Henchen

Removes the following nodes from dbre-toolkit/inventory/gprd-ci.yml:

  • patroni-ci-v17-14-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-15-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-16-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-17-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-18-db-gprd.c.gitlab-production.internal
  • patroni-ci-v17-19-db-gprd.c.gitlab-production.internal
tag:gitlab.com,2026-03-20:5224403847 Rafael Henchen pushed new project branch duo-edit-20260320-011416 at GitLab.com / GitLab Infrastructure Team / db-migration 2026-03-20T01:14:18Z rhenchen.gitlab Rafael Henchen

Rafael Henchen (985e966e) at 20 Mar 01:14

Remove patroni-ci-v17-14 through v17-19 nodes from gprd-ci inventory

tag:gitlab.com,2026-03-20:5224290623 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-20T00:08:07Z rhenchen.gitlab Rafael Henchen

The reservation for this node and cpu platform was working in the zones that failed. Only deployment seems to be failing.

Screenshot_2026-03-20_at_11.07.08_am

tag:gitlab.com,2026-03-19:5224103595 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T22:19:33Z rhenchen.gitlab Rafael Henchen

@alexander-sosna ok, so google's documentation on "default" values seems to not be working. I have calculated at #21592 (comment 3172729683) that for a 16TB disk the IOPS would be (6 × 16,384) + 3,000 = 101,304 IOPS and Throughput would be (1.5 × 16,384) + 140 = 24,716 MiB/s (which would be limited at 2,400 MiB/s).

IMO the ideal is to have:

  • 100,000 IOPS
  • 2,400 MiB/s of throughput
tag:gitlab.com,2026-03-19:5221556300 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T10:55:36Z rhenchen.gitlab Rafael Henchen

Resolved

We modified the CR plan just to remove Phase 3 which was launching nodes 14-19

So this CR objective is to only deploy 3x C4 nodes and perform the switchover of Primary into C4, that should be enough to provide high availability for the CI Primary node.

tag:gitlab.com,2026-03-19:5221551971 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T10:54:29Z rhenchen.gitlab Rafael Henchen

Resolved This should not be a blocker as we decided to reduce the number of nodes to the 3 nodes as c4-highmem-192

tag:gitlab.com,2026-03-19:5221549102 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T10:53:45Z rhenchen.gitlab Rafael Henchen

There's a pending request to increase the hyperdisk quota

Screenshot_2026-03-19_at_9.53.29_pm

tag:gitlab.com,2026-03-19:5221544225 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T10:52:32Z rhenchen.gitlab Rafael Henchen

Issue 3

We have released the 9 * c4-hihghmem-144 reservations. However, the C4 quota have not decreased. Which seems another issue with Google Cloud fleet provisioning.

Screenshot_2026-03-19_at_9.52.19_pm

tag:gitlab.com,2026-03-19:5221388953 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T10:16:45Z rhenchen.gitlab Rafael Henchen

The c4-hihghmem-144 is only available over Intel Granite Rapids.

So, we'll try to provision the 3 nodes as c4-highmem-192 which are available in Emerald Rapids.

tag:gitlab.com,2026-03-19:5221376751 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T10:13:43Z rhenchen.gitlab Rafael Henchen

Issue 2

We couldn't launch the 6th gen Intel Granite Rapids in zones B and D. It worked only in zone C. So apparently there's an issue with fleet provisioning by Google for Intel Granite Rapids.


│ Error: Error creating instance: googleapi: Error 400: C4 VM does not support minCpuPlatform Intel Granite Rapids., badRequest

│   with module.patroni-ci-v17.google_compute_instance.instance_with_attached_disk["11"],
│   on .terraform/modules/patroni-ci-v17/instance.tf line 339, in resource "google_compute_instance" "instance_with_attached_disk":
│  339: resource "google_compute_instance" "instance_with_attached_disk" {



│ Error: Error creating instance: googleapi: Error 400: C4 VM does not support minCpuPlatform Intel Granite Rapids., badRequest

│   with module.patroni-ci-v17.google_compute_instance.instance_with_attached_disk["12"],
│   on .terraform/modules/patroni-ci-v17/instance.tf line 339, in resource "google_compute_instance" "instance_with_attached_disk":
│  339: resource "google_compute_instance" "instance_with_attached_disk" {


Screenshot_2026-03-19_at_9.13.15_pm

tag:gitlab.com,2026-03-19:5221169965 Rafael Henchen commented on issue #21592 at GitLab.com / GitLab Infrastructure Team / Production 2026-03-19T09:27:30Z rhenchen.gitlab Rafael Henchen

Issue 1

During the CR execution we had issues with hyperdisk quota and decided to split the node deployment in 2

First MR to create nodes 11-13 which are the critical nodes to provide capacity for us to switchover the Primary node to C4 : https://ops.gitlab.net/gitlab-com/gl-infra/config-mgmt/-/merge_requests/13552

Second MR to create nodes 14-19 which are the Replica-only nodes: https://ops.gitlab.net/gitlab-com/gl-infra/config-mgmt/-/merge_requests/13553

tag:gitlab.com,2026-03-19:5220717000 Rafael Henchen deleted project branch duo-edit-20260319-045356 at GitLab.com / GitLab Infrastructure Team / db-migration 2026-03-19T07:23:18Z rhenchen.gitlab Rafael Henchen

Rafael Henchen (7ef43470) at 19 Mar 07:23

tag:gitlab.com,2026-03-19:5220716535 Rafael Henchen pushed to project branch master at GitLab.com / GitLab Infrastructure Team / db-migration 2026-03-19T07:23:06Z rhenchen.gitlab Rafael Henchen

Rafael Henchen (8484c93d) at 19 Mar 07:23

Merge branch 'duo-edit-20260319-045356' into 'master'

... and 2 more commits

tag:gitlab.com,2026-03-19:5220716530 Rafael Henchen accepted merge request !667: Update patroni-ci nodes to v17 with expanded node list at GitLab.com / GitLab Infrastructure Team / db-migration 2026-03-19T07:23:06Z rhenchen.gitlab Rafael Henchen

Updates the patroni-ci inventory to reference v17 nodes with an expanded list including:

  • Node 02
  • Nodes 10-19
  • Nodes 101-110

Total of 21 nodes replacing the previous 6 v16 nodes.