Pavel Shutsin activity

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T18:16:50Z

@fmccawley could you please review again? I addressed your comments and tier-3 pipeline is green now.

Pavel Shutsin commented on merge request !227130 at GitLab.org / GitLab

2026-03-18T18:11:59Z

@ahegyi could you please review this MR as clickhouse and database maintainer?

Pavel Shutsin commented on merge request !227130 at GitLab.org / GitLab

2026-03-18T18:11:09Z

@ashvins could you please review this MR as backend ?

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T17:36:49Z

done. I've added a test for guest user.

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T17:36:40Z

done. I've added a test for guest user.

Pavel Shutsin pushed to project branch 589610-add-usage-events-ae at GitLab.org / GitLab

2026-03-18T16:47:00Z

Pavel Shutsin (2e262727) at 18 Mar 16:47

Introduce featuresCount

... and 877 more commits

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T15:52:45Z

@rob.hunt yes analytics type will be available on both levels. Project and Group (and maybe Organizations and Instance one day 🤷 )

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T14:42:37Z

@fmccawley

There isn't any check that enforces this

that's correct.

Could we add a guard to the mount_aggregation_engine method to enforce it?

Unfortunately, that's not really possible due to the various flexible ways authorization can be defined. There are cases where the parent object handles authorization, so the child doesn't need to. There might also be cases where the aggregation engine should be accessible without authorization. Given these considerations, I think updating the documentation is the best approach we can take at this moment. If you'd like, we can open a follow-up issue to discuss this further and avoid blocking this MR.

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T14:42:37Z

No, there isn't. The field was marked as "Not for production use" and "Experimental" because it's a work-in-progress endpoint. There is a more stable alternative with the older architecture available under the aiUsageData endpoint, which is also experimental but known to be in use by clients. So I believe we are safe here.

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T14:42:37Z

@fmccawley it's just to emphasize for other developers that both approaches are possible. authorize: :read_pro_ai_analytics is simpler while class call is more flexible. Both approaches are standard to our GraphQL structures.

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T14:42:36Z

I noticed the rspec:undercoverage test is showing a warning https://gitlab.com/gitlab-org/gitlab/-/jobs/13532575386, this will hard fail when the tier 3 pipeline runs.

It should not, because tier-1 doesn't run full spec suite so coverage from request specs is not included. Anyway Adam already approved the MR so I just triggered new pipeline to see if there's any undercoverage issues.

Pavel Shutsin commented on issue #590777 at GitLab.org / GitLab

2026-03-18T14:42:00Z

That will require standardized way of deduplication.

Pavel Shutsin commented on issue #590777 at GitLab.org / GitLab

2026-03-18T14:41:28Z

We definitely should. And I believe it should be fully hidden inside AEs if possible.

Pavel Shutsin pushed to project branch expose-code-suggestions-ae-to-graphql at GitLab.org / GitLab

2026-03-18T14:23:47Z

Pavel Shutsin (fe29b9fc) at 18 Mar 14:23

remove disclamer

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T14:02:27Z

@rob.hunt then we can update the label and keep it Experimental but not "Not for production use yet".

Pavel Shutsin pushed to project branch expose-code-suggestions-ae-to-graphql at GitLab.org / GitLab

2026-03-18T13:28:50Z

Pavel Shutsin (4c761a8a) at 18 Mar 13:28

Improve spec coverage

... and 256 more commits

Pavel Shutsin approved merge request !226902: Store Siphon configuration at GitLab.org / GitLab

2026-03-18T13:14:04Z

What does this MR do and why?

This MR adds Siphon configured tables and adds the Siphon-specific configuration. We also do testing so the Siphon configuration matches the PostgreSQL and ClickHouse schemas.

The goal of these configurations is to act as a SSoT for tables which are going to be Siphon-replicated on our GitLab environments (GDK, .com, Dedicated, Self-Managed, etc.)

Siphon Configuration Keys

Config	Description
`table`	The PostgreSQL source table name
`database`	The database where the table resides (e.g., `main`), this tells us which Siphon producer to use for this table
`ignored_columns`	List of columns to exclude from replication (sensitive/encrypted data)
`replication_targets`	Array of replication target configurations (for now we only register ClickHouse as the target)
`replication_targets[].name`	Name of the replication target (e.g., `clickhouse_main`)
`replication_targets[].target`	Target ClickHouse table name where the data will be replicated
`replication_targets[].priority`	Replication priority (numeric value), optional. During initial setup we want to replicate specific tables first (organizations, projects, namespaces)
`replication_targets[].dedup_by`	Columns used for deduplication when upserting data. This is an important config for `traversal_path` de-normalized tables. (https://docs.gitlab.com/development/database/clickhouse/clickhouse_table_design_with_siphon/#siphon-configuration-and-the-chicken-and-egg-problem)

Config rules have been implemented in the RSpec matcher:

If tables are 1:1 match: ensure primary keys are matching.
If CH table is de-normalized or optimized for a different access pattern (dedup_by), ensure that the PostgreSQL primary keys are suffix of the ClickHouse primary keys.

MR acceptance checklist

Evaluate this MR against the MR acceptance checklist. It helps you analyze changes to reduce risks in quality, performance, reliability, security, and maintainability.

Pavel Shutsin commented on merge request !226274 at GitLab.org / GitLab

2026-03-18T11:56:18Z

@jiaan 🤷 whenever we decide that the structure is stable. @rob.hunt suggested %19.0 somewhere. It's fine for me.

Pavel Shutsin commented on merge request !226711 at GitLab.org / GitLab

2026-03-18T11:53:00Z

@tkuah lets open a follow-up for this if you don't mind. I'm not 100% sure we should. For this specific case I believe it will look fine, but for generalized case dimensions can have arguments, so not everything will fit into Parameterized tests. I'd love to have common pattern established between all aggregation_engines/*_spec.rb I hope it will emerge later on when we have more engines => more usecases.