Pam Artiaga activity https://gitlab.com/partiaga 2026-03-10T11:42:48Z tag:gitlab.com,2026-03-10:5187114122 Pam Artiaga commented on merge request !226439 at GitLab.org / GitLab 2026-03-10T11:42:48Z partiaga Pam Artiaga

I created an issue for the refactor: Remove unit_primitive references from ActiveCon... (#592907)

tag:gitlab.com,2026-03-10:5187081260 Pam Artiaga opened issue #592907: Remove unit_primitive references from ActiveContext classes at GitLab.org / GitLab 2026-03-10T11:35:26Z partiaga Pam Artiaga tag:gitlab.com,2026-03-10:5187071556 Pam Artiaga commented on merge request !226439 at GitLab.org / GitLab 2026-03-10T11:33:08Z partiaga Pam Artiaga

cc @maddievn - sorry, I meant to post an explanation for this one but forgot 🤦

Explanation:

The unit_primitive parameter here is mainly to make the initialization call from ActiveContext::EmbeddingModel.generate_embeddings work.

But ideally, we should treat the unit_primitive in the the Gitlab::Llm::Embeddings::ModelDefinition object as the source of truth. This would follow the approach in the rest of the LLM module where the source of truth for the unit primitive is within the LLM classes.

In a later refactor, we can remove the unit_primitive references in ActiveContext classes except for the ModelSelector.

tag:gitlab.com,2026-03-10:5186056220 Pam Artiaga commented on epic #20110 at GitLab.org 2026-03-10T07:59:40Z partiaga Pam Artiaga

once we know how customers will be billed, we need to implement it. As with everything in AI-land, it's really hard to follow how things are implemented. Can we get someone with expertise on DAP billing to help out with what needs to be done?

@maddievn - There's an issue for implementing the billing in https://gitlab.com/gitlab-org/modelops/applied-ml/code-suggestions/ai-assist/-/work_items/1985+

I've written out what I know needs to be done + the unknowns, which are pretty major, and there may be more unknowns I'm not aware of. We definitely need someone who could guide us with this.

tag:gitlab.com,2026-03-10:5185863959 Pam Artiaga commented on merge request !226439 at GitLab.org / GitLab 2026-03-10T07:04:16Z partiaga Pam Artiaga

Order is relevant.

tag:gitlab.com,2026-03-10:5185854209 Pam Artiaga commented on merge request !226439 at GitLab.org / GitLab 2026-03-10T07:01:49Z partiaga Pam Artiaga

Note: I've updated the test setups for Collections::Code, References::Code, and Queries::Code so that these specs have no idea about the specific LLM class being used (Ai::ActiveContext::Embeddings::Code::VertexText vs ::Gitlab::Llm::Embeddings::CodeEmbeddings). The tests were getting a bit unwieldy and brittle with these classes having that knowledge 😅

cc @maddievn

tag:gitlab.com,2026-03-10:5185829324 Pam Artiaga pushed to project branch 1866-add-embeddings-endpoint at GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway 2026-03-10T06:53:24Z partiaga Pam Artiaga

Pam Artiaga (4bb9d9a2) at 10 Mar 06:53

refactor: propagate bad request errors to client

tag:gitlab.com,2026-03-10:5185574993 Pam Artiaga commented on epic #20110 at GitLab.org 2026-03-10T05:13:35Z partiaga Pam Artiaga

Thanks for this summary @maddievn 🙇

I also posted a question in gitlab#536642 (comment 3144609929), about us possibly moving all Duo Self-hosted support (including Gitlab-operated models) to post-GA altogether.

A lot of the prep work we've done for Duo Self-hosted support are also foundations for billing tracking (which imo should be a non-negotiable requirement for GA), so we are not throwing away completed work. 🙂

In any case, that question is something for everyone to weigh in on and I'm OK with it either way, but the outcome of that discussion would also influence the GA/post-GA list here.

tag:gitlab.com,2026-03-10:5185525696 Pam Artiaga pushed to project branch 590572-code-embeddings-llm-class at GitLab.org / GitLab 2026-03-10T04:50:41Z partiaga Pam Artiaga

Pam Artiaga (c41ef31c) at 10 Mar 04:50

Remove llm class references in ActiveContext tests

tag:gitlab.com,2026-03-10:5185481498 Pam Artiaga pushed to project branch 590572-code-embeddings-llm-class at GitLab.org / GitLab 2026-03-10T04:28:09Z partiaga Pam Artiaga

Pam Artiaga (29a95c1a) at 10 Mar 04:28

Resolve test failures in ActiveContext classes

tag:gitlab.com,2026-03-10:5185164085 Pam Artiaga commented on issue #536642 at GitLab.org / GitLab 2026-03-10T01:43:57Z partiaga Pam Artiaga

[ActiveContext SM]: UI for creating connection (#585318) could not be a blocker. This is just the frontend for self-hosted,

@arturoherrero - I thought we have settled on including Duo self-hosted support for GA? IMO, for GA, I think it would make sense to have frontend ready for that, instead of asking customers to configure their vector stores through the console.

Anyway, this does bring up the same question as #592744 (closed) about what we should really include for GA. ⬇️ ⬇️ ⬇️


The GA requirements I listed above can be broken down into 3 groups: Billing, Duo Self-hosted support, and outstanding fixes/investigations.

Shall we go ahead and drop the Duo self-hosted support out of the requirements for GA? This does not change the timeline of when we can get it done, but it means we can go to GA earlier if we also make Semantic Code Search available outside of MCP.

WDYT @maddievn @arturoherrero @tgao3701908 @changzhengliu @mnohr

tag:gitlab.com,2026-03-10:5185160176 Pam Artiaga commented on merge request !226439 at GitLab.org / GitLab 2026-03-10T01:41:23Z partiaga Pam Artiaga

The order is important.

tag:gitlab.com,2026-03-10:5185159628 Pam Artiaga commented on merge request !226439 at GitLab.org / GitLab 2026-03-10T01:40:59Z partiaga Pam Artiaga

The order is important.

tag:gitlab.com,2026-03-10:5185143163 Pam Artiaga commented on issue #592744 at GitLab.org / GitLab 2026-03-10T01:32:13Z partiaga Pam Artiaga

Is the billing/licensing model for semantic search calls resolved?

No, I still have an outstanding question regarding how to bill embeddings requests for indexing for SM customers, see https://gitlab.com/gitlab-org/gitlab/-/work_items/586372#note_3141209402.

Does exposing via REST (outside of Duo) change the billing consideration?

It shouldn't.

tag:gitlab.com,2026-03-10:5185140003 Pam Artiaga commented on issue #592744 at GitLab.org / GitLab 2026-03-10T01:30:18Z partiaga Pam Artiaga

Clarification on General Availability blockers

cc @dgruzd @tgao3701908 @changzhengliu @maddievn @arturoherrero

Regarding this point:

Unblock compliance-restricted customers without waiting for MCP to reach GA

The reason that Semantic Code Search is still in Beta is not just because of the MCP Server. I have listed the requirements for GA in #536642 (comment 3133196028), and we have committed that the timeline for completion would correspond with MCP Server going to GA.

This means that even if we add Semantic Code Search to a REST API, we still need to complete the issues in #536642 (comment 3133196028) to reach GA and thus make it available for compliance-restricted customers.

(There are maybe some work items we can move to post-GA, but we can continue discussions in #536642 (comment 3133196028))

tag:gitlab.com,2026-03-09:5180935281 Pam Artiaga commented on merge request !4612 at GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway 2026-03-09T07:14:46Z partiaga Pam Artiaga

Update:

I had to do a few rebases due to merge conflicts.

There is also a failure for ingest:dry-run, but that looks like a CI resource problem.

tag:gitlab.com,2026-03-09:5180912903 Pam Artiaga pushed to project branch 590572-code-embeddings-llm-class at GitLab.org / GitLab 2026-03-09T07:06:40Z partiaga Pam Artiaga

Pam Artiaga (473b02f6) at 09 Mar 07:06

Add tests and fix functionalities

tag:gitlab.com,2026-03-09:5180567953 Pam Artiaga pushed to project branch 590572-code-embeddings-llm-class at GitLab.org / GitLab 2026-03-09T04:31:59Z partiaga Pam Artiaga

Pam Artiaga (bb4c7100) at 09 Mar 04:31

Add MR url for feature flag config

tag:gitlab.com,2026-03-09:5180565814 Pam Artiaga pushed to project branch 1866-add-embeddings-endpoint at GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway 2026-03-09T04:30:40Z partiaga Pam Artiaga

Pam Artiaga (bfc5a134) at 09 Mar 04:30

refactor: add handling for empty response data

... and 7 more commits

tag:gitlab.com,2026-03-09:5180563621 Pam Artiaga pushed to project branch 590572-code-embeddings-llm-class at GitLab.org / GitLab 2026-03-09T04:29:21Z partiaga Pam Artiaga

Pam Artiaga (e38fface) at 09 Mar 04:29

Introduce generic embeddings llm class