Fabrizio J. Piva activity https://gitlab.com/fpiva 2026-03-13T16:19:00Z tag:gitlab.com,2026-03-13:5202130554 Fabrizio J. Piva commented on merge request !4732 at GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway 2026-03-13T16:11:36Z fpiva Fabrizio J. Piva

Hi @junminghuang 👋 I will be on PTO next week until 20/03, so if we get a maintainer review, I won't be able to drive it. If that happens, would you mind driving this MR until its gets merged?

If we don't get feedback, then we can assign somebody else when I am back.

I will be adding you in my PTO coverage issue just in case. Thanks!

tag:gitlab.com,2026-03-13:5201581533 Fabrizio J. Piva commented on merge request !2044 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-13T14:04:46Z fpiva Fabrizio J. Piva

Hi @tle_gitlab, feel free to use duo_chat.security_analyst.1 to test this pipeline, it contains 200 examples and I think we have solved the problem of LLM lack of context.

Do not hesitate to use it, curious to know your thoughts

tag:gitlab.com,2026-03-13:5200770063 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-13T10:50:42Z fpiva Fabrizio J. Piva

Note: Unfortunately there is not way to fetch the dataset size for a certain dataset split via API call.

Before, I was using read_dataset, but this function does not support passing the dataset split name, which means that it will always return the full dataset size.

When working with splits, this is not useful anymore, so I had to do the counting myself.

tag:gitlab.com,2026-03-13:5200760097 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-13T10:48:10Z fpiva Fabrizio J. Piva

Hi @tle_gitlab, I just updated one last time the MR to compute the dataset_size considering the dataset split. Before it was always returning the whole dataset size, which is incorrect. I think now it is ready for review 👍

tag:gitlab.com,2026-03-13:5200750579 Fabrizio J. Piva pushed to project branch fjp-correct-data-generation-pipeline-for-security-analyst-missing-file-content at GitLab.org / ModelOps / AI Model... 2026-03-13T10:45:49Z fpiva Fabrizio J. Piva

Fabrizio J. Piva (4b8394c6) at 13 Mar 10:45

Update dataset_size attribute computation to be compatible with splits

tag:gitlab.com,2026-03-13:5200379633 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-13T09:19:57Z fpiva Fabrizio J. Piva

Thanks @tle_gitlab! I will mark this as solved then 👍

tag:gitlab.com,2026-03-12:5196019971 Fabrizio J. Piva commented on merge request !4732 at GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway 2026-03-12T09:40:54Z fpiva Fabrizio J. Piva

Hi @erran 👋 would you have some time to provide a maintainer review on this? No worries at all if you are busy, just let us know and we can find someone else. Thanks!

tag:gitlab.com,2026-03-11:5193571418 Fabrizio J. Piva pushed to project branch fjp-correct-data-generation-pipeline-for-security-analyst-missing-file-content at GitLab.org / ModelOps / AI Model... 2026-03-11T17:19:47Z fpiva Fabrizio J. Piva

Fabrizio J. Piva (d568ddff) at 11 Mar 17:19

Fix linter

tag:gitlab.com,2026-03-11:5193569921 Fabrizio J. Piva pushed to project branch fjp-correct-data-generation-pipeline-for-security-analyst-missing-file-content at GitLab.org / ModelOps / AI Model... 2026-03-11T17:19:24Z fpiva Fabrizio J. Piva

Fabrizio J. Piva (d6cd63c6) at 11 Mar 17:19

Add support for dataset split specification

tag:gitlab.com,2026-03-11:5193529888 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-11T17:08:55Z fpiva Fabrizio J. Piva

Done, thank you

tag:gitlab.com,2026-03-11:5193529080 Fabrizio J. Piva pushed to project branch fjp-correct-data-generation-pipeline-for-security-analyst-missing-file-content at GitLab.org / ModelOps / AI Model... 2026-03-11T17:08:42Z fpiva Fabrizio J. Piva

Fabrizio J. Piva (7416693b) at 11 Mar 17:08

Removed id in projects.get

tag:gitlab.com,2026-03-11:5193519848 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-11T17:06:13Z fpiva Fabrizio J. Piva

Hi @tle_gitlab 👋 would you mind giving this MR a quick look? It is to correct what we discovered during our sync call

tag:gitlab.com,2026-03-11:5193517317 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-11T17:05:40Z fpiva Fabrizio J. Piva

@tle_gitlab do you think this is necessary? 🤔

tag:gitlab.com,2026-03-11:5193509810 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-11T17:03:51Z fpiva Fabrizio J. Piva

@GitLabDuo the example I gave you is as follows:

file = project.files.get(gt_file_name, ref="main")
file_bytes = file.decode()
ontent = yaml.safe_load(file_bytes.decode("utf-8"))

But that example was processing a yaml file. If we want to get the raw content, we just remove yaml.safe_load() wrapping and we would get the following:

file = project.files.get(gt_file_name, ref="main")
file_bytes = file.decode()
content = file_bytes.decode("utf-8")
tag:gitlab.com,2026-03-11:5193494089 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-11T17:00:09Z fpiva Fabrizio J. Piva

@GitLabDuo please check this example: https://gitlab.com/gitlab-org/modelops/ai-model-validation-and-research/ai-evaluation/prompt-library/-/blob/main/cef/security_testing/data/extract.py#L282

I think what I did is correct

tag:gitlab.com,2026-03-11:5193479862 Fabrizio J. Piva commented on merge request !2061 at GitLab.org / ModelOps / AI Model Validation and Research / AI Evaluation / CEF 2026-03-11T16:56:43Z fpiva Fabrizio J. Piva

Thanks for the feedback, solved

tag:gitlab.com,2026-03-11:5193478883 Fabrizio J. Piva pushed to project branch fjp-correct-data-generation-pipeline-for-security-analyst-missing-file-content at GitLab.org / ModelOps / AI Model... 2026-03-11T16:56:27Z fpiva Fabrizio J. Piva

Fabrizio J. Piva (edd32868) at 11 Mar 16:56

Update check for GitLab Rest client