Generic feature extraction POC by flexthink · Pull Request #2876 · speechbrain/speechbrain

flexthink · 2025-04-03T20:09:19Z

What does this PR do?

(Work in progress) A universal feature extractor to extract arbitrary features from dataset (discrete tokens, continuous representations, etc) and save them using arbitrary formats.

Before submitting

Did you read the contributor guideline?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
Did you list all the breaking changes introduced by this pull request?
Does your code adhere to project-specific code style and conventions?

PR review

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified
Confirm that the changes adhere to compatibility requirements (e.g., Python version, platform)
Review the self-review checklist to ensure the code is ready for review

Adel-Moumen

Do you have an example of a train.py integration of your new tokens loader?

Adel-Moumen · 2025-06-07T12:12:31Z

recipes/LJSpeech/extraction/extract.py

I don't think that this script should be here. I think it should be dataset dependent similar to what we are doing for let's say librispeech_preparation.py

This was borrowed from DASB - my older approach integrated that with preparation.

Adel-Moumen · 2025-06-07T12:14:03Z

recipes/LJSpeech/extraction/sanity_check.py

maybe you should move this a unit-test. I think the extraction will requires extensive tests to make sure they are correct in the loading/saving process

I will create unit tests

@Adel-Moumen: Unit tests created in #2938

flexthink · 2025-06-12T15:35:19Z

Do you have an example of a train.py integration of your new tokens loader?

I have some private examples - but they are on new work in progress not ready to be merged yet, as well as older incarnations of Tokotron.

I would suggest choosing one existing recipe and integrating it.

Adel-Moumen · 2025-06-12T16:05:14Z

Also, quick question to @pplantinga, don't you think we should maybe aim for a single backend? Given that we are trying to minimise the number of dependencies, I would find it better to just stick to the best and more general purpose solution (instead of having something too general). I believe that most of them share similar pos and cons. In our context, I am not sure if we really need something very sota, I would prefer having something easy to use, where we only need low efforts to maintain the integration. So maybe, something like numpy or h5 is enough.

flexthink · 2025-06-12T18:31:58Z

See #2938 for a simplified H5-only version.

Adel-Moumen · 2025-10-30T10:21:18Z

Closed in favour of #2985.

Extraction: Initial import

0a36ce2

flexthink requested review from Adel-Moumen and poonehmousavi April 3, 2025 20:09

flexthink added 8 commits April 4, 2025 14:25

Extraction: Add h5 support, fixes

826bf54

Extraction: Add reading logic, some fixes

5dff1fb

Extraction: Cosmetic changes

2a93db0

Extraction: Fixes

0982939

Extraction: Minor revisions, add LJSpeech

1fb2eb7

Extraction: Fixes, add a switch

ca99f47

Extraction: Add layer selection

3b5d9a9

Extraction: Fixes

802bdb0

Adel-Moumen reviewed Jun 7, 2025

View reviewed changes

flexthink mentioned this pull request Jun 12, 2025

Generic feature extraction POC - H5 backend only #2938

Closed

13 tasks

TParcollet added this to the v1.1.0 milestone Oct 9, 2025

pplantinga mentioned this pull request Oct 20, 2025

Feature caching proposal: CachedDynamicItem #2985

Merged

Adel-Moumen closed this Oct 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generic feature extraction POC#2876

Generic feature extraction POC#2876
flexthink wants to merge 9 commits intospeechbrain:developfrom
flexthink:extraction

flexthink commented Apr 3, 2025

Uh oh!

Adel-Moumen left a comment

Uh oh!

Adel-Moumen Jun 7, 2025

Uh oh!

flexthink Jun 12, 2025

Uh oh!

Adel-Moumen Jun 7, 2025

Uh oh!

flexthink Jun 12, 2025

Uh oh!

flexthink Jun 13, 2025

Uh oh!

flexthink commented Jun 12, 2025

Uh oh!

Adel-Moumen commented Jun 12, 2025

Uh oh!

flexthink commented Jun 12, 2025

Uh oh!

Adel-Moumen commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

flexthink commented Apr 3, 2025

What does this PR do?

PR review

Uh oh!

Adel-Moumen left a comment

Choose a reason for hiding this comment

Uh oh!

Adel-Moumen Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

flexthink Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Adel-Moumen Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

flexthink Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

flexthink Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

flexthink commented Jun 12, 2025

Uh oh!

Adel-Moumen commented Jun 12, 2025

Uh oh!

flexthink commented Jun 12, 2025

Uh oh!

Adel-Moumen commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants