FocalCodec [NeurIPS 2025] by lucadellalib · Pull Request #3000 · speechbrain/speechbrain

lucadellalib · 2025-11-21T01:55:36Z

Add FocalCodec training recipe.

mravanelli

Great Job @lucadellalib! I did a quick code inspection and shared some comments about the docstrings.
Other Comments:

Regarding the extra-depenency, please take a look at our policy here. This code should be compliant with that. @pplantinga can advise
I'm not sure about having "metrics" as a local folder. I think we might need to resuse the same metrics in other recipes, for instance the streamable focalcodec and the extension of FocalCoded to LibriLight. Maybe we can put it in SpeechBrain/metrics?. Any advise @pplantinga ?

mravanelli · 2025-11-21T16:05:20Z

recipes/LibriTTS/focalcodec/train_decoder.py

+
+
+class Generation(sb.Brain):
+    def fit_batch(self, batch):


For every method, we typically add some short description about its functionality for better clarity (See for instance this). It is even more important here as some methods are not standard.

mravanelli · 2025-11-21T16:06:23Z

recipes/LibriTTS/focalcodec/train_quantizer.py

+            return super()._fit_valid(valid_set, epoch, enable)
+
+    @torch.no_grad()
+    def evaluate_batch(self, batch, stage):


Make sure every method has a short description.

mravanelli · 2025-11-21T16:07:31Z

recipes/LibriTTS/focalcodec/utils.py

+
+
+def prepare_recipe(hparams, run_opts):
+    # Dataset preparation


Add docstring

mravanelli · 2025-11-21T16:08:43Z

recipes/LibriTTS/focalcodec/utils.py

+    audio_backend="soundfile",
+    **kwargs,
+):
+    """This function prepares the datasets to be used in the brain class.


Improve the docstring by explaining all the parameters. They are a lot in this case, but that can improve clarity and usability.

mravanelli · 2025-11-21T16:10:05Z

recipes/LibriTTS/focalcodec/utils.py

+    provides = ["sig"]
+
+    def audio_pipeline_train(wav):
+        original_sample_rate = sb.dataio.dataio.read_audio_info(wav).sample_rate


Make sure all the functions have a short docstring

mravanelli · 2025-11-21T16:16:46Z

recipes/LibriTTS/focalcodec/metrics/dwer.py

+
+
+class DWER(MetricStats):
+    def __init__(


Add docstring with a working example

mravanelli · 2025-11-21T16:17:41Z

recipes/LibriTTS/focalcodec/metrics/speaker_similarity.py

+
+
+class SpkSimWavLM(MetricStats):
+    def __init__(


Add a docstring with a working example (such that we can test all with our doc tests)

mravanelli · 2025-11-21T16:18:01Z

recipes/LibriTTS/focalcodec/metrics/utmos.py

+
+
+class UTMOS(MetricStats):
+    def __init__(self, sample_rate, model=None):


Add docstring with example

mravanelli · 2025-11-21T16:18:59Z

speechbrain/lobes/models/HifiGAN.py



+class HingeGLoss(nn.Module):
+    """Hinge Generator Loss


Add the example to the docstring

mravanelli · 2025-11-21T16:19:22Z

speechbrain/lobes/models/HifiGAN.py



+class HingeDLoss(nn.Module):
+    """Hinge Discriminator Loss


Add example

pplantinga · 2025-11-21T22:09:06Z

Regarding the extra-depenency, please take a look at our policy here. This code should be compliant with that. @pplantinga can advise

I think this PR follows the policy: for recipes, the only requirement is an extra-requirements.txt file with all non-speechbrain dependencies. The one thing I might suggest is adding transformers to the extra requirements, as this is (mostly) moved to integrations now, not a core dependency.

I'm not sure about having "metrics" as a local folder. I think we might need to resuse the same metrics in other recipes, for instance the streamable focalcodec and the extension of FocalCoded to LibriLight. Maybe we can put it in SpeechBrain/metrics?. Any advise @pplantinga ?

Perhaps we can leave it here for now, and move it if we do end up using it for other recipes. The principle of YAGNI (you ain't gonna need it) might apply here, let's keep it as straightforward as possible and not plan too far ahead.

mravanelli · 2025-11-22T14:45:32Z

Thank you for your comments @pplantinga! Do you have other comments or suggestions?

mravanelli · 2025-11-22T14:47:33Z

@Adel-Moumen, do you also have some comments and suggestions here?

mravanelli · 2025-11-22T19:34:13Z

I tested the recipe and the recipe tests. All seems to work properly.
A couple of small points:

We need to upload the logs to Dropbox. I will follow up on that privately.
In both Yaml file, train-clean-360 and train-other-500. I would suggest uncommenting that by default.

mravanelli · 2025-11-24T18:36:58Z

This PR LGTM now. I think we can go ahead and merge it, unless @pplantinga or @Adel-Moumen have further comments. Great Job @lucadellalib!

pplantinga

LGTM

lucadellalib added 2 commits November 20, 2025 20:49

Add FocalCodec recipe

b3bd5d1

Update

d070685

mravanelli requested changes Nov 21, 2025

View reviewed changes

mravanelli requested a review from pplantinga November 21, 2025 20:31

Improve docstrings

1a2c94b

Merge branch 'develop' into focalcodec

cbfec20

mravanelli requested a review from Adel-Moumen November 22, 2025 14:47

mravanelli assigned lucadellalib Nov 22, 2025

mravanelli added the enhancement New feature or request label Nov 22, 2025

lucadellalib and others added 4 commits November 23, 2025 15:37

Minor improvements

94c7fd8

Merge branch 'develop' into focalcodec

db01033

Merge branch 'develop' into focalcodec

937535b

Update readme

349b9bf

mravanelli approved these changes Nov 24, 2025

View reviewed changes

pplantinga approved these changes Nov 24, 2025

View reviewed changes

pplantinga added this to the v1.1.0 milestone Nov 24, 2025

pplantinga added the recipes Changes to recipes only (add/edit) label Nov 24, 2025

pplantinga merged commit 637f0a5 into speechbrain:develop Nov 24, 2025
5 checks passed



		class UTMOS(MetricStats):
		def __init__(self, sample_rate, model=None):

Conversation

lucadellalib commented Nov 21, 2025

Uh oh!

mravanelli left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pplantinga commented Nov 21, 2025

Uh oh!

mravanelli commented Nov 22, 2025

Uh oh!

mravanelli commented Nov 22, 2025

Uh oh!

mravanelli commented Nov 22, 2025

Uh oh!

mravanelli commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pplantinga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mravanelli commented Nov 24, 2025 •

edited

Loading