Reforging LLaMA lobe (code from Samsung AI Center-Cambridge) by TParcollet · Pull Request #2850 · speechbrain/speechbrain

TParcollet · 2025-03-04T17:13:23Z

This PR refactor our LLaMA lobe to become generic enough and version agnostic. I cannot share the associated recipe for now, but it works ok. I still may have a few question open w.r.t checkpointing, but I think that this should be ok. I think that the current state of our LLaMA lobe is a real problem as it may hide the capacity of SB to adapt and to enable people to develop their custom code.

@poonehmousavi Multiwoz will need to be slightly refactor around this code I think. As a rule of thumb here are the potential changes:

BNB and PeFT are not expected to be on the lobe. These are external things that must be managed at the yaml level.
SB has its own adapter scheme, we should rely on this as it's more transparent than PeFT. This does not prevent function like preparation of kbit training to be applied in the training recipe, but not in the lobe. We can also use PeFT adapter class with our adapter method.

To-do:

refactor multiwoz
dive into the tests that touch llama2.py and change them.

speechbrain/lobes/models/huggingface_transformers/llama.py

TParcollet · 2025-03-17T11:47:30Z

@pplantinga do you have any idea why the doctest keep running on the llama file while I excluded it? This is frustrating ahah

Adel-Moumen · 2025-03-17T12:08:23Z

@pplantinga do you have any idea why the doctest keep running on the llama file while I excluded it? This is frustrating ahah

You need to add # doctest: +SKIP (see:

speechbrain/speechbrain/inference/encoders.py

Line 44 in 7724216

    
               >>> ssl_model.encode_file("samples/audio_samples/example_fr.wav") # doctest: +SKIP

)

TParcollet · 2025-03-17T18:45:36Z

@Adel-Moumen But like, isn't my change to the pytest file supposed to avoid testing llama.py? this sounds weird to me.

speechbrain/lobes/models/huggingface_transformers/llama.py

…peechbrain-released into llama_family_revamp

speechbrain/lobes/models/huggingface_transformers/llama.py

Adel-Moumen

LGTM

…rain#2850)

revamp llama

951141c

TParcollet requested review from Adel-Moumen, poonehmousavi and pplantinga March 4, 2025 17:13

TParcollet changed the title ~~Reforging LLaMA lobe~~ Reforging LLaMA lobe (code from Samsung AI Center-Cambridge) Mar 4, 2025

Update llama.py

d616ccb

Adel-Moumen reviewed Mar 4, 2025

View reviewed changes

speechbrain/lobes/models/huggingface_transformers/llama.py Outdated Show resolved Hide resolved

speechbrain/lobes/models/huggingface_transformers/llama.py Outdated Show resolved Hide resolved

speechbrain/lobes/models/huggingface_transformers/llama.py Outdated Show resolved Hide resolved

TParcollet added 2 commits March 17, 2025 11:15

remove llama from doctesting due to model access

559d1da

fix torch loading type

d6c80c4

TParcollet mentioned this pull request Mar 17, 2025

SpeechLLM (with LLaMA) and Conformer recipe for speech translation on CoVoST (Code from Samsung AI Center Cambridge) #2865

Merged

Adel-Moumen reviewed Mar 24, 2025

View reviewed changes

speechbrain/lobes/models/huggingface_transformers/llama.py Outdated Show resolved Hide resolved

TParcollet added 5 commits March 25, 2025 20:32

removing multiwoz waiting for a refactoring

da32ee1

Adel comment

07ac353

Merge branch 'develop' into llama_family_revamp

84af60c

remove llama2 from conftest

cc49aa1

Merge branch 'llama_family_revamp' of https://github.com/TParcollet/s…

61b0dd3

…peechbrain-released into llama_family_revamp

TParcollet added the ready to review Waiting on reviewer to provide feedback label Mar 25, 2025

remove recipe testing for Multiwoz

e80dd9e

Adel-Moumen reviewed Mar 28, 2025

View reviewed changes

TParcollet added 4 commits April 7, 2025 10:25

fix last changes

317f0b0

fix last changes

79cc2d3

better now

0d46e11

Merge branch 'develop' into llama_family_revamp

b6169bb

Adel-Moumen approved these changes Apr 7, 2025

View reviewed changes

Adel-Moumen merged commit 00b981c into speechbrain:develop Apr 7, 2025
5 checks passed

pplantinga pushed a commit to pplantinga/speechbrain that referenced this pull request Jun 2, 2025

Reforging LLaMA lobe (code from Samsung AI Center-Cambridge) (speechb…

59b5a56

…rain#2850)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reforging LLaMA lobe (code from Samsung AI Center-Cambridge)#2850

Reforging LLaMA lobe (code from Samsung AI Center-Cambridge)#2850
Adel-Moumen merged 14 commits intospeechbrain:developfrom
TParcollet:llama_family_revamp

TParcollet commented Mar 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TParcollet commented Mar 17, 2025

Uh oh!

Adel-Moumen commented Mar 17, 2025 •

edited

Loading

Uh oh!

TParcollet commented Mar 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Adel-Moumen left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TParcollet commented Mar 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TParcollet commented Mar 17, 2025

Uh oh!

Adel-Moumen commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TParcollet commented Mar 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Adel-Moumen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TParcollet commented Mar 4, 2025 •

edited

Loading

Adel-Moumen commented Mar 17, 2025 •

edited

Loading