FIX: Enable non-strict loading of state dicts #295

BenjaminBossan · 2024-08-26T13:35:18Z

What does this PR do?

Resolves #278

PyTorch allows to load state dicts with they strict=False argument to ignore missing keys. This is now also supported in optimum-quanto. Before this fix, a KeyError would be raised.

One context where this is important is for parameter-efficient fine-tuning adapters such as LoRA. There, we want to load only a small subset of parameters and leave the other model weights untouched. This requires non-strict loading.

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
~~Did you run all tests locally and make sure they pass.~~ Only subset
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

BenjaminBossan · 2024-08-26T13:41:41Z

@dacorvo I created the PR to fix non-strict loading. As the code changed a bit compared to what I had on the issue, and since I wanted to support int4 and int8, the changes are a bit different from what we discussed there. LMK if something should be changed or is still missing.

Apparently the commit is not "conventional", not sure what I missed there. Should I try to fix this via rebase or can this be fixed later via squash+merge?

dacorvo · 2024-08-26T16:11:57Z

Apparently the commit is not "conventional", not sure what I missed there. Should I try to fix this via rebase or can this be fixed later via squash+merge?

You can amend your commit to fix: Enable non-strict loading of state dicts to make it conventional and force-push.

Resolves huggingface#278 PyTorch allows to load state dicts with they strict=False argument to ignore missing keys. This is now also supported in optimum-quanto. Before this fix, a KeyError would be raised. One context where this is important is for parameter-efficient fine-tuning adapters such as LoRA. There, we want to load only a small subset of parameters and leave the other model weights untouched. This requires non-strict loading.

BenjaminBossan · 2024-08-26T16:32:42Z

You can amend your commit to fix: Enable non-strict loading of state dicts to make it conventional and force-push.

Thanks, done.

dacorvo

Thank you very much for this neat pull-request. Looking forward to see how you used quanto in peft !

BenjaminBossan requested a review from dacorvo as a code owner August 26, 2024 13:35

BenjaminBossan force-pushed the fix-enable-non-strict-loading-of-state-dicts branch from a9ec7fa to 8b59252 Compare August 26, 2024 16:29

dacorvo approved these changes Aug 27, 2024

View reviewed changes

dacorvo merged commit f9b71f4 into huggingface:main Aug 27, 2024
15 checks passed

BenjaminBossan deleted the fix-enable-non-strict-loading-of-state-dicts branch September 23, 2024 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: Enable non-strict loading of state dicts #295

FIX: Enable non-strict loading of state dicts #295

BenjaminBossan commented Aug 26, 2024

BenjaminBossan commented Aug 26, 2024

dacorvo commented Aug 26, 2024

BenjaminBossan commented Aug 26, 2024

dacorvo left a comment

FIX: Enable non-strict loading of state dicts #295

FIX: Enable non-strict loading of state dicts #295

Conversation

BenjaminBossan commented Aug 26, 2024

What does this PR do?

Before submitting

Who can review?

BenjaminBossan commented Aug 26, 2024

dacorvo commented Aug 26, 2024

BenjaminBossan commented Aug 26, 2024

dacorvo left a comment

Choose a reason for hiding this comment