-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Additional Information to prepare_model_for_kbit_training #2299
Comments
Hey @NilBiescas! I think this is a good idea since it might be surprising to find that the base model cannot be trained after using it with In addition it might also be noteworthy that this is not the only time the |
Yes, I agree. @NilBiescas would you like to create a PR to update the docstring accordingly? |
@BenjaminBossan Okey |
Resolved via #2305, thanks. |
Glad I made my first PR to HF ! |
Feature request
Add a comment in the docstring of prepare_model_for_kbit_training to inform that it sets requires_grad to false to all the base model parameters.
Motivation
As this function is used before training it might be nice to know that its actually freezing all the base model.
Your contribution
I could add a line commenting that the function freezes the base model.
The text was updated successfully, but these errors were encountered: