Support Rank Stabilized LoRA in the ModelConfig/LoraConfig #1877

JohnGiorgi · 2024-07-25T15:28:38Z

Tiny PR to support Rank Stabilized LoRA. To do this we:

Expose the argument in ModelConfig. It is copied straight from https://huggingface.co/docs/peft/en/package_reference/lora#peft.LoraConfig.use_rslora
Pass it through to LoraConfig in utils.py

Of course, a user could always do this outside TRL, but my motivation for enabling it this way was that I am working with a slightly modified version of the sft.py script, so it would be nice if I could enable Rank Stabilized LoRA with a single flag.

kashif · 2024-07-25T19:41:16Z

@JohnGiorgi do we need to pin peft to some version for this?

JohnGiorgi · 2024-07-29T13:13:38Z

@kashif Good question! Looks like yes... v0.8.0 based on digging through the release notes. Looks like TRL currently pins it to >=0.4.0. Is that a change TRL maintainers are okay with?

kashif · 2024-07-29T13:19:53Z

yes i believe we will need to pin peft to that for this PR, also could you kindly also add a test?

JohnGiorgi · 2024-07-30T14:42:48Z

@kashif Okay I bumped the peft dependency to >=0.8.0. I also added a test for get_peft_config (which looks to be untested), which also serves as a test to ensure use_rslora is getting passed from a ModelConfig to resulting LoraConfig

HuggingFaceDocBuilderDev · 2024-07-30T14:53:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

kashif · 2024-07-30T19:13:16Z

@JohnGiorgi i believe its failing for one of the older python with:

AttributeError: 'str' object has no attribute 'removeprefix'

JohnGiorgi · 2024-07-30T19:55:22Z

Fixed! Let me know if anything else is blocking this

trl/trainer/model_config.py

qgallouedec · 2024-08-06T15:36:39Z

setup.py

        "pytest",
        "pytest-xdist",
-        "accelerate",


already in required

qgallouedec · 2024-08-06T16:03:14Z

Merged, thank you @JohnGiorgi

JohnGiorgi · 2024-08-06T16:34:37Z

Merged, thank you @JohnGiorgi

Thank you!

feat: support RS-LoRA in the ModelConfig

b974ee5

kashif approved these changes Jul 28, 2024

View reviewed changes

JohnGiorgi added 2 commits July 30, 2024 10:01

build: bump minimum peft version to support rslora

11fc91f

test: add test for get_peft_config

2fb6fae

JohnGiorgi changed the title ~~feat: support RS-LoRA in the ModelConfig~~ Support Rank Stabilized LoRA in the ModelConfig/LoraConfig Jul 30, 2024

test: make test python 3.8 friendly

0a2c4d5

Merge branch 'main' into support-rs-lora

29ffca2

qgallouedec reviewed Aug 6, 2024

View reviewed changes

trl/trainer/model_config.py Outdated Show resolved Hide resolved

qgallouedec and others added 5 commits August 6, 2024 17:19

Merge branch 'main' into support-rs-lora

7fa2983

rm unused marker

2c1e46c

minor changes

58e979b

simplify, clarify doc

ec2f5b1

update deps (peft in test)

c123d21

qgallouedec reviewed Aug 6, 2024

View reviewed changes

setup.py

"pytest",

"pytest-xdist",

"accelerate",

Copy link

Member

qgallouedec Aug 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

already in required

qgallouedec added 2 commits August 6, 2024 15:40

re-ordering

fee255e

fix setup

6098e2f

qgallouedec merged commit b60ce79 into huggingface:main Aug 6, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Rank Stabilized LoRA in the ModelConfig/LoraConfig #1877

Support Rank Stabilized LoRA in the ModelConfig/LoraConfig #1877

JohnGiorgi commented Jul 25, 2024

kashif commented Jul 25, 2024

JohnGiorgi commented Jul 29, 2024 •

edited

Loading

kashif commented Jul 29, 2024

JohnGiorgi commented Jul 30, 2024

HuggingFaceDocBuilderDev commented Jul 30, 2024

kashif commented Jul 30, 2024

JohnGiorgi commented Jul 30, 2024 •

edited

Loading

qgallouedec Aug 6, 2024

qgallouedec commented Aug 6, 2024

JohnGiorgi commented Aug 6, 2024

Support Rank Stabilized LoRA in the ModelConfig/LoraConfig #1877

Support Rank Stabilized LoRA in the ModelConfig/LoraConfig #1877

Conversation

JohnGiorgi commented Jul 25, 2024

kashif commented Jul 25, 2024

JohnGiorgi commented Jul 29, 2024 • edited Loading

kashif commented Jul 29, 2024

JohnGiorgi commented Jul 30, 2024

HuggingFaceDocBuilderDev commented Jul 30, 2024

kashif commented Jul 30, 2024

JohnGiorgi commented Jul 30, 2024 • edited Loading

qgallouedec Aug 6, 2024

Choose a reason for hiding this comment

qgallouedec commented Aug 6, 2024

JohnGiorgi commented Aug 6, 2024

JohnGiorgi commented Jul 29, 2024 •

edited

Loading

JohnGiorgi commented Jul 30, 2024 •

edited

Loading