Add PaliGemma LoRA #464

probicheaux · 2024-06-10T22:45:47Z

Description

Add in class that can perform inference using LoRAs

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

Locally

Any specific deployment considerations

n/a

Docs

Docs updated? What were the changes:

probicheaux · 2024-06-10T23:08:28Z

PaliGemma needs transformers>=4.41.1, but requirements.cogvlm.txt and reqiurements.groundingdino.txt pin transformers low. Can we avoid that now?
This change doesn't work with get_model because we don't know a priori if the PaliGemma model is a LoRA or not. How should I handle that? Put something in the model bucket and check for that? Right now, there's a file adapter_config.json that exists if and only if the model is a LoRA. Should I use that file to check which class to load in get_model?

PawelPeczek-Roboflow · 2024-06-11T07:00:00Z

requirements/requirements.cogvlm.txt

@@ -1,4 +1,4 @@
-transformers>=4.36.0,<4.38.0
+transformers>=4.36.0


I remember pinning the version due to: #355
Not sure if never versions of transformers solve the problem, but if yes, probably lower-bound should be bumped

PawelPeczek-Roboflow · 2024-06-11T07:03:24Z

inference/models/paligemma/paligemma.py

+
+        self.processor = AutoProcessor.from_pretrained(self.cache_dir)
+
+
 if __name__ == "__main__":


I know that not part of the change - but is that needed?

sorry, is what needed? the main? no, it was just for testing, we can remove

PawelPeczek-Roboflow · 2024-06-11T07:07:58Z

inference/models/paligemma/paligemma.py

@@ -150,6 +161,36 @@ def download_model_artefacts_from_s3(self) -> None:
        raise NotImplementedError()


+class LoRAPaliGemma(PaliGemma):


Could you post a docs example + description on LoRA model?
I guess this class is needed to be able to load LoRA-fine-tuned models from hf hub as people are posting those, but what about our platform? PaliGemma is RoboflowInferenceModel, but it seems that we don't load weights from our hosting - which may be the indication that this is kind of "core" model?
Also - is that always required to have HF token - or maybe we could rely on their auth?

I can write up something more detailed but the short answer is this

LoRA is a technique to train a small "diff" from some base model A

This PR assumes that users will deploy the LoRA (the diff) to Roboflow, but in order to use it, they will need to download the base model A from huggingface

This doesn't reduce the amount of data transferred on the first LoRA loaded, but will significantly reduce data transfer on subsequent LoRA loads -- from 6GB for a fully finetuned model, to only 28MB for a new LoRA

This will also reduce our storage needs because we don't need to host 6GB for each fine tune, just 28MB for each LoRA

So to be clear, we are loading weights from our hosting

Also - is that always required to have HF token - or maybe we could rely on their auth?

I'm not sure I understand this question

from previous answers I see that HF tokens will be required, at least sometimes

capjamesg · 2024-06-13T14:01:53Z

I have tested this implementation and successfully trained a model with LoRA.

PawelPeczek-Roboflow · 2024-06-17T07:29:37Z

Fine, as long as we resolve this #464 (comment) we are free to merge, I believe that would only take testing CogVLM and probably setting transformers>=4.41.1

PawelPeczek-Roboflow · 2024-06-17T07:39:21Z

Regarding question 2 form here
get_model(...) calls internally get_model_type(...). Would be best if we could have the information responded from API at that level.
If not feasible, relying on adapter_config.json is ok, but that probably would take having a single class for LoRA and non-LoRA versions?

PawelPeczek-Roboflow · 2024-06-25T13:51:03Z

@probicheaux - how we plan to move on with this?

probicheaux · 2024-06-25T23:47:44Z

@PawelPeczek-Roboflow sorry, I've been super busy. Just fixed the get_model thing by pushing a new model_conversion param that adds peft to lora models. I also tested cogvlm in the new docker container (verifying transformers==4.41.2 and it works fine.

Add PaliGemma LoRA

52582d6

probicheaux requested review from PawelPeczek-Roboflow, grzegorz-roboflow and yeldarby as code owners June 10, 2024 22:45

probicheaux added 3 commits June 10, 2024 22:49

huggingface token

b125248

Style

90341c9

Loosen transformers requirement

23de033

probicheaux added 4 commits June 10, 2024 23:09

Bugfix

818657b

Handle dtypes

0a531c1

Dtypes workign

9fd6bff

Style

19ed6ec

PawelPeczek-Roboflow reviewed Jun 11, 2024

View reviewed changes

probicheaux self-assigned this Jun 13, 2024

probicheaux added 2 commits June 25, 2024 23:27

Added peft to LoRA modeltype

b68d77c

Style

ef09c74

probicheaux requested a review from PawelPeczek-Roboflow June 25, 2024 23:52

probicheaux added 2 commits June 26, 2024 00:19

I'm cringe

31509a6

Remove __main__

737f62c

PawelPeczek-Roboflow approved these changes Jun 26, 2024

View reviewed changes

PawelPeczek-Roboflow merged commit 4a5e258 into main Jun 26, 2024
50 checks passed

PawelPeczek-Roboflow deleted the paligemma-lora branch June 26, 2024 06:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PaliGemma LoRA #464

Add PaliGemma LoRA #464

probicheaux commented Jun 10, 2024

probicheaux commented Jun 10, 2024

PawelPeczek-Roboflow Jun 11, 2024

PawelPeczek-Roboflow Jun 11, 2024

probicheaux Jun 12, 2024

PawelPeczek-Roboflow Jun 17, 2024

PawelPeczek-Roboflow Jun 11, 2024

probicheaux Jun 12, 2024

probicheaux Jun 12, 2024

probicheaux Jun 12, 2024

PawelPeczek-Roboflow Jun 17, 2024

capjamesg commented Jun 13, 2024

PawelPeczek-Roboflow commented Jun 17, 2024

PawelPeczek-Roboflow commented Jun 17, 2024

PawelPeczek-Roboflow commented Jun 25, 2024

probicheaux commented Jun 25, 2024

		@@ -1,4 +1,4 @@
		transformers>=4.36.0,<4.38.0
		transformers>=4.36.0


		self.processor = AutoProcessor.from_pretrained(self.cache_dir)


		if __name__ == "__main__":

		@@ -150,6 +161,36 @@ def download_model_artefacts_from_s3(self) -> None:
		raise NotImplementedError()


		class LoRAPaliGemma(PaliGemma):

Add PaliGemma LoRA #464

Add PaliGemma LoRA #464

Conversation

probicheaux commented Jun 10, 2024

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

probicheaux commented Jun 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

capjamesg commented Jun 13, 2024

PawelPeczek-Roboflow commented Jun 17, 2024

PawelPeczek-Roboflow commented Jun 17, 2024

PawelPeczek-Roboflow commented Jun 25, 2024

probicheaux commented Jun 25, 2024