feat: added lora, fixed inference #15

P-H-B-D · 2024-11-05T00:03:13Z

Re-Added support for lora with flag --use_lora and associated flags --rank, --lora_alpha, and --target_modules.
Fixed inference to return the image outside of list

arnaudstiegler

Left a few comments. As a general rule, avoid duplicating logic and try to reuse existing code as much as possible

arnaudstiegler · 2024-11-05T02:51:24Z

run_inference.py

+def get_lora_model(unet, rank=4, lora_alpha=4, target_modules=None):
+    if target_modules is None:
+        target_modules = [
+            "to_q",
+            "to_k",
+            "to_v",
+            "to_out.0",
+            "conv1",
+            "conv2",
+            "conv_shortcut",
+            "conv3",
+            "conv4",
+        ]
+
+    config = LoraConfig(
+        r=rank,
+        lora_alpha=lora_alpha,
+        target_modules=target_modules,
+        lora_dropout=0.0,
+        bias="none",
+    )
+    return get_peft_model(unet, config)


move this to sd3/model.py

arnaudstiegler · 2024-11-05T02:52:37Z

run_inference.py

+    if is_lora:
+        inference_unet = unet.base_model.model


Can't you directly pass that as the argument to the inference function?
A priori, the inference doesn't need to be aware of whether the model was trained with LORA or not

arnaudstiegler · 2024-11-05T02:53:44Z

sd3/model.py

+    action_embedding_dim: int, 
+    skip_image_conditioning: bool = False,
+    device: torch.device | None = None
+) -> tuple[UNet2DConditionModel, AutoencoderKL, torch.nn.Embedding, DDIMScheduler, CLIPTokenizer, CLIPTextModel]:


CAn you run ruff format on that file?

arnaudstiegler · 2024-11-05T02:55:21Z

sd3/model.py

    )
    torch.nn.init.normal_(action_embedding.weight, mean=0.0, std=0.02)

-    # DDIM scheduler allows for v-prediction and less sampling steps
+    # Load models with device placement


Comments should primarily provide additional context for code that's non trivial to understand. In this case, the comment is simply describing what is done below, which is already explicit.

arnaudstiegler · 2024-11-05T02:56:00Z

sd3/model.py

    )

    if not skip_image_conditioning:
-        # This is to accomodate concatenating previous frames in the channels dimension
+        # Modify UNet input channels


Same thing: the new comment describes what we're doing and not why.
The previous comment gave that additional context

arnaudstiegler · 2024-11-05T02:57:15Z

train_text_to_image.py

+    from peft import LoraConfig, get_peft_model
+    def get_lora_model(unet, rank=4, lora_alpha=4, target_modules=None):
+        if target_modules is None:
+            # Default target modules for SD UNet
+            target_modules = [
+                "to_q",
+                "to_k",
+                "to_v",
+                "to_out.0",
+                "conv1",
+                "conv2",
+                "conv_shortcut",
+                "conv3",
+                "conv4",
+            ]
+
+        config = LoraConfig(
+            r=rank,
+            lora_alpha=lora_alpha,
+            target_modules=target_modules,
+            lora_dropout=0.0,
+            bias="none",
+        )
+        return get_peft_model(unet, config)


this should go to sd3/model.py

arnaudstiegler · 2024-11-05T02:58:28Z

train_text_to_image.py

+            # Save LoRA weights
+            unet.save_pretrained(os.path.join(args.output_dir, "lora"))


update the save function rather than saving it separately

arnaudstiegler · 2024-11-05T02:59:41Z

train_text_to_image.py

+            lora_alpha=args.lora_alpha,
+        )
+        # Only train LoRA parameters
+        params_to_optimize = filter(lambda p: p.requires_grad, unet.parameters())


Did you make sure that only the LORA parameters are marked with requires_grad? I'm not seeing this logic anywhere

arnaudstiegler · 2024-11-05T03:01:31Z

train_text_to_image.py

+    if args.skip_action_conditioning:
+        optimizer = optimizer_cls(
+            unet.parameters(),
+            lr=args.learning_rate,
+            betas=(args.adam_beta1, args.adam_beta2),
+            weight_decay=args.adam_weight_decay,
+            eps=args.adam_epsilon,
+        )
+    else:
+        optimizer = optimizer_cls(
+            [
+                {"params": params_to_optimize},
+                {"params": action_embedding.parameters()},
+            ],
+            lr=args.learning_rate,
+            betas=(args.adam_beta1, args.adam_beta2),
+            weight_decay=args.adam_weight_decay,
+            eps=args.adam_epsilon,
+        )


This is copied from line 624 -> 639. Update the existing code rather than duplicating code

arnaudstiegler · 2024-11-05T03:02:35Z

train_text_to_image.py

+    parser.add_argument(
+        "--target_modules",
+        type=str,
+        nargs="+",
+        default=None,
+        help="List of module names to apply LoRA to",
+    )


Is this used anywhere? Also, this parameter is typically hardcoded

feat: added lora, fixed inference

df821af

arnaudstiegler reviewed Nov 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added lora, fixed inference #15

feat: added lora, fixed inference #15

P-H-B-D commented Nov 5, 2024

arnaudstiegler left a comment

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

arnaudstiegler Nov 5, 2024

		# Save LoRA weights
		unet.save_pretrained(os.path.join(args.output_dir, "lora"))

feat: added lora, fixed inference #15

Are you sure you want to change the base?

feat: added lora, fixed inference #15

Conversation

P-H-B-D commented Nov 5, 2024

arnaudstiegler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment