Feature/jepa #409

benjijamorris · 2024-08-02T18:30:50Z

What does this PR do?

Add Joint Embedding Predictive Architecture and Image world Model (draft) infrastructure.

Fixes #<issue_number>

Before submitting

Did you make sure title is self-explanatory and the description concisely explains the PR?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you list all the breaking changes introduced by this pull request?
Did you test your PR locally with pytest command?
Did you run pre-commit hooks with pre-commit run -a command?

Did you have fun?

Make sure you had fun coding 🙃

…nto main

…ked patches

ritvikvasan

Looks good to me...Left some minor comments again

ritvikvasan · 2024-08-14T19:19:22Z

cyto_dl/image/transforms/generate_jepa_masks.py

+            y = self.R.randint(0, self.num_patches[-2] - height + 1)
+            # add block to mask
+            if self.spatial_dims == 3:
+                mask[:, y : y + height, x : x + width] = 1


do you not randomly sample dims in Z here?

no - I'm basing this off of v-jepa (which has masks that are the same over time) with the idea that 3d images have similar spatial redundancy to natural videos

ritvikvasan · 2024-08-14T19:19:29Z

cyto_dl/models/jepa/iwm.py

+        target_masks = self.get_predict_masks(source.shape[0], device=source.device)
+
+        # mean across patches, no cls token to remove
+        source_embeddings = self.encoder(source)


is the encoder here a vit? Does it handle the patchifying etc? Is the source here the small mask?

yes, the vit encoder does patchifying. source is an image. This is taking all the patches from the source image and predicting the embeddings of all the patches from the target image.

ritvikvasan · 2024-08-14T19:24:25Z

cyto_dl/models/jepa/jepa_base.py

+    def get_context_embeddings(self, x, mask):
+        # mask context pre-embedding to prevent leakage of target information
+        context_patches, _, _, _ = self.encoder.patchify(x, 0)
+        context_patches = take_indexes(context_patches, mask)


take_indexes seems to be repeating indices along a channel dimension. just want to check that this is expected

yes, this is just applying the indices across all channels of the tensor (tokens x batch x embedding_dim)

ritvikvasan · 2024-08-14T19:25:17Z

cyto_dl/nn/encoder_decoder.py

+        if ckpt is not None:
+            state_dict = torch.load(ckpt)["state_dict"]
+            state_dict = {
+                k.replace("backbone.", ""): v


can you add a comment about why this is necessary

I'm thinking this guy isn't ready for prime time yet

ritvikvasan · 2024-08-14T19:27:46Z

cyto_dl/nn/vits/jepa.py

+        )
+
+        self.mask_token = torch.nn.Parameter(torch.zeros(1, 1, emb_dim))
+        self.pos_embedding = torch.nn.Parameter(torch.zeros(np.prod(num_patches), 1, emb_dim))


maybe use the general pos embedding function?

ritvikvasan · 2024-08-14T19:31:32Z

general comment: I think having a quick lookup table of the different configs you are adding would be great. Something like this - https://github.com/MMV-Lab/mmv_im2im/blob/main/tutorials/example_by_use_case.md

benjijamorris · 2024-08-14T20:21:31Z

agreed, I'll do a separate pr with that for all the models

Benjamin Morris and others added 30 commits May 2, 2024 10:02

Bump version: 0.1.5 → 0.1.6

6ed231d

Merge branch 'main' of https://github.com/AllenCellModeling/cyto-dl i…

004b823

…nto main

Merge branch 'main' of https://github.com/AllenCellModeling/cyto-dl i…

0c607d9

…nto main

add datamodules

c34faad

add tile crop transform

95f54f9

add contrastive models

8ea12e9

add vic-reg specific head and loss

dbc5c19

rename encoder decoder

d5732cf

remove gan version

31dc39a

allow multi-ch reading from comma separatedlist

c539c7e

add non-overlapping tile cropper

4d4f36b

save out imagesduring inference

f5bf9da

forward is in baseclass;

bd260b7

add vic reg loss to init

233bfa3

add heatmap

6a687e5

update predict step

c25adda

transform for generating block-style masks with a fixed number of mas…

f698ff4

…ked patches

add jepa models

73105c8

patchify speedup

7d57111

update jepa

dd5fa65

add guard rails

4a223fb

update model

2a91196

Merge branch 'main' into feature/jepa

9728323

remove non-jepa changes

4917984

oops

3495430

Merge branch 'main' into feature/jepa

8669661

add inference

3bca54d

add 2d support

75c7b0c

swap to jepa

322b322

Merge branch 'main' into feature/jepa

76a3825

Benjamin Morris added 7 commits August 8, 2024 16:36

sanitize patch args

662aa2e

add ijepa configs

e14319b

update iwm configs

d0827f9

restructure csv saver

c31c8db

simplify iwm model

17d68f9

fix domain embedding shape

615ac5a

add struct to test data

d9e095a

benjijamorris dismissed ritvikvasan’s stale review via d9e095a August 9, 2024 20:37

Benjamin Morris and others added 6 commits August 9, 2024 13:37

add iwm configs

85555ae

update predict args

fb5e03d

update test/predict transforms

81737c3

switch to csv logger

6a767b2

precommit

bfabf46

account for grid patch transform

4de8859

benjijamorris requested a review from ritvikvasan August 12, 2024 21:45

Benjamin Morris added 2 commits August 13, 2024 13:18

add pretrain transformers to tests

f2fcec9

Merge branch 'main' into feature/jepa

9a2ceba

ritvikvasan previously approved these changes Aug 14, 2024

View reviewed changes

Benjamin Morris added 2 commits August 14, 2024 13:13

update mae head return format

3d38cf7

update resume test

d4fcf4f

Benjamin Morris added 3 commits August 14, 2024 14:11

update pos embed

63d2f8b

precommit

f694f91

remove encoder_decoder

4f3ea1a

benjijamorris dismissed ritvikvasan’s stale review via 4f3ea1a August 14, 2024 21:17

benjijamorris requested a review from ritvikvasan August 14, 2024 21:18

ritvikvasan approved these changes Aug 14, 2024

View reviewed changes

benjijamorris merged commit 26b77af into main Aug 14, 2024
4 of 6 checks passed

benjijamorris deleted the feature/jepa branch August 14, 2024 21:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/jepa #409

Feature/jepa #409

benjijamorris commented Aug 2, 2024 •

edited

Loading

ritvikvasan left a comment

ritvikvasan Aug 14, 2024

benjijamorris Aug 14, 2024

ritvikvasan Aug 14, 2024

ritvikvasan Aug 14, 2024

benjijamorris Aug 14, 2024

ritvikvasan Aug 14, 2024

benjijamorris Aug 14, 2024

ritvikvasan Aug 14, 2024

benjijamorris Aug 14, 2024

ritvikvasan Aug 14, 2024

benjijamorris Aug 14, 2024

ritvikvasan commented Aug 14, 2024

benjijamorris commented Aug 14, 2024

Feature/jepa #409

Feature/jepa #409

Conversation

benjijamorris commented Aug 2, 2024 • edited Loading

What does this PR do?

Before submitting

Did you have fun?

ritvikvasan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ritvikvasan commented Aug 14, 2024

benjijamorris commented Aug 14, 2024

benjijamorris commented Aug 2, 2024 •

edited

Loading