Implemented the loss NTK calculation #109

ma-sauter · 2024-02-20T17:44:24Z

No description provided.

KonstiNik

Good job in this PR. The first draft looks pretty nice.

I have two major comments:

In general, a test should cover as many aspects of the code as possible. And a test should test the desired aspect as simply as possible. This includes trying to rely as little as possible on existing methods. E.g. including an existing data generator is not as good practice as creating some dummy test data.
With your implementation, we would need to duplicate all observables for the loss ntk that we already have for the ntk. One can avoid this by having the option to either use a recorder for the loss ntk or the regular ntk. This could be done with one keyword at initialization e.g..

KonstiNik · 2024-02-21T08:26:02Z

znnl/training_recording/jax_recording.py

+        # Check if we need a loss NTK computation and update the class accordingly
+        if any(
+            [
+                "loss_ntk" in self._selected_properties,


As far as I see, right now we would have to implement the trace and all other properties again to use them with the loss ntk.
I think it might be more reasonable you had one kwarg like use_loss_ntk with which all ntk calculations are now using the loss ntk, making the entire recorder a loss ntk recorder. With this, we could re-use all the properties we have already implemented.

I agree that there's room for improvement, but I think if we introduce a flag like this here we should maybe also discuss more changes to the recorder. We should talk about this in person or in a meeting, but I'd like to make sure that the tests are working first because it's more urgent for the DPG if that's fine.

The flag got introduced in commit 1dac434

znnl/analysis/loss_ntk_calculation.py

CI/unit_tests/analysis/test_loss_ntk_calculation.py

znnl/training_strategies/simple_training.py

SamTov

I have a few comments. If you go through and address them all I can go back over it but in general, I like it and am happy to have it merged soon.

SamTov · 2024-03-06T12:35:33Z

CI/integration_tests/training_recording/test_loss_ntk_recording_deployment.py

+
+import os
+
+os.environ["CUDA_VISIBLE_DEVICES"] = "-1"


Remove from the test please.

SamTov · 2024-03-07T07:26:03Z

CI/integration_tests/training_recording/test_loss_ntk_recording_deployment.py

+
+        # For LPNormLoss of order 2 and a 1D output Network, the NTK and the loss NTK
+        # should be the same up to a factor of +1 or -1.
+        assert_array_almost_equal(


As this is an integration test, you will also want to check that the deployment has worked. You can check things like the shape of the stored values.

SamTov · 2024-03-07T07:27:01Z

CI/integration_tests/training_recording/test_loss_ntk_recording_deployment.py

Can you just name this test_loss_ntk. The naming of the tests should mirror the main python package just with test in front. All integration tests using the loss ntk should be in this one module.

SamTov · 2024-03-07T07:27:16Z

CI/unit_tests/analysis/test_loss_ntk_calculation.py

+
+import os
+
+os.environ["CUDA_VISIBLE_DEVICES"] = "-1"


Please remove this from the tests

SamTov · 2024-03-07T07:27:28Z

CI/unit_tests/analysis/test_loss_ntk_calculation.py

Please rename this to be inline with the package.

SamTov · 2024-03-07T08:05:40Z

znnl/analysis/loss_ntk_calculation.py

+        )
+
+    @staticmethod
+    def _unshape_data(


SamTov · 2024-03-07T08:06:59Z

znnl/analysis/loss_ntk_calculation.py

+        """
+
+        # Set the attributes
+        self.ntk_batch_size = model.ntk_batch_size


It might be better to make all of these arguments in this calculation. Especially when we later move into the new measurement system, this will all need to be self contained. Things like store_on_device are only pertinent to this calculator.

SamTov · 2024-03-07T08:08:03Z

znnl/analysis/loss_ntk_calculation.py

+
+        Returns
+        -------
+        input: np.ndarray


Can you add some shape information here? It can be (batch * input size, ) or anything, but just some information about what I will get back. What you mean by unshape is also very unclear. Is it flattening is it reshaping, unshape doesn't have a real meaning.

SamTov · 2024-03-07T08:09:27Z

znnl/analysis/loss_ntk_calculation.py

+            batch_length, *input_shape[1:]
+        ), datapoint[:, input_dimension:].reshape(batch_length, *target_shape[1:])
+
+    def _function_for_loss_ntk(self, params, datapoint) -> float:


I would prefer different naming here. Is it an apply function on flattened data, a loss function. What do you mean by subloss? Loss between two data points is just loss. Function for loss ntk could be anything.

SamTov · 2024-03-07T08:32:59Z

znnl/visualization/tsne_visualizer.py

If the notebook is not clear, can you clear it of outputs.

Marc Sauter and others added 24 commits January 12, 2024 18:23

starting out

076d0df

Renaming file

930cac8

analysis file should be done

580194a

quick fix

768fc13

Included loss ntk, eigval and entropy in jax recorder

c802010

Updated recorder instantiation test

17fd83a

Bugfixing

e88292f

Stand vor Kino

4eb8362

First thing state that might be working

fb1da2e

writing test

d22bb68

Included vmap_axes

e141dd8

Working on calculating loss derivatives to calculate loss ntk comparison

22cfc90

Calculation and test should work now

5f9b6bf

some linting updates

131d5cb

Fixed tests

5b244ff

Some modifications to simplify the loss ntk test code

3a06e75

Class renaming to follow convention

001e46e

added reshape and unshape methods in the loss_ntk_calculation

fd99b77

quicksave

0ef059f

Quick save

98c6b30

Added test for eigenvalues, precision is still only e-4

cd5644a

Added some docstrings

b4fe246

More docstrings

bf7a401

Black formatting

43cdcb9

ma-sauter requested a review from KonstiNik February 20, 2024 17:44

More docstrings

fd8626f

KonstiNik requested changes Feb 21, 2024

View reviewed changes

ma-sauter and others added 3 commits February 21, 2024 22:37

Some PR modifications

b6ebb8d

fixing PR comment

d17e514

requirements change

eb49043

Marc Sauter added 7 commits February 26, 2024 15:21

Black formatter changes

53c548d

changed recorder to use the use_loss_ntk flag

1dac434

Change recorder test for new flag

422eceb

removed unneccesary CNN model from loss_ntk calculation test

b5e8170

Started integration test for loss_ntk_calculation

594f4cb

Implemented integration test

b55fc30

isort

080099e

SamTov requested changes Mar 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented the loss NTK calculation #109

Implemented the loss NTK calculation #109

ma-sauter commented Feb 20, 2024

KonstiNik left a comment

KonstiNik Feb 21, 2024

ma-sauter Feb 21, 2024

ma-sauter Feb 27, 2024

SamTov left a comment

SamTov Mar 6, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024

SamTov Mar 7, 2024


		import os

		os.environ["CUDA_VISIBLE_DEVICES"] = "-1"


		import os

		os.environ["CUDA_VISIBLE_DEVICES"] = "-1"

Implemented the loss NTK calculation #109

Are you sure you want to change the base?

Implemented the loss NTK calculation #109

Conversation

ma-sauter commented Feb 20, 2024

KonstiNik left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SamTov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment