Fix a bug of forward-mode AD when multi-output is needed #1925

Jerry-Jzy · 2024-12-24T00:37:29Z

I find the bug from this case: https://github.com/lululxvi/deepxde/blob/master/examples/operator/stokes_aligned_pideeponet.py

when setting the num_output=3 and multi_output_strategy="independent", the output shape will be (batch size, # of coordinates, 3).
Then, the slice here is wrong, what we really want is slicing in the last dim.

Update example of heat equation (lululxvi#706)

Add document for Lorenz inverse with exogenous input (lululxvi#709)

OperatorPredictor supports backends tensorflow.compat.v1, tensorflow,…

… method of Lr decay in Pytorch

lululxvi · 2025-01-03T03:05:32Z

Instead of doing reshape here, another solution is modifying dde.gradients to support the output shape of (batch 1, batch 2, dim). This might be a better solution.

Jerry-Jzy · 2025-01-03T03:25:20Z

Instead of doing reshape here, another solution is modifying dde.gradients to support the output shape of (batch 1, batch 2, dim). This might be a better solution.

I think reshape in PDEOperatorCartesianProd is also a good solution. Since the problem is originally from the output structure difference between PI-DeepONet and PINN. After unifying the output structure, dde.gradients is a general-purpose module for gradent computation

lululxvi · 2025-01-03T03:30:55Z

Can you test another problem?
change this line

deepxde/examples/operator/advection_aligned_pideeponet.py

Line 33 in b313d96

return dy_t + dy_x

to

return dy_t + dy_x + x[:, 0:1]

The reverse autodiff should run, but I am not sure about the current forward code.

Jerry-Jzy · 2025-01-03T03:57:13Z

Can you test another problem? change this line

deepxde/examples/operator/advection_aligned_pideeponet.py

Line 33 in b313d96

return dy_t + dy_x

to
return dy_t + dy_x + x[:, 0:1]
The reverse autodiff should run, but I am not sure about the current forward code.

the forward-mode is not suitable for this case. The reason is not reshape. Since the forward-code is loop-free. so the shape gradient is same as the output shape. The reverse mode is doable because loop is needed then the shape of gradient is same as input shape. That's the exact difference between JVP and VJP. If we want this case working. we need to repeat the x num_func times.

lululxvi · 2025-01-03T04:08:31Z

That is why I consider to modify dde.gradients to handle and keep 3D shape; in this way, we can use broadcast, so no repeating x is needed.

Jerry-Jzy · 2025-01-03T04:20:05Z

That is why I consider to modify dde.gradients to keep 3D shape; in this way, we can use broadcast, so no repeating x is needed.

although 3D shape is kept, Still cannot use broadcast. since the (num_func, num_points) can not broadcast with (num_points, 1).

lululxvi · 2025-01-03T04:21:10Z

(num_func, num_points, 1) can broadcast with (num_points, 1)

lululxvi · 2025-01-03T04:23:45Z

return dy_t + dy_x + x[:, 0:1]


The reverse autodiff should run, but I am not sure about the current forward code.

The key question now is that if the current reshape approach can solve this problem without asking users to repeat x. If not, then we need to find out another way.

lululxvi · 2025-01-03T04:27:02Z

Is current code working for single output? If so, let us make the code work for single output, and just raise an error for multi-outputs. Then we will return to multi-output in another PR.

Jerry-Jzy · 2025-01-03T04:31:48Z

(num_func, num_points, 1) can broadcast with (num_points, 1)

If we uniformly reshape to 3D rather than 2D maybe can solve the problem of this case

deepxde/gradients/jacobian.py

Support Python 3.12

lululxvi · 2025-01-03T15:07:18Z

Is current code working for single output? If so, let us make the code work for single output, and just raise an error for multi-outputs. Then we will return to multi-output in another PR.

How about single output case?

Jerry-Jzy · 2025-01-03T16:05:11Z

Is current code working for single output? If so, let us make the code work for single output, and just raise an error for multi-outputs. Then we will return to multi-output in another PR.

How about single output case?

single output works fine

lululxvi · 2025-01-03T16:08:24Z

Then clean the code only for single output. We merge the single output code first. Or the current code in the main branch is good?

Jerry-Jzy · 2025-01-03T16:17:54Z

Then clean the code only for single output. We merge the single output code first. Or the current code in the main branch is good?

I have rolled back the code

Jerry-Jzy and others added 30 commits May 27, 2022 16:39

exchange .ipynb with .py and add a demo document

b398241

tiny change

f085616

tiny change in Heat.py

21a19e0

tiny change in pinn_forwad.rst

df57b39

add heat.py and heat.rst for Deepxde Docs plan

a86a54c

add heat.py and heat.rst for Deepxde Docs plan

f21d706

tiny change in heat.rst

663fcda

Merge pull request #1 from lululxvi/master

908b6dc

Update example of heat equation (lululxvi#706)

Update demo document for lorenz with exogenous input. inverse

8b5b980

Merge remote-tracking branch 'origin/master'

78af98a

Update demo document for lorenz with exogenous input. inverse

b3e1a9e

Update demo document for lorenz with exogenous input. inverse

ac81035

tiny change in .rst

4141cac

change .rst name and delete one empty line in pinn_inverse.rst

f875020

tiny change in pinn_inverse.rst

46a8df1

tiny change in Lorenz.inverse.forced.rst

c06657d

fix typo in Lorenz.inverse.forced.rst

d5da9f7

fix typo in pinn_inverse.rst and rename lorenz.inverse.forced.rst

c1bcb16

fix typo lorenz.inverse.forced.rst

d89816f

some tiny changes in lorenz.inverse.forced.rst

8cee955

Merge pull request #2 from lululxvi/master

1b7ba0d

Add document for Lorenz inverse with exogenous input (lululxvi#709)

Merge pull request #3 from lululxvi/master

e2e5dca

OperatorPredictor supports backends tensorflow.compat.v1, tensorflow,…

add deeponet implemented by pytorch

c4e5b48

Merge remote-tracking branch 'origin/master'

31235cd

add deeponet implemented by pytorch

86e8f2f

tiny changes in deeponet.py and model.py

31932a4

tiny changes in deeponet.py

4ca1ef0

tiny changes in deeponet.py, model.py and __init__.py; and add "Step"…

42a80f9

… method of Lr decay in Pytorch

Fix problem in model.py

8b2b449

roll back to no lr decay

68b90c0

lululxvi mentioned this pull request Jan 2, 2025

Fix bug in PDEOperatorCartesianProd #1927

Closed

update code to support different backend

c5c4e60

Jerry-Jzy added 2 commits January 2, 2025 23:36

Uniformly reshape output to 3D

4319a2e

update code

4d9ac74

Repository owner deleted a comment from Jerry-Jzy Jan 3, 2025

lululxvi reviewed Jan 3, 2025

View reviewed changes

deepxde/gradients/jacobian.py Outdated Show resolved Hide resolved

Repository owner deleted a comment from Jerry-Jzy Jan 3, 2025

Jerry-Jzy and others added 3 commits January 3, 2025 00:14

update code

6af0d40

Merge pull request #29 from lululxvi/master

150150a

Support Python 3.12

update comments

ab5547f

Jerry-Jzy closed this Jan 3, 2025

rollback

39f5af4

Jerry-Jzy reopened this Jan 3, 2025

add blank line

ec36347

lululxvi merged commit a3c677c into lululxvi:master Jan 3, 2025
14 checks passed

lululxvi referenced this pull request Jan 4, 2025

PDEOperatorCartesianProd supports forward-mode AD (#1903)

b0d239b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a bug of forward-mode AD when multi-output is needed #1925

Fix a bug of forward-mode AD when multi-output is needed #1925

Jerry-Jzy commented Dec 24, 2024

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025 •

edited

Loading

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025 •

edited

Loading

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025

lululxvi commented Jan 3, 2025

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

Fix a bug of forward-mode AD when multi-output is needed #1925

Fix a bug of forward-mode AD when multi-output is needed #1925

Conversation

Jerry-Jzy commented Dec 24, 2024

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025 • edited Loading

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025 • edited Loading

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025

lululxvi commented Jan 3, 2025

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025

Jerry-Jzy commented Jan 3, 2025

lululxvi commented Jan 3, 2025 •

edited

Loading

lululxvi commented Jan 3, 2025 •

edited

Loading