Use DifferentiationInterface for AD in Implicit Solvers #2567

jClugstor · 2025-01-02T20:36:45Z

Checklist

Appropriate tests were added
Any code changes were done in a way that does not break public API
All documentation related to code changes were updated
The new code follows the
contributor guidelines, in particular the SciML Style Guide and
COLPRAC.
Any new documentation only uses public API

Additional context

This is at a point where we can do stuff like this:

using OrdinaryDiffEqCore
using OrdinaryDiffEqSDIRK
using ADTypes
using EnzymeCore

function lorenz!(du, u, p, t)
    du[1] = 10.0 * (u[2] - u[1])
    du[2] = u[1] * (28.0 - u[3]) - u[2]
    du[3] = u[1] * u[2] - (8 / 3) * u[3]
end

u0 = [1.0; 0.0; 0.0]
tspan = (0.0, 100.0)
prob = ODEProblem{true, SciMLBase.NoSpecialize}(lorenz!, u0, tspan)
sol = solve(prob, ImplicitEuler(autodiff=AutoEnzyme(function_annotation = EnzymeCore.Const)))

and it actually uses sparsity detection and greedy jacobian coloring plus Enzyme to compute the Jacobians.

Some things I'm unsure about:

The current behavior is to use Jacobian coloring and SparseDiffTools by default. In order to keep that up, we have to wrap any ADType given in an AutoSparse unless it's already an AutoSparse. This does change the ADType that the user entered to be wrapped in an AutoSparse, which feels weird to me. Maybe there should be an option to just directly use the ADType entered, but by default we wrap it into an AutoSparse? I'm not sure.
The biggest issue is that the way the sparsity detectors work with DI is by using operator overloading (both TracerSparsityDetector and SymbolicsSparsityDetector do), but that's an issue when using AutoSpecialilzation, because of the FunctionWrappers. The solution I found was to just unwrap the function in the preparation process. I'm not sure what performance implication this will have, but I don't think it should do much, since the preparation should be run just once.
There's still pieces in here that use raw SparseDiffTools, (build_J_W) that I haven't looked in to how to convert to DI yet.
I may need to fix some of the versions.
There are some places that are getting sparse things where it's not expected.

jClugstor · 2025-01-02T23:07:43Z

In order for this to be completely done we'll need a DI equivalent for the SparseDiffTools JacVec operator that will be stored in the caches, for the W operators. I think I could just wrap the DI pushforward in an operator, but better to do a long term solution. In a recent Julia slack thread (https://julialang.slack.com/archives/C6G240ENA/p1735254065747829) there were a couple of solutions.

Is a good way to do this to make an extension in SciMLOperators for DifferentiationInterface that will have something like a DI_pushforward operator that basically wraps up the pushforward function in a FunctionOperator?

@ChrisRackauckas @oscardssmith @gdalle Any thoughts?

ChrisRackauckas · 2025-01-03T02:16:05Z

Is a good way to do this to make an extension in SciMLOperators for DifferentiationInterface that will have something like a DI_pushforward operator that basically wraps up the pushforward function in a FunctionOperator?

Yes

ChrisRackauckas · 2025-01-03T02:16:18Z

@avik-pal might already have one?

gdalle · 2025-01-03T10:16:09Z

Awesome work @jClugstor, thanks! Ping me when this is ready for a first round of DI-specific review.

This is at a point where we can do stuff like this [...] and it actually uses sparsity detection and greedy jacobian coloring plus Enzyme to compute the Jacobians.

Just to be clear, this wasn't possible before? So is this the first time that Enzyme can be used out-of-the-box to solve ODEs?

The current behavior is to use Jacobian coloring and SparseDiffTools by default. In order to keep that up, we have to wrap any ADType given in an AutoSparse unless it's already an AutoSparse. This does change the ADType that the user entered to be wrapped in an AutoSparse, which feels weird to me. Maybe there should be an option to just directly use the ADType entered, but by default we wrap it into an AutoSparse? I'm not sure.

Another option, which requires a bit more work (and is probably not worth it) would be to make SparseDiffTools compatible with the sparsity API of ADTypes v1. I think it might allow a more seamless upgrade. See e.g. JuliaDiff/SparseDiffTools.jl#298 for the detection aspect, and there should be a similar issue for the coloring aspect.

Speaking of SparseDiffTools, it still has an edge over DI when combined with FiniteDiff. The PR JuliaDiff/FiniteDiff.jl#191 could fix that, maybe @oscardssmith would be willing to take another look?

The biggest issue is that the way the sparsity detectors work with DI is by using operator overloading (both TracerSparsityDetector and SymbolicsSparsityDetector do), but that's an issue when using AutoSpecialization, because of the FunctionWrappers. The solution I found was to just unwrap the function in the preparation process. I'm not sure what performance implication this will have, but I don't think it should do much, since the preparation should be run just once.

Agreed, preparation is a one-time cost so I don't think we should worry too much (at least in the prototype stage).

There are some places that are getting sparse things where it's not expected.

What do you mean by unexpected sparse things? SparseMatrixCSC instead of Matrix? Can you give an example?

In order for this to be completely done we'll need a DI equivalent for the SparseDiffTools JacVec operator that will be stored in the caches, for the W operators. I think I could just wrap the DI pushforward in an operator, but better to do a long term solution. In a recent Julia slack thread (https://julialang.slack.com/archives/C6G240ENA/p1735254065747829) there were a couple of solutions.
Is a good way to do this to make an extension in SciMLOperators for DifferentiationInterface that will have something like a DI_pushforward operator that basically wraps up the pushforward function in a FunctionOperator?

We may also want to involve @oschulz and his AutoDiffOperators package to avoid duplication of efforts?

As a side note, DifferentiationInterface only has two dependencies: ADTypes and LinearAlgebra. For packages that use it extensively, I think it's reasonable to make it a full dep instead of a weakdep.

gdalle · 2025-01-03T13:55:55Z

lib/OrdinaryDiffEqDifferentiation/Project.toml

@@ -7,6 +7,8 @@ version = "1.3.0"
 ADTypes = "47edcb42-4c32-4615-8424-f2b9edc5f35b"
 ArrayInterface = "4fba245c-0d91-5ea0-9b3e-6abc04ee57a9"
 DiffEqBase = "2b5f629d-d688-5b77-993f-72d75c75574e"
+DifferentiationInterface = "a0c0ee7d-e4b9-4e03-894e-1c5f64a51d63"
+Enzyme = "7da242da-08ed-463a-9acd-ee780be4f1d9"


Does Enzyme need to become a dependency? This adds significant install overhead, but if AutoEnzyme is to be the new default AD then it makes sense

Yeah, probably doesn't need to be a dependency unless we're committing to having it be the default.

gdalle · 2025-01-03T13:56:30Z

lib/OrdinaryDiffEqDifferentiation/Project.toml

@@ -25,6 +29,7 @@ ADTypes = "1.11"
 ArrayInterface = "7"
 DiffEqBase = "6"
 DiffEqDevTools = "2.44.4"
+DifferentiationInterface = "0.6.23"


Suggested change

DifferentiationInterface = "0.6.23"

DifferentiationInterface = "0.6.28"

the other deps are also missing compat bounds?

Suggested change

DifferentiationInterface = "0.6.23"

DifferentiationInterface = "0.6.31"

gdalle · 2025-01-03T13:57:49Z

lib/OrdinaryDiffEqDifferentiation/src/alg_utils.jl

-            alg, autodiff = AutoForwardDiff(chunksize = cs))
+function prepare_ADType(alg::AutoFiniteDiff, prob, u0, p, standardtag)
+    # If the autodiff alg is AutoFiniteDiff, prob.f.f isa FunctionWrappersWrapper,
+    # and fdtype is complex, fdtype needs to change to something not complex


Note that DI does not explicitly support complex numbers yet. What I mean by that is that we forward things to the backend as much as possible, so if the backend does support complex numbers then it will probably work, but there are no tests or hard API guarantees on that. See JuliaDiff/DifferentiationInterface.jl#646 for the discussion

Also note that some differentiation operators are not defined unambiguously for complex numbers (e.g. the derivative for complex input)

Enzyme has an explicit variant of modes for complex numbers, that it probably would be wise to similarly wrap here (by default it will instead err warning about ambiguity if a function returns a complex number otherwise): https://enzyme.mit.edu/julia/stable/api/#EnzymeCore.ReverseHolomorphic . @gdalle I'm not sure DI supports this yet? so perhaps that means you may need to just call Enzyme.jacobian / autodiff directly in that case

@jClugstor can you maybe specify where we will encounter complex numbers by filling the following table?

derivative jacobian

complex inputs possible yes / no yes / no

complex outputs possible yes / no yes / no

When there are both complex inputs and complex outputs, that's where we run into trouble because we cannot represent derivatives as a single scalar. In that case, the differentiation operators are not clearly defined (the Jacobian matrix is basically twice as big as it should be) so we would need to figure out what convention the ODE solvers need (see https://discourse.julialang.org/t/taking-complex-autodiff-seriously-in-chainrules/39317).

@wsmoses I understand your concern, but I find it encouraging that DI actually allowed Enzyme to be used here for the first time (or at least so I've been told). This makes me think that the right approach is to handle complex numbers properly in DI instead of introducing a special case for Enzyme?

sure adding proper complex number support to DI would be great, but a three line change here to use in-spec Complex support when there's already overloads for other ADTypes feels reasonable?

e.g. something like

function jacobian(f, x::AbstractArray{<:Complex}, integrator::WhatevertheTypeIs{<:AutoEnzyme}) Enzyme.jacobian(ReverseHolomorphic, f, x) end

from the discussion in JuliaDiff/DifferentiationInterface.jl#646 I think DI complex support is a much thornier issue. In particular, various tools have different conventions (e.g. jax vs pytorch pick different conjugates of what is propagated). So either DI needs to make a choice and shim/force all tools to use it (definitely doable), and then user code must be converted to that convention (e.g. a separate shim on the user side). For example, suppose DI picked a different conjugate from forwarddiff.jl. DI could write its shim once in forward diff to convert which is reasonable. But suppose one was defining a custom rule within ForwardDiff and the code called DI somewhere, now that user code needs to conditionally do a different the shim to conjugate which feels kind of nasty to be put everywhere (in contrast to a self consistent assumption). I suppose the other alternative is for DI to not pick a convention, but that again prevents users from using since it's not possible to know whether they get the correct value for them -- and worse, they won't know when they need to do a conversion or not.

Thus, if complex support is desired, a three line patch where things are explicitly supported seems okay (at least until the DI story is figured out)

I agree that for now, this change seems to do the job (although it raises the question of consistency with the other backends that are handled via DI). But what will happen if the function in question is not holomorphic? That's the thorniest part of the problem, and that's why I wanted to inquire a bit more as to what kind of functions we can expect. Perhaps @jClugstor or @ChrisRackauckas can tell us more?

In any case, I have started a discussion on Discourse to figure out the right conventions: https://discourse.julialang.org/t/choosing-a-convention-for-complex-numbers-in-differentiationinterface/124433

Also note that the Enzyme-specific fix only handles dense Jacobians, not sparse Jacobians (which are one of the main reasons to use DI in the first place)

Sorry, I can't really tell you much about the complex number support, other than previously only ForwardDiff or FiniteDiff were used, so when someone used an implicit solver on a complex problem, their conventions were used I guess. Also just wanted to note that the code this comment is on is just making sure that the FiniteDiff fdtype isn't complex if the function is a function wrapper and doesn't have to do with complex numbers through the solver in general.

The latest release of DI inches closer to support for complex numbers. I read a little about conventions for non-holomorphic differentiation and it was a mess, so as a starting point DI assumes that the function is holomorphic. If you want e.g. a Jacobian, it is pretty much the only convention that makes sense anyway, otherwise you end up with a $2n \times 2n$ matrix. I have added complex holomorphic test scenarios, and right now FiniteDiff works on them. I'll test Enzyme soon enough.

avik-pal · 2025-01-03T14:39:18Z

@avik-pal might already have one?

Add a dispatch to https://github.com/SciML/NonlinearSolve.jl/blob/master/lib/SciMLJacobianOperators/src/SciMLJacobianOperators.jl#L115

jClugstor · 2025-01-03T20:13:20Z

@gdalle

Just to be clear, this wasn't possible before? So is this the first time that Enzyme can be used out-of-the-box to solve ODEs?

As far as I know this is the first time Enzyme has been used for the implicit solvers yes.

jClugstor · 2025-01-03T20:30:10Z

@avik-pal I noticed that the constructors for your JacobianOperator take a NonlinearProblem , but doesn't use it that much. Would it make sense to create a constructor JacobianOperator(f::AbstractSciMLFunction, ...) that does essentially the same thing and put it in SciMLOperators?

avik-pal · 2025-01-03T23:51:08Z

@avik-pal I noticed that the constructors for your JacobianOperator take a NonlinearProblem , but doesn't use it that much. Would it make sense to create a constructor JacobianOperator(f::AbstractSciMLFunction, ...) that does essentially the same thing and put it in SciMLOperators?

the prepare_jvp and prepare_vjp functions assume a 2/3 arg function for oop/iip respectively, that won't hold for ordinarydiffeq

oschulz · 2025-01-04T10:53:36Z

We may also want to involve @oschulz and his AutoDiffOperators package to avoid duplication of efforts?

AutoDiffOperators is definitely open for any extensions/changes that would be necessary, and could also be moved from oschulz to a GitHub Julia org.

jClugstor · 2025-01-06T19:40:32Z

AutoDiffOperators is definitely open for any extensions/changes that would be necessary, and could also be moved from oschulz to a GitHub Julia org.

@oschulz I think if we want to use it in OrdinaryDiffEq we should make sure that AutoDiffOperators adheres to the AbstractSciMLOperators interface, https://docs.sciml.ai/SciMLOperators/stable/interface/

oschulz · 2025-01-06T19:47:58Z

@oschulz I think if we want to use it in OrdinaryDiffEq we should make sure that AutoDiffOperators adheres to the AbstractSciMLOperators interface, https://docs.sciml.ai/SciMLOperators/stable/interface/

The way it is currently implemented, AutoDiffOperators is not opinionated there. To use LinearMaps.LinearMap as the operator type, you do

using AutoDiffOperators, ADTypes, LinearMaps
using DifferentiationInterface
import ForwardDiff, Enzyme

ad = AutoEnzyme()

f(x) = x.^2 + reverse(x)
y, J_op = with_jacobian(f, x, LinearMap, ad)

whereas

y, J_array = with_jacobian(f, x, Matrix, ad)

instantiates the full Jacobian in memory. So SciMLOperators could be supported via

y, J_op = with_jacobian(f, x, SciMLOperators.FunctionOperator, ad)

(or AbstractSciMLOperator instead of FunctionOperator or so).

This way, different opererator "backends" can be implemented via Pkg extensions and the user can select which operator type they want.

gdalle · 2025-01-21T20:13:17Z

Hey @jClugstor, just a friendly ping, do you need a hand on this one?

jClugstor · 2025-01-21T21:47:59Z

Not particularly, I've just been otherwise occupied the past few days.

I'm just going through all the tests and making sure that they pass one by one. Most of the difficulties have been related to making sure that the sparse AD works. For example making sure that if using sparse AD certain caches are constructed with sparse LU operators when needed etc.

Next step is making sure the stats tests pass, previously all of the function calls in the Jacobian/JacVec calculations were being counted. But obviously that's not true when using DI, so we'll need a better way to count how many times the user provided function has been called. I have some ideas already.

gdalle · 2025-01-21T22:33:15Z

I did something similar in DITest, you may want to take a look: https://github.com/JuliaDiff/DifferentiationInterface.jl/blob/main/DifferentiationInterfaceTest/src/tests/benchmark.jl#L1-L22

gdalle · 2025-01-22T06:04:30Z

Also note that counting function executions only makes sense for AD backend like FiniteDiff and ForwardDiff. When you use Enzyme, it doesn't actually call the function while differentiating it (that's the difference between source transformation and operator overloading). So I'm not sure how you plan to do the counting in that case?

ChrisRackauckas · 2025-01-22T08:29:14Z

We count the number of times Enzyme is called.

jClugstor · 2025-01-23T19:02:18Z

I got pretty close to getting the stats working, but I was off by one for some of them. I think the reason is that when the DI prep object is created during the building of the jac_config, using the SparseMatrixColoring tools, it calls the function with tracers as the inputs. So in the tests that counts as a call to the function. But it's not possible to increment the nf stat when building the jacobian config, since we can't access the integrator at that point. So it's a little tricky.

Wrapping the user function so that there's a counter every time it's called would work for FiniteDiff, ForwardDiff, but it sounds like it won't work for Enzyme and Zygote etc.

Maybe one way is to have those counter wrappers have custom AD rules, so that when they're differentiated, it just differentiates the function inside, and iterates the counter?

gdalle · 2025-01-23T19:40:23Z

I think the reason is that when the DI prep object is created during the building of the jac_config, using the SparseMatrixColoring tools, it calls the function with tracers as the inputs.

Indeed that happens, but there might also be a call for reasons like getting the size and eltype of y = f(x). Do we really need to enforce a strict number of calls during preparation?

During execution too, the number of calls might be off by one, depending on the backend and how it handles things like value_and_jacobian

jClugstor · 2025-01-24T17:47:21Z

I think for now it's reasonable to disregard the function calls from the Jacobian preparation. Ideally though I think we would like to make sure we count every function call. I think that can be for a later time. @oscardssmith does that sound ok?

oscardssmith · 2025-01-24T17:52:31Z

sounds good.

gdalle · 2025-01-24T18:01:56Z

So what's next?

jClugstor · 2025-01-24T19:10:17Z

Once I get the stats tests working again, it looks like this will be really close. Many of the failing CI tests are also failing on main.

But:

The Rosenbrock tests will need fixing
Resizing for the jac_configs needs to be fixed, that might get tricky
Many of the downstream integration tests will need to be fixed as well

gdalle · 2025-01-24T19:55:27Z

Resizing for the jac_configs needs to be fixed, that might get tricky

For this part specifically, I put a hidden preparation update function called prepare!_jacobian in DI, which can take an already existing prep object and work from it. At the moment this prep is simply discarded, but if you point me to the original code which does this config resizing, I will be able to port it to DI in order to save work with some backends.

jClugstor · 2025-01-24T20:01:55Z

The relevant functions are here:

OrdinaryDiffEq.jl/lib/OrdinaryDiffEqDifferentiation/src/derivative_wrappers.jl

Line 326 in 013c00d

gdalle · 2025-01-24T20:09:47Z

What does it even mean to resize the Jacobian config for a sparse Jacobian? How do you anticipate the sparsity pattern for a bigger input variable 🤔 ?

ChrisRackauckas · 2025-01-25T11:02:31Z

Think about a PDE semi-discretization in 1D: it's just a banded matrix where the bands are all the same, it's just the number of times you repeat the pattern. This is very common for example in adaptive meshing PDE solvers.

ChrisRackauckas · 2025-01-25T11:05:50Z

Here's a stop gap solution:

Support resizing in dense cases
Support resizing in auto-sparse cases by triggering a new sparsity detection on resize! of the cache
Throw an informative error that for sparse matrices support will come in the future and point to an issue and see who shows up.

I think if auto-sparse is fast enough, (3) might go away naturally. I know the Trixi case only really uses the SSP methods so they would avoid this. Some of the finite element cases may be investigating (3) but we don't have full support already, so this basically all folds into the next OrdinaryDiffEq breaking v7 and we can just look forward not backwards on this. But I believe most finite element cases would appreciate (2) anyways?

The other thing that could be done is that the case of (3) would require that the new sparsity pattern is given with a special form of resize!. But we don't even have a public API way of doing that here (we have non-public ways...) so we can make the switch without such a thing existing.

gdalle · 2025-01-29T17:23:21Z

Within DI, the function prepare!_jacobian(f!, y, prep, backend, x) allows an existing preparation output prep to be resized and adapted to the new input x. By default, it re-runs the preparation from scratch (which, in the sparse case, includes sparsity detection). But we can override it at will if we have faster ways to resize. I have only done it for ForwardDiff yet, but FiniteDiff will soon follow (as soon as FiniteDiff.jl#191 is done). Here's how it looks:

https://github.com/JuliaDiff/DifferentiationInterface.jl/blob/5dfd7adec430c71a63f527a61962d5e5567e6702/DifferentiationInterface/ext/DifferentiationInterfaceForwardDiffExt/twoarg.jl#L372-L389

Can you take it out for a spin, and tell me whether anything else is missing from DI in your opinion?

jClugstor · 2025-01-30T20:30:49Z

I just finished up fixing the resizing, looks like it's working.

Up next, in-place DAE's aren't working with the tracing sparsity detection, due to the way the DAEResidualJacobianWrapper is set up.

jClugstor · 2025-02-18T15:10:57Z

@gdalle I think this is ready for your review. Please let me know if there's anything related to DI here that I could be doing better. Thanks!

lib/OrdinaryDiffEqDifferentiation/Project.toml

gdalle · 2025-02-18T17:49:52Z

lib/OrdinaryDiffEqDifferentiation/src/alg_utils.jl

+        color_alg = DiffEqBase.has_colorvec(prob.f) ?
+                    SparseMatrixColorings.ConstantColoringAlgorithm(
+            sparsity, prob.f.colorvec) : SparseMatrixColorings.GreedyColoringAlgorithm()


Unless otherwise specified, this will be a column coloring (for forward-mode Jacobians only). I assume that's the only case you need?

that's what we want to do by default. If the user has a super complicated case they can always request that themselves.

gdalle · 2025-02-18T17:51:25Z

lib/OrdinaryDiffEqDifferentiation/src/alg_utils.jl

+        autodiff = AutoSparse(
+            autodiff, sparsity_detector = sparsity_detector, coloring_algorithm = color_alg)


Reassigning autodiff to a differently-typed object might cause type instabilities

I think SSA should deal with this (but it is always good to make sure with variable names)

gdalle · 2025-02-18T17:54:56Z

lib/OrdinaryDiffEqDifferentiation/src/derivative_utils.jl

+                grad_config = cache.grad_config
+
+                if autodiff_alg isa AutoFiniteDiff
+                    grad_config = SciMLBase.@set grad_config.dir = diffdir(integrator)


Is grad_config the result of DI.prepare_derivative? If so, you cannot rely on its fields, they are internal implementation details. It's also dangerous to modify them after preparation.
Why can't you set the dir directly in the backend?

Yes, it's from DI.prepare_derivative. If I recall correctly I change the grad_config because when the derivative is calculated in one of the next lines: DI.derivative!(tf, linsolve_tmp, dT, grad_config, autodiff_alg, t) it will use the dir in the prep object and not the backend.

Yeah, it's hacky, and it's not great since it ends up copying the prep object every time too.

Can't you change the backend before you call preparation?

We don't have the information for diffdir when we create the preparation. The dir we need can actually change during the solve, so that makes this a bit tricky.

Not sure what to do here. Grabbing into the preparation result and manually changing it is invalid. Here it may not cause problems but that is not an API guarantee from DI's perspective.
Why does dir change during the solve?

function diffdir(integrator::DiffEqBase.DEIntegrator) difference = maximum(abs, integrator.uprev) * sqrt(eps(typeof(integrator.t))) dir = integrator.tdir > zero(integrator.tdir) ? integrator.t > integrator.sol.prob.tspan[2] - difference ? -1 : 1 : integrator.t < integrator.sol.prob.tspan[2] + difference ? 1 : -1 end

I think this is basically checking to see if the finite difference step will go outside of the range of the tspan. If it does that's a problem for things like the interpolating adjoint sensitivity analysis, so we want the finitediff step to go in the opposite direction in that case.

gdalle · 2025-02-18T18:02:51Z

lib/OrdinaryDiffEqDifferentiation/src/operators.jl

+            f, fu_cache, autodiff, u, (fu,), DI.Constant(p), DI.Constant(t))
+        return @closure (vJ, v, u, p, t) -> begin
+            DI.pullback!(f, fu_cache, (reshape(vJ, size(u)),), di_prep, autodiff, u,
+                (reshape(v, size(fu_cache)),), DI.Constant(p), DI.Constant(t))


Beware that the output of reshape may have a different type than the fu you prepared with, thus making preparation invalid

wouldn't this just be a bug in reshape?

Depends what you reshape. In general, you may end up with a ReshapedArray, that's the case I had in mind

julia> reshape(ones(8), 2, 4) |> typeof Matrix{Float64} (alias for Array{Float64, 2}) julia> reshape(view(ones(10), 1:8), 2, 4) |> typeof Base.ReshapedArray{Float64, 2, SubArray{Float64, 1, Vector{Float64}, Tuple{UnitRange{Int64}}, true}, Tuple{}}

gdalle · 2025-02-18T18:04:41Z

test/integrators/resize_tests.jl

+@test length(i.cache.nlsolver.cache.jac_config.pushforward_prep.xdual_tmp) == 5
+@test length(i.cache.nlsolver.cache.jac_config.pushforward_prep.ydual_tmp) == 5


These are internal implementation details and may be renamed without notice. What are you trying to test?

Basically I want to test to make sure that the jac_config was resized correctly.

maybe just try running the jac?

gdalle · 2025-02-18T18:05:02Z

test/integrators/resize_tests.jl

-@test length(i.cache.nlsolver.cache.jac_config.x1) == 5
-@test length(i.cache.nlsolver.cache.jac_config.fx) == 5
-@test length(i.cache.nlsolver.cache.jac_config.fx1) == 5
+@test length(SparseMatrixColorings.column_colors(i.cache.nlsolver.cache.jac_config)) == 5


you can also use ncolors

gdalle · 2025-02-18T18:05:46Z

test/interface/stats_tests.jl

-                (autodiff = AutoFiniteDiff(fdtype = Val{:central}()),),
-                (autodiff = AutoFiniteDiff(fdtype = Val{:complex}()),)]
+            @testset "$kwargs" for kwargs in [(autodiff = AutoForwardDiff(),)]
+                                              #(autodiff = AutoFiniteDiff(fdtype = Val{:forward}()),),


what's failing here?

The number of function calls is not being counted correctly for FiniteDiff. The tests use a reference to count how many times the function is called, but the integrator stats object is calculated based on things like the number of chunks, maximum color if it was sparse, if the the finite diff setting was forward or central etc. etc.

So there were some function calls that weren't being accounted for with the AutoFiniteDiff tests. Maybe from the preparation?

Yeah, we make no guarantees on the number of function calls during preparation.

oscardssmith · 2025-02-18T18:06:46Z

lib/OrdinaryDiffEqDifferentiation/src/alg_utils.jl

+    #L = StaticArrayInterface.known_length(typeof(u0))
+    #if L === nothing # dynamic sized
        # If chunksize is zero, pick chunksize right at the start of solve and
        # then do function barrier to infer the full solve
-        x = if prob.f.colorvec === nothing
-            length(u0)
-        else
-            maximum(prob.f.colorvec)
-        end
+    #    x = if prob.f.colorvec === nothing
+    #        length(u0)
+    #    else
+    #        maximum(prob.f.colorvec)
+    #    end
+
+    #    cs = ForwardDiff.pickchunksize(x)
+    #    return remake(alg,
+    #        autodiff = AutoForwardDiff(
+    #            chunksize = cs, tag = tag))
+    #else # statically sized
+    #    cs = pick_static_chunksize(Val{L}())
+    #    cs = SciMLBase._unwrap_val(cs)
+    #    return remake(
+    #        alg, autodiff = AutoForwardDiff(chunksize = cs, tag = tag))
+    #end


should this all get removed now?

oscardssmith · 2025-02-18T18:07:47Z

lib/OrdinaryDiffEqDifferentiation/src/alg_utils.jl

+#function prepare_ADType(alg::DiffEqAutoAD, prob, u0, p, standardtag)
+
+#end


delete this?

oscardssmith · 2025-02-18T18:13:19Z

Overall this looks pretty good to me.

Co-authored-by: Guillaume Dalle <[email protected]>

Co-authored-by: Oscar Smith <[email protected]>

gdalle reviewed Jan 3, 2025

View reviewed changes

jClugstor added 10 commits February 17, 2025 13:16

add test deps for autosparse detection tests

a95358a

wrong nowarn

1e44a8e

fix stiff addsteps

1d23a97

fix direction in stiff_addsteps

7e07d1a

cleanup

f3caac5

using Enzyme for convergence tests

02844af

test rosenbrocks with unique caches

e2c1e34

fix tests, update error message

708fabf

add LinearSolve for BDF test

c156bf1

account for bool sparsity pattern

a886d24

jClugstor force-pushed the use_DI branch from a722098 to a886d24 Compare February 17, 2025 18:16

add LinearSolve to BDF test and extras

d889e51

This was referenced Feb 18, 2025

Fix the Jac and Grad Resizing SciML/DelayDiffEq.jl#305

Draft

Fix complex test, add _alg_autodiff method for Composite algs SciML/StochasticDiffEq.jl#601

Open

gdalle suggested changes Feb 18, 2025

View reviewed changes

oscardssmith reviewed Feb 18, 2025

View reviewed changes

jClugstor and others added 11 commits February 18, 2025 14:01

Update lib/OrdinaryDiffEqDifferentiation/Project.toml

ff1413a

Co-authored-by: Guillaume Dalle <[email protected]>

add warning, clean up

0a7733c

create JVPCache operator and use it

46c2b50

Co-authored-by: Oscar Smith <[email protected]>

fix resize tests

c39dd1b

use ncolors instead

02e101f

split up sparsity ADType preparation

ea34c5c

make JVPCache mutable

167eba8

get rid of redundant definition

b898717

use build_J_W with nothing jac_config where appropriate

6c67cbe

make sure FIRK caches use jac_config version of build_J_W

56921e0

fix alg_utils

520f877

	DifferentiationInterface = "0.6.23"
	DifferentiationInterface = "0.6.28"

	DifferentiationInterface = "0.6.23"
	DifferentiationInterface = "0.6.31"

	derivative	jacobian
complex inputs possible	yes / no	yes / no
complex outputs possible	yes / no	yes / no

		autodiff = AutoSparse(
		autodiff, sparsity_detector = sparsity_detector, coloring_algorithm = color_alg)

		@test length(i.cache.nlsolver.cache.jac_config.pushforward_prep.xdual_tmp) == 5
		@test length(i.cache.nlsolver.cache.jac_config.pushforward_prep.ydual_tmp) == 5

		#function prepare_ADType(alg::DiffEqAutoAD, prob, u0, p, standardtag)

		#end

Use DifferentiationInterface for AD in Implicit Solvers #2567

Are you sure you want to change the base?

Use DifferentiationInterface for AD in Implicit Solvers #2567

Conversation

jClugstor commented Jan 2, 2025 • edited Loading

Checklist

Additional context

jClugstor commented Jan 2, 2025

ChrisRackauckas commented Jan 3, 2025

ChrisRackauckas commented Jan 3, 2025

gdalle commented Jan 3, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gdalle Jan 4, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

avik-pal commented Jan 3, 2025

jClugstor commented Jan 3, 2025

jClugstor commented Jan 3, 2025

avik-pal commented Jan 3, 2025

oschulz commented Jan 4, 2025

jClugstor commented Jan 6, 2025

oschulz commented Jan 6, 2025

gdalle commented Jan 21, 2025

jClugstor commented Jan 21, 2025

gdalle commented Jan 21, 2025

gdalle commented Jan 22, 2025

ChrisRackauckas commented Jan 22, 2025

jClugstor commented Jan 23, 2025

gdalle commented Jan 23, 2025

jClugstor commented Jan 24, 2025

oscardssmith commented Jan 24, 2025

gdalle commented Jan 24, 2025

jClugstor commented Jan 24, 2025

gdalle commented Jan 24, 2025

jClugstor commented Jan 24, 2025

gdalle commented Jan 24, 2025

ChrisRackauckas commented Jan 25, 2025

ChrisRackauckas commented Jan 25, 2025

gdalle commented Jan 29, 2025

jClugstor commented Jan 30, 2025

jClugstor commented Feb 18, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oscardssmith commented Feb 18, 2025

jClugstor commented Jan 2, 2025 •

edited

Loading

gdalle Jan 4, 2025 •

edited

Loading