BREAKING: Change expression types to `DynamicExpressions.Expression` (from `DynamicExpressions.Node`) #326

MilesCranmer · 2024-06-24T03:40:33Z

These new experimental Expression types store both the operators and variable names within the object, rather than the plain Node which only stores the enum information about an expression.

This also adds ParametricExpression to learn basis expressions that have variable constants depending on class:

using SymbolicRegression
using Random: MersenneTwister
using Zygote
using MLJBase: machine, fit!, predict

rng = MersenneTwister(0)
X = NamedTuple{(:x1, :x2, :x3, :x4, :x5)}(ntuple(_ -> randn(rng, Float32, 30), Val(5)))
X = (; X..., classes=rand(rng, 1:2, 30))
p1 = rand(rng, Float32, 2)
p2 = rand(rng, Float32, 2)

y = [
    2 * cos(X.x4[i] + p1[X.classes[i]]) + X.x1[i]^2 - p2[X.classes[i]] for
    i in eachindex(X.classes)
]

model = SRRegressor(;
    niterations=10,
    binary_operators=[+, *, /, -],
    unary_operators=[cos, exp],
    populations=10,
    expression_type=ParametricExpression,  # Subtype of `AbstractExpression`
    expression_options=(; max_parameters=2),
    autodiff_backend=:Zygote,
    parallelism=:multithreading,
)

mach = machine(model, X, y)
fit!(mach)
ypred = predict(mach, X)

so it basically learns $y= 2 \cos(x_4 + \alpha) + x_1^2 - \beta$ for $\alpha$ and $\beta$ parameters (which can be different according to the classes parameter – here there are two classes/types of behavior).

This ParametricExpression is just a single implementation of AbstractExpression but you can see how you can do pretty custom things now.

Fixes #340. Fixes #337. Fixes #336.

TODO:

Allow passing a class feature to MLJ which will have special treatment.
Debug why some of the tests seem to get stuck and take 3x longer to finish than normal.
Consider documenting this, or just leaving it as an experimental undocumented feature until it stabilizes.
Add Enzyme backend.
Add example to docs.
Consider moving to Literate.jl for docs?
Fix ResourceMonitor weirdness

src/Options.jl

atharvas · 2024-09-12T08:37:44Z

I was encountering some issues with constraint parsing in Options.jl. Check out the comment. Don't know why the test cases don't catch the issue.

MilesCranmer · 2024-10-06T14:40:14Z

Going to punt StructuredExpressions until later. @eelregit let me know if you are at all interested in this! StructuredExpression would let you evolve within a fixed functional form. Seems like there are a couple missing methods that would allow it to work but hopefully won't take too much work. I'll have to pause on this side of things for now.

MilesCranmer · 2024-10-06T15:48:19Z

Seems like the garbage collection is going crazy in the tests, which is why they are so slow. The reason why 1.6 and 1.8 are much faster is – I think – because DispatchDoctor.jl is turned off. So something about DispatchDoctor.jl is causing the GC to overwork itself... Possibly related to MilesCranmer/DispatchDoctor.jl#57 and MilesCranmer/DispatchDoctor.jl#58?

MilesCranmer · 2024-10-06T20:07:00Z

Fixed the performance regression in the unittests with SymbolicML/DynamicExpressions.jl@74c8dc1.

Edit: still seems to hang around a bit. It's something to do with DispatchDoctor for sure, from studying the PProf outputs. So it won't affect actual runtime performance, just the testing. So probably fine to merge for now.

MilesCranmer added 30 commits May 21, 2024 22:58

chore: update gitignore

83bb8be

remove precompilation

c38f52b

wip: get many parts of expression interface working

3bdb18f

get more parts working for parametric expressions

76d1bf0

fix constant optimization for expressions

8ac4e43

various fixes for expressions

ab74ff2

fix other parts of expression interface

a43f928

specialize operators

88ce42a

wip: almost working creation

bdbd2ab

more parts working

c43c6fc

fix up other parts of parametric expression

82b4ccf

create mutate constant for parameters

9af9cfb

fix strings

5a1ddf6

fix strings

95865eb

formatting

03ac9d3

fix complexity

d38e8ab

chore: bump DynamicExpressions

1080bcb

feat: undo abstract expression changes to InterfaceDynamicExpressions

e06d982

feat: add index parameter for Dataset

11e30fb

feat: set node_type based on expression_type

c7476d0

feat: export node_type

6183296

generalize initialization for expressions

35f7c0c

add example of parameterized functions

314da58

formatting

de940bb

fix various issues with expressions

a51f4b1

user expressions on regular trees

cfd82ff

fix various issues with expressions in tests

2795fef

turn back on precompilation

c57d07a

fix conversion steps

2e3ab8f

fix bug introduced by get_tree_from_member

6739a99

MilesCranmer added 6 commits August 27, 2024 16:59

test: verbose printing of tests

1f61246

test: make dimensional test reproducible

96a1f96

test: split up dimensional analysis tests

0c4c7f9

test: set up early stopping for test

fa61c49

test: note slow tests

ff33bdf

test: fix missing operator

7740f6b

atharvas reviewed Sep 12, 2024

View reviewed changes

src/Options.jl Show resolved Hide resolved

MilesCranmer mentioned this pull request Sep 17, 2024

[BUG]: Using SymPy in custom loss function results in julia crash #344

Open

MilesCranmer added 12 commits October 5, 2024 21:37

docs: update examples

2caee75

docs: update more docs for new types

548dfe5

docs: StructuredExpression

b37adad

docs: explain how to create a custom expression

f49f88e

refactor: modularize condition_mutate_constant!

45fba62

docs: tweak docstring

fa8ccd0

fix: when bin constraints passed as dict

e078916

fix: all underscore identifier

9bb4922

fix: ambiguities

8fb0c94

Merge branch 'master' into parametric-expressions

c753d1a

test: set random seed in test_mixed

149b364

test: weaken test condition for turbo test

daee883

MilesCranmer force-pushed the parametric-expressions branch from 10f396b to daee883 Compare October 6, 2024 16:00

MilesCranmer added 3 commits October 6, 2024 17:50

refactor: fix some other potential instabilities

bc43dd8

deps: bump DE and DQ to 1.0

d6ac1de

test: fix comparison in parametric function test

e2b369e

MilesCranmer enabled auto-merge October 6, 2024 20:33

MilesCranmer merged commit 749cc34 into master Oct 6, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BREAKING: Change expression types to `DynamicExpressions.Expression` (from `DynamicExpressions.Node`) #326

BREAKING: Change expression types to `DynamicExpressions.Expression` (from `DynamicExpressions.Node`) #326

MilesCranmer commented Jun 24, 2024 •

edited

Loading

atharvas commented Sep 12, 2024

MilesCranmer commented Oct 6, 2024

MilesCranmer commented Oct 6, 2024

MilesCranmer commented Oct 6, 2024 •

edited

Loading

BREAKING: Change expression types to DynamicExpressions.Expression (from DynamicExpressions.Node) #326

BREAKING: Change expression types to DynamicExpressions.Expression (from DynamicExpressions.Node) #326

Conversation

MilesCranmer commented Jun 24, 2024 • edited Loading

atharvas commented Sep 12, 2024

MilesCranmer commented Oct 6, 2024

MilesCranmer commented Oct 6, 2024

MilesCranmer commented Oct 6, 2024 • edited Loading

BREAKING: Change expression types to `DynamicExpressions.Expression` (from `DynamicExpressions.Node`) #326

BREAKING: Change expression types to `DynamicExpressions.Expression` (from `DynamicExpressions.Node`) #326

MilesCranmer commented Jun 24, 2024 •

edited

Loading

MilesCranmer commented Oct 6, 2024 •

edited

Loading