Skip to content

Combined method of RF and Symbolic regression models #273

Answered by MilesCranmer
kyoungyi asked this question in Q&A
Discussion options

You must be logged in to vote

1.1) Compared to the important features (with interaction effects) from the RF model, the most important features do not show up in the final pysr equations (of multiple runs with different random states). I am confused by this inconsistency. Do you have any input on this?

If they don't show up over multiple different runs, maybe those features aren't actually important? Computing feature importance with RF isn't a guarantee. Note that with the latest versions of PySR after 2022 (after Jay's paper was done) selecting important features isn't really important, because the crossover operation does this implicitly. So I would just give all the features you have and PySR should be okay to h…

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@MilesCranmer
Comment options

@kyoungyi
Comment options

@kyoungyi
Comment options

@zhuyi-bjut
Comment options

@kyoungyi
Comment options

Answer selected by kyoungyi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
PySR PySR-related discussion
3 participants