Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

consider kwarg for mapping noble gases to a "nearby" element or (better) using different elemental featurizer #82

Closed
sgbaird opened this issue Jun 11, 2022 · 13 comments · Fixed by #117
Assignees
Labels
enhancement New feature or request hyperparameter Hyperparameters to consider optimizing

Comments

@sgbaird
Copy link
Member

sgbaird commented Jun 11, 2022

e.g. $He \rightarrow H\space\mathrm{or}\space Li$

Issue with generated structures having noble gases might be helped by:

@sgbaird sgbaird added enhancement New feature or request hyperparameter Hyperparameters to consider optimizing labels Jun 11, 2022
@sgbaird sgbaird changed the title [FEATURE REQUEST] kwarg for mapping noble gases to a "nearby" element consider kwarg for mapping noble gases to a "nearby" element Jun 11, 2022
@kjappelbaum
Copy link
Contributor

I do not know if you already investigated this, but sometimes it tends to help to not encode atomic numbers but something that is closely related to chemistry (and varying with a more consistent trend across the PTE) such as the electronegativity.

Maybe one could give the user the option to specify any Element property as the encoding of the "chemistry" of a given site. It could default to the atomic number but in this way, you can easily switch to X, electron_affinity, or whatever.

If you want me to make a PR for that, simply assign me to this issue.

@sgbaird
Copy link
Member Author

sgbaird commented Jun 11, 2022

That would be great! The modified Pettifor scale is one that I considered, mostly based on prior experience with ElMD (see e.g. mat_discover). I like the idea of using an Element property to keep it flexible.

@kjappelbaum
Copy link
Contributor

Since I ended up writing the same code in different packages, I started collecting it in element-coder.

@sgbaird sgbaird changed the title consider kwarg for mapping noble gases to a "nearby" element consider kwarg for mapping noble gases to a "nearby" element or (better) using different elemental featurizer Jun 18, 2022
@sgbaird
Copy link
Member Author

sgbaird commented Jun 18, 2022

@kjappelbaum is element-coder in a state that it can be incorporated into xtal2png?

@kjappelbaum
Copy link
Contributor

I think so - I use it already in one of my other projects. I was waiting for the fit but then got distracted by other things. Will give it a stab this WE!

@sgbaird
Copy link
Member Author

sgbaird commented Jun 18, 2022

Ok, cool! Not sure if you saw, fit is implemented now #91, though some of the decisions I made on how to set it up were somewhat arbitrary. Lmk if you think something should change with that approach.

Thank you! I appreciate you getting involved despite many other priorities I'm sure

@sgbaird
Copy link
Member Author

sgbaird commented Jun 18, 2022

See

atomic_numbers = [s.atomic_numbers for s in S]
in particular (small refactor about to pull into main in #112 (EDIT) #113)

@kjappelbaum
Copy link
Contributor

at least for regression, the encoding makes a huge difference: Here on a MOF dataset, showing top-100-in-top-100 (greater is better), contrasting the mod. Pettifor with electronegativity.

I'm not using the xtal2png for this plot but revised autocorrelation functions, but wouldn't be surprised to also see an impact here.
Screen Shot 2022-06-20 at 20 52 48

@sgbaird
Copy link
Member Author

sgbaird commented Jun 20, 2022

Awesome, looking forward to seeing how this impacts xtal2png. Thanks for sharing!

@sgbaird
Copy link
Member Author

sgbaird commented Jun 24, 2022

@kjappelbaum this weekend I'm planning to try doing a distributed hyperparameter optimization on my uni's HPC via RayTune. Getting RayTune and Slurm to play nicely together might take some time (my first time connecting the two), but my goal is to at least get a dummy run operating. Two hyperparameters that I think would be really important to have for the full run (which might not happen until next week) are the elemental encodings and lexicographic site sorting (#75).

@kjappelbaum
Copy link
Contributor

ok, the PR is anyhow close to being done. I'll finish it tomorrow morning

@kjappelbaum
Copy link
Contributor

But also looking forward to seeing the RayTune + Slurm code 👍🏽

@sgbaird
Copy link
Member Author

sgbaird commented Jun 24, 2022

ok, the PR is anyhow close to being done. I'll finish it tomorrow morning

Awesome, I just took a closer look at the changes and am seeing that now. Thank you!

But also looking forward to seeing the RayTune + Slurm code 👍🏽

Sounds good! I hope I don't disappoint haha

@sgbaird sgbaird linked a pull request Jul 8, 2022 that will close this issue
3 tasks
@sgbaird sgbaird closed this as completed Jul 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request hyperparameter Hyperparameters to consider optimizing
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants