Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR adds the ability to export basic datasets to hdf5 files, currently, we extract the energies and gradients but more properties could be included in future.
Layout
Each molecule has its own group under its fixed hydrogen layer inchikey with the following datasets:
smiles
: The mapped explicit hydrogen smiles which can be used to construct the molecule via the openff-toolkit usingMolecule.from_mapped_smiles
atomic_numbers
: An array of the atomic numbers type int16charge
: The total charge on the molecule calculated by the openff-toolkit as the sum of formal charges, type int16specification
: Themethod:basis
used to compute the results type h5py string.energies
: An array of energies for the molecule in units ofhartree
in the same order as the conformations, soenergies[i]
corresponds toconformation[i]
type float64conformations
: An array of conformations for the molecule inbohr
type float64gradients
: An array of gradients in unitshartree / bohr
in the same order as the conformations, sogradients[i]
corresponds toconformation[i]
type float64.Todos
Notable points that this PR has either accomplished or will accomplish.
Questions
Status