-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Processing Parameter and File Management #373
Comments
#333 is related to this - what goes in which file and in which format. |
Should wait on #371 |
Note related to NeXus file structure: a valid NeXus file should contain at least one NXentry group (https://manual.nexusformat.org/classes/base_classes/NXentry.html#nxentry) Then, in Ewoks, we often add one child NXprocess per processing step where we store the processing results and configuration. The resulting structure could then be something like this:
This is just for information purpose: I am not saying you should follow this. Perhaps it doesn't even fit your usecase. |
Could have one NXprocess entry per file, then a master file that links them all together |
Here are some links to existing codes I could find elsewhere - it looks impossible to be compatible with anyone else - but perhaps some inspiration... For diffractometer geometry and an example using "Nexus" for crystallography, then Ray Osborn has a lot of good stuff here: https://github.com/nexpy/nxrefine. I just had a look at his code and found:
Elsewhere in Dials they can dump to NXreflections: For grain orientations then Nexus has: Further digging on file formats brings me to reading distortion files from XDS, presumably in CBF format, for MX and I found this which is importing pycbf that I started in 2005. There is a data model for ISpyB but I don't think it helps us. |
Seen today: motor positions of calibration fit parameters (frelx, etc) need to match motors for data being processed. Everything fails when the wrong calibration is used. Today, we don't save any instrument metadata with parameters... Something to fix in the future perhaps? |
Following on from #318, this issue concerns two problems:
For file management, we think the best solution is to be as minimal as possible.
Currently, the dataset class keeps track of all the files that are created, with embedded absolute paths in the H5. This is bad for several reasons (portability mainly).
However, it's convenient because we don't need to define all the paths at the top of each analysis notebook.
For file management, we propose the following:
For keeping track of the processing parameters, we propose the following:
Filename templates seem to need:
PROCESSED_DATA/{sample}/{sample}{dataset}{version}peaks_table.h5
PROCESSED_DATA/{version}/{sample}/{sample}{dataset}_peaks_table.h5
Proposed structure of grains file:
The text was updated successfully, but these errors were encountered: