Add function to validate target data #197

annakrystalli · 2024-11-20T15:13:13Z

No description provided.

zkamvar · 2024-11-20T17:10:20Z

My opinion (which is widely considered trash, but without trash we would not appreciate the beauty of the natural world) is that we should first come up with a standard for the target data before writing any functionality. Importantly, it should follow these guiding principles:

well-defined: the time series data should have clear mappings to the task IDs in the hub (this is implicit for oracle data since it is derived from time series and should match the model output)
clear: these data should be easy for a hub administrator to write and store
general: someone without access to hubverse tools should still be able to read in these data (both time series and oracle outputs) and operate on them without requiring them to write code that is specific to one particular hub
format-agnostic: in line with model output data

For item 1, I'm specifically thinking about how we frame guidelines for hub administrators who are not very comfortable working with GitHub or code.

For item 2, I'm thinking about tooling that would need to be written in something like https://github.com/hubverse-org/hub-dashboard-predtimechart to generate data for predtimechart or another visualization.

I discussed potential solutions for this briefly in hubverse-org/hubDocs#208 (comment):

there still needs to be a bit more structure to having something that we can use to consistently read in the target data. I can see a few ways of addressing this:

mandate specific column and file names

add configurations to the admin schema that defines the path to the time series and oracle output data, mapping column names to targets.

the same as 2, but having a specific targets.json spec.

github-project-automation bot added this to hubverse Development overview Nov 20, 2024

github-project-automation bot moved this to Todo in hubverse Development overview Nov 20, 2024

zkamvar mentioned this issue Jan 8, 2025

[RFC] Target data file formats and organisation reichlab/decisions#10

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function to validate target data #197

Add function to validate target data #197

annakrystalli commented Nov 20, 2024

zkamvar commented Nov 20, 2024

Add function to validate target data #197

Add function to validate target data #197

Comments

annakrystalli commented Nov 20, 2024

zkamvar commented Nov 20, 2024