slacgismo · kperrynrel · Feb 20, 2025
diff --git a/ec2/README.md b/ec2/README.md
@@ -29,6 +29,10 @@ A new analysis task for insertion into the PV Validation Hub needs to contain ce
 - Data files - folder containing all csv files the analysis
 - Ground truth files - folder containing all results for each data file
 
+Below is an screenshot of an example folder containing all of the files/subfolders for an analysis insertion.
+
+![alt text](file-structure-example.png)
+
 ### config.json
 
 Example JSON:
@@ -100,17 +104,46 @@ Example JSON:
 Required columns:
 
 ```csv
-system_id,name,azimuth,tilt,elevation,latitude,longitude,tracking,climate_type,dc_capacity
+system_id,name,latitude,longitude
+```
+
+Optional columns:
+
+```csv
+azimuth,tilt,elevation,tracking,dc_capacity
 ```
 
+Ideally we want to include as many optional columns as we can, although for some data sets this may not be possible as the data is unavailable.
+
+
 ### file_metadata.csv
 
 Required columns:
 
 ```csv
-file_id,system_id,file_name,timezone,data_sampling_frequency,issue
+file_id,system_id,file_name
 ```
 
+Optional columns:
+
+```csv
+timezone,data_sampling_frequency,issue,data_type
+```
+
+Optional columns may vary based on the type of problem being solved, and is subject to change as needed.
+
+### ./file_data/ folder
+
+This folder contains all of the individual files that we are going to feed into the runner to assess the associated algorithm. File names in this folder link directly to the file_name column in the `file_metadata.csv` file. Columns in these files can vary based on the type of inputs being assessed. A screenshot of an example file for the time shift problem is shown below.
+
+![alt text](input-file-data.png)
+
+### ./validation_data/ folder
+
+This folder contains all of the files that contains the "ground-truth" results, to be assessed against the runner outputs. Files in this folder have the same naming conventions as the files in the ./file_data/ folder, so these files can be successfully linked to their input data file counterparts. Data in these files will vary based on what target variable is being assessed. A screenshot of an example output file for the time shift problem is shown below.
+
+![alt text](output-file-data.png)
+
 ### template.py (Marimo template with cli args input)
 
 Marimo python file will need to input data from `mo.cli_args()` method

diff --git a/ec2/file-structure-example.png b/ec2/file-structure-example.png
diff --git a/ec2/input-file-data.png b/ec2/input-file-data.png
diff --git a/ec2/output-file-data.png b/ec2/output-file-data.png
diff --git a/pv-validation-hub-client b/pv-validation-hub-client
+3 −11		.eslintrc.json
+1,559 −1,894		package-lock.json
+37 −38		package.json
+3 −3		postcss.config.mjs
+280 −286		src/app/(analyses)/analyses/analysis/page.tsx
+21 −25		src/app/(login)/login/page.tsx
+148 −160		src/app/(mysubmissions)/mysubmissions/private_report/page.tsx
+58 −65		src/app/(resources)/resources/page.tsx
+3 −1		src/app/globals.css
+48 −68		src/app/modules/analyses/customcard.tsx
+169 −165		src/app/modules/analyses/leaderboard/leaderboard.tsx
+211 −241		src/app/modules/analyses/upload/uploader.tsx
+49 −53		src/app/modules/resources/resourcecard.tsx
+38 −47		src/services/dashboard_service.tsx
+51 −61		src/services/submission_service.tsx
+33 −36		src/services/user_service.tsx
+0 −1		tsconfig.json