Skip to content

Commit

Permalink
Updated docs
Browse files Browse the repository at this point in the history
  • Loading branch information
ctuning-admin committed May 26, 2023
1 parent 90376a2 commit eca4465
Show file tree
Hide file tree
Showing 3 changed files with 50 additions and 21 deletions.
8 changes: 6 additions & 2 deletions cm-mlops/script/download-and-extract/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -110,9 +110,9 @@ ___
- Environment variables:
- *CM_DAE_EXTRACT_DOWNLOADED*: `yes`
- Workflow:
* `_remove-extracted`
* `_no-remove-extracted`
- Environment variables:
- *CM_DAE_REMOVE_EXTRACTED*: `yes`
- *CM_DAE_REMOVE_EXTRACTED*: `no`
- Workflow:
* `_url.#`
- Environment variables:
Expand Down Expand Up @@ -194,10 +194,14 @@ ___
* `CM_DAE_DOWNLOAD_CMD`
* `CM_DAE_DOWNLOAD_TOOL`
* `CM_DAE_EXTRACTED_CHECKSUM_CMD`
* `CM_DAE_EXTRACTED_FILENAME`
* `CM_DAE_EXTRACT_CMD`
* `CM_DAE_EXTRACT_TOOL`
* `CM_DAE_EXTRACT_TOOL_OPTIONS`
* `CM_DAE_FILENAME`
* `CM_DAE_FILE_DOWNLOADED_PATH`
* `CM_DAE_FILE_EXTRACTED_PATH`
* `CM_DAE_FINAL_ENV_NAME`
* `CM_DAE_GZIP`
___
### Maintainers
Expand Down
1 change: 1 addition & 0 deletions cm-mlops/script/get-dataset-imagenet-train/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -134,6 +134,7 @@ ___
1. ***Run "preprocess" function from [customize.py](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-train/customize.py)***
1. ***Read "prehook_deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-train/_cm.json)***
* download,torrent
* `if (CM_DATASET_IMAGENET_TRAIN_REQUIRE_TORRENT == yes)`
* CM names: `--adr.['download-torrent']...`
- CM script: [download-torrent](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/download-torrent)
* download,extract,file,_extract
Expand Down
62 changes: 43 additions & 19 deletions cm-mlops/script/get-dataset-imagenet-val/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -109,39 +109,53 @@ ___
<details>
<summary>Click here to expand this section.</summary>

* `_2012`
- Environment variables:
- *CM_DATASET_VER*: `2012`
- Workflow:
* `_2012-1`
- Environment variables:
- *CM_DATASET_SIZE*: `1`
- Workflow:
* **`_2012-500`** (default)
- Environment variables:
- *CM_DATASET_SIZE*: `500`
* `_2012-500`
- Workflow:
* `_2012-full`
- Environment variables:
- *CM_DATASET_SIZE*: `50000`
- *CM_IMAGENET_FULL*: `yes`
- Workflow:
* `_full`
- Environment variables:
- *CM_DATASET_SIZE*: `50000`
- *CM_IMAGENET_FULL*: `yes`
- *CM_DAE_DOWNLOADED_FILENAME*: `ILSVRC2012_img_val.tar`
- *CM_DAE_DOWNLOADED_CHECKSUM*: `29b22e2961454d5413ddabcf34fc5622`
- Workflow:

</details>


* Group "**count**"
<details>
<summary>Click here to expand this section.</summary>

* `_size.#`
- Environment variables:
- *CM_DATASET_SIZE*: `#`
- Workflow:
* **`_size.500`** (default)
- Environment variables:
- *CM_DATASET_SIZE*: `500`
- *CM_DAE_URL*: `https://www.dropbox.com/s/57s11df6pts3z69/ILSVRC2012_img_val_500.tar`
- Workflow:

</details>


* Group "**dataset-version**"
<details>
<summary>Click here to expand this section.</summary>

* **`_2012`** (default)
- Environment variables:
- *CM_DATASET_VER*: `2012`
- Workflow:

</details>


#### Default variations

`_2012-500`
`_2012,_size.500`
#### Default environment

<details>
Expand All @@ -158,14 +172,22 @@ ___
<details>
<summary>Click here to expand this section.</summary>

1. Read "deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/_cm.json)
1. ***Read "deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/_cm.json)***
* detect,os
- CM script: [detect-os](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/detect-os)
1. ***Run "preprocess" function from [customize.py](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/customize.py)***
1. Read "prehook_deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/_cm.json)
1. ***Read "prehook_deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/_cm.json)***
* download,torrent
* `if (CM_DATASET_IMAGENET_VAL_REQUIRE_TORRENT == yes)`
* CM names: `--adr.['download-torrent']...`
- CM script: [download-torrent](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/download-torrent)
* download,extract,file,_extract
* `if (CM_DATASET_IMAGENET_VAL_REQUIRE_DAE == yes)`
- CM script: [download-and-extract](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/download-and-extract)
1. ***Run native script if exists***
* [run.bat](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/run.bat)
* [run.sh](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/run.sh)
1. Read "posthook_deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/_cm.json)
1. Run "postrocess" function from customize.py
1. ***Run "postrocess" function from [customize.py](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/customize.py)***
1. Read "post_deps" on other CM scripts from [meta](https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-dataset-imagenet-val/_cm.json)
</details>

Expand All @@ -174,12 +196,14 @@ ___
#### New environment keys (filter)

* `CM_DATASET_IMAGENET_PATH`
* `CM_DATASET_IMAGENET_VAL_PATH`
* `CM_DATASET_PATH`
* `CM_DATASET_SIZE`
* `CM_DATASET_VER`
#### New environment keys auto-detected from customize

* `CM_DATASET_IMAGENET_PATH`
* `CM_DATASET_IMAGENET_VAL_PATH`
* `CM_DATASET_PATH`
___
### Maintainers
Expand Down

0 comments on commit eca4465

Please sign in to comment.