Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Local cache verification and update #38

Open
nocollier opened this issue Apr 8, 2024 · 1 comment
Open

Local cache verification and update #38

nocollier opened this issue Apr 8, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@nocollier
Copy link
Member

From the ESGF Compute Working Team: Especially in JupyterHub instances, it is common for teams to share a local cache of datasets on which they base their work. At some point (perhaps just before publishing a paper) they would like to update the datasets to their latest version. A few ideas and issues arise here:

  • We could do this automatically by walking through the cache and querying the dataset again and comparing versions.
  • Any updating should be loud and ask for confirmation, users need to understand what they are doing.
  • We should also look at checksums of the old files. It may be that a file has been changed locally to fix an error and an update would re-introduce that local fix.
  • This also carries the issue of one member of the group triggering an update that affects the analysis scripts written by others with no warning.
  • A first idea of how to address this would be that when to_dataset_dict is called, some local and perhaps hidden file should be saved detailing what was used. In this way when running the script again, we can check if there are differences and warn the user.
  • We probably need to do this anyway, as intake-esgf defaults to always downloading the most up to date data. So while the old version wouldn't be deleted from the cache, we would automatically (and silently) download and use the updated version.
@nocollier
Copy link
Member Author

Tagging those who I recall had input here in case I missed something. Will see what we can do here. @huard @aspinuso

@nocollier nocollier added the enhancement New feature or request label Apr 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant