Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve CSW-based harvesting #184

Merged
merged 13 commits into from
Jun 6, 2022

Conversation

ricardogsilva
Copy link
Collaborator

@ricardogsilva ricardogsilva commented May 30, 2022

This PR implements customization of the harvesting extension in order to be able to generate new EMC datasets from remote CSW-based sources.

Implementation uses a new CKAN plugin named HarvestingPlugin - this is due to bug ckan/ckanext-spatial#277 in ckanext-spatial which prevents having both custom validators and customizing harvesting in the same plugin.

The new HarvestingPlugin plugin implements the ISpatialHarvest interface that is provided by ckanext-spatial and then provides a custom get_package_dict() method which transforms the data_dict as created by the CSW harvester into the structure expected by the EMC datasets.

Note that the datasets that are harvested are explicitly not made public by default - they must go through the same manual inspection stage that normal EMC datasets go through. This means that an organization member with the 'org admin' role needs to manually mark each harvested dataset as public.

fixes #177

@ricardogsilva ricardogsilva marked this pull request as ready for review June 6, 2022 17:09
@ricardogsilva ricardogsilva merged commit 889f2a1 into kartoza:main Jun 6, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Improve harvesting from a CSW endpoint
1 participant