Skip to content

Latest commit

 

History

History
37 lines (27 loc) · 1.86 KB

CREATE_CKAN_DATASET.md

File metadata and controls

37 lines (27 loc) · 1.86 KB

Preqrequisites

Add dataset

  • Step 1: Create dataset

    • Title: Lexington . An example would be 'Lexington Code Enforcement Complaints'
    • Add a description
    • Add license: 'Open Data Commons Public Domain Dedication and Licence (PDDL)'
    • Visibility: public
  • Step 2: Add specific data resource to the dataset

  • Step 3

    • Add dataset metadata
  • Custom Step 4

    • At this point we've created a CKAN resource. Visit that resource and find the resource id. It's in the url like so: http://www.civicdata.com/en/dataset/lexington-code-enforcement-complaints/resource/<resource-id>
    • For the example code enforcement dataset it ends up being ad346da7-ce88-4c77-a0e1-10ff09bb0622
    • Edit the resource and in the link to file input, change the value to http://www.civicdata.com/datastore/dump/<resource-id>
    • This is the url that will appear on the resource page that links to the full csv dump of the dataset.

Configure datastore for newly created resource

Now the resource is ready to be automatically upserted to the transformation!

Debugging tips

  • It can be difficult to understand why/how/when requests to civicdata are failing.
  • An API Traffic Inspector called runscope records HTTP requests and responses so that you can view, compare, and replay them.
  • An HTTP request goes through runscope like so: Pentaho -> Runscope -> civicdata -> Runscope -> Pentaho
  • It's highly recommended!