Importing data

This document describes the schema supported by the application and how to build it from the a resourcemap collection.

Input data schema

This tool can import data using the following schema, where each table is stored in a CSV file with headers.

facilities.csv

Field	Type
id	String
name	String
lat	Float
lng	Float
location_id	String
facility_type	String
ownership	String
address	String
contact_name	String
contact_email	String
contact_phone	String
opening_hours:en	String
opening_hours:am	String
photo	String (url)
last update	String (ISO-8601 encoded)

Note: there should be a opening_hours:LOCALE column for each of the enabled locales of the application.

category_groups.csv (Services, Equipment, etc)

Category groups serves for indicating how the assigned categories for each facility according to facility_categories.csv should be grouped and shown to the user. The group and categories name are localized.

Field	Type
id	String
name:en	String
name:am	String

Note: there should be a name:LOCALE column for each of the enabled locales of the application.

categories.csv

Field	Type
id	String
category_group_id	String
name:en	String
name:am	String

Note: there should be a name:LOCALE column for each of the enabled locales of the application.

facility_categories.csv

Field	Type
facility_id	String
category_id	String

facility_types.csv

Field	Type
name	String
priority	Int

Note: The priority of a facility type can be used to decide which facilities will be displayed in lower zoom levels. If a facility's type doesn't have a corresponding entry in the facilities table it will be assigned the lowest priority.

locations.csv

Field	Type
id	String
name	String
parent_id	String

Importing CSV data

The import script assumes CSV files with the following names:

data
├── input
    ├── categories.csv
    ├── category_groups.csv
    ├── facilities.csv
    ├── facility_types.csv
    ├── locations.csv
    └── services.csv

To import CSV data run the following:

$ bin/import-dataset data/input

Normalizing SPA Census information

To import data from raw CSV exports of SPA results, store raw files as follows:

data
├── input
└── raw
    ├── ContactInfo.csv
    ├── Facility.csv
    ├── FacilityService.csv
    ├── FacilityType.csv
    ├── geoloc.csv
    ├── i18n.csv
    ├── MedicalService.csv
    ├── OrganizationUnit.csv
    └── ownership.csv

And then run the following scripts to generate the normalized input files in the data/input directory:

$ bin/normalize-spa-data data/raw data/input

SPA data internationalization

The i18n.csv file is not actually part of the SPA result. It's schema depends of the desired locales. There should be one column for each locale.

Field	Type
en	String
am	String

Each row will contain the equivalent text that appear across the spa raw data.

en	am
Tb test	የነቀርሳ ምርመራ

In the above sample when a the service english name "Tb test" will be translated to "የነቀርሳ ምርመራ" when generating the services.csv name:en and name:am columns.

Normalizing Resourcemap information

To import data from ResourceMap, store the files as follows:

data
├── input
└── raw
    ├── fields.json
    ├── sites.csv
    └── i18n.csv
public
    └── pictures
        ├── F00001   # F[spaplus_fac_code] folder
        |   └── front.jpg
        |   ...
        └── F03455
            └── A Front Photo.jpg

The sites.csv file can be downloaded from the ResourceMap export option. The fields.json file, which contains the metadata and field description of the collection, can be downloaded from http://resourcemap.instedd.org/en/collections/COLLECTION_ID/fields.json.

And then run the following scripts to generate the normalized input files in the data/input directory:

$ bin/normalize-resmap-data data/raw data/input

Note: currently ResmapNormalization supports the schema of Ethiopia MFR - Official Collection. The following fields are expected to exist:

facility_type (hierarchy)
general_services (select many)
Admin_health_hierarchy (hierarchy)
ownership (hierarchy)
pocname, facility__official_email, facility__official_phone_number (text)
spaplus_fac_code (text) to determine picture directory

After the normalization is done, you might want to tweak the generated facility_types.csv file to choose, for each type of facility, the size of the marker the in map. The higher the prioity the bigger the marker size.

Resourcemap data internationalization

The i18n.csv is a translation table. It's schema depends of the desired locales. There should be one column for each locale.

Field	Type
en	String
am	String

Each row will contain the equivalent text that appear across the spa raw data.

en	am
Tb test	የነቀርሳ ምርመራ

In the above sample when a the service english name "Tb test" will be translated to "የነቀርሳ ምርመራ" when generating the services.csv name:en and name:am columns.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

importing.md

importing.md

Importing data

Input data schema

facilities.csv

category_groups.csv (Services, Equipment, etc)

categories.csv

facility_categories.csv

facility_types.csv

locations.csv

Importing CSV data

Normalizing SPA Census information

SPA data internationalization

Normalizing Resourcemap information

Resourcemap data internationalization

Files

importing.md

Latest commit

History

importing.md

File metadata and controls

Importing data

Input data schema

facilities.csv

category_groups.csv (Services, Equipment, etc)

categories.csv

facility_categories.csv

facility_types.csv

locations.csv

Importing CSV data

Normalizing SPA Census information

SPA data internationalization

Normalizing Resourcemap information

Resourcemap data internationalization