Skip to content

Kompass Pipeline

Leonard Pabst edited this page Jul 27, 2017 · 4 revisions

The Kompass pipeline describes the import of structured data from Wikidata. The import follows the guidelines of the Structured Data Import. All relevant Jobs are provided below and sorted by their execution order. Notice it is assumed that Implisense is already imported.

Normalization

  1. KompassParse
  2. KompassDataLakeImport

Duplicate Detection

  1. Deduplication with config file deduplication_kompass.xml

Data Merge

  1. Merging
  2. MasterConnecting

Back to Structured Data Import

Clone this wiki locally