Skip to content

Releases: Wikidata/Wikidata-Toolkit

Wikidata Toolkit 0.3.0

30 Sep 19:21
Compare
Choose a tag to compare

The third release of the Wikidata Toolkit significantly improves the performance of v0.2.0, restores compatibility with current Wikidata file exports, and adds many new examples. Main changes are

  • Support for the new JSON format used in Wikidata dump files now
  • Much faster reading of data files (about 15min for the whole dump)
  • Extended example module with more, faster, easier-to-read programs
  • More convenient ways to construct data objects using the Datamodel class
  • Dump files are now downloaded to temporary files first to cope with aborted downloads
  • Additional helper methods and iterators for accessing data objects with less code

Artifacts are also released on Maven Central and it is strongly recommend to use this. To do this, insert the following dependency into your pom.xml:

<dependency>
    <groupId>org.wikidata.wdtk</groupId>
    <artifactId>wdtk-dumpfiles</artifactId>
    <version>0.3.0</version>
</dependency>

Similar configurations can be used for other modules (e.g., wdtk-rdf or wdtk-datamodel), but most dependencies will be resolved for you by Maven. Detailed instructions are given in the Wikidata Toolkit user documentation.

Wikidata Toolkit 0.2.0

06 Jun 21:58
Compare
Choose a tag to compare

The second release of the Wikidata Toolkit improves the dump processing capabilities of v0.1.0 and adds new serialization formats. Main changes are

  • Support for resolving site links
  • Simpler usage; less code needed for getting to work
  • New RDF serialization feature with many encoding options
  • Serialization support for external Wikibase JSON format
  • Some structural changes in the data model: snak groups

Artifacts are also released on Maven Central and it is strongly recommend to use this. To do this, insert the following dependency into your pom.xml:

<dependency>
    <groupId>org.wikidata.wdtk</groupId>
    <artifactId>wdtk-dumpfiles</artifactId>
    <version>0.2.0</version>
</dependency>

Similar configurations can be used for other modules (e.g., wdtk-rdf or wdtk-datamodel), but most dependencies will be resolved for you by Maven. Detailed instructions are given in the Wikidata Toolkit user documentation.

Wikidata Toolkit 0.1.0

27 Mar 16:54
Compare
Choose a tag to compare

This is the very first release of the Wikidata Toolkit. It provides basic functionality to download and process data dumps for Wikidata.org as provided by the Wikimedia Foundation. Using this code, it is already possible to implement a wide range of tools that proces the Wikidata content in a streaming fashion.

This version contains an initial implementation of the Wikibase datamodel that already captures many of the core features. Some changes to this model are expected for upcoming versions.

It is planned to release artifacts to Maven Central in due course, which will then be the preferred installation method.