Skip to content

Wikidata Toolkit 0.4.0

Compare
Choose a tag to compare
@mkroetzsch mkroetzsch released this 27 Feb 15:23
· 1229 commits to master since this release

The fourth release of the Wikidata Toolkit introduces support for a variety of new Wikidata features, restores compatibility with current Wikidata file exports, and adds a stand-alone command-line client that can be used to process data. Main changes to the library are:

  • Support statements on property documents
  • More robust JSON parsing: recover after errors to process remaining file
  • Improved JSON serialization + an example program showing how to do it
  • Standard (POJO) datamodel implementation now is Serializable
  • Deep copy functionality for changing between datamodel implementations (DatamodelConverter)
  • Support for filtering data during copying (e.g., to keep only some languages/properties/sites).
  • Dependency on JSON.org has been removed to use the faster Jackson library everywhere

Important compatibility changes:

  • The datatype of all values in GlobeCoordinateValue (latitude, longitude, precision) has changed from long (fixed precision number) to double (floating point number) to match the JSON.
  • The JSON serializer class org.wikidata.wdtk.datamodel.json.JsonSerializer has vanished. It is replaced by the org.wikidata.wdtk.datamodel.json.jackson.JsonSerializer (almost same interface).

Artifacts are also released on Maven Central and it is strongly recommend to use this. To do this, insert the following dependency into your pom.xml:

<dependency>
    <groupId>org.wikidata.wdtk</groupId>
    <artifactId>wdtk-dumpfiles</artifactId>
    <version>0.4.0</version>
</dependency>

Similar configurations can be used for other modules (e.g., wdtk-rdf or wdtk-datamodel), but most dependencies will be resolved for you by Maven. Detailed instructions are given in the Wikidata Toolkit user documentation.