Guidelines for LLOD aware services

This document recommends best practices for building Linguistic Linked Open Data (LLOD) aware web services. LLOD denotes the representation of linguistic resources in accordance with linked data principles. These principles include that entities are to be identified by HTTP URIs providing RDF-based information about the entity including links to related entities. LLOD-aware services consume, process and produce such resources. LLOD-aware services are services that consume resources available as Linked Data as Input and output an RDF resource that can in turn be published as Linked Data.

Recommendations

We recommend building LLOD-aware web services using RESTful interfaces. Each resource provided by a REST-service is given its own URI. HTTP methods operate on resources in the following way:

POST asks the service to create/index a new resource available as Linked Data by pointing the service to its DataHub entry for instance
PUT asks the service to update the index with the new version of a resource
GET asks to deliver the results of the algorithm implemented by the service

We assume that each LLOD-aware service has a certain persistence layer in which snapshots of LD resources can be stored. This is because downloading resource just-in-time when the algorithm is actually invoked using a GET-method can take too long. Therefore, it is recommended to download and update relevant LD datasets asynchronously. There are multiple possible ways of providing RDF data to a service. For now, we assume that there is RDF-based metadata containing the location of an RDF dump in a http://www.w3.org/ns/dcat#accessURL field. This metadata should be the input to the service.

Results of the service should be returned in RDF/XML, Turtle-RDF or JSON-LD. Ideally, all three return types would be supported through content negotiation.

An example service for linking terminological resources

Our example service that we have implemented as a proof-of-concept to illustrate the best practice described here induces matches between two terminological resources in RDF and creates links between the associated concepts.

Terminologies are expected to use the lemon vocabulary for representing lexical resources in RDF. Comparison of entries is performed based on the entries lemon:writtenRep property. The concepts associated with entries as lemon:reference are regarded as equivalent if the concepts written representations match. A linking between the concepts is then created using skos:exactMatch.

Three basic operations are supported:

Adding a new resource to the database
Updating a resource in the database
Retrieving links between resources in the database

We recommend building web services using a RESTful HTTP interface. The interface for our linking service is described in the following.

Adding a new resource to the database

HTTP request

A new resource is added by sending a HTTP POST request to

http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/resource/{resourceURL}

{resourceURL} needs to point to RDF metadata giving the resource location as http://www.w3.org/ns/dcat#accessURL
The only serialization format currently supported is Ntriples.

HTTP response

The service returns status code 202 for a valid request.
If the database already contains a resource for the given url, the service returns status code 400.
If no valid resource could be found on the given url, the service returns status code 422.

Example call

Correspondingly, the call to ask the service to index the EMN dataset would be as follows:

POST http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/resource/http://datahub.io/dataset/emn

Note: Support for this operation is deactivated.

Updating a resource in the database

HTTP request

An existing resource is updated by sending a HTTP PUT request to

http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/resource/{resourceURL}

{resourceURL} needs to point to RDF metadata giving the resource location as http://www.w3.org/ns/dcat#accessURL
The only serialization format currently supported is Ntriples.

HTTP response

The service returns status code 202 for a valid request.
If no valid resource could be found on the given url, the service returns status code 422.

Example call

The call to ask the service to update the current version of the IATE dataset in the index is as follows:

PUT http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/resource/http://datahub.io/dataset/iate-rdf

Note: Support for this operation is deactivated.

Retrieving links between resources in the database

HTTP request

A linking for all concepts in a dataset is retrieved by sending a HTTP GET request to

http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/linking/dataset/{sourceURL}?target={targetURL}

A linking for a single concept is retrieved by sending a HTTP GET request to

http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/linking/concept/{sourceURL}?target={targetURL}

The optional parameter target can be specified in order to retrieve links to a single target resource. Otherwise, the entire database will be searched for matches.
The RDF serialization format can be specified by setting the requests Accept header.

HTTP response

The set of generated will be returned in an Ntriples file or any other RDF serialization format if specified by the request.
If the given resource was not found in the index, the service returns status code 404.

Example call

The call to retrieve all the links between IATE and EMN would be as follows:

GET http://sc-lider.techfak.uni-bielefeld.de/LinkingWebService/linking/dataset/https://datahub.io/dataset/emn?target=https://datahub.io/dataset/iate-rdf

And would return as result:



european_migration_network:absconding                skos:exactMatch  iate:IATE-3544259 .
european_migration_network:accommodationcentre       skos:exactMatch  iate:IATE-878245 .
european_migration_network:acquisitionofcitizenship  skos:exactMatch  iate:IATE-3549121 .
european_migration_network:actofpersecution          skos:exactMatch  iate:IATE-3549123 .
european_migration_network:actorofprotection         skos:exactMatch  iate:IATE-3549124 .
…

where

european_migration_network: stands for http://ec.europa.eu/dgs/home-affairs/what-we-do/networks/european_migration_network/glossary/index_a_en.htm#
skos: stands for http://www.w3.org/2004/02/skos/core#
iate: stands for http://tbx2rdf.lider-project.eu/data/iate/

Note: N-Triples is the only output format currently supported by our implementation.

Use Case

Recommendations

An example service for linking terminological resources

Adding a new resource to the database

HTTP request

HTTP response

Example call

Updating a resource in the database

HTTP request

HTTP response

Example call

Retrieving links between resources in the database

HTTP request

HTTP response

Example call