This best practice describes how to exploit Linguistic Linked Data resources. The suggested steps for exploitation comprise:

This document was published by the Best Practices for Multilingual Linked Open Data community group. It is not a W3C Standard nor is it on the W3C Standards Track.

There are a number of ways that one may participate in the development of this report:

Use Case

Let us consider the example of a company developing sentiment analysis and opinion mining software that has a working system for the English language and wants to port the system to also support German. The company wants to find a corpus that is annotated at the sentiment level and extract a first seed lexicon of German subjective expressions with their polarity (positive, negative, neutral).

Method

In order to exploit Linguistic Linked Data resources, the above mentioned methodology can be implemented as follows: If LIDER guidelines are followed during publication and metadata provision for resources and if the resource is registered at either Metashare, CLARIN VO, LRE Map or DataHub, LingHub will crawl the resource and index the resource with the appropriate metadata. Further, if de facto standards and vocabularies as recommended by LIDER are followed, then the same extraction patterns can be used to extract data from different datasets.

Use Case Revisited

Our company looking for a German lexicon would follow the above sketched methodology as follows: the company would obtain a seed lexicon of subjective phrases with their polarity as a result.