Linked data usually contains labels, comments, descriptions, etc. that depend on the natural language used. When linked data appears in a multilingual setting, it is a challenge to publish and consume it. This document presents a survey on the work done by the Best Practices for Multilingual Linked Open Data Group to identify a set of solutions and their benefits/drawbacks when applied in a given context.
This document was published by the Best Practices for Multilingual Linked Open Data comunity group. It is not a W3C Standard nor is it on the W3C Standards Track.
There are a number of ways that one may participate in the development of this report:
This report describes the work done by the Best Practices for Multilingual Linked Open Data.
The goal of the group is to identify common practices and patterns that can be applied to publish linked data in a multilingual context.
Descriptive URIs use ASCII characters that are combined to represent terms or abbreviations of terms in some natural language. It is usually done with terms in English or in other Latin-based languages, like French, Spanish, etc. where only a small fraction of their alphabets is outside ASCII characters.
http://example.org/Armenia
Some arguments in favour
Some drawbacks
Opaque URIs are resource identifiers which are not intended to represent terms in a natural language.
http://example.org/#I23AX45
Arguments in favor
Arguments against
This patterns consists of using unrestricted IRIs which can contain Unicode characters outside the ASCII repertoire.
http://օրինակ.օրգ/#Հայաստան
IRIs (Internationalized Resource Identifiers) are described in [[RFC3987]] and they are not restricted to ASCII characters.
Arguments in favor
Arguments against
IRIs where the domain part is restricted to ASCII characters while the path can use Unicode characters.
http://example.org#Հայաստան
Arguments in favor
Arguments against
/
or #
.
It may be difficult to see where the term is
(eg.: namespace is http://w3.org/html
and full name
is http://w3.org/htmldiv
).
One should define prefixes that end
with /
or #
http://hy.example.org/#Հայաստան
Arguments in favor
Arguments against
http://example.com/en/Armenia http://example.com/Armenia.en http://example.com/Armenia?lang=en
Arguments in favor
Arguments against
Return always the same triples without taking into account the
HTTP Accept-language
header.
Arguments in favor
Arguments against
The server attends the language preferences of the user agent,
presented in the Accept-language
header and
returns different data for each language preference.
Arguments in favor
Arguments against
The server attends the language preferences of the user agent,
presented in the Accept-language
header and
returns a 303 (see also) redirect to a resource with triples
in that language
Arguments in favor
Arguments against
Linked data datasets should provide labels for all resources: individuals, concepts and properties, not just the main entities.
A large amount of thanks goes out to the BPMLOD Community Group participants who worked through many of the technical issues on the mailing list and the telecons.
Thanks to the following individuals, in order of their first name, for their input on the report: ...here we should include the list of people that contributed