t in a language L, the URI is constructed as follows:
t is encoded using Unicode, and the NFC normalization procedure is applied to ensure a unique representation. Conventional unnormalized Unicode allows encoding a character such as "à" in either a composed or in a decomposed form.%4D with the respective octet value stored as two upper-case hexadecimal digits.http://www.lexvo.org/id/term/ as well as the ISO 639-3 code for the
language L followed by the "/" character are prepended to this path segment to obtain a complete URI.t in language L.
http://www.lexvo.org/id/iso639-3/
followed by an ISO 639-3 language code that is not defined as a special code.
A language URI abiding to this specification refers to the language denoted by the language code according
to the ISO 639-3 standard.
http://www.lexvo.org/id/script/
followed by an ISO 15924 script code other than Zxxx, Zyyy, Zzzz. A script URI abiding to this specification refers to the script denoted
by the code according to the ISO 15924 standard.
http://www.lexvo.org/id/char/,
followed by a Unicode code point in upper-case hexadecimal notation
with zero-padding to 4 digits if shorter than 4 digits, and without additional zero-padding if longer.
A character URI abiding to this specification refers to the character denoted by the code point
according to the Unicode 5.0 standard.
Lexvo.org © 2009 Gerard de Melo. Contact Data Sources Legal Information / Imprint