|
The EuroFIR
Value Documentation Thesauri |
Organisation of the EuroFIR Thesauri
The 2008 version of EuroFIR thesauri is published in
the EuroFIR Thesauri 2008 report (
pdf file, 4.39 MB, formatted for duplex printing).
The EuroFIR thesauri are held in the EuroFIR
eThesaurus, an online thesaurus manager. The thesauri are available to the user in several
formats designated by icons, and in different
views/displays (for download of a thesaurus display,
right-click the icon in the and select Save Target
As...):
|
Thesaurus |
Version |
Date |
Systematic display |
Alpha-numerical display |
Alphabetical display |
XML |
| Acquisition Type |
1.0 |
2007-11-04 |
 |
 |
 |
 |
| Component |
1.0 |
2008-03-12 |
 |
 |
 |
 |
| Value Type |
1.0 |
2007-11-04 |
 |
 |
 |
 |
| Unit |
1.0 |
2007-11-04 |
 |
 |
 |
 |
| Matrix Unit |
1.0 |
2007-11-04 |
 |
 |
 |
 |
| Method Type |
1.0 |
2007-11-04 |
 |
 |
 |
 |
| Method Indicator |
1.0 |
2007-11-04 |
 |
 |
 |
 |
|
Reference Type |
1.0 |
2007-11-04 |
 |
 |
 |
 |
To view the EuroFIR online eThesaurus, click
here
(username:
guest@polytec.dk, password: guest$Welcome).
|
The
Food
Description Thesaurus |
LanguaL
In food description the LanguaL thesaurus is used. Further information on the LanguaL thesaurus, its documentation and software can be found
on the EuroFIR LanguaL pages here, or on the LanguaL website.
|
ISO 3166:
Codes for representation of countries and their sub-division |
ISO 3166 Maintenance Agency (ISO 3166/MA) - ISO's focal point for country codes
The ISO 3166/MA maintains and update the ISO 3166 Standard on country codes. EuroFIR uses the short
country names from ISO 3166-1 (the so-called alpha-2 codes). The alpha-2 codes are made available by ISO at
no charge for internal use and non-commercial purposes.
The lists can be downloaded from
ISO 3166 code lists page in
English (XML, zipped) and
French
(XML, zipped).
|
ISO 639: Codes for representation of languages |
The ISO 639 Registration Authorities (ISO 639/RA)
The ISO 639 consists of two parts:
ISO 639-1:2002 Codes for the representation of names of
languages -- Part 1: Alpha-2 code / Codes pour la
représentation des noms de langue - Partie 1: Code
alpha-2, for which Infoterm has been designated the
Registration Authority (ISO
639-1/RA),
and
ISO ISO 639-2:1998 Codes for the representation of
names of languages - Part 2: Alpha-3 code / Codes pour
la représentation des noms de langue - Partie 2: Code
alpha-3, for which the Library of Congress (LoC)
functions as the Registration Authority (ISO
639-2/RA).
EuroFIR uses the ISO 639-1 short codes (the alpha-2
codes), which can be downloaded from the Library of
Congress'
ISO 639-2 website with in two character encodings (UTF-8)
and (ISO
8859-1).
To read the files, please note that one line of text
contains one entry. An alpha-3 (bibliographic) code, an
alpha-3 (terminologic) code (when given), an alpha-2
code (when given), an English name, and a French name of
a language are all separated by pipe (|) characters. If
one of these elements is not applicable to the entry,
the field is left empty, i.e., a pipe (|) character
immediately follows the preceding entry. The Line
terminator is the LF character.
Dialects of languages
It is often important to distinguish between dialects
of a language, e.g. British English and American
English. The current ISO 639 standards do not include
ways of indicating languages used in different
countries/regions. The future ISO 639-6, currently under
development, may be used to identify language variants
and dialects.
The current best practice is described in Internet
RFC 4646
and
RFC
4647 (Tags for the Identification of Languages),
describes the structure, content, construction, and
semantics of language tags in a faceted approach for use
in cases where it is desirable to indicate the language
used in an information object. It also describes how to
register values for use in language tags and the
creation of user-defined extensions for private
interchange. The language tag consists of a primary
sub-tag and a series of subsequent sub-tags, each of
which narrows or refines the range of languages
identified by the overall tag. It enables the user to
specify, in addition to the primary language, other
characteristics such as script, country, or variant.
It is considered an Internet Best Current Practices
for the Internet Community and gives guidance for the
use of ISO 639 codes.
RFC 4646 specifies use of a 2-character code from ISO
639-1 when it exists; when a language does not have a
2-character code assigned the 3-character code is used.
Although it states that the 3-character terminology code
is used in these cases where no 2-character code exists,
this situation will not occur, since the only
alternative codes in ISO 639-2 are for languages that
already have a 2-character code.
Some (simple) examples are
| Language |
Language tag |
|
|
|
| English |
en |
| British English |
en-GB |
| American English |
en-US |
| German |
de |
| German German |
de-DE |
| Swiss German |
de-CH |
| Austrian German |
de-AT |
For further information, see the full documentation
in
RFC 4646
and
RFC
4647.
NOTE
This website is under constant
development and the current layout presented as an
example of how the EuroFIR Technical Website could
appear. It is given as part of the EuroFIR Prototype
given in the EuroFIR Technical
specifications report.
The website and its pages are optimized for a screen
resolution of 1024*768 or higher.
|