EuroFIR Tech Web HomeLinks of IterestLogin

 

Maintained by

Danish Food Information

and

Polytec


Valid XHTML 1.0!

Valid CSS!

 
 
EuroFIR Thesauri


The EuroFIR thesauri:

The draft EuroFIR standard on food composition data make use of a series of thesauri (controlled vocabularies) in the description of foods, components, method types, analytical and calculation methods, units, etc.

The thesauri follows international standard on multilingual thesauri, which secures international development and use of the thesauri.

This page also contains links to the ISO 3166 Maintenance agency (ISO 3166/MA) - ISO's focal point for country codes - and ISO 639 Registration Authorities for the purpose of processing requests for alpha-2 and alpha-3 codes for ISO 639 Standard: Codes for the representation of names of languages-- Part 1 and 2. Current best practices for linking languages with countries/regions (RFC 4646 and RFC 4647) are also mentioned.


 The EuroFIR Value Documentation Thesauri

Organisation of the EuroFIR Thesauri

The 2008 version of EuroFIR thesauri is published in the EuroFIR Thesauri 2008 report ( pdf file, 4.39 MB, formatted for duplex printing).

The EuroFIR thesauri are held in the EuroFIR eThesaurus, an online thesaurus manager. The thesauri are available to the user in several formats designated by icons, and in different views/displays (for download of a thesaurus display, right-click the icon in the and select Save Target As...):
 

Thesaurus Version Date Systematic display Alpha-numerical display Alphabetical display XML
Acquisition Type 1.0 2007-11-04
Component 1.0 2008-03-12
Value Type 1.0 2007-11-04
Unit 1.0 2007-11-04
Matrix Unit 1.0 2007-11-04
Method Type 1.0 2007-11-04
Method Indicator 1.0 2007-11-04
Reference Type 1.0 2007-11-04

 

Systematic display (tree view); tab delimited file listing the thesaurus' descriptors systematically according to their place in the thesaurus hierarchy.
 
Alpha-numerical display; tab-delimited text file listing the thesaurus according to the alphanumerical descriptor code. The first line in the tab-delimited file is the root of the thesaurus and indicates the thesaurus name and version.
 
Alphabetical display; thesaurus descriptors and synonyms listed in alphabetical order for printed documentation.
 
Thesaurus in XML format

 

To view the EuroFIR online eThesaurus, click here (username: guest@polytec.dk, password: guest$Welcome).
 


 The Food Description Thesaurus

LanguaL

In food description the LanguaL thesaurus is used. Further information on the LanguaL thesaurus, its documentation and software can be found on the EuroFIR LanguaL pages here, or on the LanguaL website.

 


 ISO 3166: Codes for representation of countries and their sub-division

ISO 3166 Maintenance Agency (ISO 3166/MA) - ISO's focal point for country codes

The ISO 3166/MA maintains and update the ISO 3166 Standard on country codes. EuroFIR uses the short country names from ISO 3166-1 (the so-called alpha-2 codes). The alpha-2 codes are made available by ISO at no charge for internal use and non-commercial purposes.

The lists can be downloaded from ISO 3166 code lists page in English (XML, zipped) and French (XML, zipped).

 


 ISO 639: Codes for representation of languages

The ISO 639 Registration Authorities (ISO 639/RA)

The ISO 639 consists of two parts:

ISO 639-1:2002 Codes for the representation of names of languages -- Part 1: Alpha-2 code / Codes pour la représentation des noms de langue - Partie 1: Code alpha-2, for which Infoterm has been designated the Registration Authority (ISO 639-1/RA),

and

ISO ISO 639-2:1998 Codes for the representation of names of languages - Part 2: Alpha-3 code / Codes pour la représentation des noms de langue - Partie 2: Code alpha-3, for which the Library of Congress (LoC) functions as the Registration Authority (ISO 639-2/RA).

EuroFIR uses the ISO 639-1 short codes (the alpha-2 codes), which can be downloaded from the Library of Congress' ISO 639-2 website with in two character encodings (UTF-8) and (ISO 8859-1).

To read the files, please note that one line of text contains one entry. An alpha-3 (bibliographic) code, an alpha-3 (terminologic) code (when given), an alpha-2 code (when given), an English name, and a French name of a language are all separated by pipe (|) characters. If one of these elements is not applicable to the entry, the field is left empty, i.e., a pipe (|) character immediately follows the preceding entry. The Line terminator is the LF character.

Dialects of languages

It is often important to distinguish between dialects of a language, e.g. British English and American English. The current ISO 639 standards do not include ways of indicating languages used in different countries/regions. The future ISO 639-6, currently under development, may be used to identify language variants and dialects.

The current best practice is described in Internet RFC 4646 and RFC 4647 (Tags for the Identification of Languages), describes the structure, content, construction, and semantics of language tags in a faceted approach for use in cases where it is desirable to indicate the language used in an information object. It also describes how to register values for use in language tags and the creation of user-defined extensions for private interchange. The language tag consists of a primary sub-tag and a series of subsequent sub-tags, each of which narrows or refines the range of languages identified by the overall tag. It enables the user to specify, in addition to the primary language, other characteristics such as script, country, or variant.

It is considered an Internet Best Current Practices for the Internet Community and gives guidance for the use of ISO 639 codes.

RFC 4646 specifies use of a 2-character code from ISO 639-1 when it exists; when a language does not have a 2-character code assigned the 3-character code is used. Although it states that the 3-character terminology code is used in these cases where no 2-character code exists, this situation will not occur, since the only alternative codes in ISO 639-2 are for languages that already have a 2-character code.

Some (simple) examples are

 Language Language tag
       
 English en
 British English en-GB
 American English en-US
 German de
 German German de-DE
 Swiss German de-CH
 Austrian German de-AT

 

For further information, see the full documentation in RFC 4646 and RFC 4647


  

NOTE
This website is under constant development and the current layout presented as an example of how the EuroFIR Technical Website could appear. It is given as part of the EuroFIR Prototype given in the EuroFIR Technical specifications report. The website and its pages are optimized for a screen resolution of 1024*768 or higher.

 

 
 

home | legal notices | privacy
Copyright ©2008 Anders Mřller
This website is maintained for the EuroFIR Consortium under the EU 6th Framework Food Quality and Safety Programme (Contract No. FP6-513944).
Hosted by the Danish Food Information. Updated 2008-02-04.


 
 
 Latest News
EuroFIR thesauri version 1.0 published

2008-08-25
The EuroFIR thesauri version 1.0 has been published. See the EuroFIR thesauri pages.
 
New version of EuroFIR XML Food Data Transport Package

2008-08-25
Version 1.3 of the EuroFIR XML Food Data Transport Package.
See the System Development pages.
 
LanguaL 2008 lauched

2008-08-25
The new version of LanguaL 2008 has been published.
Find more information on the pages on LanguaL and FPI.
 
Success for the CEN project!!!

2008-08-06
The application for a CEN project on a Standard for Food Composition Data was submitted in April and has been accepted!
Please read this.
 
EuroFIR FoodComp 2008

2008-04-02
The EuroFIR Course on Production and Use of Food Composition Data in Nutrition, 6-17 October 2008, Bratislava, Slovakia.
More information and application form here.
 
*** Older news ***