From: Marc Cromme Date: Tue, 28 Nov 2006 14:18:26 +0000 (+0000) Subject: added entry on encoding directive X-Git-Tag: ZEBRA.2.0.8~72 X-Git-Url: http://git.indexdata.com/?a=commitdiff_plain;ds=sidebyside;h=2ce46f160259c9452405b68489c16654919cd16c;p=idzebra-moved-to-github.git added entry on encoding directive --- diff --git a/doc/field-structure.xml b/doc/field-structure.xml index 3a0a5f2..a1de6dd 100644 --- a/doc/field-structure.xml +++ b/doc/field-structure.xml @@ -1,5 +1,5 @@ - + Field Structure and Character Sets @@ -175,6 +175,29 @@ The contents of the character map files are structured as follows: + + encoding encoding-name + + + This directive must be at the very beginning of the file, and it + specifies the character encoding used in the entire file. If + omitted, the encoding ISO-8859-1 is assumed. + + + For example, one of the test files found at + test/rusmarc/tab/string.chr contains the following + encoding directive: + + encoding koi8-r + + and the test file + test/charmap/string.utf8.chr is encoded + in UTF-8: + + encoding utf-8 + + + lowercase value-set