X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-iconv-man.xml;h=d24dff7a01b57702d79b58cb1b6b01b1908b7725;hp=dc0f3c4f73e9833f024887be1cd857685d36e3a6;hb=9a93f996a29da6535a581a7278019e72ee07d188;hpb=f0fff840ae259cab0b1c17297c4367172a7cd229 diff --git a/doc/yaz-iconv-man.xml b/doc/yaz-iconv-man.xml index dc0f3c4..d24dff7 100644 --- a/doc/yaz-iconv-man.xml +++ b/doc/yaz-iconv-man.xml @@ -1,8 +1,19 @@ - +[ + + %local; + + %entities; + + %idcommon; +]> + + YAZ + &version; + + yaz-iconv 1 @@ -10,7 +21,7 @@ yaz-iconv - YAZ Charcter set conversion utility + YAZ Character set conversion utility @@ -69,6 +80,106 @@ + + ENCODINGS + + The yaz-iconv command and the API as defined in + yaz/yaz-iconv.h is a wrapper for the + library system call iconv. But YAZ' iconv utility also implements + conversions on its own. The table below lists characters sets (or encodings). + that are supported by YAZ. Each character set is marked with either + encode or decode. If + an encoding is encode-enabled YAZ may convert to + to the designated encoding. If an encoding is decode-enabled, YAZ + may convert from the designated encoding. + + + + marc8 (encode, decode) + + + The MARC8 encoding as defined by + the Library of Congress. Most MARC21/USMARC records usees + this encoding. + + + + + marc8s (encode, decode) + + + Like MARC8 but with conversion prefers non-combined characters + in the Latin-1 plane over combined characters. + + + + + marc8lossy (encode) + + + Lossy encoding of MARC-8. + + + + + marc8lossless (encode) + + + Lossless encoding of MARC8. + + + + + utf8 (encode, decode) + + + The most commonly used UNICODE encoding on the Internet. + + + + + iso8859-1 (encode, decode) + + + ISO-8859-1, AKA Latin-1. + + + + + iso5426 (decode) + + + ISO 5426. Some MARC records (UNIMARC) uses this encoding. + + + + + iso5428:1984 (encode, decode) + + + ISO 5428:1984. + + + + + advancegreek (encode, decode) + + + An encoding for Greek used by some vendors (Advance). + + + + + danmarc (decode) + + + Danmarc (in danish) is + an encoding based on UNICODE which is used for DanMARC2 records. + + + + + EXAMPLES @@ -96,9 +207,17 @@ - -