X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-iconv-man.xml;h=9952d2eb2e297e1bc9c3984b2db6c1da9839385e;hp=935fedbbf746795ee78fcf21a8498be75ff807b0;hb=66fded781f0dc83295c75dc35c4c6cc422307550;hpb=23403c6f31b26b0e819a47980c42f3fc8c57d84d diff --git a/doc/yaz-iconv-man.xml b/doc/yaz-iconv-man.xml index 935fedb..9952d2e 100644 --- a/doc/yaz-iconv-man.xml +++ b/doc/yaz-iconv-man.xml @@ -1,16 +1,31 @@ - - + + %local; + + %entities; + + %idcommon; +]> + + YAZ + &version; + Index Data + + yaz-iconv 1 + Commands - + yaz-iconv - YAZ Charcter set conversion utility + YAZ Character set conversion utility - + yaz-iconv @@ -20,7 +35,7 @@ file - + DESCRIPTION yaz-iconv converts data in file in character @@ -36,9 +51,9 @@ yaz-iconv reads from standard input. - + OPTIONS - + -ffrom] @@ -68,15 +83,114 @@ + ENCODINGS + + The yaz-iconv command and the API as defined in + yaz/yaz-iconv.h is a wrapper for the + library system call iconv. But YAZ' iconv utility also implements + conversions on its own. The table below lists characters sets (or encodings). + that are supported by YAZ. Each character set is marked with either + encode or decode. If + an encoding is encode-enabled YAZ may convert to + to the designated encoding. If an encoding is decode-enabled, YAZ + may convert from the designated encoding. + + + + marc8 (encode, decode) + + + The MARC8 encoding as defined by + the Library of Congress. Most MARC21/USMARC records use this encoding. + + + + + marc8s (encode, decode) + + + Like MARC8 but with conversion prefers non-combined characters + in the Latin-1 plane over combined characters. + + + + + marc8lossy (encode) + + + Lossy encoding of MARC-8. + + + + + marc8lossless (encode) + + + Lossless encoding of MARC8. + + + + + utf8 (encode, decode) + + + The most commonly used UNICODE encoding on the Internet. + + + + + iso8859-1 (encode, decode) + + + ISO-8859-1, AKA Latin-1. + + + + + iso5426 (decode) + + + ISO 5426. Some MARC records (UNIMARC) use this encoding. + + + + + iso5428:1984 (encode, decode) + + + ISO 5428:1984. + + + + + advancegreek (encode, decode) + + + An encoding for Greek in use by some vendors (Advance). + + + + + danmarc (decode) + + + Danmarc (in danish) is + an encoding based on UNICODE which is used for DanMARC2 records. + + + + + + EXAMPLES The following command converts from ISO-8859-1 (Latin-1) to UTF-8. - yaz-iconv -f ISO-8859-1 -t UTF-8 -X <input.lst >output.lst + yaz-iconv -f ISO-8859-1 -t UTF-8 <input.lst >output.lst - + FILES @@ -96,15 +210,7 @@