X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-iconv-man.xml;h=537e821799a075115607763dc1470d3254a312f7;hp=0dd2a496e18efb7994c52c667edfdb60414bdf43;hb=5541b773e8ee0e5c086946016c060f6f629bd410;hpb=621c7fe7d7da36267384afe2d13c0309db293856 diff --git a/doc/yaz-iconv-man.xml b/doc/yaz-iconv-man.xml index 0dd2a49..537e821 100644 --- a/doc/yaz-iconv-man.xml +++ b/doc/yaz-iconv-man.xml @@ -1,5 +1,5 @@ - %local; @@ -8,23 +8,24 @@ %idcommon; ]> - YAZ &version; + Index Data - + yaz-iconv 1 + Commands - + yaz-iconv - YAZ Charcter set conversion utility + YAZ Character set conversion utility - + yaz-iconv @@ -34,7 +35,7 @@ file - + DESCRIPTION yaz-iconv converts data in file in character @@ -50,9 +51,9 @@ yaz-iconv reads from standard input. - + OPTIONS - + -ffrom] @@ -82,6 +83,106 @@ + ENCODINGS + + The yaz-iconv command and the API as defined in + yaz/yaz-iconv.h is a wrapper for the + library system call iconv. But YAZ' iconv utility also implements + conversions on its own. The table below lists characters sets (or encodings). + that are supported by YAZ. Each character set is marked with either + encode or decode. If + an encoding is encode-enabled YAZ may convert to + to the designated encoding. If an encoding is decode-enabled, YAZ + may convert from the designated encoding. + + + + marc8 (encode, decode) + + + The MARC8 encoding as defined by + the Library of Congress. Most MARC21/USMARC records usees + this encoding. + + + + + marc8s (encode, decode) + + + Like MARC8 but with conversion prefers non-combined characters + in the Latin-1 plane over combined characters. + + + + + marc8lossy (encode) + + + Lossy encoding of MARC-8. + + + + + marc8lossless (encode) + + + Lossless encoding of MARC8. + + + + + utf8 (encode, decode) + + + The most commonly used UNICODE encoding on the Internet. + + + + + iso8859-1 (encode, decode) + + + ISO-8859-1, AKA Latin-1. + + + + + iso5426 (decode) + + + ISO 5426. Some MARC records (UNIMARC) uses this encoding. + + + + + iso5428:1984 (encode, decode) + + + ISO 5428:1984. + + + + + advancegreek (encode, decode) + + + An encoding for Greek used by some vendors (Advance). + + + + + danmarc (decode) + + + Danmarc (in danish) is + an encoding based on UNICODE which is used for DanMARC2 records. + + + + + + EXAMPLES The following command converts from ISO-8859-1 (Latin-1) to @@ -90,7 +191,7 @@ yaz-iconv -f ISO-8859-1 -t UTF-8 -X <input.lst >output.lst - + FILES