X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-iconv-man.xml;h=37353a8e94e73544eed6c454de6b60f0d9498331;hp=4bf973b61857aa0855747ddd41de72560eddb9d8;hb=4c00ecb5fb64580b3f36cdbc04dc99855bc7f413;hpb=d940392c53c32ccf76fb287cc5b997b9e921a431 diff --git a/doc/yaz-iconv-man.xml b/doc/yaz-iconv-man.xml index 4bf973b..37353a8 100644 --- a/doc/yaz-iconv-man.xml +++ b/doc/yaz-iconv-man.xml @@ -1,5 +1,5 @@ - %local; @@ -12,18 +12,20 @@ YAZ &version; + Index Data - + yaz-iconv 1 + Commands - + yaz-iconv - YAZ Charcter set conversion utility + YAZ Character set conversion utility - + yaz-iconv @@ -33,7 +35,7 @@ file - + DESCRIPTION yaz-iconv converts data in file in character @@ -49,9 +51,9 @@ yaz-iconv reads from standard input. - + OPTIONS - + -ffrom] @@ -81,6 +83,105 @@ + ENCODINGS + + The yaz-iconv command and the API as defined in + yaz/yaz-iconv.h is a wrapper for the + library system call iconv. But YAZ' iconv utility also implements + conversions on its own. The table below lists characters sets (or encodings). + that are supported by YAZ. Each character set is marked with either + encode or decode. If + an encoding is encode-enabled YAZ may convert to + to the designated encoding. If an encoding is decode-enabled, YAZ + may convert from the designated encoding. + + + + marc8 (encode, decode) + + + The MARC8 encoding as defined by + the Library of Congress. Most MARC21/USMARC records use this encoding. + + + + + marc8s (encode, decode) + + + Like MARC8 but with conversion prefers non-combined characters + in the Latin-1 plane over combined characters. + + + + + marc8lossy (encode) + + + Lossy encoding of MARC-8. + + + + + marc8lossless (encode) + + + Lossless encoding of MARC8. + + + + + utf8 (encode, decode) + + + The most commonly used UNICODE encoding on the Internet. + + + + + iso8859-1 (encode, decode) + + + ISO-8859-1, AKA Latin-1. + + + + + iso5426 (decode) + + + ISO 5426. Some MARC records (UNIMARC) use this encoding. + + + + + iso5428:1984 (encode, decode) + + + ISO 5428:1984. + + + + + advancegreek (encode, decode) + + + An encoding for Greek in use by some vendors (Advance). + + + + + danmarc (decode) + + + Danmarc (in danish) is + an encoding based on UNICODE which is used for DanMARC2 records. + + + + + + EXAMPLES The following command converts from ISO-8859-1 (Latin-1) to @@ -89,7 +190,7 @@ yaz-iconv -f ISO-8859-1 -t UTF-8 -X <input.lst >output.lst - + FILES @@ -109,15 +210,7 @@