X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-marcdump-man.xml;h=a44dde0a1f73569e31890420dd74c9345bc8d2e6;hp=d7f1e650aebe088c9ba0fba1ff8494fc35d47f05;hb=30af0a1ebbea75c3e757eb03d3f988c61fce8511;hpb=ad61957d649158cf2e1018da95fa428aa8999f2a diff --git a/doc/yaz-marcdump-man.xml b/doc/yaz-marcdump-man.xml index d7f1e65..a44dde0 100644 --- a/doc/yaz-marcdump-man.xml +++ b/doc/yaz-marcdump-man.xml @@ -1,45 +1,67 @@ - + %local; + + %entities; + + %idcommon; ]> - + + YAZ + &version; + Index Data + + yaz-marcdump 1 + Commands - + yaz-marcdump MARC record dump utility - + yaz-marcdump - - - - + + - + + + + + + file - + DESCRIPTION yaz-marcdump reads MARC records from one or more files. - It parses each record and supports output in line-format, - ISO2709, MARCXML, MarcXchange as well as Hex output. + It parses each record and supports output in line-format, + ISO2709, + MARCXML, + MARC-in-JSON, + MarcXchange + as well as Hex output. + + + This utility parses records ISO2709(raw MARC), line format, MARC-in-JSON + format as well as XML if that is structured as MARCXML/MarcXchange. - This utility parses records ISO2709(raw MARC) as well as XML - if that is structured as MARCXML/MarcXchange. + MARC-in-JSON encoding/decoding is supported in YAZ 5.0.5 and later. @@ -50,51 +72,38 @@ By default, each record is written to standard output in a line format with newline for each field, $x for each subfield x. - The output format may be changed with options -X, - -e, -I. + The output format may be changed with option -o, yaz-marcdump can also be requested to perform character set conversion of each record. - - OPTIONS - - - - -x - - Reads MARC records in MARCXML/MarcXchange format. Without - this option, yaz-marcdump reads records - in ISO2709 format. - - - - -X - - Writes MARC records in MARCXML. - This format is equivalent to YAZ_MARC_MARCXML in - yaz/marcdisp.h. - - + OPTIONS + - -e + -i format - Writes MARC records in MarcXchange format. - This format is equivalent to YAZ_MARC_XCHANGE in - yaz/marcdisp.h. + Specifies input format. Must be one of + marcxml, marc (ISO2709), + marcxchange (ISO25577), + line (line mode MARC), + turbomarc (Turbo MARC), + or json (MARC-in-JSON). - -I + -o format - Writes MARC records in ISO2709 format. - This format is equivalent to YAZ_MARC_ISO2709 in - yaz/marcdisp.h. + Specifies output format. Must be one of + marcxml, marc (ISO2709), + marcxchange (ISO25577), + line (line mode MARC), + turbomarc (Turbo MARC), + or json (MARC-in-JSON). @@ -104,6 +113,7 @@ Specify the character set from of the input MARC record. Should be used in conjunction with option -t. + Refer to the yaz-iconv man page for supported character sets. @@ -113,6 +123,7 @@ Specify the character set of of the output. Should be used in conjunction with option -f. + Refer to the yaz-iconv man page for supported character sets. @@ -124,7 +135,40 @@ pairs, where pos is an integer offset (0 - 23) for leader. Value is either a quoted string or an integer (character value in decimal). Pairs are comma separated. For example, to set leader at offset 9 - to a, use 9=a. + to a, use 9='a'. + + + + + -s prefix + + Writes a chunk of records to a separate file with prefix given, + i.e. splits a record batch into files with only at most + "chunk" ISO2709 record per file. By default chunk is 1 (one record + per file). See option -C. + + + + + -C chunksize + + Specifies chunk size; to be used conjunction with option + -s. + + + + + -p + + Makes yaz-marcdump prints record number and input file offset + of each record read. + + + + + -n + + MARC output is omitted so that MARC input is only checkecd. @@ -136,6 +180,13 @@ + + -V + + Prints YAZ version. + + + @@ -145,16 +196,26 @@ MARC21/USMARC in UTF-8 encoding. Leader offset 9 is set to 'a'. Both input and output records are ISO2709 encoded. - yaz-marcdump -f MARC-8 -t UTF-8 -I -l 9=97 marc21.raw >marc21.utf8.raw + yaz-marcdump -f MARC-8 -t UTF-8 -o marc -l 9=97 marc21.raw >marc21.utf8.raw The same records may be converted to MARCXML instead in UTF-8: - yaz-marcdump -f MARC-8 -t UTF-8 -X marc21.raw >marcxml.xml + yaz-marcdump -f MARC-8 -t UTF-8 -o marcxml marc21.raw >marcxml.xml - + + + Turbo MARC is a compact XML notation with same semantics as + MARCXML, but which allows for faster processing via XSLT. In order + to generate Turbo MARC records encoded in UTF-8 from MARC21 (ISO), one + could use: + + yaz-marcdump -f MARC8 -t UTF8 -o turbomarc -i marc marc21.raw >out.xml + + + FILES @@ -166,22 +227,23 @@ SEE ALSO - yaz(7) + + yaz + 7 + + + + + yaz-iconv + 1 +