X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-marcdump-man.xml;h=1cdd528e1ba9326a001424c1ed260dd2aa0914ec;hp=b7e008d379d4443ec86ab94403784b4f94651999;hb=8bcd70cb460d7a11175e82487f498102ead66472;hpb=23403c6f31b26b0e819a47980c42f3fc8c57d84d diff --git a/doc/yaz-marcdump-man.xml b/doc/yaz-marcdump-man.xml index b7e008d..1cdd528 100644 --- a/doc/yaz-marcdump-man.xml +++ b/doc/yaz-marcdump-man.xml @@ -1,41 +1,67 @@ - - + + %local; + + %entities; + + %idcommon; +]> + + YAZ + &version; + Index Data + + yaz-marcdump 1 + Commands - + yaz-marcdump MARC record dump utility - + yaz-marcdump - - - - + + - + + + + + + + file - + DESCRIPTION yaz-marcdump reads MARC records from one or more files. - It parses each record and supports output in line-format, - ISO2709, MARCXML, MarcXchange as well as Hex output. + It parses each record and supports output in line-format, + ISO2709, + MARCXML, + MARC-in-JSON, + MarcXchange + as well as Hex output. - This utility parses records ISO2709(raw MARC) as well as XML - if that is structured as MARCXML/MarcXchange. + This utility parses records ISO2709(raw MARC), line format, MARC-in-JSON + format as well as XML if that is structured as MARCXML/MarcXchange. + + + MARC-in-JSON encoding/decoding is supported in YAZ 5.0.5 and later. @@ -46,69 +72,103 @@ By default, each record is written to standard output in a line format with newline for each field, $x for each subfield x. - The output format may be changed with options -X, - -e, -I. + The output format may be changed with option -o, yaz-marcdump can also be requested to perform character set conversion of each record. - + OPTIONS - + - -x + -i format - Reads MARC records in MARCXML/MarcXchange format. Without - this option, yaz-marcdump reads records - in ISO2709 format. + Specifies input format. Must be one of + marcxml, marc (ISO2709), + marcxchange (ISO25577), + line (line mode MARC), + turbomarc (Turbo MARC), + or json (MARC-in-JSON). - -X + -o format - Writes MARC records in MARCXML. - This format is equivalent to YAZ_MARC_MARCXML in - yaz/marcdisp.h. + Specifies output format. Must be one of + marcxml, marc (ISO2709), + marcxchange (ISO25577), + line (line mode MARC), + turbomarc (Turbo MARC), + or json (MARC-in-JSON). - -e + -f from - Writes MARC records in MarcXchange format. - This format is equivalent to YAZ_MARC_XCHANGE in - yaz/marcdisp.h. + Specify the character set from + of the input MARC record. + Should be used in conjunction with option -t. + Refer to the yaz-iconv man page for supported character sets. - -I + -t to - Writes MARC records in ISO2709 format. - This format is equivalent to YAZ_MARC_ISO2709 in - yaz/marcdisp.h. + Specify the character set of + of the output. + Should be used in conjunction with option -f. + Refer to the yaz-iconv man page for supported character sets. - -ffrom] + -l leaderspec - Specify the character set from - of the input MARC record. - Should be used in conjunction with option -t. + Specify a simple modification string for MARC leader. The + leaderspec is a list of pos=value + pairs, where pos is an integer offset (0 - 23) for leader. Value + is either a quoted string or an integer (character value in decimal). + Pairs are comma separated. For example, to set leader at offset 9 + to a, use 9='a'. - -tto] + -s prefix - Specify the character set of - of the output. - Should be used in conjunction with option -f. + Writes a chunk of records to a separate file with prefix given, + i.e. splits a record batch into files with only at most + "chunk" ISO2709 record per file. By default chunk is 1 (one record + per file). See option -C. + + + + + -C chunksize + + Specifies chunk size; to be used conjunction with option + -s. + + + + + -p + + Makes yaz-marcdump prints record number and input file offset + of each record read. + + + + + -n + + MARC output is omitted so that MARC input is only checkecd. @@ -120,24 +180,42 @@ + + -V + + Prints YAZ version. + + + EXAMPLES The following command converts MARC21/USMARC in MARC-8 encoding to - MARC21/USMARC in UTF-8 encoding. (Both input and output is in ISO2709). + MARC21/USMARC in UTF-8 encoding. Leader offset 9 is set to 'a'. + Both input and output records are ISO2709 encoded. - yaz-marcdump -f MARC-8 -t UTF-8 -I marc21.raw >marc21.utf8.raw + yaz-marcdump -f MARC-8 -t UTF-8 -o marc -l 9=97 marc21.raw >marc21.utf8.raw The same records may be converted to MARCXML instead in UTF-8: - yaz-marcdump -f MARC-8 -t UTF-8 -X marc21.raw >marcxml.xml + yaz-marcdump -f MARC-8 -t UTF-8 -o marcxml marc21.raw >marcxml.xml + + + + + Turbo MARC is a compact XML notation with same semantics as + MARCXML, but which allows for faster processing via XSLT. In order + to generate Turbo MARC records encoded in UTF-8 from MARC21 (ISO), one + could use: + + yaz-marcdump -f MARC8 -t UTF8 -o turbomarc -i marc marc21.raw >out.xml - + FILES @@ -149,7 +227,16 @@ SEE ALSO - yaz(7) + + yaz + 7 + + + + + yaz-iconv + 1 +