X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-marcdump-man.xml;h=5a66ea3e2d95d4b818453134c7b8685412a282e7;hp=e73d9be2b0298df53dc7b10152c5a88179304016;hb=d9c0ce2694cbd7198c03b8e7d7c4e20a804f809f;hpb=1425def96bbdcd685ab8634c4e655766819abd6a diff --git a/doc/yaz-marcdump-man.xml b/doc/yaz-marcdump-man.xml index e73d9be..5a66ea3 100644 --- a/doc/yaz-marcdump-man.xml +++ b/doc/yaz-marcdump-man.xml @@ -1,9 +1,24 @@ - - + + %local; + + %entities; + + %idcommon; +]> + + YAZ + &version; + Index Data + + yaz-marcdump 1 + Commands @@ -14,29 +29,43 @@ yaz-marcdump - - - - - + + - + + + + + + + file DESCRIPTION - yaz-marcdump reads ISO2709/MARC records from one or + yaz-marcdump reads MARC records from one or more files. - It validates each record and supports output in line-format, - MARCXML, OAIMARC as well as Hex output. + It parses each record and supports output in line-format, + ISO2709, MARCXML, MarcXchange as well as Hex output. - By default, each record is printed to standard output in a line + This utility parses records ISO2709(raw MARC) as well as XML + if that is structured as MARCXML/MarcXchange. + + + + As of YAZ 2.1.18, OAI-MARC is no longer supported. + OAI-MARC is deprecated. Use MARCXML instead. + + + + By default, each record is written to standard output in a line format with newline for each field, $x for each subfield x. + The output format may be changed with option -o, yaz-marcdump can also be requested to perform @@ -48,75 +77,106 @@ - -x + -i format - Print MARC records in a simple XML format. - This format is equivalent to YAZ_MARC_SIMPLEXML in - yaz/marcdisp.h. + Specifies input format. Must be one of + marcxml, marc (ISO2709), + marcxchange (ISO25577), + line (line mode MARC), + or turbomarc (Turbo MARC). - -X + -o format - Print MARC records in MARCXML. - This format is equivalent to YAZ_MARC_MARCXML in - yaz/marcdisp.h. + Specifies output format. Must be one of + marcxml, marc (ISO2709), + marcxchange (ISO25577), + line (line mode MARC), + or turbomarc (Turbo MARC). - -e + -f from - Print MARC records in MarcXchange format. - This format is equivalent to YAZ_MARC_XCHANGE in - yaz/marcdisp.h. + Specify the character set from + of the input MARC record. + Should be used in conjunction with option -t. + Refer to the yaz-iconv man page for supported character sets. - -I + -t to - Print MARC records in ISO2709 format. - This format is equivalent to YAZ_MARC_ISO2709 in - yaz/marcdisp.h. + Specify the character set of + of the output. + Should be used in conjunction with option -f. + Refer to the yaz-iconv man page for supported character sets. - -O + -l leaderspec - Print MARC records in OAIMARC. Another XML variant. - This format is equivalent to YAZ_MARC_OAIMARC in - yaz/marcdisp.h. + Specify a simple modification string for MARC leader. The + leaderspec is a list of pos=value + pairs, where pos is an integer offset (0 - 23) for leader. Value + is either a quoted string or an integer (character value in decimal). + Pairs are comma separated. For example, to set leader at offset 9 + to a, use 9='a'. - + - -ffrom] + -s prefix - Specify the character set from - of the input MARC record. - Should be used in conjunction with option -t. + Writes a chunk of records to a separate file with prefix given, + i.e. splits a record batch into files with only at most + "chunk" ISO2709 record per file. By default chunk is 1 (one record + per file). See option -C. - -tto] + -C chunksize - Specify the character set of - of the output. - Should be used in conjunction with option -f. + Specifies chunk size; to be used conjunction with option + -s. + + + + + -p + + Makes yaz-marcdump prints record number and input file offset + of each record read. + + + + + -n + + MARC output is omitted so that MARC input is only checkecd. -v - Print more information about the parsing process. + Writes more information about the parsing process. Useful if you have ill-formatted ISO2709 records as input. + + + -V + + Prints YAZ version. + + @@ -124,15 +184,26 @@ EXAMPLES The following command converts MARC21/USMARC in MARC-8 encoding to - MARC21/USMARC in UTF-8 encoding. (Both input and output is in ISO2709). + MARC21/USMARC in UTF-8 encoding. Leader offset 9 is set to 'a'. + Both input and output records are ISO2709 encoded. - yaz-marcdump -f MARC-8 -t UTF-8 -I marc21.raw >marc21.utf8.raw + yaz-marcdump -f MARC-8 -t UTF-8 -o marc -l 9=97 marc21.raw >marc21.utf8.raw The same records may be converted to MARCXML instead in UTF-8: - yaz-marcdump -f MARC-8 -t UTF-8 -X marc21.raw >marcxml.xml + yaz-marcdump -f MARC-8 -t UTF-8 -o marcxml marc21.raw >marcxml.xml + + + + + Turbo MARC is a compact XML notation with same semantics as + MARCXML, but which allows for faster processing via XSLT. In order + to generate Turbo MARC records encoded in UTF-8 from MARC21 (ISO), one + could use: + + yaz-marcdump -f MARC8 -t UTF8 -o turbomarc -i marc marc21.raw >out.xml @@ -147,7 +218,22 @@ SEE ALSO - yaz(7) + + yaz + 7 + + + + + yaz-iconv + 1 + + + + MARCXML. + + + ISO25577.