X-Git-Url: http://git.indexdata.com/?p=yaz-moved-to-github.git;a=blobdiff_plain;f=doc%2Fyaz-icu-man.xml;h=0fc1f1af51436fc0b64e9be7e0464dd83395d153;hp=033defaaf3d40fb2dcedb3c1ecf3e39015680519;hb=053367cbe356fb3ce0def34b065dae589d700daf;hpb=e44e073beb75899cf954c94667b412f86009ef16 diff --git a/doc/yaz-icu-man.xml b/doc/yaz-icu-man.xml index 033defa..0fc1f1a 100644 --- a/doc/yaz-icu-man.xml +++ b/doc/yaz-icu-man.xml @@ -1,5 +1,5 @@ - %local; @@ -8,18 +8,19 @@ %idcommon; ]> - YAZ &version; + Index Data - + yaz-icu 1 + Commands - + yaz-icu YAZ ICU utility @@ -28,18 +29,30 @@ yaz-icu - commands -c config -p opt + -s -x + infile - + DESCRIPTION - yaz-icu is utility which demonstrates + yaz-icu is utility which demonstrates the ICU chain module of yaz. (yaz/icu.h). + + The utility can be used in two ways. It may read some text + using an XML configuration for configuring ICU and show text analysis. + This mode is triggered by option -c which specififies + the configuration to be used. The input file is read from standard + input or from a file if infile is specified. + + + The utility may also show ICU information. This is triggered by + option -p. + OPTIONS @@ -57,7 +70,7 @@ Specifies extra information to be printed about the ICU system. If type is c - then ICU converters are printed. + then ICU converters are printed. If type is l available locales are printed. If type is t @@ -66,7 +79,15 @@ - -x config + -s + + Specifies that output should include sort key as well. Note that + sort key differs between ICU versions. + + + + + -x Specifies that output should be XML based rather than "text" based. @@ -95,7 +116,7 @@ The following conversion elements are available: - + casemap @@ -116,7 +137,7 @@ Upper case using ICU function u_strToUpper. - + t @@ -130,7 +151,7 @@ Fold case using ICU function u_strFoldCase. - + @@ -143,18 +164,28 @@ using function icu_chain_token_display (yaz/icu.h). - + transform - Specifies an ICU transform rule. The rule attribute is the - custom transformation rule to be used. This is a text based format - which is offered by the ICU transform system. See - ICU Transforms for + Specifies an ICU transform rule using a transliterator + Identifier. + The rule attribute is the transliterator Identifier. + See ICU Transforms for + more information. + + + + + transliterate + + Specifies a rule-based transliterator. + The rule attribute is the custom transformation rule to be used. + See ICU Transforms for more information. - + tokenize @@ -175,7 +206,7 @@ Sentence. ICU: UBRK_SENTENCE. - + w @@ -200,9 +231,19 @@ - + + + join + + + Joins tokens into one string. The rule attribute is the joining + string, which may be empty. The join conversion element was added + in YAZ 4.2.49. + + + - + EXAMPLES @@ -218,6 +259,7 @@ +