<productnumber>&version;</productnumber>
<info><orgname>Index Data</orgname></info>
</refentryinfo>
-
+
<refmeta>
<refentrytitle>yaz-icu</refentrytitle>
<manvolnum>1</manvolnum>
<refmiscinfo class="manual">Commands</refmiscinfo>
</refmeta>
-
+
<refnamediv>
<refname>yaz-icu</refname>
<refpurpose>YAZ ICU utility</refpurpose>
<refsynopsisdiv>
<cmdsynopsis>
<command>yaz-icu</command>
- <arg choice="opt" rep="repeat">commands</arg>
<arg>-c <replaceable>config</replaceable></arg>
<arg>-p <replaceable>opt</replaceable></arg>
<arg>-s</arg>
<arg>-x</arg>
+ <arg choice="opt">infile</arg>
</cmdsynopsis>
</refsynopsisdiv>
-
+
<refsect1><title>DESCRIPTION</title>
<para>
- <command>yaz-icu</command> is utility which demonstrates
+ <command>yaz-icu</command> is utility which demonstrates
the ICU chain module of yaz. (<filename>yaz/icu.h</filename>).
</para>
+ <para>
+ The utility can be used in two ways. It may read some text
+ using an XML configuration for configuring ICU and show text analysis.
+ This mode is triggered by option <literal>-c</literal> which specififies
+ the configuration to be used. The input file is read from standard
+ input or from a file if <literal>infile</literal> is specified.
+ </para>
+ <para>
+ The utility may also show ICU information. This is triggered by
+ option <literal>-p</literal>.
+ </para>
</refsect1>
<refsect1><title>OPTIONS</title>
<listitem><para>
Specifies extra information to be printed about the ICU system.
If <replaceable>type</replaceable> is <literal>c</literal>
- then ICU converters are printed.
+ then ICU converters are printed.
If <replaceable>type</replaceable> is <literal>l</literal>
available locales are printed.
If <replaceable>type</replaceable> is <literal>t</literal>
</para>
<para>
The following conversion elements are available:
-
+
<variablelist>
<varlistentry>
<term>casemap</term>
<para>Upper case using ICU function u_strToUpper.</para>
</listitem>
</varlistentry>
-
+
<varlistentry>
<term>t</term>
<listitem>
<para>Fold case using ICU function u_strFoldCase.</para>
</listitem>
</varlistentry>
-
+
</variablelist>
</para></listitem>
</varlistentry>
using function icu_chain_token_display (<filename>yaz/icu.h</filename>).
</para></listitem>
</varlistentry>
-
+
<varlistentry>
<term>transform</term>
<listitem><para>
more information.
</para></listitem>
</varlistentry>
-
+
<varlistentry>
<term>transliterate</term>
<listitem><para>
more information.
</para></listitem>
</varlistentry>
-
+
<varlistentry>
<term>tokenize</term>
<listitem><para>
<para>Sentence. ICU: UBRK_SENTENCE.</para>
</listitem>
</varlistentry>
-
+
<varlistentry>
<term>w</term>
<listitem>
</variablelist>
</para></listitem>
</varlistentry>
-
+
+ <varlistentry>
+ <term>join</term>
+ <listitem>
+ <para>
+ Joins tokens into one string. The rule attribute is the joining
+ string, which may be empty. The join conversion element was added
+ in YAZ 4.2.49.
+ </para>
+ </listitem>
+ </varlistentry>
</variablelist>
-
+
</para>
</refsect1>
<refsect1><title>EXAMPLES</title>
<transform rule="[:Control:] Any-Remove"/>
<tokenize rule="w"/>
<transform rule="[[:WhiteSpace:][:Punctuation:]] Remove"/>
- <transliterate rule="xy > z"/>
+ <transliterate rule="xy > z;"/>
<display/>
<casemap rule="l"/>
</icu_chain>