<chapter id="architecture">
- <!-- $Id: architecture.xml,v 1.7 2006-04-24 12:53:03 marc Exp $ -->
+ <!-- $Id: architecture.xml,v 1.11 2006-06-30 14:36:12 marc Exp $ -->
<title>Overview of Zebra Architecture</title>
same main components, which are presented here.
</para>
<para>
- The virtual Debian package <literal>idzebra1.4</literal>
+ The virtual Debian package <literal>idzebra-2.0</literal>
installs all the necessary packages to start
working with Zebra - including utility programs, development libraries,
documentation and modules.
</variablelist>
</para>
<para>
- The Debian package <literal>libidzebra1.4</literal>
+ The Debian package <literal>libidzebra-2.0</literal>
contains all run-time libraries for Zebra, the
documentation in PDF and HTML is found in
- <literal>idzebra1.4-doc</literal>, and
- <literal>idzebra1.4-common</literal>
+ <literal>idzebra-2.0-doc</literal>, and
+ <literal>idzebra-2.0-common</literal>
includes common essential Zebra configuration files.
</para>
</sect2>
indexes according to the rules defined in the filter modules.
</para>
<para>
- The Debian package <literal>idzebra1.4-utils</literal> contains
+ The Debian package <literal>idzebra-2.0-utils</literal> contains
the <command>zebraidx</command> utility.
</para>
</sect2>
great Information Retrieval server application.
</para>
<para>
- The Debian package <literal>idzebra1.4-utils</literal> contains
+ The Debian package <literal>idzebra-2.0-utils</literal> contains
the <command>zebrasrv</command> utility.
</para>
</sect2>
as HTTP server, honoring
<ulink url="http://www.loc.gov/standards/sru/srw/">SRW</ulink>
SOAP requests, and
- <ulink url="http://www.loc.gov/standards/sru/">SRU</ulink>
+ <ulink url="&url.sru;">SRU</ulink>
REST requests. Moreover, it can
translate incoming
- <ulink url="http://www.loc.gov/standards/sru/cql/">CQL</ulink>
+ <ulink url="&url.cql;">CQL</ulink>
queries to
<ulink url="http://indexdata.com/yaz/doc/tools.tkl#PQF">PQF</ulink>
queries, if
</para>
<para>
The virtual Debian package
- <literal>libidzebra1.4-modules</literal> installs all base filter
+ <literal>libidzebra-2.0-modules</literal> installs all base filter
modules.
</para>
+
<sect3 id="componentmodulestext">
<title>TEXT Record Model and Filter Module</title>
<para>
Plain ASCII text filter. TODO: add information here.
- <!--
- <literal>text module missing as deb file<literal>
- -->
</para>
</sect3>
to the <filename>*.abs</filename> configuration file suffix.
</para>
<para>
- The <emphasis>grs.danbib</emphasis> filter is developed for
- DBC DanBib records.
- DanBib is the Danish Union Catalogue hosted by DBC
- (Danish Bibliographic Center). This filter is found in the
- Debian package
- <literal>libidzebra1.4-mod-grs-danbib</literal>.
- </para>
- <para>
The <emphasis>grs.marc</emphasis> and
<emphasis>grs.marcxml</emphasis> filters are suited to parse and
index binary and XML versions of traditional library MARC records
based on the ISO2709 standard. The Debian package for both
filters is
- <literal>libidzebra1.4-mod-grs-marc</literal>.
+ <literal>libidzebra-2.0-mod-grs-marc</literal>.
</para>
<para>
GRS TCL scriptable filters for extensive user configuration come
a general scriptable TCL filter called
<emphasis>grs.tcl</emphasis>
are both included in the
- <literal>libidzebra1.4-mod-grs-regx</literal> Debian package.
+ <literal>libidzebra-2.0-mod-grs-regx</literal> Debian package.
</para>
<para>
A general purpose SGML filter is called
<emphasis>grs.sgml</emphasis>. This filter is not yet packaged,
but planned to be in the
- <literal>libidzebra1.4-mod-grs-sgml</literal> Debian package.
+ <literal>libidzebra-2.0-mod-grs-sgml</literal> Debian package.
</para>
<para>
The Debian package
- <literal>libidzebra1.4-mod-grs-xml</literal> includes the
+ <literal>libidzebra-2.0-mod-grs-xml</literal> includes the
<emphasis>grs.xml</emphasis> filter which uses <ulink
url="http://expat.sourceforge.net/">Expat</ulink> to
parse records in XML and turn them into IDZebra's internal GRS node
<xref linkend="record-model-alvisxslt"/>.
</para>
<para>
- The Debian package <literal>libidzebra1.4-mod-alvis</literal>
+ The Debian package <literal>libidzebra-2.0-mod-alvis</literal>
contains the Alvis filter module.
</para>
</sect3>
+ <!--
<sect3 id="componentmodulessafari">
<title>SAFARI Record Model and Filter Module</title>
<para>
SAFARI filter module TODO: add information here.
- <!--
- <literal>safari module missing as deb file<literal>
- -->
</para>
</sect3>
+ -->
</sect2>
</sect1>
-<!--
- <sect1 id="architecture-querylanguage">
- <title>Query Languages</title>
-
- <para>
-
-http://www.loc.gov/z3950/agency/document.html
-
- PQF and BIB-1 stuff to be explained
- <ulink url="http://www.loc.gov/z3950/agency/defns/bib1.html">
- http://www.loc.gov/z3950/agency/defns/bib1.html</ulink>
-
- <ulink url="http://www.loc.gov/z3950/agency/bib1.html">
- http://www.loc.gov/z3950/agency/bib1.html</ulink>
-
- http://www.loc.gov/z3950/agency/markup/13.html
-
- </para>
- </sect1>
-
-
-These attribute types are recognized regardless of attribute set. Some are recognized for search, others for scan.
-
-Search
-
-Type Name Version
-7 Embedded Sort 1.1
-8 Term Set 1.1
-9 Rank weight 1.1
-9 Approx Limit 1.4
-10 Term Ref 1.4
-
-Embedded Sort
-
-The embedded sort is a way to specify sort within a query - thus removing the need to send a Sort Request separately. It is both faster and does not require clients that deal with the Sort Facility.
-
-The value after attribute type 7 is 1=ascending, 2=descending.. The attributes+term (APT) node is separate from the rest and must be @or'ed. The term associated with APT is the level .. 0=primary sort, 1=secondary sort etc.. Example:
-
-Search for water, sort by title (ascending):
-
- @or @attr 1=1016 water @attr 7=1 @attr 1=4 0
-
-Search for water, sort by title ascending, then date descending:
-
- @or @or @attr 1=1016 water @attr 7=1 @attr 1=4 0 @attr 7=2 @attr 1=30 1
-
-Term Set
-
-The Term Set feature is a facility that allows a search to store hitting terms in a "pseudo" resultset; thus a search (as usual) + a scan-like facility. Requires a client that can do named result sets since the search generates two result sets. The value for attribute 8 is the name of a result set (string). The terms in term set are returned as SUTRS records.
-
-Seach for u in title, right truncated.. Store result in result set named uset.
-
- @attr 5=1 @attr 1=4 @attr 8=uset u
-
-The model as one serious flaw.. We don't know the size of term set.
-
-Rank weight
-
-Rank weight is a way to pass a value to a ranking algorithm - so that one APT has one value - while another as a different one.
-
-Search for utah in title with weight 30 as well as any with weight 20.
-
- @attr 2=102 @or @attr 9=30 @attr 1=4 utah @attr 9=20 utah
-
-Approx Limit
-
-Newer Zebra versions normally estemiates hit count for every APT (leaf) in the query tree. These hit counts are returned as part of the searchResult-1 facility.
-
-By setting a limit for the APT we can make Zebra turn into approximate hit count when a certain hit count limit is reached. A value of zero means exact hit count.
-
-We are intersted in exact hit count for a, but for b we allow estimates for 1000 and higher..
-
- @and a @attr 9=1000 b
-
-This facility clashes with rank weight! Fortunately this is a Zebra 1.4 thing so we can change this without upsetting anybody!
-
-Term Ref
-
-Zebra supports the searchResult-1 facility.
-
-If attribute 10 is given, that specifies a subqueryId value returned as part of the search result. It is a way for a client to name an APT part of a query.
-
-Scan
-
-Type Name Version
-8 Result set narrow 1.3
-9 Approx Limit 1.4
-
-Result set narrow
-
-If attribute 8 is given for scan, the value is the name of a result set. Each hit count in scan is @and'ed with the result set given.
-
-Approx limit
-
-The approx (as for search) is a way to enable approx hit counts for scan hit counts. However, it does NOT appear to work at the moment.
-
-
- AdamDickmeiss - 19 Dec 2005
-
-
--->
-
</chapter>
- <!-- Keep this Emacs mode comment at the end of the file
-Local variables:
-mode: nxml
-End:
--->
-
+ <!-- Keep this comment at the end of the file
+ Local variables:
+ mode: sgml
+ sgml-omittag:t
+ sgml-shorttag:t
+ sgml-minimize-attributes:nil
+ sgml-always-quote-attributes:t
+ sgml-indent-step:1
+ sgml-indent-data:t
+ sgml-parent-document: "zebra.xml"
+ sgml-local-catalogs: nil
+ sgml-namecase-general:t
+ End:
+ -->