<chapter id="introduction">
- <!-- $Id: introduction.xml,v 1.51 2007-12-19 13:35:39 adam Exp $ -->
<title>Introduction</title>
<section id="overview">
<ulink url="http://indexdata.dk/zebra/">&zebra;</ulink>
is a high-performance, general-purpose structured text
indexing and retrieval engine. It reads records in a
- variety of input formats (eg. email, &acro.xml;, &acro.marc;) and provides access
+ variety of input formats (e.g. email, &acro.xml;, &acro.marc;) and provides access
to them through a powerful combination of boolean search
expressions and relevance-ranked free-text queries.
</para>
<entry>Predefined field types</entry>
<entry>user defined</entry>
<entry>Data fields can be indexed as phrase, as into word
- tokenized text, as numeric values, url's, dates, and raw binary
+ tokenized text, as numeric values, URLs, dates, and raw binary
data.</entry>
<entry><xref linkend="character-map-files"/> and
<xref linkend="querymodel-pqf-apt-mapping-structuretype"/>
<entry>Regular expression matching</entry>
<entry>available</entry>
<entry>Full regular expression matching and "approximate
- matching" (eg. spelling mistake corrections) are handled.</entry>
+ matching" (e.g. spelling mistake corrections) are handled.</entry>
<entry><xref linkend="querymodel-regular"/></entry>
</row>
<row>
</para>
</section>
- <section id="emilda-ils">
- <title>Emilda open source ILS</title>
+
+ <section id="kete-dom">
+ <title>Kete Open Source Digital Library and Archiving software</title>
<para>
- <ulink url="http://www.emilda.org/">Emilda</ulink>
- is a complete Integrated Library System, released under the
- GNU General Public License. It has a
- full featured Web-OPAC, allowing comprehensive system management
- from virtually any computer with an Internet connection, has
- template based layout allowing anyone to alter the visual
- appearance of Emilda, and is
- &acro.xml; based language for fast and easy portability to virtually any
- language.
- Currently, Emilda is used at three schools in Espoo, Finland.
+ <ulink url="http://kete.net.nz/">Kete</ulink> is a digital object
+ management repository, initially developed in
+ New Zealand. Initial development has
+ been a partnership between the Horowhenua Library Trust and
+ Katipo Communications Ltd. funded as part of the Community
+ Partnership Fund in 2006.
+ Kete is purpose built
+ software to enable communities to build their own digital
+ libraries, archives and repositories.
</para>
<para>
- As a surplus, 100% &acro.marc; compatibility has been achieved using the
- &zebra; Server from Index Data as backend server.
- </para>
+ It is based on Ruby-on-Rails and MySQL, and integrates the &zebra; server
+ and the &yaz; toolkit for indexing and retrieval of it's content.
+ Zebra is run as separate computer process from the Kete
+ application.
+ See
+ how Kete <ulink
+ url="http://kete.net.nz/documentation/topics/show/139-managing-zebra">manages
+ Zebra.</ulink>
+ </para>
+ <para>
+ Why does Kete wants to use Zebra?? Speed, Scalability and easy
+ integration with Koha. Read their
+ <ulink
+ url="http://kete.net.nz/blog/topics/show/44-who-what-why-when-answering-some-of-the-niggly-development-questions">detailed
+ reasoning here.</ulink>
+ </para>
</section>
<section id="reindex-ils">
</para>
<para>
More information can be found at
- <ulink url="http://www.dtv.dk/"/> and
+ <ulink url="http://www.dtic.dtu.dk/"/> and
<ulink url="http://dads.dtv.dk"/>
</para>
</section>
- <section id="infonet-eprints">
- <title>Infonet Eprints</title>
- <para>
- The InfoNet Eprints service from the
- <ulink url="http://www.dtv.dk/">
- Technical Knowledge Center of Denmark</ulink>
- provides access to documents stored in
- eprint/preprint servers and institutional research archives around
- the world. The service is based on Open Archives Initiative metadata
- harvesting of selected scientific archives around the world. These
- open archives offer free and unrestricted access to their contents.
- </para>
- <para>
- Infonet Eprints currently holds 1.4 million records from 16 archives.
- The online search facility is found at
- <ulink url="http://preprints.cvt.dk"/>.
- </para>
- </section>
-
- <section id="alvis-project">
- <title>Alvis</title>
- <para>
- The <ulink url="http://www.alvis.info/alvis/">Alvis</ulink> EU
- project run under the 6th Framework (IST-1-002068-STP)
- is building a semantic-based peer-to-peer search engine. A
- consortium of eleven partners from six different European
- Community countries plus Switzerland and China contribute
- with expertise in a broad range of specialties including network
- topologies, routing algorithms, linguistic analysis and
- bioinformatics.
- </para>
- <para>
- The &zebra; information retrieval indexing machine is used inside
- the Alvis framework to
- manage huge collections of natural language processed and
- enhanced &acro.xml; data, coming from a topic relevant web crawl.
- In this application, &zebra; swallows and manages 37GB of &acro.xml; data
- in about 4 hours, resulting in search times of fractions of
- seconds.
- </para>
- </section>
-
-
<section id="uls">
<title>ULS (Union List of Serials)</title>
<para>
</para>
</section>
- <section id="nli">
- <title>NLI-&acro.z3950; - a Natural Language Interface for Libraries</title>
- <para>
- Fernuniversität Hagen in Germany have developed a natural
- language interface for access to library databases.
- <!-- <ulink
- url="http://ki212.fernuni-hagen.de/nli/NLIintro.html"/> -->
- In order to evaluate this interface for recall and precision, they
- chose &zebra; as the basis for retrieval effectiveness. The &zebra;
- server contains a copy of the GIRT database, consisting of more
- than 76000 records in &acro.sgml; format (bibliographic records from
- social science), which are mapped to &acro.marc; for presentation.
- </para>
- <para>
- (GIRT is the German Indexing and Retrieval Testdatabase. It is a
- standard German-language test database for intelligent indexing
- and retrieval systems. See
- <ulink url="http://www.gesis.org/forschung/informationstechnologie/clef-delos.htm"/>)
- </para>
- <para>
- Evaluation will take place as part of the TREC/CLEF campaign 2003
- <ulink url="http://clef.iei.pi.cnr.it"/>.
- <!-- or <ulink url="http://www4.eurospider.ch/CLEF/"/> -->
- </para>
- <para>
- For more information, contact Johannes Leveling
- <email>Johannes.Leveling@FernUni-Hagen.De</email>
- </para>
- </section>
-
<section id="various-web-indexes">
<title>Various web indexes</title>
<para>
&zebra; has been used by a variety of institutions to construct
indexes of large web sites, typically in the region of tens of
millions of pages. In this role, it functions somewhat similarly
- to the engine of google or altavista, but for a selected intranet
+ to the engine of Google or AltaVista, but for a selected intranet
or a subset of the whole Web.
</para>
<para>
releases, bug fixes, etc.) and general discussion. You are welcome
to seek support there. Join by filling the form on the list home page.
</para>
- <para>
- Third, it's possible to buy a commercial support contract, with
- well defined service levels and response times, from Index Data.
- See
- <ulink url="&url.indexdata.support;"/>
- for details.
- </para>
</section>
</chapter>
<!-- Keep this comment at the end of the file