<ulink url="http://indexdata.dk/zebra/">&zebra;</ulink>
is a high-performance, general-purpose structured text
indexing and retrieval engine. It reads records in a
- variety of input formats (eg. email, &acro.xml;, &acro.marc;) and provides access
+ variety of input formats (e.g. email, &acro.xml;, &acro.marc;) and provides access
to them through a powerful combination of boolean search
expressions and relevance-ranked free-text queries.
</para>
<entry>Predefined field types</entry>
<entry>user defined</entry>
<entry>Data fields can be indexed as phrase, as into word
- tokenized text, as numeric values, url's, dates, and raw binary
+ tokenized text, as numeric values, URLs, dates, and raw binary
data.</entry>
<entry><xref linkend="character-map-files"/> and
<xref linkend="querymodel-pqf-apt-mapping-structuretype"/>
<entry>Regular expression matching</entry>
<entry>available</entry>
<entry>Full regular expression matching and "approximate
- matching" (eg. spelling mistake corrections) are handled.</entry>
+ matching" (e.g. spelling mistake corrections) are handled.</entry>
<entry><xref linkend="querymodel-regular"/></entry>
</row>
<row>
Why does Kete wants to use Zebra?? Speed, Scalability and easy
integration with Koha. Read their
<ulink
- url="http://kete.net.nz/blog/topics/show/44-who-what-why-when-answering-some-of-the-niggly-development-questions">detailled
+ url="http://kete.net.nz/blog/topics/show/44-who-what-why-when-answering-some-of-the-niggly-development-questions">detailed
reasoning here.</ulink>
</para>
</section>
- <section id="emilda-ils">
- <title>Emilda open source ILS</title>
- <para>
- <ulink url="http://www.emilda.org/">Emilda</ulink>
- is a complete Integrated Library System, released under the
- GNU General Public License. It has a
- full featured Web-OPAC, allowing comprehensive system management
- from virtually any computer with an Internet connection, has
- template based layout allowing anyone to alter the visual
- appearance of Emilda, and is
- &acro.xml; based language for fast and easy portability to virtually any
- language.
- Currently, Emilda is used at three schools in Espoo, Finland.
- </para>
- <para>
- As a surplus, 100% &acro.marc; compatibility has been achieved using the
- &zebra; Server from Index Data as backend server.
- </para>
- </section>
-
<section id="reindex-ils">
<title>ReIndex.Net web based ILS</title>
<para>
</para>
<para>
More information can be found at
- <ulink url="http://www.dtv.dk/"/> and
+ <ulink url="http://www.dtic.dtu.dk/"/> and
<ulink url="http://dads.dtv.dk"/>
</para>
</section>
- <section id="infonet-eprints">
- <title>Infonet Eprints</title>
- <para>
- The InfoNet Eprints service from the
- <ulink url="http://www.dtv.dk/">
- Technical Knowledge Center of Denmark</ulink>
- provides access to documents stored in
- eprint/preprint servers and institutional research archives around
- the world. The service is based on Open Archives Initiative metadata
- harvesting of selected scientific archives around the world. These
- open archives offer free and unrestricted access to their contents.
- </para>
- <para>
- Infonet Eprints currently holds 1.4 million records from 16 archives.
- The online search facility is found at
- <ulink url="http://preprints.cvt.dk"/>.
- </para>
- </section>
-
- <section id="alvis-project">
- <title>Alvis</title>
- <para>
- The <ulink url="http://www.alvis.info/alvis/">Alvis</ulink> EU
- project run under the 6th Framework (IST-1-002068-STP)
- is building a semantic-based peer-to-peer search engine. A
- consortium of eleven partners from six different European
- Community countries plus Switzerland and China contribute
- with expertise in a broad range of specialties including network
- topologies, routing algorithms, linguistic analysis and
- bioinformatics.
- </para>
- <para>
- The &zebra; information retrieval indexing machine is used inside
- the Alvis framework to
- manage huge collections of natural language processed and
- enhanced &acro.xml; data, coming from a topic relevant web crawl.
- In this application, &zebra; swallows and manages 37GB of &acro.xml; data
- in about 4 hours, resulting in search times of fractions of
- seconds.
- </para>
- </section>
-
-
<section id="uls">
<title>ULS (Union List of Serials)</title>
<para>
</para>
</section>
- <section id="nli">
- <title>NLI-&acro.z3950; - a Natural Language Interface for Libraries</title>
- <para>
- Fernuniversität Hagen in Germany have developed a natural
- language interface for access to library databases.
- <!-- <ulink
- url="http://ki212.fernuni-hagen.de/nli/NLIintro.html"/> -->
- In order to evaluate this interface for recall and precision, they
- chose &zebra; as the basis for retrieval effectiveness. The &zebra;
- server contains a copy of the GIRT database, consisting of more
- than 76000 records in &acro.sgml; format (bibliographic records from
- social science), which are mapped to &acro.marc; for presentation.
- </para>
- <para>
- (GIRT is the German Indexing and Retrieval Testdatabase. It is a
- standard German-language test database for intelligent indexing
- and retrieval systems. See
- <ulink url="http://www.gesis.org/forschung/informationstechnologie/clef-delos.htm"/>)
- </para>
- <para>
- Evaluation will take place as part of the TREC/CLEF campaign 2003
- <ulink url="http://clef.iei.pi.cnr.it"/>.
- <!-- or <ulink url="http://www4.eurospider.ch/CLEF/"/> -->
- </para>
- <para>
- For more information, contact Johannes Leveling
- <email>Johannes.Leveling@FernUni-Hagen.De</email>
- </para>
- </section>
-
<section id="various-web-indexes">
<title>Various web indexes</title>
<para>
&zebra; has been used by a variety of institutions to construct
indexes of large web sites, typically in the region of tens of
millions of pages. In this role, it functions somewhat similarly
- to the engine of google or altavista, but for a selected intranet
+ to the engine of Google or AltaVista, but for a selected intranet
or a subset of the whole Web.
</para>
<para>
releases, bug fixes, etc.) and general discussion. You are welcome
to seek support there. Join by filling the form on the list home page.
</para>
- <para>
- Third, it's possible to buy a commercial support contract, with
- well defined service levels and response times, from Index Data.
- See
- <ulink url="&url.indexdata.support;"/>
- for details.
- </para>
</section>
</chapter>
<!-- Keep this comment at the end of the file