X-Git-Url: http://git.indexdata.com/?a=blobdiff_plain;f=doc%2Fintroduction.xml;h=06f69600873187a848295c589291effb2483a347;hb=64047719fa2acfe64e40352bdc5fe302e136c995;hp=645c0aa635a7e0a4438e86bc5cf6f93a7114071b;hpb=4fe772289b1ab968655c27b144d08fc69c113fd9;p=idzebra-moved-to-github.git diff --git a/doc/introduction.xml b/doc/introduction.xml index 645c0aa..06f6960 100644 --- a/doc/introduction.xml +++ b/doc/introduction.xml @@ -1,5 +1,5 @@ - + Introduction @@ -9,27 +9,21 @@ The Zebra - system is a fielded free-text indexing and retrieval engine with a - Z39.50 front-end. You can use our various toolkits or any commercial - or free-ware Z39.50 client to access data stored in Zebra. - - - - FIXME - not a "first step" but a part of a complete system! -H - - - - The Zebra server is our first step towards the development of a fully - configurable, open information system. Eventually, it will be paired - off with a powerful Z39.50 client to support complex information - management tasks within almost any application domain. We're making - the server available now because it's no fun to be in the open - information retrieval business all by yourself. We want to allow - people with interesting data to make their things - available in interesting ways, without having to start out - by implementing yet another protocol stack from scratch. - - + server is a high-performance, general-purpose structured text + indexing and retrieval engine. It reads structured records in a + variety of input formats (eg. email, XML, MARC) and allows access + to them through exact boolean search expressions and + relevance-ranked free-text queries. + + + + Zebra supports large databases (more than ten gigabytes of data, + tens of millions of records). It supports incremental, safe + database updates on live systems. You can access data stored in + Zebra using a variety of Index Data tools (eg. YAZ and PHP/YAZ) as + well as commercial and freeware Z39.50 clients and toolkits. + + This document is an introduction to the Zebra system. It will tell you how to compile the software, and how to prepare your first database. @@ -53,7 +47,7 @@ Features - This is a list of some of the most important features of the + This is an overview of some of the most important features of the system. @@ -107,7 +101,8 @@ Can import the data into Zebras own storage, or just refer to - external files (html pages). + external files (good for building indexes of "live" + collections). @@ -139,7 +134,7 @@ - Protocol support: + Z39.50 protocol support: @@ -147,7 +142,6 @@ Protocol facilities: Init, Search, Retrieve, Delete, Browse and Sort. - FIXME - Itemupdate. (Remove delete until that time, confuses people) -H @@ -186,13 +180,6 @@ - - - Some variant support (not fully implemented yet). - FIXME - Test if complete enough - is it worth mentioning at all -H - - - @@ -205,62 +192,44 @@ These are some of the plans that we have for the software in the near and far future, approximately ordered after their relative importance. - Items marked with an - asterisk will be implemented before the - last beta release. - FIXME - What are the current plans? - - - *Complete the support for variants. - FIXME - who cares -H - - - - - - *Finalize the data element include facility - to support multimedia data elements in records. - - - Add more sophisticated relevance ranking mechanisms. - Add support for soundex and stemming. - Add relevance feedback support. + Improved support for XML in search and retrieval. Eventually, + the goal is for Zebra to pull double duty as a flexible + information retrieval engine and high-performance XML + repository. - Complete EXPLAIN support. + Access to search engine through SOAP/RPC API to allow the + construction of applications without requiring Z39.50 tools. - Add support for very large records by implementing segmentation and/or - variant pieces. + Finalisation, documentation of the Zebra API. Consider + exposing the API through SOAP as well (allowing updates, + database management). - Support the Item Update extended service of the protocol. + Improved free-text searching. We're first and foremost octet jockeys and + we're actively looking for organisations or people who'd like + to contribute experience in relevance ranking and text + searching. - - - We want to add a management system that allows you to - control your databases and configuration tables from a graphical - interface. - -