X-Git-Url: http://git.indexdata.com/?a=blobdiff_plain;f=doc%2Fadministration.xml;h=299b1d06ccd402c0e07a0378740a5a57fd42cde7;hb=e809d64f640790b2695a659bd2eb0ebd4e3cf963;hp=b8c1f0428e154579100b4eea3b0d8a0677eecffe;hpb=79dbb0556936ee101483f693d38bd5e97c49689e;p=idzebra-moved-to-github.git diff --git a/doc/administration.xml b/doc/administration.xml index b8c1f04..299b1d0 100644 --- a/doc/administration.xml +++ b/doc/administration.xml @@ -1,7 +1,13 @@ - + Administrating Zebra - + + Unlike many simpler retrieval systems, Zebra supports safe, incremental updates to an existing index. @@ -79,7 +85,7 @@ Indexing is a per-record process, in which either insert/modify/delete will occur. Before a record is indexed search keys are extracted from whatever might be the layout the original record (sgml,html,text, etc..). - The Zebra system currently supports two fundamantal types of records: + The Zebra system currently supports two fundamental types of records: structured and simple text. To specify a particular extraction process, use either the command line option -t or specify a @@ -99,8 +105,8 @@ You can edit the configuration file with a normal text editor. - parameter names and values are seperated by colons in the file. Lines - starting with a hash sign (#) are + parameter names and values are separated by colons in the file. Lines + starting with a hash sign (#) are treated as comments. @@ -146,13 +152,17 @@ explained further in the following sections. + + group - .recordType[.name]: + .recordType[.name]: type @@ -186,6 +196,7 @@ Specifies the Z39.50 database name. + @@ -198,6 +209,7 @@ group of records. If you plan to update/delete this type of records later this should be specified as 1; otherwise it should be 0 (default), to save register space. + See . @@ -217,6 +229,7 @@ + register: register-location @@ -248,7 +261,7 @@ keyTmpDir: directory - Directory in which temporary files used during zebraidx' update + Directory in which temporary files used during zebraidx's update phase are stored. @@ -263,7 +276,7 @@ - profilePath: path + profilePath: path Specifies a path of profile specification files. @@ -302,13 +315,34 @@ Specifies a directory base for Zebra. All relative paths given (in profilePath, register, shadow) are based on this - directory. This setting is useful if if you Zebra server + directory. This setting is useful if your Zebra server is running in a different directory from where zebra.cfg is located. + + @@ -318,14 +352,15 @@ Locating Records - The default behaviour of the Zebra system is to reference the + The default behavior of the Zebra system is to reference the records from their original location, i.e. where they were found when you ran zebraidx. That is, when a client wishes to retrieve a record following a search operation, the files are accessed from the place where you originally put them - if you remove the files (without - running zebraidx again, the client - will receive a diagnostic message. + running zebraidx again, the server will return + diagnostic number 14 (``System error in presenting records'') to + the client. @@ -370,7 +405,7 @@ - profilePath: /usr/local/yaz + profilePath: /usr/local/idzebra/tab attset: bib1.att simple.recordType: text simple.database: textbase @@ -418,7 +453,7 @@ disk space than simpler indexing methods, but it makes it easier for you to keep the index in sync with a frequently changing set of data. If you combine this system with the safe update - facility (see below), you never have to take your server offline for + facility (see below), you never have to take your server off-line for maintenance or register updating purposes. @@ -428,9 +463,13 @@ in the configuration file. In addition, you should set storeKeys to 1, since the Zebra indexer must save additional information about the contents of each record - in order to modify the indices correctly at a later time. + in order to modify the indexes correctly at a later time. + + For example, to update records of group esdd located below @@ -466,13 +505,14 @@ and then run zebraidx with the update command. + Indexing with General Record IDs - When using this method you construct an (almost) arbritrary, internal + When using this method you construct an (almost) arbitrary, internal record key based on the contents of the record itself and other system information. If you have a group of records that explicitly associates an ID with each record, this method is convenient. For example, the @@ -635,19 +675,22 @@ each directory in the order specified and use the next specified directories as needed. The size is an integer followed by a qualifier - code, M for megabytes, + code, + b for bytes, k for kilobytes. + M for megabytes, + G for gigabytes. For instance, if you have allocated two disks for your register, and the first disk is mounted - on /d1 and has 200 Mb of free space and the - second, mounted on /d2 has 300 Mb, you could + on /d1 and has 2GB of free space and the + second, mounted on /d2 has 3.6 GB, you could put this entry in your configuration file: - register: /d1:200M /d2:300M + register: /d1:2G /d2:3600M @@ -658,7 +701,7 @@ your responsibility to ensure that enough space is available, and that other applications do not attempt to use the free space. In a large production system, it is recommended that you allocate one or more - filesystem exclusively to the Zebra register files. + file system exclusively to the Zebra register files. @@ -764,14 +807,13 @@ In order to make changes to the system take effect for the users, you'll have to submit a "commit" command after a (sequence of) update operation(s). - You can ask the indexer to commit the changes immediately - after the update operation: - $ zebraidx update /d1/records update /d2/more-records commit + $ zebraidx update /d1/records + $ zebraidx commit @@ -783,7 +825,7 @@ - $ zebraidx -g books update /d1/records update /d2/more-records + $ zebraidx -g books update /d1/records /d2/more-records $ zebraidx -g fun update /d3/fun-records $ zebraidx commit