X-Git-Url: http://git.indexdata.com/?p=idzebra-moved-to-github.git;a=blobdiff_plain;f=NEWS;h=8444ab9b7358be006ee31dd07c6f1888fb003892;hp=121079059d71f38b12f570cba17515e48485278a;hb=693a0db94b4b3ac9aee7722572a6b81a86a12e13;hpb=6ba9698e88c0283e40fa5980a1a6b551fff2d597 diff --git a/NEWS b/NEWS index 1210790..8444ab9 100644 --- a/NEWS +++ b/NEWS @@ -1,622 +1,357 @@ -Zebra uses string attributes for indexing internally. Using set+numeric -use attribute can still be used. But that is a search-only conversion which -inspects .att-set files as indicated using attset-directives in zebra.cfg. -attset references are no longer required, but when used they deserve -as "check" for that the index names used are also present in .att. - -Implemented bug #588: allrecords search. Using @attr 1=allrecords "" -matches all records in a database. - -Zebra reads (non-standard) attribute 10 for RPN queries. The value, when -specified, is returned as subqueryId in SearchResult-1. It un-specified -no subqueryId is part of SearchResult-1. - -Added 'dbperms' setting to zebra.cfg. Names file with lines of -format username: db1+...dbn, listing databases that user has access to. -Database-specific access control only happens for searching, at present. - -Added zebra_set_approx_limit for a ZebraHandle (session). Results -will be approximate if hit count is greater than the limit specified. - -Added support for term hit counts. This was not in place for earlier -1.4 versions, but is present in the 1.3 series. Bug #124. - -Implemented the 'equivalent' directive for .chr-files. - -Added zebra_get_bfs to get BFile handle for session. - -Added 'melm' directive to absyn format to simplify config files -for MARC-style databases. See tab/marc21.abs for an example. - -Added bath-compliant definition file marc21.abs for use with MARC databases. - -Added autoconf utility idzebra.m4 which defines --with-idzebra . - -Added mechanism to ignore leading articles when doing full-field indexing, -based on the character map files. See the manual for further discussion. - -Fixed bug in record management. Releasing blocks could result in -partial read. - -Fixed bug in isam:b. A tree split could result in a lost item. - -Remove isamd. It's not been in use for a long time and isamb is better -in most cases. - -Change SYSNO to be zint. Change pointers in isamc and isamb to zint. -Change block number in bfile/cfile to zint. zint is a long integer -(64-bit). This change practially removes register limits for Zebra. - -Implement int-list encoding for ISAMs. - -Added facility to make attibutes in grs.regx and grs.tcl filter using the -data command with argument -attribute . The content of data is -the value of the attribute. This command should be used inside a -begin element , end element section. - -Update zebra.nsi to NSIS 2. - -Added a new 'cut' directive to charmaps (.chr files) which specifies that -only characters after the cutting char should be indexed. - -Update Perl internals so that it matches the current Zebra API. -The recordGroup structure is no longer available. A group of resources -can still be referenced by setting groupName=>.. in various methods. - -Maximum number of records to be sorted in a result set can be -specified by setting "sortmax". Default is 1000. - -Allow use of string use attributes for regular attribute sets. The -name matches the name given in the attribute set file. All strings -starting with / are considered X-Path as usual. - -Fixed bug in grs.regx. filter . 'end element' could pop off top tag -element for XML tree. It may only pop off if -record is given. - -Added grs.danbib filter - for Danish Bibliographic Centre. - -Rename CHANGELOG to NEWS. - -For text filter, return only header if elementSetName=H . elementSetName=R -returns contents only. Other elementSetName returns both header+content. - -Added test for charmap and rusmarc. - -Added feature charmaps (.chr) so that characters may be specified in -\LXXXX HEX notation. - -Fixed problem with encoding directive for charmap(.chr) files. - -Allow Remote insert/delete/replace/update with record, recordIdNumber -(sysno) and/or recordIdOpaque(user supplied record Id). If both -IDs are omitted internal record ID match is assumed (recordId: - in -zebra cfg). - ---- 1.3.15 2004/01/15 - -Fix bug. X-Path attribute expressions with spaces in them now works. - -Fix base address for MARC output. - ---- 1.3.14 2003/11/29 - -Fix bug with shadow and result set handling. - -Implement MARCXML to ISO2709 conversion. - ---- 1.3.13 2003/09/26 - -Add missing examples for Windows install. - -Fix bug in regx filter to make it "greedy" again. This bug appeared -in version 1.3.12. - -Fix a few tests. - ---- 1.3.12 2003/09/08 - -Fix XML error handling. Stop XML parse immediately if XML parse error -occur (i.e. produce one error only). - -Zebra ignores "unsupported use attribute" for individual databases -when search multiple databases (unless all databases fail). - -New filter grs.marcxml which works like grs.marc but produces MARCXML. - -Added support for database deletion. It is possible to create/drop -a database from zebraidx utility. Note: only for isam:b. - -Write zebrasrv.pid to lockdir. - -Bug fix: result sets were not recovered correctly. Had to -add ODR handle for zebra_search_RPN in order to make it work. - -Fixed a bug in regx filters that didn't do anchors (^) correctly. - -Fixed a bug in searches with X-Path searches sometimes giving "extra" -hits. - -Zebra server checks for zebrasrv.pid and refuses to start if it is already -locked by another (running) zebrasrv. +--- 2.0.46 2011/03/23 -Fixed a bug with text being chunked in pieces for the grs.xml filter. +Fix bug in dictionary component of Zebra that could lead to +inconsistent register assert for large deleteions. ---- 1.3.11 2003/04/25 +Improve estimate hits for phrase searches. -xelm code updates. xelm works regardless state of 'xpath enable/disable' -Avoid -L/usr/lib since that is already default library path. +--- 2.0.45 2010/12/22 -Allow multiple updates within one transaction. +Add Debian package for Ubuntu maverick. -Fixed a bug with >2GB files (overflow in integer expression). +Update to newer Libxml2/Libxslt for Windows package. ---- 1.3.10 2003/04/01 +Escape-sequences are disabled for RPN-queries with truncation=105/106. +These, unlike regular expressions, do not support \-sequences . -Fix linker error for Perl module. +--- 2.0.44 2010/06/29 -Fix bug in and operation which in some cases could result in "extra" -hits. Bug was introduced in 1.3.5. +grs1: avoid term_len < 0 for .abs operator 'range' -Fix bug in handling of schema conversion when producing numeric tags. +ICU: fix use of un-init variable. ---- 1.3.9 2003/03/27 +More reasonable approx hits for truncated searches. -Zvrank updates. +Updates WRT Odr_int (YAZ 4). -Add missing files doc/zvrank.txt and doc/marc_indexing.xml. +--- 2.0.43 2009/11/05 ---- 1.3.8 2003/03/26 +Fix hits counts for searches when @attr 12=n is in use. This bug +was introduced in 2.0.40. -Zvrank: an experimental ranking algorithm. See doc/zvrank.txt and -source in index/zvrank.c. Enable this by using rank: zvrank in zebra.cfg. -Contributed by Johannes Leveling +--- 2.0.42 2009/10/08 -livrank: another experimental ranking algorithm. Source in livcode.c. -Enable this by using rank: livrank in zebra.cfg and use -DLIV_CODE=1 -for CFLAGS. -Contributed by Pete Mallinson, University of Liverpool. +Proximity modifications. The optimized proximity now deals with more +cases, ie all prox of the form @prox 0 d 1 1,2,3 k 2 .. +That's exlusive==0, any distance, ordered (true), relation (<, =, <=), +known unit word. -Advanced MARC indexing. See doc/marc_indexing.xml - Oleg Kolobov +Fix bug WRT inconsistent registers, bug #3062. -Perl API updates and fixes. - Peter Popovics +Fix install doc rule so it's compatible with Automake 1.11. -Fixed 'zebraidx delete'. +--- 2.0.41 2009/08/24 -Implemented 'zebraidx clean'. +Fix typos and other errors in the documentation. Thanks to Galen +Charlton. -64-bit offsets for register files on WIN32 (no 2 GB limit). +Fix DOM filter so that full record ID (z:id) if given is used verbatim +rather than truncated until first blank - in other words use memcpy +rather than sscanf. For PI fix record attributes parsing which was +completely broken. -Fixed a few memory leaks WRT sorting. +--- 2.0.40 2009/07/13 ---- 1.3.7 2003/02/27 +Fixed problem in result set handling and segment handling. -Fixed error handling : error code was not properly returned. +--- 2.0.39 2009/07/09 -Support Truncation 104 (CCL). +Improved speed of record fetching by using a different block sizes +for recd{A,B}. ---- 1.3.6 2003/02/25 +Added zlib-compression of recd-storage (record data). This is enabled +by using recordCompression: zlib in zebra.cfg. -Added missing source files for perl extension. +Dictionary delete more agressive (dict_del_string). ---- 1.3.5 2003/02/23 +--- 2.0.38 2009/06/19 -Implemented xelm directive. +Improved estimated hits for AND operation (bug #2907) -Updated for newer version of YAZ (introduction of string schema). +Fixed problem with register area growing after "drop DB". (bug #2913). -Directory examples/zthes now part of distribution (was missing -in previous release). +--- 2.0.37 2009/05/18 -New .abs directive, systag, that control where to put retrieval -information. The directive takes two arguments: system tag, element name. -System tag is one of : rank, sysno, size. +Example with marc21 is now installed and part of packages. ---- 1.3.4 2002/11/26 +Fixed XSL for OAI example. -Perl Filter and Perl API. By Peter Popovics. +Fixed problem with generic proximity handling. -For zebra.cfg, if no profilePath is specified, directory - (prefix)/share/idzebra/tab -is used. +Added new ranking handler: rank-2. This is a combination of rank-1 and +static rank.. The staticrank is inverse (lowest value comes first, etc..). +rank-2 thus substracts that from score. This allows a system to specify +a "base" rank (=staticrank). -Zebra Examples in examples . Zebra tests in test. +--- 2.0.36 2009/02/17 -Bug fix: sort index was not properly modified on -record updates/deletes. +Added implementation-codes directive for .mar files. -Fix handling of character entities for sgml filter. +Avoid error for missing recordId when zebraidx' option -s is in use. -Move data1 to Zebra (used to be part of YAZ). +Fixed problem with some disk blocks not beeing freed correctly when +'drop database' is used. ---- 1.3.3 2002/10/05 +--- 2.0.34 2008/11/06 -Fix character encoding of scan response terms. +Facets can now be performed on sort registers (:s), not just regular indexes +(:w, :p) etc.. The performance appers to be much better than before. -Fix character decoding of scan request terms. +Added a facility to embed metadata for records generated by the DOM XML +filter. The meta data step is triggered by a 'process-meta' section in the +retrieve section of the dom config, e.g. + + + + +The meta section substitutes metadata for all elements called 'meta' in +namespace http://indexdata.com/zebra-2.0 . The meta element must have +exactly one attribute, name, which specifies the special +element set name which is equivalent to the suffix to zebra:: for +non-embedded special retrievals. This allows snippets, facets etc to be +included. -Fix ESpec handling (requires YAZ 1.9.1) +Fixed bug #2202: incorrect sorting order when searching multiple databases. -Fix searches for complete fields. +--- 2.0.32 2008/06/06 ---- 1.3.2 2002/09/09 +Fixed OAI example. -When name zebra is used in a filename or directory 'idzebra' is used -instead to avoid confusion with GNU zebra (routing software). +--- 2.0.30 2008/05/14 -Zebra server stops with a fatal error if config file cannot be read. +Changed service name for Windows. Bug #978. -New config setting, followLinks, that controls whether update of files -should follow symbolic. Set it to 1 (for enable) or 0 (to disable). -By default symbolic links are followed. +Fixed the alwaysmatches including _ALLRECORDS for ICU term indexing. Bug #2139. -Fix MARC transfer . MARC fields had wrong data for multiple fields. +--- 2.0.28 2008/04/03 -XML record reader moved from YAZ to Zebra, to make YAZ less -dependant on external libraries. +Handle right-truncation for ICU normalized terms. -Zebra uses yaz_iconv which is mini iconv library supporting UTF-8, -UCS4, ISO-8859-1. This means that Zebra does UNICODE even -on systems that doesn't offer iconv. +Updated OAI-PMH example. -XML record reader supports external system entities. +--- 2.0.26 2008/01/28 ---- 1.3.1 2002/08/20 +Fixes for searches on Chinese text. -New .abs-directive "xpath" that takes one argument: "enable" -or "disable" to enable and disable XPath -indexing. If no "xpath" -direcive is found in .abs-file , XPath-indexing is disabled to ensure -backwards compatibility. For missing .abs-files XPath-indexing is -enabled so that such records are searchable. +Fixes for scan using ICU indexing terms. -Zebra warns about missing .abs-file only once (for each type). +--- 2.0.24 2008/01/18 -Fixed a bug in file update where already-inserted files could -be treated as "new". +Fixed some memory violations for ICU enabled indexing that could be +triggered by long phrases. ---- 1.3.0 2002/08/05 +Enable ICU on Windows. Thanks to Tumer Garip for makefile patches. -Zebra license changed to GNU GPL. +Fixed bug #2002: Zebra crashes during merge using ICU indexing. -XPath-like queries used when RPN string attributes are used, eg. - @attr 1=/portal/title sometitle - @attr 1=/portal/title[@xml:lang=da] danishtitle - @attr 1=/portal/title/@xml:lang da - @attr 1=//title sometitle +--- 2.0.22 2007/12/20 -Zebra uses UTF-8 internally: -1) New setting "encoding" for zebra.cfg that specifies encoding for -OCTET terms in queries and record encoding for most transfer syntaxes -(except those that use International Strings, such as GRS-1). -2) The encoding of International strings is UTF-8 by default. It -may be changed by character set negotiation. If character set -negotiation is in effect and if records are selected for conversion -these'll be converted to the selected character set - thus overriding -the encoding setting in zebra.cfg. -3) New directive "encoding" in .abs-files. This specifies the external -character encoding for files indexed by zebra. However, if records -themselves have an XML header that specifies and encoding that'll be used -instead. +Handle encoding errors for strings passed to ICU. (assert was +thrown earlier). -XML filter (-t grs.xml). +Added zebraidx command, adelete, which attempts to delete records. +It's like 'delete', but does not fail if a record does not exist. -Multiple registers. New setting in resource 'root' that holds base -directory for register(s). A group a databases may be put in separate -register in directory root/reg by using db name 'reg/db1' ... 'reg/dbN'. +--- 2.0.20 2007/12/19 ---- 1.1.1 2002/03/21 +Added experimental support for faceted results. This is implemented +as a special retrieval on a result set and is triggered using element +set name zebra::facet::f1:t1,f2,t2,.. which will return terms and +hit counts for field f1, type t1, field f2, type t2, .. The returned +data is packed as XML or SUTRS depending on record syntax. -Fixes for Digital Unix +Added support for Unicode-based indexing using ICU. This allows +locale specific scanning, sorting. The implementation is based on +the ICU utility part of YAZ 3.0.16 and later. See the manual as well +as the examples/marcxml for an example of the use of ICU. -Implemented hits per term using USR:SearchResult-1. +--- 2.0.18 2007/09/19 -New Zebra API. Locking system re-implemented. +Fixed bug with scan and staticrank enabled. Bug introduced in 2.0.16. ---- 1.1.stable 2002/02/20 +--- 2.0.16 2007/09/12 -Rank weight can be controlled with attribute type 9. Default -value is 34. Recommended values between 1-36. +Scan now returns a displayTerm which closer to the original word/phrase +from original record. ---- 1.1 2001/10/25 +Added snippet support. Element set name zebra::snippet will make +Zebra return an XML record with snippets (terms around matching terms). -Updated for YAZ version 1.8. +Fixed bug #1142: Non-indexed but listed attributes issues diagnostic. -Added support for termsets - a result set of terms matching -a given query. For @attr 8= creates termset named . +Fixed bug #1131: Missing value-of data in DOM filter. -Added support for raw retrieval. Element Set Name R forces the -text filter which returns the record in its original form. +Fixed bug #1049: zebra.cfg lines with leading space are ignored. -Added numerical sort - triggered by structure=numeric (4=109). +Fixed bug #1128: sortmax not honored. -Remote record import using Z39.50 Extended Services and Segments. +Fixed bug #1121: Crash for some searches with customized string.chr. -Fixed bug where updating a database with user-defined attributes -could corrupt the register (bad storeKeys). +--- 2.0.14 2007/05/09 -Multi-threaded version. +Zebra uses the YAZ 3 API. Note that this changes the Zebra API +as well. The register layout is the same however. -Fixed bug regarding proximity. +Fixed bug #1114: scan within set may use excessive CPU. -Documentation updates. +Fixed memory leak that occurred in scan. -Fixed bug in record retrieval module that occured on 64-bit OSF -architectures. +Fixed memory leak that occurred for each deleted record. ---- 1.0.1 2000/2/10 +Optimize updates of records where content is almost identical to previous +version of record. This makes updating of the internal explain database +faster too. -Fixed bug in makefile for WIN32. +For RPN queries the index type (w,p,..) may be specified verbatim as +structure attribute with string value, e.g. @attr 4=w . -Fixed bug in configure script - used bash-specific features. +Changed record update API . It is now handled by function +zebra_record_update which does insert/replace/delete/update of records . +This function replaces zebra_record_{insert,delete} and +zebra_admin_exchange_record. The DOM filter uses this feature and the +@type attribute in record element specifies the action +insert/replace/delete/update. ---- 1.0 1999/12/10 +Added support for multi-record updates (Bug #944). Based on patch from +Hans-Werner Hilse. -Added support for multiple records in one file for filter grs.sgml. +--- 2.0.12 2007/03/07 -Changed record index structure. New layout is incompatible with -previous releases. Added setting "recordcompression" to control -compression of records. Possible values are "none" (no -compression) and bzip2 (compression using libbz2). +Fixed bug with indexing of attributes for rec.grs-class of filters. +If X-Path was enabled xelm a/@b would be ignored. -Added XML transfer syntax support for retrieval of structured records. -Schema in CompSpec is recognised in retrieval of structured records. +Fixed bug in register system where 'no more space' messages was produced +even though there was plenty of space. -Changed Tcl record filter so that it attemps to read .tflt. If -that fails, the filter reads the file .flt (regx style filter). +Fixed bug #884: Entity declarations in input are lost at retrieval time. -Implemented new Tcl record filter - use grs.tcl. to enable it. -Zebra's configure script automatically attempts to locate Tcl. For -manual Tcl configuration use option --with-tclconfig= to specify -where Tcl's library files are located. +Implemented new filter 'dom'. See test/xslt/dom-config*xml for examples. +This, like alvis, performs indexing and retrieval using XSLT. But Unlike +alvis, it allows multiple XSLT steps to be performed and does ISO2709 +reading. Bug #843. -Implemented "compression" of Dictionary and ISAM system. Dictionary -format HAS changed. +--- 2.0.10 2007/01/24 -Added "tagsysno" directive to zebra.cfg to control under which tag the -system ID is placed. Use tagsysno: 0 to disable Zebra's system number -entirely. +Staticrank indexing is now an index register type defined in default.idx +via directive 'staticrank'. The 'staticrank' directive for grs is no longer +supported (was only implemented for Zebra 2.0.8). -Added "tagrank" as above. +For searches, allow truncmax value to be controlled with attribute 13. +If given, that overrides the value of 'truncmax'. -Changed file naming scheme for register files from .mf. to --.mf. +For truncations being limited (abort at truncmax terms), zebrasrv returns +SearchResponse with resultSetStatus=subset. -Implemented "position"-flag for register type (as defined in -default.idx). When set to zero no position (or seqence number) is -saved in register for each word occurrence, thus saving some register -space. +For estimated hit counts, zebrasrv returns SearchResponse with +resultSetStatus=estimate. -Implemented database mapping. Using mapdb one can specify a database -to be mapped to one or more physical databases. Usage: -mapdb .. +--- 2.0.8 2007/01/15 -Added SOIF-filter. Thanks to Peter Valkenburg. +For searches, do not truncate more than at most 'truncmax' terms in one +single term with right-truncation/regular attribute. By default, 'truncmax', +is 10000. -For the regx-filter "end element -record" may trigger a mark-of-record -if outer level is reached. +Bug #781: Easier tracking of result sets. We only do this when mkstemp +is available (most moderun Unixes supports this). The PID is included +in filename : tempdir/zrs_PID_XXXXXX. -Tag sets may be typed in the reference to it. From the .abs-file the -"tagset" directive takes a third optional integer type for the tag set -referenced. From a .tag-file the "include" directive takes a third -optional type as well. The old "type" directive in the tag set itself -is still recognized but acts as the default type for the tag set. - -Zebra supports the specification of arbitrary attributes sets, schemas -and tag sets, because of the change in YAZ' OID management system. +Added support for specification of staticrank for grs-class of filters. +This is enabled by using 'staticrank indexname' in .abs. Contents of +indexname (elm ... indexname:w) is then used as value for staticrank. -Fixed bug in Sort that caused it NOT to use character mapping as it -should. +Implemented sorting via the ISAMB system. To enable, use sortindex:i in +zebra.cfg. -Zebra now uses GNU configure to generate Makefile(s). +Added special retrieval support for sort keys. These keys can also be +fetched using zebra::index:field:s -Added un-optimised support for left and left/right truncation attributes. +Added support for specification of approximative limits for whole query. +This is specified as attribute type 12. Semantics is the same as +estimatehits in zebra.cfg. -Added support for relational operators on text when using RPN queries. +--- 2.0.6 2006/11/28 -Added support for sort specifications in RPN queries. Type 7 specifies -'sort' where value 1=ascending, value 2=descending. The use attribute -specifies the field criteria as usual. The term specifies priority -where 0=first, 1=second, ... +Fixed bug #736: Updates gets slower. -Changed the way use attributes are specified in the recordId -specification. +Fixed bug #735: Delete fails to process rest of file after a record that +does not exist, -Maximum number of databases in one Zebra register increased. +Added RPM package for Zebra 2. -New setting, databasePath, which specifies that first directory during -update traversal is the database name (instead of a fixed one). +Improved indexing speed for large documents. -New setting, explainDatabase, which specifies that databases are -EXPLAIN aware. +Added support for special utility retrieval zebra:: which offers retrieval +of fundamental properties for record, such as indexed terms, raw record, +and system record ID. -Modified Zebra so that it works with ASN.1 compiled code for YAZ. +--- 2.0.4 2006/10/16 -Implemented EXPLAIN database maintenance. Zebra automatically -generate - and update CategoryList, TargetInfo, DatabaseInfo, -AttributeSetInfo and AttributeDetails records at this stage. The -records may be transferred as GRS-1, SUTRS or Explain. +Fixed bug #47: Commit needs to check for roll-back. -Fixed register spec so that colon isn't treated as size separator -unless followed by [0-9+-] in order to allow DOS drive specifications. +Fixed bug #672: Trailing characters in password are ignored -Fixed two bugs in ISAMC system. +Added extra presence check for tcl.h, because some systems have +tclConfig.sh installed even though Tcl C headers are missing. -Changed the way Zebra keeps its maintenance information about attribute -sets, available attributes, etc.. Records in "SGML" notation using an -EXPLAIN schema is now used when appropriate. +Optimized melm performance. -Bug fix: Index didn't handle update/insert/delete of the same record -(i.e. same recordId) in one run (one invocation of zebraidx). Only the -first occurence of a record is considered. +Do not use sync(2) during commit (but rely on sync'd individual files). -Most searches now return correct number of hits. +Fixed bug in ISAMB's utility function decode_ptr. With gcc -O3 on some +platforms the src argument was not updated. Problem was compiler +optimization due to strict aliasing rules. -New modular ranking system. Interested programmers are encouraged to -inspect rank1.c and improve the algorithm. +--- 2.0.2 2006/09/22 -Bug fix: Lock files weren't removed as they should on NT. +Bug fixes: #638, #647, #657, #669. -Implemented Z39.50 Sort. Zebra's sort handler uses use attributes to -specify a "sort register". Refer to the gils sample records which refer -to index type "s" which is specified as "sort" in the default.idx file. -Each sort criteria can either be Ascending or Descending and at most -three sort elements can be specified. +Implement skip of 0 hit counts in scan. Useful for scan operations +limited by a result set. -Bug fix: Character mapping didn't work for text files. +Honor position attribute, i.e. allow first-in-field search. To +enable this, "firstinfield 1" must be given for an index in +default.idx. Enabled in tab/default.idx for w. At this stage +first-in field is supported for phrase/and-list/or-list searches ---- 1.0b1 1998/1/29 +Common stream reader interface for record filters (struct ZebraRecStream). -Simple ranked searches now return correct number of hits. +Debian package fix: packages idzebra-2.0 + libidzebra-2.0-modules did +not depend properly on sub packages. -The test option (-s) only makes a read-lock on the index as well -as using read-only operations anywhere. +--- 2.0.0 2006/08/14 -Moved towards generic character mapping. Configuration file default.idx -specifies character map files for register types w, p, u, etc. +New record filter (record type) 'alvis' which uses XSLT transformations +to drive both indexing as well as retrieval. See example configuration in the +'example/alvis-oai' directory. -Implemented "begin variant" for the sgml.regx - filter. +'isamb' is now the default ISAM system. In Zebra 1.3, the default ISAM was +'isamc'. The type used can still be configured with the 'isam' setting +in 'zebra.cfg'. -Fixed a few memory leaks. +Index structure is now 64-bit based, also on 32 bit systems. +There are no more 2GB register file limits. -Added support for C++, headers uses extern "C" for public definitions. +Extended search result tuning. Approximate limit for terms can be enabled +and specified with attribute 11. The (approx or exact) hit count is returned +as part of the search response as in 1.3 series. The subqueryID of a search +term hit count can be specified with attribute 10. -Bug fix: The show records facility (-s) only displayed information for -the first record in a file (and not for every record in the file). - -Added option "-f " to limit the logging of record operations. After - records has been processed no logging is performed (unless errors -occur). - -Bug fix: the compressed ISAM system didn't handle update operations -correctly. - -Added setting, "maxResultSetSize", to hold the number of records to -save in a result set. - -Bug fix: Complete phrase did't work for search operations. - -Bug fix: temporary result sets weren't deleted. - -Reduced disk space for saved keys (storeKeys = 1). - -Added optional, physical ANY (key replication) - -Implemented proximity operator in search. - -Bug fix: the path name buffers used by file match traversal routines -have been extended to support long file names. - -New C(ompressed) ISAM system. To enable it, specify "isam: c" in the -configuration file. The resulting register without "storeKeys" is about -half the size, and the memory used by zebraidx during phase 2 (merge) is -reduced to a minimum. - -Reworked the way Regexp-2 queries with error tolerance are handled and -specified. The documentation has been updated accordingly. - -Bug fix: Zebrasrv didn't search correctly when queries contained masking -characters. This bug was introduced in 1.0a8. - -Zebrasrv now tag records with the proper database name. - -New settings, memMax and keyTmpDir. - -Changed name of setting lockDir (previously called lockPath) and -setTmpDir (previously called tempSetPath). - -Generalized and changed record type specifications. In short, there are: - text plain SUTRS - grs.sgml structured, "SGML-like" syntax - grs.regx. structured, Regular expression filter - grs.marc. Reads *MARC records in the ISO2709 format. - is the name of an abstract syntax file. -Bug fix: Result sets weren't sorted in operations involving boolean -operations with "ranked" operands. - ---- 1.0a8 1996/6/6 - -Added national character-handling subsystem. - -Various fixes. - -Small modifications to input filters and profiles. - -Added support for SOIF syntax (with private OID). - ---- 1.0a7 1996/5/16 - -Fixed buffer-size problem in indexing. - -Added compression to temporary files for updating. - -Added phrase registers. - -Added dynamic mapping of search attribute to multiple termlists (ANY). - -Scan support in multiple databases/registers. - -Configuration settings are case-insensitive and single dash (-) -characters are ignored in comparisons. - -The index processing ignores empty files - warning given. - -New option to zebraidx (-V) displays version information. - ---- 1.0a6 1996/2/24 - -Fixed problem in file-update system. - -Fixed problem in shadow system; register was sometimes corrupted after -a commit operation. - ---- 1.0a5 1996/2/10 - -Fixed problems in the ISAM subsystem. Caused difficulties when updating -existing registers. - -Fixed small problem in SUTRS-filter. A newline was sometimes inserted before -the rank and record number. - -Fixed bug in the isam subsystem - caused a malfunction when accessing -words which occurred more than 10000 times. - -Distribution should now include YAZ (Z39.50 protocol stack) to simplify -installation. - -Server can now run under inetd. Use option -i, and -w to -set working directory to desired location. - -New zebraidx command: clean - removes temporary shadow files. - -Fixed bug in ISAM system. Occurred rarely during register updates. - -Logging during index merge phase is improved. The remaining running -time is estimated. - -Temporary files generated by zebraidx are removed after each run. - -Bug fix: Dictionary didn't handle 8-bit characters correctly; was obvious -when doing scan operations in dictionaries with European characters. - ---- 1.0a4 1996/01/11 - -A whole slew of updates, to make the first publicized release. Get the doc -and check it out. - ---- 1.0a3 1995/12/06 - -Memory-problems in ISAM fixed. More blocktypes added to the default setup -to increase performance on larger databases. - -Various minor changes in data management system. +Zebra uses string attributes for indexing internally. Using set+numeric +use attribute can still be used. This is a search-only conversion which +inspects '*.att'-set files as indicated using attset-directives in 'zebra.cfg'. +'attset' references are no longer required, but when used they deserve +as "check" for that the index names used are also present in '*.att'. ---- 1.0a2 1995/12/05 +Zebra record filters (record type handlers) may be built as loadable +modules (.so's) on Unix. In particular the Zebra 2.0 Debian package uses +separate packages for each of them. This also means that zebra programs +such as zebraidx is no longer depending on Tcl/other.. -A couple of portability-problems resolved. +Documentation updates, especially on query structure and syntax, SRU, +XSLT support, alvis filter module, and many added examples. -Changed some malloc() to xmalloc(). +Improved logging of the 'zebrasrv' and 'zebraidx' binaries. ---- 1.0a1 1995/11/28 +Improved debian package structure. -First release. +--- 1.3.16 2004/08/16