pazpar2-moved-to-github.git
17 years agoRemoved bug with improper utf-8 query encoding.
Jakub Skoczen [Wed, 23 May 2007 09:18:10 +0000 (09:18 +0000)]
Removed bug with improper utf-8 query encoding.

17 years agouse <form accept-charset='UTF-8'
Marc Cromme [Wed, 23 May 2007 09:08:10 +0000 (09:08 +0000)]
use  <form accept-charset='UTF-8'

17 years agoprotect from missing ICU development environment by #ifdef HAVE_ICU , such that build...
Marc Cromme [Wed, 23 May 2007 06:42:25 +0000 (06:42 +0000)]
protect from missing ICU development environment by #ifdef HAVE_ICU , such that build process can continue on platforms missing ICU

17 years agofinished test ICU stand-allone program for benchmarking of ICU tokenization and norma...
Marc Cromme [Tue, 22 May 2007 21:20:10 +0000 (21:20 +0000)]
finished test ICU stand-allone program for benchmarking of ICU tokenization and normalization. Works quite well, benchmarking on the James English Bible from Project Gutenberg (4,5 MB plain text consisting of 870.000 individual tokens) took 3.5 seconds on a laptop. More testing/benchmarking is needed.

17 years agostarted stand-allone ICU test
Marc Cromme [Tue, 22 May 2007 08:26:59 +0000 (08:26 +0000)]
started stand-allone ICU test

17 years agoAdded a few frees here and there
Adam Dickmeiss [Tue, 22 May 2007 07:51:45 +0000 (07:51 +0000)]
Added a few frees here and there

17 years agoICU chain XML configuration up and running, used in unit test as well.
Marc Cromme [Mon, 21 May 2007 10:14:08 +0000 (10:14 +0000)]
ICU chain XML configuration up and running, used in unit test as well.

17 years agoCounters and timers cd.
Jakub Skoczen [Mon, 21 May 2007 10:10:41 +0000 (10:10 +0000)]
Counters and timers cd.

17 years agoNo longer relevant.
Jakub Skoczen [Mon, 21 May 2007 09:23:14 +0000 (09:23 +0000)]
No longer relevant.

17 years agoCorrected timers and counters.
Jakub Skoczen [Mon, 21 May 2007 09:07:43 +0000 (09:07 +0000)]
Corrected timers and counters.

17 years agoCreating symlinks for the pz2.js during the configure process
Jakub Skoczen [Mon, 21 May 2007 08:21:33 +0000 (08:21 +0000)]
Creating symlinks for the pz2.js during the configure process

17 years agoICU chain working correctly with tokenizer, normalizer and casemap operations, and...
Marc Cromme [Sun, 20 May 2007 19:00:17 +0000 (19:00 +0000)]
ICU chain working correctly with tokenizer, normalizer and casemap operations, and data extraction in normalform, sortkey form and display form all possible. New unit test added

17 years agoInitialize ID setting searlier -- Should alleviate need for explicitly setting id
Sebastian Hammer [Fri, 18 May 2007 19:52:52 +0000 (19:52 +0000)]
Initialize ID setting searlier -- Should alleviate need for explicitly setting id

17 years agoLast touch :).
Jakub Skoczen [Fri, 18 May 2007 17:16:05 +0000 (17:16 +0000)]
Last touch :).

17 years agoAdded simple stylesheet to the example.
Jakub Skoczen [Fri, 18 May 2007 15:16:18 +0000 (15:16 +0000)]
Added simple stylesheet to the example.

17 years agoCleaning styles.
Jakub Skoczen [Fri, 18 May 2007 13:00:14 +0000 (13:00 +0000)]
Cleaning styles.

17 years agoHandling situation when the location node is empty.
Jakub Skoczen [Fri, 18 May 2007 12:38:48 +0000 (12:38 +0000)]
Handling situation when the location node is empty.

17 years agoRemoving obsolete stuff.
Jakub Skoczen [Fri, 18 May 2007 11:44:44 +0000 (11:44 +0000)]
Removing obsolete stuff.

17 years agoAdded simple example of the pz2.js usage.
Jakub Skoczen [Fri, 18 May 2007 11:36:39 +0000 (11:36 +0000)]
Added simple example of the pz2.js usage.

17 years agoTemporary fix to make the target filter work.
Jakub Skoczen [Thu, 17 May 2007 22:56:41 +0000 (22:56 +0000)]
Temporary fix to make the target filter work.

17 years agoBug in pzHttpRequest->get
Jakub Skoczen [Thu, 17 May 2007 21:00:09 +0000 (21:00 +0000)]
Bug in pzHttpRequest->get

17 years agoRemoved a bug which caused a malfunction of XMLHttpRequest in some browsers.
Jakub Skoczen [Wed, 16 May 2007 20:54:17 +0000 (20:54 +0000)]
Removed a bug which caused a malfunction of XMLHttpRequest in some browsers.

17 years agoICU chain passes directives display, norm, sort, and normalize. Directives tokenize...
Marc Cromme [Wed, 16 May 2007 19:50:01 +0000 (19:50 +0000)]
ICU chain passes directives display, norm, sort, and normalize. Directives tokenize and charmap need more work yet.

17 years agocorrected ICU normalizer functions such that unit test run withut segfault.
Marc Cromme [Wed, 16 May 2007 19:12:00 +0000 (19:12 +0000)]
corrected ICU normalizer  functions such that unit test run withut segfault.
icu_buf_utf18_copy function corrected to set utf16_len right

17 years agopz:cclmap:* settings were not recognized by settings/init command
Sebastian Hammer [Wed, 16 May 2007 17:16:21 +0000 (17:16 +0000)]
pz:cclmap:* settings were not recognized by settings/init command

17 years agoUse lynx if that's an alternative to wget.
Adam Dickmeiss [Wed, 16 May 2007 13:07:18 +0000 (13:07 +0000)]
Use lynx if that's an alternative to wget.

17 years agotemorarily commented faulty transliterator test out
Marc Cromme [Wed, 16 May 2007 12:39:49 +0000 (12:39 +0000)]
temorarily commented faulty transliterator test out
progress on ICU chain test, but need to fix transliterator test first

17 years agoSkip test if wget is not found
Adam Dickmeiss [Wed, 16 May 2007 09:37:34 +0000 (09:37 +0000)]
Skip test if wget is not found

17 years agoFire test against z3950.indexdata.com/marc instead.
Adam Dickmeiss [Wed, 16 May 2007 08:31:17 +0000 (08:31 +0000)]
Fire test against z3950.indexdata.com/marc instead.

17 years agopz2.js:
Jakub Skoczen [Wed, 16 May 2007 07:53:31 +0000 (07:53 +0000)]
pz2.js:
        removed jquery dependency
        added xsl stylesheet support for detailed record view
        merged with pzQuery.js
        added pzHttpRequest class

client.js:
        updated to use the new library

17 years agoFix check for yaz-ztest. Fixed make distcheck.
Adam Dickmeiss [Tue, 15 May 2007 21:40:57 +0000 (21:40 +0000)]
Fix check for yaz-ztest. Fixed make distcheck.

17 years agoUse -l for pazpar2.
Adam Dickmeiss [Tue, 15 May 2007 21:28:36 +0000 (21:28 +0000)]
Use -l for pazpar2.

17 years agoAdded option pazpar2 option -l to specify logfile. Removed usage msg
Adam Dickmeiss [Tue, 15 May 2007 21:27:55 +0000 (21:27 +0000)]
Added option pazpar2 option -l to specify logfile. Removed usage msg
and removed description for no longer supported options.

17 years agoRegression test, test_http.sh, moved to sub directory test. The test
Adam Dickmeiss [Tue, 15 May 2007 15:50:47 +0000 (15:50 +0000)]
Regression test, test_http.sh, moved to sub directory test. The test
makes a session, tries stat, search and show on a local yaz-ztest.
Route make_sessionid modified to return deterministic session ID.
If that is considered a problem an option or configuration must be
added to Pazpar2 so this can be tuned.

17 years agocontinuing work on ICU chain of command pattern, not finished yet
Marc Cromme [Tue, 15 May 2007 15:11:42 +0000 (15:11 +0000)]
continuing work on ICU chain of command pattern, not finished yet

17 years agoBegin work on PP2 WS HTTP test.
Adam Dickmeiss [Tue, 15 May 2007 08:56:03 +0000 (08:56 +0000)]
Begin work on PP2 WS HTTP test.

17 years agoSimplify: use wrbuf_cstr to get a NUL-terminated string out. xfree works
Adam Dickmeiss [Tue, 15 May 2007 08:52:35 +0000 (08:52 +0000)]
Simplify: use wrbuf_cstr to get a NUL-terminated string out. xfree works
fine on a NULL ptr.

17 years agoExit when address is already in use (HTTP binding).
Adam Dickmeiss [Tue, 15 May 2007 08:51:49 +0000 (08:51 +0000)]
Exit when address is already in use (HTTP binding).

17 years agoICU chain of normalizers and tokenizers half-way implemented
Marc Cromme [Mon, 14 May 2007 13:51:24 +0000 (13:51 +0000)]
ICU chain of normalizers and tokenizers half-way implemented

17 years agoMinor changes to allow logging out in the client.
Jakub Skoczen [Mon, 14 May 2007 12:57:43 +0000 (12:57 +0000)]
Minor changes to allow logging out in the client.

17 years agoinital version of ICU chain XML config test file
Marc Cromme [Mon, 14 May 2007 10:07:48 +0000 (10:07 +0000)]
inital version of ICU chain XML config test file

17 years agoremoved dead code from this file
Marc Cromme [Mon, 14 May 2007 08:01:39 +0000 (08:01 +0000)]
removed dead code from this file

17 years agofree-ing memory to avoid memory leakage in test program
Marc Cromme [Fri, 11 May 2007 22:59:36 +0000 (22:59 +0000)]
free-ing memory to avoid memory leakage in test program

17 years agochecked in very nice ICU normalization examples
Marc Cromme [Fri, 11 May 2007 22:23:33 +0000 (22:23 +0000)]
checked in very nice ICU normalization examples

17 years agoIgnore targets with no name associated -- this is one way to eliminate
Sebastian Hammer [Fri, 11 May 2007 16:57:42 +0000 (16:57 +0000)]
Ignore targets with no name associated -- this is one way to eliminate
'ghost' targets without settins -- at least a name must be set before
a search can proceed.

17 years agoAdded icu_buf_utf8_copy() and icu_buf_utf16_copy() functions.
Marc Cromme [Fri, 11 May 2007 10:38:42 +0000 (10:38 +0000)]
Added icu_buf_utf8_copy() and  icu_buf_utf16_copy() functions.
Finished wrapping ICU transliterator in new icu_normalizator object including constructor, destructer, and normalize work functions. Needs more testing, though.

17 years agoconstructor and destructor wrappers for ICU transliterator services added
Marc Cromme [Fri, 11 May 2007 09:35:50 +0000 (09:35 +0000)]
constructor and destructor wrappers for ICU transliterator services added

17 years agonon-compiling tests temporarily removed with #if 0 ... #endif
Marc Cromme [Fri, 11 May 2007 08:41:07 +0000 (08:41 +0000)]
non-compiling tests temporarily removed with #if 0  ...  #endif

17 years agoadded first examples of ICU transliterator token normalization
Marc Cromme [Fri, 11 May 2007 08:27:29 +0000 (08:27 +0000)]
added first examples of ICU transliterator token normalization

17 years agotest_icu_I18N.c
Adam Dickmeiss [Fri, 11 May 2007 06:48:32 +0000 (06:48 +0000)]
test_icu_I18N.c

17 years agostarted ICU transliterator integration for more complex normalization rules than...
Marc Cromme [Thu, 10 May 2007 12:11:42 +0000 (12:11 +0000)]
started ICU transliterator integration for more complex normalization rules than lowercasing

17 years agodanish tokenization unit test added, counting error in tokenizer corrected
Marc Cromme [Thu, 10 May 2007 11:53:47 +0000 (11:53 +0000)]
danish tokenization unit test added, counting error in tokenizer corrected

17 years agoFactor relevance charset normalization out to a separate implementation
Adam Dickmeiss [Thu, 10 May 2007 11:46:09 +0000 (11:46 +0000)]
Factor relevance charset normalization out to a separate implementation
in charsets.c.

17 years agofixed tokenization counting error, added more english tokenization
Marc Cromme [Thu, 10 May 2007 10:29:58 +0000 (10:29 +0000)]
fixed tokenization counting error, added more english tokenization
unit tests

17 years agoReplacing trie with linear search using linked list. The trie is
Adam Dickmeiss [Thu, 10 May 2007 09:26:19 +0000 (09:26 +0000)]
Replacing trie with linear search using linked list. The trie is
both overkill and does not handle null-terminated strings. This change
is one step towards a configurable character set system (which may
use ICU as driver).

17 years agoChanged string chop right; problem is that a pointer could point to
Adam Dickmeiss [Thu, 10 May 2007 09:24:32 +0000 (09:24 +0000)]
Changed string chop right; problem is that a pointer could point to
one element before the start of an array (only one element after is
portable).

17 years agoICU tokenizer works now
Marc Cromme [Wed, 9 May 2007 14:01:21 +0000 (14:01 +0000)]
ICU tokenizer works now

17 years agoUpdated query handling.
Jakub Skoczen [Wed, 9 May 2007 11:54:04 +0000 (11:54 +0000)]
Updated query handling.

17 years agoremoved now superflous expwerimental file. useful contetn moved to icu_I18N.c
Marc Cromme [Mon, 7 May 2007 13:10:00 +0000 (13:10 +0000)]
removed now superflous expwerimental file. useful contetn moved to icu_I18N.c

17 years agoremove now unnecessary ICU bug experimenting file, useful content moved into icu_I18N.c
Marc Cromme [Mon, 7 May 2007 13:08:26 +0000 (13:08 +0000)]
remove now unnecessary ICU bug experimenting file, useful content moved into icu_I18N.c

17 years agopretty-formatted all ICU code and removed dead code sections
Marc Cromme [Mon, 7 May 2007 12:52:04 +0000 (12:52 +0000)]
pretty-formatted all ICU code and removed dead code sections

17 years agoupdated ICU casemap wrappers to use dynamic buffers, all ICU tests succeed
Marc Cromme [Mon, 7 May 2007 12:18:34 +0000 (12:18 +0000)]
updated ICU casemap wrappers to use dynamic buffers, all ICU tests succeed

17 years agomoved working ICU sorting into YAZ unittest test_icu_I18N.c
Marc Cromme [Mon, 7 May 2007 09:31:36 +0000 (09:31 +0000)]
moved working ICU sorting into YAZ unittest test_icu_I18N.c
commented casemapping out for the time beeing, need to integrate with new dynamic ICU buffers

17 years agoupdatet error reporting to only report when strings are actually
Marc Cromme [Mon, 7 May 2007 08:42:45 +0000 (08:42 +0000)]
updatet error reporting to only report when strings are actually
sorted wrongly

17 years agocorrected error handeling in UErrorCode icu_utf16_from_utf8() to
Marc Cromme [Mon, 7 May 2007 08:15:34 +0000 (08:15 +0000)]
corrected error handeling in UErrorCode icu_utf16_from_utf8() to
mirror the error handeling in UErrorCode icu_utf16_from_utf8_cstr();

17 years agounnecessary comments and print statements removed
Marc Cromme [Mon, 7 May 2007 08:02:03 +0000 (08:02 +0000)]
unnecessary comments and print statements removed

17 years agoICU sorting works correct now. Had forgotten to pass on the correct length of the...
Marc Cromme [Mon, 7 May 2007 07:58:31 +0000 (07:58 +0000)]
ICU sorting works correct now. Had forgotten to pass on the correct length of the destination buffer. Dynamic destination bugger resizing works as well.

17 years agobuffer stuff ok now, correct resizing
Marc Cromme [Thu, 3 May 2007 11:53:12 +0000 (11:53 +0000)]
buffer stuff ok now, correct resizing
added printout of sort keys to see what get's wrong here ..

17 years agochanged error handeling, which had nasty side effects
Marc Cromme [Thu, 3 May 2007 11:35:33 +0000 (11:35 +0000)]
changed error handeling, which had nasty side effects

17 years agotweaking .. using dynamic allocated buffers. Now sorting fails again, but using stati...
Marc Cromme [Thu, 3 May 2007 09:36:33 +0000 (09:36 +0000)]
tweaking .. using dynamic allocated buffers. Now sorting fails again, but using static buffers as in icu_bug.c it works ..

17 years agoMinor changes to make it easier for for the server side scipt to init session.
Jakub Skoczen [Wed, 2 May 2007 19:32:13 +0000 (19:32 +0000)]
Minor changes to make it easier for for the server side scipt to init session.

17 years agoadded ICU experiment which sorts correctly given all locales tried.
Marc Cromme [Wed, 2 May 2007 14:03:03 +0000 (14:03 +0000)]
added ICU experiment which sorts correctly given all locales tried.

17 years agotweaking, still no good results with danish sorting
Marc Cromme [Wed, 2 May 2007 14:01:36 +0000 (14:01 +0000)]
tweaking, still no good results with danish sorting

17 years agoAdded some more locales which fail. Something is very rotten in the kingdom of Denmar...
Marc Cromme [Tue, 1 May 2007 13:27:32 +0000 (13:27 +0000)]
Added some more locales which fail. Something is very rotten in the kingdom of Denmark! Need to find out what wents wrong ...

17 years agoAdded sorting test for ICU - only used in test_icu_I18N.c so far.
Marc Cromme [Tue, 1 May 2007 13:16:09 +0000 (13:16 +0000)]
Added sorting test for ICU - only used in test_icu_I18N.c so far.
English and german sorting tests perform fine (including german special characters), but sorting of danish special characters fails. Very suspect. Needs more investigation! See  test_icu_I18N_sortmap() in test_icu_I18N.c for details

17 years agomoved ICU helper function declarations from icu_I18N.h header file to icu_I18N.c...
Marc Cromme [Tue, 1 May 2007 08:17:05 +0000 (08:17 +0000)]
moved ICU helper function declarations from icu_I18N.h header file to icu_I18N.c source file

17 years agocleaned ICU case folding/mapping tests
Marc Cromme [Tue, 1 May 2007 08:10:26 +0000 (08:10 +0000)]
cleaned ICU case folding/mapping tests

17 years agoFixed compilation of test test_icu_I18N (syntax error).
Adam Dickmeiss [Tue, 1 May 2007 07:58:43 +0000 (07:58 +0000)]
Fixed compilation of test test_icu_I18N (syntax error).

17 years agoHandle situation where IDF becomes 0 because all records contain a term (occurs
Sebastian Hammer [Tue, 1 May 2007 05:04:53 +0000 (05:04 +0000)]
Handle situation where IDF becomes 0 because all records contain a term (occurs
frequently when records result from a search).

This actually suggests that there may be a better technique than IDF for balancing
our TF, but I'll be darned if I know what it is.

17 years agoHandle records with null-value for string sortkey
Sebastian Hammer [Tue, 1 May 2007 05:02:54 +0000 (05:02 +0000)]
Handle records with null-value for string sortkey

17 years agoAdded Paratext to demo
Sebastian Hammer [Mon, 30 Apr 2007 14:29:48 +0000 (14:29 +0000)]
Added Paratext to demo

17 years agoAdded new resources. Configuration changes
Sebastian Hammer [Mon, 30 Apr 2007 14:29:12 +0000 (14:29 +0000)]
Added new resources. Configuration changes

17 years agoVarous display changes to MK demo
Sebastian Hammer [Mon, 30 Apr 2007 14:28:09 +0000 (14:28 +0000)]
Varous display changes to MK demo

17 years agochecked in test for ICU uppercase lowercase, title and foldcase char mapping
Marc Cromme [Mon, 30 Apr 2007 13:56:52 +0000 (13:56 +0000)]
checked in test for ICU uppercase lowercase, title and foldcase char mapping

17 years agoHardcoded motd in index.html. Cleared in onFormSubmit. This should be read from a...
Sebastian Hammer [Mon, 30 Apr 2007 03:35:57 +0000 (03:35 +0000)]
Hardcoded motd in index.html. Cleared in onFormSubmit. This should be read from a file
or something, but what?

17 years agoadded autoconf checks on ICU internationalization libs and established first test...
Marc Cromme [Fri, 27 Apr 2007 14:31:14 +0000 (14:31 +0000)]
added autoconf checks on ICU internationalization libs and established first test framework on ICU. Tests are still empty, to be added later.

17 years agoFactorized char normalization code out to normalize7bit.[hc] .
Marc Cromme [Fri, 27 Apr 2007 12:17:04 +0000 (12:17 +0000)]
Factorized char normalization code out to normalize7bit.[hc] .

Removed unnecessary tests and assert
on service conf_metadata, as  those are now
satisfied on service creation time (in config.c in function
conf_metadata_assign()) once under startup, and not every time a new
record is to be parsed.

Removed unneeded logging when  service->merge == Metadata_merge_no .

Replaced manual record allocation with call to record constructor.

Replaced loop to find conf_metadata field_id with call to
conf_service_metadata_field_id().

Replaced manual record_metadata allocation with call to
record_metadata_create(se->nmem).

All these changes have been made with safety guards following this idom:
#if 0
  //oldcode
#else
  /newcode
#endif
such that they can be reverted easy if need might be. The #if guards will of
course disapear, when we have confirmed that the code behaves exaactly the
same as before.

17 years agoadded struct record_metadata * record_metadata_create(NMEM nmem)
Marc Cromme [Fri, 27 Apr 2007 11:56:41 +0000 (11:56 +0000)]
added struct record_metadata * record_metadata_create(NMEM nmem)

17 years agoin conf_metadata_assign() enforcing that merge_range is always
Marc Cromme [Fri, 27 Apr 2007 10:27:35 +0000 (10:27 +0000)]
in conf_metadata_assign() enforcing that merge_range is always
type_year _and_ enforcing that type_year is always range_merge, thus
setting contrains correctly at beginning of a server session setting
processing.
Removing these then unnecessary tests from ingest_record(), where
there will produce warnings logs nobody can do anything about at that
time, and where they are run for each ingested record.
Less code, less performed if statements, no unnecessary warning logs.

17 years agoenforcing that years type metadata is always range merged
Marc Cromme [Fri, 27 Apr 2007 09:38:13 +0000 (09:38 +0000)]
enforcing that years type metadata is always range merged

17 years agostarted facturizing 7 bit ascii normalizing functions out of logic.c into normalizati...
Marc Cromme [Thu, 26 Apr 2007 21:33:32 +0000 (21:33 +0000)]
started facturizing 7 bit ascii normalizing functions out of logic.c into normalization7bit.[hc]
later to be replaced by ICU normalization
still many bits need to be identified and moved

17 years agoadded setting sortkey on clusters, needs still nice wrapping in cluster interfaces
Marc Cromme [Thu, 26 Apr 2007 21:31:05 +0000 (21:31 +0000)]
added setting sortkey on clusters, needs still nice wrapping in cluster interfaces

17 years agousing more of the record functions, adding record_metadata creator
Marc Cromme [Thu, 26 Apr 2007 12:12:19 +0000 (12:12 +0000)]
using more of the record functions, adding record_metadata creator

17 years agousing record creator functions in production code in ingest_record()
Marc Cromme [Thu, 26 Apr 2007 11:41:26 +0000 (11:41 +0000)]
using record creator functions in production code in ingest_record()

17 years agorenamed local variables to more expressive names
Marc Cromme [Thu, 26 Apr 2007 11:03:54 +0000 (11:03 +0000)]
renamed local variables to more expressive names

17 years agorenaming position to field_id
Marc Cromme [Thu, 26 Apr 2007 10:19:05 +0000 (10:19 +0000)]
renaming position to field_id

17 years agosimplified to one title string sorting only, still crashes
Marc Cromme [Wed, 25 Apr 2007 13:57:49 +0000 (13:57 +0000)]
simplified to one title string sorting only, still crashes

17 years agocomed to the point where I can start testing sorting of records in the recordslist.
Marc Cromme [Wed, 25 Apr 2007 13:28:55 +0000 (13:28 +0000)]
comed to the point where I can start testing sorting of records in the recordslist.
But there is a nasty segfault in  reclist_sort(list, sort_parms) which needs to be sorted out before proceeding - prpbably a global parameter referenced someplace. just checking in to save the work so far, continuing tomorrow

17 years agoadded test_reclists to CVS ignores
Marc Cromme [Wed, 25 Apr 2007 13:16:13 +0000 (13:16 +0000)]
added test_reclists to CVS ignores

17 years agoforgotten to commit Makefile.am, which contains instructions for new test_reclists.c
Marc Cromme [Wed, 25 Apr 2007 13:14:46 +0000 (13:14 +0000)]
forgotten to commit Makefile.am, which contains instructions for new test_reclists.c