X-Git-Url: http://git.indexdata.com/?a=blobdiff_plain;ds=sidebyside;f=doc%2Fpazpar2_conf.xml;h=57427cae7b4350530859df5539f510e63bdd53db;hb=23a2402edc299e4ec53b5deabce34fc306cbe848;hp=8aac27a81dca40fd1e28a5875b42c0a5ccae8004;hpb=99b5b8f8856a3fe9c438d49b5206cec2e8ca514a;p=pazpar2-moved-to-github.git diff --git a/doc/pazpar2_conf.xml b/doc/pazpar2_conf.xml index 8aac27a..57427ca 100644 --- a/doc/pazpar2_conf.xml +++ b/doc/pazpar2_conf.xml @@ -163,7 +163,7 @@ - service + service This nested element controls the behavior of Pazpar2 with @@ -262,13 +262,35 @@ Specifies that this element is to be used to help rank records against the user's query (when ranking is - requested). The value is an integer, used as a - multiplier against the basic TF*IDF score. A value of - 1 is the base, higher values give additional - weight to + requested). + The valus is of the form + + M [F N] + + where M is an integer, used as a + weight against the basic TF*IDF score. A value of + 1 is the base, higher values give additional weight to elements of this type. The default is '0', which excludes this element from the rank calculation. + + F is a CCL field and N is the multipler for terms + that matches those part of the CCL field in search. + The F+N combo allows the system to use a different + multipler for a certain field. For example, a rank value of + "1 au 3" gives a multipler of 3 for + all terms part of the au(thor) terms and 1 for everything else. + + + For Pazpar2 1.6.13 and later, the rank may also defined + "per-document", by the normalization stylesheet. + + + The per field rank was introduced in Pazpar2 1.6.15. Earlier + releases only allowed a rank value M (simple integer). + + See for more + about ranking. @@ -300,6 +322,11 @@ all elements), or 'no' (don't merge; this is the default); + + Pazpar 1.6.24 also offers a new value for merge, 'first', which + is like 'all' but only takes all from first database that returns + the particular metadata field. + @@ -339,6 +366,44 @@ + limitcluster + + + Allow a limit on merged metadata. The value of this attribute + is the name of actual metadata content to be used for matching + (most often same name as metadata name). + + + + Requires Pazpar2 1.6.23 or later. + + + + + + + limitmap + + + Specifies a default limitmap for this field. This is to avoid mass + configuring of targets. However it is important to review/do this on a per + target since it is usually target-specific. See limitmap for format. + + + + + + facetmap + + + Specifies a default facetmap for this field. This is to avoid mass + configuring of targets. However it is important to review/do this on a per + target since it is usually target-specific. See facetmap for format. + + + + + setting @@ -521,7 +586,138 @@ + + + ccldirective + + + Customizes the CCL parsing (interpretation of query parameter + in search). + The name and value of the CCL directive is gigen by attributes + 'name' and 'value' respectively. Refer to possible list of names + in the + + YAZ manual + . + + + + + + rank + + + Customizes the ranking (relevance) algorithm. Also known as + rank tweaks. The rank element + accepts the following attributes - all being optional: + + + + cluster + + + Attribute 'cluster' is a boolean + that controls whether Pazpar2 should boost ranking for merged + records. Is 'yes' by default. A value of 'no' will make + Pazpar2 average ranking of each record in a cluster. + + + + + debug + + + Attribute 'debug' is a boolean + that controls whether Pazpar2 should include details + about ranking for each document in the show command's + response. Enable by using value "yes", disable by using + value "no" (default). + + + + + follow + + + Attribute 'follow' is a a floating point number greater than + or equal to 0. A positive number will boost weight for terms + that occur close to each other (proximity, distance). + A value of 1, will double the weight if two terms are in + proximity distance of 1 (next to each other). The default + value of 'follow' is 0 (order will not affect weight). + + + + + lead + + + Attribute 'lead' is a floating point number. + It controls if term weight should be reduced by position + from start in a metadata field. A positive value of 'lead' + will reduce weight as it apperas further away from the lead + of the field. Default value is 0 (no reduction of weight by + position). + + + + + length + + + Attribute 'length' determines how/if term weight should be + divided by lenght of metadata field. A value of "linear" + divide by length. A value of "log" will divide by log2(length). + A value of "none" will leave term weight as is (no division). + Default value is "linear". + + + + + + Refer to to see how + these tweaks are used in computation of score. + + + Customization of ranking algorithm was introduced with + Pazpar2 1.6.18. The semantics of some of the fields changed + in versions up to 1.6.22. + + + + + sort-default + + + Specifies the default sort criteria (default 'relevance'), + which previous was hard-coded as default criteria in search. + This is a fix/work-around to avoid re-searching when using + target-based sorting. In order for this to work efficient, + the search must also have the sort critera parameter; otherwise + pazpar2 will do re-searching on search criteria changes, if + changed between search and show command. + + + This configuration was added in pazpar2 1.6.20. + + + + + settings @@ -583,7 +779,7 @@ type="year" merge="range" termlist="yes"/> - + @@ -605,7 +801,7 @@ The XML configuration may be partitioned into multiple files by using the include element which takes a single attribute, - src. The of the src attribute is + src. The src attribute is regular Shell like glob-pattern. For example, @@ -1078,6 +1274,35 @@ + pz:extendrecs + + + If a show command goes to the boundary of a result set for a + database - depends on sorting - and pz:extendrecs is set to a positive + value. then Pazpar2 wait for show to fetch pz:extendrecs more + records. This setting is best used if a database does native + sorting, because the result set otherwise may be completely + re-sorted during extended fetch. + The default value of pz:extendrecs is 0 (no extended fetch). + + + + The pz:extendrecs setting appeared in Pazpar2 version 1.6.26. + But the bahavior changed with the release of Pazpar2 1.6.29. + + + + + + pz:presentchunk + + + Controls the chunk size in present requests. Pazpar2 will + make (maxrecs / chunk) request(s). The default is 20. + + + + pz:id @@ -1114,7 +1339,7 @@ This setting enables - SRU/SOLR + SRU/Solr support. It has four possible settings. 'get', enables SRU access through GET requests. 'post' enables SRU/POST @@ -1123,7 +1348,7 @@ the protocol. - A value of 'solr' anables SOLR client support. This is supported + A value of 'solr' enables Solr client support. This is supported for Pazpar version 1.5.0 and later. @@ -1135,7 +1360,7 @@ This allows SRU version to be specified. If unset Pazpar2 will the default of YAZ (currently 1.2). Should be set - to 1.1 or 1.2. For SOLR, the current supported/tested version is 1.4 + to 1.1 or 1.2. For Solr, the current supported/tested version is 1.4 and 3.x. @@ -1220,7 +1445,6 @@ - pz:block_timeout @@ -1230,6 +1454,29 @@ + + pz:termlist_term_count + + + Specifies number of facet terms to be requested from the target. + The default is unspecified e.g. server-decided. Also see pz:facetmap. + + + + + pz:termlist_term_factor + + + Specifies whether to use a factor for pazpar2 generated facets (1) or not (0). + When mixing locallly generated (by the downloaded (pz:maxrecs) samples) + facet with native (target-generated) facets, the later will dominated the dominate the facet list + since they are generated based on the complete result set. + By scaling up the facet count using the ratio between total hit count and the sample size, + the total facet count can be approximated and thus better compared with native facets. + This is not enabled by default. + + + pz:facetmap:name @@ -1241,7 +1488,7 @@ - At this point only SOLR targets have been tested with this + At this point only Solr targets have been tested with this facility. @@ -1268,6 +1515,12 @@ by a field a metadata field (default is to use the name of the limitmap itself). + + For Pazpar2 version 1.6.23 and later the limitmap may include multiple + specifications, separated by , (comma). + For example: + ccl:title,local:ltitle,rpn:@attr 1=4. + The limitmap facility is supported for Pazpar2 version 1.6.0.