doc/odr.xml

   1  <chapter id="odr"><title>The ODR Module</title>
   2
   3   <sect1 id="odr.introduction"><title>Introduction</title>
   4
   5    <para>
   6      &odr; is the BER-encoding/decoding subsystem of &yaz;. Care as been taken
   7     to isolate &odr; from the rest of the package - specifically from the
   8     transport interface. &odr; may be used in any context where basic
   9     ASN.1/BER representations are used.
  10    </para>
  11
  12    <para>
  13     If you are only interested in writing a Z39.50 implementation based on
  14     the PDUs that are already provided with &yaz;, you only need to concern
  15     yourself with the section on managing ODR streams
  16     (<xref linkend="odr.use"/>). Only if you need to
  17     implement ASN.1 beyond that which has been provided, should you
  18     worry about the second half of the documentation
  19     (<xref linkend="odr.programming"/>).
  20     If you use one of the higher-level interfaces, you can skip this
  21     section entirely.
  22    </para>
  23
  24    <para>
  25     This is important, so we'll repeat it for emphasis: <emphasis>You do
  26      not need to read <xref linkend="odr.programming"/>
  27      to implement Z39.50 with &yaz;.</emphasis>
  28    </para>
  29
  30    <para>
  31     If you need a part of the protocol that isn't already in &yaz;, you
  32     should contact the authors before going to work on it yourself: We
  33     might already be working on it. Conversely, if you implement a useful
  34     part of the protocol before us, we'd be happy to include it in a
  35     future release.
  36    </para>
  37
  38   </sect1>
  39   <sect1 id="odr.use"><title>Using ODR</title>
  40
  41    <sect2 id="odr.streams"><title>ODR Streams</title>
  42
  43     <para>
  44      Conceptually, the ODR stream is the source of encoded data in the
  45      decoding mode; when encoding, it is the receptacle for the encoded
  46      data. Before you can use an ODR stream it must be allocated. This is
  47      done with the function
  48     </para>
  49
  50     <synopsis>
  51      ODR odr_createmem(int direction);
  52     </synopsis>
  53
  54     <para>
  55      The <function>odr_createmem()</function> function takes as argument one
  56      of three manifest constants: <literal>ODR_ENCODE</literal>,
  57      <literal>ODR_DECODE</literal>, or <literal>ODR_PRINT</literal>.
  58      An &odr; stream can be in only one mode - it is not possible to change
  59      its mode once it's selected. Typically, your program will allocate
  60      at least two ODR streams - one for decoding, and one for encoding.
  61     </para>
  62
  63     <para>
  64      When you're done with the stream, you can use
  65     </para>
  66
  67     <synopsis>
  68      void odr_destroy(ODR o);
  69     </synopsis>
  70
  71     <para>
  72      to release the resources allocated for the stream.
  73     </para>
  74    </sect2>
  75
  76    <sect2 id="odr.memory.management"><title id="memory">Memory Management</title>
  77
  78     <para>
  79      Two forms of memory management take place in the &odr; system. The first
  80      one, which has to do with allocating little bits of memory (sometimes
  81      quite large bits of memory, actually) when a protocol package is
  82      decoded, and turned into a complex of interlinked structures. This
  83      section deals with this system, and how you can use it for your own
  84      purposes. The next section deals with the memory management which is
  85      required when encoding data - to make sure that a large enough buffer is
  86      available to hold the fully encoded PDU.
  87     </para>
  88
  89     <para>
  90      The &odr; module has its own memory management system, which is
  91      used whenever memory is required. Specifically, it is used to allocate
  92      space for data when decoding incoming PDUs. You can use the memory
  93      system for your own purposes, by using the function
  94     </para>
  95
  96     <synopsis>
  97      void *odr_malloc(ODR o, size_t size);
  98     </synopsis>
  99
 100     <para>
 101      You can't use the normal <function>free(2)</function> routine to free
 102      memory allocated by this function, and &odr; doesn't provide a parallel
 103      function. Instead, you can call
 104     </para>
 105
 106     <synopsis>
 107      void odr_reset(ODR o);
 108     </synopsis>
 109
 110     <para>
 111      when you are done with the
 112      memory: Everything allocated since the last call to
 113      <function>odr_reset()</function> is released.
 114      The <function>odr_reset()</function> call is also required to clear
 115      up an error condition on a stream.
 116     </para>
 117
 118     <para>
 119      The function
 120     </para>
 121
 122     <synopsis>
 123      size_t odr_total(ODR o);
 124     </synopsis>
 125
 126     <para>
 127      returns the number of bytes allocated on the stream since the last call to
 128      <function>odr_reset()</function>.
 129     </para>
 130
 131     <para>
 132      The memory subsystem of &odr; is fairly efficient at allocating and
 133      releasing little bits of memory. Rather than managing the individual,
 134      small bits of space, the system maintains a free-list of larger chunks
 135      of memory, which are handed out in small bits. This scheme is
 136      generally known as a <emphasis>nibble memory</emphasis> system.
 137      It is very useful for maintaining short-lived constructions such
 138      as protocol PDUs.
 139     </para>
 140
 141     <para>
 142      If you want to retain a bit of memory beyond the next call to
 143      <function>odr_reset()</function>, you can use the function
 144     </para>
 145
 146     <synopsis>
 147      ODR_MEM odr_extract_mem(ODR o);
 148     </synopsis>
 149
 150     <para>
 151      This function will give you control of the memory recently allocated
 152      on the ODR stream. The memory will live (past calls to
 153      <function>odr_reset()</function>), until you call the function
 154     </para>
 155
 156     <synopsis>
 157      void odr_release_mem(ODR_MEM p);
 158     </synopsis>
 159
 160     <para>
 161      The opaque <literal>ODR_MEM</literal> handle has no other purpose than
 162      referencing the memory block for you until you want to release it.
 163     </para>
 164
 165     <para>
 166      You can use <function>odr_extract_mem()</function> repeatedly between
 167      allocating data, to retain individual control of separate chunks of data.
 168     </para>
 169
 170    </sect2>
 171    <sect2 id="odr.encoding.and.decoding"><title>Encoding and Decoding Data</title>
 172
 173     <para>
 174      When encoding data, the ODR stream will write the encoded octet string
 175      in an internal buffer. To retrieve the data, use the function
 176     </para>
 177
 178     <synopsis>
 179      char *odr_getbuf(ODR o, int *len, int *size);
 180     </synopsis>
 181
 182     <para>
 183      The integer pointed to by len is set to the length of the encoded
 184      data, and a pointer to that data is returned. <literal>*size</literal>
 185      is set to the size of the buffer (unless <literal>size</literal> is null,
 186      signaling that you are not interested in the size). The next call to
 187      a primitive function using the same &odr; stream will overwrite the
 188      data, unless a different buffer has been supplied using the call
 189     </para>
 190
 191     <synopsis>
 192      void odr_setbuf(ODR o, char *buf, int len, int can_grow);
 193     </synopsis>
 194
 195     <para>
 196      which sets the encoding (or decoding) buffer used by
 197      <literal>o</literal> to <literal>buf</literal>, using the length
 198      <literal>len</literal>.
 199      Before a call to an encoding function, you can use
 200      <function>odr_setbuf()</function> to provide the stream with an encoding
 201      buffer of sufficient size (length). The <literal>can_grow</literal>
 202      parameter tells the encoding &odr; stream whether it is allowed to use
 203      <function>realloc(2)</function> to increase the size of the buffer when
 204      necessary. The default condition of a new encoding stream is equivalent
 205      to the results of calling
 206     </para>
 207
 208     <synopsis>
 209      odr_setbuf(stream, 0, 0, 1);
 210     </synopsis>
 211
 212     <para>
 213      In this case, the stream will allocate and reallocate memory as
 214      necessary. The stream reallocates memory by repeatedly doubling the
 215      size of the buffer - the result is that the buffer will typically
 216      reach its maximum, working size with only a small number of reallocation
 217      operations. The memory is freed by the stream when the latter is destroyed,
 218      unless it was assigned by the user with the <literal>can_grow</literal>
 219      parameter set to zero (in this case, you are expected to retain
 220      control of the memory yourself).
 221     </para>
 222
 223     <para>
 224      To assume full control of an encoded buffer, you must first call
 225      <function>odr_getbuf()</function> to fetch the buffer and its length.
 226      Next, you should call <function>odr_setbuf()</function> to provide a
 227      different buffer (or a null pointer) to the stream. In the simplest
 228      case, you will reuse the same buffer over and over again, and you
 229      will just need to call <function>odr_getbuf()</function> after each
 230      encoding operation to get the length and address of the buffer.
 231      Note that the stream may reallocate the buffer during an encoding
 232      operation, so it is necessary to retrieve the correct address after
 233      each encoding operation.
 234     </para>
 235
 236     <para>
 237      It is important to realize that the ODR stream will not release this
 238      memory when you call <function>odr_reset()</function>: It will
 239      merely update its internal pointers to prepare for the encoding of a
 240      new data value.
 241      When the stream is released by the <function>odr_destroy()</function>
 242      function, the memory given to it by <function>odr_setbuf</function> will
 243      be released <emphasis>only</emphasis> if the <literal>can_grow</literal>
 244      parameter to <function>odr_setbuf()</function> was nonzero. The
 245      <literal>can_grow</literal> parameter, in other words, is a way of
 246      signaling who is to own the buffer, you or the ODR stream. If you never call
 247      <function>odr_setbuf()</function> on your encoding stream, which is
 248      typically the case, the buffer allocated by the stream will belong to
 249      the stream by default.
 250     </para>
 251
 252     <para>
 253      When you wish to decode data, you should first call
 254      <function>odr_setbuf()</function>, to tell the decoding stream
 255      where to find the encoded data, and how long the buffer is
 256      (the <literal>can_grow</literal> parameter is ignored by a decoding
 257      stream). After this, you can call the function corresponding to the
 258      data you wish to decode (eg, <function>odr_integer()</function> odr
 259      <function>z_APDU()</function>).
 260     </para>
 261
 262     <example id="example.odr.encoding.and.decoding.functions">
 263      <title>Encoding and decoding functions</title>
 264      <synopsis>
 265       int odr_integer(ODR o, Odr_int **p, int optional, const char *name);
 266
 267       int z_APDU(ODR o, Z_APDU **p, int optional, const char *name);
 268      </synopsis>
 269     </example>
 270
 271     <para>
 272      If the data is absent (or doesn't match the tag corresponding to
 273      the type), the return value will be either 0 or 1 depending on the
 274      <literal>optional</literal> flag. If <literal>optional</literal>
 275      is 0 and the data is absent, an error flag will be raised in the
 276      stream, and you'll need to call <function>odr_reset()</function> before
 277      you can use the stream again. If <literal>optional</literal> is
 278      nonzero, the pointer <emphasis>pointed</emphasis> to/ by
 279      <literal>p</literal> will be set to the null value, and the function
 280      will return 1.
 281      The <literal>name</literal> argument is used to pretty-print the
 282      tag in question. It may be set to <literal>NULL</literal> if
 283      pretty-printing is not desired.
 284     </para>
 285
 286     <para>
 287      If the data value is found where it's expected, the pointer
 288      <emphasis>pointed to</emphasis> by the <literal>p</literal> argument
 289      will be set to point to the decoded type.
 290      The space for the type will be allocated and owned by the &odr;
 291      stream, and it will live until you call
 292      <function>odr_reset()</function> on the stream. You cannot use
 293      <function>free(2)</function> to release the memory.
 294      You can decode several data elements (by repeated calls to
 295      <function>odr_setbuf()</function> and your decoding function), and
 296      new memory will be allocated each time. When you do call
 297      <function>odr_reset()</function>, everything decoded since the
 298      last call to <function>odr_reset()</function> will be released.
 299     </para>
 300
 301     <example id="example.odr.encoding.of.integer">
 302      <title>Encoding and decoding of an integer</title>
 303      <para>
 304       The use of the double indirection can be a little confusing at first
 305       (its purpose will become clear later on, hopefully),
 306       so an example is in order. We'll encode an integer value, and
 307       immediately decode it again using a different stream. A useless, but
 308       informative operation.
 309      </para>
 310      <programlisting><![CDATA[
 311 void do_nothing_useful(Odr_int value)
 312 {
 313     ODR encode, decode;
 314     Odr_int *valp, *resvalp;
 315     char *bufferp;
 316     int len;
 317
 318     /* allocate streams */
 319     if (!(encode = odr_createmem(ODR_ENCODE)))
 320         return;
 321     if (!(decode = odr_createmem(ODR_DECODE)))
 322         return;
 323
 324     valp = &value;
 325     if (odr_integer(encode, &valp, 0, 0) == 0)
 326     {
 327         printf("encoding went bad\n");
 328         return;
 329     }
 330     bufferp = odr_getbuf(encode, &len, 0);
 331     printf("length of encoded data is %d\n", len);
 332
 333     /* now let's decode the thing again */
 334     odr_setbuf(decode, bufferp, len, 0);
 335     if (odr_integer(decode, &resvalp, 0, 0) == 0)
 336     {
 337         printf("decoding went bad\n");
 338         return;
 339     }
 340     /* ODR_INT_PRINTF format for printf (such as %d) */
 341     printf("the value is " ODR_INT_PRINTF "\n", *resvalp);
 342
 343     /* clean up */
 344     odr_destroy(encode);
 345     odr_destroy(decode);
 346 }
 347 ]]>
 348      </programlisting>
 349      <para>
 350       This looks like a lot of work, offhand. In practice, the &odr; streams
 351       will typically be allocated once, in the beginning of your program
 352       (or at the beginning of a new network session), and the encoding
 353       and decoding will only take place in a few, isolated places in your
 354       program, so the overhead is quite manageable.
 355      </para>
 356     </example>
 357
 358    </sect2>
 359
 360    <sect2 id="odr.printing"><title>Printing</title>
 361     <para>
 362      When an ODR stream is created of type <literal>ODR_PRINT</literal>
 363      the ODR module will print the contents of a PDU in a readable format.
 364      By default output is written to the <literal>stderr</literal> stream.
 365      This behavior can be changed, however, by calling the function
 366      <synopsis>
 367       odr_setprint(ODR o, FILE *file);
 368      </synopsis>
 369      before encoders or decoders are being invoked.
 370      It is also possible to direct the output to a buffer (of indeed
 371      another file), by using the more generic mechanism:
 372      <synopsis>
 373       void odr_set_stream(ODR o, void *handle,
 374                          void (*stream_write)(ODR o, void *handle, int type,
 375                                               const char *buf, int len),
 376                          void (*stream_close)(void *handle));
 377      </synopsis>
 378      Here the user provides an opaque handle and two handlers,
 379      <replaceable>stream_write</replaceable> for writing,
 380      and <replaceable>stream_close</replaceable> which is supposed
 381      to close/free resources associated with handle.
 382      The <replaceable>stream_close</replaceable> handler is optional and
 383      if NULL for the function is provided, it will not be invoked.
 384      The <replaceable>stream_write</replaceable> takes the ODR handle
 385      as parameter, the user defined handle, a type
 386      <literal>ODR_OCTETSTRING</literal>, <literal>ODR_VISIBLESTRING</literal>
 387      which indicates the type of contents is being written.
 388     </para>
 389     <para>
 390      Another utility useful for diagnostics (error handling) or as
 391      part of the printing facilities is:
 392      <synopsis>
 393       const char **odr_get_element_path(ODR o);
 394      </synopsis>
 395      which returns a list of current elements that ODR deals with at the
 396      moment. For the returned array, say <literal>ar</literal>,
 397      <literal>ar[0]</literal> is the top level element,
 398      <literal>ar[n]</literal> is the last. The last element has the
 399      property that <literal>ar[n+1] == NULL</literal>.
 400     </para>
 401     <example id="example.odr.element.path.record">
 402      <title>Element Path for record</title>
 403      <para>
 404       For a database record part of a PresentResponse the
 405       array returned by <function>odr_get_element</function>
 406       is <literal>presentResponse</literal>, <literal>databaseOrSurDiagnostics</literal>, <literal>?</literal>, <literal>record</literal>, <literal>?</literal>, <literal>databaseRecord</literal> . The question mark appears due to
 407       unnamed constructions.
 408      </para>
 409      </example>
 410    </sect2>
 411    <sect2 id="odr.diagnostics"><title>Diagnostics</title>
 412
 413     <para>
 414      The encoding/decoding functions all return 0 when an error occurs.
 415      Until you call <function>odr_reset()</function>, you cannot use the
 416      stream again, and any function called will immediately return 0.
 417     </para>
 418
 419     <para>
 420      To provide information to the programmer or administrator, the function
 421     </para>
 422
 423     <synopsis>
 424      void odr_perror(ODR o, char *message);
 425     </synopsis>
 426
 427     <para>
 428      is provided, which prints the <literal>message</literal> argument to
 429      <literal>stderr</literal> along with an error message from the stream.
 430     </para>
 431
 432     <para>
 433      You can also use the function
 434     </para>
 435
 436     <synopsis>
 437      int odr_geterror(ODR o);
 438     </synopsis>
 439
 440     <para>
 441      to get the current error number from the screen. The number will be
 442      one of these constants:
 443     </para>
 444
 445     <table frame="top" id="odr.error.codes">
 446      <title>ODR Error codes</title>
 447      <tgroup cols="2">
 448       <thead>
 449        <row>
 450         <entry>code</entry>
 451         <entry>Description</entry>
 452        </row>
 453       </thead>
 454       <tbody>
 455        <row>
 456         <entry>OMEMORY</entry><entry>Memory allocation failed.</entry>
 457        </row>
 458
 459        <row>
 460         <entry>OSYSERR</entry><entry>A system- or library call has failed.
 461          The standard diagnostic variable <literal>errno</literal> should be
 462          examined to determine the actual error.</entry>
 463        </row>
 464
 465        <row>
 466         <entry>OSPACE</entry><entry>No more space for encoding.
 467          This will only occur when the user has explicitly provided a
 468          buffer for an encoding stream without allowing the system to
 469          allocate more space.</entry>
 470        </row>
 471
 472        <row>
 473         <entry>OREQUIRED</entry><entry>This is a common protocol error; A
 474          required data element was missing during encoding or decoding.</entry>
 475        </row>
 476
 477        <row>
 478         <entry>OUNEXPECTED</entry><entry>An unexpected data element was
 479          found during decoding.</entry>
 480        </row>
 481
 482        <row><entry>OOTHER</entry><entry>Other error. This is typically an
 483          indication of misuse of the &odr; system by the programmer, and also
 484          that the diagnostic system isn't as good as it should be, yet.</entry>
 485        </row>
 486       </tbody>
 487      </tgroup>
 488     </table>
 489
 490     <para>
 491      The character string array
 492     </para>
 493
 494     <synopsis>
 495      char *odr_errlist[]
 496     </synopsis>
 497
 498     <para>
 499      can be indexed by the error code to obtain a human-readable
 500      representation of the problem.
 501     </para>
 502
 503    </sect2>
 504    <sect2 id="odr.summary.and.synopsis">
 505     <title>Summary and Synopsis</title>
 506
 507     <synopsis>
 508      #include &lt;yaz/odr.h>
 509
 510      ODR odr_createmem(int direction);
 511
 512      void odr_destroy(ODR o);
 513
 514      void odr_reset(ODR o);
 515
 516      char *odr_getbuf(ODR o, int *len, int *size);
 517
 518      void odr_setbuf(ODR o, char *buf, int len, int can_grow);
 519
 520      void *odr_malloc(ODR o, int size);
 521
 522      NMEM odr_extract_mem(ODR o);
 523
 524      int odr_geterror(ODR o);
 525
 526      void odr_perror(ODR o, const char *message);
 527
 528      extern char *odr_errlist[];
 529     </synopsis>
 530
 531    </sect2>
 532   </sect1>
 533
 534   <sect1 id="odr.programming"><title>Programming with ODR</title>
 535
 536    <para>
 537     The API of &odr; is designed to reflect the structure of ASN.1, rather
 538     than BER itself. Future releases may be able to represent data in
 539     other external forms.
 540    </para>
 541
 542    <tip>
 543     <para>
 544      There is an ASN.1 tutorial available at
 545      <ulink url="&url.asn.1.tutorial;">this site</ulink>.
 546      This site also has standards for ASN.1 (X.680) and BER (X.690)
 547      <ulink url="&url.asn.1.standards;">online</ulink>.
 548     </para>
 549    </tip>
 550
 551    <para>
 552     The ODR interface is based loosely on that of the Sun Microsystems
 553     XDR routines.
 554     Specifically, each function which corresponds to an ASN.1 primitive
 555     type has a dual function. Depending on the settings of the ODR
 556     stream which is supplied as a parameter, the function may be used
 557     either to encode or decode data. The functions that can be built
 558     using these primitive functions, to represent more complex data types,
 559     share this quality. The result is that you only have to enter the
 560     definition for a type once - and you have the functionality of encoding,
 561     decoding (and pretty-printing) all in one unit.
 562     The resulting C source code is quite compact, and is a pretty
 563     straightforward representation of the source ASN.1 specification.
 564    </para>
 565
 566    <para>
 567     In many cases, the model of the XDR functions works quite well in this
 568     role.
 569     In others, it is less elegant. Most of the hassle comes from the optional
 570     SEQUENCE members which don't exist in XDR.
 571    </para>
 572
 573    <sect2 id="odr.primitive.asn1.types">
 574     <title>The Primitive ASN.1 Types</title>
 575
 576     <para>
 577      ASN.1 defines a number of primitive types (many of which correspond
 578      roughly to primitive types in structured programming languages, such as C).
 579     </para>
 580
 581     <sect3 id="odr.integer"><title>INTEGER</title>
 582
 583      <para>
 584       The &odr; function for encoding or decoding (or printing) the ASN.1
 585       INTEGER type looks like this:
 586      </para>
 587
 588      <synopsis>
 589       int odr_integer(ODR o, Odr_int **p, int optional, const char *name);
 590      </synopsis>
 591
 592      <para>
 593       The <literal>Odr_int</literal> is just a simple integer.
 594      </para>
 595
 596      <para>
 597       This form is typical of the primitive &odr; functions. They are named
 598       after the type of data that they encode or decode. They take an &odr;
 599       stream, an indirect reference to the type in question, and an
 600       <literal>optional</literal> flag (corresponding to the OPTIONAL keyword
 601       of ASN.1) as parameters. They all return an integer value of either one
 602       or zero.
 603       When you use the primitive functions to construct encoders for complex
 604       types of your own, you should follow this model as well. This
 605       ensures that your new types can be reused as elements in yet more
 606       complex types.
 607      </para>
 608
 609      <para>
 610       The <literal>o</literal> parameter should obviously refer to a properly
 611       initialized &odr; stream of the right type (encoding/decoding/printing)
 612       for the operation that you wish to perform.
 613      </para>
 614
 615      <para>
 616       When encoding or printing, the function first looks at
 617       <literal>* p</literal>. If <literal>* p</literal> (the pointer pointed
 618       to by <literal>p</literal>) is a null pointer, this is taken to mean that
 619       the data element is absent. If the <literal>optional</literal> parameter
 620       is nonzero, the function will return one (signifying success) without
 621       any further processing. If the <literal>optional</literal> is zero, an
 622       internal error flag is set in the &odr; stream, and the function will
 623       return 0. No further operations can be carried out on the stream without
 624       a call to the function <function>odr_reset()</function>.
 625      </para>
 626
 627      <para>
 628       If <literal>*p</literal> is not a null pointer, it is expected to
 629       point to an instance of the data type. The data will be subjected to
 630       the encoding rules, and the result will be placed in the buffer held
 631       by the &odr; stream.
 632      </para>
 633
 634      <para>
 635       The other ASN.1 primitives have similar functions that operate in
 636       similar manners:
 637      </para>
 638     </sect3>
 639     <sect3 id="odr.boolean"><title>BOOLEAN</title>
 640
 641      <synopsis>
 642 int odr_bool(ODR o, Odr_bool **p, int optional, const char *name);
 643      </synopsis>
 644
 645     </sect3>
 646     <sect3 id="odr.real"><title>REAL</title>
 647
 648      <para>
 649       Not defined.
 650      </para>
 651
 652     </sect3>
 653     <sect3 id="odr.null"><title>NULL</title>
 654
 655      <synopsis>
 656 int odr_null(ODR o, Odr_null **p, int optional, const char *name);
 657      </synopsis>
 658
 659      <para>
 660       In this case, the value of **p is not important. If <literal>*p</literal>
 661       is different from the null pointer, the null value is present, otherwise
 662       it's absent.
 663      </para>
 664
 665     </sect3>
 666     <sect3 id="odr.octet.string"><title>OCTET STRING</title>
 667
 668      <synopsis>
 669 typedef struct odr_oct
 670 {
 671     unsigned char *buf;
 672     int len;
 673     int size;
 674 } Odr_oct;
 675
 676 int odr_octetstring(ODR o, Odr_oct **p, int optional,
 677                     const char *name);
 678      </synopsis>
 679
 680      <para>
 681       The <literal>buf</literal> field should point to the character array
 682       that holds the octetstring. The <literal>len</literal> field holds the
 683       actual length, while the <literal>size</literal> field gives the size
 684       of the allocated array (not of interest to you, in most cases).
 685       The character array need not be null terminated.
 686      </para>
 687
 688      <para>
 689       To make things a little easier, an alternative is given for string
 690       types that are not expected to contain embedded NULL characters (eg.
 691       VisibleString):
 692      </para>
 693
 694      <synopsis>
 695       int odr_cstring(ODR o, char **p, int optional, const char *name);
 696      </synopsis>
 697
 698      <para>
 699       Which encoded or decodes between OCTETSTRING representations and
 700       null-terminates C strings.
 701      </para>
 702
 703      <para>
 704       Functions are provided for the derived string types, eg:
 705      </para>
 706
 707      <synopsis>
 708 int odr_visiblestring(ODR o, char **p, int optional,
 709                       const char *name);
 710      </synopsis>
 711
 712     </sect3>
 713     <sect3 id="odr.bit.string"><title>BIT STRING</title>
 714
 715      <synopsis>
 716 int odr_bitstring(ODR o, Odr_bitmask **p, int optional,
 717                   const char *name);
 718      </synopsis>
 719
 720      <para>
 721       The opaque type <literal>Odr_bitmask</literal> is only suitable for
 722       holding relatively brief bit strings, eg. for options fields, etc.
 723       The constant <literal>ODR_BITMASK_SIZE</literal> multiplied by 8
 724       gives the maximum possible number of bits.
 725      </para>
 726
 727      <para>
 728       A set of macros are provided for manipulating the
 729       <literal>Odr_bitmask</literal> type:
 730      </para>
 731
 732      <synopsis>
 733 void ODR_MASK_ZERO(Odr_bitmask *b);
 734
 735 void ODR_MASK_SET(Odr_bitmask *b, int bitno);
 736
 737 void ODR_MASK_CLEAR(Odr_bitmask *b, int bitno);
 738
 739 int ODR_MASK_GET(Odr_bitmask *b, int bitno);
 740      </synopsis>
 741
 742      <para>
 743       The functions are modeled after the manipulation functions that
 744       accompany the <literal>fd_set</literal> type used by the
 745       <function>select(2)</function> call.
 746       <literal>ODR_MASK_ZERO</literal> should always be called first on a
 747       new bitmask, to initialize the bits to zero.
 748      </para>
 749     </sect3>
 750
 751     <sect3 id="odr.object.identifier"><title>OBJECT IDENTIFIER</title>
 752
 753      <synopsis>
 754 int odr_oid(ODR o, Odr_oid **p, int optional, const char *name);
 755      </synopsis>
 756
 757      <para>
 758       The C OID representation is simply an array of integers, terminated by
 759       the value -1 (the <literal>Odr_oid</literal> type is synonymous with
 760       the <literal>short</literal> type).
 761       We suggest that you use the OID database module (see
 762       <xref linkend="tools.oid.database"/>) to handle object identifiers
 763       in your application.
 764      </para>
 765
 766     </sect3>
 767    </sect2>
 768    <sect2 id="odr.tagging.primitive.types"><title>Tagging Primitive Types</title> <!-- tag.prim -->
 769
 770     <para>
 771      The simplest way of tagging a type is to use the
 772      <function>odr_implicit_tag()</function> or
 773      <function>odr_explicit_tag()</function> macros:
 774     </para>
 775
 776     <synopsis>
 777 int odr_implicit_tag(ODR o, Odr_fun fun, int class, int tag,
 778                      int optional, const char *name);
 779
 780 int odr_explicit_tag(ODR o, Odr_fun fun, int class, int tag,
 781                      int optional, const char *name);
 782     </synopsis>
 783
 784     <para>
 785      To create a type derived from the integer type by implicit tagging, you
 786      might write:
 787     </para>
 788
 789     <screen>
 790      MyInt ::= [210] IMPLICIT INTEGER
 791     </screen>
 792
 793     <para>
 794      In the &odr; system, this would be written like:
 795     </para>
 796
 797     <screen>
 798 int myInt(ODR o, Odr_int **p, int optional, const char *name)
 799 {
 800     return odr_implicit_tag(o, odr_integer, p,
 801                             ODR_CONTEXT, 210, optional, name);
 802 }
 803     </screen>
 804
 805     <para>
 806      The function <function>myInt()</function> can then be used like any of
 807      the primitive functions provided by &odr;. Note that the behavior of
 808      <function>odr_explicit_tag()</function>
 809      and <function>odr_implicit_tag()</function> macros
 810      act exactly the same as the functions they are applied to - they
 811      respond to error conditions, etc, in the same manner - they
 812      simply have three extra parameters. The class parameter may
 813      take one of the values: <literal>ODR_CONTEXT</literal>,
 814      <literal>ODR_PRIVATE</literal>, <literal>ODR_UNIVERSAL</literal>, or
 815      <literal>/ODR_APPLICATION</literal>.
 816     </para>
 817
 818    </sect2>
 819    <sect2 id="odr.constructed.types"><title>Constructed Types</title>
 820
 821     <para>
 822      Constructed types are created by combining primitive types. The
 823       &odr; system only implements the SEQUENCE and SEQUENCE OF constructions
 824      (although adding the rest of the container types should be simple
 825      enough, if the need arises).
 826     </para>
 827
 828     <para>
 829      For implementing SEQUENCEs, the functions
 830     </para>
 831
 832     <synopsis>
 833 int odr_sequence_begin(ODR o, void *p, int size, const char *name);
 834 int odr_sequence_end(ODR o);
 835     </synopsis>
 836
 837     <para>
 838      are provided.
 839     </para>
 840
 841     <para>
 842      The <function>odr_sequence_begin()</function> function should be
 843      called in the beginning of a function that implements a SEQUENCE type.
 844      Its parameters are the &odr; stream, a pointer (to a pointer to the type
 845      you're implementing), and the <literal>size</literal> of the type
 846      (typically a C structure). On encoding, it returns 1 if
 847      <literal>* p</literal> is a null pointer. The <literal>size</literal>
 848      parameter is ignored. On decoding, it returns 1 if the type is found in
 849      the data stream. <literal>size</literal> bytes of memory are allocated,
 850      and <literal>*p</literal> is set to point to this space.
 851      <function>odr_sequence_end()</function> is called at the end of the
 852      complex function. Assume that a type is defined like this:
 853     </para>
 854
 855     <screen>
 856 MySequence ::= SEQUENCE {
 857      intval INTEGER,
 858      boolval BOOLEAN OPTIONAL
 859 }
 860     </screen>
 861
 862     <para>
 863      The corresponding &odr; encoder/decoder function and the associated data
 864      structures could be written like this:
 865     </para>
 866
 867     <screen>
 868 typedef struct MySequence
 869 {
 870     Odr_int *intval;
 871     Odr_bool *boolval;
 872 } MySequence;
 873
 874 int mySequence(ODR o, MySequence **p, int optional, const char *name)
 875 {
 876     if (odr_sequence_begin(o, p, sizeof(**p), name) == 0)
 877         return optional &amp;&amp; odr_ok(o);
 878     return
 879         odr_integer(o, &amp;(*p)->intval, 0, "intval") &amp;&amp;
 880         odr_bool(o, &amp;(*p)->boolval, 1, "boolval") &amp;&amp;
 881         odr_sequence_end(o);
 882 }
 883
 884     </screen>
 885
 886     <para>
 887      Note the 1 in the call to <function>odr_bool()</function>, to mark
 888      that the sequence member is optional.
 889      If either of the member types had been tagged, the macros
 890      <function>odr_implicit_tag()</function> or
 891      <function>odr_explicit_tag()</function>
 892      could have been used.
 893      The new function can be used exactly like the standard functions provided
 894      with &odr;. It will encode, decode or pretty-print a data value of the
 895      <literal>MySequence</literal> type. We like to name types with an
 896      initial capital, as done in ASN.1 definitions, and to name the
 897      corresponding function with the first character of the name in lower case.
 898      You could, of course, name your structures, types, and functions any way
 899      you please - as long as you're consistent, and your code is easily readable.
 900      <literal>odr_ok</literal> is just that - a predicate that returns the
 901      state of the stream. It is used to ensure that the behavior of the new
 902      type is compatible with the interface of the primitive types.
 903     </para>
 904
 905    </sect2>
 906    <sect2 id="odr.tagging.constructed.types">
 907     <title>Tagging Constructed Types</title>
 908
 909     <note>
 910      <para>
 911       See <xref linkend="odr.tagging.primitive.types"/> for information on how to tag
 912       the primitive types, as well as types that are already defined.
 913      </para>
 914     </note>
 915
 916     <sect3 id="odr.implicit.tagging">
 917      <title>Implicit Tagging</title>
 918
 919      <para>
 920       Assume the type above had been defined as
 921      </para>
 922
 923      <screen>
 924 MySequence ::= [10] IMPLICIT SEQUENCE {
 925       intval INTEGER,
 926       boolval BOOLEAN OPTIONAL
 927 }
 928      </screen>
 929
 930      <para>
 931       You would implement this in &odr; by calling the function
 932      </para>
 933
 934      <synopsis>
 935 int odr_implicit_settag(ODR o, int class, int tag);
 936      </synopsis>
 937
 938      <para>
 939       which overrides the tag of the type immediately following it. The
 940       macro <function>odr_implicit_tag()</function> works by calling
 941       <function>odr_implicit_settag()</function> immediately
 942       before calling the function pointer argument.
 943       Your type function could look like this:
 944      </para>
 945
 946      <screen>
 947 int mySequence(ODR o, MySequence **p, int optional, const char *name)
 948 {
 949     if (odr_implicit_settag(o, ODR_CONTEXT, 10) == 0 ||
 950         odr_sequence_begin(o, p, sizeof(**p), name) == 0)
 951         return optional &amp;&amp; odr_ok(o);
 952     return
 953         odr_integer(o, &amp;(*p)->intval, 0, "intval") &amp;&amp;
 954         odr_bool(o, &amp;(*p)->boolval, 1, "boolval") &amp;&amp;
 955         odr_sequence_end(o);
 956 }
 957      </screen>
 958
 959      <para>
 960       The definition of the structure <literal>MySequence</literal> would be
 961       the same.
 962      </para>
 963     </sect3>
 964
 965     <sect3 id="odr.explicit.tagging"><title>Explicit Tagging</title>
 966
 967      <para>
 968       Explicit tagging of constructed types is a little more complicated,
 969       since you are in effect adding a level of construction to the data.
 970      </para>
 971
 972      <para>
 973       Assume the definition:
 974      </para>
 975
 976      <screen>
 977 MySequence ::= [10] IMPLICIT SEQUENCE {
 978    intval INTEGER,
 979    boolval BOOLEAN OPTIONAL
 980 }
 981      </screen>
 982
 983      <para>
 984       Since the new type has an extra level of construction, two new functions
 985       are needed to encapsulate the base type:
 986      </para>
 987
 988      <synopsis>
 989 int odr_constructed_begin(ODR o, void *p, int class, int tag,
 990                           const char *name);
 991
 992 int odr_constructed_end(ODR o);
 993      </synopsis>
 994
 995      <para>
 996       Assume that the IMPLICIT in the type definition above were replaced
 997       with EXPLICIT (or that the IMPLICIT keyword were simply deleted, which
 998       would be equivalent). The structure definition would look the same,
 999       but the function would look like this:
1000      </para>
1001
1002      <screen>
1003 int mySequence(ODR o, MySequence **p, int optional, const char *name)
1004 {
1005     if (odr_constructed_begin(o, p, ODR_CONTEXT, 10, name) == 0)
1006         return optional &amp;&amp; odr_ok(o);
1007     if (o->direction == ODR_DECODE)
1008         *p = odr_malloc(o, sizeof(**p));
1009     if (odr_sequence_begin(o, p, sizeof(**p), 0) == 0)
1010     {
1011         *p = 0; /* this is almost certainly a protocol error */
1012         return 0;
1013     }
1014     return
1015         odr_integer(o, &amp;(*p)->intval, 0, "intval") &amp;&amp;
1016         odr_bool(o, &amp;(*p)->boolval, 1, "boolval") &amp;&amp;
1017         odr_sequence_end(o) &amp;&amp;
1018         odr_constructed_end(o);
1019 }
1020      </screen>
1021
1022      <para>
1023       Notice that the interface here gets kind of nasty. The reason is
1024       simple: Explicitly tagged, constructed types are fairly rare in
1025       the protocols that we care about, so the
1026       esthetic annoyance (not to mention the dangers of a cluttered
1027       interface) is less than the time that would be required to develop a
1028       better interface. Nevertheless, it is far from satisfying, and it's a
1029       point that will be worked on in the future. One option for you would
1030       be to simply apply the <function>odr_explicit_tag()</function> macro to
1031       the first function, and not
1032       have to worry about <function>odr_constructed_*</function> yourself.
1033       Incidentally, as you might have guessed, the
1034       <function>odr_sequence_</function> functions are themselves
1035       implemented using the <function>/odr_constructed_</function> functions.
1036      </para>
1037
1038     </sect3>
1039    </sect2>
1040    <sect2 id="odr.sequence.of"><title>SEQUENCE OF</title>
1041
1042     <para>
1043      To handle sequences (arrays) of a specific type, the function
1044     </para>
1045
1046     <synopsis>
1047 int odr_sequence_of(ODR o, int (*fun)(ODR o, void *p, int optional),
1048                     void *p, int *num, const char *name);
1049     </synopsis>
1050
1051     <para>
1052      The <literal>fun</literal> parameter is a pointer to the decoder/encoder
1053      function of the type. <literal>p</literal> is a pointer to an array of
1054      pointers to your type. <literal>num</literal> is the number of elements
1055      in the array.
1056     </para>
1057
1058     <para>
1059      Assume a type
1060     </para>
1061
1062     <screen>
1063 MyArray ::= SEQUENCE OF INTEGER
1064     </screen>
1065
1066     <para>
1067      The C representation might be
1068     </para>
1069
1070     <screen>
1071 typedef struct MyArray
1072 {
1073     int num_elements;
1074     Odr_int **elements;
1075 } MyArray;
1076     </screen>
1077
1078     <para>
1079      And the function might look like
1080     </para>
1081
1082     <screen>
1083 int myArray(ODR o, MyArray **p, int optional, const char *name)
1084 {
1085     if (o->direction == ODR_DECODE)
1086         *p = odr_malloc(o, sizeof(**p));
1087     if (odr_sequence_of(o, odr_integer, &amp;(*p)->elements,
1088         &amp;(*p)->num_elements, name))
1089         return 1;
1090     *p = 0;
1091         return optional &amp;&amp; odr_ok(o);
1092 }
1093     </screen>
1094
1095    </sect2>
1096    <sect2 id="odr.choice.types"><title>CHOICE Types</title>
1097
1098     <para>
1099      The choice type is used fairly often in some ASN.1 definitions, so
1100      some work has gone into streamlining its interface.
1101     </para>
1102
1103     <para>
1104      CHOICE types are handled by the function:
1105     </para>
1106
1107     <synopsis>
1108 int odr_choice(ODR o, Odr_arm arm[], void *p, void *whichp,
1109                const char *name);
1110     </synopsis>
1111
1112     <para>
1113      The <literal>arm</literal> array is used to describe each of the possible
1114      types that the CHOICE type may assume. Internally in your application,
1115      the CHOICE type is represented as a discriminated union. That is, a
1116      C union accompanied by an integer (or enum) identifying the active
1117      'arm' of the union.
1118      <literal>whichp</literal> is a pointer to the union discriminator.
1119      When encoding, it is examined to determine the current type.
1120      When decoding, it is set to reference the type that was found in
1121      the input stream.
1122     </para>
1123
1124     <para>
1125      The Odr_arm type is defined thus:
1126     </para>
1127
1128     <screen>
1129 typedef struct odr_arm
1130 {
1131     int tagmode;
1132     int class;
1133     int tag;
1134     int which;
1135     Odr_fun fun;
1136     char *name;
1137 } Odr_arm;
1138     </screen>
1139
1140     <para>
1141      The interpretation of the fields are:
1142     </para>
1143
1144     <variablelist>
1145      <varlistentry><term>tagmode</term>
1146       <listitem><para>Either <literal>ODR_IMPLICIT</literal>,
1147         <literal>ODR_EXPLICIT</literal>, or <literal>ODR_NONE</literal> (-1)
1148         to mark no tagging.</para></listitem>
1149      </varlistentry>
1150
1151      <varlistentry><term>which</term>
1152       <listitem><para>The value of the discriminator that corresponds to
1153         this CHOICE element. Typically, it will be a #defined constant, or
1154         an enum member.</para></listitem>
1155      </varlistentry>
1156
1157      <varlistentry><term>fun</term>
1158       <listitem><para>A pointer to a function that implements the type of
1159         the CHOICE member. It may be either a standard &odr; type or a type
1160         defined by yourself.</para></listitem>
1161      </varlistentry>
1162
1163      <varlistentry><term>name</term>
1164       <listitem><para>Name of tag.</para></listitem>
1165      </varlistentry>
1166     </variablelist>
1167
1168     <para>
1169      A handy way to prepare the array for use by the
1170      <function>odr_choice()</function> function is to
1171      define it as a static, initialized array in the beginning of your
1172      decoding/encoding function. Assume the type definition:
1173     </para>
1174
1175     <screen>
1176 MyChoice ::= CHOICE {
1177     untagged INTEGER,
1178     tagged   [99] IMPLICIT INTEGER,
1179     other    BOOLEAN
1180 }
1181     </screen>
1182
1183     <para>
1184      Your C type might look like
1185     </para>
1186
1187     <screen>
1188 typedef struct MyChoice
1189 {
1190     enum
1191     {
1192         MyChoice_untagged,
1193         MyChoice_tagged,
1194         MyChoice_other
1195     } which;
1196     union
1197     {
1198         Odr_int *untagged;
1199         Odr_int *tagged;
1200         Odr_bool *other;
1201     } u;
1202 };
1203     </screen>
1204
1205     <para>
1206      And your function could look like this:
1207     </para>
1208
1209     <screen>
1210 int myChoice(ODR o, MyChoice **p, int optional, const char *name)
1211 {
1212     static Odr_arm arm[] =
1213     {
1214       {-1, -1, -1, MyChoice_untagged, odr_integer, "untagged"},
1215       {ODR_IMPLICIT, ODR_CONTEXT, 99, MyChoice_tagged, odr_integer,
1216       "tagged"},
1217       {-1, -1, -1, MyChoice_other, odr_boolean, "other"},
1218       {-1, -1, -1, -1, 0}
1219     };
1220
1221     if (o->direction == ODR_DECODE)
1222         *p = odr_malloc(o, sizeof(**p);
1223     else if (!*p)
1224         return optional &amp;&amp; odr_ok(o);
1225
1226     if (odr_choice(o, arm, &amp;(*p)->u, &amp;(*p)->which), name)
1227         return 1;
1228     *p = 0;
1229         return optional &amp;&amp; odr_ok(o);
1230 }
1231     </screen>
1232
1233     <para>
1234      In some cases (say, a non-optional choice which is a member of a
1235      sequence), you can "embed" the union and its discriminator in the
1236      structure belonging to the enclosing type, and you won't need to
1237      fiddle with memory allocation to create a separate structure to
1238      wrap the discriminator and union.
1239     </para>
1240
1241     <para>
1242      The corresponding function is somewhat nicer in the Sun XDR interface.
1243      Most of the complexity of this interface comes from the possibility of
1244      declaring sequence elements (including CHOICEs) optional.
1245     </para>
1246
1247     <para>
1248      The ASN.1 specifications naturally requires that each member of a
1249      CHOICE have a distinct tag, so they can be told apart on decoding.
1250      Sometimes it can be useful to define a CHOICE that has multiple types
1251      that share the same tag. You'll need some other mechanism, perhaps
1252      keyed to the context of the CHOICE type. In effect, we would like to
1253      introduce a level of context-sensitiveness to our ASN.1 specification.
1254      When encoding an internal representation, we have no problem, as long
1255      as each CHOICE member has a distinct discriminator value. For
1256      decoding, we need a way to tell the choice function to look for a
1257      specific arm of the table. The function
1258     </para>
1259
1260     <synopsis>
1261 void odr_choice_bias(ODR o, int what);
1262     </synopsis>
1263
1264     <para>
1265      provides this functionality. When called, it leaves a notice for the next
1266      call to <function>odr_choice()</function> to be called on the decoding
1267      stream <literal>o</literal> that only the <literal>arm</literal> entry with
1268      a <literal>which</literal> field equal to <literal>what</literal>
1269      should be tried.
1270     </para>
1271
1272     <para>
1273      The most important application (perhaps the only one, really) is in
1274      the definition of application-specific EXTERNAL encoders/decoders
1275      which will automatically decode an ANY member given the direct or
1276      indirect reference.
1277     </para>
1278
1279    </sect2>
1280   </sect1>
1281
1282   <sect1 id="odr.debugging"><title>Debugging</title>
1283
1284    <para>
1285     The protocol modules are suffering somewhat from a lack of diagnostic
1286     tools at the moment. Specifically ways to pretty-print PDUs that
1287     aren't recognized by the system. We'll include something to this end
1288     in a not-too-distant release. In the meantime, what we do when we get
1289     packages we don't understand is to compile the ODR module with
1290     <literal>ODR_DEBUG</literal> defined. This causes the module to dump tracing
1291     information as it processes data units. With this output and the
1292     protocol specification (Z39.50), it is generally fairly easy to see
1293     what goes wrong.
1294    </para>
1295   </sect1>
1296  </chapter>
1297  <!-- Keep this comment at the end of the file
1298  Local variables:
1299  mode: sgml
1300  sgml-omittag:t
1301  sgml-shorttag:t
1302  sgml-minimize-attributes:nil
1303  sgml-always-quote-attributes:t
1304  sgml-indent-step:1
1305  sgml-indent-data:t
1306  sgml-parent-document: "yaz.xml"
1307  sgml-local-catalogs: nil
1308  sgml-namecase-general:t
1309  End:
1310  -->