X-Git-Url: http://git.indexdata.com/?a=blobdiff_plain;ds=sidebyside;f=src%2Fcodetables-iso5426.xml;h=54dbb1f1967bf1add7e656f25276ba740b426fa2;hb=1ecf21b0324959e62628807d5df217d7a761730b;hp=d61546f5a94a91273000ee297d893f76bb784438;hpb=ecc1ee5b177fdc11ed531dfe13b5c3aae52b2843;p=yaz-moved-to-github.git
diff --git a/src/codetables-iso5426.xml b/src/codetables-iso5426.xml
index d61546f..54dbb1f 100644
--- a/src/codetables-iso5426.xml
+++ b/src/codetables-iso5426.xml
@@ -7,21 +7,13 @@
contains the ISO5426 code (in hex) for the character as coming from the G1
graphic set, the third column contains the UCS/Unicode 16-bit code (in
hex), the fourth column contains the UTF-8 code (in hex) for the UCS
- characters, the fifth column contains a representation of the character (where possible),
+ characters, the fifth column contains a representation of the character (where possible),
the sixth column contains the MARC character name, followed
by the UCS name. If the MARC name is the same as or very similar to the
- UCS name, only the UCS name is given. For some tables alternate encodings
- in Unicode and UTF-8 are given. When that occurs the alternate Unicode and
+ UCS name, only the UCS name is given. For some tables alternate encodings
+ in Unicode and UTF-8 are given. When that occurs the alternate Unicode and
alternate UTF-8 columns follow the character name.
-
1D
001D
@@ -615,7 +607,6 @@ BRACKET
SPACING TILDE / TILDE
-
See also Zeichentabelle MAB2 (ISO 5426-1983), http://www.gymel.com/charsets/MAB2.html
@@ -641,14 +632,12 @@ BRACKET
C2A1
INVERTED EXCLAMATION MARK
-
A2
201E
E2809E
LOW DOUBLE COMMA QUOTATION MARK
-
A3
00A3
@@ -661,19 +650,18 @@ BRACKET
24
DOLLAR SIGN
-
A5
00A5
C2A5
YEN SIGN
-
+
A6
2020
E280A0
DAGGER
-
+
A7
00A7
@@ -686,31 +674,30 @@ BRACKET
E280A0
PRIME
-
A9
2018
E28098
SINGLE TURNED COMMA QUOTATION MARK
-
+
AA
201C
E2809C
DOUBLE TURNED COMMA QUOTATION MARK
-
+
AB
00AB
E280A0
LEFT-POINTING DOUBLE ANGLE QUOTATION MARK (LEFT POINTING GUILLEMET)
-
+
AC
266D
E299AD
MUSIC FLAT SIGN (FLAT)
-
+
AD
00A9
@@ -729,17 +716,12 @@ BRACKET
C2AE
PATENT MARK / REGISTERED SIGN
-
-
-
-
B0
02BB
CABB
AYN / MODIFIER LETTER TURNED COMMA
-
B1
02BC
@@ -747,7 +729,6 @@ BRACKET
CABE
ALIF / MODIFIER LETTER APOSTROPHE
-
B2
201A
@@ -772,26 +753,26 @@ BRACKET
2033
E280B3
DOUBLE PRIME
-
+
B9
2019
E2809D
RIGHT SINGLE QUOTATION MARK (SINGLE COMMA QUOTATION MARK)
-
+
BA
201D
E2809D
RIGHT DOUBLE QUOTATION MARK (DOUBLE COMMA QUOTATION MARK)
-
+
BB
00BB
C2BB
RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK (RIGHT POINTING GUILLEMET)
-
-
+
+
BC
266F
E299AF
@@ -807,7 +788,7 @@ BRACKET
BE
02BA
CABA
- HARD SIGN, DOUBLE PRIME / MODIFIER LETTER DOUBLE PRIME
+ HARD SIGN, DOUBLE PRIME / MODIFIER LETTER DOUBLE PRIME
BF
@@ -815,7 +796,6 @@ BRACKET
C2BF
INVERTED QUESTION MARK
-
true
C0
@@ -878,7 +858,7 @@ BRACKET
http://www.unicode.org/faq/char_combmark.html#18
true
C8
- 034F0308
+ 0308
CC88
U+034F COMBINING GRAPHEME JOINER (CGJ) / tréma
@@ -908,7 +888,7 @@ BRACKET
CC
0313
CC93
- HIGH COMMA, CENTERED / COMBINING COMMA ABOVE (Psili)
+ HIGH COMMA, CENTERED / COMBINING COMMA ABOVE (Psili)
true
@@ -931,14 +911,13 @@ BRACKET
CC8C
HACEK / COMBINING CARON
-
true
D0
0327
CCA7
CEDILLA / COMBINING CEDILLA
-
+
true
D1
@@ -952,7 +931,7 @@ BRACKET
0326
CCA6
LEFT HOOK (COMMA BELOW) / COMBINING COMMA BELOW
-
+
true
D3
@@ -1002,7 +981,6 @@ BRACKET
CCB3
DOUBLE UNDERSCORE / COMBINING DOUBLE LOW LINE
-
true
DA
@@ -1026,7 +1004,7 @@ BRACKET
FE22
EFB8A2
DOUBLE TILDE, FIRST HALF / COMBINING DOUBLE TILDE
-
+
true
DE
@@ -1035,18 +1013,18 @@ BRACKET
FE21
EFB8A1
LIGATURE, SECOND HALF / COMBINING LIGATURE RIGHT HALF
- The Ligature that spans two characters
- is constructed of two halves in MARC-8: EB
- (Ligature, first half) and EC (Ligature, second
- half). The preferred Unicode/UTF-8 mapping is to
+ The Ligature that spans two characters
+ is constructed of two halves in MARC-8: EB
+ (Ligature, first half) and EC (Ligature, second
+ half). The preferred Unicode/UTF-8 mapping is to
the single character Ligature that spans two characters,
U+0361. The single character Ligature is encoded
- following the second of the two characters to be spanned.
- The two half Ligatures in Unicode, to which the
- Ligature has been mapped since 1996, are indicted
- in the mapping as alternatives, but their use is not
- recommended. It is expected that font support for
- the single character Ligature mark will be more
+ following the second of the two characters to be spanned.
+ The two half Ligatures in Unicode, to which the
+ Ligature has been mapped since 1996, are indicted
+ in the mapping as alternatives, but their use is not
+ recommended. It is expected that font support for
+ the single character Ligature mark will be more
easily obtained than for the two halves.
@@ -1057,24 +1035,22 @@ BRACKET
FE23
EFB8A3
DOUBLE TILDE, SECOND HALF / COMBINING DOUBLE TILDE RIGHT HALF
- The Double Tilde that spans two characters is
- constructed of two halves in MARC-8: FA (Double
- Tilde, first half) and FB (Double Tilde, second
- half). The preferred Unicode/UTF-8 mapping
- is to the single character Double Tilde that
- spans two characters, U+0360. The single
- character Double Tilde is encoded following
- the second of the two characters to be spanned.
- The two half Double Tildes in Unicode, to
- which the MARC8 Double Tilde has been
- mapped since 1996, are indicted in the
- mapping as alternatives, but their use is not
- recommended. It is expected that font support
- for the single character Double Tilde mark will
+ The Double Tilde that spans two characters is
+ constructed of two halves in MARC-8: FA (Double
+ Tilde, first half) and FB (Double Tilde, second
+ half). The preferred Unicode/UTF-8 mapping
+ is to the single character Double Tilde that
+ spans two characters, U+0360. The single
+ character Double Tilde is encoded following
+ the second of the two characters to be spanned.
+ The two half Double Tildes in Unicode, to
+ which the MARC8 Double Tilde has been
+ mapped since 1996, are indicted in the
+ mapping as alternatives, but their use is not
+ recommended. It is expected that font support
+ for the single character Double Tilde mark will
be more easily obtained than for the two halves.
-
-
E1
@@ -1100,7 +1076,7 @@ BRACKET
E8
0141
C581
- UPPERCASE POLISH L / LATIN CAPITAL LETTER L WITH STROKE
+ UPPERCASE POLISH L / LATIN CAPITAL LETTER L WITH STROKE
E9
@@ -1152,7 +1128,7 @@ BRACKET
0133
C4B3
LATIN SMALL LIGATURE IJ (LATIN SMALL LETTER I J)
-
+
F8