home bbs files messages ]

Just a sample of the Echomail archive

Cooperative anarchy at its finest, still active today. Darkrealms is the Zone 1 Hub.

   GOLDED      GoldED Public Release discussion.      2,690 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 537 of 2,690   
   Michiel van der Vlist to andrew clarke   
   UTF-8   
   06 Jun 15 16:23:46   
   
   Hello andrew,   
      
   On Saturday June 06 2015 21:15, you wrote to Nicholas Boel:   
      
    ac> It supports writing UTF-8 messages, provided a UTF-8 capable external   
    ac> editor is used, as you say, but that's about it.   
      
   No, to write a message in UTF-8, you do not need to use an external editor.   
   What you need is the proper translation table from your local set to UTF-8.   
      
   It is reading message in UTF-8 that is problematic.   
      
      
   Here is my translation table from CP850 to UTF-8:   
   850_utf.chs   
      
      
   === Cut ===   
    file 850_utf8.chs   
      
    This file is a charset conversion module in text form.   
      
    This module Converts IBM CP850 characters to UTF-8 characters.   
      
    By Michiel van der Vlist, 2:280/5555   
      
    This is a modified version that translates the currency symbol € (Alt 207)   
    (hex CF) into the Euro sign.   
      
    Format: ID, version, level,   
            from charset, to charset,   
            128 entries: first, second and optional third byte   
            "END"   
    Lines beginning with a ";" or a ";" after the entries are comments   
      
      
    cedilla = ,   ; dieresis = ..       ; acute = '   
    grave = `     ; circumflex = ^      ; ring = o   
    tilde = ~     ; caron = v   
    All of these are above the character, apart from the cedilla which is below.   
      
    \ is the escape character: \0 means decimal zero,   
    \dnnn where nnn is a decimal number is the ordinal value of the character   
    \xnn where nn is a hexadecimal number   
    e.g.: \d32 is the ASCII space character   
    Two \\ is the character "\" itself.   
      
   4                 ; ID number   
   1                 ; version number   
      
   2 4               ; level number   
      
   CP850             ; from set   
   UTF-8             ; to set   
                    ; dec hx description   
      
   \xC3 \x87         ; 128 80 latin capital letter c with cedilla   
   \xC3 \xBC         ; 129 81 latin small letter u with diaeresis   
   \xC3 \xA9         ; 130 82 latin small letter e with acute   
   \xC3 \xA2         ; 131 83 latin small letter a with circumflex   
   \xC3 \xA4         ; 132 84 latin small letter a with diaeresis   
   \xC3 \xA0         ; 133 85 latin small letter a with grave   
   \xC3 \xA5         ; 134 86 latin small letter a with ring above   
   \xC3 \xA7         ; 135 87 latin small letter c with cedilla   
   \xC3 \xAA         ; 136 88 latin small letter e with circumflex   
   \xC3 \xAB         ; 137 89 latin small letter e with diaeresis   
   \xC3 \xA8         ; 138 8A latin small letter e with grave   
   \xC3 \xAF         ; 139 8B latin small letter i with diaeresis   
   \xC3 \xAE         ; 140 8C latin small letter i with circumflex   
   \xC3 \xAC         ; 141 8D latin small letter i with grave   
   \xC3 \x84         ; 142 8E latin capital letter a with diaeresis   
   \xC3 \x85         ; 143 8F latin capital letter a with ring above   
   \xC3 \x89         ; 144 90 latin capital letter e with acute   
   \xC3 \xA6         ; 145 91 latin small letter ae   
   \xC3 \x86         ; 146 92 latin capital letter ae   
   \xC3 \xB4         ; 147 93 latin small letter o with circumflex   
   \xC3 \xB6         ; 148 94 latin small letter o with diaeresis   
   \xC3 \xB2         ; 149 95 latin small letter o with grave   
   \xC3 \xBB         ; 150 96 latin small letter u with circumflex   
   \xC3 \xB9         ; 151 97 latin small letter u with grave   
   \xC3 \xBF         ; 152 98 latin small letter y with diaeresis   
   \xC3 \x96         ; 153 99 latin capital letter o with diaeresis   
   \xC3 \x9C         ; 154 9A latin capital letter u with diaeresis   
   \xC3 \xB8         ; 155 9B latin small letter o with stroke   
   \xC2 \xA3         ; 156 9C pound sign   
   \xC3 \x98         ; 157 9D latin capital letter o with stroke   
   \xC3 \x97         ; 158 9E multiplication sign   
   \xC6 \x92         ; 159 9F dutch guilder sign (ibm437 159) (f with hook)   
   \xC3 \xA1         ; 160 A0 latin small letter a with acute   
   \xC3 \xAD         ; 161 A1 latin small letter i with acute   
   \xC3 \xB3         ; 162 A2 latin small letter o with acute   
   \xC3 \xBA         ; 163 A3 latin small letter u with acute   
   \xC3 \xB1         ; 164 A4 latin small letter n with tilde   
   \xC3 \x91         ; 165 A5 latin capital letter n with tilde   
   \xC2 \xB8         ; 166 A6 feminine ordinal indicator   
   \xC2 \xBA         ; 167 A7 masculine ordinal indicator   
   \xC2 \xBF         ; 168 A8 inverted question mark   
   \xC2 \xAE         ; 169 A9 registered sign   
   \xC2 \xAC         ; 170 AA not sign   
   \xC2 \xBD         ; 171 AB vulgar fraction one half   
   \xC2 \xBC         ; 172 AC vulgar fraction one quarter   
   \xC2 \xA1         ; 173 AD inverted exclamation mark   
   \xC2 \xAB         ; 174 AE left-pointing double angle quotation mark   
   \xC2 \xBB         ; 175 AF right-pointing double angle quotation mark   
   \xE2 \x96 \x91    ; 176 B0 light shade   
   \xE2 \x96 \x92    ; 177 B1 medium shade   
   \xE2 \x96 \x93    ; 178 B2 dark shade   
   \xE2 \x94 \x82    ; 179 B3 box drawings light vertical   
   \xE2 \x94 \xA4    ; 180 B4 box drawings light vertical and left   
   \xC3 \x81         ; 181 B5 latin capital letter a with acute   
   \xC3 \x82         ; 182 B6 latin capital letter a with circumflex   
   \xC3 \x80         ; 183 B7 latin capital letter a with grave   
   \xC2 \xA9         ; 184 B8 copyright sign   
   \xE2 \x95 \xA3    ; 185 B9 box drawings heavy vertical and left   
   \xE2 \x95 \x91    ; 186 BA box drawings heavy vertical   
   \xE2 \x95 \x97    ; 187 BB box drawings heavy down and left   
   \xE2 \x95 \x9D    ; 188 BC box drawings heavy up and left   
   \xC2 \xA2         ; 189 BD cent sign   
   \xC2 \xA5         ; 190 BE yen sign   
   \xE2 \x94 \x90    ; 191 BF box drawings light down and left   
   \xE2 \x94 \x94    ; 192 C0 box drawings light up and right   
   \xE2 \x94 \xB4    ; 193 C1 box drawings light up and horizontal   
   \xE2 \x94 \xAC    ; 194 C2 box drawings light down and horizontal   
   \xE2 \x94 \x9C    ; 195 C3 box drawings light vertical and right   
   \xE2 \x94 \x80    ; 196 C4 box drawings light horizontal   
   \xE2 \x94 \xBC    ; 197 C5 box drawings light vertical and horizontal   
   \xC3 \xA3         ; 198 C6 latin small letter a with tilde   
   \xC3 \x83         ; 199 C7 latin capital letter a with tilde   
   \xE2 \x95 \x9A    ; 200 C8 box drawings heavy up and right   
   \xE2 \x95 \x94    ; 201 C9 box drawings heavy down and right   
   \xE2 \x95 \xA9    ; 202 CA box drawings heavy up and horizontal   
   \xE2 \x95 \xA6    ; 203 CB box drawings heavy down and horizontal   
   \xE2 \x95 \xA0    ; 204 CC box drawings heavy vertical and right   
   \xE2 \x95 \x90    ; 205 CD box drawings heavy horizontal   
   \xE2 \x95 \xAC    ; 206 CE box drawings heavy vertical and horizontal   
   \xC2 \xA4         ; 207 CF currency sign   
   \xE2 \x82 \xAC    ; 207 CF Euro sign.   
   \xC3 \xB0         ; 208 D0 latin small letter eth (icelandic)   
   \xC3 \x90         ; 209 D1 latin capital letter eth (icelandic)   
   \xC3 \x8A         ; 210 D2 latin capital letter e with circumflex   
   \xC3 \x8B         ; 211 D3 latin capital letter e with diaeresis   
   \xC3 \x88         ; 212 D4 latin capital letter e with grave   
   \xC4 \xB1         ; 213 D5 latin small letter i dotless   
   \xC3 \x8D         ; 214 D6 latin capital letter i with acute   
   \xC3 \x8E         ; 215 D7 latin capital letter i with circumflex   
   \xC3 \x8F         ; 216 D8 latin capital letter i with diaeresis   
   \xE2 \x94 \x98    ; 217 D9 box drawings light up and left   
   \xE2 \x94 \x8C    ; 218 DA box drawings light down and right   
   \xE2 \x96 \x88    ; 219 DB full block   
   \xE2 \x96 \x84    ; 220 DC lower half block   
   \xC2 \xA6         ; 221 DD broken bar   
   \xC3 \x8C         ; 222 DE latin capital letter i with grave   
   \xE2 \x96 \x80    ; 223 DF upper half block   
   \xC3 \x93         ; 224 E0 latin capital letter o with acute   
   \xC3 \x9F         ; 225 E1 latin small letter sharp s (german)   
   \xC3 \x94         ; 226 E2 latin capital letter o with circumflex   
   \xC3 \x92         ; 227 E3 latin capital letter o with grave   
   \xC3 \xB5         ; 228 E4 latin small letter o with tilde   
   \xC3 \x95         ; 229 E5 latin capital letter o with tilde   
   \xC2 \xB5         ; 230 E6 greek small letter mu   
   \xC3 \x9E         ; 231 E7 latin capital letter thorn (icelandic)   
   \xC3 \xBE         ; 232 E8 latin small letter thorn (icelandic)   
   \xC3 \x9A         ; 233 E9 latin capital letter u with acute   
   \xC3 \x9B         ; 234 EA latin capital letter u with circumflex   
   \xC3 \x99         ; 235 EB latin capital letter u with grave   
   \xC3 \xBD         ; 236 EC latin small letter y with acute   
   \xC3 \x9D         ; 237 ED latin capital letter y with acute   
   \xC3 \x8F         ; 238 EE em dash   
   \xC2 \xB4         ; 239 EF acute accent   
   \xC2 \xAD         ; 240 F0 soft hyphen   
   \xC2 \xB1         ; 241 F1 plus-minus sign   
   \xE2 \x80 \x97    ; 242 F2 left right double arrow   
   \xC2 \xBE         ; 243 F3 vulgar fraction three quarters   
   \xC2 \xB6         ; 244 F4 pilcrow sign   
   \xC2 \xA7         ; 245 F5 section sign   
   \xC3 \xB7         ; 246 F6 division sign   
   \xC2 \xB8         ; 247 F7 ogonek   
   \xC2 \xB0         ; 248 F8 degree sign   
   \xC2 \xA8         ; 249 F9 diaeresis   
   \xC2 \xB7         ; 250 FA middle dot   
   \xC2 \xB9         ; 251 FB superscript one   
   \xC2 \xB3         ; 252 FC superscript three   
   \xC2 \xB2         ; 253 FD superscript two   
   \xE2 \x96 \xA0    ; 254 FE black square   
   \xC2 \xA0         ; 255 FF no-break space   
   END   
   === Cut ===   
      
      
   Cheers, Michiel   
      
   --- GoldED+/W32-MSVC 1.1.5-b20130111   
    * Origin: http://www.vlist.eu (2:280/5555)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca