Unicode has been developed to describe all possible characters of all languages plus a lot of symbols with one unique number for each character/symbol. Unicode as defined by the Unicode organization has become a universal standard: ISO/IEC 10646, describing the 'Universal Multiple-Octet Coded Character Set' (UCS).
It is not always possible to transfer a Unicode character to another computer reliably. For that reason a special encoding scheme has been developed, UTF-8, which stands for UCS Transformation Format 8.
On this page you will find an overview of the UTF-8 encoding scheme.
This page is encoded as Windows-1252. Your browser should support this character set. If not, then the literal characters of the table below will be displayed incorrectly. That's no problem if you are only interested in the conversion algorithms for UTF-8 on this page.
Explanation of the table
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| ™ | 153 | 99 | 00F4 | 244 | 195.180 | C3B4 | ô | TRADE MARK SIGN | trademark |
The meaning of the columns is a follows:
ch: the specified character as literal, which is probably not displayed correctly
dec: the decimal ASCII value of the character
hx: the hexadecimal value of the character
U-hex: the Unicode value in hexadecimal
U-dec: the Unicode value in decimal
UTF-dec: the UTF8-encoded bytes as decimal numbers
UTF-hx: the UTF8-encoded bytes as hexadecimal numbers
lit: the UTF8-encoded characters as literals, which are probably not displayed correctly in the absolute sense, but are displayed 'as seen' by your browser
Unicode name: the full Unicode name of the character
PostScript name: the PostScript name of the character if this name exists
Let us take for example the trademark sign, which looks something like a higher positioned TM.
On a Macintosh you can produce this sign by taking character position number 170 decimal. On a Windows computer this is position 153 decimal. Unicode is the same for all users and in this scheme the trademark sign can be found at position 2122 hexadecimal, which is the same as 8842 decimal.
On a webpage you could try to encode this character like ™ but not each and every browser is able to reproduce many of those 'entities'. If your reader has a version 4 browser or better, the best thing you can do is encode the trademark sign with a numerical Unicode entity like ™. In a special META-tag your page has to be defined as a UTF-8 page. This is explained in detail on the page with entity tips.
If you write an email with for instance Microsoft Outlook Express and let the emailer encode your letter as UTF-8, then Outlook Express converts the trademark sign to a UTF-8 code. The result is in this case a combination of two characters with numerical values 195 and 180. What characters you will see on your screen without UTF-8 decoding, depends on your platform. A Macintosh user would see a square root symbol, followed by a Yen sign. The Windows viewer will see a letter A with a tilde, followed by an acute accent.
How did the encoding program get these numbers?
UTF-8 encoding
The proper way to convert between UCS-4 and UTF-8 is to use bitmask (and, or) and bitshift operations. But if you would like to convert only a couple of characters by hand or if your program development environment (scripting language) does not support bit operations, then integer division and multiplication can be used as follows.
From Unicode UCS-4 to UTF-8:
Start with the Unicode number expressed as a decimal number and call this ud.
If ud <128 (7F hex) then UTF-8 is 1 byte long, the value of ud.
If ud >=128 and <=2047 (7FF hex) then UTF-8 is 2 bytes long.
byte 1 = 192 + (ud div 64)
byte 2 = 128 + (ud mod 64)
If ud >=2048 and <=65535 (FFFF hex) then UTF-8 is 3 bytes long.
byte 1 = 224 + (ud div 4096)
byte 2 = 128 + ((ud div 64) mod 64)
byte 3 = 128 + (ud mod 64)
If ud >=65536 and <=2097151 (1FFFFF hex) then UTF-8 is 4 bytes long.
byte 1 = 240 + (ud div 262144)
byte 2 = 128 + ((ud div 4096) mod 64)
byte 3 = 128 + ((ud div 64) mod 64)
byte 4 = 128 + (ud mod 64)
If ud >=2097152 and <=67108863 (3FFFFFF hex) then UTF-8 is 5 bytes long.
byte 1 = 248 + (ud div 16777216)
byte 2 = 128 + ((ud div 262144) mod 64)
byte 3 = 128 + ((ud div 4096) mod 64)
byte 4 = 128 + ((ud div 64) mod 64)
byte 5 = 128 + (ud mod 64)
If ud >=67108864 and <=2147483647 (7FFFFFFF hex) then UTF-8 is 6 bytes long.
byte 1 = 252 + (ud div 1073741824)
byte 2 = 128 + ((ud div 16777216) mod 64)
byte 3 = 128 + ((ud div 262144) mod 64)
byte 4 = 128 + ((ud div 4096) mod 64)
byte 5 = 128 + ((ud div 64) mod 64)
byte 6 = 128 + (ud mod 64)
The operation div means integer division and mod means the rest after integer division.
For positive numbers a div b = int(a/b) and a mod b = (a/b-int(a/b))*b.
UTF-8 sequences of 5 bytes and longer are at the moment not supported by the regular browsers.
The highest character position defined in Unicode 3.2 is number 10FFFF hex (1114111 dec) in a 'private use' area. The highest character with an actual glyph is number E007F hex (917631 dec), the CANCEL TAG character. In Unicode 6.1 there are still no characters defined above 200000 hex.
Please note that at the moment UTF-8 is only defined for number series from 1 to 4 bytes long. What will happen when the Unicode region above 200000 hex is filled, is not known. It is possible that UTF-8 will be extended to 6 byte series, but this is far from certain. That means that the algorithm given above should throw an error if ud >=2097152.
From UTF-8 to Unicode UCS-4:
Let's take a UTF-8 byte sequence. The first byte in a new sequence will tell us how long the sequence is. Let's call the subsequent decimal bytes z y x w v u.
If z is between and including 0 - 127, then there is 1 byte z. The decimal Unicode value ud = the value of z.
If z is between and including 192 - 223, then there are 2 bytes z y; ud = (z-192)*64 + (y-128)
If z is between and including 224 - 239, then there are 3 bytes z y x; ud = (z-224)*4096 + (y-128)*64 + (x-128)
If z is between and including 240 - 247, then there are 4 bytes z y x w; ud = (z-240)*262144 + (y-128)*4096 + (x-128)*64 + (w-128)
If z is between and including 248 - 251, then there are 5 bytes z y x w v; ud = (z-248)*16777216 + (y-128)*262144 + (x-128)*4096 + (w-128)*64 + (v-128)
If z is 252 or 253, then there are 6 bytes z y x w v u; ud = (z-252)*1073741824 + (y-128)*16777216 + (x-128)*262144 + (w-128)*4096 + (v-128)*64 + (u-128)
If z = 254 or 255 then there is something wrong!
Please note that at the moment UTF-8 is only defined for number series from 1 to 4 bytes long. What will happen when the Unicode region above 200000 hex is filled, is not known. It is possible that UTF-8 will be extended to 6 byte series, but this is far from certain. That means that the algorithm given above should throw an error if z >=248.
Example: take the decimal Unicode designation 8482 (decimal), which is for the trademark sign. This number is larger than 2048, so we get three numbers.
The first number is 224 + (8482 div 4096) = 224 + 2 = 226.
The second number is 128 + (8482 div 64) mod 64) = 128 + (132 mod 64) = 128 + 4 = 132.
The third number is 128 + (8482 mod 64) = 128 + 34 = 162.
Now the other way round. We see the numbers 226, 132 and 162. What is the decimal Unicode value?
In this case: (226-224)*4096+(132-128)*64+(162-128) = 8482.
And the conversion between hexadecimal and decimal? Come on, this is not a math tutorial! In case you don't know, use a calculator.
References
More information about the UTF-8 encoding can be found in:
Request for Comments No. 3629, UTF-8, a transformation format of ISO 10646.
The page you are reading now is encoded in the standard Windows Roman encoding, 'code page 1252'. The unicode definition can be found at:
Windows code page 1252, Unicode encodings
Our alternative page (utf8tblMac.html) is encoded in the Apple Roman Unicode encoding.
This encoding scheme can also be found at the unicode organization:
Apple Roman Unicode encoding.
That document describes the latest Apple character set, as used by the Apple Mac OS Text Encoding Converter software version 1.5 and above.
A remark about that encoding: code position 0xDB is now used for the EURO SIGN, but a couple of years ago this position was used for the CURRENCY SIGN, as originally defined.
And now where you have been waiting for, the complete UTF-8 table for 1-byte Unicode characters from 128 decimal to 255.
This table follows the Windows 1252 encoding scheme.
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| € | 128 | 80 | 00C4 | 196 | 195.132 | C384 | Ä | EURO SIGN | Euro |
| 129 | 81 | 00C5 | 197 | 195.133 | C385 | Ã… | not defined | ||
| ‚ | 130 | 82 | 00C7 | 199 | 195.135 | C387 | Ç | SINGLE LOW-9 QUOTATION MARK | quotesinglbase |
| ƒ | 131 | 83 | 00C9 | 201 | 195.137 | C389 | É | LATIN SMALL LETTER F WITH HOOK | florin |
| „ | 132 | 84 | 00D1 | 209 | 195.145 | C391 | Ñ | DOUBLE LOW-9 QUOTATION MARK | quotedblbase |
| … | 133 | 85 | 00D6 | 214 | 195.150 | C396 | Ö | HORIZONTAL ELLIPSIS | ellipsis |
| † | 134 | 86 | 00DC | 220 | 195.156 | C39C | Ü | DAGGER | dagger |
| ‡ | 135 | 87 | 00E1 | 225 | 195.161 | C3A1 | á | DOUBLE DAGGER | daggerdbl |
| ˆ | 136 | 88 | 00E0 | 224 | 195.160 | C3A0 | Ã | MODIFIER LETTER CIRCUMFLEX ACCENT | circumflex |
| ‰ | 137 | 89 | 00E2 | 226 | 195.162 | C3A2 | â | PER MILLE SIGN | perthousand |
| Š | 138 | 8A | 00E4 | 228 | 195.164 | C3A4 | ä | LATIN CAPITAL LETTER S WITH CARON | Scaron |
| ‹ | 139 | 8B | 00E3 | 227 | 195.163 | C3A3 | ã | SINGLE LEFT-POINTING ANGLE QUOTATION MARK | guilsinglleft |
| Œ | 140 | 8C | 00E5 | 229 | 195.165 | C3A5 | Ã¥ | LATIN CAPITAL LIGATURE OE | OE |
| 141 | 8D | 00E7 | 231 | 195.167 | C3A7 | ç | not defined | ||
| Ž | 142 | 8E | 00E9 | 233 | 195.169 | C3A9 | é | LATIN CAPITAL LETTER Z WITH CARON | Zcaron |
| 143 | 8F | 00E8 | 232 | 195.168 | C3A8 | è | not defined | ||
| 144 | 90 | 00EA | 234 | 195.170 | C3AA | ê | not defined | ||
| ‘ | 145 | 91 | 00EB | 235 | 195.171 | C3AB | ë | LEFT SINGLE QUOTATION MARK | quoteleft |
| ’ | 146 | 92 | 00ED | 237 | 195.173 | C3AD | Ã | RIGHT SINGLE QUOTATION MARK | quoteright |
| “ | 147 | 93 | 00EC | 236 | 195.172 | C3AC | ì | LEFT DOUBLE QUOTATION MARK | quotedblleft |
| ” | 148 | 94 | 00EE | 238 | 195.174 | C3AE | î | RIGHT DOUBLE QUOTATION MARK | quotedblright |
| • | 149 | 95 | 00EF | 239 | 195.175 | C3AF | ï | BULLET | bullet |
| – | 150 | 96 | 00F1 | 241 | 195.177 | C3B1 | ñ | EN DASH | endash |
| — | 151 | 97 | 00F3 | 243 | 195.179 | C3B3 | ó | EM DASH | emdash |
| ˜ | 152 | 98 | 00F2 | 242 | 195.178 | C3B2 | ò | SMALL TILDE | tilde |
| ™ | 153 | 99 | 00F4 | 244 | 195.180 | C3B4 | ô | TRADE MARK SIGN | trademark |
| š | 154 | 9A | 00F6 | 246 | 195.182 | C3B6 | ö | LATIN SMALL LETTER S WITH CARON | scaron |
| › | 155 | 9B | 00F5 | 245 | 195.181 | C3B5 | õ | SINGLE RIGHT-POINTING ANGLE QUOTATION MARK | guilsinglright |
| œ | 156 | 9C | 00FA | 250 | 195.186 | C3BA | ú | LATIN SMALL LIGATURE OE | oe |
| 157 | 9D | 00F9 | 249 | 195.185 | C3B9 | ù | not defined | ||
| ž | 158 | 9E | 00FB | 251 | 195.187 | C3BB | û | LATIN SMALL LETTER Z WITH CARON | zcaron |
| Ÿ | 159 | 9F | 00FC | 252 | 195.188 | C3BC | ü | LATIN CAPITAL LETTER Y WITH DIAERESIS | Ydieresis |
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| 160 | A0 | 2020 | 8224 | 226.128.160 | E280A0 | †| NO-BREAK SPACE | nobreakspace | |
| ¡ | 161 | A1 | 00B0 | 176 | 194.176 | C2B0 | ° | INVERTED EXCLAMATION MARK | exclamdown |
| ¢ | 162 | A2 | 00A2 | 162 | 194.162 | C2A2 | ¢ | CENT SIGN | cent |
| £ | 163 | A3 | 00A3 | 163 | 194.163 | C2A3 | £ | POUND SIGN | sterling |
| ¤ | 164 | A4 | 00A7 | 167 | 194.167 | C2A7 | § | CURRENCY SIGN | currency |
| ¥ | 165 | A5 | 2022 | 8226 | 226.128.162 | E280A2 | • | YEN SIGN | yen |
| ¦ | 166 | A6 | 00B6 | 182 | 194.182 | C2B6 | ¶ | BROKEN BAR | brokenbar |
| § | 167 | A7 | 00DF | 223 | 195.159 | C39F | ß | SECTION SIGN | section |
| ¨ | 168 | A8 | 00AE | 174 | 194.174 | C2AE | ® | DIAERESIS | dieresis |
| © | 169 | A9 | 00A9 | 169 | 194.169 | C2A9 | © | COPYRIGHT SIGN | copyright |
| ª | 170 | AA | 2122 | 8482 | 226.132.162 | E284A2 | â„¢ | FEMININE ORDINAL INDICATOR | ordfeminine |
| « | 171 | AB | 00B4 | 180 | 194.180 | C2B4 | ´ | LEFT-POINTING DOUBLE ANGLE QUOTATION MARK | guillemotleft |
| ¬ | 172 | AC | 00A8 | 168 | 194.168 | C2A8 | ¨ | NOT SIGN | logicalnot |
| | 173 | AD | 2260 | 8800 | 226.137.160 | E289A0 | ≠| SOFT HYPHEN | hyphen |
| ® | 174 | AE | 00C6 | 198 | 195.134 | C386 | Æ | REGISTERED SIGN | registered |
| ¯ | 175 | AF | 00D8 | 216 | 195.152 | C398 | Ø | MACRON | macron |
| ° | 176 | B0 | 221E | 8734 | 226.136.158 | E2889E | ∞ | DEGREE SIGN | degree |
| ± | 177 | B1 | 00B1 | 177 | 194.177 | C2B1 | ± | PLUS-MINUS SIGN | plusminus |
| ² | 178 | B2 | 2264 | 8804 | 226.137.164 | E289A4 | ≤ | SUPERSCRIPT TWO | twosuperior |
| ³ | 179 | B3 | 2265 | 8805 | 226.137.165 | E289A5 | ≥ | SUPERSCRIPT THREE | threesuperior |
| ´ | 180 | B4 | 00A5 | 165 | 194.165 | C2A5 | Â¥ | ACUTE ACCENT | acute |
| µ | 181 | B5 | 00B5 | 181 | 194.181 | C2B5 | µ | MICRO SIGN | mu |
| ¶ | 182 | B6 | 2202 | 8706 | 226.136.130 | E28882 | ∂ | PILCROW SIGN | paragraph |
| · | 183 | B7 | 2211 | 8721 | 226.136.145 | E28891 | ∑ | MIDDLE DOT | periodcentered |
| ¸ | 184 | B8 | 220F | 8719 | 226.136.143 | E2888F | ∠| CEDILLA | cedilla |
| ¹ | 185 | B9 | 03C0 | 960 | 207.128 | CF80 | Ï€ | SUPERSCRIPT ONE | onesuperior |
| º | 186 | BA | 222B | 8747 | 226.136.171 | E288AB | ∫ | MASCULINE ORDINAL INDICATOR | ordmasculine |
| » | 187 | BB | 00AA | 170 | 194.170 | C2AA | ª | RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK | guillemotright |
| ¼ | 188 | BC | 00BA | 186 | 194.186 | C2BA | º | VULGAR FRACTION ONE QUARTER | onequarter |
| ½ | 189 | BD | 03A9 | 937 | 206.169 | CEA9 | Ω | VULGAR FRACTION ONE HALF | onehalf |
| ¾ | 190 | BE | 00E6 | 230 | 195.166 | C3A6 | æ | VULGAR FRACTION THREE QUARTERS | threequarters |
| ¿ | 191 | BF | 00F8 | 248 | 195.184 | C3B8 | ø | INVERTED QUESTION MARK | questiondown |
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| À | 192 | C0 | 00BF | 191 | 194.191 | C2BF | ¿ | LATIN CAPITAL LETTER A WITH GRAVE | Agrave |
| Á | 193 | C1 | 00A1 | 161 | 194.161 | C2A1 | ¡ | LATIN CAPITAL LETTER A WITH ACUTE | Aacute |
|  | 194 | C2 | 00AC | 172 | 194.172 | C2AC | ¬ | LATIN CAPITAL LETTER A WITH CIRCUMFLEX | Acircumflex |
| à | 195 | C3 | 221A | 8730 | 226.136.154 | E2889A | √ | LATIN CAPITAL LETTER A WITH TILDE | Atilde |
| Ä | 196 | C4 | 0192 | 402 | 198.146 | C692 | Æ’ | LATIN CAPITAL LETTER A WITH DIAERESIS | Adieresis |
| Š| 197 | C5 | 2248 | 8776 | 226.137.136 | E28988 | ≈ | LATIN CAPITAL LETTER A WITH RING ABOVE | Aring |
| Æ | 198 | C6 | 2206 | 8710 | 226.136.134 | E28886 | ∆ | LATIN CAPITAL LETTER AE | AE |
| Ç | 199 | C7 | 00AB | 171 | 194.171 | C2AB | « | LATIN CAPITAL LETTER C WITH CEDILLA | Ccedilla |
| È | 200 | C8 | 00BB | 187 | 194.187 | C2BB | » | LATIN CAPITAL LETTER E WITH GRAVE | Egrave |
| É | 201 | C9 | 2026 | 8230 | 226.128.166 | E280A6 | … | LATIN CAPITAL LETTER E WITH ACUTE | Eacute |
| Ê | 202 | CA | 00A0 | 160 | 194.160 | C2A0 | Â | LATIN CAPITAL LETTER E WITH CIRCUMFLEX | Ecircumflex |
| Ë | 203 | CB | 00C0 | 192 | 195.128 | C380 | À | LATIN CAPITAL LETTER E WITH DIAERESIS | Edieresis |
| Ì | 204 | CC | 00C3 | 195 | 195.131 | C383 | Ã | LATIN CAPITAL LETTER I WITH GRAVE | Igrave |
| Í | 205 | CD | 00D5 | 213 | 195.149 | C395 | Õ | LATIN CAPITAL LETTER I WITH ACUTE | Iacute |
| Î | 206 | CE | 0152 | 338 | 197.146 | C592 | Å’ | LATIN CAPITAL LETTER I WITH CIRCUMFLEX | Icircumflex |
| Ï | 207 | CF | 0153 | 339 | 197.147 | C593 | Å“ | LATIN CAPITAL LETTER I WITH DIAERESIS | Idieresis |
| Р| 208 | D0 | 2013 | 8211 | 226.128.147 | E28093 | – | LATIN CAPITAL LETTER ETH | Eth |
| Ñ | 209 | D1 | 2014 | 8212 | 226.128.148 | E28094 | — | LATIN CAPITAL LETTER N WITH TILDE | Ntilde |
| Ò | 210 | D2 | 201C | 8220 | 226.128.156 | E2809C | “ | LATIN CAPITAL LETTER O WITH GRAVE | Ograve |
| Ó | 211 | D3 | 201D | 8221 | 226.128.157 | E2809D | †| LATIN CAPITAL LETTER O WITH ACUTE | Oacute |
| Ô | 212 | D4 | 2018 | 8216 | 226.128.152 | E28098 | ‘ | LATIN CAPITAL LETTER O WITH CIRCUMFLEX | Ocircumflex |
| Õ | 213 | D5 | 2019 | 8217 | 226.128.153 | E28099 | ’ | LATIN CAPITAL LETTER O WITH TILDE | Otilde |
| Ö | 214 | D6 | 00F7 | 247 | 195.183 | C3B7 | ÷ | LATIN CAPITAL LETTER O WITH DIAERESIS | Odieresis |
| × | 215 | D7 | 25CA | 9674 | 226.151.138 | E2978A | â—Š | MULTIPLICATION SIGN | multiply |
| Ø | 216 | D8 | 00FF | 255 | 195.191 | C3BF | ÿ | LATIN CAPITAL LETTER O WITH STROKE | Oslash |
| ٠| 217 | D9 | 0178 | 376 | 197.184 | C5B8 | Ÿ | LATIN CAPITAL LETTER U WITH GRAVE | Ugrave |
| Ú | 218 | DA | 2044 | 8260 | 226.129.132 | E28184 | â„ | LATIN CAPITAL LETTER U WITH ACUTE | Uacute |
| Û | 219 | DB | 20AC | 8364 | 226.130.172 | E282AC | € | LATIN CAPITAL LETTER U WITH CIRCUMFLEX | Ucircumflex |
| Ü | 220 | DC | 2039 | 8249 | 226.128.185 | E280B9 | ‹ | LATIN CAPITAL LETTER U WITH DIAERESIS | Udieresis |
| Ý | 221 | DD | 203A | 8250 | 226.128.186 | E280BA | › | LATIN CAPITAL LETTER Y WITH ACUTE | Yacute |
| Þ | 222 | DE | FB01 | 64257 | 239.172.129 | EFAC81 | ï¬ | LATIN CAPITAL LETTER THORN | Thorn |
| ß | 223 | DF | FB02 | 64258 | 239.172.130 | EFAC82 | fl | LATIN SMALL LETTER SHARP S | germandbls |
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| à | 224 | E0 | 2021 | 8225 | 226.128.161 | E280A1 | ‡ | LATIN SMALL LETTER A WITH GRAVE | agrave |
| á | 225 | E1 | 00B7 | 183 | 194.183 | C2B7 | · | LATIN SMALL LETTER A WITH ACUTE | aacute |
| â | 226 | E2 | 201A | 8218 | 226.128.154 | E2809A | ‚ | LATIN SMALL LETTER A WITH CIRCUMFLEX | acircumflex |
| ã | 227 | E3 | 201E | 8222 | 226.128.158 | E2809E | „ | LATIN SMALL LETTER A WITH TILDE | atilde |
| ä | 228 | E4 | 2030 | 8240 | 226.128.176 | E280B0 | ‰ | LATIN SMALL LETTER A WITH DIAERESIS | adieresis |
| å | 229 | E5 | 00C2 | 194 | 195.130 | C382 | Â | LATIN SMALL LETTER A WITH RING ABOVE | aring |
| æ | 230 | E6 | 00CA | 202 | 195.138 | C38A | Ê | LATIN SMALL LETTER AE | ae |
| ç | 231 | E7 | 00C1 | 193 | 195.129 | C381 | Ã | LATIN SMALL LETTER C WITH CEDILLA | ccedilla |
| è | 232 | E8 | 00CB | 203 | 195.139 | C38B | Ë | LATIN SMALL LETTER E WITH GRAVE | egrave |
| é | 233 | E9 | 00C8 | 200 | 195.136 | C388 | È | LATIN SMALL LETTER E WITH ACUTE | eacute |
| ê | 234 | EA | 00CD | 205 | 195.141 | C38D | Ã | LATIN SMALL LETTER E WITH CIRCUMFLEX | ecircumflex |
| ë | 235 | EB | 00CE | 206 | 195.142 | C38E | ÃŽ | LATIN SMALL LETTER E WITH DIAERESIS | edieresis |
| ì | 236 | EC | 00CF | 207 | 195.143 | C38F | Ã | LATIN SMALL LETTER I WITH GRAVE | igrave |
| í | 237 | ED | 00CC | 204 | 195.140 | C38C | ÃŒ | LATIN SMALL LETTER I WITH ACUTE | iacute |
| î | 238 | EE | 00D3 | 211 | 195.147 | C393 | Ó | LATIN SMALL LETTER I WITH CIRCUMFLEX | icircumflex |
| ï | 239 | EF | 00D4 | 212 | 195.148 | C394 | Ô | LATIN SMALL LETTER I WITH DIAERESIS | idieresis |
| ð | 240 | F0 | F8FF | 63743 | 239.163.191 | EFA3BF |  | LATIN SMALL LETTER ETH | eth |
| ñ | 241 | F1 | 00D2 | 210 | 195.146 | C392 | Ã’ | LATIN SMALL LETTER N WITH TILDE | ntilde |
| ò | 242 | F2 | 00DA | 218 | 195.154 | C39A | Ú | LATIN SMALL LETTER O WITH GRAVE | ograve |
| ó | 243 | F3 | 00DB | 219 | 195.155 | C39B | Û | LATIN SMALL LETTER O WITH ACUTE | oacute |
| ô | 244 | F4 | 00D9 | 217 | 195.153 | C399 | Ù | LATIN SMALL LETTER O WITH CIRCUMFLEX | ocircumflex |
| õ | 245 | F5 | 0131 | 305 | 196.177 | C4B1 | ı | LATIN SMALL LETTER O WITH TILDE | otilde |
| ö | 246 | F6 | 02C6 | 710 | 203.134 | CB86 | ˆ | LATIN SMALL LETTER O WITH DIAERESIS | odieresis |
| ÷ | 247 | F7 | 02DC | 732 | 203.156 | CB9C | Ëœ | DIVISION SIGN | divide |
| ø | 248 | F8 | 00AF | 175 | 194.175 | C2AF | ¯ | LATIN SMALL LETTER O WITH STROKE | oslash |
| ù | 249 | F9 | 02D8 | 728 | 203.152 | CB98 | ˘ | LATIN SMALL LETTER U WITH GRAVE | ugrave |
| ú | 250 | FA | 02D9 | 729 | 203.153 | CB99 | Ë™ | LATIN SMALL LETTER U WITH ACUTE | uacute |
| û | 251 | FB | 02DA | 730 | 203.154 | CB9A | Ëš | LATIN SMALL LETTER U WITH CIRCUMFLEX | ucircumflex |
| ü | 252 | FC | 00B8 | 184 | 194.184 | C2B8 | ¸ | LATIN SMALL LETTER U WITH DIAERESIS | udieresis |
| ý | 253 | FD | 02DD | 733 | 203.157 | CB9D | Ë | LATIN SMALL LETTER Y WITH ACUTE | yacute |
| þ | 254 | FE | 02DB | 731 | 203.155 | CB9B | Ë› | LATIN SMALL LETTER THORN | thorn |
| ÿ | 255 | FF | 02C7 | 711 | 203.135 | CB87 | ˇ | LATIN SMALL LETTER Y WITH DIAERESIS | ydieresis |
The following characters can normally not be displayed on a Macintosh. These are the Windows characters that can be found in the Windows code page cp1252, for which no equivalent characters can be found in the MacRoman encoding below a numerical character value of 256.
Modern Macintosh browsers however, are able to display them in another encoding scheme, especially when the user has installed the full Apple Language Kits, found on the Mac OS 9+ CD.
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| Š | 138 | 8A | 0160 | 352 | 197.160 | C5A0 | Å | LATIN CAPITAL LETTER S WITH CARON | Scaron |
| š | 154 | 9A | 0161 | 353 | 197.161 | C5A1 | Å¡ | LATIN SMALL LETTER S WITH CARON | scaron |
| ¦ | 166 | A6 | A6 | 166 | 194.166 | C2A6 | ¦ | BROKEN BAR | brokenbar |
| ² | 178 | B2 | B2 | 178 | 194.178 | C2B2 | ² | SUPERSCRIPT TWO | twosuperior |
| ³ | 179 | B3 | B3 | 179 | 194.179 | C2B3 | ³ | SUPERSCRIPT THREE | threesuperior |
| ¹ | 185 | B9 | B9 | 185 | 194.185 | C2B9 | ¹ | SUPERSCRIPT ONE | onesuperior |
| ¼ | 188 | BC | BC | 188 | 194.188 | C2BC | ¼ | VULGAR FRACTION ONE QUARTER | onequarter |
| ½ | 189 | BD | BD | 189 | 194.189 | C2BD | ½ | VULGAR FRACTION ONE HALF | onehalf |
| ¾ | 190 | BE | BE | 190 | 194.190 | C2BE | ¾ | VULGAR FRACTION THREE QUARTERS | threequarters |
| Ð | 208 | D0 | D0 | 208 | 195.144 | C390 | Ã | LATIN CAPITAL LETTER ETH | Eth |
| × | 215 | D7 | D7 | 215 | 195.151 | C397 | × | MULTIPLICATION SIGN | multiply |
| Ý | 221 | DD | DD | 221 | 195.157 | C39D | Ã | LATIN CAPITAL LETTER Y WITH ACUTE | Yacute |
| Þ | 222 | DE | DE | 222 | 195.158 | C39E | Þ | LATIN CAPITAL LETTER THORN | Thorn |
| ð | 240 | F0 | F0 | 240 | 195.176 | C3B0 | ð | LATIN SMALL LETTER ETH | eth |
| ý | 253 | FD | FD | 253 | 195.189 | C3BD | ý | LATIN SMALL LETTER Y WITH ACUTE | yacute |
| þ | 254 | FE | FE | 254 | 195.190 | C3BE | þ | LATIN SMALL LETTER THORN | thorn |
The following characters can normally not be displayed on a Windows page with character encoding Windows-1252 in effect. These are the Macintosh characters that can be found in the MacRoman code page, for which no equivalent characters can be found in the Windows-1252 encoding below a numerical character value of 256.
Modern Windows and other browsers however, are able to display them in another encoding scheme.
In the following table we have encoded the characters in the first column 'ch' with numerical Unicode entities (see for the values the column 'U-hex' or 'U-dec'). The numbers in the colums 'dec' and 'hex' are the positions in the MacRoman encoding where these characters can be found.
| ch | dec | hx | U-hex | U-dec | UTF-dec | UTF-hx | lit | Unicode name | PostScript name |
| ≠ | 173 | AD | 2260 | 8800 | 226.137.160 | E289A0 | ≠| NOT EQUAL TO | notequal |
| ∞ | 176 | B0 | 221E | 8734 | 226.136.158 | E2889E | ∞ | INFINITY | infinity |
| ≤ | 178 | B2 | 2264 | 8804 | 226.137.164 | E289A4 | ≤ | LESS-THAN OR EQUAL TO | lessequal |
| ≥ | 179 | B3 | 2265 | 8805 | 226.137.165 | E289A5 | ≥ | GREATER-THAN OR EQUAL TO | greaterequal |
| ∂ | 182 | B6 | 2202 | 8706 | 226.136.130 | E28882 | ∂ | PARTIAL DIFFERENTIAL | partialdiff |
| ∑ | 183 | B7 | 2211 | 8721 | 226.136.145 | E28891 | ∑ | N-ARY SUMMATION | summation |
| ∏ | 184 | B8 | 220F | 8719 | 226.136.143 | E2888F | ∠| N-ARY PRODUCT | product |
| π | 185 | B9 | 03C0 | 960 | 207.128 | CF80 | Ï€ | GREEK SMALL LETTER PI | pi |
| ∫ | 186 | BA | 222B | 8747 | 226.136.171 | E288AB | ∫ | INTEGRAL | integral |
| Ω | 189 | BD | 03A9 | 937 | 206.169 | CEA9 | Ω | GREEK CAPITAL LETTER OMEGA | Omega |
| √ | 195 | C3 | 221A | 8730 | 226.136.154 | E2889A | √ | SQUARE ROOT | radical |
| ≈ | 197 | C5 | 2248 | 8776 | 226.137.136 | E28988 | ≈ | ALMOST EQUAL TO | approxequal |
| ∆ | 198 | C6 | 2206 | 8710 | 226.136.134 | E28886 | ∆ | INCREMENT | Delta |
| ◊ | 215 | D7 | 25CA | 9674 | 226.151.138 | E2978A | â—Š | LOZENGE | lozenge |
| ⁄ | 218 | DA | 2044 | 8260 | 226.129.132 | E28184 | â„ | FRACTION SLASH | fraction |
| fi | 222 | DE | FB01 | 64257 | 239.172.129 | EFAC81 | ï¬ | LATIN SMALL LIGATURE FI | fi |
| fl | 223 | DF | FB02 | 64258 | 239.172.130 | EFAC82 | fl | LATIN SMALL LIGATURE FL | fl |
| | 240 | F0 | F8FF | 63743 | 239.163.191 | EFA3BF |  | Apple logo | apple |
| ı | 245 | F5 | 0131 | 305 | 196.177 | C4B1 | ı | LATIN SMALL LETTER DOTLESS I | dotlessi |
| ˘ | 249 | F9 | 02D8 | 728 | 203.152 | CB98 | ˘ | BREVE | breve |
| ˙ | 250 | FA | 02D9 | 729 | 203.153 | CB99 | Ë™ | DOT ABOVE | dotaccent |
| ˚ | 251 | FB | 02DA | 730 | 203.154 | CB9A | Ëš | RING ABOVE | ring |
| ˝ | 253 | FD | 02DD | 733 | 203.157 | CB9D | Ë | DOUBLE ACUTE ACCENT | hungarumlaut |
| ˛ | 254 | FE | 02DB | 731 | 203.155 | CB9B | Ë› | OGONEK | ogonek |
| ˇ | 255 | FF | 02C7 | 711 | 203.135 | CB87 | ˇ | CARON | caron |
© Oscar van Vlijmen, June 2000
URL of this page: http://home.telfort.nl/~t876506/utf8tbl.html
Last update: 2012-07-17