Unicode UTF-8 encoding

Go to an alternative page with a Macintosh encoding
Go to the page with explanations and tips of 'HTML entities'
Go to the table with all HTML entities
Go to a page with tables of Macintosh & Windows standard encodings and Symbol encodings in ASCII.
Go to/Back to the index.

Unicode has been developed to describe all possible characters of all languages plus a lot of symbols with one unique number for each character/symbol. Unicode as defined by the Unicode organization has become a universal standard: ISO/IEC 10646, describing the 'Universal Multiple-Octet Coded Character Set' (UCS).
It is not always possible to transfer a Unicode character to another computer reliably. For that reason a special encoding scheme has been developed, UTF-8, which stands for UCS Transformation Format 8.
On this page you will find an overview of the UTF-8 encoding scheme.

This page is encoded as Windows-1252. Your browser should support this character set. If not, then the literal characters of the table below will be displayed incorrectly. That's no problem if you are only interested in the conversion algorithms for UTF-8 on this page.

Explanation of the table
chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
1539900F4244195.180C3B4ôTRADE MARK SIGNtrademark

The meaning of the columns is a follows:
ch: the specified character as literal, which is probably not displayed correctly
dec: the decimal ASCII value of the character
hx: the hexadecimal value of the character
U-hex: the Unicode value in hexadecimal
U-dec: the Unicode value in decimal
UTF-dec: the UTF8-encoded bytes as decimal numbers
UTF-hx: the UTF8-encoded bytes as hexadecimal numbers
lit: the UTF8-encoded characters as literals, which are probably not displayed correctly in the absolute sense, but are displayed 'as seen' by your browser
Unicode name: the full Unicode name of the character
PostScript name: the PostScript name of the character if this name exists

Let us take for example the trademark sign, which looks something like a higher positioned TM.
On a Macintosh you can produce this sign by taking character position number 170 decimal. On a Windows computer this is position 153 decimal. Unicode is the same for all users and in this scheme the trademark sign can be found at position 2122 hexadecimal, which is the same as 8842 decimal.
On a webpage you could try to encode this character like ™ but not each and every browser is able to reproduce many of those 'entities'. If your reader has a version 4 browser or better, the best thing you can do is encode the trademark sign with a numerical Unicode entity like ™. In a special META-tag your page has to be defined as a UTF-8 page. This is explained in detail on the page with entity tips.
If you write an email with for instance Microsoft Outlook Express and let the emailer encode your letter as UTF-8, then Outlook Express converts the trademark sign to a UTF-8 code. The result is in this case a combination of two characters with numerical values 195 and 180. What characters you will see on your screen without UTF-8 decoding, depends on your platform. A Macintosh user would see a square root symbol, followed by a Yen sign. The Windows viewer will see a letter A with a tilde, followed by an acute accent.
How did the encoding program get these numbers?

UTF-8 encoding
The proper way to convert between UCS-4 and UTF-8 is to use bitmask (and, or) and bitshift operations. But if you would like to convert only a couple of characters by hand or if your program development environment (scripting language) does not support bit operations, then integer division and multiplication can be used as follows.

From Unicode UCS-4 to UTF-8:
Start with the Unicode number expressed as a decimal number and call this ud.

If ud <128 (7F hex) then UTF-8 is 1 byte long, the value of ud.

If ud >=128 and <=2047 (7FF hex) then UTF-8 is 2 bytes long.
   byte 1 = 192 + (ud div 64)
   byte 2 = 128 + (ud mod 64)

If ud >=2048 and <=65535 (FFFF hex) then UTF-8 is 3 bytes long.
   byte 1 = 224 + (ud div 4096)
   byte 2 = 128 + ((ud div 64) mod 64)
   byte 3 = 128 + (ud mod 64)

If ud >=65536 and <=2097151 (1FFFFF hex) then UTF-8 is 4 bytes long.
   byte 1 = 240 + (ud div 262144)
   byte 2 = 128 + ((ud div 4096) mod 64)
   byte 3 = 128 + ((ud div 64) mod 64)
   byte 4 = 128 + (ud mod 64)

If ud >=2097152 and <=67108863 (3FFFFFF hex) then UTF-8 is 5 bytes long.
   byte 1 = 248 + (ud div 16777216)
   byte 2 = 128 + ((ud div 262144) mod 64)
   byte 3 = 128 + ((ud div 4096) mod 64)
   byte 4 = 128 + ((ud div 64) mod 64)
   byte 5 = 128 + (ud mod 64)

If ud >=67108864 and <=2147483647 (7FFFFFFF hex) then UTF-8 is 6 bytes long.
   byte 1 = 252 + (ud div 1073741824)
   byte 2 = 128 + ((ud div 16777216) mod 64)
   byte 3 = 128 + ((ud div 262144) mod 64)
   byte 4 = 128 + ((ud div 4096) mod 64)
   byte 5 = 128 + ((ud div 64) mod 64)
   byte 6 = 128 + (ud mod 64)

The operation div means integer division and mod means the rest after integer division.
For positive numbers a div b = int(a/b) and a mod b = (a/b-int(a/b))*b.
UTF-8 sequences of 5 bytes and longer are at the moment not supported by the regular browsers.
The highest character position defined in Unicode 3.2 is number 10FFFF hex (1114111 dec) in a 'private use' area. The highest character with an actual glyph is number E007F hex (917631 dec), the CANCEL TAG character. In Unicode 6.1 there are still no characters defined above 200000 hex.
Please note that at the moment UTF-8 is only defined for number series from 1 to 4 bytes long. What will happen when the Unicode region above 200000 hex is filled, is not known. It is possible that UTF-8 will be extended to 6 byte series, but this is far from certain. That means that the algorithm given above should throw an error if ud >=2097152.

From UTF-8 to Unicode UCS-4:
Let's take a UTF-8 byte sequence. The first byte in a new sequence will tell us how long the sequence is. Let's call the subsequent decimal bytes z y x w v u.

If z is between and including 0 - 127, then there is 1 byte z. The decimal Unicode value ud = the value of z.

If z is between and including 192 - 223, then there are 2 bytes z y; ud = (z-192)*64 + (y-128)

If z is between and including 224 - 239, then there are 3 bytes z y x; ud = (z-224)*4096 + (y-128)*64 + (x-128)

If z is between and including 240 - 247, then there are 4 bytes z y x w; ud = (z-240)*262144 + (y-128)*4096 + (x-128)*64 + (w-128)

If z is between and including 248 - 251, then there are 5 bytes z y x w v; ud = (z-248)*16777216 + (y-128)*262144 + (x-128)*4096 + (w-128)*64 + (v-128)

If z is 252 or 253, then there are 6 bytes z y x w v u; ud = (z-252)*1073741824 + (y-128)*16777216 + (x-128)*262144 + (w-128)*4096 + (v-128)*64 + (u-128)

If z = 254 or 255 then there is something wrong!
Please note that at the moment UTF-8 is only defined for number series from 1 to 4 bytes long. What will happen when the Unicode region above 200000 hex is filled, is not known. It is possible that UTF-8 will be extended to 6 byte series, but this is far from certain. That means that the algorithm given above should throw an error if z >=248.

Example: take the decimal Unicode designation 8482 (decimal), which is for the trademark sign. This number is larger than 2048, so we get three numbers.
The first number is 224 + (8482 div 4096) = 224 + 2 = 226.
The second number is 128 + (8482 div 64) mod 64) = 128 + (132 mod 64) = 128 + 4 = 132.
The third number is 128 + (8482 mod 64) = 128 + 34 = 162.
Now the other way round. We see the numbers 226, 132 and 162. What is the decimal Unicode value?
In this case: (226-224)*4096+(132-128)*64+(162-128) = 8482.
And the conversion between hexadecimal and decimal? Come on, this is not a math tutorial! In case you don't know, use a calculator.

References
More information about the UTF-8 encoding can be found in:
Request for Comments No. 3629, UTF-8, a transformation format of ISO 10646.
The page you are reading now is encoded in the standard Windows Roman encoding, 'code page 1252'. The unicode definition can be found at:
Windows code page 1252, Unicode encodings
Our alternative page (utf8tblMac.html) is encoded in the Apple Roman Unicode encoding.
This encoding scheme can also be found at the unicode organization:
Apple Roman Unicode encoding.
That document describes the latest Apple character set, as used by the Apple Mac OS Text Encoding Converter software version 1.5 and above.
A remark about that encoding: code position 0xDB is now used for the EURO SIGN, but a couple of years ago this position was used for the CURRENCY SIGN, as originally defined.

And now where you have been waiting for, the complete UTF-8 table for 1-byte Unicode characters from 128 decimal to 255.
This table follows the Windows 1252 encoding scheme.

chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
1288000C4196195.132C384ÄEURO SIGNEuro
1298100C5197195.133C385Ã…not defined
1308200C7199195.135C387ÇSINGLE LOW-9 QUOTATION MARKquotesinglbase
ƒ1318300C9201195.137C389ÉLATIN SMALL LETTER F WITH HOOKflorin
1328400D1209195.145C391ÑDOUBLE LOW-9 QUOTATION MARKquotedblbase
1338500D6214195.150C396ÖHORIZONTAL ELLIPSISellipsis
1348600DC220195.156C39CÜDAGGERdagger
1358700E1225195.161C3A1áDOUBLE DAGGERdaggerdbl
ˆ1368800E0224195.160C3A0àMODIFIER LETTER CIRCUMFLEX ACCENTcircumflex
1378900E2226195.162C3A2âPER MILLE SIGNperthousand
Š1388A00E4228195.164C3A4äLATIN CAPITAL LETTER S WITH CARONScaron
1398B00E3227195.163C3A3ãSINGLE LEFT-POINTING ANGLE QUOTATION MARKguilsinglleft
Œ1408C00E5229195.165C3A5Ã¥LATIN CAPITAL LIGATURE OEOE
1418D00E7231195.167C3A7çnot defined
Ž1428E00E9233195.169C3A9éLATIN CAPITAL LETTER Z WITH CARONZcaron
1438F00E8232195.168C3A8ènot defined
1449000EA234195.170C3AAênot defined
1459100EB235195.171C3ABëLEFT SINGLE QUOTATION MARKquoteleft
1469200ED237195.173C3ADíRIGHT SINGLE QUOTATION MARKquoteright
1479300EC236195.172C3ACìLEFT DOUBLE QUOTATION MARKquotedblleft
1489400EE238195.174C3AEîRIGHT DOUBLE QUOTATION MARKquotedblright
1499500EF239195.175C3AFïBULLETbullet
1509600F1241195.177C3B1ñEN DASHendash
1519700F3243195.179C3B3óEM DASHemdash
˜1529800F2242195.178C3B2òSMALL TILDEtilde
1539900F4244195.180C3B4ôTRADE MARK SIGNtrademark
š1549A00F6246195.182C3B6öLATIN SMALL LETTER S WITH CARONscaron
1559B00F5245195.181C3B5õSINGLE RIGHT-POINTING ANGLE QUOTATION MARKguilsinglright
œ1569C00FA250195.186C3BAúLATIN SMALL LIGATURE OEoe
1579D00F9249195.185C3B9ùnot defined
ž1589E00FB251195.187C3BBûLATIN SMALL LETTER Z WITH CARONzcaron
Ÿ1599F00FC252195.188C3BCüLATIN CAPITAL LETTER Y WITH DIAERESISYdieresis
chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
 160A020208224226.128.160E280A0†NO-BREAK SPACEnobreakspace
¡161A100B0176194.176C2B0°INVERTED EXCLAMATION MARKexclamdown
¢162A200A2162194.162C2A2¢CENT SIGNcent
£163A300A3163194.163C2A3£POUND SIGNsterling
¤164A400A7167194.167C2A7§CURRENCY SIGNcurrency
¥165A520228226226.128.162E280A2•YEN SIGNyen
¦166A600B6182194.182C2B6¶BROKEN BARbrokenbar
§167A700DF223195.159C39FßSECTION SIGNsection
¨168A800AE174194.174C2AE®DIAERESISdieresis
©169A900A9169194.169C2A9©COPYRIGHT SIGNcopyright
ª170AA21228482226.132.162E284A2â„¢FEMININE ORDINAL INDICATORordfeminine
«171AB00B4180194.180C2B4´LEFT-POINTING DOUBLE ANGLE QUOTATION MARKguillemotleft
¬172AC00A8168194.168C2A8¨NOT SIGNlogicalnot
­173AD22608800226.137.160E289A0≠SOFT HYPHENhyphen
®174AE00C6198195.134C386ÆREGISTERED SIGNregistered
¯175AF00D8216195.152C398ØMACRONmacron
°176B0221E8734226.136.158E2889E∞DEGREE SIGNdegree
±177B100B1177194.177C2B1±PLUS-MINUS SIGNplusminus
²178B222648804226.137.164E289A4≤SUPERSCRIPT TWOtwosuperior
³179B322658805226.137.165E289A5≥SUPERSCRIPT THREEthreesuperior
´180B400A5165194.165C2A5Â¥ACUTE ACCENTacute
µ181B500B5181194.181C2B5µMICRO SIGNmu
182B622028706226.136.130E28882∂PILCROW SIGNparagraph
·183B722118721226.136.145E28891∑MIDDLE DOTperiodcentered
¸184B8220F8719226.136.143E2888FâˆCEDILLAcedilla
¹185B903C0960207.128CF80Ï€SUPERSCRIPT ONEonesuperior
º186BA222B8747226.136.171E288AB∫MASCULINE ORDINAL INDICATORordmasculine
»187BB00AA170194.170C2AAªRIGHT-POINTING DOUBLE ANGLE QUOTATION MARKguillemotright
¼188BC00BA186194.186C2BAºVULGAR FRACTION ONE QUARTERonequarter
½189BD03A9937206.169CEA9ΩVULGAR FRACTION ONE HALFonehalf
¾190BE00E6230195.166C3A6æVULGAR FRACTION THREE QUARTERSthreequarters
¿191BF00F8248195.184C3B8øINVERTED QUESTION MARKquestiondown
chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
À192C000BF191194.191C2BF¿LATIN CAPITAL LETTER A WITH GRAVEAgrave
Á193C100A1161194.161C2A1¡LATIN CAPITAL LETTER A WITH ACUTEAacute
Â194C200AC172194.172C2AC¬LATIN CAPITAL LETTER A WITH CIRCUMFLEXAcircumflex
Ã195C3221A8730226.136.154E2889A√LATIN CAPITAL LETTER A WITH TILDEAtilde
Ä196C40192402198.146C692Æ’LATIN CAPITAL LETTER A WITH DIAERESISAdieresis
Å197C522488776226.137.136E28988≈LATIN CAPITAL LETTER A WITH RING ABOVEAring
Æ198C622068710226.136.134E28886∆LATIN CAPITAL LETTER AEAE
Ç199C700AB171194.171C2AB«LATIN CAPITAL LETTER C WITH CEDILLACcedilla
È200C800BB187194.187C2BB»LATIN CAPITAL LETTER E WITH GRAVEEgrave
É201C920268230226.128.166E280A6…LATIN CAPITAL LETTER E WITH ACUTEEacute
Ê202CA00A0160194.160C2A0 LATIN CAPITAL LETTER E WITH CIRCUMFLEXEcircumflex
Ë203CB00C0192195.128C380ÀLATIN CAPITAL LETTER E WITH DIAERESISEdieresis
Ì204CC00C3195195.131C383ÃLATIN CAPITAL LETTER I WITH GRAVEIgrave
Í205CD00D5213195.149C395ÕLATIN CAPITAL LETTER I WITH ACUTEIacute
Î206CE0152338197.146C592Å’LATIN CAPITAL LETTER I WITH CIRCUMFLEXIcircumflex
Ï207CF0153339197.147C593Å“LATIN CAPITAL LETTER I WITH DIAERESISIdieresis
Ð208D020138211226.128.147E28093–LATIN CAPITAL LETTER ETHEth
Ñ209D120148212226.128.148E28094—LATIN CAPITAL LETTER N WITH TILDENtilde
Ò210D2201C8220226.128.156E2809C“LATIN CAPITAL LETTER O WITH GRAVEOgrave
Ó211D3201D8221226.128.157E2809Dâ€LATIN CAPITAL LETTER O WITH ACUTEOacute
Ô212D420188216226.128.152E28098‘LATIN CAPITAL LETTER O WITH CIRCUMFLEXOcircumflex
Õ213D520198217226.128.153E28099’LATIN CAPITAL LETTER O WITH TILDEOtilde
Ö214D600F7247195.183C3B7÷LATIN CAPITAL LETTER O WITH DIAERESISOdieresis
×215D725CA9674226.151.138E2978Aâ—ŠMULTIPLICATION SIGNmultiply
Ø216D800FF255195.191C3BFÿLATIN CAPITAL LETTER O WITH STROKEOslash
Ù217D90178376197.184C5B8ŸLATIN CAPITAL LETTER U WITH GRAVEUgrave
Ú218DA20448260226.129.132E28184â„LATIN CAPITAL LETTER U WITH ACUTEUacute
Û219DB20AC8364226.130.172E282AC€LATIN CAPITAL LETTER U WITH CIRCUMFLEXUcircumflex
Ü220DC20398249226.128.185E280B9‹LATIN CAPITAL LETTER U WITH DIAERESISUdieresis
Ý221DD203A8250226.128.186E280BA›LATIN CAPITAL LETTER Y WITH ACUTEYacute
Þ222DEFB0164257239.172.129EFAC81ï¬LATIN CAPITAL LETTER THORNThorn
ß223DFFB0264258239.172.130EFAC82flLATIN SMALL LETTER SHARP Sgermandbls
chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
à224E020218225226.128.161E280A1‡LATIN SMALL LETTER A WITH GRAVEagrave
á225E100B7183194.183C2B7·LATIN SMALL LETTER A WITH ACUTEaacute
â226E2201A8218226.128.154E2809A‚LATIN SMALL LETTER A WITH CIRCUMFLEXacircumflex
ã227E3201E8222226.128.158E2809E„LATIN SMALL LETTER A WITH TILDEatilde
ä228E420308240226.128.176E280B0‰LATIN SMALL LETTER A WITH DIAERESISadieresis
å229E500C2194195.130C382ÂLATIN SMALL LETTER A WITH RING ABOVEaring
æ230E600CA202195.138C38AÊLATIN SMALL LETTER AEae
ç231E700C1193195.129C381ÃLATIN SMALL LETTER C WITH CEDILLAccedilla
è232E800CB203195.139C38BËLATIN SMALL LETTER E WITH GRAVEegrave
é233E900C8200195.136C388ÈLATIN SMALL LETTER E WITH ACUTEeacute
ê234EA00CD205195.141C38DÃLATIN SMALL LETTER E WITH CIRCUMFLEXecircumflex
ë235EB00CE206195.142C38EÃŽLATIN SMALL LETTER E WITH DIAERESISedieresis
ì236EC00CF207195.143C38FÃLATIN SMALL LETTER I WITH GRAVEigrave
í237ED00CC204195.140C38CÃŒLATIN SMALL LETTER I WITH ACUTEiacute
î238EE00D3211195.147C393ÓLATIN SMALL LETTER I WITH CIRCUMFLEXicircumflex
ï239EF00D4212195.148C394ÔLATIN SMALL LETTER I WITH DIAERESISidieresis
ð240F0F8FF63743239.163.191EFA3BFLATIN SMALL LETTER ETHeth
ñ241F100D2210195.146C392Ã’LATIN SMALL LETTER N WITH TILDEntilde
ò242F200DA218195.154C39AÚLATIN SMALL LETTER O WITH GRAVEograve
ó243F300DB219195.155C39BÛLATIN SMALL LETTER O WITH ACUTEoacute
ô244F400D9217195.153C399ÙLATIN SMALL LETTER O WITH CIRCUMFLEXocircumflex
õ245F50131305196.177C4B1ıLATIN SMALL LETTER O WITH TILDEotilde
ö246F602C6710203.134CB86ˆLATIN SMALL LETTER O WITH DIAERESISodieresis
÷247F702DC732203.156CB9CËœDIVISION SIGNdivide
ø248F800AF175194.175C2AF¯LATIN SMALL LETTER O WITH STROKEoslash
ù249F902D8728203.152CB98˘LATIN SMALL LETTER U WITH GRAVEugrave
ú250FA02D9729203.153CB99Ë™LATIN SMALL LETTER U WITH ACUTEuacute
û251FB02DA730203.154CB9AËšLATIN SMALL LETTER U WITH CIRCUMFLEXucircumflex
ü252FC00B8184194.184C2B8¸LATIN SMALL LETTER U WITH DIAERESISudieresis
ý253FD02DD733203.157CB9DËLATIN SMALL LETTER Y WITH ACUTEyacute
þ254FE02DB731203.155CB9BË›LATIN SMALL LETTER THORNthorn
ÿ255FF02C7711203.135CB87ˇLATIN SMALL LETTER Y WITH DIAERESISydieresis

The following characters can normally not be displayed on a Macintosh. These are the Windows characters that can be found in the Windows code page cp1252, for which no equivalent characters can be found in the MacRoman encoding below a numerical character value of 256.
Modern Macintosh browsers however, are able to display them in another encoding scheme, especially when the user has installed the full Apple Language Kits, found on the Mac OS 9+ CD.

chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
Š1388A0160352197.160C5A0Å LATIN CAPITAL LETTER S WITH CARONScaron
š1549A0161353197.161C5A1Å¡LATIN SMALL LETTER S WITH CARONscaron
¦166A6A6166194.166C2A6¦BROKEN BARbrokenbar
²178B2B2178194.178C2B2²SUPERSCRIPT TWOtwosuperior
³179B3B3179194.179C2B3³SUPERSCRIPT THREEthreesuperior
¹185B9B9185194.185C2B9¹SUPERSCRIPT ONEonesuperior
¼188BCBC188194.188C2BC¼VULGAR FRACTION ONE QUARTERonequarter
½189BDBD189194.189C2BD½VULGAR FRACTION ONE HALFonehalf
¾190BEBE190194.190C2BE¾VULGAR FRACTION THREE QUARTERSthreequarters
Ð208D0D0208195.144C390ÃLATIN CAPITAL LETTER ETHEth
×215D7D7215195.151C397×MULTIPLICATION SIGNmultiply
Ý221DDDD221195.157C39DÃLATIN CAPITAL LETTER Y WITH ACUTEYacute
Þ222DEDE222195.158C39EÞLATIN CAPITAL LETTER THORNThorn
ð240F0F0240195.176C3B0ðLATIN SMALL LETTER ETHeth
ý253FDFD253195.189C3BDýLATIN SMALL LETTER Y WITH ACUTEyacute
þ254FEFE254195.190C3BEþLATIN SMALL LETTER THORNthorn

The following characters can normally not be displayed on a Windows page with character encoding Windows-1252 in effect. These are the Macintosh characters that can be found in the MacRoman code page, for which no equivalent characters can be found in the Windows-1252 encoding below a numerical character value of 256.
Modern Windows and other browsers however, are able to display them in another encoding scheme.
In the following table we have encoded the characters in the first column 'ch' with numerical Unicode entities (see for the values the column 'U-hex' or 'U-dec'). The numbers in the colums 'dec' and 'hex' are the positions in the MacRoman encoding where these characters can be found.

chdechxU-hexU-decUTF-decUTF-hxlitUnicode namePostScript name
173AD22608800226.137.160E289A0≠NOT EQUAL TOnotequal
176B0221E8734226.136.158E2889E∞INFINITYinfinity
178B222648804226.137.164E289A4≤LESS-THAN OR EQUAL TOlessequal
179B322658805226.137.165E289A5≥GREATER-THAN OR EQUAL TOgreaterequal
182B622028706226.136.130E28882∂PARTIAL DIFFERENTIALpartialdiff
183B722118721226.136.145E28891∑N-ARY SUMMATIONsummation
184B8220F8719226.136.143E2888FâˆN-ARY PRODUCTproduct
π185B903C0960207.128CF80Ï€GREEK SMALL LETTER PIpi
186BA222B8747226.136.171E288AB∫INTEGRALintegral
Ω189BD03A9937206.169CEA9ΩGREEK CAPITAL LETTER OMEGAOmega
195C3221A8730226.136.154E2889A√SQUARE ROOTradical
197C522488776226.137.136E28988≈ALMOST EQUAL TOapproxequal
198C622068710226.136.134E28886∆INCREMENTDelta
215D725CA9674226.151.138E2978Aâ—ŠLOZENGElozenge
218DA20448260226.129.132E28184â„FRACTION SLASHfraction
222DEFB0164257239.172.129EFAC81ï¬LATIN SMALL LIGATURE FIfi
223DFFB0264258239.172.130EFAC82flLATIN SMALL LIGATURE FLfl
240F0F8FF63743239.163.191EFA3BFApple logoapple
ı245F50131305196.177C4B1ıLATIN SMALL LETTER DOTLESS Idotlessi
˘249F902D8728203.152CB98˘BREVEbreve
˙250FA02D9729203.153CB99Ë™DOT ABOVEdotaccent
˚251FB02DA730203.154CB9AËšRING ABOVEring
˝253FD02DD733203.157CB9DËDOUBLE ACUTE ACCENThungarumlaut
˛254FE02DB731203.155CB9BË›OGONEKogonek
ˇ255FF02C7711203.135CB87ˇCARONcaron

© Oscar van Vlijmen, June 2000
URL of this page: http://home.telfort.nl/~t876506/utf8tbl.html
Last update: 2012-07-17