Description Code Entity name
=================================== ============ ==============
quotation mark " --> " " --> "
ampersand & --> & & --> &
less-than sign < --> < < --> <
greater-than sign > --> > > --> >
Description Char Code Entity name
=================================== ==== ============ ==============
euro sign � &128; -->
undefined � &129; -->
single low-9 quotation mark � &130; -->
latin small letter f with hook � &131; -->
double low-9 quotation mark � &132; -->
horizontal ellipsis � &133; -->
dagger � &134; -->
double dagger � &135; -->
modifier letter circumflex accent � &136; -->
per mille sign � &137; -->
latin capital letter s with caron � &138; -->
single left-pointing angle quote mark � &139; -->
latin capital ligature oe � &140; -->
undefined � &141; -->
latin capital letter z with caron � &142; -->
undefined � &143; -->
undefined � &144; -->
left single quotation mark � &145; -->
right single quotation mark � &146; -->
left double quotation mark � &147; -->
right double quotation mark � &148; -->
bullet � &149; -->
en dash � &150; -->
em dash � &151; -->
small tilde � &152; -->
trade mark sign � &153; -->
latin small letter s with caron � &154; -->
single right-pointing angle quote mark � &155; -->
latin small ligature oe � &156; -->
undefined � &157; -->
latin small letter z with caron � &158; -->
latin capital letter y with diaeresis � &159; -->
non-breaking space �   --> -->
inverted exclamation � ¡ --> ¡ ¡ --> ¡
cent sign � ¢ --> ¢ ¢ --> ¢
pound sterling � £ --> £ £ --> £
general currency sign � ¤ --> ¤ ¤ --> ¤
yen sign � ¥ --> ¥ ¥ --> ¥
broken vertical bar � ¦ --> ¦ ¦ --> ¦
Non-standard &brkbar; --> &brkbar;
section sign � § --> § § --> §
umlaut (dieresis) � ¨ --> ¨ ¨ --> ¨
Non-standard ¨ --> ¨
copyright � © --> © © --> ©
feminine ordinal � ª --> ª ª --> ª
left angle quote, guillemotleft � « --> « « --> «
not sign � ¬ --> ¬ ¬ --> ¬
soft hyphen � ­ --> ­ -->
registered trademark � ® --> ® ® --> ®
macron accent � ¯ --> ¯ ¯ --> ¯
Non-standard &hibar; --> &hibar;
degree sign � ° --> ° ° --> °
plus or minus � ± --> ± ± --> ±
superscript two � ² --> ² ² --> ²
superscript three � ³ --> ³ ³ --> ³
acute accent � ´ --> ´ ´ --> ´
micro sign � µ --> µ µ --> µ
paragraph sign � ¶ --> ¶ ¶ --> ¶
middle dot � · --> · · --> ·
cedilla � ¸ --> ¸ ¸ --> ¸
superscript one � ¹ --> ¹ ¹ --> ¹
masculine ordinal � º --> º º --> º
right angle quote, guillemotright � » --> » » --> »
fraction one-fourth � ¼ --> ¼ ¼ --> ¼
fraction one-half � ½ --> ½ ½ --> ½
fraction three-fourths � ¾ --> ¾ ¾ --> ¾
inverted question mark � ¿ --> ¿ ¿ --> ¿
capital A, grave accent � À --> À À --> À
capital A, acute accent � Á --> Á Á --> Á
capital A, circumflex accent � Â --> Â Â --> Â
capital A, tilde � Ã --> Ã Ã --> Ã
capital A, dieresis or umlaut mark � Ä --> Ä Ä --> Ä
capital A, ring � Å --> Å Å --> Å
capital AE diphthong (ligature) � Æ --> Æ Æ --> Æ
capital C, cedilla � Ç --> Ç Ç --> Ç
capital E, grave accent � È --> È È --> È
capital E, acute accent � É --> É É --> É
capital E, circumflex accent � Ê --> Ê Ê --> Ê
capital E, dieresis or umlaut mark � Ë --> Ë Ë --> Ë
capital I, grave accent � Ì --> Ì Ì --> Ì
capital I, acute accent � Í --> Í Í --> Í
capital I, circumflex accent � Î --> Î Î --> Î
capital I, dieresis or umlaut mark � Ï --> Ï Ï --> Ï
capital Eth, Icelandic � Ð --> Ð Ð --> Ð
Non-standard Đ --> Đ
capital N, tilde � Ñ --> Ñ Ñ --> Ñ
capital O, grave accent � Ò --> Ò Ò --> Ò
capital O, acute accent � Ó --> Ó Ó --> Ó
capital O, circumflex accent � Ô --> Ô Ô --> Ô
capital O, tilde � Õ --> Õ Õ --> Õ
capital O, dieresis or umlaut mark � Ö --> Ö Ö --> Ö
multiply sign � × --> × × --> ×
capital O, slash � Ø --> Ø Ø --> Ø
capital U, grave accent � Ù --> Ù Ù --> Ù
capital U, acute accent � Ú --> Ú Ú --> Ú
capital U, circumflex accent � Û --> Û Û --> Û
capital U, dieresis or umlaut mark � Ü --> Ü Ü --> Ü
capital Y, acute accent � Ý --> Ý Ý --> Ý
capital THORN, Icelandic � Þ --> Þ Þ --> Þ
small sharp s, German (sz ligature) � ß --> ß ß --> ß
small a, grave accent � à --> à à --> à
small a, acute accent � á --> á á --> á
small a, circumflex accent � â --> â â --> â
small a, tilde � ã --> ã ã --> ã
small a, dieresis or umlaut mark � ä --> ä ä --> ä
small a, ring � å --> å å --> å
small ae diphthong (ligature) � æ --> æ æ --> æ
small c, cedilla � ç --> ç ç --> ç
small e, grave accent � è --> è è --> è
small e, acute accent � é --> é é --> é
small e, circumflex accent � ê --> ê ê --> ê
small e, dieresis or umlaut mark � ë --> ë ë --> ë
small i, grave accent � ì --> ì ì --> ì
small i, acute accent � í --> í í --> í
small i, circumflex accent � î --> î î --> î
small i, dieresis or umlaut mark � ï --> ï ï --> ï
small eth, Icelandic � ð --> ð ð --> ð
small n, tilde � ñ --> ñ ñ --> ñ
small o, grave accent � ò --> ò ò --> ò
small o, acute accent � ó --> ó ó --> ó
small o, circumflex accent � ô --> ô ô --> ô
small o, tilde � õ --> õ õ --> õ
small o, dieresis or umlaut mark � ö --> ö ö --> ö
division sign � ÷ --> ÷ ÷ --> ÷
small o, slash � ø --> ø ø --> ø
small u, grave accent � ù --> ù ù --> ù
small u, acute accent � ú --> ú ú --> ú
small u, circumflex accent � û --> û û --> û
small u, dieresis or umlaut mark � ü --> ü ü --> ü
small y, acute accent � ý --> ý ý --> ý
small thorn, Icelandic � þ --> þ þ --> þ
small y, dieresis or umlaut mark � ÿ --> ÿ ÿ --> ÿ
-->
"what your browser does with it"
-->
"what your browser does with it"
÷ -->
÷" in the 4th column, this means your browser
doesn't know about the entity name "divide" and just puts it
literally.
This table grew out of an overview of the "ISO
Latin-1 Character Set" overview related to the Hyper-G Text Format
(HTF).
The entity names &brkbar; and Đ
seem to be unique to HTF.
The entity name &hibar; has been supported by X Mosaic
but seems to be replaced with ¯.
The entity names ¨ and ¨ should
be equivalent.
The standards stuff:
The
HTML 2.0 Standard
includes a section on
Character Entity Sets
and an overview on the
HTML Coded Character Set
(The entity names are derived from ISO 8879).
Or have a look at the
Latin-1 Character Entities
as listed in an draft for the
HTML 3.0 specification.
The
Appendix II
of CERN's
HTML+ Discussion Document
contains a
table
(in PostScript format) of the proposed character entities for HTML+ and their
corresponding character codes for Unicode and the Adobe Latin-1 & Symbol
character sets.
Please note that there is nothing wrong with using characters of ISO Latin-1 above 127: the normal transmission protocol for the WWW, HTTP/1.0, uses the 8bit ISO latin-1 as default encoding. (Thanks to Roman Czyborra for pointing this out!)
Other information:
server/ddx/sun/Compose.list.