[Mirrored from: http://www.w3.org/pub/WWW/TR/WD-entities on April 15, 1997]
This draft is work under review by the W3C HTML Working Group, for potential incorporation in an upcoming version of the HTML specification, code named Cougar. Please remember this is subject to change at any time, and may be updated, replaced or obsoleted by other documents. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress".
A list of current W3C Working Drafts can be found at http://www.w3.org/pub/WWW/TR. This is work in progress and does not imply endorsement by, or the consensus of, either W3C or members of the HTML working group. Further information about Cougar is available at http://www.w3.org/pub/WWW/MarkUp/Cougar/.
Please send detailed comments to www-html-editor@w3.org. We cannot garantee a personal response, but summaries will be maintained off the Cougar page. Public discussion on HTML features takes place on www-html@w3.org. To subscribe send a message to www-html-request@w3.org with subscribe in the subject.
This specification extends HTML to support additional named entities for all characters in ISO 8859-1, all characters representable by glyphs in the Adobe Symbol font, entities required for Internationalisation - in particular for bidirectional text - and characters from that portion of CP-1250 which lie outside the character repertoire of ISO 8859-1.
All new entities in this section produce characters which can already be represented by all conforming HTML 2.0 User Agents (UAs), but currently in a less convenient form. For example they allow typing ÷ instead of ÷ to obtain a division sign (÷). The new named entities conform to the Proposed Entities section of the HTML 2.0 specification.
Adding support for these entities to a UA merely entails recognising the entity names and converting them to characters which are within the repertoire of ISO-8859-1.
Support for these entities may be checked with a test file, entitytest-8859.html
The list of entities is given in Appendix @@ or as a text file, HTMLlat1 for use in SGML systems.
This section adds new named entities for all characters representable by glyphs in the widely available Adobe Symbol font as described in table E.11 of [Adobe90].
Some glyphs in that font are not represented here, because they do not correspond to characters. For example, the 'top portion of a large integral sign' is one third of a composite glyph which is used to represent an integral sign; there is however an entity to represent a complete integral sign. Some glyphs are not represented here because, although they do correspond to characters, those characters are already represented within the repertoire of ISO 8859-1. For example (!, #, ©).
Entities in this section refers to characters which are outside the repertoire of ISO 8859-1 but within the repertoire of ISO-10646. These characters may be ustd in an HTML document in one of three ways:
Adding support for these entities to a UA may be achieved by supporting full ISO-10646 or by other means. Display of glyphs for these characters may be obtained by being able to display the relevant ISO-10646 characters or by other means, such as internally mapping the listed entities, NCRs and characters to the appropriate position in some font which contains the requisite glyphs.
Note: This entity set contains all the letters used in modern Greek. However, it does not include Greek punctuation, precomposed accented characters nor the non-spacing accents (tonos, dialytika) required to compose them. There are no archaic letters, Coptic-unique letters, or precomposed letters for Polytonic Greek.
The entities defined here are not intended for the representation of modern Greek text and would not be an efficient representation; rather, they are intended for occasional Greek letters used in technical and mathermatical works.
The list of entities is given in Appendix @@+1 or as a text file, HTMLsymbol for use in SGML systems.
This section adds named entities for escaping markup-significant characters (these are the same as those in HTML 2.0 and 3.2), for denoting spaces and dashes, and for disambiguating bidirectional text (these are the same as those in RFC 2070).
Entities are also added forthe remaining characters occuring in CP-1252 which do not occur in the HTMLlat1 or HTMLsymbol entity sets. These all occur in the 128 to 159 range within the cp-1252 charset. These entities permit the characters to be denoted in a platform-independent manner.
Note: some authoring tools on the MS Windows platform incorrectly generate numeric character references (NCRs) in the range € to Ÿ which are illegal Unicode characters. NCRs always refer to the 'Document Character Set' which, for HTML, is always ISO-10646 (which has the same characters as Unicode). NCRs do not refer to code positions in the charset used to transmit the document. Using the entity names defined here avoids such errors by refering the characters to their Unicode values.
The list of entities is given in Appendix @@+2 or as a text file, HTMLspecial for use in SGML systems.
Adding support for these entities to a UA may be achieved by supporting full ISO-10646 or by other means. Display of glyphs for these characters may be obtained by being able to display the relevant ISO-10646 characters or by other means, such as internally mapping the listed entities, NCRs and characters to the appropriate position in some font which contains the requisite glyphs.
This entity set is identical to that in the Proposed Entities section of the HTML 2.0 specification
<!-- Portions © International Organization for Standardization 1986 Permission to copy in any form is granted for use with conforming SGML systems and applications as defined in ISO 8879, provided this notice is included in all copies. --> <!-- Character entity set. Typical invocation: <!ENTITY % HTMLlat1 PUBLIC "-//W3C//ENTITIES Full Latin 1//EN//HTML"> %HTMLlat1; --> <!ENTITY nbsp CDATA " " -- no-break space --> <!ENTITY iexcl CDATA "¡" -- inverted exclamation mark --> <!ENTITY cent CDATA "¢" -- cent sign --> <!ENTITY pound CDATA "£" -- pound sterling sign --> <!ENTITY curren CDATA "¤" -- general currency sign --> <!ENTITY yen CDATA "¥" -- yen sign --> <!ENTITY brvbar CDATA "¦" -- broken (vertical) bar --> <!ENTITY sect CDATA "§" -- section sign --> <!ENTITY uml CDATA "¨" -- umlaut (dieresis) --> <!ENTITY copy CDATA "©" -- copyright sign --> <!ENTITY ordf CDATA "ª" -- ordinal indicator, feminine --> <!ENTITY laquo CDATA "«" -- angle quotation mark, left --> <!ENTITY not CDATA "¬" -- not sign --> <!ENTITY shy CDATA "­" -- soft hyphen --> <!ENTITY reg CDATA "®" -- registered sign --> <!ENTITY macr CDATA "¯" -- macron --> <!ENTITY deg CDATA "°" -- degree sign --> <!ENTITY plusmn CDATA "±" -- plus-or-minus sign --> <!ENTITY sup2 CDATA "²" -- superscript two --> <!ENTITY sup3 CDATA "³" -- superscript three --> <!ENTITY acute CDATA "´" -- acute accent --> <!ENTITY micro CDATA "µ" -- micro sign --> <!ENTITY para CDATA "¶" -- pilcrow (paragraph sign) --> <!ENTITY middot CDATA "·" -- middle dot --> <!ENTITY cedil CDATA "¸" -- cedilla --> <!ENTITY sup1 CDATA "¹" -- superscript one --> <!ENTITY ordm CDATA "º" -- ordinal indicator, masculine --> <!ENTITY raquo CDATA "»" -- angle quotation mark, right --> <!ENTITY frac14 CDATA "¼" -- fraction one-quarter --> <!ENTITY frac12 CDATA "½" -- fraction one-half --> <!ENTITY frac34 CDATA "¾" -- fraction three-quarters --> <!ENTITY iquest CDATA "¿" -- inverted question mark --> <!ENTITY Agrave CDATA "À" -- capital A, grave accent --> <!ENTITY Aacute CDATA "Á" -- capital A, acute accent --> <!ENTITY Acirc CDATA "Â" -- capital A, circumflex accent --> <!ENTITY Atilde CDATA "Ã" -- capital A, tilde --> <!ENTITY Auml CDATA "Ä" -- capital A, dieresis or umlaut mark --> <!ENTITY Aring CDATA "Å" -- capital A, ring --> <!ENTITY AElig CDATA "Æ" -- capital AE diphthong (ligature) --> <!ENTITY Ccedil CDATA "Ç" -- capital C, cedilla --> <!ENTITY Egrave CDATA "È" -- capital E, grave accent --> <!ENTITY Eacute CDATA "É" -- capital E, acute accent --> <!ENTITY Ecirc CDATA "Ê" -- capital E, circumflex accent --> <!ENTITY Euml CDATA "Ë" -- capital E, dieresis or umlaut mark --> <!ENTITY Igrave CDATA "Ì" -- capital I, grave accent --> <!ENTITY Iacute CDATA "&4amp;#205;" -- capital I, acute accent --> <!ENTITY Icirc CDATA "Î" -- capital I, circumflex accent --> <!ENTITY Iuml CDATA "Ï" -- capital I, dieresis or umlaut mark --> <!ENTITY ETH CDATA "Ð" -- capital Eth, Icelandic --> <!ENTITY Ntilde CDATA "Ñ" -- capital N, tilde --> <!ENTITY Ograve CDATA "Ò" -- capital O, grave accent --> <!ENTITY Oacute CDATA "Ó" -- capital O, acute accent --> <!ENTITY Ocirc CDATA "Ô" -- capital O, circumflex accent --> <!ENTITY Otilde CDATA "Õ" -- capital O, tilde --> <!ENTITY Ouml CDATA "Ö" -- capital O, dieresis or umlaut mark --> <!ENTITY times CDATA "×" -- multiply sign --> <!ENTITY Oslash CDATA "Ø" -- capital O, slash --> <!ENTITY Ugrave CDATA "Ù" -- capital U, grave accent --> <!ENTITY Uacute CDATA "Ú" -- capital U, acute accent --> <!ENTITY Ucirc CDATA "Û" -- capital U, circumflex accent --> <!ENTITY Uuml CDATA "Ü" -- capital U, dieresis or umlaut mark --> <!ENTITY Yacute CDATA "Ý" -- capital Y, acute accent --> <!ENTITY THORN CDATA "Þ" -- capital THORN, Icelandic --> <!ENTITY szlig CDATA "ß" -- small sharp s, German (sz ligature) --> <!ENTITY agrave CDATA "à" -- small a, grave accent --> <!ENTITY aacute CDATA "á" -- small a, acute accent --> <!ENTITY acirc CDATA "â" -- small a, circumflex accent --> <!ENTITY atilde CDATA "ã" -- small a, tilde --> <!ENTITY auml CDATA "ä" -- small a, dieresis or umlaut mark --> <!ENTITY aring CDATA "å" -- small a, ring --> <!ENTITY aelig CDATA "æ" -- small ae diphthong (ligature) --> <!ENTITY ccedil CDATA "ç" -- small c, cedilla --> <!ENTITY egrave CDATA "è" -- small e, grave accent --> <!ENTITY eacute CDATA "é" -- small e, acute accent --> <!ENTITY ecirc CDATA "ê" -- small e, circumflex accent --> <!ENTITY euml CDATA "ë" -- small e, dieresis or umlaut mark --> <!ENTITY igrave CDATA "ì" -- small i, grave accent --> <!ENTITY iacute CDATA "í" -- small i, acute accent --> <!ENTITY icirc CDATA "î" -- small i, circumflex accent --> <!ENTITY iuml CDATA "ï" -- small i, dieresis or umlaut mark --> <!ENTITY eth CDATA "ð" -- small eth, Icelandic --> <!ENTITY ntilde CDATA "ñ" -- small n, tilde --> <!ENTITY ograve CDATA "ò" -- small o, grave accent --> <!ENTITY oacute CDATA "ó" -- small o, acute accent --> <!ENTITY ocirc CDATA "ô" -- small o, circumflex accent --> <!ENTITY otilde CDATA "õ" -- small o, tilde --> <!ENTITY ouml CDATA "ö" -- small o, dieresis or umlaut mark --> <!ENTITY divide CDATA "÷" -- divide sign --> <!ENTITY oslash CDATA "ø" -- small o, slash --> <!ENTITY ugrave CDATA "ù" -- small u, grave accent --> <!ENTITY uacute CDATA "ú" -- small u, acute accent --> <!ENTITY ucirc CDATA "û" -- small u, circumflex accent --> <!ENTITY uuml CDATA "ü" -- small u, dieresis or umlaut mark --> <!ENTITY yacute CDATA "ý" -- small y, acute accent --> <!ENTITY thorn CDATA "þ" -- small thorn, Icelandic --> <!ENTITY yuml CDATA "ÿ" -- small y, dieresis or umlaut mark -->
<!-- Mathematical, Greek and Symbolic characters for HTML --> <!-- Character entity set. Typical invocation: <!ENTITY % HTMLsymbol PUBLIC "-//W3C//ENTITIES Symbolic//EN//HTML"> %HTMLsymbol; --> <!-- Portions International Organization for Standardization 1986: Permission to copy in any form is granted for use with conforming SGML systems and applications as defined in ISO 8879, provided this notice is included in all copies. --> <!-- Relevant ISO entity set is given unless names are newly introduced. New names (ie, not in ISO 8879 list) do not clash with any existing ISO 8879 entity names. Unicode character numbers are given for each character, in hex, and are identical for Unicode 1.1 and Unicode 2.0. CDATA values are decimal conversions of the Unicode values. Names are Unicode 2.0 names. --> <!-- Latin Extended-B --> <!ENTITY fnof CDATA "À" -- latin small f with hook, =function, =florin, U0192 ISOtech --> <!-- Greek --> <!ENTITY Alpha CDATA "Α" -- greek capital letter alpha, U0391 --> <!ENTITY Beta CDATA "Β" -- greek capital letter beta, U0392 --> <!ENTITY Gamma CDATA "Γ" -- greek capital letter gamma, U0393 ISOgrk3 --> <!ENTITY Delta CDATA "Δ" -- greek capital letter delta, U0394 ISOgrk3 --> <!ENTITY Epsilon CDATA "Ε" -- greek capital letter epsilon, U0395 --> <!ENTITY Zeta CDATA "Ζ" -- greek capital letter zeta, U0396 --> <!ENTITY Eta CDATA "Η" -- greek capital letter eta, U0397 --> <!ENTITY Theta CDATA "Θ" -- greek capital letter theta, U0398 ISOgrk3 --> <!ENTITY Iota CDATA "Ι" -- greek capital letter iota, U0399 --> <!ENTITY Kappa CDATA "Κ" -- greek capital letter kappa, U039A --> <!ENTITY Lambda CDATA "Λ" -- greek capital letter lambda, U039B ISOgrk3 --> <!ENTITY Mu CDATA "Μ" -- greek capital letter mu, U039C --> <!ENTITY Nu CDATA "Ν" -- greek capital letter nu, U039D --> <!ENTITY Xi CDATA "Ξ" -- greek capital letter xi, U039E ISOgrk3 --> <!ENTITY Omicron CDATA "Ο" -- greek capital letter omicron, U039F --> <!ENTITY Pi CDATA "Π" -- greek capital letter pi, U03A0 ISOgrk3 --> <!ENTITY Rho CDATA "Ρ" -- greek capital letter rho, U03A1 --> <!-- (there is no Sigmaf, and no U03A2 character either) --> <!ENTITY Sigma CDATA "Σ" -- greek capital letter sigma, U03A3 ISOgrk3 --> <!ENTITY Tau CDATA "Τ" -- greek capital letter tau, U03A4 --> <!ENTITY Upsi CDATA "Υ" -- greek capital letter upsilon, U03A5 ISOgrk3 --> <!ENTITY Phi CDATA "Φ" -- greek capital letter phi, U03A6 ISOgrk3 --> <!ENTITY Chi CDATA "Χ" -- greek capital letter chi, U03A7 --> <!ENTITY Psi CDATA "Ψ" -- greek capital letter psi, U03A8 ISOgrk3 --> <!ENTITY Omega CDATA "Ω" -- greek capital letter omega, U03A9 ISOgrk3 --> <!ENTITY alpha CDATA "α" -- greek small letter alpha, U03B1 ISOgrk3 --> <!ENTITY beta CDATA "β" -- greek small letter beta, U03B2 ISOgrk3 --> <!ENTITY gamma CDATA "γ" -- greek small letter gamma, U03B3 ISOgrk3 --> <!ENTITY delta CDATA "δ" -- greek small letter delta, U03B4 ISOgrk3 --> <!ENTITY epsi CDATA "ε" -- greek small letter epsilon, U03B5 ISOgrk3 --> <!-- why not 'epsilon' ?? how to remember which three are truncated?? --> <!ENTITY zeta CDATA "ζ" -- greek small letter zeta, U03B6 ISOgrk3 --> <!ENTITY eta CDATA "η" -- greek small letter eta, U03B7 ISOgrk3 --> <!ENTITY theta CDATA "θ" -- greek small letter theta, U03B8 ISOgrk3 --> <!ENTITY iota CDATA "ι" -- greek small letter iota, U03B9 ISOgrk3 --> <!ENTITY kappa CDATA "κ" -- greek small letter kappa, U03BA ISOgrk3 --> <!ENTITY lambda CDATA "λ" -- greek small letter lambda, U03BB ISOgrk3 --> <!ENTITY mu CDATA "μ" -- greek small letter mu, U03BC ISOgrk3 --> <!ENTITY nu CDATA "ν" -- greek small letter nu, U03BD ISOgrk3 --> <!ENTITY xi CDATA "ξ" -- greek small letter xi, U03BE ISOgrk3 --> <!ENTITY omicron CDATA "ο" -- greek small letter omicron, U03BF NEW --> <!ENTITY pi CDATA "π" -- greek small letter pi, U03C0 ISOgrk3 --> <!ENTITY rho CDATA "ρ" -- greek small letter rho, U03C1 ISOgrk3 --> <!ENTITY sigmaf CDATA "ς" -- greek small letter final sigma, U03C2 ISOgrk3 --> <!ENTITY sigma CDATA "σ" -- greek small letter sigma, U03C3 ISOgrk3 --> <!ENTITY tau CDATA "τ" -- greek small letter tau, U03C4 ISOgrk3 --> <!ENTITY upsi CDATA "υ" -- greek small letter upsilon, U03C5 ISOgrk3 --> <!-- why not 'upsilon' --> <!ENTITY phi CDATA "φ" -- greek small letter phi, U03C6 ISOgrk3 --> <!ENTITY chi CDATA "χ" -- greek small letter chi, U03C7 ISOgrk3 --> <!ENTITY psi CDATA "ψ" -- greek small letter psi, U03C8 ISOgrk3 --> <!ENTITY omega CDATA "ω" -- greek small letter omega, U03C9 ISOgrk3 --> <!ENTITY theta CDATA "ϑ" -- greek small letter theta symbol, U03D1 NEW --> <!ENTITY upsih CDATA "ϒ" -- greek upsilon with hook symbol, U03D2 NEW --> <!ENTITY piv CDATA "ϖ" -- greek pi symbol, U03D6 ISOgrk3 --> <!-- General Punctuation --> <!ENTITY bull CDATA "•" -- bullet, =black small circle, U2022 ISOpub --> <!-- bullet is NOT the same as bullet operator, U2219 --> <!ENTITY hellip CDATA "…" -- horizontal ellipsis, =three dot leader, U2026 ISOpub --> <!ENTITY prime CDATA "′" -- prime, =minutes, =feet, U2032 ISOtech --> <!ENTITY Prime CDATA "″" -- double prime, =seconds, =inches, U2033 ISOtech --> <!ENTITY oline CDATA "‾" -- overline, =spacing overscore, U203E NEW --> <!ENTITY frasl CDATA "⁄" -- fraction slash, U2044 NEW --> <!-- Letterlike Symbols --> <!ENTITY weierp CDATA "℘" -- script capital P, =power set, =Weierstrass p, U2118 ISOamso --> <!ENTITY image CDATA "ℑ" -- blackletter capital I, =imaginary part, U2111 ISOamso --> <!ENTITY real CDATA "ℜ" -- blackletter capital R, =real part symbol, U211C ISOamso --> <!ENTITY trade CDATA "™" -- trade mark sign, U2122 ISOnum --> <!ENTITY alefsym CDATA "ℵ" -- alef symbol, =first transfinite cardinal, U2135 NEW --> <!-- alef symbol is NOT the same as hebrew letter alef, U05D0 although the same glyph could be used to depict both characters --> <!-- Arrows --> <!ENTITY larr CDATA "←" -- leftwards arrow, U2190 ISOnum --> <!ENTITY uarr CDATA "↑" -- upwards arrow, U2191 ISOnum--> <!ENTITY rarr CDATA "→" -- rightwards arrow, U2192 ISOnum --> <!ENTITY darr CDATA "↓" -- downwards arrow, U2193 ISOnum --> <!ENTITY harr CDATA "↔" -- left right arrow, U2194 ISOamsa --> <!ENTITY crarr CDATA "↵" -- downwards arrow with corner leftwards, =carriage return, U21B5 NEW --> <!ENTITY lArr CDATA "⇐" -- leftwards double arrow, U21D0 ISOtech --> <!-- Unicode does not say that lArr is the same as the 'is implied by' arrow but also does not have any other character for that function. So ? lArr can be used for 'is implied by' as ISOtech suggests --> <!ENTITY uArr CDATA "⇑" -- upwards double arrow, U21D1 ISOamsa --> <!ENTITY rArr CDATA "⇒" -- rightwards double arrow, U21D2 ISOtech --> <!-- Unicode does not say this is the 'implies' character but does not have another character with this function so ? rArr can be used for 'implies' as ISOtech suggests --> <!ENTITY dArr CDATA "⇓" -- downwards double arrow, U21D3 ISOamsa --> <!ENTITY hArr CDATA "⇔" -- left right double arrow, U21D4 ISOamsa --> <!-- Mathematical Operators --> <!ENTITY forall CDATA "∀" -- for all, U2200 ISOtech --> <!ENTITY part CDATA "∂" -- partial differential, U2202 ISOtech --> <!ENTITY exist CDATA "∃" -- there exists, U2203 ISOtech --> <!ENTITY empty CDATA "∅" -- empty set, =null set, =diameter, U2205 ISOamso --> <!ENTITY nabla CDATA "∇" -- nabla, =backward difference, U2207 ISOtech --> <!ENTITY isin CDATA "∈" -- element of, U2208 ISOtech --> <!ENTITY notin CDATA "∉" -- not an element of, U2209 ISOtech --> <!ENTITY ni CDATA "∋" -- contains as member, U220B ISOtech --> <!-- should there be a more memorable name than 'ni'? --> <!ENTITY prod CDATA "∏" -- n-ary product, =product sign, U220F ISOamsb --> <!-- prod is NOT the same character as U03A0 'greek capital letter pi' though the same glyph might be used for both --> <!ENTITY sum CDATA "−" -- n-ary sumation, U2211 ISOamsb --> <!-- sum is NOT the same character as U03A3 'greek capital letter sigma' though the same glyph might be used for both --> <!ENTITY minus CDATA "−" -- minus sign, U2212 ISOtech --> <!ENTITY lowast CDATA "∗" -- asterisk operator, U2217 ISOtech --> <!ENTITY radic CDATA "√" -- square root, =radical sign, U221A ISOtech --> <!ENTITY prop CDATA "∝" -- proportional to, U221D ISOtech --> <!ENTITY infin CDATA "∞" -- infinity, U221E ISOtech --> <!ENTITY ang CDATA "∠" -- angle, U2220 ISOamso --> <!ENTITY and CDATA "⊥" -- logical and, =wedge, U2227 ISOtech --> <!ENTITY or CDATA "⊦" -- logical or, =vee, U2228 ISOtech --> <!ENTITY cap CDATA "∩" -- intersection, =cap, U2229 ISOtech --> <!ENTITY cup CDATA "∪" -- union, =cup, U222A ISOtech --> <!ENTITY int CDATA "∫" -- integral, U222B ISOtech --> <!ENTITY there4 CDATA "∴" -- therefore, U2234 ISOtech --> <!ENTITY sim CDATA "∼" -- tilde operator, =varies with, =similar to, U223C ISOtech --> <!-- tilde operator is NOT the same character as the tilde, U007E, although the same glyph might be used to represent both --> <!ENTITY cong CDATA "≅" -- approximately equal to, U2245 ISOtech --> <!ENTITY asymp CDATA "≅" -- almost equal to, =asymptotic to, U2248 ISOamsr --> <!ENTITY ne CDATA "≠" -- not equal to, U2260 ISOtech --> <!ENTITY equiv CDATA "≡" -- identical to, U2261 ISOtech --> <!ENTITY le CDATA "≤" -- less-than or equal to, U2264 ISOtech --> <!ENTITY ge CDATA "≥" -- greater-than or equal to, U2265 ISOtech --> <!ENTITY sub CDATA "⊂" -- subset of, U2282 ISOtech --> <!ENTITY sup CDATA "⊃" -- superset of, U2283 ISOtech --> <!-- note that nsup, 'not a superset of, U2283' is not covered by the Symbol font encoding and is not included. Should it be, for symmetry? It is in ISOamsn --> <!ENTITY nsub CDATA "⊄" -- not a subset of, U2284 ISOamsn --> <!ENTITY sube CDATA "⊆" -- subset of or equal to, U2286 ISOtech --> <!ENTITY supe CDATA "⊇" -- superset of or equal to, U2287 ISOtech --> <!ENTITY oplus CDATA "⊕" -- circled plus, =direct sum, U2295 ISOamsb --> <!ENTITY otimes CDATA "⊗" -- circled times, =vector product, U2297 ISOamsb --> <!ENTITY perp CDATA "⊥" -- up tack, =orthogonal to, =perpendicular, U22A5 ISOtech --> <!ENTITY sdot CDATA "⋅" -- dot operator, U22C5 ISOamsb --> <!-- dot operator is NOT the same character as U00B7 middle dot --> <!-- Miscellaneous Technical --> <!ENTITY lceil CDATA "⌈" -- left ceiling, =apl upstile, U2308, ISOamsc --> <!ENTITY rceil CDATA "⌉" -- right ceiling, U2309, ISOamsc --> <!ENTITY lfloor CDATA "⌊" -- left floor, =apl downstile, U230A, ISOamsc --> <!ENTITY rfloor CDATA "⌋" -- right floor, U230B, ISOamsc --> <!ENTITY lang CDATA "〈" -- left-pointing angle bracket, =bra, U2329 ISOtech --> <!-- lang is NOT the same character as U003C 'less than' or U2039 'single left-pointing angle quotation mark' --> <!ENTITY rang CDATA "〉" -- right-pointing angle bracket, =ket, U232A ISOtech --> <!-- rang is NOT the same character as U003E 'greater than' or U203A 'single right-pointing angle quotation mark' --> <!-- Geometric Shapes --> <!ENTITY loz CDATA "◊" -- lozenge, U25CA ISOpub --> <!-- Miscellaneous Symbols --> <!ENTITY spades CDATA "♠" -- black spade suit, U2660 ISOpub --> <!-- black here seems to mean filled as opposed to hollow --> <!ENTITY clubs CDATA "♣" -- black club suit, =shamrock, U2663 ISOpub --> <!ENTITY hearts CDATA "♥" -- black heart suit, =valentine, U2665 ISOpub --> <!ENTITY diams CDATA "♦" -- black diamond suit, U2666 ISOpub -->
<!-- Special characters for HTML --> <!-- Character entity set. Typical invocation: <!ENTITY % HTMLspecial PUBLIC "-//W3C//ENTITIES Special//EN//HTML"> %HTMLspecial; --> <!-- Portions International Organization for Standardization 1986: Permission to copy in any form is granted for use with conforming SGML systems and applications as defined in ISO 8879, provided this notice is included in all copies. --> <!-- Relevant ISO entity set is given unless names are newly introduced. New names (ie, not in ISO 8879 list) do not clash with any existing ISO 8879 entity names. Unicode character numbers are given for each character, in hex, and are identical for Unicode 1.1 and Unicode 2.0. CDATA values are decimal conversions of the Unicode values. Names are Unicode 2.0 names. --> <!-- C0 Controls and Basic Latin --> <!ENTITY quot CDATA """ -- quotation mark, =apl quote, U0022 ISOnum --> <!ENTITY amp CDATA "&" -- ampersand, U0026 ISOnum --> <!ENTITY lt CDATA "<" -- less-than sign, U003C ISOnum --> <!ENTITY gt CDATA ">" -- greater-than sign, U003E ISOnum --> <!-- Latin Extended-A --> <!ENTITY OElig CDATA "Œ" -- latin capital ligature oe, U0152 ISOlat2 --> <!ENTITY oelig CDATA "œ" -- latin small ligature oe, U0153 ISOlat2 --> <!-- ligature is a misnomer, this is a separate character in some languages --> <!ENTITY Scaron CDATA "Š" -- latin capital letter s with caron, U0160 ISOlat2 --> <!ENTITY scaron CDATA "š" -- latin small letter s with caron, U0161 ISOlat2 --> <!ENTITY Yuml CDATA "Ÿ" -- latin capital letter y with diaeresis, U0178 ISOlat2 --> <!-- Spacing Modifier Letters --> <!ENTITY circ CDATA "ˆ" -- modifier letter circumflex accent, U02C6 ISOpub --> <!ENTITY tilde CDATA "˜" -- small tilde, U02DC ISOdia --> <!-- General Punctuation --> <!ENTITY ensp CDATA " " -- en space, U2002 ISOpub --> <!ENTITY emsp CDATA " " -- em space, U2003 ISOpub --> <!ENTITY thinsp CDATA " " -- thin space, U2009 ISOpub --> <!ENTITY zwnj CDATA "‌" -- zero width non-joiner, U200C NEW RFC 2070 --> <!ENTITY zwj CDATA "‍" -- zero width joiner, U200D NEW RFC 2070 --> <!ENTITY lrm CDATA "‎" -- left-to-right mark, U200E NEW RFC 2070 --> <!ENTITY rlm CDATA "‏" -- right-to-left mark, U200F NEW RFC 2070 --> <!ENTITY ndash CDATA "–" -- en dash, U2013 ISOpub --> <!ENTITY mdash CDATA "—" -- em dash, U2014 ISOpub --> <!ENTITY lsquo CDATA "‘" -- left single quotation mark, U2018 ISOnum --> <!ENTITY rsquo CDATA "’" -- right single quotation mark, U2019 ISOnum --> <!ENTITY sbquo CDATA "‚" -- single low-9 quotation mark, U201A NEW --> <!ENTITY ldquo CDATA "“" -- left double quotation mark, U201C ISOnum --> <!ENTITY rdquo CDATA "”" -- right double quotation mark, U201D ISOnum --> <!ENTITY bdquo CDATA "„" -- double low-9 quotation mark, U201E NEW --> <!ENTITY dagger CDATA "†" -- dagger, U2020 ISOpub --> <!ENTITY Dagger CDATA "‡" -- double dagger, U2021 ISOpub --> <!ENTITY permil CDATA "‰" -- per mille sign, U2030 ISOtech --> <!ENTITY lsaquo CDATA "‹" -- single left-pointing angle quotation mark, U2039 ISO proposed --> <!-- lsaquo is proposed but not yet ISO standardised --> <!ENTITY rsaquo CDATA "›" -- single right-pointing angle quotation mark, U203A ISO proposed --> <!-- rsaquo is proposed but not yet ISO standardised -->