From 15c91c8548b99e4ebfdc24c35497ff2f6e13bb35 Mon Sep 17 00:00:00 2001 From: Norman Walsh Date: Thu, 14 Apr 2005 13:36:01 +0000 Subject: [PATCH] Removed; we're using the W3C entities --- xmlcharent/spec/wd-docbook-xmlcharent.xml | 1012 --------------------- 1 file changed, 1012 deletions(-) delete mode 100644 xmlcharent/spec/wd-docbook-xmlcharent.xml diff --git a/xmlcharent/spec/wd-docbook-xmlcharent.xml b/xmlcharent/spec/wd-docbook-xmlcharent.xml deleted file mode 100644 index df43eec67..000000000 --- a/xmlcharent/spec/wd-docbook-xmlcharent.xml +++ /dev/null @@ -1,1012 +0,0 @@ - - - -%glyphs; - - - - - - - - - - - - - - - - - - - - - - -]> -
- - -wd-docbook-xmlcharent -&version; - -XML -HTML -PDF - -http://www.oasis-open.org/docbook/specs - -XML Character Entities - - - - NormanWalsh - - Sun - Sun Microsystems, Inc. -
Norman.Walsh@Sun.COM
-
-
-
- -13 June 2002 - -20012002 -The Organization for the Advancement of Structured Information -Standards [OASIS]. All Rights Reserved. - - - - -This &standard; defines XML encodings of the 19 standard -character entity sets defined in Non-normative Annex D of . - - - -Status - -This is a working draft constructed by the editor. It is not -an official committee work product and may not reflect the consensus -opinion of the committee. - - - -Please send comments on this specification to the -docbook@lists.oasis-open.org list. To subscribe, send -an email message to -docbook-request@lists.oasis-open.org with the word -subscribe as the body of the -message. - - - - - - -
- -This &standard; defines XML encodings of the standard SGML -character entity sets. - -Non-normative Annex D of defines 19 -standard SGML character entity sets: - -Added Latin 1 -Added Latin 2 -Greek Letters -Monotoniko Greek -Russian Cyrillic -Non-Russian Cyrillic -Numeric and Special Graphic -Diacritical Marks -Publishing -Box and Line Drawing -General Technical -Greek Symbols -Alternative Greek Symbols -Added Math Symbols: Ordinary -Added Math Symbols: Binary Operators -Added Math Symbols: Relations -Added Math Symbols: Negated Relations -Added Math Symbols: Arrow Relations -Added Math Symbols: Delimiters -. The SGML declarations for these entities use the -specific character data (SDATA) entity type that is -not supported in XML, so alternative XML declarations are necessary. - - -In XML, the specific character data of most entities can be expressed -as a character. - -
XML Character Entity Sets - -The character entity sets defined by this &standard; are -summarized in through . - -In order to use these entities in a document, they must be declared. -Entities can be declared in the external subset or the internal subset, -as described in . An example document, with the -declaration in the internal subset, is shown in . - - - -Declaring and Using the ISO Latin 1 Character Entity Set -<!DOCTYPE doc [ -<!ENTITY % iso-lat1 PUBLIC "ISO 8879:1986//ENTITIES Added Latin 1//EN//XML" - "&baseuri;/iso-lat1.ent"> -%iso-lat1; -]> -<doc> -<p>This document declares the ISO Latin 1 Character Entity Set, providing -access to the ISO Latin 1 entities, such as "&eacute;" and "&copy;".</p> -</doc> - - - -Non-validating XML Parsers may choose not to process externally -declared entities. This &standard; does not alter the semantics of XML -processors. If a processor does not see the declaration for an entity, -it will not be able to report the correct replacement text for that -entity. - - -
Multi-Character Replacements - -The replacement text of some entities includes more than a single -Unicode character. Some characters are composed with the -combining reverse solidus overlay (20E5) -and some are composed with a variation selector -(FE00, FE01, …). - - -
- -
Duplicate Entities - -Historically, the inodot entity is multiply -defined in iso-lat2.ent and iso-amso.ent. If both entity sets are -included, some parsers will warn about redefinition of this entity. -The warning can be ignored. - -
- -
Entities with no Mapping - -There are a small number of entities that have no - representation. These entities are all mapped to -the Unicode character FFFD, the -replacement character. - - - - - - - - - Entity Name - Entity Set - Description - - - - - fjlig - iso-pub.ent - Small fj ligature - - - gnap - iso-amsn.ent - Greater, not approximate - - - jnodot - iso-amso.ent - Small j, no dot - - - lnap - iso-amsn.ent - Less, not approximate - - - lpargt - iso-amsc.ent - Greater than, left arc - - - nsmid - iso-amsn.ent - Negated short mid - - - prnE - iso-amsn.ent - Precedes, not double equals - - - rpargt - iso-amsc.ent - Right paren, greater than - - - scnE - iso-amsn.ent - Succeeds, not double equals - - - smid - iso-amsr.ent - shortmid r - - - vsubnE - iso-amsn.ent - Subset not double equals, variant - - - - - -Users needing these characters will have to rely on the private use -area or other non-portable mechanisms to access them. - -
- -
Entities with Substituted Mappings - -There are a few more for which there is no specific - representation but where a reasonable -substitution has been used: - - - - - - - - - Entity Name - Entity Set - Substitution - Description - - - - - bepsi - iso-amsr.ent - 220D - Back epsilon: such that - - - ges - iso-amsr.ent - 2265 - Greater-or-equal, slanted - - - gvnE - iso-amsn.ent - 2269 - Gt, vert, not double equals - - - iff - iso-tech.ent - 21D4 - If and only if - - - les - iso-amsr.ent - 2264 - Less-than-or-equal, slanted - - - lozf - iso-pub.ent - 2726 - Lozenge, filled - - - lvnE - iso-amsn.ent - 2268 - Less, vert, not double equals - - - nge - iso-amsn.ent - 2271 - Neither greater-than nor equal to - - - nle - iso-amsn.ent - 2270 - Not less-than-or-equal - - - npre - iso-amsn.ent - 22E0 - Not precedes, equals - - - nsce - iso-amsn.ent - 22E1 - Not succeeds, equals - - - nspar - iso-amsn.ent - 2226 - Not short parallel - - - pre - iso-amsr.ent - 227C - Precedes, equals - - - spar - iso-amsr.ent - 2225 - Short parallel - - - ssetmn - iso-amsb.ent - 2216 - Small set minus (reverse solidus) - - - star - iso-pub.ent - 22C6 - Star operator - - - starf - iso-pub.ent - 2605 - Black star - - - thkap - iso-amsr.ent - 2248 - Thick approximate - - - thksim - iso-amsr.ent - 223C - Thick similar - - - vsubne - iso-amsn.ent - 228A - Subset, not equals, variant - - - vsupnE - iso-amsn.ent - 228B - Subset not double equals, variant - - - vsupne - iso-amsn.ent - 228B - Superset, not equals, variant - - - xhArr - iso-amsa.ent - 2194 - Long left and right double arr - - - xharr - iso-amsa.ent - 2194 - Long left and right arr - - - xlArr - iso-amsa.ent - 21D0 - Long left double arrow - - - xrArr - iso-amsa.ent - 21D2 - Long right double arr - - - ssmile - iso-amsr.ent - 2323 - Small smile - - - sfrown - iso-amsr.ent - 2322 - Small frown - - - - - -Users needing alternate glyphs for these characters will have to -rely on redefining them to use the private use area or other -non-portable mechanisms to access them. - -
- -
- -
XML Character Elements - -Named XML entities (except for the five -predefined entities) cannot -be used if they are not declared. Entity declaration requires either an external -or an internal subset. Some classes of applications forbid the occurrence of -markup declarations in documents. For these documents, named character entities are -inaccessible. - -In this section, we introduce an XML vocabulary with the semantics of -character entity reference. This &standard; defines the semantics of elements -and attributes declared in the &charns; -namespace. - -This namespace contains exactly one element, char. -The char element has two attributes, -entity and -name. They are mutually exclusive. - -The entity attribute identifies -characters by their character entity names. (The set of valid names -is the closed set of names associated with character entity sets defined -by this &standard;.) Case is significant in entity names. - -The name attribute identifies -characters by their Unicode character names. (The set of valid names -is the set of character names published in the -specification, or any later version of that specification.) Case is -insignificant in character names. - -The definition of this namespace is -shown in figure . - -
-The RELAX NG Definition of the <literal>&charns;</literal> Namespace - - - - - - -
- - -shows a sample document using this mechanism. - - -Declaring and Using the ISO Latin 1 Character Entity Set -<doc xmlns:e="&charns;"> -<p>This document uses the character names element to access -character entities, such as "<e:char name="eacute"/>" and -"<e:char name="COPYRIGHT SIGN"/>".</p> -</doc> - - -The character names element is limited to contexts where elements may -occur. In particular, elements may not occur in XML attribute values. Note, -however, that internationalization requirements such as bidirectional language -support and Ruby already require structure in arbitrary contexts. It is probably -an error to use attributes for human-readable content. - -
- -Added Latin 1 - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Latin 1//EN//XML -System identifier: &baseuri;/iso-lat1.ent - - - -The following character entities are defined in this entity set: - - -&iso-lat1-table; - - - -Added Latin 2 - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Latin 2//EN//XML -System identifier: &baseuri;/iso-lat2.ent - - -The following character entities are defined in this entity set: - - -&iso-lat2-table; - - - -Greek Letters - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Greek Letters//EN//XML -System identifier: &baseuri;/iso-grk1.ent - - -The following character entities are defined in this entity set: - - -&iso-grk1-table; - - - -Monotoniko Greek - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Monotoniko Greek//EN//XML -System identifier: &baseuri;/iso-grk2.ent - - -The following character entities are defined in this entity set: - - -&iso-grk2-table; - - - -Russian Cyrillic - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Russian Cyrillic//EN//XML -System identifier: &baseuri;/iso-cyr1.ent - - -The following character entities are defined in this entity set: - - -&iso-cyr1-table; - - - -Non-Russian Cyrillic - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Non-Russian Cyrillic//EN//XML -System identifier: &baseuri;/iso-cyr2.ent - - -The following character entities are defined in this entity set: - - -&iso-cyr2-table; - - - -Numeric and Special Graphic - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Numeric and Special Graphic//EN//XML -System identifier: &baseuri;/iso-num.ent - - -The following character entities are defined in this entity set: - - -&iso-num-table; - - - -Diacritical Marks - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Diacritical Marks//EN//XML -System identifier: &baseuri;/iso-dia.ent - - -The following character entities are defined in this entity set: - - -&iso-dia-table; - - - -Publishing - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Publishing//EN//XML -System identifier: &baseuri;/iso-pub.ent - - -The following character entities are defined in this entity set: - - -&iso-pub-table; - - - -Box and Line Drawing - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Box and Line Drawing//EN//XML -System identifier: &baseuri;/iso-box.ent - - -The following character entities are defined in this entity set: - - -&iso-box-table; - - - -General Technical - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES General Technical//EN//XML -System identifier: &baseuri;/iso-tech.ent - - -The following character entities are defined in this entity set: - - -&iso-tech-table; - - - -Greek Symbols - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Greek Symbols//EN//XML -System identifier: &baseuri;/iso-grk3.ent - - -The following character entities are defined in this entity set: - - -&iso-grk3-table; - - - -Alternative Greek Symbols - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Alternative Greek Symbols//EN//XML -System identifier: &baseuri;/iso-grk4.ent - - -The following character entities are defined in this entity set: - - -&iso-grk4-table; - - - -Added Math Symbols: Ordinary - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Ordinary//EN//XML -System identifier: &baseuri;/iso-amso.ent - - -The following character entities are defined in this entity set: - - -&iso-amso-table; - - - -Added Math Symbols: Binary Operators - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Binary Operators//EN//XML -System identifier: &baseuri;/iso-amsb.ent - - -The following character entities are defined in this entity set: - - -&iso-amsb-table; - - - -Added Math Symbols: Relations - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Relations//EN//XML -System identifier: &baseuri;/iso-amsr.ent - - -The following character entities are defined in this entity set: - - -&iso-amsr-table; - - - -Added Math Symbols: Negated Relations - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Negated Relations//EN//XML -System identifier: &baseuri;/iso-amsn.ent - - -The following character entities are defined in this entity set: - - -&iso-amsn-table; - - - -Added Math Symbols: Arrow Relations - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Arrow Relations//EN//XML -System identifier: &baseuri;/iso-amsa.ent - - -The following character entities are defined in this entity set: - - -&iso-amsa-table; - - - -Added Math Symbols: Delimiters - -Identifiers for this entity set: - - -Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Delimiters//EN//XML -System identifier: &baseuri;/iso-amsc.ent - - -The following character entities are defined in this entity set: - - -&iso-amsc-table; - - - -Unicode Glyphs - -The Unicode reference glyphs in this document are examples -only. Some characters have more than one Unicode representation and -different Unicode characters may be appropriate in different -contexts. The glyph images offer only one of many possible -representations for the specified character. - - -Most of the glyphs this reference are from the TmsPF Roman font -by -Production -First Software. A few glyphs are from -Everson Mono. - - -Unicode support requires much more than a simple character to glyph -mapping; for more information on Unicode, consult -The -Unicode Standard, Version 2.0 and - -Unicode -Technical Report #8, which describes Unicode Version 2.1. - - - - -OASIS DocBook Technical Committee - -The following individuals were members of the committee during -the formulation of this &standard;: - - -Dennis Evans -Patricia Gee-Best -Paul Grosso -Dick Hamilton -Nancy (Paisner) Harrison -Sabine Ocker -Michael Sabrio -Michael Smith -Tim Teebken (prospective) -Norman Walsh (Chair, Editor) - - - - - -Notices - -Copyright © The Organization for the Advancement of -Structured Information Standards [OASIS] 2001, 2002. All Rights -Reserved. - -OASIS takes no position regarding the validity -or scope of any intellectual property or other rights -that might be claimed to pertain to the implementation -or use of the technology described in this document -or the extent to which any license under such rights -might or might not be available; neither does it represent -that it has made any effort to identify any such rights. -Information on OASIS's procedures with respect to rights -in OASIS specifications can be found at the OASIS website. -Copies of claims of rights made available for publication -and any assurances of licenses to be made available, -or the result of an attempt made to obtain a general -license or permission for the use of such proprietary -rights by implementors or users of this specification, -can be obtained from the OASIS Executive Director. - -OASIS invites any interested party to bring to -its attention any copyrights, patents or patent applications, -or other proprietary rights which may cover technology -that may be required to implement this specification. -Please address the information to the OASIS Executive -Director. - -This document and translations of it may be copied -and furnished to others, and derivative works that comment -on or otherwise explain it or assist in its implementation -may be prepared, copied, published and distributed, -in whole or in part, without restriction of any kind, -provided that the above copyright notice and this paragraph -are included on all such copies and derivative works. -However, this document itself may not be modified in -any way, such as by removing the copyright notice or -references to OASIS, except as needed for the purpose -of developing OASIS specifications, in which case the -procedures for copyrights defined in the OASIS Intellectual -Property Rights document must be followed, or as required -to translate it into languages other than English. - -The limited permissions granted above are perpetual -and will not be revoked by OASIS or its successors or -assigns. - -This document and the information contained herein -is provided on an AS IS basis and OASIS DISCLAIMS -ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT -LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION -HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES -OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. - -OASIS has been notified of intellectual property -rights claimed in regard to some or all of the contents -of this specification. For more information consult -the online list of claimed rights. - - - - -Intellectual Property Rights - -For information on wether any patents have been disclosed that may be -essential to implementing this specification, and any offers of patent -licensing terms, please refer to the Intellectual Property Rights section -of the DocBook web page -() - - - - - -Revision History - - - - Working Draft - 13 Jun 2002 - - - - Working Draft - 19 Mar 2002 - - - - Working Draft - 19 Nov 2001 - - - - - - -References - -Normative - - - - - - - - - -Non-Normative - -gaijiMartin Dürst, -editor. -Missing -Characters and Glyphs. -World Wide Web Consortium, 2002. - - - - - - -
-- 2.40.0