XML Catalogs

Working Draft 16 Jul 2001

This version:
Working Draft: 16 Jul 2001
Previous versions:
Working Draft: 10 Jul 2001
Working Draft: 09 Jul 2001
Working Draft: 12 Jun 2001
Working Draft: 08 Jun 2001
Working Draft: 25 May 2001
Archives: 24 May 2001 and beyond
Editor:
Norman Walsh <Norman.Walsh@Sun.COM>

This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works. However, this document itself may not be modified in any way, such as by removing the copyright notice or references to OASIS, except as needed for the purpose of developing OASIS specifications, in which case the procedures for copyrights defined in the OASIS Intellectual Property Rights document must be followed, or as required to translate it into languages other than English.

The limited permissions granted above are perpetual and will not be revoked by OASIS or its successors or assigns.

This document and the information contained herein is provided on an "AS IS" basis and OASIS DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.


Abstract

The requirement that all external identifiers in XML documents must provide a system identifier has unquestionably been of tremendous short-term benefit to the XML community. It has allowed a whole generation of tools to be developed without the added complexity of explicit entity management.

However, the interoperability of XML documents has been impeded in several ways by the lack of entity management facilities:

  1. External identifiers may require resources that are not always available. For example, a system identifier that points to a resource on another machine may be inaccessible if a network connection is not available.

  2. External identifiers may require protocols that are not accessible to all of the vendors' tools on a single computer system. An external identifier that is addressed with the ftp: protocol, for example, is not accessible to a tool that does not support that protocol.

  3. It is often convenient to access resources using system identifiers that point to local resources. Exchanging documents that refer to local resources with other systems is problematic at best and impossible at worst.



The problems involved with sharing documents, or packages of documents, across multiple systems are large and complex. While there are many important issues involved and a complete solution is beyond the current scope, the OASIS membership agrees upon the enclosed set of conventions to address a useful subset of the complete problem. To address these issues, this Standard defines an entity catalog that maps both external identifiers and arbitrary URI references to URI references.

Status of this Document

This is a working draft constructed by the editor. It is not an official committee work product and may not reflect the consensus opinion of the committee.

Table of Contents

1 Introduction
2 Terminology
3 An Entity Catalog
4 Using Catalogs
4.1 External Identifier Entries
4.1.1 The prefer attribute
4.2 URI Entries
4.3 Rewrite Entries
4.4 An XML Catalog Example
5 Catalog Entry Files
5.1 Document Control of Catalog Entry Files
5.2 Catalog Circularities
6 XML Catalog Entries
6.1 Common Attributes
6.2 Public Identifier Normalization
6.3 System Identifier and URI Normalization
6.4 URN "Unwrapping"
6.5 Catalog Elements
6.5.1 The catalog Entry
6.5.2 The group Entry
6.5.3 The public Entry
6.5.4 The system Element
6.5.5 The rewriteSystem Element
6.5.6 The delegatePublic Element
6.5.7 The delegateSystem Element
6.5.8 The uri Element
6.5.9 The rewriteURI Element
6.5.10 The delegateURI Element
6.5.11 The nextCatalog Element
7 Catalog Resolution Semantics
7.1 External Identifier Resolution
7.1.1 Input to the Resolver
7.1.2 Resolution of External Identifiers
7.2 URI Resolution
7.2.1 Input to the Resolver
7.2.2 Resolution of URI references
8 Resource Failures

Appendixes

A A W3C XML Schema for the XML Catalog (Non-Normative)
B A TREX Grammar for the XML Catalog (Non-Normative)
C A RELAX Grammar for the XML Catalog (Non-Normative)
D A DTD for the XML Catalog (Non-Normative)
E Support for TR9401 Catalog Semantics (Non-Normative)
E.1 The doctype Element
E.2 The document Element
E.3 The dtddecl Element
E.4 The entity Element
E.5 The linktype Element
E.6 The notation Element
E.7 The sgmldecl Element
F OASIS Entity Resolution Committee (Non-Normative)
References

1. Introduction

In order to make optimal use of the information about an XML external resource, there needs to be some interoperable way to map the information in an XML external identifier into a URI reference for the desired resource.

This Standard defines an entity catalog that handles two simple cases:

  1. Mapping an external entity's public identifier and/or system identifier to a URI reference.

  2. Mapping the URI reference of a resource (a namespace name, stylesheet, image, etc.) to another URI reference.

Though it does not handle all issues that a combination of a complete entity manager and storage manager addresses, it simplifies both the use of multiple products in a great majority of cases and the task of processing documents on different systems.

2. Terminology

The key words must, must not, required, shall, shall not, should, should not, recommended, may, and optional in this Standard are to be interpreted as described in [RFC 2119]. Note that for reasons of style, these words are not capitalized in this document.

The terms URI and URI reference are to be interpreted as described in [RFC 2396].

The logical input to a catalog processor is an external identifier (some combination of public and system identifiers) or a URI reference. The logical output of the catalog processor is a URI reference. (This Standard does not attempt to define an API for catalog processors so the logical interfaces and the practical interfaces may differ.)

A catalog is a logical structure that contains "mapping" information. A catalog may be physically contained in one or more catalog entry files. A catalog entry file is a document that contains a set of catalog entries.

3. An Entity Catalog

This Standard defines a format for an application-independent entity catalog that maps external identifiers and URI references to (other) URI references. This catalog is expressed in terms of [XML 1.0 (Second Edition)] and [Namespaces].

The principal task of a catalog processor is to find entries in the catalog that match the input provided and return the associated URI reference as the output. The first such match is always used, and there is no requirement for the catalog processor to search for additional matches.

This catalog is used by an application's entity manager. This Standard does not dictate when an entity manager should access this catalog; for example, an application may attempt other mapping algorithms before or (if the catalog fails to produce a successful mapping) after accessing this catalog.

The catalog is effectively an ordered list of (one or more) catalog entry files. It is up to the application to determine the ordered list of catalog entry files to be used as the logical catalog. (This Standard uses the term "catalog entry file" to refer to one component of a logical catalog even though a catalog entry file can be any kind of storage object or entity including--but not limited to--a table in a database, some object identified by a URI reference, or some dynamically generated set of catalog entries.)

Each entry in the catalog associates a URI reference with information about an external reference that appears in an XML document. For example, the following are possible catalog entries that associate a URI reference with a public identifier:

<public publicId="ISO 8879:1986//ENTITIES Added Latin 1//EN"
        uri="iso-lat1.gml"/>
<public publicId="-//USA/AAP//DTD BK-1//EN"
        uri="aapbook.dtd"/>
<public publicId="-//Example, Inc.//DTD Report//EN"
        uri="http://www.example.com/dtds/report.dtd"/>

This Standard defines the following catalog entry types: catalog, delegatePublic, delegateSystem, delegateURI, group, nextCatalog public, rewriteSystem, rewriteURI, system, and uri.

The namespace name defined by this Standard is "urn:oasis:names:tc:entity:xmlns:xml:catalog".

This Standard reserves all elements and attributes from its namespace for current and future use. In addition, unqualified attributes on elements in its namespace, other than the attributes explicitly described in this Standard, are reserved for future use.

To provide for possible future extension and other applications of this catalog, its format allows for "other information" indicated by elements and attributes from namespaces other than the one defined by this Standard.

4. Using Catalogs

A catalog can be used in two different, independent ways: (1) it can be used to locate the replacement text for an external entity, or (2) it can be used to locate an alternate URI reference for a resource. Although these functions are similar in nature, they are distinct and exercise two different sets of entries in the catalog.

In either case, the following entries in the catalog are interpreted as follows:

  1. The catalog entry is the root of a catalog entry file. All other entries occur within this catalog element.

  2. The group entry is simply a wrapper on which prefer (Section 4.1.1) and xml:base (Section 6.1) can occur. It has no other effect on the entries that it contains. When examining entries in the catalog sequentially, the presence of a group entry does not effect the order in which they are examined.

  3. The nextCatalog entry indicates that an entity manager must use the associated URI reference to locate an additional catalog entry file to be processed after the current catalog entry file.

The nextCatalog entry can be used to insert a new catalog entry file into the current list of catalog entry files. The catalog attribute on a nextCatalog entry is used to locate another catalog entry file that is inserted into the catalog entry file list after the current catalog entry file. Multiple nextCatalog entries are allowed, and the referenced catalog entry files are inserted into the existing working catalog entry file list in the order in which they occur in the current catalog entry file (document order).

Catalog entry files identified by nextCatalog entries will only be examined after all other entries in the current catalog entry file have been considered and none of them provide a match for the current input.

4.1. External Identifier Entries

External identifiers ([Production 75] of [XML 1.0 (Second Edition)]) identify the external subset, entities, and notations of an XML document.

For the purposes of resolving external identifiers, the following entries are considered:

  • The public entry indicates that an entity manager must use the associated URI reference to locate the replacement text for an entity with the specified public identifier.

  • The system entry indicates that an entity manager must use the associated URI reference to locate the replacement text for an entity with the specified system identifier.

  • The rewriteSystem entry indicates that an entity manager must rewrite the specified system identifier by replacing the matching prefix with the associated rewrite prefix. The resulting string must be used to locate the replacement text.

  • The delegatePublic entry indicates that external identifiers with a public identifier that starts with the specified string must be resolved by considering the catalog specified by the associated URI reference.

  • The delegateSystem entry indicates that external identifiers with a system identifier that starts with the specified string must be resolved by considering the catalog specified by the associated URI reference.

Although system identifiers are assumed to be "URI reference[s]...meant to be dereferenced to obtain input for the XML processor to construct the entity's replacement text", in some circumstances (such as when the document was generated on another system, when the document was generated in another location on the same system, or when some files referenced by system identifiers have moved since the document was generated), the specified system identifiers are not always the best identifiers for the replacement text. For this or other reasons, it may be desirable to prefer the public identifier over the system identifier in determining the entity's replacement text. Therefore, this Standard defines two modes for searching the catalog: "prefer system identifier" mode and "prefer public identifier" mode.

  1. If system identifiers are preferred, a system identifier was provided, and there is no matching system or rewriteSystem type entry, then the system identifier given in the external identifier is used as the URI reference to locate the entity's replacement text regardless of any public identifier given in the external identifier.

    This Standard does not specify what happens if a preferred system identifier does not identify an accessible storage object; an application may look up the public identifier and/or entity name to find another URI reference, or it may simply report an error. An application should at least have the option of issuing a warning if the system identifier fails in this mode.

  2. If public identifiers are preferred, a public identifier was specified, and there is no matching system or rewriteSystem type entry, the system identifier given in the external identifier is used as the URI reference to locate the entity's replacement text only if no mapping can be found in the catalog for the public identifier.

4.1.1. The prefer attribute

The prefer attribute can be used on catalog and group entry types to indicate, for the enclosed set of catalog entries, if system or public entry matches are preferred.

Each occurrence of a prefer attribute specifies the search strategy mode for entries contained within the catalog or group element on which it occurs. A public or delegatePublic entry encountered when prefer is "public" will be considered for possible matching whether or not the external identifier has an explicit system identifier. A public or delegatePublic entry encountered when prefer is "system" will be ignored during lookups for which the external identifier has an explicit system identifier. No other entry types are affected by the prefer attribute. The initial search strategy in force at the beginning of each catalog entry file depends on the preference as determined by the application.

An application must provide some way (e.g., a runtime argument, environment variable, preference switch) that allows the user to specify which of these modes to use in the absence of any occurrence of the prefer attribute on the catalog entry.

When doing a catalog lookup, an entity manager generally uses whatever is available from among the entity declaration's system identifier and public identifier to find catalog entries that match the given information. A match in one catalog entry file will take precedence over any match in a later catalog entry file (and, in fact, the entity manager need not process subsequent catalog entry files once a match has occurred).

4.2. URI Entries

Other URI references, for example namespace names, stylesheets, included files, graphics, and hypertext references, simply identify other resources. The input to a resolver that locates resources is simply the original URI reference.

For the purposes of resolving URI references, the following entries are considered:

  • The uri entry indicates that an entity manager must use the associated URI reference to locate the resource.

  • The rewriteURI entry indicates that an entity manager must rewrite the specified URI reference by replacing the matching prefix with the associated rewrite prefix. The resulting string must be used to locate the resource.

  • The delegateURI entry indicates that a URI reference that starts with the specified string must be resolved by considering the catalog specified by the associated URI reference.

As when resolving URI references, a match in one catalog entry file will take precedence over any match in a later catalog entry file (and, in fact, the entity manager need not process subsequent catalog entry files once a match has occurred).

4.3. Rewrite Entries

Rewrite entries are provided as a convenience for performing redirection of a whole set of entities with a single catalog entry. Typical uses are website mirroring and dealing with fragment identifiers.

If the entire website at http://example.com/ has been mirrored onto your local system in file:///share/mirrors/example/, it is likely that you want any system identifier reference to the website to be redirected to your local system.

One way of doing this would be to create a system entry for every relevant identifier. If there are many entities on the website, this may be tedious. Instead, a single rewrite can be used:

<rewriteSystem systemIdStartString="http://www.example.com/"
               rewritePrefix="file:///share/mirrors/example/"/>

Similarly, if you have a large number of references to a single document using many different fragment identifiers, it may be tedious to construct uri entries for every URI reference if the base document moves. Again, a single rewrite can be used instead:

<rewriteURI uriStartString="http://www.example.com/old-location/database.xml"
            rewritePrefix="http://www.example.com/new-location/database.xml"/>

4.4. An XML Catalog Example

The catalog files in Example 1 and Example 2 are complete examples of XML Catalog files. These files are located at http://www.oasis-open.org/committees/entity/release/1.0/.

Example 1. A DocBook XML Catalog File: docbook.xml.

<!DOCTYPE catalog
  PUBLIC "-//OASIS//DTD Entity Resolution XML Catalog V1.0//EN"
         "http://www.oasis-open.org/committees/entity/release/1.0/catalog.dtd">
<catalog xmlns="urn:oasis:names:tc:entity:xmlns:xml:catalog"
         prefer="public">

  <group xml:base="http://www.oasis-open.org/docbook/xml/4.1.2/">
    <public publicId="-//OASIS//DTD DocBook XML V4.1.2//EN"
            uri="docbookx.dtd"/>
    <public publicId="-//OASIS//ENTITIES DocBook XML Notations V4.1.2//EN"
            uri="dbnotnx.mod"/>
    <public publicId="-//OASIS//ENTITIES DocBook XML Character Entities V4.1.2//EN"
            uri="dbcentx.mod"/>
    <public publicId="-//OASIS//ELEMENTS DocBook XML Information Pool V4.1.2//EN"
            uri="dbpoolx.mod"/>
    <public publicId="-//OASIS//ELEMENTS DocBook XML Document Hierarchy V4.1.2//EN"
            uri="dbhierx.mod"/>
    <public publicId="-//OASIS//ENTITIES DocBook XML Additional General Entities V4.1.2//EN"
            uri="dbgenent.mod"/>
    <public publicId="-//OASIS//DTD DocBook XML CALS Table Model V4.1.2//EN"
            uri="calstblx.dtd"/>
  </group>

  <public publicId="-//OASIS//DTD DocBook MathML Module V1.0//EN"
      uri="http://www.oasis-open.org/docbook/xml/mathml/1.0/dbmathml.dtd"/>

  <nextCatalog catalog="stylesheets.xml"/>

</catalog>

Example 2. A Stylesheet XML Catalog File: stylesheet.xml.

<!DOCTYPE catalog
  PUBLIC "-//OASIS//DTD Entity Resolution XML Catalog V1.0//EN"
         "http://www.oasis-open.org/committees/entity/release/1.0/catalog.dtd">
<catalog xmlns="urn:oasis:names:tc:entity:xmlns:xml:catalog"
         prefer="public">

  <!-- Circumvent relative URI in spec.xsl that doesn't work online -->
  <uri name="http://www.oasis-open.org/committes/tr.xsl"
   uri="http://www.oasis-open.org/committes/entity/stylesheets/base/tr.xsl"/>
</catalog>

Together, these two catalog files provide sufficient resolution information to parse and format the XML source for this Standard using the XSL Stylesheet at http://www.oasis-open.org/committees/entity/stylesheets/spec.xsl.

5. Catalog Entry Files

Applications conforming to this Standard must provide some (implementation dependent) mechanism to establish the initial list of catalog entry files. This may be a preferences dialog, an environment variable, an application properties file, or any other appropriate mechanism.

5.1. Document Control of Catalog Entry Files

If a document contains external identifiers or URI references, it may be useful for the document to identify a catalog or catalogs that are likely to aid in the resolution of those references.

For example, XML documents stored on the www.example.com server may wish to indicate that http://www.example.com/catalog is a useful public catalog to use when parsing them.

This Standard defines the processing instruction "<?oasis-xml-catalog?>" for this purpose. The <?oasis-xml-catalog?> processing instruction has a single pseudo-attribute, catalog, that identifies a single catalog entry file.

If a document contains one or more <?oasis-xml-catalog?> processing instruction(s), the catalog(s) identified must be used during resolution of external identifiers and URI references within that document.

Catalogs referenced by the processing instruction are added to the end of any system- or user-defined catalog entry file list.

For example, in

<?xml version="1.0"?>
<?oasis-xml-catalog catalog="http://example.com/catalog.xml"?>
<!DOCTYPE doc PUBLIC "-//Example//DTD Document V1.0//EN"
                     "http://www.example.com/schema/doc.dtd">
...

The URI "http://example.com/catalog.xml" is added to the end of the of the list of catalog entry files used for resolution within this document.

The following constraints apply:

  • The <?oasis-xml-catalog?> processing instruction must appear in the prologue before the document type declaration.

    It is an error for the processing instruction to occur in the internal subset or after the document type declaration. Processors should recover from the error by ignoring any processing instructions that occur after the start of the document type declaration.

    Likewise, it is an error for the processing instruction to occur after other processing instructions that contain URI references, such as stylesheet processing instructions ([XML Stylesheets]). Applications should recover by ignoring catalogs that occur after the URI reference for the purposes of resolving that reference.

  • If more than one catalog processing instruction is present, each catalog specified is added to the end of the catalog entry file list.

  • If the catalog is specified with a relative URI, it is relative to the base URI of the document that contains the processing instruction.

  • The URI that identifies the catalog entry file must not be subject to catalog resolution.

Catalog-aware applications should support the <?oasis-xml-catalog?> processing instruction. If the processing instruction is supported, they must provide a facility which allows a user to request that all <?oasis-xml-catalog?> processing instructions be ignored.

5.2. Catalog Circularities

It is possible for catalog references to be circular. Given the dynamic nature of resources on the internet, it is not possible for implementations to detect without the possibility of error whether or not a circular reference actually occurs.

Implementations should detect circularity where possible. If a circularity is detected, it must be treated as an error. Applications may recover from this error by indicating to the calling application that no match was found.

6. XML Catalog Entries

Each catalog entry file consists of some number of catalog entries. Catalog entries can be identified by the namespace name defined by this Standard.

Elements and attributes from other namespaces are are allowed, but they must be ignored for the purposes of resolution defined by this Standard. If an element is ignored, all of its descendants must also be ignored, regardless of their namespace.

6.1. Common Attributes

There are two attributes common to most elements: id and xml:base. The id is provided so that individual entries can be uniquely identified; it has no impact on the semantics of the catalog as defined by this Standard. The xml:base attribute changes the base URI for the entry on which it occurs (and all entries contained within it, unless further modified by another xml:base attribute). The semantics of xml:base are normatively defined in [XML Base].

All of the attributes defined by this Standard are in the per-element-type partition. Use of global attributes, for example, <cat:group cat:id="groupId"> instead of <cat:group id="groupId"> is forbidden.

6.2. Public Identifier Normalization

In order to accurately and interoperably compare public identifiers, catalog processors must perform normalization on public identifiers in both the catalog and the input passed to them.

All strings of white space in public identifiers must be normalized to single space characters (#x20), and leading and trailing white space must be removed.

6.3. System Identifier and URI Normalization

In order to accurately and interoperably compare system identifiers and URI references, catalog processors must perform normalization. The normalization described in this section must be performed on system identifiers and URI references passed as input to the resolver and on strings in the catalog that are compared to them.

URI references require encoding and escaping of certain characters. The disallowed characters include all non-ASCII characters, plus the excluded characters listed in Section 2.4 of [RFC 2396], except for the number sign (#) and percent sign (%) characters and the square bracket characters re-allowed in [RFC 2732]. These characters are summarized in Table 1.

Table 1. Excluded US-ASCII Characters

Hex Value Character Hex Value Character Hex Value Character
00 NUL 0F SI 1E RS
01 SOH 10 DLE 1F US
02 STX 11 DC1 20 (space)
03 ETX 12 DC2 22 "
04 EOT 13 DC3 3C <
05 ENQ 14 DC4 3E >
06 ACK 15 NAK 5C \
07 BEL 16 SYN 5E ^
08 BS 17 ETB 60 `
09 HT 18 CAN 7B {
0A LF 19 EM 7C |
0B VT 1A SUB 7D }
0C FF 1B ESC 7F  
0D CR 1C FS    
0E SO 1D GS    

Disallowed characters must be escaped as follows:

  1. Each disallowed character is converted to UTF-8 [RFC 2279] as one or more bytes.

  2. Any octets corresponding to a disallowed character are escaped with the URI escaping mechanism (that is, converted to %HH, where HH is the hexadecimal notation of the byte value).

  3. The original character is replaced by the resulting character sequence.

Note that this normalization process is idempotent: repeated normalization does not change a normalized URI reference.

6.4. URN "Unwrapping"

This Standard requires processors to implement special treatment of URNs in the publicid URN Namespace (that is, URNs that begin "urn:publicid:").

URNs of this form must, in some contexts, be "unwrapped" by the Catalog processor. This unwrapping translates the URN form of the public identifier back into the standard ISO 8879 form for the purposes of subsequent catalog processing.

Unwrapping a urn:publicid: URN is accomplished by transcribing characters in the URN according to the following table after discarding the leading urn:publicid: string:

URN Characters Public Identifier Characters
+ " " (space)
: //
; ::
%2B +
%3A :
%2F /
%3B ;
%27 '
%3F ?
%23 #
%25 %

For example, the following URN in the publicid namespace:

urn:publicid:-:OASIS:DTD+DocBook+XML+V4.1.2:EN

Represents the public identifier:

-//OASIS//DTD DocBook XML V4.1.2//EN

6.5. Catalog Elements

The root element of a catalog entry file is catalog. There are ten other element types: group, public, system, rewriteSyste delegatePublic, delegateSystem, rewriteURI, delegateURI, uri, and nextCatalog. Each of these element types is described in one of the following sections.

6.5.1. The catalog Entry

Each XML Catalog entry file consists of a single catalog element. This element may set the global prefer value and global base URI. It is otherwise just a container for the other elements.

<catalog
  id = id
  prefer = "system" | "public"
  xml:base = uri-reference>
  <!-- Content: (group | public | system | uri | delegatePublic | delegateSystem | delegateURI | nextCatalog)+ -->
</catalog>

6.5.2. The group Entry

The group element is a convenience wrapper for specifying a prefer setting or base URI for a set of catalog entries. It has no semantics other than scoping these settings.

<group
  id = id
  prefer = "system" | "public"
  xml:base = uri-reference>
  <!-- Content: (public | system | uri | delegatePublic | delegateSystem | delegateURI | nextCatalog)+ -->
</group>

6.5.3. The public Entry

The public element associates a URI reference with a public identifier.

<public
  id = id
  publicId = public-identifier
  uri = uri-reference
  xml:base = uri-reference />

A public entry matches a public identifier if the normalized value (Section 6.2) of the public identifier is lexically identical to the normalized value of the publicId attribute of the entry.

If the value of the uri attribute is relative, it must be made absolute with respect to the base URI currently in effect. The URI reference should not include a fragment identifier.

6.5.4. The system Element

The system element associates a URI reference with a system identifier.

<system
  id = id
  systemId = string
  uri = uri-reference
  xml:base = uri-reference />

A system entry matches a system identifier if the normalized value (Section 6.3) of the system identifier is lexically identical to the normalized value of the systemId attribute of the entry.

If the value of the uri attribute is relative, it must be made absolute with respect to the base URI currently in effect. The URI reference should not include a fragment identifier.

6.5.5. The rewriteSystem Element

The rewriteSystem element rewrites the beginning of a system identifier.

<rewriteSystem
  id = id
  systemIdStartString = string
  rewritePrefix = uri-reference />

A rewriteSystem entry matches a system identifier if the normalized value (Section 6.3) of the system identifier begins precisely with the normalized value of the systemIdStartString attribute of the entry.

If the value of the rewritePrefix attribute is relative, it must be made absolute with respect to the base URI currently in effect.

Rewriting removes the matching prefix from the supplied system identifier and replaces it with the value of the rewritePrefix attribute.

Given the following catalog fragment:

<rewriteSystem systemIdStartString="http://www.oasis-open.org/docbook/"
          rewritePrefix="file:///sourceforge/docbook/docbook/"/>
<rewriteSystem systemIdStartString="http://www.oasis-open.org/"
          rewritePrefix="file:///share/doctypes/oasis/"/>
<rewriteSystem systemIdStartString="http://www.oasis-open.org/committees/"
          rewritePrefix="file:///projects/oasis/"/>

The first two entries match the system identifier "http://www.oasis-open.org/docbook/xml/4.1.2/docbookx.dtd", but the third does not. The rewritten system identifier in this case is: "file:///sourceforge/docbook/docbook/xml/4.1.2/docbookx.dtd". (The first matching entry is used.)

6.5.6. The delegatePublic Element

The delegatePublic element associates an alternate catalog with a partial public identifier.

<delegatePublic
  id = id
  publicIdStartString = public-identifier-prefix
  catalog = uri-reference
  xml:base = uri-reference />

A delegatePublic entry matches a public identifier if the normalized value (Section 6.2) of the public identifier begins precisely with the normalized value of the publicIdStartString attribute of the entry.

Given the following catalog fragment:

<delegatePublic publicIdStartString="-//OASIS//"
          catalog="http://www.oasis-open.org/catalog"/>
<delegatePublic publicIdStartString="-//OASIS//DTD DocBook "
          catalog="http://www.oasis-open.org/docbook/catalog"/>
<delegatePublic publicIdStartString="-//OASIS//DTD XML Catalog //"
          catalog="http://www.oasis-open.org/committees/entity/catalog"/>

The first two entries match the public identifier "-//OASIS//DTD DocBook V4.1.2//EN", but the third does not.

If the value of the catalog attribute is relative, it must be made absolute with respect to the base URI currently in effect.

6.5.7. The delegateSystem Element

The delegateSystem element associates an alternate catalog with a partial system identifier.

<delegateSystem
  id = id
  systemIdStartString = string
  catalog = uri-reference
  xml:base = uri-reference />

A delegateSystem entry matches a system identifier if the normalized value (Section 6.3) of the system identifier begins precisely with the normalized value of the systemIdStartString attribute of the entry.

Given the following catalog fragment:

<delegateSystem systemIdStartString="http://www.oasis-open.org/"
          catalog="http://www.oasis-open.org/catalog"/>
<delegateSystem systemIdStartString="http://www.oasis-open.org/docbook/"
          catalog="http://www.oasis-open.org/docbook/catalog"/>
<delegatePublic publicIdStartString="http://www.oasis-open.org/committees/"
          catalog="http://www.oasis-open.org/committees/catalog"/>

The first two entries match the system identifier "http://www.oasis-open.org/docbook/xml/4.1.2/docbookx.dtd", but the third does not.

If the value of the catalog attribute is relative, it must be made absolute with respect to the base URI currently in effect.

6.5.8. The uri Element

The uri element associates an alternate URI reference with a URI reference.

<uri
  id = id
  name = string
  uri = uri-reference
  xml:base = uri-reference />

A uri entry matches a URI reference if the normalized value (Section 6.3) of the URI reference is lexically identical to the normalized value of the name attribute of the entry.

Given the following catalog fragment:

<uri name="http://www.oasis-open.org/committees/docbook/#membership"
          uri="file:///projects/oasis/docbook/website/#membership"/>
<uri name="http://www.oasis-open.org/committees/docbook/"
          uri="file:///projects/oasis/docbook/website/"/>

The second entry matches the URI reference "http://www.oasis-open.org/committees/docbook/", but the first does not.

If the value of the uri attribute is relative, it must be made absolute with respect to the base URI currently in effect.

6.5.9. The rewriteURI Element

The rewriteURI element rewrites the beginning of a URI reference.

<rewriteURI
  id = id
  uriStartString = string
  rewritePrefix = uri-reference />

A rewriteURI entry matches a URI reference if the normalized value (Section 6.3) of the URI reference begins precisely with the normalized value of the uriStartString attribute of the entry.

If the value of the rewritePrefix attribute is relative, it must be made absolute with respect to the base URI currently in effect.

Rewriting removes the matching prefix from the supplied URI reference and replaces it with the value of the value of the rewritePrefix attribute.

6.5.10. The delegateURI Element

The delegateURI element associates an alternate catalog with a partial URI reference.

<delegateURI
  id = id
  uriStartString = string
  catalog = uri-reference
  xml:base = uri-reference />

A delegateURI entry matches a URI reference if the normalized value (Section 6.3) of the URI reference begins precisely with the normalized value of the uriStartString attribute of the entry.

If the value of the catalog attribute is relative, it must be made absolute with respect to the base URI currently in effect.

6.5.11. The nextCatalog Element

The nextCatalog elements indicate additional catalog entry file(s) to be considered during the process of resolution.

<nextCatalog
  id = id
  catalog = uri-reference
  xml:base = uri-reference />

If the value of the catalog attribute is relative, it must be made absolute with respect to the base URI currently in effect.

Catalogs loaded due to a nextCatalog directive have an initial base URI that is dependent on the location of the loaded catalog entry file. No xml:base information is inherited from the originating catalog.

7. Catalog Resolution Semantics

This section describes how catalog resolution is performed.

Resolution begins with a list of catalog entry files and either an external identifier or a URI reference.

7.1. External Identifier Resolution

This section describes how catalog entries are used to resolve external identifiers.

7.1.1. Input to the Resolver

An external identifier will have at least one and perhaps both of the following:

  1. a public identifier

  2. a system identifier

If the public identifier is a URN in the publicid namespace ([urn:publicid]), it is converted into another public identifier by "unwrapping" the URN (Section 6.4). This may be done, for example, so that a URN can be specified as the public identifier and a URL as the system identifier, in the absence of widely deployed URN-resolution facilities.

If the system identifier is a URN in the publicid namespace, it is converted into a public identifier by "unwrapping" the URN. In this case, one of the following must apply:

  1. No public identifier was provided. Resolution continues as if the public identifier constructed by unwrapping the URN was supplied as the original public identifier and no system identifier was provided.

  2. The normalized public identifier provided is lexically identical to the public identifier constructed by unwrapping the URN. Resolution continues as if the system identifier had not been supplied.

  3. The normalized public identifier provided is different from the public identifier constructed by unwrapping the URN. This is an error. Applications may recover from this error by discarding the system identifier and proceeding with the original public identifier.

7.1.2. Resolution of External Identifiers

Resolution follows the steps listed below, proceeding to each subsequent step if and only if no other action is indicated.

  1. Resolution begins in the first catalog entry file in the current catalog entry file list.

  2. If a system identifier is provided, and at least one matching system entry exists, the (absolutized) value of the uri attribute of the first matching system entry is returned.

  3. If a system identifier is provided, and at least one matching rewriteSystem entry exists, rewriting is performed.

    Rewriting removes the matching prefix and replaces it with the rewrite prefix identified by the first matching entry. The rewritten string is returned.

  4. If a system identifier is provided, and one or more delegateSystem entries match, delegation is performed.

    If delegation is to be performed, a new catalog entry file list is generated from the set of all matching delegateSystem entries. The (absolutized) value of the catalog attribute of each matching delegateSystem entry is inserted into the new catalog entry file list such that the delegate entry with the longest matching systemIdStartString is first on the list, the entry with the second longest match is second, etc.

    These are the only catalog entry files on the list, the current list is not considered for the purpose of delegation. If delegation fails to find a match, resolution for this entity does not resume with the current list. (A subsequent resolution attempt for a different entity begins with the original list; in other words the catalog entry file list used for delegation is distinct and unrelated to the "normal" catalog entry file list.)

    Catalog resolution restarts using exclusively the catalog entry files in this new list and the given system identifier; any originally given public identifier is ignored during the remainder of the resolution of this external identifier: return to step 1.

  5. If a public identifier is provided, and at least one matching public entry exists, the (absolutized) value of the uri attribute of the first matching public entry is returned. If a system identifier is also provided as part of the input to this catalog lookup, only public entries that occur where the prefer setting is public are considered for matching.

  6. If a public identifier is provided, and one or more delegatePublic entries match, delegation is performed. If a system identifier is also provided as part of the input to this catalog lookup, only delegatePublic entries that occur where the prefer setting is public are considered for matching.

    If delegation is to be performed, a new catalog entry file list is generated from the set of all matching delegatePublic entries. The value of the catalog attribute of each matching delegatePublic entry is inserted into the new catalog entry file list such that the delegate entry with the longest matching publicIdStartString is first on the list, the entry with the second longest match is second, etc.

    These are the only catalog entry files on the list, the current list is not considered for the purpose of delegation. If delegation fails to find a match, resolution for this entity does not resume with the current list. (A subsequent resolution attempt for a different entity begins with the original list; in other words the catalog entry file list used for delegation is distinct and unrelated to the "normal" catalog entry file list.)

    Catalog resolution restarts using exclusively the catalog entry files in this new list and the given public identifier; any originally given system identifier is ignored during the remainder of the resolution of this external identifier: return to step 1.

  7. If the current catalog entry file contains one or more nextCatalog entries, the catalog entry files referenced by each nextCatalog entry's "catalog" attribute are inserted, in the order that they appear in this catalog entry file, onto the current catalog entry file list, immediately after the current catalog entry file.

  8. If there are one or more catalog entry files remaining on the current catalog entry file list, load the next catalog entry file and continue resolution efforts: return to step 2.

  9. Indicate to the calling application that no match was found.

7.2. URI Resolution

This section describes how catalog entries are used to resolve URI references.

7.2.1. Input to the Resolver

URI reference resolution always begins with a single URI reference.

If the URI reference is a URN in the publicid namespace ([urn:publicid]), it is converted into a public identifier by "unwrapping" the URN (Section 6.4). Resolution continues by following the semantics of external identifier resolution (Section 7.1) as if the public identifier constructed by unwrapping the URN had been provided and no system identifier had been provided. Otherwise, resolution of the URI reference proceeds according to the steps below.

7.2.2. Resolution of URI references

Resolution of a generic URI reference follows the steps listed below, proceeding to each subsequent step if and only if no other action is indicated.

  1. Resolution begins in the first catalog entry file in the current catalog list.

  2. If at least one matching uri entry exists, the (absolutized) value of the uri attribute of the first matching uri entry is returned.

  3. If at least one matching rewriteURI entry exists, rewriting is performed.

    Rewriting removes the matching prefix and replaces it with the rewrite prefix identified by the first matching entry. The rewritten string is returned.

  4. If one or more delegateURI entries match, delegation is performed.

    If delegation is to be performed, a new catalog entry file list is generated from the set of all matching delegateURI entries. The (absolutized) value of the catalog attribute of each matching delegateURI entry is inserted into the new catalog entry file list such that the delegate entry with the longest matching uriStartString is first on the list, the entry with the second longest match is second, etc.

    These are the only catalog entry files on the list, the current list is not considered for the purpose of delegation. If delegation fails to find a match, resolution for this entity does not resume with the current list. (A subsequent resolution attempt for a different entity begins with the original list; in other words the catalog entry file list used for delegation is distinct and unrelated to the "normal" catalog entry file list.)

    Catalog resolution restarts using exclusively the catalog entry files in this new list and the given URI reference: return to step 1.

  5. If the current catalog entry file contains one or more nextCatalog entries, the catalog entry files referenced by each nextCatalog entry's "catalog" attribute are inserted, in the order that they appear in this catalog entry file, onto the current catalog entry file list, immediately after the current catalog entry file.

  6. If there are one or more catalog entry files remaining on the current catalog entry file list, load the next catalog entry file and continue resolution efforts: return to step 2.

  7. Indicate to the calling application that no match was found.

8. Resource Failures

The catalog processor is sometimes required to load a catalog entry file. This may occur at the beginning of processing, when dealing with the initial list of catalog entry files, or during subsequent processing of a nextCatalog entry or one of the delegate entries.

If the processor attempts to load a resource and fails (because the resource does not exist or is not reachable, for example), it must recover by ignoring the catalog entry file that failed and proceeding.

Similarly, if the resource retrieved is not an understandable catalog (because it is not in a format that the processor recognizes, or it purports to be XML but is not well-formed, or for any other reason), the processor must recover by responding as if the resource could not be loaded.

In order for a resource to be considered an XML Catalog, the following conditions must hold:

  1. The resource retrieved must be well-formed [XML 1.0 (Second Edition)] consistent with [Namespaces].

  2. The root element must be catalog.

  3. The namespace name of the root element must be urn:oasis:names:tc:entity:xmlns:xml:catalog.

It is not an error for catalog processors to accept other forms of catalog documents, but their identification and specification is outside the scope of this Standard.

A. A W3C XML Schema for the XML Catalog (Non-Normative)

This is a [W3C XML Schema] grammar for the OASIS XML Catalog Standard.

<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE xs:schema SYSTEM "http://www.w3.org/2001/XMLSchema.dtd" [
<!ENTITY % schemaAttrs "
    xmlns:xml   CDATA   #IMPLIED
    xmlns:er    CDATA   #IMPLIED
">
]>
<xs:schema xmlns:xs='http://www.w3.org/2001/XMLSchema'
           xmlns:er='urn:oasis:names:tc:entity:xmlns:xml:catalog'
           xmlns:xml='http://www.w3.org/XML/1998/namespace'
  xmlns:x="http://nwalsh.com/extensions"
           targetNamespace='urn:oasis:names:tc:entity:xmlns:xml:catalog'
           elementFormDefault='qualified'>

  <!-- $Id: catalog.xsd,v 1.8 2001/06/12 14:26:00 ndw Exp $ -->

  <xs:simpleType name='pubIdChars'>
    <!-- A string of the characters defined as pubIdChar in production 13
         of the Second Edition of the XML 1.0 Recommendation -->
    <xs:restriction base='xs:string'/>
  </xs:simpleType>

  <xs:simpleType name='publicIdentifier'>
    <xs:restriction base='xs:string'/>
  </xs:simpleType>

  <xs:simpleType name='partialPublicIdentifier'>
    <xs:restriction base='er:pubIdChars'/>
  </xs:simpleType>

  <xs:simpleType name='systemOrPublic'>
    <xs:restriction base='xs:string'>
      <xs:enumeration value='system'/>
      <xs:enumeration value='public'/>
    </xs:restriction>
  </xs:simpleType>

  <!-- The global attribute xml:base is not explicitly declared; -->
  <!-- it is allowed by the anyAttribute declarations. -->

  <xs:complexType name='catalog'>
    <xs:choice minOccurs='1' maxOccurs='unbounded'>
      <xs:element ref='er:public'/>
      <xs:element ref='er:system'/>
      <xs:element ref='er:uri'/>
      <xs:element ref='er:rewriteSystem'/>
      <xs:element ref='er:rewriteURI'/>
      <xs:element ref='er:delegatePublic'/>
      <xs:element ref='er:delegateSystem'/>
      <xs:element ref='er:delegateURI'/>
      <xs:element ref='er:nextCatalog'/>
      <xs:element ref='er:group'/>
      <xs:any namespace='##other' processContents='skip'/>
    </xs:choice>
    <xs:attribute name='id' type='xs:ID'/>
    <xs:attribute name='prefer' type='er:systemOrPublic'/>
    <xs:anyAttribute namespace="##other" processContents="lax"/>
  </xs:complexType>

  <xs:complexType name='public'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="publicId" type="er:publicIdentifier"
                       use="required"/>
        <xs:attribute name="uri" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='system'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="systemId" type="xs:string"
                       use="required"/>
        <xs:attribute name="uri" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='uri'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="name" type="xs:anyURI"
                       use="required"/>
        <xs:attribute name="uri" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='rewriteSystem'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="systemIdStartString"
                       type="xs:string"
                       use="required"/>
        <xs:attribute name="rewritePrefix" type="xs:string" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='rewriteURI'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="uriIdStartString"
                       type="xs:string"
                       use="required"/>
        <xs:attribute name="rewritePrefix" type="xs:string" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='delegatePublic'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="publicIdStartString"
                       type="er:partialPublicIdentifier"
                       use="required"/>
        <xs:attribute name="catalog" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='delegateSystem'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="systemIdStartString"
                       type="xs:string"
                       use="required"/>
        <xs:attribute name="catalog" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='delegateURI'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="uriStartString"
                       type="xs:string"
                       use="required"/>
        <xs:attribute name="catalog" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='nextCatalog'>
    <xs:complexContent>
      <xs:restriction base="xs:anyType">
        <xs:attribute name="catalog" type="xs:anyURI" use="required"/>
        <xs:attribute name='id' type='xs:ID'/>
        <xs:anyAttribute namespace="##other" processContents="lax"/>
      </xs:restriction>
    </xs:complexContent>
  </xs:complexType>

  <xs:complexType name='group'>
    <xs:choice minOccurs='1' maxOccurs='unbounded'>
      <xs:element ref='er:public'/>
      <xs:element ref='er:system'/>
      <xs:element ref='er:uri'/>
      <xs:element ref='er:rewriteSystem'/>
      <xs:element ref='er:rewriteURI'/>
      <xs:element ref='er:delegatePublic'/>
      <xs:element ref='er:delegateSystem'/>
      <xs:element ref='er:delegateURI'/>
      <xs:element ref='er:nextCatalog'/>
      <xs:any namespace='##other' processContents='skip'/>
    </xs:choice>
    <xs:attribute name='prefer' type='er:systemOrPublic'/>
    <xs:attribute name='id' type='xs:ID'/>
    <xs:anyAttribute namespace="##other" processContents="lax"/>
  </xs:complexType>

  <xs:element name="catalog" type="er:catalog"/>
  <xs:element name="public" type="er:public"/>
  <xs:element name="system" type="er:system"/>
  <xs:element name="uri" type="er:uri"/>
  <xs:element name="rewriteSystem" type="er:rewriteSystem"/>
  <xs:element name="rewriteURI" type="er:rewriteURI"/>
  <xs:element name="delegatePublic" type="er:delegatePublic"/>
  <xs:element name="delegateSystem" type="er:delegateSystem"/>
  <xs:element name="delegateURI" type="er:delegateURI"/>
  <xs:element name="nextCatalog" type="er:nextCatalog"/>
  <xs:element name="group" type="er:group"/>

</xs:schema>

B. A TREX Grammar for the XML Catalog (Non-Normative)

This is a [TREX] grammar for the OASIS XML Catalog Standard.

<?xml version="1.0"?>
<grammar xmlns="http://www.thaiopensource.com/trex"
         ns="urn:oasis:names:tc:entity:xmlns:xml:catalog">

  <!-- $Id: catalog.trex,v 1.7 2001/06/12 14:26:00 ndw Exp $ -->

  <start>
    <choice>
      <ref name="Catalog"/>
    </choice>
  </start>

  <define name="OptionalAttributes">
    <optional>
      <attribute name="xml:base"/>
    </optional>
    <optional>
      <attribute name="id"/>
    </optional>
    <zeroOrMore>
      <attribute>
        <not>
          <choice>
            <nsName/>
            <nsName ns=""/>
          </choice>
        </not>
      </attribute>
    </zeroOrMore>
  </define>

  <define name="PreferAttribute">
    <attribute name="prefer">
      <choice>
        <string>system</string>
        <string>public</string>
      </choice>
    </attribute>
  </define>

  <define name="Catalog">
    <element name="catalog">
      <ref name="OptionalAttributes"/>
      <optional>
        <ref name="PreferAttribute"/>
      </optional>
      <oneOrMore>
        <choice>
          <ref name="Group"/>
          <ref name="Public"/>
          <ref name="System"/>
          <ref name="Uri"/>
          <ref name="RewriteSystem"/>
          <ref name="RewriteURI"/>
          <ref name="DelegatePublic"/>
          <ref name="DelegateSystem"/>
          <ref name="DelegateURI"/>
          <ref name="NextCatalog"/>
          <ref name="AnyOtherElement"/>
        </choice>
      </oneOrMore>
    </element>
  </define>

  <define name="Group">
    <element name="group">
      <ref name="OptionalAttributes"/>
      <optional>
        <ref name="PreferAttribute"/>
      </optional>
      <oneOrMore>
        <choice>
          <ref name="Public"/>
          <ref name="System"/>
          <ref name="Uri"/>
          <ref name="RewriteSystem"/>
          <ref name="RewriteURI"/>
          <ref name="DelegatePublic"/>
          <ref name="DelegateSystem"/>
          <ref name="DelegateURI"/>
          <ref name="NextCatalog"/>
          <ref name="AnyOtherElement"/>
        </choice>
      </oneOrMore>
    </element>
  </define>

  <define name="Public">
    <element name="public">
      <attribute name="publicId"/>
      <attribute name="uri"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="System">
    <element name="system">
      <attribute name="systemId"/>
      <attribute name="uri"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="Uri">
    <element name="uri">
      <attribute name="name"/>
      <attribute name="uri"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="RewriteSystem">
    <element name="rewriteSystem">
      <attribute name="systemIdStartString"/>
      <attribute name="rewritePrefix"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="RewriteURI">
    <element name="rewriteURI">
      <attribute name="uriStartString"/>
      <attribute name="rewritePrefix"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="DelegatePublic">
    <element name="delegatePublic">
      <attribute name="publicIdStartString"/>
      <attribute name="catalog"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="DelegateSystem">
    <element name="delegateSystem">
      <attribute name="systemIdStartString"/>
      <attribute name="catalog"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="DelegateURI">
    <element name="delegateURI">
      <attribute name="uriStartString"/>
      <attribute name="catalog"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="NextCatalog">
    <element name="nextCatalog">
      <attribute name="catalog"/>
      <ref name="OptionalAttributes"/>
      <empty/>
    </element>
  </define>

  <define name="AnyOtherElement">
    <choice>
      <element>
        <not>
          <choice>
            <nsName/>
            <nsName ns=""/>
          </choice>
        </not>
        <zeroOrMore>
          <attribute>
            <anyName/>
          </attribute>
        </zeroOrMore>
        <ref name="AnyContent"/>
      </element>
      <anyString/>
    </choice>
  </define>

  <define name="AnyContent">
    <mixed>
      <zeroOrMore>
        <element>
          <anyName/>
          <zeroOrMore>
            <attribute>
              <anyName/>
            </attribute>
          </zeroOrMore>
          <zeroOrMore>
            <ref name="AnyContent"/>
          </zeroOrMore>
        </element>
      </zeroOrMore>
    </mixed>
  </define>
</grammar>

C. A RELAX Grammar for the XML Catalog (Non-Normative)

This is a [RELAX] grammar for the OASIS XML Catalog Standard.

<?xml version='1.0'?>
<!DOCTYPE module SYSTEM "/share/doctypes/RELAX/relaxCore.dtd">

<module moduleVersion="0.1"
    relaxCoreVersion="1.0"
    targetNamespace="urn:oasis:names:tc:entity:xmlns:xml:catalog"
    xmlns="http://www.xml.gr.jp/xmlns/relaxCore">

  <!-- $Id: catalog.rlx,v 1.7 2001/06/12 14:26:00 ndw Exp $ -->

  <interface>
    <export label="catalog"/>
  </interface>

  <attPool role='optional.attributes'>
    <attribute name='id' type='ID'/>
    <attribute name='xml:base' type='string'/>
  </attPool>

  <attPool role='prefer.attribute'>
    <attribute name='prefer' type='NMTOKEN'>
      <enumeration value='system'/>
      <enumeration value='public'/>
    </attribute>
  </attPool>

  <elementRule role="catalog">
    <choice occurs="+">
      <ref label="public"/>
      <ref label="system"/>
      <ref label="uri"/>
      <ref label="rewriteSystem"/>
      <ref label="rewriteURI"/>
      <ref label="delegatePublic"/>
      <ref label="delegateSystem"/>
      <ref label="delegateURI"/>
      <ref label="nextCatalog"/>
      <ref label="group"/>
    </choice>
  </elementRule>

  <elementRule role="public">
    <empty/>
  </elementRule>

  <elementRule role="system">
    <empty/>
  </elementRule>

  <elementRule role="uri">
    <empty/>
  </elementRule>

  <elementRule role="rewriteSystem">
    <empty/>
  </elementRule>

  <elementRule role="rewriteURI">
    <empty/>
  </elementRule>

  <elementRule role="delegatePublic">
    <empty/>
  </elementRule>

  <elementRule role="delegateSystem">
    <empty/>
  </elementRule>

  <elementRule role="delegateURI">
    <empty/>
  </elementRule>

  <elementRule role="nextCatalog">
    <empty/>
  </elementRule>

  <elementRule role="group">
    <choice occurs="+">
      <ref label="public"/>
      <ref label="system"/>
      <ref label="uri"/>
      <ref label="rewriteSystem"/>
      <ref label="rewriteURI"/>
      <ref label="delegatePublic"/>
      <ref label="delegateSystem"/>
      <ref label="delegateURI"/>
      <ref label="nextCatalog"/>
    </choice>
  </elementRule>

  <tag name="catalog">
    <ref role="optional.attributes"/>
    <ref role="prefer.attribute"/>
  </tag>

  <tag name="group">
    <ref role="optional.attributes"/>
    <ref role="prefer.attribute"/>
  </tag>

  <tag name="public">
    <ref role="optional.attributes"/>
    <attribute name="publicId" type='string' required='true'/>
    <attribute name="uri" type='string' required='true'/>
  </tag>

  <tag name="system">
    <ref role="optional.attributes"/>
    <attribute name="systemId" type='string' required='true'/>
    <attribute name="uri" type='string' required='true'/>
  </tag>

  <tag name="uri">
    <ref role="optional.attributes"/>
    <attribute name="name" type='string' required='true'/>
    <attribute name="uri" type='string' required='true'/>
  </tag>

  <tag name="rewriteSystem">
    <ref role="optional.attributes"/>
    <attribute name="systemIdStartString" type='string' required='true'/>
    <attribute name="rewritePrefix" type='string' required='true'/>
  </tag>

  <tag name="rewriteURI">
    <ref role="optional.attributes"/>
    <attribute name="systemIdStartString" type='string' required='true'/>
    <attribute name="rewritePrefix" type='string' required='true'/>
  </tag>

  <tag name="delegatePublic">
    <ref role="optional.attributes"/>
    <attribute name="publicIdStartString" type='string' required='true'/>
    <attribute name="catalog" type='string' required='true'/>
  </tag>

  <tag name="delegateSystem">
    <ref role="optional.attributes"/>
    <attribute name="systemIdStartString" type='string' required='true'/>
    <attribute name="catalog" type='string' required='true'/>
  </tag>

  <tag name="delegateURI">
    <ref role="optional.attributes"/>
    <attribute name="uriStartString" type='string' required='true'/>
    <attribute name="catalog" type='string' required='true'/>
  </tag>

  <tag name="nextCatalog">
    <ref role="optional.attributes"/>
    <attribute name="catalog" type='string' required='true'/>
  </tag>
</module>

D. A DTD for the XML Catalog (Non-Normative)

This is an [XML 1.0 (Second Edition)] DTD grammar for the OASIS XML Catalog Standard.

The syntax for a catalog entry file is partially[1] defined by this Document Type Definition:

<!-- $Id: catalog.dtd,v 1.9 2001/06/12 14:26:00 ndw Exp $ -->

<!ENTITY % pubIdChars "CDATA">
<!ENTITY % publicIdentifier "%pubIdChars;">
<!ENTITY % partialPublicIdentifier "%pubIdChars;">
<!ENTITY % uriReference "CDATA">
<!ENTITY % string "CDATA">
<!ENTITY % systemOrPublic "(system|public)">

<!ENTITY % p "">
<!ENTITY % s "">
<!ENTITY % nsdecl "xmlns%s;">

<!ENTITY % catalog "%p;catalog">
<!ENTITY % public "%p;public">
<!ENTITY % system "%p;system">
<!ENTITY % uri "%p;uri">
<!ENTITY % rewriteSystem "%p;rewriteSystem">
<!ENTITY % rewriteURI "%p;rewriteURI">
<!ENTITY % delegatePublic "%p;delegatePublic">
<!ENTITY % delegateSystem "%p;delegateSystem">
<!ENTITY % delegateURI "%p;delegateURI">
<!ENTITY % nextCatalog "%p;nextCatalog">
<!ENTITY % group "%p;group">

<!ENTITY % local.catalog.attribs "">

<!ELEMENT %catalog; (%public;|%system;|%uri;
                     |%rewriteSystem;|%rewriteURI;
                     |%delegatePublic;|%delegateSystem;|%delegateURI;
                     |%nextCatalog;|%group;)+>
<!ATTLIST %catalog;
    %nsdecl;    %uriReference;      #FIXED
        'urn:oasis:names:tc:entity:xmlns:xml:catalog'
    prefer      %systemOrPublic;    #IMPLIED
    xml:base    %uriReference;      #IMPLIED
    %local.catalog.attribs;
>

<!ELEMENT %public; EMPTY>
<!ATTLIST %public;
    id      ID          #IMPLIED
    publicId    %publicIdentifier;  #REQUIRED
    uri     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %system; EMPTY>
<!ATTLIST %system;
    id      ID          #IMPLIED
    systemId    %string;        #REQUIRED
    uri     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %uri; EMPTY>
<!ATTLIST %uri;
    id      ID          #IMPLIED
    name        %string;        #REQUIRED
    uri     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %rewriteSystem; EMPTY>
<!ATTLIST %rewriteSystem;
    id      ID          #IMPLIED
    systemIdStartString %string;    #REQUIRED
    rewritePrefix       %string;        #REQUIRED
>

<!ELEMENT %rewriteURI; EMPTY>
<!ATTLIST %rewriteURI;
    id      ID          #IMPLIED
    uriStartString  %string;        #REQUIRED
    rewritePrefix   %string;        #REQUIRED
>

<!ELEMENT %delegatePublic; EMPTY>
<!ATTLIST %delegatePublic;
    id      ID          #IMPLIED
    publicIdStartString %partialPublicIdentifier;   #REQUIRED
    catalog     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %delegateSystem; EMPTY>
<!ATTLIST %delegateSystem;
    id      ID          #IMPLIED
    systemIdStartString %string;    #REQUIRED
    catalog     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %delegateURI; EMPTY>
<!ATTLIST %delegateURI;
    id      ID          #IMPLIED
    uriStartString  %string;        #REQUIRED
    catalog     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %nextCatalog; EMPTY>
<!ATTLIST %nextCatalog;
    id      ID          #IMPLIED
    catalog     %uriReference;      #REQUIRED
    xml:base    %uriReference;      #IMPLIED
>

<!ELEMENT %group; (%public;|%system;|%uri;
                   |%rewriteSystem;|%rewriteURI;
                   |%delegatePublic;|%delegateSystem;|%delegateURI;
                   |%nextCatalog;)+>
<!ATTLIST %group;
    id      ID          #IMPLIED
    prefer      %systemOrPublic;    #IMPLIED
    xml:base    %uriReference;      #IMPLIED
>

E. Support for TR9401 Catalog Semantics (Non-Normative)

This Standard defines a subset of the catalog entry types described in [TR 9401] that are applicable to XML. For implementors wishing to provide full TR9401 support, this appendix defines the elements that should be used for the remaining TR9401 catalog entry types.

The elements described in this appendix provide full TR9401 semantics in the XML Catalog format. These are implemented as extension elements in the namespace: "urn:oasis:names:tc:entity:xmlns:tr9401:catalog".

For a complete description of the semantics of these elements see [TR 9401].

E.1. The doctype Element

The doctype element associates a DTD with a document element.

<doctype
  id = id
  name = name
  uri = uri-reference
  xml:base = uri-reference />

E.2. The document Element

The document element identifies a default document.

<document
  id = id
  uri = uri-reference
  xml:base = uri-reference />

E.3. The dtddecl Element

The dtddecl element associates an SGML declaration with a public identifier.

<dtddecl
  id = id
  publicId = public-identifier
  uri = uri-reference
  xml:base = uri-reference />

E.4. The entity Element

The entity element associates a document with an entity name.

<entity
  id = id
  name = name
  uri = uri-reference
  xml:base = uri-reference />

E.5. The linktype Element

The linktype element associates an external subset with a linktype declaration name.

<linktype
  id = id
  name = name
  uri = uri-reference
  xml:base = uri-reference />

E.6. The notation Element

The notation element associates a URI reference with a notation name.

<notation
  id = id
  name = name
  uri = uri-reference
  xml:base = uri-reference />

E.7. The sgmldecl Element

The sgmldecl element provides the location of a default SGML declaration.

<sgmldecl
  id = id
  uri = uri-reference
  xml:base = uri-reference />

F. OASIS Entity Resolution Committee (Non-Normative)

Tony Coates, Reuters
John Cowan, Reuters Health
Paul Grosso, Arbortext
David Leland, Elsevier Science London
Normand Montour, LGS
Norman Walsh, Sun Microsystems (Editor)
Lauren Wood, SoftQuad (Chair)

References

Normative

IETF (Internet Engineering Task Force). RFC 2119: Key words for use in RFCs to Indicate Requirement Levels. S. Bradner. 1997.

Tim Bray, Jean Paoli, and C. M. Sperberg-McQueen, Eve Maler, editors. Extensible Markup Language (XML) 1.0 Second Edition. World Wide Web Consortium, 2000.

Tim Bray, Dave Hollander, and Andrew Layman, editors. Namespaces in XML. World Wide Web Consortium, 1999.

Jonathan Marsh, editor. XML Base. World Wide Web Consortium, 2000.

IETF (Internet Engineering Task Force). Internet Draft: A URN Namespace for Public Identifiers. N. Walsh, J. Cowan, P. Grosso, 2001.

Non-Normative

IETF (Internet Engineering Task Force). RFC 2396: Uniform Resource Identifiers (URI): Generic Syntax. T. Berners-Lee, R. Fielding, L. Masinter. 1998.

Paul V. Biron and Ashok Malhotra, editors. XML Schema Part 2: Datatypes. World Wide Web Consortium, 2000.

James Clark, editor. OASIS TREX Technical Committee. OASIS. 2001.

James Clark, editor. Associating Style Sheets with XML Documents Version 1.0. World Wide Web Consortium. 1999.

IETF (Internet Engineering Task Force). RFC 2732: Format for Literal IPv6 Addresses in URL's. R. Hinden, B. Carpenter, L. Masinter. 1999.

ISO/IEC DTR 22250-1, Document Description and Processing Languages -- Regular Language Description for XML (RELAX) -- Part 1: RELAX Core, ISO/IEC, 2000.

Henry S. Thompson, David Beech, Murray Maloney, et al. editors. XML Schema Part 1: Structures. World Wide Web Consortium, 2000.

IETF (Internet Engineering Task Force). RFC 2279: UTF-8, a transformation format of ISO 10646. F. Yergeau. 1998.



[1] Any catalog file which is valid according to this DTD is valid according to this Standard. However, catalog files which make use of extension elements or attributes may be valid according to this Standard but invalid according to this DTD, due to the limits of DTD validation with respect to namespaces.