[Mirrored from: http://www.hightext.com/IHC96/lmd7.htm]

Lois Delcambre <- Structured Map -> Structured Maps: Modeling Explicit Semantics over a Universe of Information

David Maier Lois Delcambre Lougie Anderson Radhika Reddy Lois M. L. Delcambre+
David Maier+
, Radhika Reddy+
, and Lougie Anderson++
lmd@cse.ogi.edu, maier@cse.ogi.edu, reddy@cse.ogi.edu, lougie@sequent.com

Oregon Graduate Institute Sequent Computer Systems +Oregon Graduate Institute
Portland, Oregon 97291-1000

++Sequent Computer Systems, Inc.
Beaverton, Oregon 97006

Abstract

Data Access <- Structured Map -> Topic Map The overwhelming accessibility to data, on a global scale, does not necessarily translate to widespread utility of data. We often find that we are drowning in data, with few tools to help manage relevant data for our various activities. This paper presents Structured Maps, an additional modeling construct at a level above available information sources, to provide structured and managed access to data. Structured Maps are based on Topic Navigation Maps, defined by the SGML community to provide multi-document indices and glossaries.

<- Intranet -> <- SCEL -> <- Sequent Corporate Electronic Library -> Application Universe <- Data Structures -> <- Information Universe -> <- Structured Map -> <- HTML -> A Structured Map is a modeling construct that provides a layer of typed entities and relationships where the entities can have typed references to information elements in the Information Universe. The type structure introduces semantics so that we know what sort of entities are being tracked and why various references have been made. Structured Maps can be placed over loosely structured data, e.g., document collections, with references at various levels of granularity. Structured Maps directly support new, customized, and even personalized use of the information. In this paper, we define Structured Maps and present several examples adapted from the Sequent Corporate Electronic Library (SCEL), an intranet resource currently implemented in HTML.

1. Introduction

<- WWW -> <- World Wide Web -> The amount of information represented electronically has been expanding since the beginning of the computer age. But our ability to access information has recently expanded to a global scale, through the Internet, the World Wide Web (WWW), and other technologies. Recent reports estimate the doubling rate of the WWW as around 6 months [1] .

<- WWW -> Data Mining One serious challenge presented by this volume of accessible information is organizing relevant information so that it is easily available for specific uses. As we encounter information, we often wish to remember it, e.g., with a bookmark on the WWW, or annotate it, e.g., with our opinion. We may wish to collect various information references together. As an example, a marketing analyst might collect references to web pages about relevant product descriptions as part of a marketing research project. We may wish to distinguish references to information according to various criteria: the references that support a particular point of view and those that argue against it.

<- Structured Map -> In general, we see this problem as a need to provide explicit, semantically tagged guidance to a given universe of information. We present in this paper the concept of Structured Maps to provide various organized, customized supplements to existing Information Universes. The definition of Structured Maps is adapted from Topic Navigation Maps [2], defined by the SGML community.

1.1 An Analogy

<- Road Map -> <- Structured Map -> Structured Maps share certain properties with conventional road maps. Consider the following excerpt from a road map [3], shown in Figure 1. The legend defines several types of entities, each with an iconic representation. Circles, in various sizes, represent cities and indicate their populations. An airplane silhouette represents an airport. Roads are indicated by various line styles, corresponding to the type of roads shown on the map. Other types of entities can appear on the map, as shown in the legend.

			This is an electronic version of this paper
			and the Rand McNally road map excerpt
			has not been encoded electronically, as of
			this printing.  Please imagine any conventional
			road map with its accompanying legend in
			order to follow the text.

<- Road Map -> Excerpt From a Road Map: Map from the Road Atlas 93 1992 by Rand McNally, R.L. 96-S-158 (with permission)

Figure 1

The graphical part of the map serves as a window onto some portion of the world; the information content of the map is clearly less than the total detail of the corresponding world slice. The information selected for representation in the map, for the most part, corresponds to real-world instances of the entity types shown in the legend. In most cases, the icon or symbol on the map is labeled, e.g., with city name, airport name, highway number, road name, etc. A road map describes a variety of potential destinations: the non-road objects, such as schools, cities, parks, etc. And the roads provide various navigational paths among the non- road objects. Note that all of the objects (represented iconically) are displayed visually and they are usually correctly positioned in geographic space according to the stated scale of the map. A conventional map uses a graphical language to describe entities in the world and some of their relationships.

Such a map may be more or less complete, depending on whether it represents all of the entities of the given type contained in the region it covers. Note also that the same collection of entity types could be used to model different slices of the world. Conversely, the same slice of the world could be modeled with a different collection of entity types, for another purpose. For example, we might want a map that indicates car rental locations, filling stations and auto repair shops.

Database Schema Information Universe <- Information Universe -> Structured maps can be viewed as an analogue of a conventional map, where the world being modeled is the space of online information, which we term the Information Universe. The counterpart of the legend is a definition, using a database-like schema, of the possible entity types and relationships between them. The main part of a Structured Map, corresponding to the graphical part of the road map, consists of instances of these entity and relationship types, where the entity instances have labels. Each entity instance is also explicitly connected to the item(s) in the Information universe it represents. This aspect is in contrast to a conventional map, where the connection between an entity instance and its referent is implicit via similarity in geometry.

<- Entity-Relationship Model -> <- Relationship -> <- Structured Map -> <- Entity -> These ideas are illustrated by the Structured Map depicted in Figure 2. The definition of the map is shown as an Entity-Relationship model [4], in OMT notation [5], at the top of the figure. The middle part of the figure represents entity instances, represented as rectangles with rounded corners as in OMT notation and instances of relationships, represented by labeled lines between entity instances. The entity instances that appear in the Structured Map conform to the types that appear in the Structured Map Definition just as the symbols that appear on a road map are generally listed in the legend. The bottom layer of Figure 2 represents the Information Universe, consisting of one or more information sources. The lines from the middle layer of Figure 2 to the bottom layer represent the correspondence between entity instances and items in this universe.

<- Information Universe -> <- Structured Map -> <- Topic Map -> For this example, entity types are Painter and Painting. The documents in the Information Universe are various books and articles about European artists. The only relationship type has the roles painter-of and painted-by, for the two ends of the relationship, with the obvious semantics. We see a connection between the artist, Gentile Bellini, and a paragraph in an encyclopedia that describes him. Similarly, we see a connection from "Miracles of the Relic of the Cross" to a paragraph in a guide book about art that describes the painting. Note this representation of a Structured Map is somewhat simplified, to support the analogy with road maps. A complete definition is given in Section 2. Also, Figure 2 is meant to show the various parts of a Structured Map; it is not the form in which a Structured Map is shown to users. (This example is adapted from one illustrating Topic Navigation Maps [6].) We envision that Structured Maps might be rendered in various ways. A road map, on the other hand, exists in a conventional, graphical rendering. The model behind the road map is partly explicit (in the symbols that appear in the legend) and partly implicit.

<- Structured Map -> Simple Structured Map

Figure 2

<- Information Universe -> <- Structured Map -> As with conventional maps, the same "legend" (entity and relationship types in the Structured Map Definition) can be used for Structured Maps of different regions of the Information Universe, with the relevant entity instances and relationship instances of interest appearing in the Structured Map Instance. Likewise, it is possible to construct several Structured Maps over the same Information Universe, but with different entity and relationship types, just as there can be different kinds of conventional maps of the same region. As with conventional maps, a Structured Map can be more or less complete or accurate. There is no guarantee per se that entity instances in a Structured Map are linked to all occurrences of relevant items in the underlying documents. Also, whereas an entity instance on a conventional map typically represents a single real world occurrence, an entity instance in a structured map may be connected to multiple items in the Information Universe, and different kinds of connections may be distinguished. For painters, we might distinguish information items that provide biographical information from items that simply mention the painter. In contrast to conventional maps, which are typically of uniform scale, entity instances in a single Structured Map can be connected to multiple items in the underlying universe and they can be of different granularities. For example, an entity might link to an entire document, a section, a paragraph, a diagram, a caption or even a single word or phrase.

<- Information Universe -> <- Structured Map -> Structured Maps provide an abstraction of the universe of information and a basis for semantically rich navigation in that universe much like a road map provides an abstraction of the real world and a basis for navigation by car in the world slice that is represented by the map. On a road map, we can travel from a particular city to a particular park by selecting from among the available roads that interconnect them. In an analogous manner, we could navigate from a city to its artists (if the Structured Map included a City entity type and the Born-In relationship type) and then from the artist to his or her works of art. At each step along the way we can stop and see the available detail about the information objects that are referenced. On a car trip, we can stop in any city and see the sights, many of which are not shown on the map. For the Structured Map, we could, for example, browse the guide book document when we arrive at the "Miracles of the Relic of the Cross" instance of the Painting type during navigation. The guide book clearly has much more information than is represented in the Structured Map.

<- Entity-Relationship Diagram -> <- Relationship -> <- Structured Map -> <- Entity -> One significant difference between roads (on a map) and relationships (in a Structured Map) is that roads correspond to actual physical roads but relationship instances are represented electronically. In effect, the designer of a Structured Map can create new "road types" and build new "roads" simply by defining and populating a Structured Map. We observe that although roads provide a navigational path to non-road objects (much like relationships provide navigational paths to entities), the legend of a map is not drawn like an Entity- Relationship diagram (ERD). Rather the road-type and the non-road-type icons are simply listed in the legend of a road map. Another difference is that all entity types shown in a Structured Map Definition have a single, database-style attribute, often called Title. Titles appear on a road map (as a label beside the icon) but the legend of a road map does not generally indicate the possibility of the Title attribute. Finally, note that in order to browse a Structured Map, we require some sort of display or visualization of the underlying information. For a road map, we are accustomed to a conventional, two-dimensional graphical display with iconic representation.

<- Structured Map -> This paper presents Structured Maps as a means to introduce explicit semantic structures over heterogeneous information sources. We see Structured Maps as distinct from and complementary to search engines that provide text searching and index capabilities. A full text search, for example, might identify every place where the string "Bellini" appears in a set of documents. Such a search is complete in that all places where "Bellini" is found will be indexed. A search is normally syntactic, lacking the ability to recognize places where the characters "B-e-l-l-i-n-i" appear but are not describing the painter of interest. Finally, a search is usually flat or homogeneous in the sense that all text can be searched and all references will point to the string "Bellini", in context.

<- Reference -> <- Structured Map -> Structured Maps, on the other hand, hold explicit references. They may or may not be complete. The references may be more accurate than those found by a search, assuming that only the actual descriptions of Bellini are referenced by the Structured Map. Structured Maps can also reference information items that discuss the painter but where the string "Bellini" does not actually appear. The items referenced from a Structured Map may be of any granularity and may have its own (local) semantics. A Structured Map can reference a chapter, a section, a paragraph, etc., as appropriate.

<- Structured Map -> It is possible that a Structured Map could be populated with the results of a search [6]. The resulting Structured Map could then be edited, by a knowledgeable person, to drop insignificant or nonsense references or establish additional references or relationships, not found by a search.

<- WWW -> <- Hyperlink -> <- Structured Map -> <- HTML -> We also see Structured Maps as distinct from links in the WWW. A Structured Map introduces entity types. In the WWW, there is no direct analog of entity types nor entity instances that are described separately from the HTML pages. Another distinction is that all references in a Structured Map, in relationship instances and in references-valued attributes, are typed. This means that we know the reason why the link is present, e.g., to connect to works painted by the painter vs. works that inspired the painter.

<- Database -> <- Information Universe -> <- Structured Map -> Structured Maps exhibit some characteristics of a conventional database but also offer additional capabilities, based on the three-level model. The basic premise of this research is to bring the modeling and querying capabilities of a database to a three-level model where the Information Universe includes loosely-structured documents with their own internal/semantic structure.

1.2 Organization

<- Structured Map -> Section 2 of this paper defines Structured Maps and presents the SGML/HyTime foundation for Structured Maps, the Topic Navigation Map Architecture. Section 3 describes our implementation of Structured Maps along with a discussion of issues that affect an implementation. Section 4 includes examples from a large-scale, corporate electronic library . Section 5 evaluates this work by comparing it with related topics in the database and digital library community. Section 6 concludes with a discussion of the contributions of this work and our current research plans.

<- Structured Map -> 2. Structured Maps

<- Structured Map -> We present the general features of Structured Maps in this section. Figure 3 shows a more detailed version of the example in Figure 2. The primary difference between these two figures is in the references from the entity instances to the information elements in the universe. In the definition of the Structured Map (the ERD part), each entity can have zero or more named, reference-valued attributes and each such attribute can be single- or multi- valued. The simplifying assumptions of Figure 2 were that each entity type had only one, single-valued reference attribute because icons on a road map generally correspond to a single object in the real world.

Anchor Role <- Reference -> <- Structured Map -> In Figure 3, the Painter entity type has two, distinct, reference-valued attributes: one to reference biographical information and the other to point to places where the artist is mentioned. The two attributes demonstrate the semantic distinctions that can be highlighted with a Structured Map. We can distinguish the role that the referenced information fills by using distinct reference-valued attributes. Figure 3 also demonstrates that individual references can be at various levels of granularity. Bellini has a "biography" reference that refers to an entire book "The Life and Art of Gentile Bellini" and a "mention" reference to a single paragraph in guide.sgm. Finally, Figure 3 demonstrates that an information element can be referenced by several reference-valued attributes.

<- Structured Map -> Three-level Model A Structured Map has a three-level model, as shown in Figures 2 and 3: the Structured Map Definition, the Structured Map Instance (i.e., the populated instance of the Structured Map) analogous to the map, and the underlying universe of information.

<- Entity-Relationship Diagram -> <- Information Universe -> Structured Map Structured Map Definition - The Structured Map Definition follows the normal conventions of an ERD, except that it is limited to entity types and relationships types, e.g., there are no generalization or aggregation links. Each entity type includes an attribute definition that will hold the user-visible title or name for an entity instance. This attribute can be viewed as the label beside the icon on a road map. Each entity type also can define one or more reference- valued attributes. Such attributes hold, as values, addresses of information elements in the Information Universe, and can be single- or multi-valued. The envelope icon shown in the middle layer of Figure 3 is used to represent the address(es) that appear on the reference- valued attributes (in that envelopes have addresses on them). Relationship types can be defined among entity types.

<- Structured Map -> Elaborated Structured Map

Figure 3

<- Database -> <- Structured Map -> Structured Map Instance - The Structured Map is populated much like a conventional database instance that conforms to its schema. A Structured Map is much less concerned about the domain definitions or representations for (referenced) information compared to a database. The attribute values are either a simple title (often a character string) or references to arbitrarily represented information.

<- DTD -> <- Document Type Definition -> <- Information Universe -> <- Element -> Information Universe Elements - An information element within a given information source in the Information Universe must be identifiable, addressable, and renderable. As an example, consider an SGML document [7, 8] where each structural element, such as title, abstract, section, or paragraph, is identified through the appropriate tags. Each element is addressable through the SGML ID attribute , provided that an ID attribute is permitted for that element type, as prescribed in the Document Type Definition (DTD) section for that element type. The ID attribute in SGML is reserved to hold the unique identifier (within this SGML document) for this element instance, e.g., this paragraph. Other SGML attributes in this same document may hold this identifier, e.g., to reference the paragraph. The SGML attribute holding the reference is declared to be of type IDREF. Each element is renderable through a variety of SGML-based tools that allow "style sheet" specification to describe the display [9]. SGML ID and IDREFs are not the only possible scheme for referencing document elements. One could use other schemes, such as byte ranges in files or object identifiers in a document database, including the addressing modes of HyTime [10].

<- DTD -> <- Entity-Relationship Diagram -> <- Relationship -> <- Structured Map -> <- Topic Relation -> <- Entity -> <- CApH -> <- Conventions for the Application of HyTime -> The modeling capability of a Structured Map is fairly elementary compared to most ERD models. But we are currently guided by the definition of the Topic Navigation Map, defined as part of the working group on the Conventions for the Application of HyTime (CApH). A Topic Navigation Map is represented as an SGML document [2]. The Topic Navigation Map uses the terms Topic, Topic Relation, Topic Title, and anchor role for the analogous terms of Entity, Relationship, Title, and reference-valued attribute in Structured Maps. As an example, the Topic Navigation Map SGML document that corresponds to the Topic Navigation Map of Figure 3 is shown in Figure 4. The Document Type Definition (DTD) for the Topic Navigation Map declares the desired Topic and Topic Relation types (i.e., the Entity and Relationship types, in Structured Map terminology) for the application. The content model for the document instance of the Topic Navigation Map consists of a disjunction of all Topic and Topic Relation types. This means that any number of Topic and Topic Relation instances can appear in any order in the document instance.

<- Topic Map -> Topic Navigation Maps depend on the use of several HyTime constructs, particularly those for linking and addressing. HyTime, the Hypermedia Time-Based Structuring Language, is an ISO standard (ISO 10744:1992) [10] that extends the semantics of SGML but is expressed in ordinary SGML syntax. The independent link (ilink) construct of HyTime allows links to be created distinct from the referenced elements; that is, the ilink exists separately from the information elements that are being linked. Both topic instances and topic relation instances are implemented as ilinks. Our motivations for following the definition of Topic Navigation Maps are:

<- DTD -> <- Entity-Relationship Model -> <- Relationship -> <- Structured Map -> <- Topic Map -> <- Entity -> 1. Topic Navigation Maps are currently being proposed as an ISO standard to provide multi-document indices, glossaries and table of contents [11]. This opens up the possibility that Structured Maps will have an ISO standard interchange format.

2. Topic Navigation Maps, although developed outside of the database community, use a basic Entity-Relationship model at the core. The Entity-Relationship model has proven to be widely understandable and useful for information representation and access, using database technology.

3. Topic Navigation Maps are defined using SGML [7] and HyTime [10]. This means that they provide a model to describe the structure and semantics of complex documents, a foundation currently lacking in conventional database models.

4. Topic Navigation Maps use the DTD to describe the structure of the document instance, analogous to the database schema and instance.

<- Topic Map -> <- Topic Navigation Maps -> Topic Navigation Maps differ from databases in several important ways. First, there are few constraints on attribute values. As an example, a topic title is viewed as (SGML) content and thus can consist of anything expressible in the declared notation for content. Also, the current definition of Topic Navigation Maps (implicitly) allows a reference on a reference- valued attribute as well as a reference on a topic relation instance to refer to any SGML ID (including an ID for a topic instance, a topic relation instance, or an information item from the universe of information). For example, a Topic instance can refer to other Topic instances or even to Topic Relation instances on a reference-valued attribute. Second, a Topic Relation instance is not constrained to connect topic instances of a prescribed type; they can connect any type of entity instances. As a silly but legal example, painted-bypainter-of could connect two painters.

<- Structured Map -> <- CApH -> <- Topic Navigation Maps -> The CApH Committee is currently considering extensions to the current definition of Topic Navigation Maps to provide more structured modeling [11], at least an option. We are currently following a database-style view of the Structured Map model although we leave open the possibility that such rigid structure may not be specified in every Structured Map Definition. Finally, we note that Topic Relations may be of any degree, binary, ternary or higher. One current implementation of Topic Navigation Maps supports only binary relations because they are easy to display for the user [2]. Structured Maps support n-ary relationships, without attributes.

Database Model <- Relationship -> <- Structured Map -> <- Entity -> As a database model, Structured Maps can be viewed in several ways. One (overly) simplified view would be to consider it as an Entity-Relationship model where attribute values can be any sort of information item. This view ignores the use of addresses on the reference-valued attributes and de-emphasizes the use of information items in situ. A closely related view is to consider the Structured Map model as an Entity-Relationship model with pointers or reference-valued attributes. We see the indirection provided by addressing as a key feature of Structured Maps. We include it as an explicit part of the conceptual model, as evidenced by our description of Structured Maps as a three-level model. We are particularly interested in the semantics of a query language for Structured Maps. Structured Maps can also be viewed as a binary model where entity instances are represented (only) as an identifier (e.g., an OID) and both relationships and reference-valued attributes are binary relations. Finally, note that the title for an entity could equivalently be represented as a reference-valued attribute where the title(s) are supplied in an additional document in the Information Universe. This approach of referencing an additional document can be used to provide annotation or other documentation for entities in a Structured Map.

Independent Link In the Topic Navigation Map of Figure 4, each Topic Type is declared as a HyTime ilink through the %topic parameter entity; each Topic instance has a Topic title (shown in the content model for each Topic declaration). Each Topic Type declares the names of its anchor roles through the value of the anchrole (SGML) attribute. By convention, the first entry in the anchrole attribute value is the Topic type followed by the names of the anchor roles. The #AGG following an anchor role name means that the anchor role is multi-valued. As an example, the Painter topic type in Figure 4 has an anchor role for "biography" and an anchor role for "mentioned". Each Topic instance has a corresponding set of zero or more addresses for each anchor role. As an example, the Painter instance for Gentile Bellini (with SGML ID = painter-GentileBellini) provides one SGML ID for each anchor role through the values of the linkends (SGML) attribute. This one SGML element (with this ID) consists of the list of addresses. Thus, adr-GentileBellini-mention is the SGML ID for the element that contains the list of addresses, each of which references an information item in the Information Universe. In HyTime, the document identifier is given (in the docorsub SGML attribute). The referenced information element in this case is the information element with the stated ID in the referenced SGML document in the underlying universe of information where Bellini is mentioned. As an example, the Painter instance for Gentile Bellini has the address for the entire "book" SGML document on the linkend for the "biography" anchor role and the ID n9 in the "art" SGML document as well as the ID n39 in the "guide" SGML document on the linkend for the "mentioned" anchor role. The way in which information references can appear is defined as part of the various HyTime addressing modes, including address by name, address by location, etc.

The details of HyTime are beyond the scope of this paper. Topic Navigation Maps are currently being proposed as an independent ISO standard because of their general utility. As declared in the Committee Draft submitted to ISO, a Topic Navigation Map provides a mechanism to define tables of contents, subject indexes, glossaries, and multi-lingual thesauri for a single document or for a set of documents [11]. The focus of a Topic Navigation Map is to highlight individual topics that appear in a set of documents and also to establish relationships among topics. As suggested by the name, a Topic Navigation Map is intended to support navigation, as with a general index or glossary in a conventional book, over multiple documents.

----tm.sgm

...... (standard SGML declaration) ........

<!ELEMENT document - O
(painter|painting|painted_bypainter_of)+><<!ATTLIST document
                id         ID        #IMPLIED
                HyTime     NAME      HyDoc
                boslevel   NUMBER    #IMPLIED ><<!ELEMENT  painter - -
(TITLE)+ ><<!ATTLIST  painter
                       %topic;
                       anchrole  CDATA  #FIXED "painter mentioned #AGG=
 biography #AGG"><<!ELEMENT  painting - - (TITLE)* ><<!ATTLIST  painting
                       %topic;
                       anchrole  CDATA  #FIXED "painting mentioned
#AGG"><<!ELEMENT  painted_bypainter_of - O EMPTY ><<!ATTLIST 
painted_bypainter_of
                       %topicrelation;
                       anchrole  CDATA  #FIXED "painted_by
painter_of">><<document>
<painter id=painter-BartolomeoVeneto mnemonic=BartolomeoVeneto
universe=all
linkends="painter-BartolomeoVeneto adr-BartolomeoVeneto-mention
adr-BartolomeoVeneto-
biography"><<title>Bartolomeo Veneto</title></painter><<nameloc
id=adr-BartolomeoVeneto-mention><nmlist docorsub=art>n7</nameloc><<nameloc id=adr-BartolomeoVeneto-biography><nmlist>null</nameloc>

Figure 4

<- Structured Map -> <- Topic Navigation Maps -> Structured Maps provide an alternative way to express Topic Navigation Maps using the language semantics and technology of database systems. Key features of Topic Navigation Maps that are preserved in Structured Maps are:

  1. <- Structured Map -> <- CApH -> <- Topic Navigation Maps -> the information represented in the Structured Map is expressible using SGML with HyTime and CApH [2] constructs, as prescribed in the forthcoming ISO standard for Topic Navigation Maps,
  2. <- Structured Map -> the Structured Map declares entity and relationship types (in the schema),
  3. <- Structured Map -> the instance of the Structured Map establishes the (typed and titled) entities and relationship instances (as a database extension),
  4. the entity instances include the references for the reference-valued attributes, and
  5. <- Structured Map -> the addressing mechanism and the interpretation of addresses are both expressed separately from the definition of the Structured Map; the Structured Map simply holds addresses (with appropriate metadata fields to describe the type of address).

<- Structured Map -> One of the major advantages of Structured Maps, based on the database semantics, is the query capability over the database extension. The challenges of Structured Maps, from a database point of view, are to define the semantics of the structural model as well as the query language for the three-level system and to implement the referenced information elements in a general-purpose manner.

<- Structured Map -> 3. Implementation of Structured Maps

<- Structured Map -> Our first prototype was designed to show the feasibility of representing the Topic Navigation Map definition and instance in a relational database. Figure 5 gives an overview of the first prototype. We have demonstrated the transfer of information from an SGML Topic Navigation Map document to a database and, in the reverse direction, the transfer of Structured Map instance information back into SGML documents.

<- Information Universe -> <- Topic Relation -> <- Topic Navigation Maps -> This first prototype used a commercial product, called EnLIGHTeN [6], as the baseline for the exercise. EnLIGHTeN supports a browser for Topic Navigation Maps over SGML documents. At the heart of the EnLIGHTeN systems is a proprietary HyTime engine called HyMinder [12]. With EnLIGHTeN, the Topic Navigation Map SGML document plus all of the referenced SGML documents in the underlying Information Universe are parsed by an SGML parser and then stored in HyMinder . HyMinder validates the HyTime constructs and then supports their semantics during runtime. As an example, HyMinder supports bi-directional navigation across each Topic or Topic Relation ilink and resolves all addresses that appear on the reference-valued attributes.

In our first prototype, we parsed a Topic Navigation Map SGML document from EnLIGHTeN and populated an Informix [13] relational database with both the Topic Navigation Map definition and instance information. We defined a generic schema for this purpose, e.g., with relations for entity types and entity instances. The referenced (SGML) documents were not placed in the Informix database, although the addresses on the referenced-valued attributes were included. Only the information represented in the top two levels of Figure 3 was placed in the database.

<- Information Universe -> We issued SQL queries against the Informix database and were able to freely join across relationships. This facility is useful for query answers that consist only of entity titles. Sometimes a query answer included addresses (from the reference-valued attributes) in the form of HyTime addresses referencing the SGML documents from the underlying Information Universe. But there was no direct support in our prototype for interpreting, dereferencing, or displaying the contents of these addresses. We could do matching for equality on addresses but we were not able to navigate to or otherwise interpret the referenced information elements in the underlying SGML documents. (This limitation is in direct contrast to EnLIGHTeN, which provides a fully integrated Topic Navigation Map browser with a built-in SGML browser for the underlying documents, based on the centralized representation of the Topic Navigation Map and the documents in HyMinder.)

3.1 The CARTE System

<- Information Universe -> <- Structured Map -> <- HTML -> We considered using HyMinder for a more sophisticated prototype, but opted instead to further explore the use of conventional database technology because HyMinder uses a proprietary data structure and does not focus on query processing and schema management . Our second prototype, called CARTE 1.0 (for Context-Assisted Restructuring via Topical Extensions), also uses Informix as the repository for Structured Map Definition and Instance information. Like its predecessor, CARTE does not store the documents from the underlying Information Universe. One difference in CARTE, compared to our first prototype, is that we used HTML pages as the underlying information sources in the Information Universe rather than SGML documents. We used URL's and the NAME HTML attribute to formulate addresses for the reference-valued attributes in the Structured Map.

<- Structured Map -> <- HTML -> Figure 6 presents a screen image of the CARTE system. CARTE provides a browser for the database containing a Structured Map. We use the multiple frame capability of Netscape 2.0 to present three different, synchronized frames. Within CARTE, the upper-left frame shows the schema (in OMT notation). The right frame in CARTE displays the instance information from the Structured Map. In Figure 6, the instance frame is showing all of the available entity types, in list form. This is the initial instance screen contents, when the user begins viewing a Structured Map, and has not yet selected an entity type or navigated to a particular entity instance. When the user clicks on one of the entity types, the instance screen shown on the right side of Figure 7 is presented. The entity type is shown followed by the titles of all entity instances of that type. The user can click in this frame to navigate to an individual entity. Such a selection results in the instance screen shown in Figure 8, with one entity title listed with all its available relationships (to navigate from one entity to another) and reference- valued attributes (to navigate to an underlying HTML page). The example shown here is adapted from one developed for EnLIGHTeN [6].

CARTE Screen

Figure 6

<- Structured Map -> <- HTML -> When a user navigates across a relationship, the user sees the entity instance screen for the target instance. When a user navigates to any of the addresses appearing on any of the reference-valued attributes, another Netscape process is invoked to view the underlying, referenced HTML page, as shown in Figure 9. At any time while the second Netscape browser is operational, the user can return to the CARTE interface and proceed with other navigational steps. At present, we do not support inverse links for the reference-valued attributes. That is, it is not possible to navigate from the underlying HTML page "upwards" into the Structured Map from an information element whose address appears in one or more reference-valued attributes. This implementation of CARTE uses the underlying HTML page in situ. The underlying HTML pages have no knowledge of the Structured Map.

<- HTML -> The lower-left frame in the main CARTE screen is intended to give the user a sense of "you are here" during navigation by describing the current context. The context frame, appends an informational message to a scrolling list each time the user takes a navigational step. Thus, the context frame shows the progression, for example, from an entity instance, across a relationship instance, to another entity instance, down to an underlying HTML page, and so forth.

CARTE Screen

Figure 7

<- Structured Map -> CARTE provides a navigational browser for Structured Maps. The Informix database for CARTE contains the Structured Map Definition and Instance as well as the addresses appearing on the reference-valued attributes. Each user action taken in the CARTE interface results in an SQL query being issued to the Informix database, followed by the appropriate presentation of information in the CARTE frames. The three CARTE frames are always synchronized with the latest user action.

CARTE Screen

Figure 8

3.2 Discussion

<- Information Universe -> <- Structured Map -> A key feature of Structured Maps is the delegation of responsibility for identifying and rendering information from the underlying universe. Structured Maps are distinct from hypermedia database systems [14,15] because the hyperdocument created by a Structured Map is layered over existing information, in situ. It does not require modifications of base documents, such as the insertion of URLs, to create connections among those documents. The bridge between the underlying Information Universe and the Structured Map is the addressing mechanism. Figure 10 summarizes the features of the Topic Navigation Map specification, the EnLIGHTeN product, and our two prototypes. Each aspect is listed in the left column of Figure 10 and discussed below.

<- Structured Map -> Any implementation of Structured Maps must deal with the addresses of identified information elements for two different purposes. First, when populating a Structured Map, it is necessary to select information elements of interest and place their addresses on the selected reference-valued attribute for the proper entity instance. Second, for browsing a Structured Map, any traversal of a reference-valued attribute must present the address for interpretation and rendering of the information element. Tools for authoring and browsing may take various approaches for establishing and interpreting addresses.

<- Structured Map -> To populate a reference-valued attribute (during Map creation or modification), one possibility is to generate the references using an automatic indexing technique (that might be adjusted by a knowledgeable human user). Another possibility is to mark information elements in the underlying universe with the entity type, entity instance title, and reference- valued attribute name. This approach would allow the establishment of an address on the proper reference-valued attribute for the proper instance; it has been implemented in EnLIGHTeN [6]. Yet another possibility would be to support visual display of both the Structured Map and the information sources with an easy point and click identification of referenced information elements. Finally, it is always possible to create Structured Maps by hand, including the placement of addresses for reference-valued attributes. The support for map creation of the various systems is shown on line 1 of Figure 10.

<- HTML -> CARTE Screen - Second Netscape Window (to view underlying HTML)

Figure 9

<- Structured Map -> A closely related issue, relevant to any browsing or navigational capability is: How will the information be viewed? It is not immediately obvious how various users or application domains might want to see Structured Map Definitions and Instances. For CARTE, we present the schema (i.e., Structured Map Definition) explicitly in the user interface. Perhaps because of our history with working with databases, it seems quite natural for us to show the navigation paths through the schema as well as the instance of the Structured Map. Note that EnLIGHTeN currently does not show the schema to the user. There is also the issue of how to present instance information for the Structured Map Instance. In CARTE, we generate lists of possible traversals, that are appropriate at each navigational step. For an entity instance, we list all available relationship traversals, by type, and all addresses that appear in reference- valued attributes, by attribute name. In general, we can imagine a set of tools that allow an interface designer to freely configure the way in which Structured Map Instance information is displayed. The second line of Figure 10 summarizes the current display choices of the various systems.

SYSTEM:
ASPECT:
Topic Navigation Map Specification EnLIGHTeN First Informix Prototype CARTE 1.0
1 support for map creation unspecified markup in information elements with automatic indexing none by hand (using SQL CREATE TABLE & Insert statements) 2 choice of display for the Map unspecified HTML pages generated for output none - other than relational schema frames generated for Netscape based on Informix data
3 addresses supported any HyTime address SGML IDREF/ID plus docorsub reference to other documents SGML IDREF/ID plus docorsub reference to other documents URLs plus the SGML NAME attribute
4 identification mechanism any identifiable element, supported by the addressing scheme any element instance in the SGML document with an ID attribute, identified by start and end tags any element instance in the SGML document with an ID attribute, identified by start and end tags any element, in the HTML tag set, that has an NAME attribute
5 rendering mechanism unspecified early version: built-in, proprietary SGML browser. current version: all screens generated in HTML format; uses WWW browser. none WWW browser (Netscape 2.0)
6 heterogeneity of underlying information source(s) supported any type supported by HyTime addressing SGML SGML HTML
7 tight/loose integration of underlying information sources unspecified tight everything stored in HyMinder with all links maintained loose -- SGML documents outside of the Informix DB loose -- HTML pages outside of the Informix DB

Choices Concerning Implementation Issues

Figure 10

<- Information Universe -> Lines 3 through 5 of Figure 10 indicate the current choice of the various prototypes regarding the method used to address, identify, and render the information elements from the Information Universe. Perhaps the most important point is that the various HyTime addressing modes are standard, by virtue of the fact that HyTime is a standard (ISO 10744:1992) . This standardization enables the delegation of the identification and rendering of information elements via shared addresses. Note also that SGML implicitly provides a way to identify information elements, through markup, and a way to address them by name, through the SGML ID attribute.

<- Structured Map -> <- CORBA -> The final two lines of Figure 10 deal with the heterogeneity of underlying information and the type of integration between the Structured Map and the underlying information sources. The heterogeneity of information is unlimited, at the conceptual level. Any type of information that can be meaningfully identified, addressed and rendered can participate in a Structured Map. The challenge of heterogeneity has to do with the availability of an addressing mechanism and the connection, at runtime, with the independent technology responsible for the rendering. This second challenge might be addressed through various interoperation models such as CORBA [16 ] and COM/OLE [17].

<- WWW -> <- Entity-Relationship Model -> <- Information Universe -> <- Relationship -> <- Structured Map -> <- Entity -> Finally, the issue of loose or tight integration of the Information Universe with the Structured Map presents all of the classic tradeoffs between a tight or loose federation. A tightly- integrated, centrally-managed repository can offer the advantages of conventional database technology such as concurrency control during update, optimization for access, query optimization, etc. Also, bi-directional links can be more easily maintained, with their integrity ensured. But the scaleability of such systems may be limited. A loose integration offers the advantage of autonomy for the underlying information sources. Such a choice is particularly appropriate for an environment where we have access to information that we do not own, e.g., on the WWW. The use of Structured Maps in this environment provides a rich mechanism for foreground information . A Structured Map can be viewed as a complex bookmark, with the structuring capability of an Entity-Relationship model.

<- Digital Library -> <- Structured Map -> 4. Structured Maps in Digital Libraries

<- SCEL -> <- Sequent Corporate Electronic Library -> <- Structured Map -> While we have defined a particular formulation of Structured Maps, similar constructs are already in use in digital libraries. In this section, we highlight portions of a particular digital library, the Sequent Corporate Electronic Library (SCEL), that resemble a Structured Map. These portions of SCEL are currently hard-coded into Web pages. One of our research goals is to provide a more automated and flexible means to construct this portion of a corporate digital library.

<- SCEL -> Highest Table of Contents Level Screen in SCEL (sketch of actual page) Figure 11

<- Intranet -> <- SCEL -> <- Sequent Corporate Electronic Library -> <- HTML -> SCEL is an intranet-based system that provides access to a rich and varied set of corporate information resources to over 2,500 Sequent employees. The system has been operational for about 18 months and the scope and utility of the resource has grown steadily. SCEL is also used to manage routine requests for services through the forms interface of HTML. SCEL is entirely implemented in HTML, relying on Web browsers to provide a uniform, easy-to-use, easy-to-learn interface. But there are a number of striking analogs of Structured Maps present in SCEL.

<- SCEL -> One of the top-level pages in SCEL is shown in Figure 11. This screen presents a number of navigational choices; all of the labels shown here are clickable to proceed to more information. SCEL users can view this screen as a visually-displayed table of contents, with entries for Sequent, Suppliers, Offerings, Partners, Channels, and Market. There are also subentries for Sequent, consisting of History and Values, Internal Processes, Organization Charts, Library Employee Services, and Education. The Market entry is further subdivided into Customers and Competitors.

If we click on Internal Processes we see the page shown in Figure 12.

Internal Processes -- How do we do it?

* Admin Handbook On-Line
This handbook provides administrative assistants with the most up-to-date information to assist them in performing their duties.

* Admin Services
Business Cards, name Tags and Name Plates, Pagers, Phone Cards, Scheduling Conference Calls, Scheduling Video Conferences

* American Operations
AO Field IS, Complex Issue Escalation, Courtesy Visits, customer Visits, Executive Briefing Centers, Field Office Guidelines, Major Account Plan

* Beaverton Facilities Operations
Admin Services, Card Keys, Dry Cleaning and Laundry Services, Facilities Maintenance, Move and Furniture Services, Recycling, Security

* Benchmarking Guidelines

<- Data -> <- Database -> * Corporate Marketing
Complex Systems Consultancy Service, Customer Information Database, ImagePIX Program, Ordering Literature, Photo Library, Poster Program, Price Book, Sales Proposal Builder, System Competency Center Task Creation

* Global Marketing Home Page
Guide to Partner Management

<- SCEL -> Second Level Table of Contents in SCEL (sketch of actual page) Figure 12

This page shows a classical nested table of contents structure, with levels 3 and 4 shown. Levels 1 and 2 of the table of contents is shown in Figure 11. The listed entries in Figure 12 include "Admin Handbook On-Line" and "Admin Services". Each has a number of subentries such as "Business Cards" and "Name Tags" for "Admin Services". Each of these entries leads to either more detailed levels of the table of contents or directly to information sources of various kinds.

<- SCEL -> <- Structured Map -> We see this nested table of contents metaphor as a powerful, recurring structure intended to organize access to information. One possible Structured Map Definition capturing the same information is shown in Figure 13. This structure has also been implemented in CARTE. The schema in Figure 13 is simple but it represents the generic nature of a nested hierarchy used as a table-of-contents. The entity title is used to contain the table-of-contents entry name. The has-subentries relationship captures the hierarchical structure of table-of-contents entries, and the reference-valued attribute leads off to the referenced document elements. Although this example uses just one reference-valued attribute, it is possible to have more than one, for different purposes. This structure is actually a bit more general than what is present in SCEL. For example, a table-of-contents entry could lead to a set of references, and these references can be subelements of a document. Such a Structured Map could be visualized in multiple ways, such as the generic interface in Figure 6 or a more specialized interface as used by SCEL. Sophisticated displays as used by SCEL would require additional styling and layout tools.

<- Structured Map -> There is another view of Figure 11 that suggest a different metaphor. With the exception of the entries inside the box labeled Sequent, Figure 11 represents different entity types as found in an ERD. There are even some relationships suggested, such as Channel between Offerings and Market. Figure 11 represents an iconic display for a Structured Map Definition. It also demonstrates several choices for visualization. The Sequent entries are placed inside the icon in list form. You could even imagine a scrolling list inside an icon. The other entity types are seen by clicking on the icon. When defined as a Structured Map, the information organization and navigation can reflect the rich structure, perhaps navigating from partners who are also customers or partners or suppliers who also serve as channels. Figure 9 thus suggests some of the potential for introducing entity and relationship types and instances over an underlying set of information sources.

<- SCEL -> <- Structured Map -> Note that Figure 11 presents a high-level picture of the value chain for Sequent. A value chain represents a view of the organization highlighting the suppliers and customers, at any level. The value chain was chosen as the highest level view of Sequent, in part, because it provides an easy way to place documents (into SCEL) and an easy way to find them. We are currently exploring the possibility of representing the value chain, in a similar form, at lower levels in the Sequent organization and reflecting it in a Structured Map.

5. Related Work and Evaluation

<- Entity-Relationship Model -> <- Structured Map -> <- Topic Map -> <- Topic Navigation Maps -> The most closely related work is clearly the definition of Topic Navigation Maps, based on HyTime and SGML. The Topic Navigation Map provides the conceptual foundation for Structured Maps. The contribution of the Topic Navigation Map is the emerging ISO standard, to provide a precise syntax and an interchange format for Structured Maps. The semantics of Topic Navigation Maps derive from SGML and HyTime in that the choice of certain SGML or HyTime elements imply certain semantics. The use of database technology to support Structured Maps brings a long tradition of ERD modeling, clear semantics for collections of instances, a well-defined query languages with query optimization techniques. It also opens up the possibility of managing the updates to a Structured Map using DBMS support.

Federated Database <- Structured Map -> One related area of work in the database field is federated databases, where autonomous databases can be viewed conceptually as a single database with a single (integrated) schema. [18,19]. In the case where each participant underlying information sources is a database with a schema and where the Structured Map Definition corresponds to a global schema, then a Structured Map is a federated database. Our work focuses on three aspects of Structured Maps that are outside of the main focus of federated database research: (1) the introduction of more than one Structured Map, where the entities and relationships type have not necessarily been captured in that form elsewhere, (2) the use of Structured Maps over loosely structured information sources such as documents, spreadsheets or video, and (3) the use of Structured Maps over in situ information where an addressing mechanism bridges the two different implementation environments.

<- Entity-Relationship Model -> Information Management Intelligent Integration of Information Architecture <- Relationship -> <- Structured Map -> <- Entity -> Another area related to this work is the Intelligent Integration of Information (I3) Architecture with intelligent mediators introduced to facilitate interaction of various information sources and services [20]. The I3 program is much more ambitious in its goals and in its techniques, employing intelligent agents to analyze information sources, for example. Structured Maps adopt a simplified Entity-Relationship model, with the focus on explicit representation of information and connections, much like traditional database systems. Structured Maps could conceivably be exploited in the I3 architecture because of the additional semantics they provide. Finally, Structured Maps focus on autonomous systems for information management, but with a simple, well-defined interface to identify, address, and render information.

Database View <- Structured Map -> Structured Maps provide a type of database view, a materialized view [21]. They differ from conventional views because they may introduce entities and relationships that are not present explicitly in any of the underlying information sources. Another difference is that a Structured Map presents a "view" in the form of a database. So it is somewhat similar to database-structured query answers [22].

Within the digital library research community, e.g., [23, 24], there is little focus explicitly on the conceptual model for information. Much work has focused on various aspects of searching such as the user interface and the performance. Some work has focused on spatial modeling, representation and searching, e.g., [25, 26, 27].

<- DTD -> <- Information Universe -> <- SGML Database -> <- Structured Map -> Another related area of research is efforts to store SGML documents directly in a database, capturing the DTD structure explicitly in the schema, e.g., [28]. Such work is complementary to this research; we do not consider the modeling or the representation of SGML documents. An SGML database could offer an appropriate repository for the SGML items in the Information Universe and could support a centralized approach to Structured Maps. SGML database do not explicitly set forth a three-level model such as Structured Maps. Another research effort deals with non-SGML structured text in a digital library [29]] . 6. Conclusions and Future Work

<- Entity-Relationship Diagram -> <- Structured Map -> <- Topic Map -> Structured Maps provide a new level of modeling that can be introduced over various information sources. The form of the model, an ERD, is familiar and ubiquitous. This similarity suggests that the utility of the model and the application of conventional database tools and technology are both likely to be high. We focus in our work on the implications of the three-layer model. Work in progress is considering the semantics of a query language for Structured Maps. A particular issue is how to provide support for queries that span the Structured Map and the underlying document content. Other challenges include the interaction among multiple Structured Maps as well as the interface between a Structured Map and an underlying database schema. Both of these topics are related to schema integration but may differ in the details because of the simplicity of Topic Maps, essentially without conventional attributes and without concern for the representational details of the underlying information. This avoidance of representational focus (and the accompanying reduction in representational conflict) is facilitated by the delegation of the responsibility for identification, addressing and rendering.

<- Information Universe -> <- Topic Map -> Topic Navigation Maps are defined as an SGML document which serves as a standard data interchange format. But there is no specification of any tools or uses of Topic Navigation Maps in the standard. There are no operational semantics nor any prescribed usage model for Topic Navigation Maps. EnLIGHTeN has pursued one usage model: browsing a Topic Navigation Map along with browsing the information sources in the Information Universe in one integrated tool. In this research we are pursuing a slightly different usage model where the interpretation and browsing of information sources is in separate tools and where database-style queries are supported, in addition to browsing.

DDL <- Information Universe -> <- Querying -> <- Structured Map -> <- Topic Map -> The language associated with Structured Maps is quite non-traditional. The foundation for Structured Maps, the Topic Navigation Map SGML document, provides a standard syntax for representing the DDL, instances, and addresses for reference-valued attributes, albeit buried in the cumbersome syntax of SGML. But it does provide a standard data interchange format that benefits from SGML generality, such as the ability to declare notations for content as well as for the markup. We choose to express the definition and instances of a Structured Map using a conventional database, i.e. without an explicit lexical language, for the moment. Another key difference between Structured Maps and conventional databases languages is the delegation of address interpretation. From a language point of view, we envision Structured Maps using embedded languages for addressing and even for querying the information Universe, much like SGML supports arbitrary notations and HyTime supports addressing in an external syntax. Structured Maps basically include addresses that are bound outside of the system.

<- WWW -> As a final comment, there are a number of aspects of Structured Maps that are important but outside of our current focus. These include: details of address syntax, semantics, and interpretation, implementation-level connections with underlying information sources, update and the consistency of links, etc. We do not focus on these issues, in part, because there is evidence that such work is in progress, elsewhere, e.g., through object interoperation models and various database-oriented implementations of WWW servers and other work on interoperability, e.g., [30, 31].

7. Acknowledgements

Catherine Hamon <- Michel Biezunski -> <- Steven R. Newcomb -> High Text TechnoTeacher The authors gratefully acknowledge the techical support, through personal interaction and use of technical products, from Michel Biezunski and Catherine Hamon of High Text, S.A.R.L. and Steven Newcomb of TechnoTeacher, Inc. Both companies graciously made their products, EnLIGHTeN and HyMinder, available for use in this research.

<- Reference -> 8. References

This research was funded by a grant from the National Science Foundation, NSF Award Number: IRI- 9502084.

<- SCEL -> <- Sequent Corporate Electronic Library -> <- HTML -> The examples are taken from the Sequent Corporate Electronic Library, SCEL, a large-scale, production intranet library for Sequent employees implemented using HTML and Netscape.

SGML terminology conflicts with database terminology, for the terms entity, attribute, and others. Here "attribute" is used in the SGML sense, but database terminology prevails in the bulk of the paper.

<- Structured Map -> The anchor role corresponds to reference-valued attributes in Structured Maps.

<- HTML -> Product development plans for HyMinder and for EnLIGHTeN include support for incremental parsing of documents. Also, EnLIGHTeN supports both HTML and plain ASCII text as input (as the underlying information sources) but they are currently translated into SGML for internal storage and processing.

<- Topic Map -> HyMinder focuses on the difficult problem of enforcing SGML and HyTime syntax and semantics. We note that Topic Navigation Maps use a small subset of SGML and HyTime features.

<- FSI -> Work is also in progress to generalize and standardized references to files through the Formal System Identifiers and Storage Managers (FSISM) draft document submitted to ISO.

A term coined by Bill Scherlis of CMU for local, supplementary information, such as bookmarks.

<- WWW -> [1] Network Wizards, http://www.nw.com/zone/WWW/report.html. Internet Domain Survey, January 1996.

<- Michel Biezunski -> <- Topic Map -> [2] M. Biezunski. Modeling hyperdocuments using the Topic Map Architecture. In International HyTime Conference, Vancouver, August 1995. Available under HyTime Conferences at http://www.techno.com.

[3] Map from the Road Atlas 93 1992 by Rand McNally, R.L. 96-S-158. (with permission) Rand McNally & Company.

<- Data -> <- Database -> [4] P. P. Chen. The entity-relationship model: Toward a unified view of data. In ACM Transactions on Database Systems, volume 1, pages 9-36, 1976.

<- Object -> [5] J. Rumbaugh, M. Blaha, W. Premerlani, F. Eddy, and W. Lorensen. Object-Oriented Modeling and Design. Prentice Hall, Englewood Cliffs, New Jersey 07632, 1991.

High Text [6] High Text, S.A.R.L., http://www.hightext.com. High Text Home Page.

High Text TechnoTeacher [7] Modern SGML, Explicit Foundations, Create New Possibilities. High Text, S.A.R.L., 5 rue d'Alsace, 75010 Paris France and Techno Teacher, Inc., 3800 Monroe Avenue, Pittsford, New York, 14534-1330 USA.

[8] M. Bryan. SGML An Author's Guide to the Standard Generalized Markup Language. Addison-Wesley, 1988.

<- DSSSL -> [9] http://www.jclark.com/dsssl, DSSSL Home Page.

High Text TechnoTeacher [10] Overview of the HyTime Standard. High Text, S.A.R.L., 5 rue d'Alsace, 75010 Paris France and Techno Teacher Inc., 3800 Monroe Avenue, Pittsford, New York, 14534- 1330 USA.

[11] Information Processing -- SGML Applications -- Topic Navigation Maps, WG8 N 1806, ISO/IEC JTC1/SC18/WG8, Document Procesing and Relation Communication -- Document Description and Processing Languages, (committee draft).

TechnoTeacher [12] TechnoTeacher, Inc., http://www.techno.com. TechnoTeacher Home Page.

[13] Informix Software, Inc., http://www.informix.com. Informix Home Page.

[14] H. Maurer. Hyper-G: Advancing the ideas of the World-Wide-Web. Available at http://www.chemie.fu-berlin.de/outerspace/doc/hyper-g-abs.html.

<- Data -> <- Database -> [15] R. Sacks-Davis, T. Arnold-Moore, and J. Zobel. Database Systems for Structured Documents. In Proceeding International Symposium on Advanced Database Technologies and their Integration (ADTT'94), Nara, Japan, October 1994.

<- Object -> [16] R. Ofali, D. Harkey, J. Edwards. The Essential Distributed Objects Survival Guide, John Wiley & Sons, 1996, Part 2, pages 43-216.

<- Object -> [17] R. Ofali, D. Harkey, J. Edwards. The Essential Distributed Objects Survival Guide, John Wiley & Sons, 1996, Part 2, pages 283-296.

[18] A. P. Sheth and J. A. Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. In ACM Computing Surveys, volume 22, pages 183-237. Association for Computing Machinery, Inc., September 1990.

<- Data -> [19] J. L. Zhao, A. Segev, and A. Chatterjee. A universal relation approach to federated database management. In Proceedings Eleventh International Conference on Data Engineering, pages 261-270, Taipei, Taiwan, March 1995. IEEE Computer Society Press.

[20] S. Adali and V. S. Subrahmanian. Amalgamating knowledge bases, II: Distributed mediators. In International Journal of Intelligent Cooperative Information Systems, December 1994.

<- Data -> [21] J. L. Lu, G. Moerkotte, J. Schue, and V. S. Subrahmanian. Efficient maintenance of materialized mediated views. In Proceedings of the 1995 ACM SIGMOD, International Conference on Management of Data, pages 340-351, San Jose, California, June 1995.

[22] K. Dittrich, W. Gotthard, and P. C. Lockemann. DAMOKLES - A database system for software engineering environments. In Proceedings IFIP Workshop on Advanced Programming Environments, Lecture Notes in Computer Science, 1986, Springer-Verlag.

[23] Digital Libraries, Communications of the ACM, April, 1995, volume 38, number 4, pages 22-86, collection of papers on various aspects of digital libraries.

<- Digital Library -> [24] Digital Library Initiative, IEEE Computer Magazine, May, 1996, volume 29, number 5, pages 22-76, collection of papers on various aspects of digital libraries.

<- Digital Library -> [25] T. R. Smith and J. Frew, "Alexandria Digital Library", pages 61-62 in [24].

Zone [26] C. Kacmar and D. Jue, "The Information Zone System", pages 46-47 in [24].

<- Digital Library -> <- Reference -> [27] T. R. Smith, "A Digital Library for Geographically Referenced Materials", pages 54-60 in [25].

[28] V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl, "From Structured Documents to Novel Query Facilities", Proc. of the 1994 ACM SIGMOD Conference, Minneapolis, MN, ACM SIGMOD Record, volume 23, number 2, June 1994, pages 313-324.

[29] S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, and J. Widom. The Tsimmis project: Integration of heterogeneous information sources. In Proceedings 100th Anniversary Meeting, Information Processing Society of Japan, pages 7-18, Tokyo, Japan, October 1994.

[30] B. Schatz, W. Mischo, T. Cole, J. Hardin, A. Bishop, and H. Chen, "Federating Diverse Collections of Scientific Literature, pages 28-36 in [25].

<- Digital Library -> <- Object -> [31] A. Paepcke, S. Cousins, H. Garcia-Molina, S. Hassan, S. Ketchpel, M. Roscheisen, T. Winograd, "Using Distributed Objects for Digital Library Interoperability", pages 61-68 in [25].