SEARCH
Advanced Search
ABOUT
Site Map
CP RSS Channel
Contact Us
Sponsoring CP
About Our Sponsors
NEWS
Cover Stories
Articles & Papers
Press Releases
CORE STANDARDS
XML
SGML
Schemas
XSL/XSLT/XPath
XLink
XML Query
CSS
SVG
TECHNOLOGY REPORTS
XML Applications
General Apps
Government Apps
Academic Apps
EVENTS
LIBRARY
Introductions
FAQs
Bibliography
Technology and Society
Semantics
Tech Topics
Software
Related Standards
Historic
|
| SGML/XML Bibliography Part 7, S - Z |
[CR: 19961009]
Sabasteanski, Anna. "Use of the Electronic Manuscript Standard at the New England Journal of Medicine." EPSIG News 2/1 (March 1989) 1-2. ISSN: 1042-3737. Author's affiliation: Medical Publishing Group, Massachusetts Medical Society.
The New England Journal of Medicine is a key publication of the Medical Publishing Group, Massachusetts Medical Society, and it plays part in the Society's adoption of SGML-based publishing technologies. Canonical database files using SGML encoding are used to produce different versions of the journal for CDROM, paper, and online access. The AAP's SGML DTD is the basis for the information structuring in the knowledgebase.
See the entry for Elsevier Scientific Publishers for other information, or the bibliographic entry for the Elsevier DTD documentation.
[CR: 19960812]
Sacks-Davis, R.; Arnold-Moore, T.; Zobel, J. "Database Systems for Structured Documents.." IEICE Transactions on Information and Systems E78-D/11 (November 1995) 1335-1342 (with 26 references). Authors' affiliation: Collaborative Information Technology Research Institute (CITRI), Carlton, Victoria, Australia Home Page Contact: Ron Sacks-Davis.
"Abstract: Documents stored in a database system can have complex internal structure described by languages such as SGML. How to take advantage of this structure presents challenges for database system implementers. We classify the types of queries that need to be supported by SGML conformant database systems. We then describe several data models that have been proposed for representing documents in a database system and discuss the support these models provide for SGML. Finally we consider query evaluation."
For further information on SGML-related research at RMIT/CITRI, see the main entry for RMIT - MDS.
Sacks-Davis, Ron; Arnold-Moore, Timothy; Zobel, Justin. Database Systems for Structured Documents. Technical Report. [Prepared for] International Symposium on Advanced Database Technologies and their Integration (ADTI'94), 1994.. Nara, Japan: [PUBLISHER?], 1994. 13 pages, 33 references.
Said, Carolyn; McManus, Neil. "SGML Standard Will Star in Boston Seybold Show." MacWeek 7/15 (April 12, 1993) 1,124. ISSN: 0736-7260.
Note on the prominence of SGML publishing technologies at the Seybold Seminars '93 trade show.
[CR: 19970817]
Salminen, Airi; Kauppinen, Katri; Lehtovaara, Merja. "Towards a Methodology for Document Analysis." Pages 644-655 (with 24 references) in Structured Information/Standards for Document Architectures. Edited by Elisabeth Logan and Marvin Pollard. = Journal of the American Society for Information Science, Special Issue. Volume 48, Number 7 (July 1997). New York: John Wiley & Sons Inc., 1997. ISSN: 0002-8231. Authors' affiliation: Departmenf of Computer Science and Information Systems, University of Jyväskylä, P.O. Box 35, FIN-40351, Jyväskylä, Finland. Email: airi@cs.jyu.fi.
Abstract: "A great deal of the collective knowledge of organizations is stored in documents. To be able to use documents effectively, the information structure in the documents should be carefully planned. International standards, for example SGML, have been developed for defining document structures. The definition method however is not enough. For defining effective document standards for an organization, a profound document analysis is needed. In the analysis, current documents and document management practices should be studied and described before developing new document structures and document management practices. The development of a methodology for document analysis is going on in a project studying legislative documents produced in the Finnish government and parliament. The article describes the first results of the project. As the document structure definition method, SGML is used in the project. The analysis method is developed and extended from an object-oriented method. The article introduces the main phases of the analysis: Domain definition, object modeling, state modeling, and content modeling. The application of the methodology in the case project and the data gathering methods used are also described."
See the main document entry for the complete list of articles and contributors, as well as other bibliographic information.
[CR: 19951110]
Salminen, Airi; Tompa, Framk Wm. "PAT Expressions: An Algebra for Text Search." Acta Linguistica Hungarica 41/1-4 (1992-1993) 277-306 (with 25 references). Authors' affiliation: [Salminen]Department of Computer Science, University of Jyväskylä; [Tompa] Department of Computer Science, University of Waterloo.
Summary: Text search operations are used to locate and retrieve needed information from some text collection. In traditional information retrieval, text search is a means for identifying relevant documents. By specifying selection criteria for the text of a document, the reader can choose a subset of a given set of documents. If the text collection is defined not as a set of documents, but more generally as a structure containing some parts, then text search involves the specification of those parts of interest to the reader.
The structure of the documents may be determined by the search system, by the author, by the text installer, or by the reader. In the PAT (TM) system, text search operations are expressions that efficiently combine traditional search capabilities with some new, powerful facilities. PAT contains means for lexical search, proximity search, contextual search and Boolean search. It also contains more rare operation types, including position and frequency search. Furthermore, a novel feature in PAT is the capability by which a reader can define structures for a text and use these structures in subsequent operations. One of the goals of this paper is to introduce the powerful search capabilities of PAT expressions.
Text search is usually considered so simple that only a rough description of the operations is given. For example, when word search is discussed, we are seldom told what is meant by a 'word'. The reader has to find out through experimentation how many words are contained in the strings 'Jean-Marie' and 'O'Hara'. However, a careless description of search operations may lead to search errors or unnecessarily long retrieval sessions. A second goal of the paper, therefore, is to introduce a mechanism for precise specification of text search semantics.
Text search using PAT is typically simple and straightforward. However, because of the powerful definition capabilities included in PAT, explaining and understanding the semantics of some operations may be difficult. As a side-effect of our systematic specification of PAT, we have identified some features of PAT expressions that cause problems and thus would benefit from further development. From this we see that precise specification also serves as a means for evaluation and offers a means for comparing text search systems. As is common in information retrieval systems, a PAT search is applied to indexed text. Indexing is usually described from the point of view of implementation, for example, by giving an algorithm for the indexing. However, since the way text is indexed affects search behaviour, our systematic approach to precise description must include mechanisms that accommodate indexing definition capabilities." [adapted from the Introduction]
The authors describe the query capabilities of the PAT system, dividing PAT expressions into six classes, and supplying a discussion of the syntax and semantics for each class. PAT indexing can be specified by productions as a view of PAT text.
Available on the Internet: ; [or mirror copy].
[CR: 19970106]
Sampson, Craig R. "SASOUT: A Context Based Table Model." Pages 235-264 in SGML '96 Conference Proceedings. Celebrating a Decade of SGML. SGML '96 Conference, Boston, MA, November 18-21, 1996. Sponsored by The Graphic Communications Association (GCA). [Edited by] Conference Co-Chairs: B. Tommie Usdin and Deborah A. Lapeyre. Alexandria, VA: GCA, 1996. Extent: 711 pages. Author's affiliation: SAS Institute Inc., SAS Campus Drive, Cary, North Carolina, USA 27513; Tel: (919) 677-8000 x-7417; FAX: (919) 677-4444; Email: sascrs@unx.sas.com; WWW: http://www.sas.com.
Abstract: "The SASOUT table model was developed to support the tabular documentation needs of the Publications Division of SAS Institute Inc. SASOUT instances contain sufficient meta information to allow them to be presented in both hard and soft copy. The meta data also makes possible non-traditional and interactive online presentations of the tabular data.
In 1995, research on tables produced by SAS software and on the tables previously used in our documentation resulted in our identification of four table types: simple, intersection, drill-down, and show-all. Imaging these tables on paper, as in the past, presented no significant problems even with SGML source data. However, we anticipated problems presenting our tables in soft copy after experimenting with the capabilities of the CALS table model, which was supported by our SGML software tools.
The CALS model does not support markup for indicating relationships between cells in a table nor directly support row header formatting. These relationships are not critical for producing hard copy, but are very important to our interactive online presentations. Header formatting is important for both hard copy and online presentations from a single source.
The SASOUT table model was developed to provide a means of marking up our tabular data while preserving its characteristics. The markup supports row headers and cell relationships in addition to all CALS features, such as column heads, spanning rows and columns, and alignment of data. The SASOUT model also supports behavior characteristics that allow the specification of online presentation methods.
This paper describes our table types, our platform presentation requirements, extensions we added to the CALS model, and the processing we designed to meet our formatting requirements so far.
The SASOUT DTD is freely available and we look forward to vendors providing support for it and other table DTD's that provide the means to fully identify tabular data."
The document is also available online in SGML format: see the download instructions from Craig Sampson, which contain the associated GCAPAPER DTD. URLs for the paper are: ftp://ftp.sas.com/incoming/sasout.tar.Z, (UNIX tar compressed) or ftp://ftp.sas.com/incoming/sasout.zip (.ZIP format); [UNIX format mirror copy] and [ .ZIP format mirror copy]. The SASOUT table DTD has been made available publicly by Craig Sampson on the Usenet News forum comp.text.sgml (CTS): see the local document. A related presentation describing the implementation of SGML by the Publications Division of SAS Institute was given at SGML '96 by Leonard P Olszewski, "Modular DTD Development and Maintenance at SAS Institute: Implementing an Efficient SGML System Using Software Engineering Principles."
Note: The above presentation was part of the "SGML User" track at SGML '96. The SGML '96 Conference Proceedings volume containing the full text of the paper may be obtained from GCA.
[CR: 19961226]
Samuels, Eloise. "Case Study: Key Learnings from Converting Complex Technical Documents to SGML." Pages 57-64 in SGML '96 Conference Proceedings. Celebrating a Decade of SGML. SGML '96 Conference, Boston, MA, November 18-21, 1996. Sponsored by The Graphic Communications Association (GCA). [Edited by] Conference Co-Chairs: B. Tommie Usdin and Deborah A. Lapeyre. Alexandria, VA: GCA, 1996. Extent: 711 pages. Author's affiliation: Senior Information Design Manager, Bellcore, 6 Corporate Place, PYA-1N158, NJ 08854, USA; Tel: 908-699-6853; FAX: 908-336-2605; Email: ews@rangers.lso.bellcore.com.
Abstract: "Unlocking the benefits of information in your documents may mean that you invest more than human resources, money and equipment. What about the pre-process planning time? Investment in Standard Generalized Markup Language (SGML) does not guarantee you immediate return and does not happen at the drop of a hat without a major management investment in pre-planning.
Because most corporations information management's top goal is to produce a much richer information environment, we must make a management commitment to a document analysis process. The process should identify what information in these documents is important enough to migrate to a rich electronic format such as SGML. It may seem obvious that the way to maximize your information is to break it into intelligible chunks in a data base. However, to get those chunks of information into a format that is acceptable by most applications is not a simple process. When that is complete, next comes the targeted conversion by document type.
For Learning Support the goal was to establish an information database that yielded benefits in the area of:
- document creation,
- document updating and revising,,
- database review and validation,
- information reuse, and,
- on-line full-text retrieval and distribution of information.,
The objective was to convert annually some 300,000 pages of technical documents containing complex tables and graphics from several different authoring environments into an industry standard Document Type Definition (DTD), called the Telecommunications Industry Markup (TIM DTD).
This industry standard format, Telecommunication's Industry Markup Document Type Definition (TIMDTD) is an explicit and neutral form of markup. The BCCs, in conjunction with the Telecommunications Industry Forum consisting of representatives from telecommunication vendors, such as Ericson, Siemens, AT&T, and Northern Telecom have unanimously endorsed it as their standard list of SGML markup tag definitions.
This paper identifies key learnings grasped from project management of the SGML Implementation Plan the Learning Support organization at Bellcore. Key outcomes determined were:
- Document analysis was critical to the success of the [project]
- The DTD writer's interpretation of the data and its structure required an iterative process with document developers and users. DTDs will change.
- It was important for acceptance to maintain the document developers view of the textual layout and format of the data while enforcing structure.
- Management's buy-in was needed at all points in the process
- Not everyone will be on board the train at the same time."
For more information on the TIM DTD as part of the TCIF/IPI (Telecommunications Industry Forum Information Products Interchange) standard, see the main entry in the SGML/XML Web Page.
Note: The above presentation was part of the "SGML Newcomer" track at SGML '96. The SGML '96 Conference Proceedings volume containing the full text of the paper may be obtained from GCA.
[CR: 19951113]
Sandoval, Victor. SGML: un outil pour la gestion électronique de documents. Techniques de l'information. Paris: Hermes, 1994. Extent: 174 pages, bibliographie, index. ISBN: 2866014405.
[CR: 19971008]
Schatz, Bruce; Mischo, William H.; Cole, Timothy W.; Hardin, Joseph B; Bishop, Ann P. "Federating Diverse Collections of Scientific Literature." IEEE Computer 29/5 (May 1996) 28-36 (with 12 references). ISSN: . Authors' affiliation: Grainger Engineering Library and Information Center, Illinois University, Urbana, IL, USA.
"Abstract: The Digital Library Initiative (DLI) project at the University of Illinois at Urbana-Champaign is developing the information infrastructure to effectively search technical documents on the Internet. The authors are constructing a large testbed of scientific literature, evaluating its effectiveness under significant use, and researching enhanced search technology. They are building repositories (organized collections) of indexed multiple-source collections and federating (merging and mapping) them by searching the material via multiple views of a single virtual collection. Developing widely usable Web technology is also a key goal. Improving Web search beyond full-text retrieval will require using document structure in the short term and document semantics in the long term. Their testbed efforts concentrate on journal articles from the scientific literature, with structure specified by the Standard Generalized Markup Language (SGML). Research efforts extract semantics from documents using the scalable technology of concept spaces based on context frequency. They then merge these efforts with traditional library indexing to provide a single Internet interface to indexes of multiple repositories."
Available online in HTML format: http://computer.org/computer/dli/r500280/r50028.htm; [archive copy, text only].
Scheller, Angela. "Document Standards: Availability and Products." Computer Networks and ISDN Systems 16/1-2 (September 1988) 138-142. ISSN: 0169-7552. CODEN: CNISE9.
Abstract: With the growth in the spread of computer networks the demand by users for document interchange features is becoming increasingly apparent. The prerequirement for the realization of document interchange in a heterogeneous computer environment are internationally accepted standards for the description of documents. Already in early 1986, the Standard Generalized Markup Language SGML was published as an international standard for the structuring of documents. The publication of the Office Document Architecture ODA is expected in the course of 1988. The final text is already available. ODA was originally developed for the pure office environment, whereas the concept for SGML addressed the author/publisher environment. This fact is mirrored in the current pilot projects testing the standards: the manufacturers of office and word-processing systems mainly work with ODA, whereas in the technical scientific and publishing sectors SGML is often implemented. Users requiring an interface both to the office sector as well as to the publishing sector will therefore be confronted with the problems related to working with two different, only partially compatible standards.
Scheller, Angela. "Experience with SGML in the Real World: DAPHNE, a System Integrating Computer Graphics Metafiles into SGML Documents." In Document Exchange: The Use of SGML in the UK Academic and Research Community. Workshop Proceedings 5-7 March 1990. Edited by Anne Mumford. Advisory Group on Computer Graphics, 1990.
Abstract: DAPHNE is a document processing system implemented to support joint editing within the German Research Network DFN. It is based on two international standards in the area of document and graphics processing, the Standard Generalized Markup Language SGML and the Computer Graphics Metafile CGM. This paper presents the functionality offered by DAPHNE today as well as plans for future extensions. It also describes the experience gained with a distributed environment of commercial products for processing SGML documents in general and DAPHNE documents in particular.
Schettini, Stephen; Alschuler, Liora. "SGML is Here to Stay. Coding Documents with Standard Generalized Markup Language Lets You Manipulate and Format Text in Limitless Ways." Publish ? (June 1994) 71-78.
[CR: 19970212]
Schietekat, Raf. "DSSSL: The Promise FOSI Did Not Fulfill." In: Proceedings of the 3rd Annual Conference on the Practical Use of SGML. "A Decade of Power." Third Annual [Belux] Conference on the Practical Use of SGML. Business Faculty, Sint-Lendriksborre 6, Brussels, Belgium. October 31, 1996. Sponsored by SGML Belux (Belgian-Luxembourg Chapter of the International SGML Users' Group). Leuven, Belgium: Belux, 1996. Author's affiliation: Fotek NV, Entrepotstraat 3, B-9100 Sint-Niklaas, Belgium. Email: raf@fotek.com.
Abstract: "SGML (Standard Generalized Markup Language) is designed to encode information at the content level, abstracting away from formatting issues. In a well-designed SGML application, font details are not part of the SGML document, and contents may be rearranged or automatically generated. In this light, for professional purposes HTML (HyperText Markup Language) is better considered to be a presentation language for contents that have been stored separately (e.g. in a dedicated SGML environment) than a reliable repository data storage format in itself. This now probably well understood, at least in the SGML community. HTML is tied to a particular, very limited DTD (even ignoring the reality of emerging Microsoft and Netscape dialects), and requires independently specified semantics (i.e. how the Web browser has to interpret the HTML tags)."
"In general, SGML documents will be typeset using informal descriptions of the style semantics for the various elements that occur in a document instance, which requires good communication between the document publisher and the document typesetter. Once a system is set up to process a particular kind of SGML input, that style specification is generally not portable.
"The new ISO standard DSSSL (Document Style Semantics and Specification Language) aims to become the standard way of linking up SGML information containers with their graphical representation, as part of a suite of complementary ISO standards: SGML, HyTime, SPDL (Standard Page Description Language, based in part on the PostScript language from Adobe). This paper will elaborate on the details of how DSSSL achieves this, how it fits into a complete document production process, how detailed the specification is and what it leaves open, what other
functionality is available, and why it was worth waiting for."
Available online in HTML format: "DSSSL, the promise FOSI did not fulfill", by Raf Schietekat; [mirror copy]. For further information on the conference, see: (1) the description in the conference announcement and call for papers, and (2) the full program listing, or (3) the main conference entry in the SGML/XML Web Page.
[CR: 19971125]
Schiller, Jörg. "SGML and Development Documentation." Page(s) 159-160 in SGML '97 Conference Proceedings. SGML Europe '97. "The Next Decade - Pushing the Envelope." Princesa Sofia Intercontinental, Barcelona, Spain. 11-15 May, 1997. Sponsored by Graphic Communications Association (GCA) and SGML Open. Conference Chair: Pamela L. Gennusa (Director, Database Publishing Systems Ltd). Alexandria, VA: Graphic Communications Association (GCA), 1997. Extent: 342 pages, CDROM. Author's affiliation: Project Manager, debis Systemhaus GEI, Ulm, Germany' Email: jschiller@gei-ulm.daimler-benz.com.
Abstract: "Increasing requirements in development documentation for automotive manufacturers such as the number of world-wide development sites (leading to the support of different languages); the speed of the development cycle; and the number of variants of products (re-use of base documentation), has led to the definition of exchange formats for information. This paper examines how SGML technology can be a good solution for problems in this area." [from the published program]
"The development of ECU (electronic control unit) for cars is a highly parallel and complex job. Requirements from different parts of an automotive manufacturer have to be fulfilled. The interfaces between correlated persons are not defined in a way, that an exchange of information is done easily. Business process reengineering activities discovered a big potential for enhancements by using SGML technology as an overall exchange format in these areas.
"Several projects were started, to implement a new process model. This article describes our experiences in projects we realized since the beginning of 1995. The biggest project deals with diagnosis data that is needed to describe parameters to communicate with ECUs. Today you can get many informations about the actual state of internal and external variables of ECUs (for example a coolant temperature). These informations are used to guide a diagnosis process to determine erroneous behaviour of components of a car. The diagnosis data is used in different parts of the company (development, production, service). Even companies that deliver ECUs can be involved in this process. We started a case study to determine the best format for the description of structures. As a result we decided to take SGML. The Document Type Structure (DTD) is presented to the ASAM/ASAP consortium for standardization. This consortium represents the German automotive industry, suppliers and tool companies.
"The system is now in use by 10 to 15 users and will grow in 1997 to 30 to 50 users. There is a process of standardization of diagnosis data in the moment in a consortium called ASAM/ASAP. Our DTD is a proposal to that committee and we think it will be fixed till the end of 1997. Our experience with the technologie SGML are quite good. We can transport the concept very easily to the users.
Note: The electronic conference proceedings in hypertext were produced by Inso Corporation (DynaText) and by High Text (EnLIGHTeN). Information about the SGML Europe '97 Conference may be found in the main database entry.
[CR: 19971125]
Schmitt-Rennekamp, Walter. "Digital Documentation Trends for Aircraft Maintenance." Page(s) 153-154 in SGML '97 Conference Proceedings. SGML Europe '97. "The Next Decade - Pushing the Envelope." Princesa Sofia Intercontinental, Barcelona, Spain. 11-15 May, 1997. Sponsored by Graphic Communications Association (GCA) and SGML Open. Conference Chair: Pamela L. Gennusa (Director, Database Publishing Systems Ltd). Alexandria, VA: Graphic Communications Association (GCA), 1997. Extent: 342 pages, CDROM. Author's affiliation: Senior Consultant, Aircraft Maintenance and Engineering Documentation, Lufthansa Systems GmbH, Hamburg, Germany.
Abstract: "The aviation industry has a long tradition for information interchange standardization. The first generation of on-line documentation was a paper document duplicate based on SGML. In the future, documentation has to move from the document paradigm to an information paradigm. Then the user will get an 'Information Web' and exactly the information he is looking for. This presentation looks at the challenges and trends in aircraft maintenance documentation."
"In aircraft maintenance and operations documentation structure and form are well defined by the ATA SPEC 100 specification. It was a good foundation bringing that documentation into electronic form using SGML. SGML is today the foundation for ATA SPEC 2100, the aviation standard for electronic document interchange. [...] Tagged information at a well defined granularity makes incremental revisions easy. Taking advantage of the progress in electronic networks, an on-line document update will be possible and leads to totally new worksharing concepts between aircraft manufacturers product support organization and airline engineering."
Note: The electronic conference proceedings in hypertext were produced by Inso Corporation (DynaText) and by High Text (EnLIGHTeN). Information about the SGML Europe '97 Conference may be found in the main database entry.
[CR: 19961108]
Schopen, Michael (Dr. med). "Die logische Struktur der ICD-10 (Systematik) und ihre Beschreibung mit SGML [The Logical Structure of ICD-10 (Tabular List) and its Description with SGML]." Informatik, Biometrie und Epidemiologie in Medizin und Biologie 26/2 (1995) 121-133 (with bibliography and 4 pages of figures. Author's affiliation: Deutsches Institut für medizinische Dokumentation und Information (DIMDI) [German Institute for Medical Documentation and Information].
"Zusammenfassung: Das Deutsche Institut für medizinische Dokumentation und Information, DIMDI, ist beauftragt, die amtliche deutschsprachige Ausgabe der ICD-10 herauszugeben. Ausgehend von den Anforderungen an die maschinenlesbare Version der ICD-10 (Systematik) wird die Standard Generalized Markup Language (SGML) vorgestellt als ein Formalismus, mit dem die logische Struktur der Klassifikation beschrieben werden kann. Der im DIMDI für die ICD-10 verfolgte Ansatz - Reduktion auf die logische Struktur und Verzicht auf Layoutinformation - macht die maschinenlesbare Fassung unabhängig von spezifischer Hard- und Software und hät sie offen für unterschiedlichste Anwendungen. Die vorliegende Arbeit beschreibt in SGML die Grobstruktur der ICD-10-Kapitel sowie den Aufbau der Krankheitsklassen und einiger ihrer Elemente. An einem Beispiel wird gezeigt, wie die SGML- Daten für spezielle Anwendungen der Klassifikation restrukturiert werden können.
Abstract: The German Institute for Medical Documentation and Information, DIMDI, has been authorized to publish the official German language edition of ICD-10. [Internationale Klassifikation der Krankheiten = International Classification of Diseases, ICD.] Based on the requirements for a machine-readable version of ICD-10 (Tabular List), SGML - the Standard Generalized Markup Language - is introduced as a formalism to describe the logical structure of this classification. By specifying the mere logical structure and abandoning layout information, DIMDI's concept for ICD-10 makes the machine-readable version independent of any hardware or software and keeps it open to a broad range of applications. This paper uses SGML to describe the structure of ICD-10 - the chapters, the disease categories and some of their elements. An example is given how to rearrange SGML data for specific applications of the classification.
Die Arbeit erläutert den Ansatz des DIMDI, die ICD-10-Daten SGML-basiert zu bearbeiten.
Available in Postscript format on the Internet: ftp://193.174.240.221/pub/klassi/icdsgml.zip. Mirror copies: original from DIMDI; alternate - edited Postscript that worked locally. For more information, see the database entry for DIMDI.
[CR: 19961210]
Schouten, Han. "Documents in Databases." SGML Users' Group Newsletter 15 (January 1990) 8-11. ISSN: 0952-8008. Author's affiliation: Research Center for Technical and Physical Engineering in Agriculture (TFDL), The Netherlands.
The article explains "why documents should be stored in databases rather than in sequential files." [Because:] "Database technology provides us direct access to facts stored in a database. Here too the application-independent logical structure of information determines how we can get access to and process stored facts. The verification of manipulating such information according to its logical structure is, unless explicitly prescribed, not sequence-specific. Therefore, the storage of documents in databases seems to be the correct answer to our requirements of interaction with respect to document processing in the office environment." [extracted]
This article should be read in conjunction with a second article by Han Schouten, "Draft Tender Re: 'Documents in Databases'", also in number 15 of SGML Users' Group Newsletter.
[CR: 19961210]
Schouten, Han. "Draft Tender Re: 'Documents in Databases'." SGML Users' Group Newsletter 15 (January 1990) 12-14. ISSN: 0952-8008. Author's affiliation: Research Center for Technical and Physical Engineering in Agriculture (TFDL), The Netherlands.
A major draft proposal for SGML DSIG sponsored development of a prototype document processing environment in which documents are stored as databases. The environment would support SGML, but also other SGML-related standards like DSSSL -- "as an alternative for the sequential access strategy characteristic of standard SGML." Details on the objectives, tasks, funding, deliverables, rights and duties of participants, project management, (etc.) are described. Proposed tasks include specification of a gross system architecture, definition of modelling techniques, building and verifying semantic equivalence of all models with SGML and DSSSL, facilities for loading SGML DTDs, facilities to unload DTDs without loss of information, creation of a DTD editor, creation of a structured document editor, building of retrieval facilities, building a document formatter.
This document, as a draft tender, is to be read in conjunction with the companion article in issue 15 of the SGML Users' Group Newsletter, "Documents in Databases," also by Han Schouten.
[CR: 19961210]
Schouten, Han. "Meeting of the [SGML] Database SIG." SGML Users' Group Newsletter 15 (January 1990) 11-12. ISSN: 0952-8008. Author's affiliation: Ministry of Agriculture and Fisheries, Research Center for Technical and Physical Engineering in Agriculture (TFDL), Expert Center for Information Technology (ECIT), the Netherlands.
The article is a report on the meeting of the SGML Database SIG on October 26, 1989 at Alphen aan de Rijn, Netherlands. Presentations included: (1) Han Schouten, "The Storage of Documents in Databases at the Ministry of Agriculture and Fisheries"; (2) François Chahuneau -- experiences with the implementation of a document database for production of the Journal of the EEC, in nine languages, with an emphasis upon support for version management; (3) Ian Williams' presentation "An architecture for hypertext object management" -- this presentation focused on GUIDE, IDEX and OWL's hypermedia products in relation to SGML. OWL is researching SGML applications for information retrieval, object indexing and maintenance of database links; (4) Other meeting participants included Lou Burnard, Frank Dros, Harry Gaylord, Jurgen de Jonghe, Jan Maasdam, Hans Mabelis, Jon Maslin, Koen Mulder, and Gert van der Steen.
[CR: 19961210]
Schouten, Han. "SGML*CASE: The Storage of Documents in Databases." SGML Users' Group Bulletin 4/1 (1989) 1-14 (with 5 references). ISSN: 0269-2538. Author's affiliation: Ministry of Agriculture and Fisheries, Research Center for Technical and Physical Engineering in Agriculture (TFDL), Expert Center for Information Technology (ECIT); POB 356, Mansholtlaan 12, 6700 AJ Wagenhingen, The Netherlands. TEL: +31-8370-19143; FAX: +31-8370-11312.
Abstact: "Despite recent achievements in text editing, desktop publishing, and the hypermedia approach toward information processing, the developments in document processing remain in arrears when compared to data processing. This is highly remarkable, since today 99 percent of all information is still archived as documents on paper."
"Here we analyse the possible causes for this apparent backlog in document processing and the damage it inflicts on office automation. Hitherto the logical structure, the layout, and the presentation of documents have often been insufficiently distinguished. Documents are typically stored and accessed as sequential files. These characteristics strongly remind us of most data processing environments of about twenty years ago. Then, file structures were mainly application-dependent and files could only be processed in batch, because the possibilities for accessing their contents directly were absent. The information systems of those days featured all the bad qualities that most document processing systems feature today; many types of conversion from one application-dependent form to another, loss of information with these conversions, and the practical impossibility of managing stored information as a corporate resource. Conversely, many document processing applications such as document editing, hypermedia applications, and the integrated processing of data and text also require direct access to individual elements of stored documents."
"The logical consequence seems, therefore, to be to devise some application- and device-independent, directly accessible, storage facility for documents and to stimulate developments similar to those that caused data processing to become the success it is today. Building on the results of our analysis, we have made an attempt to store documents in a database and, consequently, have direct access to their structure and contents, maintain information integrity and optimally integrate data and text. A conceptual schema for the storage of documents is proposed here. The obvious advantages of the model are discussed, as well as the topics which remain to be investigated."
See the main entry for the SGML Database Special Interest Group (SGML DSIG/DBSIG) for further information. Note: The volume editor for SGML Users' Group Bulletin 4/1 is David W. Penfold (Edgerton Publishing Services, Huddersfield, UK).
[CR: 19970312]
Schouten, Han. "A Utility for the Combined Use of SGML and Ventura ®." SGML Users' Group Bulletin 3/2 (1988) 27-36 (with 4 appendices). ISSN: 0269-2538. Author's affiliation: TFDL/ECIT [Ministry of Agriculture and Fisheries, Research Center for Technical and Physical Engineering in Agriculture (TFDL), Expert Center for Information Technology (ECIT), The Netherlands].
The author explains a strategy used at the Dutch Ministry of Agriculture and Fisheries for converting SGML documents into Ventura documents for printing. The appendices contain examples of the SGML source code, the conversion scripts, and the corresponding representations in Ventura format.
Note: The volume editor for SGML Users' Group Bulletin 3/2 is Anders Berglund (ISO Central Secretariat, 1 Rue de Varambé, CH-1211 Geneva 20, Switzerland).
[CR: 19971125]
Schreier, Richard A. "Supporting SGML in Document Management Systems." Page(s) 95-101 in SGML '97 Conference Proceedings. SGML Europe '97. "The Next Decade - Pushing the Envelope." Princesa Sofia Intercontinental, Barcelona, Spain. 11-15 May, 1997. Sponsored by Graphic Communications Association (GCA) and SGML Open. Conference Chair: Pamela L. Gennusa (Director, Database Publishing Systems Ltd). Alexandria, VA: Graphic Communications Association (GCA), 1997. Extent: 342 pages, CDROM. Author's affiliation: Director of Professional Services, Microstar Software Ltd, Nepean, Ontario, Canada; Email: ras@microstar.com.
Abstract: "Most Document Management System architectures can be categorized by the ability to handle and organize information of different kinds. Supporting information based on the Standard Generalized Markup Language (SGML) involves unique requirements that bear on the tasks of managing structured documents."
"This report overviews approaches to support SGML documents in a number of Document Management System architectures that were candidates to be used in an actual publishing system supporting the publishing and re-purposing of shared information for technical manuals. This publishing system supports content- and presentation-oriented SGML documents for a supplier of military equipment to a Canadian Department of National Defence (DND) Project Office."
This paper was originally prepared under a slightly different title by G. Ken Holman (Crane Softwrights Ltd.), formerly the Chief Technology Officer of Microstar Software Ltd. Slides for the related paper are among the collection of slide show presentations from Microstar.
Note: The electronic conference proceedings in hypertext were produced by Inso Corporation (DynaText) and by High Text (EnLIGHTeN). Information about the SGML Europe '97 Conference may be found in the main database entry.
[CR: 19980413]
Schroeder, Bethany. "HL7 Focuses on XML in New Orleans." XML Files: The XML Magazine
Issue 04 (March 17, 1998) 14.
A brief note on the January 1998 meeting of HL7's SGML/XML SIG, with updated on DICOM, KONA, and other health industry standards efforts.
Available online
[CR: 19971206]
Schroff, Thomas; Brüggemann-Klein, Anne. "Grammar-Compatible Stylesheets." Pages 51-88 (with 11 references) in Principles of Document Processing. Proceedings of the Third International Workshop. PODP '96, Third International Workshop. Palo Alto, California. September 23, 1996.. Edited by Charles Nicholas (Department of Computer Science and Electrical Engineering, UMBC, Baltimore, MD) and Derick Wood (Department of Computer Science, HKUST, Clear Water Bay, Kowloon, HONG KONG). Lecture notes in artificial intelligence. Lecture notes in computer science, 1293. Berlin / London: Springer-Verlag, 1997. ISBN: 354063620X. Authors' affiliation: [Schroff]: Technische Universität München; [Brüggemann-Klein]: Technische Universität München.
Abstract: "Stylesheets have been used to convert the document type of SGML documents. With a stylesheet, a document conforming to a source grammar can be transformed into a document conforming to a target grammar. The paper discusses the following problem: given a stylesheet, a source and a target SGML grammar, is it decidable whether or not all documents conforming to the source grammar are transformed into documents conforming to the target grammar? Using context free extended context free grammars we give a decision procedure for this problem."
Seaman, David "Campus Publishing in Standardized Electronic Formats -- HTML and TEI." Pages xxx-xxx in Filling the Pipeline and Paying the Piper: Proceedings of the Fourth Symposium [November 5-7, 1994, the Washington Vista Hotel, Washington, DC]. Edited by Anne Okerson, Symposium co-sponsored by the Association of Research Libraries and the Association of American University Presses in collaboration with the University of Virginia Library, the Johns Hopkins University Press, and the American Physical Society. Washington, D.C.: Association of Research Libraries, Office of Scientific & Academic Publishing, 1995. ISBN: 0918006252. Author's affiliation: David Seaman is the Director of the University of Virginia Library's Electronic Text Center.
"Introduction: In the past year, HyperText Markup Language (HTML) has done more to popularize the notion of Standard Generalized Markup Language than any single preceding use of SGML. Used on the World Wide Web through a graphical client such as Netscape or NCSA Mosaic, HTML documents and their associated image, sound, and digital video files result in sophisticated network publications and services. And even when viewed through the plain text (VT100) client Lynx, HTML files can still be exciting clusters of interlinked documents.
In common with Internet users all over the world, the University of Virginia Library now uses and produces HTML documents; unlike most other academic institutions, however, we came to HTML with practical experience in another, more sophisticated, form of SGML -- that of the Text Encoding Initiative Guidelines. For two years the Electronic Text Center has been using the TEI Guidelines, through several drafts, to tag and distribute hundreds of electronic texts. The purpose of this paper is both to explain how we are using these various forms of SGML mark-up to publish a variety of documents, and to sound a cautionary note about the wholesale use of HTML as a primary authoring language."
An online version ia also available at URL in HTML format, and in (only partially-linked) mirror copy here (May 1995). An abstract of the paper by Mary Mallery is available here.
[CR: 19971018]
Seaman, David. "The Electronic Archive of Early American Fiction (1775-1850)." Page 150 in ACH-ALLC '97. The 1997 Joint International Conference of the Association for Computers and the Humanities and the Association for Literary and Linguistic Computing. Conference Abstracts. ACH-ALLC '97. Queen's University at Kingston, Ontario, Canada. June 3 - 7, 1997. Compiled by Greg Lessard and Michael Levison. Ontario, Canada: Queen's University, 1997. ISBN: 0-88911-760-8. Author's affiliation: University of Virginia, Email: .
[Extract:] "This 125,000-page project takes the University of Virginia Library into a level of archival-quality text and image production rarely seen in rare books archives. In preparing for this project we have tackled issues of funding, production-level digital equipment and practices, partnerships with commercial publishers to disseminate the results, and large-scale storage issues. This paper will outline the project, explain the workflow, equipment, and text and image standards that we think appropriate for creating data of long-term viability, and explore the lessons we are learning (and expect to learn) regarding the economics of undertaking a cost-recovery process. The project will combine high-quality color page images of all 125,000 pages (including covers and spines) with TEI-encoded text versions, allowing scholars all over the world a rare sense of the physical reality of the volumes being studied as well as providing a fully-searchable SGML database."
Abstract available online in HTML format: "The Electronic Archive of Early American Fiction (1775-1850)", by David Seaman; [archive copy]. See the Early American Fiction Home Page, or the main SGML/XML Web Page database entry for The Electronic Archive of Early American Fiction (UVA).
Additional information on the ACH-ALLC '97 Conference is available in the SGML/XML Web Page main conference entry, or [August 1997] via the Queen's University WWW server.
[CR: 19961217]
Seaman, David. "The Electronic Text Center and On-line Archive of Electronic Texts." Pages 55-57 in Elektronisches Publizieren und Bibliotheken, die Herausforderung neuer Partnerschaften [Electronic Publishing and Libraries, The Challenges of New Partnerships]. [Conference] 'Elektronisches Publizieren und Bibliotheken'. Bielefeld, Germany. February 5-7, 1996. Frankfurt am Main, Germany: Vittorio Klostermann, 1996. Author's affiliation: University of Virginia, Electronic Text Center.
Abstract: "The Electronic Text Center is both a physical space within the university library, open to all University of Virginia members and also an on-line collection of many thousands of Internet-accessible texts and images. It is important to us that we perform two tasks simultaneously in order to build our digital library: we are both creating a set of electronic resources, and also creating a user community for it, by training our users to become effective consumers and producers of electronic texts and images. Since 1992, the Etext Center has made available hardware and software for the creation and analysis of electronic texts; it provides training for these new tools and techniques; it acts as a focal point for HTML and SGML development in the humanities at Virginia; and it provides a place in which to use those texts that are not yet accessible on the Internet."
The document is available online: in HTML format; [mirror copy]. See also the main entry for the UVA Electronic Text Center.
Seaman, David M. "From Margin to Mainstream: Creating a Broad-Based Humanities Computing User Community at the University of Virginia." Pages 213-214 [partial abstract] in Colloque International "Consensus ex Machina?". Abstracts International Joint Conference of the ALLC (Association for Linguistic and Literary Computing) and ACH (Association for Computers and the Humanities), Sorbonne, Paris, 19-23 avril 1994. Paris: Laboratoire "Lexicométrie et textes politiques" (INaLF, CNRS), and Ecole Normale Supérieure de Fontenay - Saint Cloud, 1994. 244 pages. Author Affiliation: University of Virginia.
[CR: 19950716]
Seaman, David. "Gate-Keeping A Garden of Etext Delights: Electronic Texts and the Humanities at the University of Virginia Library." Pages 63-67 in Gateways, Gatekeepers, and Roles in the Information Omniverse: Proceedings of the Third Symposium. Third [ARL] Symposium, Washington Vista Hotel, Washington, DC, November 13-15.1933. edited by Ann Okerson and Dru Mogge. Washington, D.C.: ssociation of Research Libraries, 1994. Authors' affiliation: University of Virginia.
The paper discusses the use of SGML by the Electronic Text Center. "All the electronic texts are encoded with Standard Generalized Markup Language (SGML). The large-scale electronic text databases -- the OED, the Chadwyck-Healey items - come fully marked up, and increasingly we are seeing producers of individual titles (such as Oxford University Press) also offering them in SGML form. The SGML markup not only means that texts can be added together in conglomerations but also that the data, with all its structural and typographic information, is not inherently wedded to a piece of software. It is, in a real sense, data that will outlive the software we currently use to explore and present it."
Available online from the UVA WWW server.
Seaman, David M. "'A Library and Apparatus of Every Kind': The Electronic Text Center at the University of Virginia." Information Technology and Libraries 13/1 (March 1994) 15-19. 1 reference. Author affiliation: Coordinator of Electronic Texts, University of Virginia Library, Charlottesville, VA.
Abstract: The Electronic Text Center at the University of Virginia combines an online archive of thousands of SGML-encoded electronic texts, all available through a single piece of search software, with a library-based center housing hardware and software suitable for the creation and analysis of text. Through ongoing training sessions and support of individual teaching and research projects, the Center is now building a diverse and expanding user community locally, and providing a potential model for similar enterprises at other institutions.
[CR: 19950716]
Seaman, David. "The University of Virginia's Electronic Texe Center: An Interview with David Seaman." Virginia Librarian 39/2 (April/May/June) 6-10 (with sidebar: "Standard Generalized Markup Language"). Author's affiliation: David Seaman is Director of the Electronic Text Center, Alderman Library, University of Virginia, Charlottesville, Virginia.
"We are also concerned to maintain our on-line data in a standard tagged format-known as SGML, or Standard Generalized Mark-up Language-that will ensure that the electronic texts, with all their typographic, spacial, and structural instructions, will outlive the software we currently use to search and display them. . .The texts in our on-line collection are marked up with SGML tags that use letters and phrases within angled brackets to convey such information as structural divisions-title page, main body of text, scene, stanza, page, paragraph, etc. and typographical elements- changes in typeface, special characters, etc. . ."
Available online: http://www.lib.virginia.edu/etext/articles/VirgLib/virglib.html from the UVA WWW server.
[CR: 19971024]
Seaman, David. "The User Community as Responsibility and Resource: Building a Sustainable Digital Library." D-Lib Magazine ( ). ISSN: 1082-9873. Author's affiliation: Electronic Text Center, University of Virginia.
Summary: "Since opening as a full-time service in 1992, the Electronic Text Center at the University of Virginia Library has pursued twin missions with equal seriousness of purpose: (1) to create an on-line archive of SGML texts; (2) to build a community of humanists adept at the creation and use of online full-text resources. . . this article will focus on the integral place that our user community has in shaping the work of our library-based Etext Center."
See the Web site for The Electronic Text Center.
The article is available online in HTML format; local archive copy. Note that the July/August 1997 double issue of D-Lib Magazine (Amy Friedlander, editor) contains several articles referencing the use of SGML encoding in digital library research.
[CR: 19971201]
Selber, Stuart A. "First Commentary. The OHCO Model of Text: Merits and Concerns." Journal of Computer Documentation 21/3 (August 1997) 26-31 (with 21 references). ISSN: 0731-1001. Author's affiliation: Technical Communication and Rhetoric Program, Department of English, Box 43091, Texas Tech University, Lubbock, Texas 79409-3091; Email: selber@ttu.edu; WWW: http://english.ttu.edu/faculty/selber/vitae.html.
Abstract: "The author discusses the ordered hierarchy of content object (OHCO) model for text representation on the computer. [I have a concern about the OHCO model of text...] Although the model has explanatory power computationally, the way it defines what a text is, what a writer is, and what a reader is may serve to diminish, in potentially damaging ways, what is involved in the processes and practices of technical communication. But before discussing his concerns with the OHCO model of text, the author considers some of its merits, because he would not dismiss the model as invaluable to students and professionals. At times he found the model quite compelling, particularly in its focus on how text can be both productively and unproductively represented in online information space. [In fact, I plan on including this article in a graduate-level course I teach in technology and discourse.]"
The article is a response (commentary) on the publication of DeRose (et al), "What is Text, Really?" reprinted from Journal of Computing in Higher Education 1/2 (Winter 1990) 3-26.
This article appeared with four others in a special issue of JCD which focused upon 'the OHCO model of text [ordered hierarchy of content objects]'. The Journal of Computer Documentation (JCD) is a quarterly publication of the Association for Computing Machinery, Special Interest Group on Systems Documentation [SIGDOC], published by the Association for Computing Machinery. Editor in Chief: Tony R. Girill, Lawrence Livermore National Laboratory and University of California.
[CR: 19960716]
Sengupta, Arijit. "Demand More from Your SGML Database! Bringing SQL Under the SGML Limelight." <TAG>: The SGML Newsletter 9/4 (April 1996) 1-7, with 11 references. ISSN: 1067-9197. Authors' affiliation: PhD candidate at Indiana University, Department of Computer Science.
"Abstract: Have you ever been frustrated by how inadequate SGML databases are in terms of searching or querying your documents? With the current state of the art, you will easily be able to search for a word, phrase, or keywords in the whole document. Some systems let you perform approximate searches or regular expression searches. Even fewer systems let you search for keywords or phrases in certain SGML regions. However, there is much more information already in SGML documents that one can utilize cleverly to design a proper SGML database system. The current trend of modeling SGML documents with object-oriented and object-relational databases has certainly brought SGML closer to a complex object database model, but much research and development remains to be done in this area. This article introduces the popular relational database query language SQL (Structured Query Language) and its applicability in the SGML domain.The capability of this query language to express complex queries with a not-so-complex syntax gives relational databases that support SQL an advantage over other similar systems. The ability to use SQL or an SQL-like query language with SGML has the potential of giving much more power to SGML repositories. This article shows how we can pose complex document-related questions easily with SQL. SQL-capable systems will let you solve problems that would otherwise seem impossible, or at least, tedious."
The author believes that SQL ought to be implemented more completely in SGML systems, as it supplies a widely accepted and powerful language for expressing queries -- many of which are difficult to express in current SGML systems.
Available online in postscript format; [mirror copy]
Sengupta, Arijit. Design and Implementation of a Database Environment for the Manipulation of Structured Documents 1993.. Extent: 30 references.
"Abstract: A method for implementing a structured document database system is presented. The present-day systems dealing with structured or tagged documents have not been able to produce capabilities that even simple database systems possess - the ability to query the database based on the various properties of the database. Research in this area also has not been able to produce query languages and visual query interfaces similar to those that exist in the relational domain. The goal for the present research is to develop a complete database system for structured documents having data definition, manipulation and querying capabilities similar to those in the relational world. Only structured documents tagged with the SGML [13] have been considered, in which detailed and complete information about the document structuring can be obtained from the Document Type Definition (DTD). Special systems that have been considered, used and evaluated are PAT (Open Text 5.0) [22], sgmls 1.1, Exodus [28], Shore [6] for purposes of data structures, parsing, data storage and retrieval, etc. Special considerations have been given to three special cases of data for experimentation purposes: (a) the Oxford English Dictionary (OED) database, (b) the Chadwyck Healy English Poetry full-text database, and (c) an experimental movie database." [from the online text]
Available online at URL http://www.cs.indiana.edu/hyplan/asengupt/thesis/oral/oral.html. [further details are requested from the author]
[CR: 19961226]
Sengupta, Arijit. "Standardizing the Querying Process with SGML The SQL DTD." Pages 323-338 in SGML '96 Conference Proceedings. Celebrating a Decade of SGML. SGML '96 Conference, Boston, MA, November 18-21, 1996. Sponsored by The Graphic Communications Association (GCA). [Edited by] Conference Co-Chairs: B. Tommie Usdin and Deborah A. Lapeyre. Alexandria, VA: GCA, 1996. Extent: 711 pages. Author's affiliation: Indiana University, Computer Science Department, Lindley Hall 215, Bloomington, IN 47405, USA; Tel: 812 855 4318; Fax: 812 855 4829; Email: asengupt@indiana.edu; WWW: http://www.cs.indiana.edu/hyplan/asengupt.html.
Abstract: "One of the most exciting applications of SGML which has emerged in the recent years is its use in document databases. The structural information embedded in SGML documents makes it possible to query SGML documents and extract information in an automatic manner; however, this querying process has not been standardized. As a result, different SGML database implementations use their own query language syntax, thus making the migration from one system to another a difficult process. In the relational database domains, however, the query language SQL has been a standard for over ten years and is universally used in most relational database systems. Although originally designed for relational databases, SQL is quite powerful for specifying complex queries in a relatively easy-to-understand syntax. With a small set of extensions to take advantage of the hierarchical structure of SGML, SQL can be easily adapted for use with SGML document databases (TAG-496).
The powerful 'generalized' nature of SGML makes it easy to implement SQL as an SGML DTD, so that queries can be expressed as document instances of the SQL DTD. Current SGML authors and users can write queries expressed in this DTD without learning a different language or using a separate editor. Moreover, because of the portable nature of SGML, these queries can be used in any SGML database system and can be converted to regular SQL for use in a relational or Object-Relational/Object-Oriented database system, if necessary. Databases that support the SQL DTD can also store the queries without any extra effort, and subsequently query them for inferring optimization parameters.
This paper presents a representative DTD for the SQL query language, with extensions for use with hierarchically structured documents. It also compares this language with languages proposed and implemented, including SDQL - the query language in the DSSSL standard (DSSSL95). This paper explains the advantages of using this language as a query language in document database systems and the necessity for standardizing the querying process in document databases. Finally, it discusses some implementation issues and complexity measures."
Note: The above presentation was part of the "SGML Expert" track at SGML '96. The SGML '96 Conference Proceedings volume containing the full text of the paper may be obtained from GCA.
Available in postscript format, or SGML format; [mirror copy, postscript]
[CR: 19970817]
Sengupta, Arijit; Dillon, Andrew. "Extending SGML to Accommodate Database Functions: A Methodological Overview." Pages 629-637 (with 27 references) in Structured Information/Standards for Document Architectures. Edited by Elisabeth Logan and Marvin Pollard. = Journal of the American Society for Information Science, Special Issue. Volume 48, Number 7 (July 1997). New York: John Wiley & Sons Inc., 1997. ISSN: 0002-8231. Authors' affiliation: [Sengupta:] Computer Science, Indiana University, Bloomington, IN; Email: asengupt@indiana.edu, WWW: http://www.cs.indiana.edu/hyplan/asengupt.html; [Dillon:] School of Library and Information Science, Indiana University, Bloomington, IN; Email: adillon@indiana.edu, WWW: http://www-slis.lib.indiana.edu/adillon/adillon.html.
Abstract: "A method for augmenting an SGML document repository with database functionality is presented. SGML (ISO 8879,1986) has been widely accepted as a standard language for writing text with added structural information that gives the text greater applicability. Recently there has been a trend to use this structural information as meta-data in databases. The complex structure of documents, however, makes it difficult to directly map the structural information in documents to database structures. In particular, the flat nature of relational databases makes it extremely difficult to model documents that are inherently hierarchical in nature. Consequently, documents are modeled in object-oriented databases (Abiteboul, Cluet, & Milo, 1993), and object-relational databases (Hoist, 1995), in which SGML documents are mapped into the corresponding database models and are later reconstructed as necessary. However, this mapping strategy is not natural and can potentially cause loss of information in the original SGML documents. Moreover, interfaces for building queries for current document databases are mostly built on form-based query techniques and do not use the 'look and feel' of the documents. This article introduces an implementation method for a complex-object modeling technique specifically for SGML documents and describes interface techniques tailored for text databases. Some of the concepts for a Structured Document Database Management System (SDDBMS) specifically designed for SIL documents are described. A small survey of some current products is also presented to demonstrate the need for such a system."
A Postscript version of the article is available online (also, online abstract); [local archive copy].
See the main document entry for the complete list of articles and contributors, as well as other bibliographic information.
[CR: 19970627]
Sengupta, Arijit Dillon, Andrew. Query By Templates: A Generalized Approach for Visual Query Formulation for Text Dominated Databases. Technical Report. To appear in the Proceedings of the Conference on Advanced Digital Libraries (ADL'97), Library of Congress, Washington, D.C. May 7-9 1997. []: [], May 1997. Extent: approximately 13 pages.
"Abstract: With the advent of the World Wide Web (WWW), the concept of document databases is becoming more popular. This makes the idea of a globally distributed digital document library realizable. The standard encoding format for the WWW is HTML (HyperText Markup Language), which embeds some structural information in otherwise text-dominated documents. HTML can be viewed as a special instance of SGML (Standard Generalized Markup Language), a very powerful document encoding language capable of describing may different types of languages and formats. The current work is based on designing query languages, processing and visualizing mechanisms for structured documents in general, and SGML documents in particular. We are using the World Wide Web as a platform for this querying mechanism, especially because of its popularity and world-wide availability. However, because of the wide range of users, these systems need to be easy to use. In particular, it is important that users can easily search for information from the database without prior knowledge of the internal structure of the database. This paper outlines a visual query constructing technique for application in databases containing hierarchically structured documents. In this paper, we describe the visual component of this query language, which is essentially a generalization of the Query By Example (QBE) language for relational databases. We call this method ``Query By Templates(QBT)''. Further, we describe the basic properties and usefulness this visual query technique, and show how queries on structured document databases can be performed using this method. We also describe an implementation of QBT on the Web using the Java{TM} programming language."
Available online in postscript format; [mirror copy].
[CR: 19951122]
Severson, Eric. The Art of SGML Conversion: Eating Your Vegetables and Enjoying Dessert. Avalanche Development Corporation/Interleaf, January 1995. 34K (computer file), ca. 15 pages. Author's affiliation: Executive Vice President, Avalanche Development Corporation [Interleaf]; email: eric@avalanche.com; Tel: (303) 449-5032.
"SGML conversions have a reputation for being worthwhile but not necessarily lots of fun. Much like the problem of having to eat your vegetables before you get dessert.
"SGML conversion typically involves building a bridge between the world of hardcopy and word processing documents (where logical structure is perceived visually by the reader) and "intelligent" documents (where logical structure is explicitly encoded). The whole point of SGML conversions is that they necessarily involve information enrichment, adding more than was originally there.
"This white paper explores the issues involved in moving to SGML and offers advice for making the process as effective and painless as possible. It demonstrates how the steps in the SGML conversion process are directly related to the benefits you get once conversion is complete."
Available on the Internet from the Interleaf/Avalanche WWW server: "The Art of SGML Conversion" [mirror copy November 1995]. Apparently also to be available as an SGML Open White Paper, #4001-II.
Severson, Eric. How SGML and HTML Really Fit Together: A Case for the A Scalable HTML Avalanche Development Corporation/Interleaf, January 1995. 24K (computer file), ca. 8 pages. Author's affiliation: Avalanche Development Corporation/Interleaf; email: eric@avalanche.com; Tel: (303) 449-5032.
**Note: Version 2 (April 1995) is available from this WWW server.
This (white) paper was distributed on Newswire, and is available as item 143.1995-01-09 in the Newswire archives, or here. Discussion of the paper took place on the sgml-internet discussion list.
[CR: 19971227]
Severson, Eric. "The Proper Role of SGML and XML in an Enterprise I/T and Intranet Strategy." Pages 513-518 in SGML/XML '97 Conference Proceedings. SGML/XML '97. "SGML is Alive, Growing, Evolving!" The Washington Sheraton Hotel, Washington, D.C., USA. December 7 - 12, 1997. Sponsored by the Graphic Communications Association (GCA) and Co-sponsored by SGML Open. Conference Chairs: Tommie Usdin (Chair, Mulberry Technologies), Debbie Lapeyre (Co-Chair, Mulberry Technologies); Michael Sperberg-McQueen (Co-Chair, University of Illinois). Alexandria, VA: Graphic Communications Association (GCA), 1997. Extent: 691 pages, CDROM; print volume contains author and title indexes, keyword and acronym lists. Author's affiliation: [Eric Severson]: IBM Global Services.
Abstract: "Up to now SGML has tended to be used primarily in technical publishing applications, usually at a departmental level. However, with today's focus on web-based enterprise information management, and the recent introduction of XML, many more opportunities for SGML have become apparent. This whitepaper surveys the current state of the information industry, from both a business and technical point of view, and shows how SGML and XML technology can and should be positioned within an organization's overall I/T and intranet strategy."
This paper was delivered as part of the "Business Management" track in the SGML/XML '97 Conference.
Note: The SGML/XML '97 conference proceedings volume is available from the Graphic Communications Association, 100 Daingerfield Road, Alexandria, VA 22314-2888; Tel: +1 (703) 519-8160; FAX: +1 (703) 548-2867. Complete information about the conference (e.g., program listing, tutorials, show guide, DTDs, conference reports) is provided in the dedicated conference section of the SGML/XML Web Page and via the GCA Web server. The electronic proceedings on CDROM was produced courtesy of Jouve Data Management (Jouve PubUser).
[CR: 19960904]
Severson, Eric; Bingham, Harvey (editors). Table Interoperability: Issues for the CALS Table Model. SGML Open Technical Research Paper 9501:1995. Coraopolis, PA: SGML Open, November 21 1995. Extent: approximately 25 pages. Authors' affiliation: [Eric Severson] Co-chair, Table Interchange Subcommittee, SGML Open; [Harvey Bingham] Co-Chair, Table Interchange Subcommittee, SGML Open; also: Interleaf.
"Abstract: To help address the existing interoperability issues when using tabular material ("tables") in SGML implementations, SGML Open's Technical Committee formed a Table Interchange subcommittee to research these issues.
"Because the CALS table model has proliferated widely, it was chosen as the initial starting point. Although it has evolved to the point of a de facto standard, the specification leaves a large number of semantics open to interpretation which in turn has made interoperability difficult to achieve. As its first major task, the Committee therefore set out to identify and document ambiguities in the CALS table model specifications, identify and document related interoperability issues between SGML Open vendor products, and lay the groundwork for developing a proposed clarification of the standard that will minimize ambiguity and maximize interoperability."
"This paper summarizes the results of this initial work, identifies the sources of current interoperability issues for the CALS model, and summarizes the most common set of practices currently followed by SGML Open vendors."
Available in HTML format: SGML Open - TRP 9501:1995 - "TABLE INTEROPERABILITY: Issues for the CALS Table Model" [mirror copy, December 28, 1995]. Also available from the FTP server at Exoterica Corporation in compressed Postscript format ftp://ftp.exoterica.com/sgmlopen/9501/9501.ps.Z, [mirror copy] or in other formats (files: 9501pack.tar.Z, 9501pack.zip, 9501ps.zip). Document revisions: Technical Research Paper 9501:1995; Committee Draft: 1995 May 10; Committee Draft: 1995 August 5; Final Draft Technical Research Paper: 1995 September 15; Final Technical Research Paper: 1995 November 21.
[CR: 1995]
Sévigny, Martin. Conception et réalisation d'une interface-utilisateurs pour l'interrogation de bases de documents structurés. . Travail dirigé présenté à la Faculté des études supérieures de l'Université de Montréal. Quebec: Faculté des études supérieures de l'Université de Montréal,, 1996. Advisor: Yves Marcoux. Affiliation: École de bibliothéconomie et des sciences de l'information (EBSI), de l'Université de Montréal, Québec, Canada. .
See the summary for the thesis in French: Conception et réalisation d'une interface-utilisateurs pour l'interrogation de bases de documents structurés. Travail dirigé présenté à la Faculté des études supérieures de l'Université de Montréal, par Martin Sévigny. Pour l'obtention de la Maîtrise en bibliothéconomie et en sciences de l'information de l'EBSI.
[CR: 19970531]
Sévigny, Martin; Marcoux, Yves. "Conception et réalisation d'une interface-utilisateurs pour l'interrogation de bases de documents structurés [The Creation and Evaluation of a Human-computer Interface for Information Retrieval in Structured-document Bases", in French]." Revue canadienne des sciences de l'information et de bibliothéconomie [Canadian Journal of Information and Library Science] 21/3-4 (September-December 1996) 59-77 (with 23 references). ISSN: . Authors' affiliation: École de bibliothéconomie et des sciences de l'information (EBSI), de l'Université de Montréal, Québec, Canada. WWW [http://www3.sympatico.ca/msevigny/]: ; WWW [Marcoux]: http://tornade.ere.umontreal.ca/~marcoux/.
Abstract: "The creation of electronic information in the form of structured documents is steadily gaining popularity. It is thus necessary to develop information retrieval tools fitted to this type of document. In this article, we present the results of a research project aimed at identifying human-computer interface elements that can support information retrieval in structured document bases. The research included a review of the literature and of existing systems, as well as the design, development, and user testing of a prototype information retrieval system for SGML (ISO 8879) document bases. We make five recommendations for the design of structured-retrieval systems."
See the summary for the thesis in French: Conception et réalisation d'une interface-utilisateurs pour l'interrogation de bases de documents structurés. Travail dirigé présenté à la Faculté des études supérieures de l'Université de Montréal, par Martin Sévigny. Pour l'obtention de la Maîtrise en bibliothéconomie et en sciences de l'information de l'EBSI.
[CR: 19960313]
Seybold, Jonathan. "Remembering Yuri Rubinsky (1952-1996) [In memoriam: Yuri Rubinsky]." Seybold Report on Publishing Systems 25/11 February 29, 1996 20. ISSN: 0736-7260.
The full-page article includes a photograph of Yuri Rubinsky. He was familiar to Seybold readers mostly in connection with role as cofounder of SoftQuad, Inc. The article summarizes the highlights of Yuri's career in the publishing industry, including annotation of his major publications. It also includes eulogy from several friends and colleagues (Jonathan Seybold, Charles Goldfarb, Tim Berners-Lee, Chet Ensign, Tim Bray, Tommie Usdin, Elaine Brennan, and Pam Gennusa. For other memorial tributes to Yuri, see the collection elsewhere in this database.
[CR: 19971008]
[Seybold Publications Staff]. "Inso Adds Math to DynaWeb. IEEE Uses it to Go Live with Online Digital Library." Seybold Report on Internet Publishing 2/2 (October 1997) 28. ISSN: 1090-4808.
"Inso recently beefed up the math support in DynaWeb, its SGML-to-Web publishing system, by enabling Web browsers to display mathematical equations stored in DVI (TEX) format."
Representing and rendering math has always been a special challenge. The DynaWeb technology described in the article is featured in the IEEE (Institute of Electrical and Electronics Engineers, Inc.) Computer Society Digital Library (CSDL). The online database introduction explains: "For those interested in the publication technology, we have created a database of SGML files and linked images. These files are converted and displayed as HTML on the fly. This allows subscribers to manipulate and view the content -- including math -- with standard web browsers without any helper applications or plug-ins." According to the Seybold article, the online IEEE collection is an "SGML-encoded text library [which] offers the equivalent of 35,000 periodical pages and more than 250,000 images. With the new [DynaBase] 3.1 software, the Web database automatically converts DVI math equations into GIF images on the fly as articles are served to visitors...DynaWeb is the first commercial product to generate GIFs from DVI math equations on the fly."
Note: DynaWeb has been chosen by SOFTBANK NetForums as part of a dynamic Web
publishing solution. See the press release, [archive copy].
[CR: 19971008]
[Seybold Publications Staff]. "Microsoft, Inso, ArborText Propose Style Sheet Language for XML." Seybold Report on Internet Publishing 2/2 (October 1997) 19. ISSN: 1090-4808.
The article describes the proposal for a style sheet language XSL (Extensible Style Language), comparing it to CSS and DSSSL. See the database entry for Extensible Stylesheet Language (XSL) for further description.
[CR: 19970329]
[Seybold Publications Staff]. "Netscape Replies on XML [editorial]." Seybold Report on Internet Publishing 1/5 (January 1997) 2. ISSN: 1090-4808.
The article summarizes a response from Netscape Communications Corporation to the Seybold offices regarding Netscape's disposition on XML (Extensible Markup Language). Netscape clarifies that it is not currently [ca. January 1997] working on SGML or XML for two reasons: (a) customers have spoken in favor of HTML over SGML, so Netscape believes these customers are better served by improving HTML functionality than by adopting XML; (b) XML can theoretically be supported indirectly via XML-compliant Netscape plugins: "...it is possible and quite straightforward to incorporate SGML-based layout engines into the Netscape Navigator and Netscape Communicator environments using inline plug-ins or Java. This makes it possible for the 40 million users of Netscape Navigator to access XML information today."
According to the article, Alex Edelstein, Netscape's Group Product Manager, also expressed the opinion that the need to deal with user-defined tags and styles (as XML does) could be met via JavaScript: "...JavaScript is rapidly being accepted as a great means to provide user-defined flexibility. User-defined variables can be created dynamically and passed between client and server in this way."
Editorially, the article expresses doubt as to whether Netscape is "up to speed on publishing issues," at least, with respect to Seybold's readership: "that product managers at Netscape think HTML somehow will work as a way to define all the Web documents that we need, or that SGML's main benefit is better layout, illustrates just how far behind Netscape will be should Microsoft decide to leverage its expertise in SGML and XML."
See more on the Extensible Markup Language in the SGML/XML Web Page main entry.
[CR: 19971121]
[Seybold Publications Staff]. "Seybold San Francisco '97: PDF and XML Emerge. [Alternate title: 'Shaping the Future: PDF, XML and the Men of the Hour, Gates and Jobs'." Seybold Report on Publishing Systems 27/5 (November 17, 1997) 1, 3-38. ISSN: 0736-7260.
"By most calculations, the two areas of sharpest focus at Seybold San Francisco 97 were PDF, which increasingly is moving into a role as the format for production workflow, and XML, which is picking up support as a standard for tagging documents intended for use in multiple-media environments. Sections in this issue of Seybold Report on Publishing Systems covering SGML/XML include: "Asset Management, SGML and Database Publishing" (pages 29-38), beginning with: "The boundaries between asset management and document management are starting to blur. So are the boundaries between SGML publishing tools and database-publishing tools." The section "Publishing with SGML" (pages 33-36) provides an update on ArborText's Willow and XML support, Chrystal's Astoria 3.0, Datalogics' Documentum-to-Frame solution, I4I S4-Desktop, PIT's Target 2000, and XyVision's support for FrameMaker+SGML.
[CR: 19971120]
[Seybold Publications Staff]. "XML, Collaborative Tools Shine at Seybold San Francisco '97 [alt. title: 'XML, Content Management Take Center Stage at SSF '97']." Seybold Report on Internet Publishing 2/3 (November 1997) 1, 3-19. ISSN: 1090-4808.
Abstract: This "Special Report" feature article in Seybold Report on Internet Publishing describes the rapidly-changing world of Internet tools and standards that bear on Web publishing, and particularly, the role of XML within the W3C's suite of Internet recommendations. "Seybold San Francisco '97 will be remembered as the first major conference and trade show where XML entered the mainstream vocabulary. It was the buzz of the conference and a draw on the show floor. The demo of XML support in Internet Explorer 4 was one of the highlights of Bill Gates's keynote address on Wednesday."
[CR: 19971120]
[Seybold Publications Staff]. "XML Comes into the Limelight." Seybold Report on Internet Publishing 2/3 (November 1997) 4-5. ISSN: 1090-4808.
Summary: The article describes the growing support for XML, as evidenced in the SSF presentations by Bill Gates (Microsoft), John Warnock (Adobe), and John Gage (Sun Microsystems). Bill Gates is is quoted as saying "XML is important because you won't be able to afford to author for all of the screen form factors and interface techniques." Steve DeRose (chief scientist at Inso) is quoted as saying that the 'quiet revolution' (SGML now emerging in XML) "is no longer quiet, but boisterous, productive, and growing at Web speed."
This is a subsumed article in the longer feature article of the Special Report in this issue: "XML, Collaborative Tools Shine at Seybold San Francisco '97."
[CR: 19961213]
[Seybold Staff]. "W3C Publishes Draft of Simplified SGML. XML Allows User-definable Tags." Seybold Report on Publishing Systems 26/6 (November 30, 1996) 41. ISSN: 0736-7260.
"On the tenth anniversary of the adoption of SGML as an ISO standard, a band of SGML experts announced they have drafted a simplified subset of the language they hope will spur the use of SGML on the Internet. The new language, Extensible Markup Language, or XML, was prepared by a World Wide Web Consortium working group consisting of about 80 members, primarily representing vendors. The announcement was made at SGML '96, being held in Boston this week. The first published draft is available on the Web at http://www.w3.org/pub/WWW/TR/WD-xml-961114.html. XML, like SGML, is a meta-language for describing the markup of different types of documents. It is simpler than SGML, reducing a 500-page reference to 26 pages. Unlike HTML, which has a fixed (albeit changing) set of tags, XML lets you define your own tags and attributes." [extracted] See the main entry for XML in the SGML/XML Web Page for additional information.
[CR: 19961113]
Seybold Publications. Seybold San Francisco '96. Part III: Color Publishing, Page Composition and Hardware. Seybold Special Report, Volume 5, Number 4. Media, PA: Seybold Publications, October 28, 1996. ISSN: 1069-7217.
The Seybold San Francisco '96 Conference was held at the Moscone Convention Center, San Francisco, September 13-17, 1996. The Seybold Special Report Series (3 parts) covered the conference. Part III of the Seybold Special Report 5/4 includes a section entitled "Page Layout Software, SGML Systems and Other Aids to Publishing (pages 31-39). Featured SGML software systems include: (1) "Document management for ArborText Adept" [The 'Willow Initiative' which places software between the editor and the document manager for the purpose of managing small document 'objects']; (2) "Autographics tackles library automation"; (3) "I4I offers on-the-fly SGML concersion" [delivering SGML documents to users lacking SGML software]; (4) Microstar pursues 'Mainstream SGML'" [marketing initiative with Documentum, InfoAccess and Adobe to help SGML penetrate business environments by "making it simple for authors to create and maintain SGML-savvy documents"]; (5) "Passage Systems shows custom system" [PassageNet]; (6) Xyvision sees market in telecommunication" [TEDD and TIM DTDs]. In Parts I and II of the SSF '96 report, coverage is given to HTML/SGML products for Web publishing (SoftQuad HIP and HoTMetaL; Electronic Book Technologies' DynaText (Matterhorn), DynaBase, and DynaWeb 3.0); the other volume titles are: Part I: Overview of the Show and Publishing on the Internet, and Part II: Output Technology and Workflow Developments.
Seybold Publications. Seybold Seminars Boston '95 [March 28-31, 1995. Hynes Convention Center, Boston, MA]. Part I: Electronic Delivery, SGML Issues, Catalogs and Output. Seybold Special Report, Volume 3, Number 8. Media, PA: Seybold Publications, April 21, 1995. ISSN: 1069-7217.
SGML was a major theme at Seybold Seminars once again, and details are available in the two Special Report issues. Part II is less relevant, being dedicated to images and color (Volume 3, Number 9: Seybold Seminars Boston '95 [March 28-31, 1995. Hynes Convention Center, Boston, MA]. Part II: Managing Color; Image Input, Editing and Output; Page Makeup, Etc.). The issue title for Part I includes "SGML", which is becoming more popular in light of widespread acquaintance with HTML. The volume Table of Contents for Part I (much abbreviated) is: I. Introduction. Electronic Publishing: Moving Past Fear and Greed to Commercial Realities (3-7); II. Publishing on the Internet: Strategies and Tools (8-28); A. Tools for Creating Web Pages; B. Other Electronic Delivery Tools [EBT Deals with Phoenix]; III. New Tools for Managing and Writing SGML Documents (29-39); A. SGML-Based Document Management Tools; B. SGML Authoring Tools; IV. Catalog Production Systems (40-43); V. Imagesetters, Platesetters and Digital Presses (44-66).
The section "Tools for Creating Web Pages" includes the following major presentations: (1) InContext's Spider [Web authoring program based upon InContext 2 SGML editor]; (2) NaviSoft [HTML and Webs authoring]; (3) Archetype [HTML viewer supporting multiple views]; (4) SoftPress' Uniqorn; (5) SoftQuad HoTMetaL [support for HTML 3.0] and SoftQuad Panorama [SGML browsing over the Internet]; (6) Electronic Book Technologies [DynaBase, SGML to HTML conversion on the fly]; (7) Open Text [WWW Indexing].
The section "Other Electronic Delivery Tools" includes a significant story under the title "EBT Deals With Phoenix" (pages 21-23; see also the graphic of the virtual digital library on page 7). Phoenix Publishing Systems Inc., a spinoff company from Phoenix Technologies, which produces documentation for some 40% of PC shipped worldwide, has contracted with EBT [Electronic Book Technologies] to create online virtual "digital libraries" storing PC documentation. Virdox (the virtual documentation information system) supports advanced concepts in document versioning, including multilingual versions and multi-vendor versions. Other stories under "Other Electronic Delivery Tools" are: (1) "Frame Adds Olias [SGML browswe], drops R&D"; (2) "Ntergaid's HyperWriter 4.2" [with SGML import facilities]; (3) "Open Text Latitude for delivery, retrieval" [Release 5 of PAT used for managing the display of a broad range of text formats, incorporating Panorama for SGML display].
Under "SGML-Based Document Management Tools" this Seybold Special Report includes description, evaluation, and references for the following products: Auto-Graphics [Smart Editor]; Berger-Levrault/AIS [SGML/Store]; CTMG (or Active Systems) [ActiveSearch]; Documentum [Enterprise Document Management System]; EBT - Electronic Book Technologies [DynaBase]; Ferntree [Structured Information Manager]; Frame [Frame SGML Toolkit]; InfoDesign [WorkSmart]; IDI - Information Dimensions [Basis SGMLServer]; Interleaf [Relational Document Manager]; Odesta [LiveLink]; Texcel [Information Manager]; XSoft [Astoria]; XyVision [Parlance Document Manager].
The section "SGML Authoring Tools" includes reviews of four products in particular: (1) ArborText plans Internet Addition; (2) Frame improves style, attributes handling; (3) Microstar improves on [SGML Author for] Word; (4) WordPerfect prepares SGML Edition.
Seybold Publications. Seybold San Francisco '94 [September 13-16, 1994]. Part I: Electronic Document Delivery and Output Issues. Seybold Special Report, Volume 3, Number 2. Media, PA: Seybold Publications, October 10, 1994. ISSN: 1069-7217.
The abbreviated Table of Contents: Introduction: Publishing on the Net Sparks Industry Resurgence (1-7); Electronic Document Delivery (8-29); Internet and Online Publishing (10-15); Tools for Internet Publishing (16-20); Fonts for Electronic Documents (20-21); Delivering Documents Through Digital Media (22-27); Digital Ad Delivery: Ready to Move Ahead (27-29); Output Issues (30-67).
The issue contains an in-depth discussion of the implication of HTML for the advance of SGML. There is a short presentation "HTML: Becoming an SGML Application" (14-15). SGML tools for the Web are treated in discussion of three products: "EBT's DynaWeb Server" (16-17); "HaL Browser Shows SGML, HTML" (17); "IDI Adds Web Service to BasisPlus" (17-18). HTML authoring tools are treated in: "Tools for Making HTML" [Nice TagWizard; SoftQuad HoTMetaL; Free Tools] (20).
The section "Software for Delivering Document Collections" (24-27) includes discussion of SGML's role in sevral products: Bellcore's SuperBook [SGML import]; Folio support for SGML [SGML to flat-file conversion]; IBM's upgraded BookManager [migration to SGML support from underlying GML]; Sun Microsystems [replacing PostScript-based Answerbook documentation reader with SGML-based documentation using Electronic Book Technologies' DynaText and developer toolkit].
Seybold Publications. Seybold San Francisco '94 [September 13-16, 1994]. Part III: Composition, Font Issues, Platforms, SGML and Other Topics. Seybold Special Report, Volume 3, Number 4. Media, PA: Seybold Publications, October 31, 1994. ISSN: 1069-7217.
The abbreviated Table of Contents: Introduction: Text Composition, Page Layout, Font Issues, and Newspaper Systems (3); Composition Systems and Software (4-12); Newspapers and Magazines (13-20); Xtensions and Additions (21-25); Fonts: New Technology Keeps the Fires Burning (26-30)
SGML Coming Into the Mainstream (31-37); SGML Tools: Microsoft Into the Act (32-35); Other Authoring Tools (35-38); Other Document Conversion Services and Tools (37); The Great Platform Debate Continues (38-50).
The special coverage of SGML publishing tools (pages 31-37) includes a major discuss of Microsoft's SGML Author for Word, including companion products for Author. The companion products include Avalanche SureStyle [conversion of text with direct formatting into SGML constructs, additional processing of tables, cross-refrences, OLE embedding information] and SoftQuad Enactor [cleans up SGML errors in Author, including support for SGML constructs not implemented in Author]. Other SGML products reviewed include: SoftQuad Enabler [SGML support for Quark Express], SoftQuad Explorer [SGML browser], SoftQuad HoTMetaL [HTML editor], Arbor |