From: ftp://ftp.ifi.uio.no/pub/SGML/bibliography Date: 1999-10-13 STANDARD GENERALIZED MARKUP LANGUAGE ISO 8879:1986 (SGML) ANNOTATED BIBLIOGRAPHY AND LIST OF RESOURCES Version 2.0 Revised January 1992, (c) Robin Cover CONTENTS: 1. Introductory and General SGML Bibliography 2. SGML Manuals: Commentary and Indices for ISO 8879 3. SGML Applications and Implementations 4. ISO Standards Publications Germane to SGML 5. Serial Publications Dedicated to SGML 6. SGML Supporting Organizations and SIGs 7. SGML Electronic Discussion Forums and Online Support Centers 8. Entry-level SGML Parsers and Related SGML Software 9. Further Bibliography and Resource Guides for SGML 10. Postscript ==================================================== INTRODUCTORY AND GENERAL SGML BIBLIOGRAPHY ==================================================== Bibliographic items <6>, <10>, <13>, <24>, <29>, <39> and <42> in this section are marked with an asterisk ("*") to indicate their relative importance and as primary introductions to SGML. Other titles in section 1 represent general studies, but likewise may be reckoned as essential reading. <1> Adler, Sharon C. "DSSSL- Document Style Semantics and Specification Language. 1/8 (January 1989) 8-9. An overview of the standard by the editor of DSSSL. For brief description of the goals of DSSSL, see the entry below on this Draft International Standard (ISO/IEC DIS 10179), <69>. <2> Amsler, Robert A.; Tompa, Frank W. "An SGML-Based Standard for English Monolingual Dictionaries." In Fourth Annual Conference of the UW Centre for the New Oxford English Dictionary: Information in Text. Proceedings of the Conference. Conference held in Waterloo, Ontario, Canada, 26-28 October 1988. Pages 61-79. Waterloo, Ontario: University of Waterloo, 1988. The 'Dictionary Encoding Initiative' referenced is loosely affiliated with the international Text Encoding Initiative; both projects seek to epploy SGML. For SGML used in dictionary markup, see also <41> below. Several of the Waterloo Annual Conference volumes contain articles germane to descriptively- tagged and SGML-tagged text. For further details on the Waterloo Centre, see <19> below. <3> Ballanti, Anna; Cork, Deborah; Dam, Lex van; Jonghe, Jurgen de; Herwijnen, Eric van; Nijdam, Marco; Samarin, Alexandre; Shave, Tony. "Text Processing at CERN. Part 1: Overview." SGML Users' Group Bulletin 3/2 (1988) 39-54. <4> Barnard, David T.; Fraser, Cheryl A.; Logan, George M. "Generalized Markup for Literary Texts." Literary and Linguistic Computing 3/1 (1988) 26-31. Abstract: Encoding literary texts for analysis, electronic transmission, or publication requires the marking of various substantive, structural and formal features. The development of a standard comprehensive markup language for these purposes is a desideratum. This paper offers a set of requirements for such a language, reviews related work, and describes a newly-created standard based on the Standard Generalized Markup Language. <5> Barnard, David T.; Hayter, Ron; Karababa, Maria; Logan, George M.; McFadden, John. "SGML Based Markup for Literary Texts: Two Problems and Some Solutions." Computers and the Humanities 22/4 (1988) 265-276. ISSN: 0010-4817. (Revision of Technical Report 204, Queen's University Department of Computing and Information Science, 1988, ISSN 0836-0227). Abstract: There is wide agreement on the need for a markup standard for encoding literary texts. The Standard Generalized Markup Language (SGML) seems to provide the best basis for such a standard. But two problems inhibit the acceptance of SGML for this purpose. (1) Computer-assisted textual studies often require the maintenance of multiple views of a document's structure but SGML is not designed to accommodate such views. (2) An SGML-based standard would appear to entail the keyboarding of more markup than researchers are accustomed to, or are likely to accept. We discuss five ways of reducing the burden of markup. We conclude that the problem of maintaining multiple views can be surmounted, though with some difficulty, and that the markup required for an SGML-based standard can be reduced to a level comparable to that of other markup schemes currently in use. <6> *Barron, David. "Why Use SGML?" Electronic Publishing: Origination, Dissemination and Design (EPOdd) 2/1 (April 1989) 3-24. CODEN: EPODEU; ISSN 0894-3982. Abstract: The Standard Generalised Markup Language (SGML) is a recently-adopted International Standard (ISO 8879). The paper presents some background material on markup systems, gives a brief account of SGML, and attempts to clarify the precise nature and purpose of SGML, which are widely misunderstood. It then goes on to explore the reasons why SGML should (or should not) be used in preference to older-established systems. A summary of the article is also printed in "Why Use SGML," SGML Users' Group Newsletter 13 (August 1989) 10. <7> Bryan, Martin. "Creating Informative Document Models." SGML Users' Group Newsletter 20 (September 1991) 12-17. <8> Burnard, Lou. "What is SGML and How Does it Help?" Pp. 65-79 in Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts, 1991. See volume information sub the editor, Daniel Greenstein, <21> below. A revised copy of the article in tagged electronic format is available from the TEI-L LISTSERVer (listserv@uicvm on BITNET) as EDW25 LDOC, October 1, 1991. <9> Chamberlin, Donald Dean; Goldfarb, Charles F. "Graphic Applications of the Standard Generalized Markup Language (SGML)." Computers and Graphics 11/4 (1987) 343-358. ISSN: 0097-8493. Abstract: The Standard Generalized Markup Language (SGML) is a language for representing document structure. This paper discusses ways in which the SGML language might be used to represent graphic as well as textual contents of a document. By using SGML markup for both graphics and text, a document processing application can achieve a more uniform treatment and tighter coupling between these two types of materials. <10> *Coombs, James; Renear, Allen; DeRose, Steven J. "Markup Systems and the Future of Scholarly Text Processing." CACM 30/11 (1987) 933-947. ISSN: 0001-0782. Cf. CACM 31/7 (July 1988) 810-811. Abstract: The authors argue that many word processing systems distract authors from their tasks of research and composition, toward concern with typographic and other tasks. Emphasis on "WYSIWYG", while helpful for display, has ignored a more fundamental concern: representing document structure. Four main types of markup are analyzed: Punctuational (spaces, punctuation,...), presentational (layout, font choice,...), procedural (formatting commands), and descriptive (mnemonic labels for document elements). Only some ancient manuscripts have no markup. Any form of markup can be formatted for display, but descriptive markup is privileged because it reflects the underlying structure. ISO SGML is a descriptive markup standard, but most benefits are available even before a standard is widely accepted. A descriptively marked-up document is not tied to formatting or printing capabilities. It is maintainable, for the typographic realization of any type of element can be changed in a single operation, with guaranteed consistency. It can be understood even with no markup formatting software: compare "
" to ".sk 3 a; .in +10 -10; .ls 0; .cp 2". It is relatively portable across views, applications and systems. Descriptive markup also minimizes cognitive demands: the author need only recall (or recognize in a menu) a mnemonic for the desired element, rather than also deciding how it is currently to appear, and recalling how to obtain that appearance. Most of this extra work is thrown away before final copy; descriptive markup allows authors to focus on authorship. (abstract supplied by Steve DeRose) <11> Cover, Robin; Duncan, Nicholas; Barnard, David. "The Progress of SGML (Standard Generalized Markup Language): Extracts from a Comprehensive Bibliography." Literary and Linguistic Computing 6/3 (1991) 200-212. ISSN: 0268-1145. The article includes introductory essay sections delineating the fundamental conceptions of SGML, its broad application, and the advantages it brings to academia, industry and government sectors. For a description of the complete bibliographic database itself, see further under section 9, item <133> below. <12> Cruz, Gil C.; Judd, Thomas J. "The Role of a Descriptive Markup Language in the Creation of Interactive Multimedia Documents for Customized Electronic Delivery. In Electronic Publishing '90: Proceedings of the International Conference on Electronic Publishing, Document Manipulation and Typography (Gaithersburg, Maryland, September 1990). Pages 249-262. The Cambridge Series on Electronic Publishing. Cambridge: Cambridge University Press, 1990. <13> *DeRose, Steven J.; Durand, David G.; Mylonas, Elli; Renear, Allen H. "What is Text, Really?" Journal of Computing in Higher Education 1/2 (Winter 1990) 3-26. ISSN: 1042-1726. Abstract: "The way in which text is represented on a computer affects the kinds of uses to which it can be put by its creator and by subsequent users. The electronic document model currently in use is impoverished and restrictive. The authors agree that text is best represented as an ordered hierarchy of content object[s] (OHCO), because that is what text really is. This model conforms with emerging standards such as SGML and contains within it advantages for the writer, publisher, and researcher. The authors then describe how the hierarchical model can allow future use and reuse of the document as a database, hypertext or network." <14> Ellison, Paul A. "SGML and Related Information Standards." Pp. 17-28 (1-12) in Document Exchange: The Use of SGML in the UK Academic and Research Community. Workshop Proceedings 5-7 March 1990 (see <28>below). Abstract: "This paper explains the position of four ISO 'standards' (only one agreed standard, one draft standard and two draft proposals) in the area of text and office information processing. Those standards are SGML (Standard Generalized Markup Language), the 'Fonts' standard (Font Architecture and Interchange Format), DSSSL (Document Style Semantics and Specification Language), and SPDL (Standard Page Description Language). . . In addition, the paper relates these standards to ODA (Office Document Architecture) and places SGML and ODA in their own contexts." <15> Fahmy, Eanass; Barnard, David T. "Adding Hypertext Links to an Archive of Documents." Canadian Journal of Information Science 15/3 (September 1990) 26-41. Abstract: Texts are characterized by various types of linkages, within themselves and with other documents, which may be either explicit or implicit. When texts are available in machine-readable form, the ability to trace linkages should become much easier, and more complex tracing of linkages should be possible. Hypertext is an electronic document paradigm whose distinguishing feature is machine support for the building and tracing of intra- and inter- document links; a document is viewed as a collection of nodes connected by directed links. A limitation of many hypertext systems is that all links must be created explicitly by the user. This is impractical in many situations, and it is unnecessary if the link structure is inherent in the documents themselves. The work described in our paper is motivated by the perceived need to extend the hypertext paradigm so that links can be derived from a collection of documents. We explore how a rich set of links connecting documents in a text archive can be programmatically generated, and present a set of link types that are useful, specifiable and computable. The documents in the archive are encoded using the Standard Generalized Markup Language, which views a document as a hierarchical organization of document elements. The archive, therefore, consists of a forest of document trees. <16> Gaspart, Jean-Pierre. "Use of the SGML Parser at the Office for Official Publications of the European Communities (OPOCE)." SGML Users' Group Bulletin 2/1 (1987) 29-36. <17> Gennusa, Pamela L. "Advantages of an SGML Implementation for Management of an Electronic Text Database." SGML Users' Group Bulletin 2/2 (1987) 73-86. <18> Goldfarb, Charles F. "A Generalized Approach to Document Markup." Proceedings of the ACM SIGPLAN SIGOA Symposium on Text Manipulation. = SIGPLAN Notices 16/6 (1981) 68-73. Conference proceedings containing this paper also available as SIGOA Newsletter 2/1-2 (Spring/Summer 1981). <19> Gonnet, Gaston. "Examples of PAT Applied to the Oxford English Dictionary." Technical Report OED-87-02. University of Waterloo Centre for the New Oxford English Dictionary. July, 1987. PAT and associated text processing tools are built around descriptively-marked text, even if not specifically SGML text. Compare also "PAT, GOEDEL, LECTOR and More: Text-dominated Database Software, " pp. 83-84 in: Tools for Humanists, 1989. A Guidebook to the Software and Hardware Fair Held in Conjunction with the Dynamic Text 6-9 June 1989 Toronto. Toronto, Ontario: Centre for Computing in the Humanities, 1989. The article describes several software tools developed at the Waterloo Centre, including TRUC (an editor for SGML or SGML-style tagged text). TRUC supports multiple views of a tagged document, based upon use of style-sheets. The University of Waterloo has pioneered several important research efforts in the study of machine-readable lexical databases, machine transduction and generation of descriptively marked-up electronic texts (SGML-style markup). The Centre has also developed software to search, interactively display and format text structured with descriptive markup. These tools were developed for the New Oxford English Dictionary Project with the long range goal of application to other texts. A Newsletter is issued by the Centre describing ongoing research, publications, software enhancements, work of visiting scholars, conferences and other events. Persons interested in the Centre's research and publications may write for a current document list (e.g., especially the several publications and technical reports by Darrell R. Raymond, Donna L. Berg, Gaston H. Gonnet, Timothy J. Benbow, Heather J. Fawcett, Rick Kazman, Frank Wm. Tompa, George V. J. Townsend. See <20>, <33> and <41> in this bibliography. Address: Electronic Text Research; Centre for the New Oxford English Dictionary; Davis Centre; University of Waterloo; Waterloo, Ontario; Canada N2L 3G1 TEL: (1 519) 885- 1211 extension 6183; Email (Internet):newoed@waterloo.edu. <20> Gonnet, Gaston; Tompa, Frank W. "Mind Your Grammar: A New Approach to Modelling Text." Technical Report OED-87-01. University of Waterloo Centre for the New Oxford English Dictionary. February, 1987. Abstract: Beginning to create the New Oxford English Dictionary database has resulted in the realization that databases for reference texts are unlike those for conventional enterprises. While the traditional approaches to database design and development are sound, the particular techniques used for commercial databases have been repeatedly found to be inappropriate for text-dominated databases, such as the New OED. In the same way that the relational model was developed based on experiences gained from earlier database approaches, the grammar-based model presented here builds on the traditional foundations of computer science, and particularly database theory and practice. This new model uses grammars as schemas and "parsed strings" as instances. Operators on the parsed strings are defined, resulting in a "p-string algebra" that can be used for manipulation and view definition. The model is representation-independent and the operators are non-navigational, so that efficient implementations may be developed for unknown future hardware and operating systems. Several approaches to storage structures and efficient processing algorithms for representative hardware configurations have been investigated. <21> Greenstein, Daniel I. (editor). Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts. Halbgraue Reihe zur Historischen Fachinformatik, Serie A, Historische Quellenkunden (edited by Manfred Thaller). Band 11. Published for the Max-Planck-Institut für Geschiche, by Scripta Mercaturae Verlag (St. Katharinen), 1991. iv + 223 pages. ISBN: 3-928134-45-0. A collection of fourteen essays on various aspects of conceptual modelling and development of standardized encoding methods for representing knowledge in historical texts. The contributions are by Manfred Thaller, Lou Burnard, Daniel I. Greenstein, Hannes D. Galter, Ingo H. Kropač, Donald A. Spaeth, Hans Jørgen Marker, Thomas Werner, Jan Oldervoll, and Kevin Schurer. The essays reflect interaction with and critique of encoding methods which emerged from the TEI phase I efforts as documented in TEI-P1; see <52> below. <22> Guittet, Christian. "Appendix -- Introduction to SGML. Extract from FORMEX. Published by the EEC Office of Official Publications." SGML Users' Group Bulletin 1/1 (1986) 26-57. <23> Heath, Jim; Welsch, Larry. "Difficulties in Parsing SGML." In Proceedings of the ACM Conference on Document Processing Systems, Santa Fe (5-9 December 1988). Pages 71-77. New York: Association for Computing Machinery, 1988. See similarly, by the same authors, "Difficulties in Parsing: Suggestions to Improve SGML," 10 (July 1989) 8-10. <24> *Joloboff, Vania. "Document Representation: Concepts and Standards." In Structured Documents. Edited by Jacques André, Richard Furuta, and Vincent Quint. Cambridge Series on Electronic Publishing. Pages 75-105. Cambridge: Cambridge University Press, 1989. This article examines the problem of document representation in computer systems for printing, editing or interchange among heterogeneous systems. After a discussion of the various possibilities for defining documentation representation formalisms, it considers a number of standard representations typical of their class: page description languages, SGML, Interscript, ODA. Several other articles in the volume are of direct or marginal relevance to SGML as a metalanguage for document-structuring. <25> Macleod, Ian A. "Storage and Retrieval of Structured Documents." Information Processing and Management 26/2 (1990) 197-208. Abstract: There have been a number of important related activities which suggest the need for a new model for text. ISO standards for document description have been recently developed. These standards view documents as hierarchical objects and it is likely that languages such as SGML will become widely used in the near future for document markup. As structured documents become available, so there will be a need to evolve tools to take advantage of structural knowledge. The goal of the work described here is to develop such tools. A conceptual model for bibliographic data has been designed. The model is known as Maestro (Management Environment for Structured Text Retrieval and Organization). It supports structured documents and provides a query language to retrieve and link information contained in these structures. In this paper an overview of Maestro is presented together with an outline of the basic implementation. <26> Mamrak, Sandra A.; Kaelbling, Michael J.; Nicholas, C.K.; Share, M. "Chameleon: A System for Solving the Data-Translation Problem." IEEE Transactions on Software Engineering 15/9 (September 1989) 1090-1108. ISSN: 0098-5589. Abstract: "There is a need for widespread exchange of electronic documents in domains as diverse as book publishing, automated offices, factories, and research laboratories. The variety of data representations, and the subsequent need for data translation, is a major obstacle to this exchange. This paper describes a comprehensive data translation system with the following characteristics: 1) it is derived from a formal model of the translation task; 2) it supports the building of translation tools; 3) it supports the use of translation tools; and 4) it is accessible to its targeted end-users. A software architecture to achieve the translation capability is fully implemented. Translators have been generated using the architecture, both by the original software developers and by industrial associates who have installed the architecture at their own sites." Further note: A subset of the OSU Chameleon Project's more recent SGML translation tools is to be made freely available to the academic community in 1992. See <44> and "Free Data Translation Software," EPSIG News 4/3 (September 1991) 8-9; cf. "Integrated Chameleon Architecture Plans for 91-92 Year," SGML Users' Group Newsletter 19 (April 1991) 12-13. <27> Moline, Judi; Benigni, Dan; Baronas, Jean (eds). Proceedings of the Hypertext Standardization Workshop [January 16-18, 1990 National Institute of Standards and Technology, Gaithersburg, MD]. NIST Sepcial Publication 500-178, March, 1990. CODEN: NSPUE2. Several papers in this proceedings volume reference SGML, HyTime and SMDL as potentially valuable in creating hypertext/hypermedia standards. Reports from the workshop's Data Interchange Group and User Requirements Discussion Groups likewise identified SGML or SGML-like GIs as having probable priority in emerging standards formulations. <28> Mumford, Anne (editor). Document Exchange: The Use of SGML in the UK Academic and Research Community. Workshop Proceedings 5- 7 March 1990. Advisory Group on Computer Graphics, 1990. This proceedings volume contains several important contributions on SGML (submitted by Anne Mumford, Paul Ellison, Martin Bryan, Angella Scheller, David Duce and Ruth Kidd, Tim Niblett, Lou Burnard, John Larmouth, Paul Bacsich and Paul Lefrere, Malcolm Clark, and Kathleen Crennell). The volume is available from the organizer: Ann M. Mumford, Computer Centre, Loughborough University, Loughborough LE11 3TU, UNITED KINGDOM; TEL: 44 509 222312; FAX: 44 392 211603. See a full list of contributors and presentation-titles in "Document Exchange in UK Universities," SGML Users' Group Newsletter 17 (August 1990) 10. <29> *Naggum, Erik. "Answers to Frequently-Asked-Questions (FAQs) - for the UseNet Newsgroup comp.text.sgml." A draft version (0.0, 1991-12-15) is available via anonymous FTP as ftp.ifi.uio.no:SGML/FAQ.0.0 (ftp.ifi.uio.no has the IP address 129.240.88.1 as of December 1991). The latest version of the FAQ document may be fetched at any time from this public disk region, generously sponsored by The University of Oslo, Department of Informatics with oversight by Erik Naggum. The FAQ will also be found on servers which archive collections of FAQs. Suggestions for additional questions (or answers) to be included in the FAQ may be directed to the author: Erik Naggum; Naggum Software; Boks 1570, Vika; 0118 OSLO, NORWAY; Email: erik@naggum.no OR enag@ifi.uio.no on the Internet. <30> Price, Lynne A. "Graphic Representation of Content Models." 10 (July 1989) 12-16. The article demonstrates the use of tree structures and (more extensively) FSAs to represent SGML content models. FSAs are useful in revealing ambiguity (seemingly equivalent models). The article is derived from the author's tutorial session at the ACM Conference on Document Processing Systems, Santa Fe, New Mexico (5-9 December 1988). <31> Price, Lynne A. "Using SGML and TeX for User Documentation." In TEXniques No. 7: Proceedings, TeX User's Group 1988 Annual Meeting (21-24 August 1988, Montreal). Pages 203-210. TeX User's Group, 1988. Abstract: The Standard Generalized Markup Language (SGML), defined in International Standard (ISO) 8879, is a notation for representing documents and making their inherent structure explicit. The open-ended list of SGML applications includes document interchange, formatting or typesetting, loading databases for information retrieval, stylistic or linguistic analysis, and computer-aided translation. The combination of SGML and TeX is a natural one. This paper reviews the philosophy of SGML and then describes a particular environment where SGML and TeX are used together, giving specific examples of how processing is shared between the SGML application and TeX macros. <32> Price, Lynne A.; Schneider, Joe. "Evolution of an SGML Parser Generator." In Proceedings of the ACM Conference on Document Processing Systems, Santa Fe, 5-9 December 1988. Pages 51-60. New York: Association for Computing Machinery, 1988. Abstract: The Standard Generalized Markup Language (SGML) is a notation for describing classes of structured documents and for coding documents belonging to described classes. An advantage of SGML and other grammar-based document representations is the ability to perform multiple applications on a single document source file. This paper describes the evolution of a software development tool for implementing such applications. It explains the original design as well as enhancements made during the system's first eighteen months. Although not statistically significant, data on the use of the enhanced features are presented. The experience described is relevant to other software engineering tools for text processing. <33> Raymond, Darrell R. "Lector - An Interactive Formatter for Tagged Text." Technical Report OED-90-02. University of Waterloo Centre for the New Oxford English Dictionary and Text Research, August, 1990. 26 pages, 13 figures. Abstract: Lector is an X.11 aplication that provides highly interactive text formatting. Unlike text previewers, Lector handles descriptively marked-up text, supports multiple styles, and interacts well with other programs, including other invocations of Lector. Appropriate selection of texts and styles enables Lector to act as a text previewer, database browser, code prettyprinter, menu utility, and iconic interface. Lector's implementation revolves around a set of tradeoffs involving efficiency, simplicity and generality. The result demonstrated the utility of generalized text display tools. Note: for further details on the Waterloo Centre, see <19>. <34> Rubinsky, Yuri. "Standards for Hypertext Interchange." SGML Users' Group Newsletter 15 (January 1990) 14-15. For more on SGML applied to hypertext/hypermedia, see <109>, <72> and: (1) Yuri Rubinsky, "Standards for Hypertext Interchange Need Not Come out of Thin Air," 11 (October 1989) 4-5; (2) Yuri Rubinsky, "Comments on an SGML Application for Hyper- and Multi- Media Interchange: Informal Report from the GCA Hypertext/Hypermedia Standards Forum," 11 (October 1989) 5-6. <35> Scheller, Angela. "Document Standards: Availability and Products." Computer Networks and ISDN Systems 16/1-2 (September 1988) 138-142. CODEN: CNISE9. ISSN: 0169-7552. Abstract: With the growth in the spread of computer networks the demand by users for document interchange features is becoming increasingly apparent. The prerequirement for the realization of document interchange in a heterogeneous computer environment are internationally accepted standards for the description of documents. Already in early 1986, the Standard Generalized Markup Language SGML was published as an international standard for the structuring of documents. The publication of the Office Document Architecture ODA is expected in the course of 1988. The final text is already available. ODA was originally developed for the pure office environment, whereas the concept for SGML addressed the author/publisher environment. This fact is mirrored in the current pilot projects testing the standards: the manufacturers of office and word-processing systems mainly work with ODA, whereas in the technical scientific and publishing sectors SGML is often implemented. Users requiring an interface both to the office sector as well as to the publishing sector will therefore be confronted with the problems related to working with two different, only partially compatible standards. <36> SGML Users' Group. "A Brief History of the Development of SGML." 3-June-1989. 2 pages. Printed as a separate document, and in the SGML Users' Group Newsletter 14 (October 1989) 6-7, and (being free of copyright restrictions) elsewhere, (1) The SGML Handbook, <49>, Appendix A: pp. 567-570; (2) The SGML Source Guide, <134>. <37> Smith, Joan M. "The Standard Generalized Markup Language (SGML) for Humanities Publishing." Literary and Linguistic Computing 2/3 (1987) 171-175. ISSN: 0268-1145. Abstract: a new methodology, and the core of which is generic coding, has been developed within the International Organization for Standardization (ISO). This is known as the Standard Generalized Markup Language (SGML). Using SGML, the elements of a document are marked up as to their role, be it a paragraph, an abstract, a note, or whatever; the style of presentation is a separate issue and is not addressed by SGML. These elements can form part of a data base, which can be updated at will. So there is the notion of data base publishing. The Standard Generalized Markup Language is presented as a tool for full-text data base publishing, where the options for output are open, an example being given as a marked up document. Its value for all aspects of humanities publishing is addressed: whether for scholarly papers intended for a journal, books, specialist publications, dictionaries, or biographies, indeed whatever is input to an electronic medium with the intention of being imaged subsequently in some form; whether alone, in part, or in combination with other text. SGML represents an advance in publishing methodology, taking advantage of developing technology. It can be exploited as such in an academic environment to give an added dimension to research publications. <38> Smith, Joan M. "Standard Generalized Markup Language and Related Standards." Computing Communications 12/2 (April 1989) 80-84. <39> *SoftQuad, Inc. The SGML Primer. SoftQuad's Quick Reference Guide to the Essentials of the Standard: The SGML Needed for Reading a DTD and Marked-up Documents and Discussing them Reasonably. Version 2.0. Toronto: SoftQuad Inc., May 1991. 36 pages. Available from SoftQuad Inc.; 56 Aberfoyle Crescent, Suite 810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 (416) 239- 4801; FAX: +1 (416) 239-7105. <40> Software Exoterica. "Understanding the SGML Declaration." Release 2.0. Ottawa: Software Exoterica Corporation, February 19, 1991. Technical Report ECM03-0291. iv + 34 pages. Available from: Software Exoterica Corporation; 383 Parkdale Avenue, Suite 406; Ottawa, Ontario; CANADA K1Y 4R4; TEL: +1 (613) 722-1700; TEL: +1 (800) 565-XGML; FAX: +1 (613) 722-5706. <41> Tompa, Frank W. "What is (Tagged) Text?" In Dictionaries in the Electronic Age: Proceedings of the Fifth Annual Conference of the UW Centre for the New Oxford English Dictionary (18-19 September 1989, St. Catherine's College, Oxford). Volume 2. Pages 81-93. Waterloo, Ontario: UW Centre for the New OED, 1989. Note: for further details on the Waterloo Centre, see <19>. <42> *"Use of SGML Markup." Chapter 2 (pp. 9-38) in Guidelines for the Encoding and Interchange of Machine-Readable Texts (Text Encoding Initiative, Draft Version 1.0). See <52> in section 3 below. <43> Vooren, Ludo van. "Implementing SGML: Where Do You Start?" 13 (February 1990) 5-7. This contribution proposes implementing SGML in several stages: Document Analysis, Process Design, Document Type Declaration Writing, Document Preparation. Also published in SGML Users' Group Newsletter 17 (August 1990) 5-7. <44> Walter, Mark. "OSU's Chameleon Architecture: A Grammatical Approach to Translation and DTDs." Seybold Report on Publishing Systems 20/7 (December 24, 1990) 17-23. Describes the approach taken by the Chameleon Research Group at the Department of Computer and Information Science at Ohio State University in building SGML translators and DTDs. See more on Chameleon sub <26>. <45> Warmer, Jos; Egmond, Sylvia van. "The Implementation of the Amsterdam SGML Parser." Electronic Publishing: Origination, Dissemination and Design (EPOdd) 2/2 (July 1989) 65-90. ISSN: 0894-3982. Abstract: The Standard Generalized Markup Language (SGML) is an ISO Standard that specifies a language for document representation. This paper gives a short introduction to SGML and describes the (Vrije Universiteit) Amsterdam SGML Parser and the problems we encountered in implementing the Standard. These problems include the interpretation of the Standard in places where it is ambiguous and the technical problems in parsing SGML documents. Note: the parser is available electronically via anonymous-FTP; see <123>. <46> Wu, Gilbert. SGML Theory and Practice. British Library Research Paper 68. British Library Research and Development Department, 1989. ISSN: 0269-9257 No. 68; ISBN 0-7123-3211-1. 93 pages. =========================================================== SGML MANUALS: COMMENTARY AND INDICES FOR ISO 8879 =========================================================== *All five volumes in this section are available for purchase from the GCA collection "Information Technologies Publications and Resources," as well as from the named publishers or distributors. GCA addresses are given in entry <105>. Publications <48> and <51> are also available from the SGML Users' Group Bookstore and may be ordered from the SGMLUG Secretary; see <106>. <47> Bingham, Harvey W. SGML Syntax Summary. Cambridge, MA: Interleaf, 2-June-1988. 46 pages. The document supplies cross- reference information which is not given or optimally accessible in the ISO 8879 standard itself. The syntax summary covers the primary ISO document (8879), Amendment 1 (Fall 1987) and Amendment 1, Corrections (May 1988). Copies of the syntax summary were mailed to subscribers of with issue 1/4 (1988). Updates are (were?) available from Interleaf. <48> Bryan, Martin. SGML: An Author's Guide to the Standard Generalized Markup Language. Wokingham/Reading/New York: Addison-Wesley, 1988. ISBN: 0-201-17535-5 (pbk); LC CALL NO: QA76.73.S44 B79 1988. 380 pages. A highly detailed and useful manual explaining and illustrating features of ISO 8879. The book: (1) shows how to analyse the inherent structure of a document; (2) illustrates a wide variety of markup tags; (3) shows how to design your own tag set; (4) is copiously illustrated with practical examples; (5) covers the full range of SGML features. Technical and non-technical authors, publishers, typesetters and users of desktop publishing systems will find this book a valuable tutorial on the use of SGML and a comprehensive reference to the standard. It assumes no prior knowledge of computing or typography on the part of its readers. <49> Goldfarb, Charles F. The SGML Handbook. Edited and with a foreword by Yuri Rubinsky. Oxford: Oxford University Press, 1990. ISBN: 0-19-853737-1. 688 pages. This volume contains the full annotated text of ISO 8879 (with amendments), authored by IBM Senior Systems Analyst and acknowledged "father of SGML," Charles Goldfarb. The book was itself produced from SGML input using a DTD which is a variation of the "ISO.general" sample DTD included in the annexes to ISO 8879. The SGML Handbook includes: (1) the up-to-date amended full text of ISO 8879, extensively annotated, cross-referenced, and indexed (2) a detailed structured overview of SGML, covering every concept (3) additional tutorial and reference material (4) a unique "push- button access system" that provides paper hypertext links between the standard, annotations, overview, and tutorials. <50> Herwijnen, Eric van. Practical SGML. Dordrecht/Hingham, MA: Wolters Kluwer Academic Publishers. 200 pages. ISBN: 0-7923- 0635-X. The book is designed as a "practical SGML survival-kit for SGML users (especially authors) rather than developers," and itself constitutes an experiment in SGML publishing. The book provides a practical and painless introduction to the essentials of SGML, and an overview of some SGML applications. See the reviews by (1) Carol Van Ess-Dykema in Computational Linguistics 17/1 (March 1991) 110-116, and (2) Deborah A. Lapeyre in 16 (October 1990) 12-14. <51> Smith, Joan M.; Stutely, Robert S. SGML: The Users' Guide to ISO 8879. Chichester/New York: Ellis Horwood/Halsted, 1988. 173 pages. ISBN: 0-7458-0221-4 (Ellis Horwood) and ISBN: 0-470- 21126-1 (Halsted). LC CALL NO: QA76.73.S44 S44 1988. The book (1) supplies a list of some 200 syntax productions, in numerical and alphabetical sequence; (2) gives a combined abbreviation list; (3) includes highly useful subject indices to ISO 8879 and its annexes (4) supplies graphic representations for the ISO 8879 character entities; (5) lists SGML keywords and reserved names. An overview of the book may be found in the SGML Users' Group Newsletter 9 (August 1988) 9. ================================================ SGML APPLICATIONS AND IMPLEMENTATIONS ================================================ <52> ACH/ACL/ALLC. Guidelines for the Encoding and Interchange of Machine Readable Texts. Edited by C. Michael Sperberg-McQueen and Lou Burnard. TEI-P1, Version 1.1 October 1990. xx + 289 pages. This volume represents the results of work in Phase I of the International Text Encoding Initiative, sponsored by ACH/ACL/ALLC and several advisory associations. The publication describes and illustrates mechanisms (some experimental) for SGML markup of many kinds of documents, especially for humanities fields (literary, linguistic, historical, philosophical studies). The TEI encoding standard is an SGML application. Contact the editors: in the US, Michael Sperberg- McQueen; BITNET: u35395@uicvm; Computer Center (M/C 135); University of Illinois at Chicago; Box 6998; Chicago, IL 60680; TEL: (312) 996-2981; in the UK, Lou Burnard; JANET: lou@vax.ox.ac.uk; Oxford University Computing Services; 13 Banbury Road; Oxford OX2 6NN; TEL: (44) 865-273238. Version note: Draft Version TEI-P2 is scheduled to appear January 1992, and Version 3 is to be produced at the end of the current TEI project, April-June, 1992. Summary descriptions of the TEI effort may be found in several publications, among which the following may be mentioned: (1) Susan Hockey (Chair, TEI Steering Committee), "The ACH-ACL-ALLC Text Encoding Initiative: An Overview." TEI Technical Report No. TEIJ16. June, 1991. Available as 'TEIJ16 DOC' from the public LISTSERVer at UICVM: see <114> below; or contact Wendy Plotkin, Email: u49127@uicvm.uic.edu, OR u49127@uicvm.cc.uic.edu OR u49127@UICVM.BITNET; Postal: Wendy Plotkin, TEI-Rm 168 UIC Computer Center; Computer Center (M/C 135); University of Illinois at Chicago; Box 6998; Chicago, IL 60680; (2) Lou Burnard, "An Introduction to the Text Encoding Initiative." Pp. 81-91 in Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts: see sub <21> above. Anyone interested in a full description of the TEI and its publications should request the TEI document title lists: (3) Document Number TEI A0 ("Text Encoding Initiative, Current Documents"), listing some 200 technical reports and working papers, and (4) Document Number TEI SC R14 ("Talks and Papers on the Text Encoding Initiative: A Cumulative List"). <53> American National Standard for Electronic Manuscript Preparation and Markup. (ANSI/NISO Z39.59-1988). Published for NISO (National Information Standards Organization) by Transaction Publishers (New Brunswick, NJ), 1991. xv +167 pages. ISBN: 0- 88738-945-7. ISSN: 1041-5653. An earlier form of the document was published simply as: the Standard for Electronic Manuscript Preparation and Markup. (ANSI/NISO Z39.59-1988). 1987, 1988. This document was developed over several years as the "AAP Standard," was later promoted to by EPSIG/AAP as "the Electronic Manuscript Standard" or simply as the "Standard," and is now a NISO publication. The AAP/EPSIG application is SGML-conforming, and provides a suggested tagset for authors and publishers. The standard is said to "represent the first industry wide application of SGML (Standard Generalized Markup Language, ISO 8879). The standard defines the format syntax of the application of SGML publication of books and journals. The standard achieves two goals. First, it establishes a standard way to identify and tag parts of an electronic manuscript so that computers can distinguish between these parts. Second, it provides a logical way to represent special characters, symbols, and tabulator material, using only the ASCII character set usually found on a standard keyboard." The standard is available for $75 (75 US dollars) from Transaction Publishers, Rutgers--The State University, New Brunswick, NJ 08903, TEL: (1 908) 932-2280. Discounts are available for purchase of multiple copies. Equally, the volume may be ordered from EPSIG: see <107>. <54> Association of American Publishers. Author's Guide to Electronic Manuscript Preparation and Markup. 2nd edition, November 1987. Reprinted 1989. ISBN: 1-55653-086-2. Available from EPSIG. <55> Association of American Publishers. The Markup of Mathematical Formulas. 2nd edition, 1987. Reprinted 1989. ISBN:1-55653- 083-8. Available from EPSIG. <56> Association of American Publishers. The Markup of Tabular Material. 2nd edition, 1987. Reprinted 1989. ISBN: 1-55653- 085-4. Available from EPSIG. <57> Association of American Publishers. Reference Manual on Electronic Manuscript Preparation and Markup. 2nd edition, November 1987. Reprinted 1989. ISBN: 1-55653-084-6. Available from EPSIG. <58> Guittet, Christian (ed.) FORMEX: formalisation de l'échange de publications électronique = Formalised Exchange of Electronic Publications. Luxembourg: Office des Publications officielles des Communautés européennes, 1985. ISBN: 92-825-5399-X. The volume contains an introduction to SGML and implementation of the standard for electronic interchange of CEC and OPOCE documents. <59> Smith, Joan M. The Standard Generalized Markup Language (SGML): Guidelines for Editors and Publishers. British National Bibliography Research Fund, 26. 1987. ISBN: 0-7123-3111-5. ISSN: 0264-2972. The abstract for <60> generally pertains to this document as well. <60> Smith, Joan M. The Standard Generalized Markup Language (SGML): Guidelines for Authors. British National Bibliography Research Fund, 27. 1987. ISBN: 0-7123-3112-3. ISSN: 0264-2972. Abstract: These guidelines are for authors of scholarly publications who wish to prepare documents for a publisher on existing text entry devices, word processors and personal computers, adding markup to the text in accordance with the Standard Generalized Markup Language (SGML). A simple approach is adopted, based on the concept of a starter set of tags. An explanation of SGML is given and why markup should be used, and advice provided on what is to be done if the author has a publisher, has not yet got a publisher, or is his or her own publisher. As far as the preparation of the document is concerned, there is advice on keying conventions, when not to use stylistic and formatting characteristics of the system, and conditions under which its features and facilities may be used. The starter set of tags is explained, and how to deal with lists, tables, and figures. Cross referencing is addressed and the preparation of an index -- all with examples. Information is given on how to extend the starter set and how to cope with text the author may not be able to markup for any reason. How to deal with characters for printing, that cannot be imaged on the text entry device, is explained, also how to use abbreviations for lengthy character strings of a repetitive nature. For all other issues, the author is referred to the publisher, to the companion 'Guidelines for Editors and Publishers', and to the standard itself. <61> US Department of Defense. Military Specification. Markup Requirements and Generic Style Specification for Electronic Printed Output and Exchange of Text (SGML). CALS Phase 1.1 Core Requirement Document. MIL-M-28001A Draft (Superseding MIL-M- 28001, 15 December 1988). 17 July 1989. 501 pages. Further amendments, revisions and extensions to this standard are in progress as of 1991. The document is available in paper from the GCA and from any CALS document suppliers. It is also available free in electronic format from the CALS-BBS Internet forum: see <117>. One would connect to the server via anon-FTP and "get" the individual files from the pub/cals/28001a subdirectory. For machine parsable Document Type Declaration Sets and Output Specification Instance files, download the following in addition to the paginated "human readable" 28001a files: (1) 38784PID.EXE (Document Type Declaration Set, Contents of Public Identifier "-//USA-DOD//DTD MIL-M-38784B 900102//EN"); (1a) the revised/corrected file 38784-V2.EXE or 38784-V2.PID; (2) TEMP-PID.EXE (Document Type Declaration Set, Contents of Public Identifier "-//USA DOD//DTD TEMPLATE 900102//EN"); (3) OS-PID.EXE (Document Type Declaration Set, Contents of Public Identifier "-//USA-DOD//DTD OUTPUT SPEC 900102//EN"); (4) TEMP-FOS.EXE (FOSI = Formatted Output Specification Instance Template for OUTSPEC.PID). The fuller CALS Specifications and Requirements package is likewise available in looseleaf paper format from GCA, and free from the CALS-BBS Internet forum in electronic format. E.g., (1) MIL-STD 1840A (Military Standard. Automated Interchange of Technical Information. Superceding MIL-STD-1840); (2) MIL-R- 28002 (Military Specification. Raster Graphics Representation in Binary Format, Requirements for); (3) MIL D-28000 (Military Specification. Digital Representation for Communication of Product Data: IGES Application Subsets. With Amendment 1, of 20-December-1988); (4) MIL-D-28003 (Military Specification. Digital Representation for Communication of Illustration Data: CGM Application Profile); (5) MIL-HDBK-59a (Military Handbook. Department of Defense, Computer-Aided and Acquisition Logistic Support (CALS) Program Implementation Guide). Explanation of these core requirements for CALS and the relationships of the specifications to SGML is accomplished in a book by Joan M. Smith, An Introduction to CALS: The Strategy and the Standards. Technology Appraisals Limited, 1990. ISBN 1-871802-04-0. 143 pages. This introductory volume is also available from the GCA. <62> Vignaud, Dominique. L'édition structurée des documents: SGML application à l'édition français. Paris: Éditions du Cercle de la Librarie, 1989. ISBN: 2-7654 0420-8. This volume was prepared to assist French publishers with application of the SGML standard. It supplies a basic DTD, and additional materials are available (including electronic files) for extending the DTD. The book is said to be the first volume in a series L'édition structurée des documents, published by Éditions du Cercle de la Librarie. For availability, contact the Syndicat nationale de l'édition (SNE) or: Éditions du Cercle de la Librarie, 35 rue Grégorie-de-Tours, 75006 Paris, France. Additional details: see "SGML: application à l'édition français," SGML Users' Group Newsletter 13 (August 1989) 9; Yuri Rubinsky's brief review, "Can Imaginative Objects Have Intentions?" 10 (July 1989) 11; or "French Book DTD Available," 9 (March/April 1989) 15. The book is similar in purpose to the American (EPSIG/AAP) volume "Standard for Electronic Manuscript Preparation and Markup" published by NISO, <53>, and to the British volumes written by Joan Smith: <59> and <60>. Whereas the EPSIG/AAP standard for electronic publishing defined some 220 tags, Vignaud's DTD deliberately defines only 60 tags. ==================================================== ISO STANDARDS PUBLICATIONS GERMANE TO SGML ==================================================== The following ISO (standards) publications are listed in two groups: (A) the SGML standard and eight other standards/documents considered to belong to the fuller suite of 'SGML' standards; (B) other standards which are referenced by SGML or otherwise, for various reasons, are of interest to SGML users. In the first list, numerically, are: the ISO 8879 (SGML) standard <63> and its amendment <64>; ISO 9069 SDIF <65>, ISO/IEC Public Text Owner Identifiers <66>, ISO/IEC TR 9573 Techniques for Using SGML <67>, ISO/IEC TR 10037 Syntax-Directed Editing Systems <68>, ISO/IEC DIS 10179 DSSSL <69>, ISO/IEC DIS 10180 SPDL <70>, ISO/IEC CD 10743 SMDL <71>, and ISO/IEC DIS 10744 HyTime <72>. The ISO documents listed below are available from national member bodies of ISO, but many are also available from the Graphic Communications Association using phone (credit card) or FAX ordering. The GCA may be reached for ordering as follows: TEL: (1 703) 519-8157; FAX (1 703) 548-2867. GCA's postal address is: Graphic Communications Association; 100 Daingerfield Road, Alexandria, VA 22314. A smaller number of standards documents (e.g., ISO 8879, ISO 8879 Amendment, ISO 9069, ISO 9544, ISO 9573) are available from the secretary of the SGML Users' Group at member prices: contact Mr. Stephen G. Downie; SGML Users' Group, Secretary; c/o SoftQuad Inc.; 56 Aberfoyle Crescent, Suite 810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 416 239 4801; FAX: +1 416 239 7105. The address for ISO is: ISO Central Secretariat; 1, rue de Varembé; Case Postale 56; CH-1211 Geneva 20; SWITZERLAND; TEL: (022) 34-12-40; FAX: +41-22-33-34-30; TELEX: 23 887 iso ch. The bibliographic information below is current through about November 1991, but incomplete citations and inaccuracies are certain to be found. Corrections/additions/updates in support of currency and accuracy will be welcomed. PRIMARY ISO DOCUMENTS RELATING TO SGML: <63> ISO 8879:1986. Information Processing -- Text and Office Systems -- Standard Generalized Markup Language (SGML). International Organization for Standardization. Ref. No. ISO 8879:1986 (E). Geneva/New York, 1986. A subset of SGML became a US FIPS (Federal Information Processing Standard) in 1988. The British Standards Institution adopted SGML as a national standard (BS 6868) in 1987, and in 1989 SGML was adopted by the CEN/CENELEC Standards Committees as a European standard, #28879. Australia has dual numbered versions of ISO 8879 SGML and ISO 9069 SDIF (AS 3514 - SGML 1987; AS 3649 - 1990 SDIF). A one-page NTIS technical note on ISO 8879 as a US FIPS document, FIPS-PUB-152, provides the following abstract for ISO 8879: Abstract "This citation summarizes a one-page announcement of technology available for utilization. A Federal Information Processing Standard (FIPS) recently approved by the Secretary of Commerce should help federal agencies improve their communications with publishing organizations. (FIPS are developed by NIST for use by the federal government.) The new standard, called Standard Generalized Markup Language (SGML), provides a common way for defining markup languages so documents can be transferred from author to publisher in a standardized format. By providing a coherent and unambiguous syntax for describing the elements within a document, SGML makes it easier to move unformatted textual data among different installations and processing systems. Developed by the International Organization for Standardization (ISO) and the American National Standards Institute (ANSI) with assistance from NIST, the SGML standard is already being used by the Computer-Aided Acquisition and Logistics Support (CALS) program of the Department of Defense to develop a military specification. NIST is providing technical support for the CALS program. In addition, NIST has developed the first set of conformance tests for SGML; ISO and ANSI are considering using these tests for their own test suites." See "Publishing Standard Allows for the Transfer of Documents from Author to Publisher," NTIS Tech Note, 081914000; National Bureau of Standards, Gaithersburg, MD; May 1989. The SGML standard is now (1991) in the process of its first five-year review. National member bodies of ISO and other entities are submitting revision statements to the ISO/IEC JTC1/SC18/WG8 for review. See, for example, statements by ANSI X3VI.8 and the SGML Users' Group, printed in the SGML Users' Group Newsletter 20 (September 1991) 20. For other possible addenda and changes to 8879, see "Recommendations for a Possible Revision of ISO 8879. ISO/IEC JTC1/SC18/WG8 N931 [Part I]," 12 (December 1989) 6-8 and "Recommendations for a Possible Revision of ISO 8879. Part II. ISO/IEC JTC1/SC18/WG8 N931," 13 (February 1990) 12-15; "Additional Recommendations for a Possible Revision of ISO 8879 - Information Processing - Text and Office Systems (ISO/IEC JTC1/SC18/WG8 N1013," 15 (August, 1990) 12-14. Balloting of 18 countries' national standards bodies (from 25) based upon review of the standard between November 15, 1990 and February 28, 1991 resulted in general confirmation of ISO 8879, with six requests for revision. WG8 will continue to review ISO 8879 in light of the comments and recommendations for revision, but the standard is thus confirmed through 1996. See details in "Replies on Review of ISO 8879 (SGML," EPSIG News 4/4 (December 1991) 8. <64> ISO 8879:1986 / A1:1988 (E). Information Processing -- Text and Office Systems -- Standard Generalized Markup Language (SGML), Amendment 1. Published 1988-07-01. Geneva: International Organization for Standardization, 1988. <65> ISO 9069:1988. Information Processing -- SGML Support Facilities -- SGML Document Interchange Format (SDIF). 13 September 1988. Geneva/New York: International Organization for Standardization, 1988. Also available as The British Standard Guide to SGML Document Interchange Format (SDIF), BS 7138 1989 (ISO 9069: 1988; see in "Snippets," SGML Users' Group Newsletter 14 (October 1989) 12. <66> ISO/IEC 9070:1991. Information Processing -- SGML Support Facilities -- Registration Procedures for Public Text Owner Identifiers. Second edition. 15 April 1991. The "public text" envisioned in this standard as applied to SGML might be DTDs (Document Type Definitions), or declaration subsets of DTDs, public entity sets, etc. Names include an owner name and an object identifier. Equivalent encodings for the names in ASN.1 and SGML may be supplied for interchange purposes. Note: "The intention of the amendment that has resulted in a 2nd edition is to extend 9070 beyond the simple boundaries of SGML only. It is now used by 9541 (and 10036) for the definition of 'structured names'. A New Work Item Proposal is being submitted to change the title and scope of 9070 to show its extended usefulness." (note from Paul Ellison, December 1991) <67> ISO/IEC TR 9573:1988 (E). Information Processing -- SGML Support Facilities -- Techniques for Using Standard Generalized Markup Language (SGML). December 09, 1988. Anders Berglund, editor. vi + 124 pages. A major revision of the TR underway (as of May 1990) will result in a new TR with (16) parts: (1) SGML Tutorial (2) Basic Techniques (3) Advanced Techniques (4) Using Short References for Identifying Markup (5) Using non-Latin Alphabets (6) Referencing and Synchronisation (7) Mathematics and Chemistry (8) Tables (9) Using SGML for Computer-to-Computer Interchange (10) Designing Applications for Database Interfacing (11) Application at ISO CS for International Standards and Technical Reports (12) Public Entity Sets for General and Publishing Symbols (13) Public Entity Sets for Mathematics and Science (14) Public Entity Sets for Latin Based Alphabets (15) Public Entity Sets for non-Latin Based Alphabets (16) Public Entity Sets for Ideograms (adapted from Ludo Van Vooren, "SGML Standards Committee Update: Activities of ISO SC 18 WG8," 14 (May 1990) 11-12. See also Joan M. Smith in "More Liaison Statements to ISO," SGML Users' Group Newsletter 13 (August 1989) 6-7. A description of this ISO document is found in "Publication of Techniques for Using SGML," SGML Users' Group Newsletter 11 (January 1989) 3-4. Further update of parts 1-5 of TR 9573 will be delayed until the 5-year revision of SGML (ISO 8879) is completed. <68> ISO/IEC TR 10037:1991. Information Processing -- SGML and Text Entry Systems -- Guidelines for SGML Syntax-Directed Editing Systems. 15 March 1991. Geneva: International Organization for Standardization, 1991. The document supplies technical guidance for the development of context- sensitive SGML editors. See "Guidelines for Syntax-Directed Editing Systems," SGML Users' Group Newsletter 14 (October 1989) 3. <69> ISO/IEC DIS 10179:1990. Text Composition -- Document Style Semantics and Specification Language (DSSSL). ISO Project 18.15.6.01. 1988, 1989, 1990. Edited by Sharon Adler. vi + 132 pages. See <1> for early commentary. Ocasionally SGML is criticized for its fundamental principle (sometimes declared 'misguided philosophy') of radically separating content from appearance. DSSSL addresses this issue, although as a distinct, separate processing matter: DSSSL acknowledges the need to support the exchange of semantic information about a document's layout structures and other presentation or processing features. From the Standard's introduction: "This International Standard defines the Document Style Semantics and Specification Language (DSSSL) for the specification of document processing, such as formatting and data management functions, with the initial focus on formatting to both print and on display media, and data conversion. The International Standard has been structured to permit future sections to be added to this International Standard to cover the areas of data management. The objective of the DSSSL Standard is to provide a formal and rigorous means of expressing the range of document production specifications, including high-quality typography, required by the graphic arts industry. These specifications will be expressed using standardized basic semantics or combinations of the basic standard DSSSL semantics. These semantics will allow users to specify fully the characteristics to be be applied during document processing, such as composition, pagination, and imposition. The DSSSL typographic semantics may be used to form the basis of a standard style sheet language. In addition, DSSSL includes General Language Transformation constructs which provide the capability to translate into an existing processing language, such as a data base update language (e.g., SQL) or a traditional text formatting language. A status summary based on note from Paul Ellison, December 5, 1991: The DIS ballot on DSSSL has closed, and sufficient votes have been received to allow DSSSL to proceed to IS once ballot comments have been resolved. Although a negative vote was received from three countries (UK, Germany and France), there were a considerable number of major and minor comments. In addition an extensive liaison statement has been received from SC18/WG3. Resolution of these comments will take many meetings. <70> ISO/IEC DIS 10180:1991. Information Processing -- Text Composition -- Standard Page Description Language (SPDL). Geneva: International Organization for Standardization, 1991. For a summary, see: (1) SGML Users' Group Newsletter 20 (September 1991) 17-18; Peter J. Robinson, and Stephen M. Strasen, "Standard Page Description Language," Computing Communications 12/2 (April 1989) 85-92; (2) "Text Composition Standards," SGML Users' Group Newsletter 15 (January 1990) 7-8. Note: ISO/IEC 10180 has now passed DIS ballot with no negative votes. The joint editors are expected to have the final text ready for publication during 1992 (so Paul Ellison, December 1991). <71> ISO/IEC CD 10743:1991. Information Technology -- Standard Music Description Language (SMDL). April 1, 1991. SMDL "defines a language for the representation of music information, either alone, on in conjunction with text, graphics, or other information needed for publishing or business purposes." Multimedia time sequence information in also supported. SMDL is a HyTime application conforming to ISO/IEC DIS 10744 Hypermedia/Time- based Structuring Language (HyTime), and an SGML application conforming to Standard Generalized Markup Language (ISO 8879:1986). An earlier version was published by ANSI (American National Standards Institute), as ANSI X3V1.8M Journal of Development. ANSI Project X3.542-D. Standard Music Description Language (SMDL). X3V1.8M/SD-8. 60 pages. Sixth Draft. April 15, 1990. See a description of SMDL in: Steven R. Newcomb, "Standard Music Description Language Complies with Hypermedia Standard," IEEE Computer 24/7 (July 1991) 76-79. <72> ISO/IEC DIS 10744:1991. Information Technology -- Hypermedia/Time-based Structuring Language (HyTime). Edited by Charles F. Goldfarb (with assistance from Steven R. Newcomb). 10-October-1991. Voting on the Draft International Standard terminates on April 10, 1992. Copies are available from the SIGhyper group, <109> below, as well as from any national member of ISO. "HyTime is a standard neutral markup language for representing hypertext, multimedia, hypermedia, and time- and space-based documents in terms of their logical structure. Its purpose is to make hyperdocuments interoperable and maintainable over the long term. HyTime can be used to represent documents containing any combination of digital notations. HyTime is parsable as Standard Generalized Markup Language (ISO 8879:1986). HyTime provides standardized means of expressing (1) intra- and extra-document locations, and arbitrary links between them, (2) the scheduling of multimedia objects in 'finite coordinate spaces,' and (3) rendering instructions for arbitrarily projecting such objects onto other finite coordinate spaces, and other constructs." = Abstract from CACM 34/11 (November 1991) 67-83. An earlier version was published as an ANSI (American National Standards Institute) standard: ANSI X3V1.8M Journal of Development. ANSI Project X3.749-D. HyperMedia/Time-Based Structuring Language (HyTime). X3V1.8M/SD- 7. 68 pages. Sixth Draft. April 15, 1990. OTHER STANDARDS RELATED TO ISO 8879:1986 (SGML): <73> ISO 639:1988 (E/F). Code for the Representation of Names of Languages. First edition, 1988-04-01. Reference number is ISO 639:1988 (E/F). iii + 17 pages. Geneva: International Organization for Standardization, 1988. This document is a technical revision of ISO 639:1967, prepared by Technical Committee ISO/TC 37. The two-character language codes of ISO 639 are relevant to SGML encoding in two respects. First, the SGML standard (ISO 8879) itself specifies that declaration of 'public text language' should be given using the language code(s) from ISO 639; see ISO 8879-1986(E) page 36, section 10.2.2.3. Second, the WSD (Writing System Declaration) implemented in the Text Encoding Initiative uses the two-character language code of ISO 639 (as amended) as a 'language.code' attribute of the 'nat.language' declaration, specifying the language in which the WSD is written. ISO 639 contains much other information about the use of language symbols, registration of new symbols, etc. The language codes of ISO 639 are said to be "devised primarily for use in terminology, lexicography and linguistics, but they may be used for any application requiring the expression of languages in coded form." The registration authority for ISO 639 is given as Infoterm, Österreiches Normungsinstitut (ON), Postfach 130, A-1021 Vienna, AUSTRIA. The two-character language codes of ISO 639 are recognized as being inadequate for use as SGML language attributes when tagging text, viz, for use as global 'lang' attributes attached to any element to identify the language of the text element or a language shift. In principle, there should be nothing wrong with tagging language using SGML elements rather than attributes, if the encoder has principled reasons for not using attributes (e.g., indexing engines which read simple tags but not SGML attributes). But the two-character codes of ISO 639 are neither sufficiently mnemonic nor complete for the world's languages: whereas ISO 639 supplies codes for only about 136 languages, the Ethnologue published by the Summer Institute of Linguistics identifies over 6100 languages (see Ethnologue: Languages of the World, ed. Barbara Grimes. 11th edition. Dallas, TX: Summer Institute of Linguistics, 1988). A revision of ISO 639 completed late 1990 is described as supplying 3-character language codes (following MARC 3-character language codes in part), based upon the code sequence of the American National Standard (ANSI Z39.53). This draft will be circulated for worldwide review in 1991/92. It remains to be seen whether these new ISO 639 3-character codes qualify mnemonically for use in SGML tagging and if the set is complete. <74> ISO 646:1991. Information Processing -- 7-bit Coded Character Set for Information Interchange. Geneva: International Organization for Standardization, 1991. (646 IRV identical to ASCII) <75> ISO 2014:1976. Writing of Calendar Dates in All-Numeric Form. Geneva/New York: ISO (International Organization for Standardization), 1976. Now superseded by ISO 4166 (?). <76> ISO 2022:1982. Information Processing -- ISO 7-bit and 8-bit Coded Character Sets -- Code Extension Techniques. <77> ISO 2375:1985. Data Processing -- Procedure for Registration of Escape Sequences. Geneva/New York: International Organization for Standardization, 1985. <78> ISO 4873:1985. 8-bit Code for Information Interchange -- Structure and Rules for Implementation. Geneva/New York: International Organization for Standardization, 1985. (In 1990: under review as ISO/DIS 4873:1990.) <79> ISO 6429:1988. Additional Control Functions for Character Imaging Devices. <80> ISO DIS 6937:1990. Coded Character Sets for Text Communication. ISO DIS 6937 is under review as CD 6937:1990. The titled parts are: ISO DIS 6937-1:1983 General Information; ISO DIS 6937- 2:1983 Latin alphabetic and non-alphabetic graphic characters; ISO DIS 6937-2:1989 Addendum 1, Latin alphabetic and non- alphabetic graphic characters. Parts 3 and 4 (DIS 6937-3 and DIS 6937-4) are now superseded by DIS 10538. <81> ISO DIS 7350-2:1990. Text Communication -- Registration of Graphic Character Subrepertoires of the Graphic Character Repertoire of ISO 10367. Geneva/New York: International Organization for Standardization, 1990. <82> ISO 8613:1988. Information Processing -- Text and Office Systems -- Office Document Architecture (ODA) and Interchange Formats. 1988. For addenda, see also "Other ISO News. Addenda to ISO 8613," SGML Users' Group Newsletter 14 (October 1989) 3- 4. The ODA/ODIF standard in 8 parts. It is also available as a CCITT document, according to Erik Naggum, for considerably less money. The fascicles are available free from ECMA to qualifying parties. "ODA is also available from your favorite CCITT outlet as Blue Book Volume VII, fascicle VII.6, Terminal equipment and protocols for telematic services, T.400-T.418, and costs a nominal CHF 47 (or CHF 57 if you order it from CCITT yourself). ISO 8613 and the T.400-series are supposedly identical, as per ISO 8613-1, Annex B, Relationships with other standards, B.2 Other standards, section B.2.1, second paragraph: 'The text of ISO 8613-1 to ISO 8613-8 are identical to the texts in the correspondingly numbered CCITT Recommendations T.411 to T.418 except for mandated stylistic differences and provisions of ISO 8613 that are outside the scope of these Recommendations'." (Erik Naggum , Naggum Software, Oslo, Norway, News comp.text, July 13, 1990) Fuller bibliographic coverage for ODA/ODIF is not practical here, given the immediate goals of the document and space limitations. As pointers to the literature on ODA, here are five citations: (1) Appelt, Wolfgang. Document Architecture in Open Systems: The ODA Standard. Berlin/Heidelberg/New York: Springer-Verlag, 1991. ISBN: 3-540-54539-5. (2) Bormann, U.; Bormann, C.; Bathe, C. "SDE -- A WYSIWYG Editing and Formatting System for ODA and SGML." In Esprit '88: Putting the Technology to Use. Proceedings of the 5th Annual ESPRIT Conference (Brussels, November 14-17, 1988). II:1075-1095. Amsterdam/New York: Elsevier/North-Holland, 1988. (3) Brown, H. "Standards for Structured Documents," The Computer Journal 32/6 (December 1989) 505-514. (4) Dawson, Frank; Nielsen, Fran. "ODA and Document Interchange Office Document Architecture Standard." UNIX Review 8/3 (March 1990) 50-57. (5) Rosenberg, Jonathan; Sherman, Mark; Marks, Ann; Akkerhuis, Jaap. Multi-media Document Translation: ODA and the EXPRES Project. New York, NY: Springer-Verlag, 1991. ISBN 0-387-97397-4 [U.S.]; ISBN 3-540- 97397-4 [Germany]. See also the excellent survey article in <24>. <83> ISO 8632:1987. Information Processing Systems - Computer Graphics - Metafile for the Storage and Transfer of Picture Description Information (CGM). Geneva/New York: International Organization for Standardization, 1987. Note: the standard is in several parts. Part 2: Character Encoding; Part 3: Binary Encoding; Part 4: Clear-Text Encoding. <84> ISO 8859:1987-. Information Processing -- 8-bit Single-Byte Coded Graphic Character Sets. Multiple parts, 1987-1991. Titles for parts 1,2 7,8 are given below. Character set and character code issues are not the fundamental concern of SGML, since any code sets can be declared and managed by SGML. On the other hand, SGML users typically value SGML for its intelligent handling of multi-lingual text and document production, so it is natural that code sets are of some interest. See also the multi-byte code standards ISO 10646 <93> and Unicode <95> below. Anyone interested ISO 8859 discussions may subscribe to the electronic mailing list: ISO8859@JHUVM.BITNET, ASCII/EBCDIC character set related issues. Owner = HART@APLVM (Ed Hart). A variety of ASCII/EBCDIC character set related issues are discussed on the ISO8859 list, including (but not limited to) the following: (1) Definitions of codes (2) Translations between ASCII and EBCDIC ISO 8859/1 (8-bit ASCII with international accented characters for Latin alphabets) and ANSI equivalent X3.134.2-198x (3) IBM Country Extended Code Pages (CECPs) particularly, U.S./Canada English CECP 37, V1; and Code Page 850 (PC, PS/2); (4) Application/Program Product support of characters and codes. <85> ISO 8859-1: 1987 (E). Information Processing -- 8-Bit Single- Byte Coded Graphic Character Sets -- Part 1: Latin Alphabet No. 1. First edition. 15 February 1987. Geneva/New York: International Organization for Standardization, 1987. <86> ISO 8859-2: 1987 (E). Information Processing -- 8-Bit Single- Byte Coded Graphic Character Sets -- Part 2: Latin Alphabet No. 2. First edition. 15 February 1987. Geneva/New York: International Organization for Standardization, 1987. <87> ISO 8859-7:1987 (E). Information Processing -- 8-Bit Single- Byte Coded Graphic Character Sets -- Part 7: Latin/Greek Alphabet. First edition. 15 November 1987. Geneva/New York: International Organization for Standardization, 1987. <88> ISO 8859-8:1988 (E). Information Processing -- 8-Bit Single- Byte Coded Graphic Character Sets -- Part 8: Latin/Hebrew Alphabet. First edition. 1 June 1988. Geneva/New York: International Organization for Standardization, 1988. <89> ISO DIS 9541:1989. Information Processing -- Font and Character Information Interchange. Its parts are: ISO DIS 9541-1.2 Part 1: Architecture; ISO DIS 9541-2.2 Part 2: Interchange Format. Note from Paul Ellison, December 5, 1991 follows. "ISO 9541 Parts 1 & 2 and ISO 10036 are now published ISO standards. An error in Part 2 has been found, and a 'defects procedure' will be set up. DIS 9541 Part 3 is currently (December 1991) out for DIS ballot. User Requirements were developed for Part 4 (Application Specific Properties). User Requirements (UR) and New Work Item Proposals (NP) were developed for two amendments: (1) To Part 1 to add additional properties for the improved setting of Latin- based and of East Asian languages, and for the setting of script-based languages (eg Arabic and Hindi). This NP will include changes to Part 2 to allow the additional properties to be interchanged; (2) To Part 2 only, to add facilities for the interchange of partial fonts, subsets of fonts, and families of fonts." <90> ISO TR 9544:1988. Information Processing -- Computer-Assisted Publishing -- Vocabulary. 15 July 1988. 43 pages. Due for three-year review as of late 1991; discussions are being held to change the editorship (Paul Ellison is currently nominated). <91> ISO/DIS 10036:1989. Procedure for Registration of Glyph and Glyph-Collection Identifiers. Includes the text of ISO DIS 9541 on registration. <92> ISO DIS 10538: . Control Functions for Text Communication. Incorporates ISO 6937, parts 3 and 4, into a separate standard. <93> ISO/IEC DIS 10646. Information technology - Universal Coded Character Set (UCS). The DIS voting ended 1991-06-06. Note: (from Harry Gaylord, December 1991) "SC2 decided on a revised DIS to be sent out for a four month voting period at Rennes, France in October. The revised DIS should be sent to national bodies in January/February 1992." As of late 1991, ISO 10646 DIS and the consortial 'Unicode' standard, an alternative multi- byte code standard, were in a period of 'merger.' Both standards efforts attempt to define a multi-byte character encoding large enough to account for the world's major writing systems; the 'merger' is a complex story which can only be summarized here. The following paragraphs present a characterization of the ISO 10646 effort from the perspective of the list owner of the electronic discussion group. Entry <95> below, following the ISO documents list, supplies information on Unicode. ISO10646 List: Multi-byte Code Issues. ISO10646@JHUVM.BITNET Owner = HART@APLVM (Ed Hart). The purpose of the list is to serve as a clearing house for information on and discussion of multi-byte coded-character-set issues. The ISO 10646 draft and Unicode draft standards represent two different approaches to encoding the world's characters into a multi-byte code. Background to the disucssion: People are looking at multi-byte codes as a way to solve many of the problems we are experiencing with single-byte, 7-bit and 8-bit codes. Although most of us do not need all 191 of the characters in the ISO 8859-1 character set (repertoire), we frequently need characters outside of this set; for example, bullets or nice quotation marks for professional looking documents, symbols for mathematics and science, etc. The reason for developing multi-byte codes is that processing ONE multi-byte code appears easier than several single-byte codes. As of March, 1990, two coding schemes have emerged. The International Organization for Standardization (ISO) Subcommittee 2, Working Group 2 (SC2/WG2) has developed the ISO 10646 Multi-Octet Code. It is now a "draft proposed" standard (two levels removed from being an international standard). The ISO working group has been working on this project for the last 6 years and it has been subject to unusually wide review for a proposed standard. The other draft standard is the result of the work of a consortium of U.S. companies, mostly from the west coast. It is called Unicode. Both of these draft standards enable the worlds communication (newspapers and magazines) and business characters, ideographs, and symbols to be encoded for storage and communication between computers. However, each uses a different approach to making the inevitable tradeoffs. <94> TR XXXX Operational Model for Text Description and Processing Language. <95> Unicode: A Standard International Character Code for Multilingual Information Processing. Unicode is noteworthy in *not* being an ISO standard, but promising to become a major standard, more or less compatible with ISO 10646. Compare <93> above. The Unicode Consortium. The Unicode Standard: Worldwide Character Encoding, Version 1.0. Volume 1. Edited by Erica Liederman. Addison-Wesley, 1991. xx + 682 pages. ISBN 0-201- 56788-1. An "implementor's version" of the Unicode Book volume one (looseleaf in binder, same text as the bound Addison-Wesley volume, no ISBN) comes with a diskette containing chapters 4-6. It is available from the Unicode Secretariat, c/o Metaphor Computer Systems; see the address below. An earlier draft of Unicode was: Unicode 1.0. Draft Standard, Final Review Document. The Unicode Consortium, December, 1990. "Unicode is a fixed-width 16-bit multilingual character encoding scheme that efficiently and unambiguously represents the world's normal text characters for electronic information processing. Unicode was devised by a group of individuals and companies, including Apple, IBM, Metaphor, Microsoft, NeXT, Research Libraries Group (RLG), Sun, Xerox and others. "Unicode is designed to meet the need for a simple multilingual character standard in electronic information systems. The model for Unicode is the defacto use of ASCII, with its simple, unambiguous, fixed-width characters. Fixed-width characters simplify information processing in text streams, flat text files, strings, character arrays, database fields, procedure arguments and returns. They simplify text operations such as insertion and deletion, truncation, indexing the nth character, etc. Conversely, variable width, context dependent encodings or code-page switching (that are common in other multilingual standards) complicate information interchange and programming. Since ASCII's eight-bit character size is inadequate to handle multilingual text, and proposals for 32-bit standard are unnecessarily cumbersome, Unicode adopts a 16- bit architecture which extends the benefits of the ASCII to multilingual text and adds new strengths. "A 16-bit character code can uniquely specify any character in any language that is currently used in electronic information systems or is likely to be used in the foreseeable future. Moreover, since Unicode characters are consistently 16-bits wide, regardless of language, no escape sequence or control code is required to specify any character in any language. Unicode handles alphabetic and ideographic character sets simultaneously and with equal facility. "Basic (kernel) computer programs that use Unicode to represent characters but do not display or print text can often remain undisturbed when new languages or characters are introduced. Programmers seeking to adapt a Unicode conformant program to a new language environment can allocate their full energies to specifying the contents of character sorting and string handling algorithms, providing proper fonts, and developing appropriate character rendering routines (e.g., date, time, and currency formats). "Unicode is a product of the collaboration of multilingual engineers, managers, linguists, and information specialists from various corporations world-wide. A Unicode Consortium was formed in 1991 to bring Unicode to the world under the sponsorship of a multi-corporate, multi-national, non-profit organization. Membership in the consortium is open to all who support Unicode principles. (Adapted from network postings) Contacts: (1) Unicode Consortium; c/o Kenneth Whistler; Metaphor Computer Systems; 1965 Charleston Road; Mountain View, CA 94043; USA; Email (Internet): whistler@zarasun.metaphor.com; TEL: (1 415) 691-3600 OR (2) Richard McGowan; c/o NeXT Computer, Inc; 900 Chesapeake Drive; Redwood City, CA 94063; Email (Internet): McGowan@next.com; TEL: (1 415) 780-4522; FAX: (1 415) 780-3714. ================================================= SERIAL PUBLICATIONS DEDICATED TO SGML ================================================= Contents: * : The SGML Newsletter * SGML Users' Group Bulletin * SGML Users' Group Newsletter * EPSIG News CALS-SGML: * SGML Report * EuroCALS Newsletter * Barlow Report * CALS Journal * CALS Report <96> : The SGML Newsletter. This dedicated SGML publication is one of several forms of support given to SGML by the Graphic Communications Association (GCA). is GCA's premier publication organ covering SGML, published jointly by the GCA and SGML Associates, Inc. as "The Technical Journal of the SGML Community." It covers CALS SGML, SGML events and conferences, implementation case studies, SGML tutorials, product news, interviews and other topics. Current editors (1991) are Sharon C. Adler, Marion Elledge, Charles Goldfarb, Yuri Rubinsky and Ludo Van Vooren. Current subscription prices (December 1990) are annual 75 dollars US for GCA members, 150 dollars for non- members in the US and Canada, 200 dollars for foreign subscribers, and 125 dollars for foreign GCA Members. Other special offers are available with orders for back issues. Six issues per year. See entry <105> below for addresses and further information on the GCA. <97> SGML Users' Group Bulletin and <98> SGML Users' Group Newsletter The SGML Users' Group Bulletin (ISSN: 0269-2538) and SGML Users' Group Newsletter (ISSN: 0952-8008) are the primary publications sponsored by the International SGML Users' Group. Both publications are included as benefits of membership in the SGML Users' Group. The Newsletter appears as an 'occasional' publication (about quarterly) with approximately 20 pages per issue; the Bulletin is a more substantial, refereed publication that appears about semi-annually. Joan Smith served as the first editor of the SGMLUG Newsletter, for about 4 years. Pamela Gennusa (Datalogics/Database Publishing Systems, Ltd.) is the current Newsletter editor, with Joan Smith and Yuri Rubinsky serving as corresponding editors. Editorial responsibility for the Bulletin changes frequently. See entry <106> below for addresses and further information on the International SGML Users' Group. <99> EPSIG News (ISSN 1042-3737). EPSIG publishes the quarterly newsletter EPSIG News in support of the ANSI/NISO manuscript standard Z39.59-1988, and more generally in support of SGML. EPSIG News is mailed to members (annual dues 200 US dollars) and subscribers (annual dues 50 US dollars). Articles carried in EPSIG News (especially press reports and major product announcements) are also sometimes printed in and/or the SGMLUG Newsletter. EPSIG News accepts advertising for SGML services and products, as well as classified advertising. See entry <107> below for addresses and further information on EPSIG. <100> SGML Report. Occasional newsletter covering SGML. Written by Pat Byrne, edited by Steve Goodyear. Complimentary subscriptions available upon request. Contact: AGFA CAPS; 200 Ballardvale Street; Wilmington, MA 01887; (1 508) 658-5600. <101> EuroCALS Newsletter. The EuroCALS Newsletter is a publication of the CALS in Europe Special Interest Group, a SIG of the International SGML Users' Group. ISSN: 0958-2711. Contact: David Ardron, Secretary, CALS in Europe SIG; Ferranti Computer Systems Ltd,; Western Road, Bracknell, Berkshire RG12 1RA; UNITED KINGDOM; TEL: +44-344-483232; FAX: 44-344-54639; TELEX: 848117 FERBKL G. Subscription rate (1991): 20 pounds UK. <102> Barlow Report. Occasional (quarterly) newsletter covering CALS and CALS-SGML. Written by Bob Barlow, edited by Steve Goodyear. Complimentary subscriptions available upon request. Contact: AGFA CAPS; 200 Ballardvale Street; Wilmington, MA 01887; (1 508) 658-5600. <103> CALS Journal. Quarterly publication for the CALS community, new in 1992. The editorial advisory board includes David Bettwy, Howard M. Bloom, Judith A. Fincher, Rear Adm. Roland G. Freeman III, Pamela Gennusa, Frank Gilbane, James E. Giles III, David Gunning, Donald Hall, Terry R. Savage, H. B. Stormfeltz. Contact: CALS Journal; 14407 Big Basin Way; Saratoga, CA 95070- 6008; USA. TEL: (1 408) 867-7900; FAX: (1 408) 867-9800. Subscription rate: 48 US dollars annual. <104> CALS Report: Design, Implementation and Support Data for Concurrent Engineering and ILS. Monthly report on CALS, including CALS-SGML issues. ISSN: 0897-991X. Edited by William G. Beazley. Contact: CALS Report; Knowledge Base International; 13939 Northwest Freeway, Suite 270; Houston, TX 77040; USA. TEL: (1 713) 690-7644. Subscription rate: 250 US dollars annual. ====================================================================== SGML SUPPORTING ORGANIZATIONS AND SPECIAL INTEREST GROUPS (SIGS) ====================================================================== Contents: * Graphic Communications Association (GCA) * SGML Users' Group (SGMLUG) * Electronic Publishing Special Interest Group (EPSIG) * SGML DSIG/DBSIG (Database Special Interest Group) * SGML SIGhyper (Hypertext and Multimedia SIG) * European Workgroup on SGML (EWS) * CALS In Europe Special Interest Group <105> Graphic Communications Association (GCA) The GCA is a national affiliate of Printing Industries of America, and is a leading SGML sponsor. Historically, GCA executives have played strategic leadership roles in promoting the development of the SGML standard; cf. <36>. Marion Elledge is the current Director of Informatin Technologies; the GCA's current Board of Directors (Information and Technologies track) in 1990 includes David Mayer (Autotrol Technologies), Jon Slangerup (also Board Secretary; Federal Express) and Yuri Rubinsky (President, SoftQuad). GCA sponsors numerous conferences, tutorials, workshops and seminars on SGML, CALS and other technologies related to electronic publishing and information processing. GCA's SGML events are announced in direct mailings: its dedicated SGML newsletter , in the occasional newsletter TECHInfo: News for the Information Management Community, and in a quarterly magazine Perspectives. GCA events are also covered in each issue of the SGML Users' Group Newsletter, and in EPSIG News. The GCA distributes ISO documents and other SGML-related publications, with preferential rates for GCA members. Included are CALS specifications packages and SGML "handbooks" by Martin Bryan, Eric van Herwijnen, Joan Smith and Charles Goldfarb. Also available is the GCA's publication The SGML Source Guide. The Graphic Communications Association's Guide to Standard Generalized Markup Language (SGML) Systems, Software, Service, Consultants, Seminars and Resources. Edited by Marion Elledge. ISBN 0-933505-13-2. See <134>. Address: Graphic Communications Association; Attention: Marion Elledge (Director, Information Technologies); 100 Daingerfield Road; Alexandria, VA 22314 USA; TEL: (703) 519-8160; FAX (703) 548-2867; TELEX: 510-600-0889. <106> SGML User's Group (SGMLUG) The International SGML Users' Group was founded in 1984 by Joan M. Smith, who served as its first president until 1990; the current president editor is Pamela Gennusa of Datalogics and MID/Information Logistics Group Ltd. The objectives of the group are to promote the use of the Standard Generalized Markup Language and to provide a forum for exchange of information about SGML. The Constitution of the SGML Users' Group is published on pp. 7-8 of the SGML Users' Group Newsletter 6 (November 1987); see also "SGML Users' Group Constitution," SGML Users' Group Bulletin 1/1 (1986) 5-7; "Standard Generalized Markup Language. Users' Group," Literary and Linguistic Computing 3/1 (1988) 54. Executive Council members for 1991 are as follows. Officers: Pamela Gennusa (President), William W. Davis Jr. (Vice President), Stephen G. Downie (Secretary), Francis J. Cave (Treasurer); Members: Charles F. Goldfarb (Honarary Technical Consultant), David W. Penfold (Bulletin Editor); Elected Members: Sharon Adler, Marion Elledge, Paul Ellison, James D. Mason, Lynne Price , Dieke van Wijnen. For an address list of elected members, national chapters, SIGs and pending chapters of the SGMLUG, see the SGML Users' Group Newsletter 20 (September 1991) 9-11. The Annual General Meeting (AGM) of the Users' Group is held in conjunction with the International Markup Conference sponsored by the Graphic Communications Association (GCA). Regional and national chapters and SIGs also meet occasionally throughout the year. There are now several national chapters of SGMLUG (e.g., Dutch, German, Japanese, Norwegian, Swiss, Canadian, New York) and several SGMLUG SIGs (special interest groups, e.g., SGML Database SIG; SGML SIGhyper - Hypertext and Multimedia; European Workshop on SGML). Some of these groups have dedicated serial publications in paper and electronic format. Membership in the SGMLUG includes up-to-date information about SGML and related standards, reduced rates at events sponsored by the The SGML Users' Group, a copy of each issue of the SGML Users' Group Newsletter and SGML Users' Group Bulletin, discounts on books central to SGML, and 10-percent discount for the MarkUp Conferences. The secretary of the SGML Users' Group maintains for members a bibliography of other SGML-related publications in a document "SGML Library," and will supply copies of these documents to group members. These volumes include ISO standards documents and other SGML-related publications available to members at discounted rates include: Address: Mr. Stephen G. Downie; SGML Users' Group, Secretary; c/o SoftQuad Inc.; 56 Aberfoyle Crescent, Suite 810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 416 239 4801; FAX: +1 416 239 7105. <107> Electronic Publishing Special Interest Group (EPSIG) EPSIG (Electronic Publication Special Interest Group) is collaboration between the AAP (with 250 member firms in the US) and OCLC (Online Computer Library Center), serving 11,000 libraries. EPSIG was invited by the AAP to carry out development and promotion of the EPSIG/AAP "Electronic Manuscript Standard" for the preparation, publication and interchange of electronic manuscripts; see <53> above. The primary goals of EPSIG are to: (1) Promote the adoption and proliferation of the Electronic Manuscript Standard; (2) Organize and present tutorials and technical programs; (3) Provide an information clearinghouse for documents pertaining to the Standard, EPSIG, and electronic publishing; (4) Provide electronic mail to EPSIG members; (5) Provide a toll-free helpline to provide advice and assistance with the Standard; (6) Publish a quarterly newsletter; (7) Publish and sell current and future EPSIG manuals related to the Standard; (8) Coordinate standards input from members as revisions to the Standard are required. As of late 1991, electronic discussion lists were set up for committee work on the Math and Tables specifications in the EMS. The final versions of this committee work will be submitted to be balloted as an amendment to ANSI/NISO Z39.59-1988. For the committee work on Math, contact William B. Wolff of the American Mathematical Society (Email: wbw@math.ams.com); for Tables, contact Paul Grosso of ArborText (pbg@arbortext.com). See further in Paul Grosso's note, "First Meeting of Math/Tables Update Committee Held," EPSIG News 4/4 (December 1991) 1-2. EPSIG's newsletter EPSIG News is published quarterly (ISSN: 1042- 3737), and receipt of EPSIG News is a benefit of subscriber status. EPSIG members receive discounts on SoftQuad and Software Exoterica software products. EPSIG Members also receive discounts at SGML conferences sponsored by the GCA. EPSIG promotes the EMS (American National Standard for Electronic Manuscript Preparation and Markup. (ANSI/NISO Z39.59-1988)) as well as several Guides and instructional materials pertinent to electronic document preparation and data interchange; see <54>, <55>, <56>, <57>. Address: EPSIG (Electronic Publishing Special Interest Group); Attention: Betsy Kiser, EPSIG Manager, MC 278; c/o OCLC; 6565 Frantz Road; Dublin, OH 43017-0702 USA; TEL: (614) 764-6195; FAX: (614) 764- 6096. <108> SGML Database Special Interest Group (DSIG or DBSIG) The SGML DBSIG is an international effort loosely organized under the SGML Users' Group and its national chapters for the development of SGML(document) database strategies. Contacts: Mr. Hans Mabelis; SGML DBSIG, Secretary; c/o Matrices Software; Westeinde 14; 1017 ZP Amsterdam; THE NETHERLANDS; TEL: +31- 20-255-006; FAX: +31-20-247-948; Han Schouten, SGML DSIG; Research Center for Technical and Physical Engineering in Agriculture (TFDL); Mansholtlaan 12; 6700 AJ Wageningen; THE NETHERLANDS; TEL: +31-8370- 19143; FAX: +31-8370-11312. See: (1) Han Schouten, "SGML*CASE: The Storage of Documents in Databases," SGML Users' Group Bulletin 4/1 (1989) 1-14; (2) Han Schouten, "Draft Tender re: Documents in Databases," SGML Users' Group Newsletter 15 (January 1990) 12-14. A major draft proposal for SGML DSIG sponsored development of a prototype document processing environment in which documents are stored as databases. The environment would support SGML, but also other SGML-related standards like DSSSL -- "as an alternative for the sequential access strategy characteristic of standard SGML." Details on the objectives, tasks, funding, deliverables, rights and duties of participants, project management, (etc.) are described. Proposed tasks include specification of a gross system architecture, definition of modelling techniques, building and verifying semantic equivalence of all models with SGML and DSSSL, facilities for loading SGML DTDs, facilities to unload DTDs without loss of information, creation of a DTD editor, creation of a structured document editor, building of retrieval facilities, building a document formatter. <109> SGML SIGhyper The SGML Users' Group Special Interest Group on Hypertext and Multimedia (SGML SIGhyper) is different from all other groups whose focus is hypertext and multimedia theory, technology, and creativity. The sole interest of SGML SIGhyper is the promulgation of information about the "HyTime" Hypermedia/Time-based Structuring Language (ISO/IEC Draft International Standard 10744), as a worldwide standard technical framework for integrated open hypermedia. As an application of SGML, Hytime is a "standard neutral markup language for representing hypertext, multimedia, hypermedia, and time- and space-based documents in terms of their logical structure." See further item <72>, "ISO/IEC DIS 10744" above. The organizers of SIGhyper hope that the publishing, documentation, entertainment, education, and information processing industries will evaluate HyTime as an appropriate standard toward which they all can migrate in an orderly and profitable fashion. See: (1) Steven R. Newcomb (et al.), "'HyTime': The Hypermedia/Time-based Document Structuring Language," CACM 34/11 (November 1991) 67-83; (2) on SMDL (Standard Music Description Language), an application of HyTime, thus similarly based upon SGML: Steven R. Newcomb, "Standard Music Description Language Complies with Hypermedia Standard," IEEE Computer 24/7 (July 1991) 76-79. The current (1991) chairman of SIGhyper is: Steven R. Newcomb; c/o TechnoTeacher, Inc.; 1810 High Road; Tallahassee, FL 32303-4408; USA; TEL: +1 904 422 3574; FAX: +1 904 386 2562; Internet: srn@cmr.fsu.edu. The current (1991) vice chairman and maintainer of the SIGhyper on- line library is: Erik Naggum; c/o Naggum Software; Box 1570 Vika; 0118; OSLO, NORWAY; Internet: erik@naggum.no OR enag@ifi.uio.no. Current SIGhyper documents available for public access from the online library are on the following hosts: ftp.ifi.uio.no:/SIGhyper and at mailer.cc.fsu.edu:/pub/sgml. <110> European Workgroup on SGML (EWS) The EWS is a cooperative effort by publishers, database developers, typesetters and other representatives from commercial and academic sectors to produce standard DTDs for scientific articles. The DTDs are to be used to facilitate document interchange between researchers, and between scholars and publishers. The Workgroup is loosely associated with the International SGML Users' Group. For specific details on the published MAJOUR DTDs and current work, see below entry <132> subsection (b). Questions regarding the full activities of the EWS may be directed to Holger Wendt or Dieke van Wijnen. Addresses: Holger Wendt; Springer Verlag GmbH & Co. KG; Postfach 105280; Tiergartenstrasse 17; D-6900 Heidelberg 1; GERMANY; TEL: +49-6221-487-324; FAX: +49-6221-43982; Email (BITNET) WENDT@DHDSPRI6; Dieke van Wijnen; New Media Department; Wolters Kluwer Academic Publishers; Spuiboulevard 50, Postbus 989; 3300 AZ Dordrecht; THE NETHERLANDS; TEL: +31-78-334-264; FAX: 31-78-334-254. See further in: (1) "Report on the 5th MAJOUR- body meeting of the European Workshop on SGML (EWS) at CERN, Geneva, on 30-Sept-1991," (SGML PROJECT REPORT NO 5) by Paul Ellison, Director-SGML Project, UNIVERSITY OF EXETER COMPUTER UNIT, 14 November 1991; (2) "European Workshop on SGML," SGML Users' Group Newsletter 20 (September 1991) 22; (3) "European Workgroup on SGML Plan to Become SIG," SGML Users' Group Newsletter 19 (April 1991) 9; (4) "European Workgroup to Publish Draft DTD for Scientific Articles," SGML Users' Group Newsletter 18 (November 1990) 13. <111> CALS In Europe Special Interest Group This SIG under the aegis of the International SGML Users' Group was formed in 1989 under the leadership of Joan Smith. See her article in the inaugural issue of the main publication organ: "The Proposal to Form a SIG for CALS in Europe," EuroCALS Newsletter 1 (September 1989) 2. The objectives of the SIG are to acquaint all interested parties with the requirements of the US DoD CALS program, and to provide assistance in meeting those requirements. Membership in the SIG includes copies of the Newsletter, reduced registration fee rates for events sponsored by the SIG, and discounted rates for various other CALS/SGML conferences. Contact: David Ardron, Secretary, CALS in Europe SIG; Ferranti Computer Systems Ltd,; Western Road, Bracknell, Berkshire RG12 1RA; UNITED KINGDOM; TEL: +44-344-483232; FAX: 44-344-54639; TELEX: 848117 FERBKL G. Subscription rate (1991): 20 pounds UK. ===================================================================== SGML ELECTRONIC DISCUSSION FORUMS AND ONLINE SUPPORT CENTERS ===================================================================== Contents: * Usenet Newsgroup: comp.text.sgml * SGML Project, University of Exeter * TEI-L Discussion Forum * SGML-L Mailing List * MARKUP-L Discussion Forum * CALS-BBS <112> Usenet Newsgroup: comp.text.sgml The Usenet Newsgroup comp.text.sgml is a dedicated forum for discussion of SGML and associated standards (ODA, SDIF, DSSSL, SPDL, etc.). It began in Fall 1990 thanks to the vision and energy of Edward Vielmetti (University of Michigan Math Department, Ann Arbor, MI; Internet: emv@math.lsa.umich.edu). As of late 1991, comp.text.sgml hosted the most concentrated and high-quality electronic discussion of SGML on any academic network. The discussion group has support from a number of experts within the SGML community (e.g., contributions from SoftQuad; Software Exoterica; Electronic Book Technologies; Open Text Systems; InfoDesign; Naggum Software; ArborText; Avalanche Corporation). An introductory FAQ file on SGML ("Frequently Asked Questions") written for comp.text.sgml is noted under section 1 above; cf. entry <29>. The discussion forum is accessible from any VAX, UNIX or other site which receives Usenet News, viz, from among any of the 40,000 Usenet sites worldwide. Software for reading News on a local system varies greatly; see your local computer gurus. Various sites archive the posting from comp.text.sgml in spool and public archive areas, and sometimes with indexes. The ARCHIE servers may be used to locate such sites. As of late 1991, some hosts were listed as follows: Host mcsun.eu.net (192.16.202.1) Location: /newsarchive/comp/text/sgml DIRECTORY rwxrwxr-x 2048 Dec 4 23:53 sgml Host aix370.rrz.uni-koeln.de (134.95.136.1) Location: /usenet/comp.archives/text/sgml Location: /usenet/comp.archives/auto/comp.text.sgml Host utsun.s.u-tokyo.ac.jp (133.11.11.11) Location: /misc DIRECTORY rwxrwxr-x 512 Oct 31 11:51 sgml Host src.doc.ic.ac.uk (146.169.3.7) Location: /usenet/comp.archives/text/sgml Location: /usenet/comp.archives/auto/comp.text.sgml Location: /.zapme/comp.archives/auto/comp.text.sgml Host rusmv1.rus.uni-stuttgart.de (129.69.1.12) Location: /soft/kommunikation/news/spool/comp/text/sgml Host cs.dal.ca (129.173.4.5) Location: /pub/comp.archives/comp.text.sgml Host ftp.syd.dit.csiro.au (130.155.128.3) Location: anonymous:[.comp_text_sgml] <113> SGML Project, University of Exeter In November, 1990, a two-year project was awarded by the UK Information Systems Committee (formerly the Computer Board for Universities and Research Councils) to the University of Exeter to evaluate SGML products for use in UK Universities and Research Council establishments. The project staff are now located within the University's Computer Unit and the project is directed by Paul Ellison, a member of the relevant ISO working committee and a long- time proponent of SGML. The aims of the project are: (1) to investigate commercial products and review them for possible use in the UK Academia; (2) to investigate the current use of SGML within and without Academia; (3) to assess possible requirements for SGML systems in UK Academia; (4) to investigate the required utilities (e.g., editors, translators, formatters) and make recommendations concerning possible acquisition; (5) to define, in consultation with academic users, a vocabulary of element and entity names and develop general Document Type Definitions; (6) to maintain a library of DTDs; (7) to function as a center for information on the use of SGML; (8) to cooperate with AGOCG (the Advisory Board on Computer Graphics) in increasing the awareness of SGML in Academia. The project was first proposed within the context of an AGOCG sponsored workshop on the use of SGML in UK universities, organized by Anne Mumford of Loughborough University. SGML was one of the standards chosen by the AGOCG for structuring and distribution of university-related information containing graphics (research documents, teaching aids, view graphs). A set of recommendations issued to the AGOCG and the UK academic community called for such an initiative as Paul Ellison now directs (information supplied by Anne Mumford and Paul Ellison). The SGML Project's host machine will be fully addressable for anonymous-FTP from 01/01/92. The machine's address will be: sgml1.ex.ac.uk (144.173.6.61) INTERNET uk.ac.exeter.sgml1 JANET (N.B. the machine name sgml1 = "sgml_one") A library of DTDs, tools etc. is to be created, and details of this service will be widely publicized. Any donations (of DTDs etc.), or suggestions as to what this library should contain, would be most welcome. The SGML Users' Group Parser Materials (the ARCSGML v1.0 parser toolkit, VM2 etc), are currently held on the SGML Project's host machine. Until 01/01/92 these are only available over JANET, via an e-mail request (contact the Project for more details on both), or via anonymous-FTP from other sites; see entry <118> below for a list of anon-FTP sites hosting the ARC-SGML tools. For academic and research sites within the UK, the SGML Project has been given funding to make visits to give lectures, to run workshops, or give advice on SGML implementation and use. This service is entirely free to such sites. The SGML Project has published notices in all the university and research institution site newsletters in the UK. Currently it has a mailing-list of 200+ contact names, and each will automatically receive any updates, reports and notices produced by the Project. Additional contact names are always welcome (any country). To contact the SGML Project: Michael Popham; SGML Project; Computer Unit - Laver Building; North Park Road; University of Exeter; Exeter EX4 4QE; UNITED KINGDOM; TEL: +44 392 263946; TEL: +44 392 263939; FAX: +44 392 211630; Email: (JANET): sgml@exeter.ac.uk (Office); (JANET): M.G.Popham@exeter.ac.uk (Project Officer); Paul Ellison on (JANET): P.A.Ellison@exeter.ac.uk (Project Director). Further references: (1) "The SGML Project." SGML Users' Group Newsletter 18 (November 1990) 13; (2) Advisory Group on Computer Graphics. Document Exchange: The Use of SGML in the UK Academic and Research Community. Workshop Proceedings 5-7 March 1990. Edited by Anne Mumford. See <28>: the volume's front matter contains the recommendations proposed for the 'Exeter' SGML group. <114> TEI-L Discussion Forum (on BITNET) TEI-L is the electronic discussion forum for the international Text Encoding Initiative, implementing SGML for markup of texts in academic applications, particularly the humanities. The list owners are Michael Sperberg-McQueen and Lou Burnard, TEI editors. TEI-L is the primary electronic forum for the TEI, normally open for subscription to all who request subscriber status; specialized forums hosted on the same computer (TEI-REP, TEI-ANA, TEI-META, TEI-DOC) are normally closed lists used by TEI subcomittees and working groups. While the TEI-L forum is not designated for discussing the theoretical and practical merits of SGML per se, the TEI encoding is an implementation of SGML, and thus some theoretical and practical discussions of SGML will be heard. Many SGML-related working papers and provisional DTDs are available in conjunction with the archived TEI-L discussions. Send email or postal mail to Wendy Plotkin requesting document TEI A0, "Current Documents," which lists some 120 working papers, committee reports and minutes records -- many of which address TEI-SGML issues. Address: Wendy Plotkin, Email (Internet): u49127@uicvm.uic.edu, OR u49127@uicvm.cc.uic.edu OR u49127@UICVM.BITNET; Postal: Wendy Plotkin, TEI-Rm 168; UIC Computer Center; Computer Center (M/C 135); University of Illinois at Chicago; Box 6998; Chicago, IL 60680. If you wish to subscribe to the TEI-L electronic forum (there are no subscription fees), send an interactive BITNET message, or (from a BITNET site, or from one of the other public networks), send standard mail to the BITNET address listserv@uicvm with the single line as the first line (any subject line will be ignored): subscribe tei-l YOUR NAME If you wish to obtain archived files from this discussion, send interactive BITNET message or mail message to the BITNET address listserv@uicvm with one or more lines patterned after this example (for October, 1991 archive): get tei-l log9110 <115> SGML-L Mailing List (on BITNET) BITNET: SGML-L@DHDURZ1 SGML-L was only minimally active through most of calendar 1991, but showed some signs of vitality late in 1991. The List Owners are Joachim Lammarsch (X92@DHDURZ1.BITNET or X33@DHDURZ1) and Mr. Holger Wendt (WENDT@DHDSPRI6). Mr. Wendt serves as the SGML-LIST editor. To subscribe to the mailing list, send an interactive BITNET message or mail message (from any network) to the BITNET address listserv@dhdurz1 with the command: subscribe SGML-L YOUR NAME substituting your personal name for "YOUR NAME." <116> MARKUP-L (GLDV-AK fuer TEI-Guideline-Anpassung) The objectives of the Goettingen-based GLDV-AK fuer TEI-Guideline- Anpassung and associated MARKUP-L discussion list are given in a poster submitted by Guenter Koch, 8-October-1991, "Mitgliederbrief und Protokoll der konstituierenden Sitzung des GLDV-Arbeitskreises: 'Kodierung und Normung maschinenlesbarer Texte,' Universitaet Trier am 27.9.1991, 14.-15.30 Uhr." These objectives relate primarily to the evaluation and use of TEI (and SGML) encoding standards within the German research comunity, but the discussions and resources will be of broader interest, covering such areas as TEI applied to machine- readable lexica, hypermedia, and non-Roman languages (e.g., Sanskrit). The role of the discussion group in particular is described as follows: "Um eine schnelle und kostenguenstige Kommunikation zwischen den Teilnehmern des Arbeitskreises zu ermoeglichen, haben wir auf dem Listserver LISTSERV@DGOGWDG1.BITNET eine Liste mit dem Namen MARKUP-L eingerichtet. Sie soll dazu dienen, Fragen bei der Arbeit mit SGML und den TEI-Richtlinien zu diskutieren, Erfahrungen mit Programmen auszutauschen und Einschaetzungen einschlaegiger Buecher weiterzugeben. Auch arbeitskreisinterne, organisatorische Nachrichten, z.B. wo und wann ein naechstes Arbeitskreistreffen stattfindet, sollen ueber die Liste verbreitet werden. Des weiteren enthaelt die Liste eine Reihe Dateien, die von den TEI-L und anderen fachlich benachbarten Kommunikationslisten stammen und nuetzliches Arbeitsmaterial bieten, so die TEI DTDs, Bibliographien zu SGML, zum Thema Hypertext und SGML etc." The two list owners are Guenter Koch, GWD Goettingen (BITNET: GKOCH@DGOGWDG1) and Peter Scherber, GWD Goettingen (BITNET: PSCHERB@DGOGWDG1). The LISTSERVer may be addressed via its BITNET address (LISTSERV@DGOGWDG1) or by the Internet address (listserv@ibm.gwdg.de) using standard LISTSERVer commands: sub markup-l , index markup-l, review markup-l, get , etc. The LISTSERVer currently (December 1991) contains general information files on TEI, SGML (bibliography), DTDs from TEI phase 1 work, and discussion logs. <117> CALS-BBS The primary CALS(-SGML) documents are alluded to above; see <61>. Several of these CALS-specific SGML documents and related information files are available via telephone dialup or Internet anonymous-FTP. Of special interest are MIL-M-28001 SGML and MIL-M-28001A SGML (Markup Requirements and Generic Style Specification for Electronic Printed Output and Exchange of Text), held in ASCII file format and containing relevant parsable DTDs. The file server also contains electronic information and/or full text for: (1) MIL-HDBK-59A; (2) MIL-STD-1840A "Automated Interchange of Technical Information"; (3) MIL-D-28000 "Initial Graphics Exchange Specification (IGES); (4) MIL-R-28002 Group 4 Raster; (5) MIL-D-28003 Computer Graphics Metafile (CGM). Amendments for some of these items are also on the server. Direct access to the files (but without BBS interface) is via anonymous-FTP: FTP to durer.cme.nist.gov (129.6.32.4) and 'cd' to pub/cals. The subdirectories are: 1840a; 28000; 28000a; 28001; 28002; 28003; 28003a; 59a; arc; autocom; brochure; bulletin; cals-net; cals- rpt; citis; ctn; expo91; ietm; isg; misc; scenario -- where content is readily discernible. Files are held in ASCII, WordPerfect and other binary formats as appropriate. In the subdirectory 28001, for example, one would find the following files: 28001a.exe; 38784-v2.exe; 38784pid.exe; app-a10-.exe; app-a50.exe; app-a70a.exe; app-a70b.exe; app-a70c.exe; app-a80.exe; app-b10-.exe; app-b50.exe; app-b70.exe; app-b80.exe; app-b90.exe; app-c10-.exe; app-d10-.exe; app-d50.exe; dtds.exe; m28001a.exe; os-pid.exe; pid-fos.exe; read.me; temp-fos.exe; temp-pid.exe. The files '.exe' are self-extracting compressed files that may be unpacked on an MS-DOS machine simply by typing the . Direct dialup for the BBS is supported for phone via (1 301) 948-8966 (a four line rotary); or (via Internet) TELNET to 129.6.32.173 and follow the login instructions. The BBS Main Menu contains seven areas: 1 - Meetings and Conferences; 2 - Special Announcements; 3 - CALS Information Sources; 4 - Standards News; 5 - Technology News; 6 - BBS Operation and Software; 7 - SME & SOLE Bulletins. CALS-MIL-STD documents are online, available for downloading: Directory 1. - CALS MIL-SPECS, MIL-STDS, & MIL-HDBKS; Directory 2. - DOD Publications About CALS Program; Directory 3. - Standards Activities Information; Directory 4. - Miscellaneous Information. Also included in the BBS support is a file on CALS-Related Software Packages, maintained by the Tech Transfer Committee of the DOD/ISG Small Business Task Group. For questions about the CALS-BBS, contact Dave Bettwy via email, the CALS BBS message system, or by telephone at (1 301) 975-6641. ============================================================ ENTRY-LEVEL SGML PARSERS AND RELATED SGML SOFTWARE ============================================================ INTRODUCTION: No opinions, judgments, recommendations or warranties are offered with respect to the SGML products noted below. The selection reflects a simple, minimal goal of pointing the interested beginner to some low-cost, entry-level (PC and Macintosh microcomputers) SGML tools, and to public domain sources which may be freely used or modified in experimentation. Special place is given to ARC-SGML and derivative materials, since they have generously been placed (via liberal license terms) into 'public domain.' Anyone wishing to evaluate SGML software for commercial or large-scale production purposes should follow normal industry standards in making such evaluation. Three published resources described under section 9 below may be of assistance: (1) The SGML Source Guide <134>, edited for the GCA by Marion Elledge, supplies summary information on SGML products and services; (2) Bibliography on SGML (Standard Generalized Markup Language) and Related Issues <133>, Cover/Duncan/Barnard, contains 117 entries on SGML products, services, and development projects; (3) SGML Products and Services <135>, compiled for the CALS in Europe SIG by Joan Smith, covers CALS-SGML software. A technical report prepared by Nicholas Duncan for the TEI summarized major companies/products in the SGML arena: Nicholas Duncan, "Companies Producing SGML-Related Products," TEI Syntax and Metalanguage Committee, Technical Report TEI MLP28 (October 2, 1990); the document is available from Wendy Plotkin; see <114>. Other primary resources are the seasonal and annual trade shows, SGML workshops, (GCA) TechDoc conferences, (GCA) MarkUp and SGML Conferences and other electronic publishing events. Printed resources include the trade publications and dedicated SGML newsletters, which regularly feature new SGML software products: CALS Report, SGML Report, SGML Users' Group Newsletter ("New Products and Services" section), SGML Users' Group Bulletin, CALS Journal, EuroCALS Newsletter, , EPSIG News, Seybold Report on Publishing Systems, Release 1.0, etc. Included in the online CALS-BBS support is a file on CALS-Related Software Packages, maintained by the Tech Transfer Committee of the DOD/ISG Small Business Task Group: anon-FTP to durer.cme.nist.gov (129.6.32.4), and get the file pub/cals/bulletin/bullet51. See more at <117>. A large proportion of serious (industry, government, academic) SGML work is carried out on workstations, mini- and mainframe computers. These SGML-aware editors, transducers, translators and other facilities are numerous, and in general could not be evaluated here. For a summary of some SGML-aware editors, see: Dale Waldt, "Overview of SGML-Smart Text Editors," 17 (December 1990) 12-15; he reviews IBM TextWrite; Datalogics WriterStation; SoftQuad Author/Editor; Yard Software Write-It; Software Exoterica CheckMark). The parsers and other software resources noted below are these: * ARC-SGML -Original ARC-SGML 'VM2' Release for MS-DOS, with C-Sources and Documentation by Charles Goldfarb -UNIX port of ARC-SGML, by James Clark -'SGMLS' version of ARC-SGML, by James Clark -Macintosh port of ARC-SGML, by Wally Wedel * Amsterdam (Free University) Parser * Software Exoterica (XGML Validator; XGML Normalizer, XGML Translator, XGML OmniMark) * SEMA Group/Yard Software Systems Limited (Mark-It Light) * SoftQuad (Author/Editor) * E2S (EASE -- E2S Advanced SGML Editor) * Electronic Book Technologies (MS-Windows) DynaText SGML Browse/Search * Avalanche Development Company (FastTag, IMSYS) - Intelligent Autotagging * qwertz/FORMAT -- SGML to LaTeX (and nroff/troff) Translator * NIST "SGML parser materials" * Public DTD Collections <118> ARC-SGML <119> Original ARC-SGML 'VM' Release for MS-DOS, with C-Sources and Documentation by Charles Goldfarb In July 1991, a significant set of SGML Parser Materials was made available via the SGML Users Group (SGMLUG). The materials consist of source code (in C), executable binary (for PCs), examples, and documentation. The SGMLUG has been granted "an irrevocable, royalty- free, worldwide, non-exclusive license to use, execute, reproduce, display, perform and distribute copies of, and to prepare derivative works based upon these materials; and the right to authorize others to do any of the foregoing". The grantor of the license to the SGMLUG, who has chosen to remain anonymous, does not warrant the Parser Materials in any way. Dr. Charles Goldfarb, the Honorary Technical Consultant to the SGMLUG, has created some documentation to introduce the materials. The SGML Project at the University of Exeter has agreed to make the Parser Materials and the associated README file available on the academic network, and to act as a collector of enhancements and changes to them. Reviews of the parser appeared on the Usenet Newsgroup comp.text.sgml in July-August 1991 by François Chahuneau, AIS/Berger Levrault; Michael Sperberg-McQueen University of Illinois at Chicago; David G. Durand (dgd@cs.bu.edu) Boston University Computer Science. The Exeter group and David Durand are keeping lists of bug reports, modifications, etc. If you are in the UK and have access to JANET, you may use blue book file transfer to obtain the files from uk.ac.exeter.sgml1; use the following address and protocol: uk.ac.exeter.sgml1 username:sgmlbox, password:sgmlbox. If you have any problems contact sgml@uk.ac.exeter, or phone (+44) 392 263946, or fax (+44) 392 211630. Exeter plans to have anonymous-FTP support by the end of calendar 1991. The summary above was adapted from a public posting by Paul Ellison to TEI-L, 18-July-1991. Fuller description may be found in the associated README files on the relevant file servers, and in "SGML Parser Materials Donated to [the] SGML Users' Group," SGML Users' Group Newsletter 20 (September 1991) 3-4. The ARC-SGML parser materials are available via anonymous FTP from several sites, typically under a directory ARC-SGML: Host sgml1.ex.ac.uk (144.173.6.61) Or uk.ac.exeter.sgml1 on JANET - see <113> above Location /sgmlbox Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/ARC-SGML Host ftp.ifi.uio.no (129.240.88.1) Location: /pub/SIGhyper/ARC-SGML/distrib Host src.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml DIRECTORY rwxr-xr-x 512 Dec 13 05:03 ARC-SGML <120> UNIX Port of ARC-SGML, by James Clark The sources included in ARC-SGML have been modified to facilitate compiling under UNIX (James Clark); the sources and patches are available on several anonymous-FTP hosts. The filename is typically: arcsgml-1.0jclark.tar.Z or something similar. Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/ARC-SGML.UNIX FILE rw-r--r-- 289727 Aug 5 11:19 arcsgml-1.0jclark.tar.Z Host src.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/ARC-SGML.UNIX FILE rw-r--r-- 289727 Aug 5 11:19 arcsgml-1.0jclark.tar.Z Host nic.funet.fi (128.214.6.100) Location: /pub/languages/sgml FILE rw-rw-r-- 289727 Aug 4 16:17 arcsgml-1.0jclark.tar.Z Host ftp.ifi.uio.no (129.240.88.1) <121> 'SGMLS' Version of ARC-SGML, by James Clark "A beta-test version of sgmls, a SGML parser derived from the ARCSGML parser materials, is now available for anonymous ftp. . . Sgmls outputs a simple, line oriented, ASCII representation of a SGML document's Element Structure Information Set, which can easily be parsed by awk, perl, C or whatever. The idea is that sgmls can be used as the front end for a structure-controlled SGML application. It is primarily intended for Unix systems, but it works also on MS- DOS. I've tested it on the following architecture/OS/compiler combinations: sparc/SunOS 4.1.1/cc, sparc/SunOS 4.1.1/gcc, 386/SVR3.2/cc, 286/MS-DOS/Borland C++ 2.0. It should be straightforward to port to most systems that have 8-bit bytes and a character set consistent with ISO 646 IRV. (The beta version of sgmls (arguably) contains minor bugs which prevent it from being fully conforming, but the author "aims to make sgmls fully conforming to ISO 8879.") Adapted from a posting by James Clark: jjc@jclark.com (James Clark) to Usenet Newsgroup comp.text.sgml; Subject: sgmls available; Date: 28- Oct-91. Provisionally (December 1991) 'sgmls' is available on several anon-FTP hosts: Host ftp.ifi.uio.no (129.240.88.1) Location: /SIGhyper/ARC-SGML/jclark FILE r--r--r-- 757760 Oct 28 18:23 sgmls-0.3.tar Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/jclark FILE rw-r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z Host utsun.s.u-tokyo.ac.jp (133.11.11.11) Location: /misc/sgml FILE rw-r--r-- 249967 Oct 31 11:46 sgmls-0.3.tar.Z Host src.doc.ic.ac.uk (146.169.3.7) Location: /other3/text/sgml/jclark FILE r--r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z Host isfs.kuis.kyoto-u.ac.jp (130.54.20.1) Location: /ftpmail/utsun.s.u-tokyo.ac.jp/misc/sgml FILE rw-rw-r-- 249967 Dec 19 19:31 sgmls-0.3.tar.Z Host puffin.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/jclark FILE r--r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z <122> Macintosh Port of ARC-SGML, by Wally Wedel According to a posting to Usenet Newsgroup comp.text.sgml by Wally Wedel (31-December-1991), a version of ARC-SGML ported to the Macintosh platform is available via anonymous-FTP from the site rascal.ics.utexas.edu. The path to 'rascal' is not clearly indicated, but the archive files are available on several other servers, as listed below. "I have recently finished porting ARC SGML 1.0 as released by the SGML Users' Group to the Apple Macintosh environment. This archive is being released to the InterNet community on an as-is, no-support basis. The Compactor archive has been submitted to rascal.ics.utexas.edu. . ." "ARC SGML 1.0 for the Macintosh: This archive is a port of an SGML parser called ARC SGML to the Apple Macintosh environment. Original sources of this parser are available from the SGML Users Group and various FTP sites around the world. Users should read the document LICENSE.DOC in the docs directory to determine the legal status of the original code. My modifications impose no further restrictions on use. This code is supplied on an as-is basis with no warranties of any kind. . ." "This code does contain errors. James Clark has caught and fixed some in his SGMLS code for UNIX. There are no doubt more. Moreover this code is not for the beginner or the timid. A copy of the SGML Handbook by your side when working with it is highly recommended. . .This port has been a 'spare time' project to take a close look at the implementation. I have no plans for further enhancements in spite of the appearance of the todo list in the docs directory." "This archive unpacks a folder which I maintain inside my MPW Folder hierarchy. I build the Think C version by copying the vm.9 project and the sources from sgmlc and sgmlh into a folder named ARC SGML in my Think C Development folder. . . A good place to start reading about this port is the document 'Macintosh port' in the docs directory." (Wally Wedel 12/30/91. Archive Name: ARC SGML 1.0.sea) Contact: Wally Wedel; U S WEST Advanced Technologies; 4001 Discovery Drive, Suite 390; Boulder, CO 80303 USA; Email (Internet): wwedel@uswest.com; AppleLink: D5100; TEL: (1 303) 541-6052 Anonymous-ftp locations for ARC-SGML.MAC are (or were -- use ARCHIE): Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/ARC-SGML.MAC FILE rw-r--r-- 461687 Jan 7 14:25 arc-sgml-10.hqx Host sumex-aim.stanford.edu (36.44.0.6) Location: /info-mac/app FILE rw-r--r-- 461687 Jan 3 12:27 arc-sgml-10.hqx Host nro.cs.athabascau.ca (131.232.1.1) Location: /wuarchive/mirrors/info-mac/app FILE r--r--r-- 461687 Jan 3 12:27 arc-sgml-10.hqx Host lth.se (130.235.16.3) Location: /pub/mac/info-mac/app FILE rw-r--r-- 461687 Jan 5 01:18 arc-sgml-10.hqx Host src.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/ARC-SGML.MAC FILE r--r--r-- 404959 Jan 7 14:25 arc-sgml-10.hqx.Z Location: /micros/mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /micros/mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Location: /mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Host puffin.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/ARC-SGML.MAC FILE r--r--r-- 404959 Jan 7 14:25 arc-sgml-10.hqx.Z Location: /micros/mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /micros/mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Location: /mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Host sics.se (192.16.123.90) Location: /pub/info-mac/app FILE rw-r--r-- 461687 Jan 8 05:14 arc-sgml-10.hqx Host shark.mel.dit.csiro.au (144.110.16.11) Location: /info-mac/app FILE rw-r--r-- 461687 Jan 4 10:32 arc-sgml-10.hqx Host ditmela.mel.dit.csiro.au (144.110.16.11) Location: /info-mac/app FILE rw-r--r-- 461687 Jan 4 10:32 arc-sgml-10.hqx Host wuarchive.wustl.edu (128.252.135.4) Location /mirrors/info-mac/app FILE -r--r--r-- archive 461687 Jan 3 06:27 arc-sgml-10.hqx <123> Amsterdam (Free University) Parser A description of the Amsterdam Parser (ASP) is found in the article cited sub section 1 above, <45>. "The Amsterdam SGML Parser uses an LL(1) parser generator, notably LLgen, for both DTD and document parsing. Actually, SGML is not LL(1), [so the developers] used the 'conflict resolvers' from LLgen to overcome the problems [they] came across." Questions regarding the parser and its current status might best be directed to the Vrije Universiteit: Prof. Dr. J.C. van Vliet, Internet email: hans@cs.vu.nl. As a last resort, contact Jos Warmer at PTT Research (no longer at VU): jos@pttrnl.nl OR possibly JB_Warmer@pttrnl.nl. The Amsterdam Parser is available via anonymous-FTP on the hosts listed below, though support for the parser (obtained in this manner) should not necessarily be expected. Host star.cs.vu.nl (130.37.24.6) DIRECTORY rwxr-xr-x 512 Aug 7 10:34 Sgml FILE rw-r--r-- 419803 Aug 7 10:23 Sgml.tar.Z Host nic.funet.fi (128.214.6.100) Location: /pub/languages/sgml FILE rw-r--r-- 419803 Aug 12 02:14 Sgml.tar.Z Host isfs.kuis.kyoto-u.ac.jp (130.54.20.1) Location: /ftpmail/ftp.ricoh.co.jp/pub/misc/sgml FILE rw-rw-r-- 419803 Dec 18 17:44 Sgml.tar.Z Host theta.iis.u-tokyo.ac.jp (130.69.48.4) Location: /sgml FILE rw-rw-r-- 419803 Dec 13 13:12 Sgml.tar.Z <124> Software Exoterica - XGML Validator (XGML Normalizer, XGML Translator, XGML OmniMark) An entry-level validating SGML parser distributed by EPSIG and GCA for just sixty-five (65) US dollars is Software Exoterica's Validator(tm). XGML Validator is an application of Software Exoterica's XGML Translator, and contains a subset of features implemented in the fully-functional XGML Normalizer and XGML Tester. Validator is a batch SGML parser with error detection but no entity expansion or normalization features. XGML Normalizer also produces a version of the document with all shortrefs and entities expanded; Tester will generate an even more detailed version of the document in various formats, including the format suggested by the newest ISO standard for conformance testing. The Validator parser validates SGML declarations, DTDs and document instances: it detects all errors in a single pass, reports errors in a diagnostic transcript, recovers intelligently from errors and always parses the full document. Validator requires a PC (MS-DOS) with 386 CPU and at least 4 megabytes of memory. See further description in: (1) "EPSIG Sells XGML Validator," The Seybold Report on Publishing Systems 20/9 (January 28 1991) 28; (2) "EPSIG and GCA Selling Exoterica Validator," SGML Users' Group Newsletter 20 (September 1991) 32-33, or (3) 17 (December 1990) 4, or EPSIG News 3/4 (December 1990) 9. Ordering from GCA or EPSIG: EPSIG; c/o OCLC, Mail Code 278; 6565 Frantz Road; Dublin, OH 43017- 0702; TEL: (1 614) 764-6195; FAX: (1 614) 764-6096. GCA address: Graphic Communications Association; 100 Daingerfield Road, Alexandria, VA 22314; TEL: (1 703) 519-8160; Telex: 510-600-0889; FAX: (1 703) 548-2867. Software Exoterica's related products include an SGML editor and several tools for SGML translation/transduction. (1) XGML CheckMark is a full-screen SGML editor available for the Mac II. It was originally developed as a conversion tool, but can be used for a variety of authoring/writing tasks. CheckMark uses actual DTDs, not rules files, and can validate documents while they are being edited. Tags may be entered into documents from menus or directly from the keyboard. (2) XGML Translator is an SGML-based string transducer which uses a string conversion language called XTRAN. Translator is available for Unix systems and 386 DOS microcomputers. Translator's functions are up-, down-, context- and cross-translations -- creating various kinds of SGML <<->> non-SGML conversions of documents and structured information. Exoterica considers XGML Translator appropriate for third generation applications. See Brian Travis, "Review of the Exoterica XGML Translator," 17 (December 1990) 5- 11. (3) XGML OmniMark is Exoterica's tool for fourth generation applications using markup languages. OmniMark script-writing and document translation facilities can be used to convert SGML documents to input languages of other languages, and to convert the output languages of other to SGML-defined languages (e.g., AAP, CALS), and to convert between arbitrary languages and data formats. OmniMark is available for MS-DOS (386-class machines), Microsoft Windows 3.0, OS/2, Macintosh, SCO UNIX, DEC VAX, and most UNIX systems. See EPSIG News 4/4 (December 1991) 5-6. Queries about XGML Validator and the full line of Exoterica's XGML products may be directed to: Software Exoterica Corporation; 383 Parkdale Avenue, Suite 406; Ottawa, Ontario; CANADA K1Y 4R4; TEL: 1- 613-722-1700; TEL: 1-800-565-9465 (1-800-565-XGML); FAX: 1-613-722- 5706. <125> SEMA Group/Yard Software Systems Limited (Mark-It Light) Mark-It Light is a "low cost" entry-level validating parser from the SEMA Group, and runs on MS-DOS 286 and 386 class microcomputers. It is based upon the so-called Sobemap parser, named after the European company now known as the SEMA Group. Mark-It Light supports a subset of features found in SEMA's fuller Mark-It package. Mark-It includes, as part of the package, a structured document editor called Write-It, which is equivalent to IBM's TextWrite and SoftQuad's Author/Editor. The Mark-It parser supports SGML more completely than most other SGML parsers: multiple concurrent documents, subdocuments, data tags, attributes, marked sections, rank, tag omission (minimization), short tags, short references, formal public identifiers, model inclusions, links (simple, implicit and explicit links). Additionally, Mark-It uses SGML together with a regular expression pattern recognizer to control the process of converting files both into and out of SGML form: "users can easily convert to and from TeX, including the LaTeX and JLaTeX forms, without having to be provided with a preconfigured TeX conversion package." Other components in the suite include Parse-It (file creator and parser), Compile-It (SGML DTD compiler for Write-It) and Lisp-It (an SGML-aware interpreter useful for applications development). Compiled versions of Mark-It and related tools are also available for UNIX and most mainframe computers, while C source code is available to OEM developers. The Mark-It Light parser itself thus supports basic validation: markup using the complete range of tag minimization features (data tagging, SGML tag ranking option), and interactive correction of markup errors; it also provides access to the full set of 256 character codes (extended ASCII). See: (1) "Yard Release[s] Mark-It Light," SGML Users' Group Newsletter 18 (November 1990) 22; (2) "Mark-It Light," EPSIG News 4/1 (March 1991) 4; (3) Martin Bryan and Ed Warnshuis, "Comments from our readers (letter to the editor)." The Seybold Report on Publishing Systems 20/10-11 (February 25, 1991) 2-4; (4) "Five New Products from Sema," EPSIG News 4/3 (September 1991) 7. Contacts: In Europe, Products Manager; SEMA Group Systems Ltd.; Avonbridge House, Bath Road, Chippenham, Wiltshire SN15 2BB; TEL +44- 249-656-194 (Paul Moorhead); FAX +44-249-655-723; or, Martin Bryan; Yard Software Systems; 29 Oldbury Orchard; Churchdown; Glos GL3 2PU, UNITED KINGDOM; TEL: +44 452 714029; or Griet Descheemaeker; SEMA Group Belgium; Place du Champ de Mars 5; Boite 40;B-1050 Brussels; BELGIUM; TEL: +32 2 508 53 23; In the US: John W. Oster II; Yard Software USA; 125 North Main Street; Bel Air, MD 21014; TEL: 301/838- 1911; FAX: 301/838-1913. <126> SoftQuad (Author/Editor) Transparent SGML parsing facilities are incorporated into SoftQuad's popular Author/Editor and related SGML products. Author/Editor (A/E) is an SGML(-aware) text editor and word-processor used as a major component in electronic publishing by a wide range of developers. Author/Editor uses pre-compiled DTDs rather than directly-editable ASCII DTD files (viz, DTDs based upon the AAP tagsets, CALS-SGML DTDS, ATA DTDs, or other DTDs shipped with A/E). Users who wish to customize standard DTDs or build new DTDs rely upon a second SoftQuad product, RulesBuilder (or its batch counterpart, 'buildRules'). The user is informed by A/E of all legal tags at any given point in the text by means of pull-down menus, from which the tags may be selected. Valid SGML document structure (i.e., conforming to the rules file) is enforced because it is not possible to type SGML tags directly into the file. Recent (1991) enhancements to A/E include a macro programming language (Scheme, based upon LISP), a new tables editor and graphics module. A/E interfaces with a number of other related products to support text and graphics formatting of the SGML files (SoftQuad Publishing Software). A/E is currently (1991) supported for both Apple Macintosh and DOS (MS-Windows), as well as for SunOS, Ultrix, Sun Open Windows, DEC Motif, etc. Prices (1991, with 12-month support contract) are about 1995 dollars US (DOS), 3295 dollars US (UNIX) and 1695 dollars US (Mac), though academic discounts will likely be arranged. See further description: (1) "SoftQuad Author/Editor: An SGML Context-sensitive Text Entry System." Pp. 99- 103 in Tools for Humanists, 1989. A Guidebook to the Software and Hardware Fair Held in Conjunction with the Dynamic Text [6-9 June 1989 Toronto]. Toronto, Ontario: Centre for Computing in the Humanities, 1989; (2) "SoftQuad - New Language/Platform," SGML Users' Group Newsletter 19 (April 1991) 21; (3) "SoftQuad Adds Table Editor," EPSIG News 4/4 (December 1991) 4. Contact: In North America: SoftQuad Inc.; 56 Aberfoyle Crescent, Suite 810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 416 239 4801; +1-800- 387-2777; FAX: +1 416 239 7105. Email: dns@sq.com (David Slocombe). In Europe: Open Information Technology (UK) Ltd.; Marble Arch; King Street; Knutsford; Cheshire WA16 6HD; UNITED KINGDOM; TEL: +44 565 50021; FAX: +44 565 51093. <127> E2S (EASE -- E2S Advanced SGML Editor) "EASE enables the production of Basic SGML documents (see clause 15.1.1 of ISO 8879), using any PC running DOS 3.30 (or higher) and with 550 kB free memory. It also runs on UNIX graphical workstations, under X-windows. EASE consists of two integrated parts, the DTD- editor and the DI-Editor, both of which offer on-line and context- sensitive control and help in the use of SGML (syntax- oriented editing). They also offer on-line, context-sensitive (and hypertext- like) help in the use of the editors themselves. The user interface is window-based, with pull-down menus for the various options/functions -- which can be accessed not only via the mouse, but also via the keyboard or function keys. All nine official languages of the E.C. are supported simultaneously, and unformatted output can be sent straight to an ASCII or PostScript printer. On the PC version, individual copies of the software require a protection key (dongle) to be fitted to the Centronics (parallel) port." "The DTD-editor is used to create, edit, validate and compile DTDs for use with the DI-editor. Inexperienced users can use options from the pull-down menus to build DTDs, whilst others can enter the text directly (with DTD-editor automatically picking up any syntactic errors). Other DTDs stored in plain ASCII files, can be imported into the DTD-editor for parsing, validation, and compilation (provided that they conform to the requirements suitable for producing Basic SGML documents). On the basis of the (compiled) DTD, the DI-editor displays the elements for the minimal logical structure of a document of that type. Users may then either edit the logical structure of the document (all actions being checked for semantic correctness), or select appropriate elements and swap into the text window for data entry (and/or the entry of further valid elements). It is also possible to import a plain text file, conforming to ISO 2022, and add valid markup according to the DTD; the resulting document will be a validated Basic SGML document (extended for multilinguism)." Reference: "New SGML Editor from E2S," EPSIG News 4/3 (September 1991) 6-7. Normal licence fees: are 1,250 dollars US (PC); 6,500 dollars US, (workstation); University licence fees: PC: 125 dollars US + medium, documentation, administration and shipment: 500 dollars US; Workstation: 650 dollars US + medium, documentation, administration and shipment: 500 dollars US. This information thanks to Caroline de Vleeschauwer (Email: cdv%e2s@relay.EU.net, Date: Thu, 19 Dec 91) Contact: Joost CARDOEN, Managing Director, E2S, Moutstraat 100; B-9000 Gent, Belgium, TEL: +32/91/21.03.83, FAX: +32/91/20.31.91, e-mail: jca@e2s.be; OR Ronny Verkest, Sales Manager; E2S; Moutstraat 100; B- 9000 Gent; Belgium; TEL: +32(91)21 03 83; FAX: +32(91)20 31 91; Email: e2s@e2s.be (Internet). <128> Electronic Book Technologies (MS-Windows) DynaText SGML Browse/Search DynaText supports SGML document indexing/searching/browsing and is implemented for MS-DOS (Windows), as well as for various UNIX platforms. It allows users to read, query, dynamically display and annotate electronic "books" or other structured information. DynaText uses SGML element tags to automatically generate hyperlinks to associated material such as diagrams, tables, and explicit cross references; it allows users to add their own link types/behavior through simple style sheet entries. Electronic style sheets are held in ascii-editable files with SGML syntax. This mechanism can be employed by users who want to create dynamic multi-media documents. Style definitions may be used to set the display characteristics (font type, size, color) including visibility or suppression of each SGML element. The principle of conditional visibility of elements (and element classes) in response to style sheets and icon clicks permits rapid customization of electronic books where a variety of document editions is desired. DynaText builds a full text index of the SGML document and (unlike other indexers that simply report occurrences within an entire document) can report occurrences within SGML components. Hit-list statistics for each document section provides an unprecedented level of search precision that enables users to find terms within the relevant sections of the document quickly. Wild cards and regular expressions may be used in queries, eliminating the need for exact string matches; Boolean logic (AND, OR) may also be specified. The indexer supports synonym lists that act like special purpose thesauri that enable access to information though a variety of synonymous terms. This feature is especially useful in acronym-laden technical reference manuals. In addition to displaying SGML-structured character text, DynaText supports a variety of popular raster formats (such as PICT, TIFF, Sun Raster and CCITT fax formats) to facilitate capture of associated artwork. The system also supports an open architecture for integration with multi-media applications allowing sound, animation and video supplements to be added to existing reference documents. Discounts up to 80 percent are available to academic institutions. See fuller descriptions of the product in: (1) "DynaText: Electronic Book Engine from EBT [Electronic Book Technologies]: First to Handle any SGML Application," Seybold Report on Publishing Systems 20/2 (September 24, 1990) 18-22; (2) Steven J. DeRose, "DynaText: Electronic Book Indexer/Browser," EPSIG News 3/4 (December 1990) 1-2; (3) "Electronic Book Technologies," SGML Users' Group Newsletter 20 (September 1991) 32; (4) "Electronic Book Technologies' Dynatext (SGML compiler). Text Tools: Beyond Search and Retrieval." Release 1.0 91/7 (July 31, 1991) 15-17. Contacts: (US): Electronic Book Technologies, Inc.; One Richmond Square; Providence, RI 02906 USA; TEL: (401) 421-9550; FAX: (401) 421-9551; (Internet): sjd%ebt-inc@uunet.uu.net (Steven J. DeRose), or ebt-inc!sjd@uunet.uu.net (that's ebt- inc!sjd@uunet.uu.net); (Internet): lrr%ebt-inc@uunet.uu.net (Louis R. Reynolds); in Europe: EBT International; 20, Pre de la Ferme; 1261 Gingins; SWITZERLAND; TEL: +41-22-69-24-24; FAX: +41-22- 69-24-25. <129> Avalanche Development Company (FastTag, IMSYS) Intelligent Autotagging Retro-fitting electronic data files held in proprietary format with SGML tagging, or converting paper documents to SGML-tagged files is frequently accomplished with the use of intelligent automatic markup software. The "autotagging" software from Software Exoterica (XGML Translator, XGML OmniMark) was noted briefly above,; see sub <124> and compare <44>. Other products for SGML conversion/translation are Avalanche Development's tools FastTag and IMSYS: they incorporate autotagging software using visual recognition techniques to generate structured output text. Using a VRE(tm) Visual Recognition Engine, FastTag reads visual clues: (1) from an output file in the case of electronic files or, (2) in conjunction with OCR, to convert paper- based documents into SGML electronic file formats. FastTag reckons with graphics objects, tables and similar text structures. Input may be ASCII, Calera PDA files, DCA/RTF files, WordPerfect or Word files (other wordprocessor formats supported), OCR/ICR scanner files, DECWrite, Interleaf, and so forth; output is user-definable, but may be specified as SGML, RTF, FrameMaker (Maker Interchange Format, MIF), Ventura Publisher, Interleaf, Datalogics, troff, nroff, tbl and so forth. FastTag and IMSYS are supported on MS-DOS, as well as on a variety of UNIX, VMS and Ultrix systems. The autotagging technology is licensed by a number of OEM developers (Xerox Information Systems (Kurzweil), Bell Atlantic (DocuSource), Shaffstall, IBM and DEC). See: (1) "Avalanche Development's FastTAG." Release 1.0 91/7 (July 31, 1991) 14-16; (2) "Avalanche - New Developments," SGML Users' Group Newsletter 19 (April 1991) 18-19; (3) "Avalanche FASTTAG to Support EMS DTDs," EPSIG News 3/2 (June 1990) 5-6 (Avalanche announces intention to support DTDs of the EPSIG/AAP Electronic Manuscript Standard); (4) "Systems Solutions for the '90s: In-Plant, Commercial. [Avalanche Development]," Seybold Report on Publishing Systems 18/14 (April 24, 1989) 26-43 (reviews Avalanche's Imsys.CALS, based on its Visual Recognition Engine, which interprets visual cues in the output file of word processors and generates a tagged ASCII file. It thus provides means of updating files to CALS-compliant SGML format). Related products include Proof Positive (spelling, grammar and style checking facilities for use with Interleaf and FrameMaker) and hypertext interface facilities. Contact: Eileen Quirk, Director of Marketing and Sales; Avalanche Development Company; 947 Walnut Street; Boulder, CO 80302 USA; (303) TEL: 449-5032; FAX: (303) 449-3246. Email (Internet): support@avalanche.com, sales@avalanche.com, apps@avalanche.com. <130> qwertz/FORMAT -- SGML to LaTeX (and nroff/troff) Translator Version 1.1 of the qwertz SGML documenting processing system is now available. Format is an SGML to LaTeX (and nroff/troff) translator. It consists of: (1) SGML document type definitions for the LaTeX document styles (articles, books, reports, letters, slides), for BibTeX bibliographies and for Unix manual pages; (2) Programs for translating SGML documents of the these types into LaTeX and nroff/troff; (3) A command for extracting source code from documentation, which is useful for a simple kind of "literate programming". Format allows LaTeX documents to be created using powerful and comfortable SGML editors, such as Author/Editor, in a quasi WYSIWYG manner. It also provides a layer of abstraction from LaTeX, allowing documents to be more easily translated into formats for other document processing systems, such as nroff/troff or MS-WORD. Anonymous-FTP locations (use ARCHIE for updates): Host gmdzi.gmd.de (129.26.1.90) Location: /pub/gmd FILE rw-r--r-- 959 Sep 30 11:51 sgml2latex-format.readme FILE rw-r--r-- 1336911 Sep 30 11:51 sgml2latex-format.tar.Z Host utsun.s.u-tokyo.ac.jp (133.11.11.11) Location: /misc/sgml FILE rw-r--r-- 1336911 Oct 31 11:48 sgml2latex-format.tar.Z Host liasun3.epfl.ch (128.178.36.30) Location: /pub/tex FILE rw-r--r-- 1336911 Oct 23 20:28 sgml2latex-format.tar.Z Contact: Thomas F. Gordon; German National Research Center for Computer Science (GMD); Schloss Birlinghoven, D-5205 Sankt Augustin 1, Germany; Internet email: thomas@gmdzi.gmd.de; phone: (+49 22441) 14- 2665. (Adapted from Usenet posting to comp.text.sgml; Subject: New Version of Format; Date: 30-Sep-91) <131> NIST "SGML parser materials" The older NIST parser materials are reported to be out-of-date and somewhat unreliable. Furthermore, questions persist concerning the NIST's support for these code sources, and, in light of apparently stronger current support for ODA (e.g., NIST ODA SIG in the OSI Implementors Workshop), for SGML more generally. It would be wise to consult with NIST and with a recognized SGML guru before spending time building on these SGML parser tools. Host stag.math.lsa.umich.edu (141.211.64.23) FILE rw-r--r-- 1731 Sep 15 1990 nist-sgml.cheater FILE rw-r--r-- 202923 Sep 15 1990 nist-sgml.tar.Z Host nic.funet.fi (128.214.6.100) Location: /pub/languages/sgml FILE rw-r--r-- 1731 Aug 12 02:29 nist-sgml.cheater FILE rw-r--r-- 202923 Aug 12 02:29 nist-sgml.tar.Z Host theta.iis.u-tokyo.ac.jp (130.69.48.4) Location: /sgml FILE rw-rw-r-- 871 Dec 13 13:13 sgml.cheater.Z FILE rw-rw-r-- 198363 Dec 13 13:15 sgml.cpio.Z <132> Public DTD Collections Public DTDs and entity sets for SGML are printed in the standards documents and in most handbooks, and are usually shipped with commercial SGML software. The nineteen (19) character entity sets published in Annex D of the ISO 8879 SGML Standard are legally unencumbered, and will probably become widely available in electronic format (use ARCHIE to search for filenames like "ISOlat1," "ISOlat2," ISOgrk1," "ISOgrk2," "ISOgrk3," "ISOgrk4," "ISOcyr1," "ISOcyr2," "ISOnum," "ISOdia," "ISOpub," "ISObox," "ISOtech," "ISOamso," "ISOamsb," "ISOamsr," "ISOamsa," "ISOamsn," "ISOamsc," etc.). Currently, sample DTDs in usable ASCII format may be obtained on the academic networks from several sources: (a) DTDs produced by the Text Encoding Initiative (TEI) are available from the BITNET listserv(er) on UICVM. The filenames and content of these DTDs can be expected to undergo change during 1992, but they currently have a VM/CMS filespec of ' DTD' (e.g., TEI1 DTD; TEIHDR1 DTD; TEIWSD1 DTD; TEIBASE1 DTD; TEIFRON1 DTD; TEIBACK1 DTD; TEILOW1 DTD; TEICRYS1 DTD; TEILING1 DTD; TEIREND1 DTD; TEIDRAM1 DTD; TEITC1 DTD; TEITC1 DTD). A number of related files on TEI syntax are also available (SYNTAX MEMO; OVERVIEW SYNTAX; BASICS SYNTAX; DOCUMENT SYNTAX; SGMLDECL SYNTAX; DTD SYNTAX). Use interactive BITNET command or mail to request the files, "get " and item <114> above for other standard BITNET LISTSERVer syntax. The DTDs may be obtained also from the LISTSERVer in Europe (Goettingen) which hosts MARKUP-L: LISTSERV@DGOGWDG1 on BITNET or listserv@ibm.gwdg.de on the Internet; see <116>. (b) MAJOUR (Modular Application for Journals). The EWS (European Workgroup on SGML) has published a DTD for the headers of scientific articles (MAJOUR-Header DTD). In cooperation with the CERN EPS (European Physical Society), EWS has now also produced a "body" standard DTD for complete scientific articles. The DTDs developed by EWS and EPS/CERN are based upon AAP article DTDs. See above entry <110> for further details on EWS. Free copies of the MAJOUR DTD(s) may be obtained from the STM (International Group of Scientific Technical and Medical Publishers): contact Ms Harriet de Hoog, STM Secretariat, Keizersgracht 462, 1016 GE Amsterdam, THE NETHERLANDS; FAX +31-20-38-15-66. See further in "European Workshop on SGML," SGML Users' Group Newsletter 20 (September 1991) 22. Alternately, contact the GCA for a free copy of MAJOUR (with purchase of other materials from the GCA's printed materials); see SGML Users' Group Newsletter 20 (September 1991) 32. (c) A HyTime DTD (declarations set) is located on the online SIGhyper forums: Host ftp.ifi.uio.no (129.240.88.1) and Host mailer.cc.fsu.edu (128.186.6.103). Disk space has been secured at the University of Oslo, Department of Informatics for a larger collection of public DTDs: see ftp.ifi.uio.no:/pub/SGML for contributions that may be available from that archive (e.g., for the "general" document DTD 'PUBLIC "ISO 8879:1986//DTD General Document//EN"' from ISO 8879 Annex E as amended). (d) Public DTDs and entity sets should be available from Exeter. One may periodically check the online library of the Exeter SGML Project for its collections: anon-FTP to sgml1.ex.ac.uk (144.173.6.61). See item <113> above. (e) The CALS-BBS forum described sub item <117> above contains several CALS-related DTDs, usually in self-extracting DOS compressed files (filename.exe). (f) DTDs supporting the AAP/EPSIG manuscript standard are available from EPSIG for 10 US dollars plus shipping; see <107>. The same files may be accessible from file servers: use ARCHIE to search for AAPARTCL.TXT; AAPBOOK.TXT; AAPSERL.TXT; SHORTREF.TXT or similar filenames. (g) The "Information Architecture" working group of the OSF Documentation Special Interest Group (IA WG of the OSF Doc SIG) will be releasing public DTDs during the first and second quarters of 1992. Contact Fred Dalrymple, the Open Software Foundation; TEL: (1 617) 621-8855, or John Bowe, Email (Internet): bowe@osf.org. =========================================================== FURTHER BIBLIOGRAPHY AND RESOURCE GUIDES FOR SGML =========================================================== <133> Cover, Robin; Duncan, Nicholas; Barnard, David. Bibliography on SGML (Standard Generalized Markup Language) and Related Issues. Technical Report 91-299. Queen's University, Kingston, Ontario. February, 1991. ISSN 0836-0227. 312 pages. A revised print version of a bibliographic and information database (compiled by Robin Cover), structured in SGML-database and formatted with SGML ->> BibTeX utilities developed at Queen's University by Nick Duncan and David Barnard. For print copies, contact: (1) Department of Computing and Information Science; Queen's University; Kingston, Ontario, Canada K7L 3N6; TEL: (613) 545- 6056; Email (Internet): heather@qucis.queensu.ca, or (2) the Graphic Communications Association, <105>. The printed version of the database contains a "Short Bibliography" of 67 essential references, and a fuller "Main Bibliography" with 1403 citations (many with abstracts). The second major section is an SGML Directory for some 117 SGML- supporting groups in academia, government, or industry: each entry supplies addresses, descriptions of software products or SGML services, and references. In detail: Table of Contents 1 Introduction.............................................1 1.1 What is SGML? (Robin Cover)..............................2 1.2 Content of the Bibliography (Robin Cover)................2 1.3 Format of the Bibliography (Nicholas Duncan).............9 1.4 Database Access and Collection Maintenance...............9 2 Acknowledgments (Robin Cover)...........................13 3 Acronym List............................................15 4 Short Bibliography......................................20 5 Main Bibliography.......................................25 6 Directory of SGML Supporting Groups....................160 6.1 Index of Entries.......................................160 6.2 Directory..............................................163 7 Appendices.............................................261 A Processing the Bibliography............................261 B BiBTeX Enhancements....................................262 C Document Type Definitions (DTDs).......................265 D ISO 8879-1986 Entities (LAT1, LAT2, ISOdia, ISOmin)....270 E Language Codes for SGML (ISO 639 and ANSI/MARC)........293 New bibliographic references and other SGML information for this database are welcome: please send citations (published or unpublished materials: technical reports, working papers, internal memoranda, articles, product announcements, product reviews) to Robin Cover via electronic or postal mail. Plans are underway to make the SGML database available electronically via a BITNET listserver (listserv@uicvm.bitnet or listserv@uicvm.uic.edu) and on Internet via anonymous FTP. <134> The SGML Source Guide. The Graphic Communications Association's Guide to Standard Generalized Markup Language (SGML) Systems, Software, Service, Consultants, Seminars and Resources. Edited by Marion Elledge. Graphic Communications Association, February, 1991. 6" x 8". 105 pages. ISBN: 0-93505-13-2. Several SGML-related standards documents distributed by GCA are listed and annotated in this Guide. Listings of SGML suppliers are in alphabetical order and provide information on the type of business, name and description of products or services, and prices. The Guide is issued on a subscription basis in looseleaf format; updates are issued quarterly or as information is accumulated. <135> SGML Products and Services. A document covering primarily CALS- SGML, produced by Joan Smith for the CALS in Europe SIG. Cost is approximately 20 UK pounds. Contact: David Ardron, Secretary, CALS in Europe SIG; Ferranti Computer Systems Ltd,; Western Road, Bracknell, Berkshire RG12 1RA; UNITED KINGDOM; TEL: +44-344-483232. ===================== POSTSCRIPT ===================== COPYLEFT. The bibliography above is copyright (c) Robin Cover, December 1991. Everyone is granted permission to freely copy and distribute the file in any format whatever, for non-commercial purposes, so long as the entire document is reproduced, together with this notice. Corrections and additions will be received with gratitude. PERSPICUITY IN PUNCTUATION. Several painful compromises have been required to produce this network-safe ASCII version. Somewhat against the spirit of SGML, I have used angle brackets to delimit entry numbers, both serially and as cross-references; any editor supporting regular-expressions can be used to convert these particular delimiters to conventional square-brackets. Likewise, the use of parentheses for square brackets in running text is infelicitous, where the latter more clearly signified (editorially) added text, and nested better within parenthesized expressions. SGML character entities have been used to represent extended ASCII accented characters, for which I offer no apology: anyone wishing to print the bibliography on paper copy should first change out these entities (e.g., é, É, ü, à, ø, ç, č). Thus, with the exception of @ ("@" IRV 4/0 = decimal 64), I have restricted the character set to the 'safe' subset of (non- national-use) ISO 646 IRV characters, but at the risk of complete clarity in a few contexts. According to network authorities, use of any characters outside the following 'safe' ISO 646 subset is dangerous, especially across national boundaries, since unanticipated and often undetectable instances of EBCDIC-ASCII translation during transmission will have mutagenic effects upon the 'non-safe' IRV/ASCII characters. This wisdom warns that the 'safe' ISO 646 subset contains ONLY the following (non national-use) characters: a b c d e f g h i j k l m n o p q r s t u v w x y z A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 0 1 2 3 4 5 6 7 8 9 " % & ' ( ) * + , - . / : ; < = > ? _ SPACE (SP, IRV 2/0) ---------------------------------------------------------------------- Robin Cover BITNET: zrcc1001@smuvm1 ("one-zero-zero-one") 6634 Sarah Drive Internet: zrcc1001@vm.cis.smu.edu Dallas, TX 75236 USA Internet: robin@utafll.uta.edu ("uta-ef-el-el") TEL: (214) 296-1783 Internet: robin@ling.uta.edu FAX: (214) 709-3387 Internet: robin@txsil.sil.org ----------------------------------------------------------------------