[This local archive copy mirrored from the canonical site: http://www.balise.com/current/balise4desc.htm; links may not have complete integrity, so use the canonical document at this URL if possible.]

AIS Software Announces Balise 4.0

November 1997

AIS Software announces the new Balise 4.0 Release and unveils its delivery schedule. This major release brings a significant set of new features:

Balise Release 4.0 will be available end of January 1998.

Release 4.0 overview

Below are some more details about each of these features. As you will see, this represents a *major* functionality upgrade for Balise! Feel free to use to send us your comments or ask for more details about any of these features.

Direct handling of Word/RTF documents

Goal

To be able to process RTF documents to convert them efficiently & easily to other formats, typically some flavour of SGML.

Usage/Form

A new scanner in Balise that is activable using the currently available Open Parser Interface. The RTF document scanner generates "ESIS" events corresponding to an implicit "RTF DTD", which models every logical piece of information one can find in an RTF document.

You will then be able to write Balise programs to handle the "restructuration" phase of the transformation. The big benefit is that low-level decoding and analysis will be automatically performed for you.

Details

The following MS-Word features are supported

The first version will only support "Western RTF". RTF-J (Japanese) will be considered in a later phase (which might in fact be superseded by support of the new "Universal" RTF 1.5 which comes with Word 97).

Native support of the emerging XML standard

Goal

To be able to handle XML documents directly in Balise and manipulate them exactly as we can today manipulate SGML documents.

Beyond XML concerns, this module will bring much flexibility in handling tagged files without explicit DTDs at hand.

Usage/Form

A new parser in Balise that is activable using the currently available Open Parser Interface.

A prototype of this parser is already available as a free unsupported add-on to Balise 3.1. The official release which will be an integral component of Balise 4.0 implements the XML-LANG specification as it has stabilized over this last quarter.

Details

New set of functions for manipulating SGML DTD information ("DTD API")

Goal

To be able to access and manipulate DTD information while processing an SGML instance. In particular, DTD API gives access to the details of entity and/or element declarations beyond what is currently implemented in Balise 3.1.

Usage/Form

A set of new functions and a predefined data type which models DTD information.

When DTD information is loaded, it becomes a Balise data structure that can be browsed and even updated using the existing Balise mechanisms.

This mechanism will subsume the Balise 3.1 entity access mechanism which remains available for compatibility reasons.

Details

Handling of markup-level SGML structures

Goal

To be able to handle specific SGML-to-SGML transformations that require fine-grain knowledge of the SGML markup, typically beyond the ESIS information level supported by Balise 3.1.

As a related extension, Balise also now provides access to the SGML simple and implicit LINK information.

Usage/Form

A set of new SGML event types is introduced, which correspond to this new information : external entity boundaries, marked sections boundaries, SGML comments, etc.

Corresponding content rules are also added to the Balise language. For performance reasons, these events will only be raised as an option.

Details

The following non-ESIS events are introduced:

SGML LINK information is transferred through Balise applicative attributes.

Transparent handling of large, disk-based indexed structures ("persistent maps")

Goal

To be able to easily manipulate indexed structures that are too large to fit in memory and/or should be shared and reused through multiple Balise sessions.

Usage/Form

A set of Balise functions that provides a very similar interface to that of Balise maps, but manipulate disk-based structures.

Details

Such disk-based maps are indexed (access is very efficient) and are scalable from very small to very large numbers (several millions) of items.

Processing of large SGML documents stored as disk-based SGML trees ("disk-based Balise Documents")

Goal

Since Release 3.0, Balise has provided an API for manipulating SGML instances (or parts of instances) as "Balise documents" represented as SGML elements trees.

This has proved to be a very efficient and powerful feature for some kinds of SGML transformations involving structural manipulations. The main limitation of this tree-based manipulation approach was that SGML trees were handled in memory.

This new feature precisely removes this limit by providing an unlimited disk-based implementation of Balise Documents.

Usage/Form

Disk-based Documents will be manipulated exactly through the same API as memory-based documents available in Rel 3.1. This is made possible by the Abstract Document Interface implemented in Balise.

Only a few new functions are provided such as creation of a disk-based document from an SGML instance, or opening a disk-based document for manipulation and update.

Details


 
AIS Software
15 - 17 Rue Rémy Dumoncel
F-75014 Paris
France
Tel: (+33) 1 40 64 43 00
Fax: (+33) 1 40 64 43 10
email: info@balise.com