SGML: Free Soft: MtStr - Multilingual string library

SGML: Free Soft: MtStr - Multilingual string library


Received: from UICVM.UIC.EDU by UICVM.UIC.EDU (LISTSERV release 1.8b) with NJE
          id 7226 for TEI-L@UICVM.UIC.EDU; Fri, 14 Jun 1996 14:06:29 -0500
Date:         Fri, 14 Jun 1996 14:03:29 CDT
Reply-To: Jean Ve'ronis <veronis@univ-aix.fr>
Sender: "TEI (Text Encoding Initiative) public discussion list"
              <TEI-L@UICVM.UIC.EDU>
From: Jean Ve'ronis <veronis@univ-aix.fr>
Organization: CNRS - Universite' de Provence
Subject:      Free Soft: MtStr - Multilingual string library
Comments: cc: poul.andersen@LUX.DG13.cec.be, Jacques.Andre@irisa.fr,
          multext@univ-aix.fr
To: Multiple recipients of list TEI-L <TEI-L@UICVM.UIC.EDU>

-------------------------------------------------------------------------

              *** ANNOUNCING FREE SOFTWARE ALPHA RELEASE ***

                  MtStr - Multilingual string library


MtStr is a C library for UN*X developed in the context of the MULTEXT
project, which extends the usual functions provided in the C character
and string ctype and string libraries, in order to accommodate
multi-lingual text processing. MtStr is designed especially for texts
encoded using SGML. It offers the following capabilities:

   * it extends functions such as isupper, strcmp, strtok, etc., to
     handle several different character sets as well as SGML entities;
   * it offers additional functions for translation between different
     character sets and/or SGML entities;
   * it extends functions such as isupper, islower, etc., to handle
     characters with diacritics;
   * it provides means to handle different lexicographic orderings;
   * it provides functions for conversion between character classes
     (e.g., between the accented and unaccented versions of a given
     character, etc.);
   * it handles both fixed-width character sets (in which all characters
     have the same byte size) and variable-width character sets (in
     which characters may have different byte sizes);

MtStr internally maps all characters to Unicode/ISO 10646. The functions
are table-driven and can therefore be easily customized to handle
modification or additional character sets, as needed.

Source code and documentation can be freely downloaded for
non-commencial, non-military use (see our user agreement) from:

     
http://www.lpl.univ-aix.fr/projects/multext/MtStr/

Note that MtStr is an alpha version with bugs and limitations. It is
being distributed "as is" in order to solicit feedback. We invite the
user community to send comments and advice, provide additional tables,
etc.

Jean Ve'ronis
Multext project Coordinator_______________veronis@univ-aix.fr