RXP Description From: ftp://ftp.cogsci.ed.ac.uk/pub/richard/rxp-beta8.tar.gz Date: 980527 ----------------------------------------------------------------- This is a beta-test release of RXP, a non-validating XML parser in C. RXP is free for individual, research and educational use and for evaluation. This release may not be redistributed. Commercial licences will be available for future releases. See the file COPYRIGHT for copyright and disclaimer. RXP is based on the W3C recommendation of 10th February 1998; deviations from the recommendation should probably be considered as bugs. It can be compiled in 8- or 16-bit character mode. In 8-bit mode, the internal encoding is a superset of ASCII, in which all characters above 0xa0 are treated as name characters. Characters are not translated on input or ouput. This means that well-formed documents in ASCII and ISO-8859-N should work. In 16-bit mode, the internal encoding is UTF-16 and the supported input encodings are ISO-8859-N (1 <= N <= 9), UTF-16 and UTF-8. The system has been developed under unix (Solaris and FreeBSD), but has also been compiled and run under MS Windows. It should work with minor changes on other systems. To compile under unix, type "make", or "make CHAR_SIZE=16". A simple application (called rxp) is provided that parses and writes XML data, optionally expanding entities, defaulting attributes, and translating to a different output encoding. There is a short manual page for rxp, but no other documentation yet. Notes on 8- and 16-bit character modes: The type char8 is typedefed to char. Unsigned char would have been better than char, but causes too many compiler warnings in applications. The type char16 is typedefed to unsigned short. The type Char is typedefed to either char8 or char16 according to the compilation mode.