xml encoding detector in C


Date:     Thu, 22 Apr 1999 18:10:40 -0400
From:     John Cowan <cowan@locke.ccil.org>
To:       XML Dev <xml-dev@ic.ac.uk>
Subject:  ANNOUNCE: xml encoding detector in C

I have written an XML encoding detector function in C. It would be easy to translate it to Java, but I thought that C would be the most useful in different contexts. It uses only Standard C facilities.

There is a subroutine called "xmlenc" which accepts a FILE* argument and returns a (static) string representing the encoding. I believe it handles all the cases in Appendix F correctly, including the EBCDIC one.

There is also a test-harness main program that can generate some sample files in EBCDIC and 16-bit Unicode (8-bit ASCII-compatible files are easy to find). This part can be stripped out, as indicated by the comments, in order to use the routine in some server program.

No copyright, no warranty; I assert the moral right to be known as the author.

Download from:

      http://www.ccil.org/~cowan/XML/xmlenc.c


John Cowan      http://www.ccil.org/~cowan              cowan@ccil.org
        You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
        You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
                Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 
981-02-3594-1