New version of search engine for XML documents
Date: Tue, 31 Aug 1999 09:57:31 -0700 From: Walter Underwood <wunder@infoseek.com> To: xml-dev@ic.ac.uk Subject: ANNOUNCE: new version of search engine for XML documents
Commercial announcement:
Ultraseek Server 3.1 is available with improved support for XML. This is an internet search engine based on Infoseek/GO.com search, and designed for ease-of-admin, scalability, and relevant results. This is our third release with XML support. Ultraseek Server first supported indexing XML documents in version 2.1, way back in September 1998.
The XML support is designed for providing search to users over sets of XML documents. It is not designed as a repository for authors, as a database-like search (SQL, XQL, etc.), nor for arbitrary searches depending on element context (XPath). It is designed for people to type in a word or a phrase and get the most relevant documents without having to learn a query language.
The XML support in 3.1 allows admins to map text inside elements to search fields, and to have different mappings for different root element names (we can't rely on DTDs, since we only require well-formedness, not validity). If one document type uses <author>, another uses <docAuthor>, another uses <byline>, and yet another uses <creator>, all of those can be mapped to "author:name" searches.
It ships with default mappings for:
- TEI
- DocBk XML V3.1.3
- the Bosak religion collection
- the Bosak Shakespear collection
- FGDC Content Standard for Digital Geospatial Metadata
- XMLNews-Story
and a fallback default mapping that covers some common conventions, like <title> or <TITLE> for the document title.
Simple links in XLink format (that is, "href" attributes) will be followed by the spider. Elements qualified by namespaces can be mapped.
For more information, see http://software.infoseek.com/. For detailed XML questions, you can ask me, but for more general software support you'll probably get more prompt answers from our usual support e-mail (it doesn't take vacations).
There is also a 30-day free trial, which is another way to get answers to questions.
Special thanks to James Clark for his high-quality parser (Expat) and his liberal license. And continuing special thanks to the Python community.
wunder
Walter R. Underwood wunder@infoseek.com wunder@best.com (home) http://software.infoseek.com/cce/ (my product) http://www.best.com/~wunder/ 1-408-543-6946
xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@ic.ac.uk Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
Prepared by Robin Cover for the The SGML/XML Web Page archive.