The Cover PagesThe OASIS Cover Pages: The Online Resource for Markup Language Technologies
SEARCH | ABOUT | INDEX | NEWS | CORE STANDARDS | TECHNOLOGY REPORTS | EVENTS | LIBRARY
SEARCH
Advanced Search
ABOUT
Site Map
CP RSS Channel
Contact Us
Sponsoring CP
About Our Sponsors

NEWS
Cover Stories
Articles & Papers
Press Releases

CORE STANDARDS
XML
SGML
Schemas
XSL/XSLT/XPath
XLink
XML Query
CSS
SVG

TECHNOLOGY REPORTS
XML Applications
General Apps
Government Apps
Academic Apps

EVENTS
LIBRARY
Introductions
FAQs
Bibliography
Technology and Society
Semantics
Tech Topics
Software
Related Standards
Historic
Last modified: February 08, 2002
Protein Extensible Markup Language (PROXIML)

[October 26, 2001] PROXIML is a research 'Bioinformatics Project' [CMPS243] hosted at the University of California, Santa Cruz. The principal investigator is Douglas C. McArthur. "Problems associated with existing protein data formats, such as PDB and mmCIF, indicate a need for a more self-describing and machine-readable approach to exchanging protein-related data. XML (eXtensible Markup Language) is an ideal solution for this particular problem. However, most existing XML-based efforts (such as BIOML and ProML) rely on the W3C XML DTD (document type definition) to describe and validate the structure of their contents. The XML DTD approach imposes severe limitations upon both the structure and ability to validate an XML document. An alternative utilizing the W3C XML Schema approach to document definition overcomes many of these limitations. This approach has recently been adopted for CML, a general purpose chemical markup language. As an extension of the CML schema, PROXIML can encode the relevant details of protein structure in a more robust and well-structured fashion than other currently available data formats..." [Status: 2001-03-16.]

The popularity of XML in the area of bioinformatics has clearly grown in the past few years. XML provides the capability of representing protein data in a single, standardized data structure. However, the structure of XML documents defined using a DTD is limited to representing data in a hierarchical tree fashion. While some portion of protein-related data can be effectively stored in this way, a significant amount of protein-related data is better represented as an arbitrary graph rather than a hierarchical tree. The XML Schema approach, coupled with XML Linking Language (XLink) allows representation of non-hierarchical data within an XML document in a self-describing fashion. Additionally, validation of both the structure and the content (with regard to specific datatypes) is greatly facilitated by an XML Schema vs. the XML DTD. By combining elements of three separate XML-based languages using an XML Schema approach, PROXIML is able to encode the relevant details of protein structure in a more robust and well-structured fashion than the current PDB and mmCIF data formats. Adoption however will ultimately depend heavily on the availability of tools (such as viewers and converters) that support the new format..."

References:


Hosted By
OASIS - Organization for the Advancement of Structured Information Standards

Sponsored By

IBM Corporation
ISIS Papyrus
Microsoft Corporation
Oracle Corporation

Primeton

XML Daily Newslink
Receive daily news updates from Managing Editor, Robin Cover.

 Newsletter Subscription
 Newsletter Archives
Globe Image

Document URI: http://xml.coverpages.org/proximl.html  —  Legal stuff
Robin Cover, Editor: robin@oasis-open.org