[Provisional description for MEDLINE XML and associated XML data formats.]
From a 1999-09 Bulletin: "As part of its reinvention efforts, NLM continues to refine the format that will be used for our forthcoming data creation and maintenance system and for distribution of MEDLINE data. This new XML-based format will be used when NLM offers its leased data via ftp. The new format will be a greatly expanded and somewhat modified version of the SGML-based format currently used by publishers who submit citation and abstract data electronically to NLM for entry to see PubMed. We plan to offer the XML format via ftp in parallel with continued distribution of the data in ELHILL Unit Record Format (EURF) on tape until all licensees have had sufficient time to transition over to the new format."
[July 29, 2000] Documentation on XML DTDs: Sample NLM Data Available: MEDLINE. Sample MEDLINE data in NLM's new XML format (May 9, 2000 version) is available for ftp. Instructions are below. Please see http://www.nlm.nih.gov/bsd/licensee.html for more information on NLM's XML format. The NLM MEDLINE DTD is available at: http://www.nlm.nih.gov/databases/dtd/nlmmedline.dtd. This new DTD defines the entities and references the MedlineCitation DTD (http://www.nlm.nih.gov/databases/dtd/nlmmedlinecitation.dtd) which in turn references the NLMCommon DTD (http://www.nlm.nih.gov/databases/dtd/nlmcommon.dtd). The MEDLINE DTD, therefore, is the "parent" DTD and the starting point for MEDLINE licensees. There are two groups of sample records available as compressed and also uncompressed files. One (file name sampmed1) contains close to 41,000 MEDLINE records and the other (file name is sampmed2) contains a small sampling of 21 representative records. Information for Licensees of NLM Data can be found at: http://www.nlm.nih.gov/bsd/licensee.html. Cache version 2000-07-29 below.
PubMed SGML DTD ['NCBI PubMed DTD Version 1.5, September 29, 1997']
See also: National Library of Medicine (NLM) - SGML Applications