[This local archive copy mirrored from: http://www.ncbi.nlm.nih.gov/PubMed/spec.html; see the canonical version of the document.]

The NCBI PubMed Project

NCBI standard publisher data format

This is our standard data format for publishers to use in submitting citation data to NCBI for processing into the MEDLINE or PubMed databases. This is a tagged format, meaning that each part of a citation is preceded by a <Tag> string and followed by a </Tag> string, where Tag is some appropriate label. The first two sections list the tags used and what information they enclose; the third gives an example. If you need further assistance, please email us at pubmed@ncbi.nlm.nih.gov.

Note! While we encourage the use of this format, we do not require it; any format that provides the same level of detail is adequate. If you wish to use another format, please email us and provide a sample.

This document contains:


The SGML Tags

Data Tags (R = Required, O = Optional) :

ArticleSet (R) An entire submission of the set of articles. Each issue of a given journal must be enclosed in these tags.
Article (R) An article submission. Each article must be enclosed in these tags.

Special Characters

Characters not in the standard ASCII character set must be represented using standard SGML entity codes. For instance, "&ccedil;" represents a c cedilla. "&rsquo;" in the example below is a right single-quote.

Example:

<ArticleSet>
<Article>
<Journal>
<PublisherName>AAAS</PublisherName>
<JournalTitle>SCIENCE</JournalTitle>
<Issn>9731-864X</Issn>
<Volume>271 suppl. 3</Volume>
<Issue>5</Issue>
<PubDate>
<Year>1996</Year>
<Month>Mar</Month>
<Day>3</Day>
</PubDate>
</Journal>
<ArticleTitle>The Elasticity of a Single Supercoiled DNA Molecule</ArticleTitle>
<FirstPage>1835</FirstPage>
<LastPage>1837</LastPage>
<Language>EN</Language>
<AuthorList>
<Author>
<FirstName>Kenneth</FirstName>
<MiddleName>S.</MiddleName>
<LastName>Strick</LastName>
<Suffix>Jr.</Suffix>
<Affiliation>
Laboratoire de Biophysique de l’ADN, Institut Pasteur, 25-28 rue du Dr Roux, Paris, 75015 France. </Affiliation>
</Author>
<Author>
<FirstName>J.-F.</FirstName>
<LastName>Allemand</LastName>
<Affiliation>
<Institution> Laboratoire de Physique Statistique de l&rsquo;ENS, </Institution>
<StreetAddress>24 rue Lhomond</StreetAddress>
<City>Paris</City>
<PostalCode>75015</PostalCode>
<Country>France</Country>
</Affiliation>
</Author>
</AuthorList>
<PublicationType>JOURNAL ARTICLE</PublicationType>
<FullTextURL> http://www.oup.co.uk/nar/Volume_24/Issue_02/5c0194_gml.abs.html </FullTextURL>
<SummaryURL>
uk,fr,sp,it,de,nd,no::http://www.oup.co.uk/nar;
jp::http://www.nar.com/nar;
*::http://www.nar.com/nar
</SummaryURL>
<PublisherId>sc271_5_1835</PublisherId>
<Abstract>
Single linear DNA molecules were bound at multiple sites at one extremity to a treated glass cover slip and at the other to a magnetic bead. The DNA was therefore torsionally constrained. A magnetic field was used to rotate the beads and thus to coil and pull the DNA. The stretching force was determined by analysis of the Brownian fluctuations of the bead. Here, the elastic behavior of individual &lgr; DNA molecules over- and underwound by up to 500 turns was studied. A sharp transition was discovered from a low to a high extension state at a force of ∼0.45 piconewtons for underwound molecules and at a force of ∼ 3 piconewtons for overwound ones. These transitions, probably reflecting the formation of alternative structures in stretched coiled DNA molecules, might be relevant for DNA transcription and replication.
</Abstract>
<Keywords>
magnetic field;DNA transcription;Elasticity
</Keywords>
</Article>
</ArticleSet>

SUBSET OF LANGUAGE CODES

The following is a subset of the ISO 639 standard for language codes.

CODE    LANGUAGE
----    --------
DA      Danish
DE      German
EN      English
EL      Greek
ES      Spanish
FR      French
IT      Italian
IW      Hebrew
JA      Japanese
NL      Dutch
NO      Norwegian
RU      Russian
SV      Swedish
ZH      Chinese

The NCBI PubMed DTD

<!DOCTYPE ArticleSet [

<!-- ncbi.dtd; NCBI PubMed DTD Version 1.5, September 29, 1997 -->

<!ENTITY % ISOlat1 PUBLIC "ISO 8879-1986//ENTITIES Added Latin 1//EN">
%ISOlat1;
<!ENTITY % ISOlat2 PUBLIC "ISO 8879-1986//ENTITIES Added Latin 2//EN">
%ISOlat2;
<!ENTITY % ISOnum PUBLIC "ISO 8879-1986//ENTITIES Numeric and Special Graphic//EN">
%ISOnum;
<!ENTITY % ISOpub PUBLIC "ISO 8879-1986//ENTITIES Publishing//EN">
%ISOpub;
<!ENTITY % ISOgrk1 PUBLIC "ISO 8879-1986//ENTITIES Greek Letters//EN">
%ISOgrk1;
<!ENTITY % ISOgrk2 PUBLIC "ISO 8879-1986//ENTITIES Monotoniko Greek//EN">
%ISOgrk2;
<!ENTITY % ISOgrk3 PUBLIC "ISO 8879-1986//ENTITIES Greek Symbols//EN">
%ISOgrk3;
<!ENTITY % ISOtech PUBLIC "ISO 8879-1986//ENTITIES General Technical//EN">
%ISOtech;
<!ENTITY % ISOdia PUBLIC "ISO 8879-1986//ENTITIES Diacritical Marks//EN">
%ISOdia;
<!ENTITY % ISOamso PUBLIC "ISO 8879-1986//ENTITIES Added Math Symbols: Ordinary//EN">
%ISOamso;
<!ENTITY % ISOamsb PUBLIC "ISO 8879-1986//ENTITIES Added Math Symbols: Binary Operators//EN">
%ISOamsb;
<!ENTITY % ISOamsr PUBLIC "ISO 8879-1986//ENTITIES Added Math Symbols: Relations//EN">
%ISOamsr;
<!ENTITY % ISOamsn PUBLIC "ISO 8879-1986//ENTITIES Added Math Symbols: Negated Relations//EN">
%ISOamsn;
<!ENTITY % ISOamsa PUBLIC "ISO 8879-1986//ENTITIES Added Math Symbols: Arrow Relations//EN">
%ISOamsa;
<!ENTITY % ISOamsc PUBLIC "ISO 8879-1986//ENTITIES Added Math Symbols: Delimiters//EN">
%ISOamsc;
<!ENTITY % ISObox PUBLIC "ISO 8879-1986//ENTITIES Box and Line Drawing//EN">
%ISObox;
<!ENTITY % ISOcyr1 PUBLIC "ISO 8879-1986//ENTITIES Russian Cyrillic//EN">
%ISOcyr1;
<!ENTITY % ISOcyr2 PUBLIC "ISO 8879-1986//ENTITIES Non-Russian Cyrillic//EN">
%ISOcyr2;

<!ENTITY % data "(#PCDATA | e1 | e2 | e3 | e4 | e5 | e6
     | e7 | e8 | e9 | sup | inf)*">

<!-- This is the top level element -->
<!ELEMENT ArticleSet - - ( Article+)>

<!ELEMENT Article - - ( Journal, Replaces?, ArticleTitle, FirstPage,
      LastPage?, Language?, AuthorList?,
      PublicationType?, FullTextURL?, SummaryURL?,
      PublisherId?, Abstract?, Keywords? )>
<!ELEMENT Journal - - ( PublisherName, JournalTitle, Issn,
      Volume, Issue?, Part?, PubDate,
      HoldDate? )>

<!ELEMENT PublisherName - - (#PCDATA)>
<!ELEMENT JournalTitle - - (#PCDATA)>
<!ELEMENT Issn - - (#PCDATA)>
<!ELEMENT Volume - - (#PCDATA)>
<!ELEMENT Issue - - (#PCDATA)>
<!ELEMENT Part - - (#PCDATA)>

<!ELEMENT PubDate - - ( year, month?, day?, season? )>
<!ELEMENT Year - - (#PCDATA)>
<!ELEMENT Month - - (#PCDATA)>
<!ELEMENT Day - - (#PCDATA)>
<!ELEMENT Season - - (#PCDATA)>
<!-- End of PubDate group -->

<!ELEMENT HoldDate - - ( Year, Month, Day )>

<!-- End of Journal group -->

<!ELEMENT Replaces - - (#PCDATA)>
<!ELEMENT ArticleTitle - - (#PCDATA, SubTitle? )>
<!ELEMENT SubTitle - - (#PCDATA)>
<!-- End of ArticleTitle group -->

<!ELEMENT FirstPage - - (#PCDATA)>
<!ELEMENT LastPage - - (#PCDATA)>
<!ELEMENT Language - - (#PCDATA)>
<!ELEMENT AuthorList - - ( Author+ )>
<!ELEMENT Author - - ( (FirstName, MiddleName?, LastName, Suffix?)  |
           CollectiveName), Affiliation? )>

<!ELEMENT FirstName - - (#PCDATA)>
<!ELEMENT MiddleName - - (#PCDATA)>
<!ELEMENT LastName - - (#PCDATA)>
<!ELEMENT CollectiveName - - (#PCDATA)>
<!ELEMENT Suffix - - (#PCDATA)>

<!ELEMENT Affiliation - - ( #PCDATA | ( Institution?, Division?,
      StreetAddress?, ( City | PostalCode | Country )*,
      Phone?, Fax?, Email? )>
<!ELEMENT Institution - - (#PCDATA)>
<!ELEMENT Division - - (#PCDATA)>
<!ELEMENT City - - (#PCDATA)>
<!ELEMENT Country - - (#PCDATA)>
<!ELEMENT StreetAddress - - (#PCDATA)>
<!ELEMENT PostalCode - - (#PCDATA)>
<!ELEMENT Phone - - (#PCDATA)>
<!ELEMENT Fax - - (#PCDATA)>
<!ELEMENT Email - - (#PCDATA)>
<!-- End of Affiliation group -->

<!-- End of Author group -->

<!-- End of AuthorList group -->

<!ELEMENT PublicationType - - (#PCDATA)>
<!ELEMENT FullTextURL - - (#PCDATA)>
<!ELEMENT SummaryURL - - (#PCDATA)>
<!ELEMENT PublisherId - - (#PCDATA)>
<!ELEMENT Abstract - - (%data;)+>
<!ELEMENT Keywords - - (%data;)+>

<!-- End of Article group -->

<!-- End of ArticleSet group -->

<!-- Commonly used formatting elements -->

<!ELEMENT e1 - - (%data;)*>
<!ELEMENT e2 - - (%data;)*>
<!ELEMENT e3 - - (%data;)*>
<!ELEMENT e4 - - (%data;)*>
<!ELEMENT e5 - - (%data;)*>
<!ELEMENT e6 - - (%data;)*>
<!ELEMENT e7 - - (%data;)*>
<!ELEMENT e8 - - (%data;)*>
<!ELEMENT e9 - - (%data;)*>
<!ELEMENT sup - - (%data;)*>
<!ELEMENT sub - - (%data;)*>
<!ELEMENT inf - - (%data;)*>
]>



Credits: Brandon Brylawski, Alexander Levitsky

Last Modified: September 29, 1997