Contents
The Open Archives Initiative has published an Implementation Guideline specification for Conveying Rights Expressions About Metadata in the OAI-PMH Framework. This specification defines mechanisms for data providers to associate XML-based rights expressions with harvested metadata that is queried and delivered via service providers using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH).
Detailed examples are provided in the specification for declaring rights using the Creative Commons and GNU licenses; however, the rights expression mechanism under the OAI-PMH data model is agnostic as the particular rights expression language used by the data provider. The XML-based rights expressions indicate how data may be used, shared, and modified after it has been harvested.
The OAI Protocol for Metadata Harvesting (OAI-PMH) has now become the most widely adopted approach for publication of both "data" and "metadata" by online digital libraries and archive centers. A list of registered OAI conforming repositories (perhaps not current) identifies some 286 digital library projects using this federated database approach, and the OAIster Digital Library Production Service at the University of Michigan stores some 5,366,375 records of digital resources from 472 institutions.
The essence of the open archives approach is "to enable access to Web-accessible material through interoperable repositories for (meta-)data sharing, publishing and archiving. OAI develops and promotes a low-barrier interoperability framework and associated standards based upon open protocols. In the OAI model, a data provider maintains one or more repositories (web servers) that support the OAI-PMH as a means of exposing metadata; a service provider issues OAI-PMH requests to data providers and uses the metadata as a basis for building value-added services.
According to the online tutorial, OAI-PMH provides "a simple technical option for data providers to make their metadata available to services, based on the open standards HTTP (Hypertext Transport Protocol) and XML (Extensible Markup Language). The metadata that is harvested may be in any format that is agreed by a community (or by any discrete set of data and service providers), although unqualified Dublin Core is specified to provide a basic level of interoperability. MARCXML, METS, and OLAC are also popular supported formats. The OAI-PMH protocol is based on HTTP with support for flow control. Request arguments are issued as GET or POST parameters. OAI-PMH supports six request types, known as verbs; responses are encoded in XML syntax. OAI-PMH supports any metadata format encoded in XML, and OAI-PMH defines a single XML Schema to validate responses to all OAI-PMH requests."
The new OAI Guidelines document specifies the mechanisms by which rights expressions can be conveyed at three different levels representing OAI-PMH data model entities: the record level, the repository level, and the set aggregation level.
At the record level, metadata returned in OAI-PMH ListRecords and GetRecord responses "is packaged in <record> elements. Each <record> may include one or more <about> containers that provide information about the metadata. A rights expression can be associated with the metadata by including an optional <rights> package in an <about> container. The specification defines an XML schema to which such <rights> packages must conform. The schema allows for actual rights expressions to be specified by one of two mutually exclusive alternatives: in-line within a <rights> package, or by-reference via a URI of a network-accessible rights expression. In the by-reference encoding, there are no restrictions on the nature of the resource referenced by this URI; however, to facilitate machine processing, it is recommended that an XML formatted rights expression is accessible via the <rightsReference> URI."
At the repository level, "the response to the OAI-PMH Identify request contains information about the repository. This response may include one or more <description> containers that provide an extensible mechanism for repository description. This specification defines an optional mechanism for delivering, within a <description> container, a manifest of the rights expressions used in the repository. The purpose of this manifest, defined in an XML schema, is to provide a convenient way for harvesters to determine the rights expressions associated with metadata disseminated by a repository.
For rights expressed at the set level, the response to the OAI-PMH ListSets request contains information about the set structure of the repository. Sets are a mechanism to provide selective harvesting from a repository. This response may include one or more <setDescription> containers that provide an extensible mechanism for set description."
The editor's announcement for the Implementation Guideline on Conveying Rights Expressions clarifies how the guidelines specify a mechanism for including rights expressions that pertain to the contents of the <metadata> parts of records in OAI-PMH responses. No new rights expression language has been created. Instead, the specification provides a mechanism to include existing and future XML rights expressions. Description of rights expressions associated with set and repository aggregations is supported through manifests of rights expressions in set and repository descriptions. The design has been guided by the need for simple and clear semantics that will allow service-providers to make harvesting and use decisions based on these rights expressions."
The XML-based markup system for declarative rights expression for OAI-PMH exemplifies a growing preference for light-weight and medium-weight rights management for digital resources on the Internet. While the media industry tries to promlugate a uniform commercial DRM method for draconian rights restriction and enforcement, this goal has repeatedly failed, due to DRM patent terrorism and litigation about DRM patents, and the public lack of interest in fee-based software that invades personal computers. The OAI-PMH approach, like Creative Commons licenses and related solutions, places minimum importance upon legal enforcement: instead, it enables the creation of massive federated databases supported by unified search engines that allow consumers to identify digital content matching their requirements, and then to access digital assets using protocols based upon open standards.
The OAI-PMH Guideline on Conveying Rights Expressions notes that "actions in response to a failure of a harvester to abide by rights expressions are outside the scope of this specification." Harvesters "should look for and abide by the contents of any rights expressions included using this mechanism within records they harvest, [and] clients should act in good faith and abide by the rights expressions about the metadata they harvest."
The rapidly growing popularity of the "Commons" approach is understandable, especially when compared to the commercial DRM alternative, which tends to view every user as a pirate and a criminal. Public sensibilities about openness (open standards, open protocols, open source software, democratized open access to digital information), strengthened by the success of Google, Creative Commons, RSS/Atom, hundreds of public OAI-PMH repositories, and "the Worldwide Web" in general, are making it extremely difficult to convince consumers to pay for intrusive commercial DRM software that regularly prevents the user from doing something very sensible with digital information (select, copy, print, view-a-second-time, forward) that deserves to be used and shared on reasonable terms. The clear terms for "reasonable use" as supported by OAI-PMH rights expression and Creative Commons licenses provide an adequate model for many (perhaps most) use cases in a civil society that depends upon informed citizens.
In March 2005 the ODRL Initiative announced the release of the ODRL Creative Commons Profile as a Draft Specification. This draft "describes the semantics of the Creative Commons licenses and defines how they can be represented using a Profile of the ODRL rights expression language." The Open Digital Rights Language (ODRL) specification is a publicly available XML-based standard language and vocabulary for the expression of terms and conditions over assets. It covers a core set of semantics for these purposes including the rights holders and the expression of permissible usages for asset manifestations. Both ODRL and Creative Commons licenses use XML markup to express rights assertions in digital works, and neither has licensing requirements for use of the technology. In July 2004, the DCMI Usage Board announced approval of the rights-related terms "License" and "Rights Holder" in machine-readable metadata declarations for Dublin Core.
Bibliographic Information
Conveying Rights Expressions About Metadata in the OAI-PMH Framework. Protocol Version 2.0 of 2002-06-14. Edited by Carl Lagoze (OAI Executive; Cornell University - Computer Science); Herbert Van de Sompel (OAI Executive; Los Alamos National Laboratory - Research Library); Michael Nelson (Old Dominion University - Computer Science); Simeon Warner (Cornell University - Computer Science). Series: Implementation Guidelines for the Open Archives Initiative Protocol for Metadata Harvesting. Document Version: 2005/05/03T22:55:00Z. Version URL: http://www.openarchives.org/OAI/2.0/guidelines-rights.htm. This document is one part of the Implementation Guidelines that accompany the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH).
Support for the development of the OAI-PMH and for other Open Archives Initiative activities has come from a number of sources including the National Science Foundation, the Digital Library Federation, and the Coalition for Networked Information (CNI).
Contributors: "The development of these guidelines was inspired, in part, by the work of the RoMEO project. Individuals who have played a significant role in the development of these guideline are: Caroline Arms (Library of Congress), Chris Barlas (Rightscom), Tim Cole (University of Illinois at Urbana-Champaign), Mark Doyle (American Physical Society), Henk Ellerman (Erasmus Electronic Publishing Initiative), John Erickson (Hewlett Packard & DSpace), Elizabeth Gadd (Loughborough University & RoMEO), Brian Green (EDItEUR), Chris Gutteridge (Southampton University & eprints.org), Carl Lagoze (Cornell University & OAI), Mike Linksvayer (Creative Commons), Uwe Müller (Humboldt University), Michael Nelson (Old Dominion University & OAI), John Ober (California Digital Library), Charles Oppenheim (Loughborough University & RoMEO), Sandy Payette (Cornell University), Andy Powell (UKOLN, University of Bath), Steve Proberts (Loughborough University & RoMEO), Herbert Van de Sompel (Los Alamos National Laboratory & OAI), and Simeon Warner (Cornell University, arXiv & OAI)."
About the Open Archives Initiative (OAI)
The Open Archives Initiative develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content. The Open Archives Initiative has its roots in an effort to enhance access to e-print archives as a means of increasing the availability of scholarly communication. Continued support of this work remains a cornerstone of the Open Archives program. The fundamental technological framework and standards that are developing to support this work are, however, independent of the both the type of content offered and the economic mechanisms surrounding that content, and promise to have much broader relevance in opening up access to a range of digital materials. As a result, the Open Archives Initiative is currently an organization and an effort explicitly in transition, and is committed to exploring and enabling this new and broader range of applications. As we gain greater knowledge of the scope of applicability of the underlying technology and standards being developed, and begin to understand the structure and culture of the various adopter communities, we expect that we will have to make continued evolutionary changes to both the mission and organization of the Open Archives Initiative.
Support for Open Archives Initiative activities has come from the Digital Library Federation, the Coalition for Networked Information, and from the National Science Foundation.
Technical work on the OAI has been carried out by a set of technical committees, organized for specific purposes such as protocol releases. These committees evaluate the effectiveness of the OAI interoperability architecture and propose changes and enhancements based on community experience... The OAI Executive Committee manages the operational details of the Initiative under the oversight of the Steering Committee and with the support of the Technical Committee..." [adapted from About]
Principal References
- OAI-PHM rights:
- OAI and OAI-PMH
- Open Archives Initiative
- About OAI
- Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)
- Implementation Guidelines for the Open Archives Initiative Protocol for Metadata Harvesting
- Specification for an OAI Static Repository and an OAI Static Repository Gateway
- XML Schema to describe content and policies of repositories in the e-print community
- Registered OAI conforming repositories. On 2005-05-05, some 286 digital library projects were registered. The table may be sorted by Repository Name, base URL, or Repository Identifier.
- Base URLs of registered data providers. XML format.
- Registered OAI Service Providers. These participants issue OAI protocol requests to the systems of data providers and use the returned metadata as a basis for building value-added services.
- Example OAI Service Providers:
- OAIster: A project of the University of Michigan Digital Library Production Service, creating a collection of freely available, previously difficult-to-access, academically-oriented digital resources, searchable by anyone; 5,366,375 records from 472 institutions as of 5-May-2005
- ARC: A unified, cross archive search service from Old Dominion University Digital Library Research Group
- Public Knowledge Project (PKP): Open Archives Harvester, 122419 indexed papers from 120 archives as of 5-May-2005.
- DP9: An OAI Gateway Service for Web Crawlers. XML/XSLT for processing; supports OAI name resolution and service linking.
- Open Language Archives Community (OLAC): Creating a virtual library of language resources; OLAC metadata is disseminated using the metadata harvesting protocol of the Open Archives Initiative (OAI).
- OAI Tools
- Open Archives Initiative Frequently Asked Questions (FAQ)
- Open Archives Forum online tutorial. By Leona Carpenter, for OA-Forum and UKOLN.
- OAI News
- OAI mailing lists:
- OAI-general. For general information about OAI-related activities.
- OAI-implementers. List to discuss implementation of the OAI protocol.
- "Resource Harvesting within the OAI-PMH Framework." By Herbert Van de Sompel (Los Alamos National Laboratory, Research Library), Michael L. Nelson (Old Dominion University, Computer Science Department), Carl Lagoze (Cornell University, Computing and Information Science) and Simeon Warner ( Cornell University). From D-Lib Magazine Volume 10 Number 12 (December 2004).
- Creative Commons
- Creative Commons web site
- Science Commons web site
- iCommons: Creative Commons Worldwide. Internationalization of the Creative Commons Idea.
- Creative Commons Licenses: Examples
- "Creative Commons Project" - Main reference page.
- "ODRL Initiative Releases Draft Specification of ODRL Creative Commons Profile." News story 2005-03-07.
- "Science Commons Develops Legal and Technical Tools for Sharing Scientific Innovation." News story 2004-12-31.
- "Delivering Classics Resources with TEI-XML, Open Source, and Creative Commons Licenses." News story 2004-04-28.
- "RSS 2.0 Specification Published by Berkman Center Under Creative Commons License." News story 2003-07-22.
- RoMEO Project:
- Other XML Schemas for rights expression:
- "DCMI Usage Board Announces Approval of Metadata Terms for Digital Rights Declaration." The Usage Board of the Dublin Core Metadata Initiative (DCMI) announced approval of the rights-related terms "License" and "Rights Holder" in machine-readable metadata declarations. The proposed standard of practice for encoding formal rights declarations identifying IP owners and machine-readable licenses provides a basis for distributed de-centralized rights management systems. Use of Dublin Core specifications and Creative Commons licenses involves no royalty or usage fee, as is true also for the XML-based Open Digital Rights Language (ODRL).
- Metadata Encoding and Transmission Standard (METS). A draft XML Rights Schema for METS has been prepared to support declaration and documentation of some basic facts about the digital collections being created and/or included in institutional digital repositories; see the announcement.
- ODRL Creative Commons Profile. The draft describes the semantics of the Creative Commons licenses and defines how they can be represented using a Profile of the Open Digital Rights Language (ODRL) rights expression language. ODRL is an XML-based rights expression language which has no license restrictions for use.
- RelLicense. By Tantek Çelik and Technorati. RelLicense, discussed in a blog entry, is one of several MicroFormats. By adding rel="license" to a hyperlink, a page indicates that the destination of that hyperlink [reports the] license for the current page. E.g., with the following hyperlink <a href="http://creativecommons.org/licenses/by/2.0/" rel="license">cc by 2.0</a> the author indicates that the page is licensed under a Creative Commons 2.0 Attribution Required license.
- "Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)" - Main reference page.
- "Patents and Open Standards" - Main reference page.
- "XML and Digital Rights Management (DRM)" - Main reference page.