[Rocket Science - "Mappa.Mundi's first installment of Rocket Science explores how Invisible Worlds' EDGARspace takes a step above the SEC's EDGAR database."] space.cgi Interface: "Space.cgi is a web proxy which makes the services of the SpaceServer engine available to traditional web browsers. The EDGARspace portal is built using this interface. Underneath the space.cgi interface is a rich architecture of protocols, servers, and other modules."
"The core concept to understand is the retrieve, evaluate, publish paradigm. All calls to space.cgi exercise this paradigm: data are retrieved from the SpaceServer. The metadata are fed into an evaluate script (or a series of scripts) to look for relationships among the data, then the results of the evaluate stage are fed into the publish stage for formatting."
"The SpaceServer and the underlying SpaceEngine (the data store) use XML as a way of structuring data. While our interfaces have selected several elements to query on, you can specify any of the elements and attributes in the Document Type Definition (DTD) that was used to check data into the SpaceServer. Currently, our SpaceServer is aware of two kinds of data: (1) Internet RFCs (doc.rfc) (2) SEC EDGAR Documents (doc.edgar)"
XML DTDs - The DTDs define the structure of data stored in the SpaceServer. By looking at these DTDs, you can see what types of queries are possible.
[September 16, 1999] "Internet Pioneers Build a Better EDGAR Using XML. New Web Service Delivers Dramatically Improved Searches of SEC Filings With the First XML-based Financial Information System." - "Invisible Worlds, a San Francisco-based startup company headed by a team of Internet veterans responsible for many of the most significant innovations and standards behind the Internet, today unveiled the EDGARspace portal, a new Web service that delivers dramatically refined searches of the U.S. Securities and Exchange Commission's (SEC) EDGAR filings. 'The EDGARspace service is the first glimpse into the way XML-based Internet information systems will work in the future,' said Invisible Worlds' CEO and Chairman Carl Malamud, who first put EDGAR on the Internet five years ago. 'For the first time, you can reach inside EDGAR filings for gems of knowledge that were previously buried in text and also rise above the immense collection of documents to make sense of broad searches.' The EDGARspace portal, one of the largest XML-based (Extensible Markup Language) information systems ever developed, demonstrates the potential of this new Web standard by giving investment, financial and research professionals better ways to search for and find information that had been difficult to obtain. . . The SEC filings are enhanced using industry-standard XML to tag key information. For example, a search can target all insider-trading reports within an investor's portfolio, or look for Initial Public Offerings within a particular industry segment. EDGARspace organizes search results with XML 'meta-information,' making large result sets more manageable. Complex search results can be organized by any type of meta-information, such as filing type, document section, industry code, date or company name."
"The Importance of Being EDGAR. Who We Are, What We Do." By Carl Malamud (CEO, Invisible Worlds). "A general introduction to SpaceServers and other components of the Blocks Architecture."
EDGARspace portal - example of an application that uses space.cgi, the web proxy interface to the SpaceServer.
Danny Goodman SpaceKit - example of an application that uses space.cgi, the web proxy interface to the SpaceServer.