[June 29, 2001] Log Markup Language (LOGML) "is an XML 1.0 application designed to describe log reports of web servers. Web-data mining is one of the current hot topics in computer science. Mining data that has been collected from web server logfiles, is not only useful for studying customer choices, but also helps in organizing web pages. This is accomplished by knowing which web pages are most frequently accessed by the web surfers. The structure of a web site is represented as a web graph (see the XGMML draft specification http://www.cs.rpi.edu/~puninj/XGMML/). In mining the data from the log statistics, we use the web graph in annotating the log information. Further we give summary reports, comprising of information such as client sites, types of browsers and the usage time statistics. We also gather the client activity in a web site as a subgraph of the web site graph. This subgraph can be used to get better understanding of general user activity in the web site. In LOGML, we create a new XML vocabulary to structurally express the contents of the logfile information."
"Motivation: Log reports are the compressed version of logfiles. Web masters in general save web server logs in several files. Usually each logfile contains a single day of information. Due to disk space limitation, old log data gets deleted to save new log information. Generally, web masters generate HTML reports of the logfiles and do not have problems keeping them for a long period of time since the HTML reports are an insignificant size. If a web master likes to generate reports for a large period of time, he has to combine several HTML reports to produce a final report. LOGML is conceived to make this task easier. Web masters can generate LOGML reports of logfiles and combine them on a regular basis without much effort. LOGML files can be combined with XSLT to produce HTML reports. LOGML offers the flexibility to combine them with other XML applications, such as SVG, to produce graphics of the statistics of the reports. LOGML in addition can be combined with RDF to provide some metadata information about the web server that is being analyzed. LOGML is based on XGMML that is a language to describe graphs. LOGML can be seen as a snapshot of the web site as the user visits web pages and traverses hyperlinks. LOGML also provides an efficient way to save the user sessions..."