The Evolution of RSS | WebReference

The Evolution of RSS

current page23456789
[next]

The Evolution of RSS

By Andrew King (aking@webreference.com)

Abstract

We look at how RSS has evolved from its humble beginnings through present day and beyond. We survey all versions of RSS, including a feature comparison, a new RSS usage survey, and format and validation information. We also interview the people and explore the standards behind RSS. Learn how the newest version of RSS will move us towards a more Semantic Web.

Update (051401): Now includes more background standards material, 0.92 information, a Dan Libby interview, and extensive reference links.

Introduction

Rich Site Summary (RSS) is a lightweight XML vocabulary for describing metadata about Web sites, ideal for news syndication. Originated by UserLand Software in 1997 and used by Netscape to populate Netscape's My Netscape portal with external newsfeeds ("channels") RSS has taken on a life of its own and has become perhaps the most popular XML format today. Thousands of Web sites today use RSS as a "what's new" mechanism to drive traffic their way. This article discusses the various versions of RSS, the standards behind them (DTDs, RDF, namespaces), and the future of RSS. This article assumes you have some familiarity with RSS. For more information see our introduction to RSS.

Netscape Drops RSS

In the last week of April 2001 as part of their My Netscape redesign, AOL stopped including external RSS news feeds in their service. They also (probably inadvertently) removed the RSS 0.91 validator and most importantly the DTD, which most 0.91 RSS feeds point to. Thankfully, in response to feedback, Dan Libby (the author of the RSS 0.9/0.91 DTDs) reports that Netscape has replaced the missing files, though not the validator. While probably an oversight, the missing DTD caused some validating XML parsers to choke on the missing file, and the RSS community is abuzz. They ask: Where do we go from here?

This episode raises two important issues:

The temporary loss of the USENET archives after Google's acquisition of Deja.com is a similar example.

Tristan Louis, freelance reporter at tnl.net says, "The problem with Netscape is showing some clear vulnerability in the use of XML as it relies on everyone pointing to a single DTD. When that DTD is gone, problems ensue.... We need to ensure redundancy across the network as a whole."

Rael Dornfest creator of O'Reilly's Meerkat RSS news portal and co-editor of the RSS 1.0 spec says, "RSS needs to, 1.0 or otherwise, move away from the static DTD model and reliance on a single point of availability (read: inevitable failure)."

Lars Marius Garshol, who wrote the original 0.9 DTD, says: "What I don't understand is all this fuss over Netscape removing the DTD. A well-designed RSS tool, whether it validates or not, would not use the DTD at Netscape's site in any case. There are several mechanisms which can be used to control the dereferencing of references from XML documents to their DTDs. These should be used, if not the result will be as described in this article."

RSS Lives

All is not lost, fellow syndicators. Your RSS files didn't break, and there are alternative RSS validators. Most of the aggregators (like Meerkat and My UserLand) work without a DTD as they have validation turn off or work with local copies. The good news is that Netscape put the RSS files back up, including Dan Libby's old Netscape RSS 0.91 spec, explaining their philosophy for RSS. Dan also reports Netscape posted his "futures" document, which "was actually the first RSS format I came up with," and has some similarities to RSS 1.0. Dan credits UserLand, appropriately, for their contribution to RSS 0.91.

http://my.netscape.com/publish/formats/rss-0.9.dtd
http://my.netscape.com/publish/formats/rss-0.91.dtd
http://my.netscape.com/publish/formats/schema-1.0.dtd
http://my.netscape.com/publish/formats/rss-spec-0.91.html
http://my.netscape.com/publish/help/futures.html

To find out more, check out the three RSS-related discussion lists at the end of this article.

RSS Validators

I asked Dave Winer (CEO of UserLand, primary contributor to the 0.91 spec, and author of the 0.92 spec) about validators now that Netscape/AOL/Time-Warner has removed theirs. It turns out that UserLand already has one. You just may not have known it. It's located at:

http://aggregator.userland.com

Dave updated his validator to accept 0.91, 0.92, and 1.0 RSS files. Just enter the URL of your RSS file and the validator will do its thing. Valid feeds can be registered into UserLand's public RSS database, which is used by some aggregators. UserLand's RSS validation/registration procedure is actually better than Netscape's now defunct process since it occurs in two easy steps.

Leigh Dodds has a prototype open source RSS 1.0 validator available at:

http://www.ldodds.com/rss_validator/1.0/validator.html

Leigh says that updating his script (based on Schematron, which uses XSLT style sheets) to validate 0.91 files would be easy, just plug in another XSLT.

Contents


current page23456789
[next]


Created: May 03, 2001
Revised: April 14, 2003

URL: http://webreference.com/authoring/languages/xml/rss/1/