MSC Stirling: RSS First Counter

(from http://www.faganfinder.com/search/rss.php)

What is RSS?

RSS can actually be explained in three words, but depending on who you ask and what version of RSS you are speaking about, it may stand for Really Simple Syndication, Rich Site Summary, RDF Site Summary, or a variation on one of those. None of that matters to you anyhow. Another thing that you don’t need to care about is the versions. There are 0.90 and 0.91 (created by Netscape), 1.0 (by RSS-DEV), and 0.9x and 2.0 (by UserLand Software) versions, but almost all applications that handle RSS feeds can read all the different versions. There’s also a similar format called Atom, explained below.

RSS is a text-based format, a type of XML. You should know that only because often RSS files are often labeled as XML. RSS version 1.0 is also RDF (whatever), which, again, is important only because an RSS file may be labeled as RDF. RSS files (which are also called RSS feeds or channels) simply contain a list of items. Usually, each item contains a title, summary, and a link to a URL (e.g. a web page). Other information, such as the date, creator’s name, etc., may also be available. The most common use for RSS files is for news and other reverse-chronologically ordered websites like blogs. For example, this particular page on Fagan Finder has a changes log, which is also available in RSS format. An item’s description may contain all of a news article, blog post, etc., or just an extract or summary. The item’s link will usually point to the full content (although it may also point to what the content itself links to).

When a website has an RSS feed, it is said to be “syndicated.” There are various other syndication formats besides RSS (such as Atom), but RSS is by far the most widely used and supported today. RSS files do not have a common file extension, although they frequently end in one of .xml, .rss, or .rdf (note that other extensions may be used also). The term “scraping” refers to creating an RSS feed for a website that doesn’t provide one itself (i.e. scraping the text off of the page). That is, scraped feeds are not created by the same people who created the content within the feed. Scraped RSS feeds may stop working if the page changes its layout.

What is Atom?
Atom is a format quite similar to RSS. It was created by people who felt that RSS could be improved upon, and some that disagreed with some of the politics regarding RSS. Some people are heavily involved in the (quite unimportant, in my opinion) argument as to which format is better. The Atom format is in development, but as of February 2004, Atom version 0.3 is stable. There are pros and cons to the format, but that’s more complex than I am going to deal with here. The basic difference is that while Atom is somewhat more complex (for producers of Atom feeds), it is also able to carry more complex information, and it is consistent across the syndication, storage, and editing of information. Just about everything on this page which discusses RSS applies equally well to Atom. You can learn more about Atom at the official website, AtomEnabled.org.

Subscribing to a Feed
There is no agreed-upon standard for how to subscribe to an RSS feed, although some developers are working on this. So there are roughly two ways to subscribe. One is to enter the URL of the RSS feed into your aggregator. The other is to follow a subscribe link from a web page; the problem with this is that practically every aggregator has a different way of doing this. So you might see links labeled as subscribe with Bloglines or add to My Yahoo!. Two services exist to deal with this problem: Syndication Subscription Service and quickSub, which was inspired by the former. Links labeled as will take you to a page on the Syndication Subscription Service which itself contains direct links to subscribing using various aggregators. QuickSub is similar. Links to RSS feeds using the quickSub tool will popup a list of links to subscribing with various aggregators.

Very nice RSS aggregators will allow you to enter in the URL of a web page and it will then read its RSS feed. These tools support RSS auto-discovery (and anyone reading this who is writing RSS applications I encourage to use the Ultra-liberal RSS locator). Most RSS aggregators, unfortunately, aren’t that nice; you’ll need to copy and paste the RSS URL into your aggregator.

Finding the RSS Feed for a Website
The websites you already read may have an RSS feed. So you want to find it. Go to BlogStreet’s RSS Discovery tool and enter in a website. It will return the feed for you.
If that doesn’t find the RSS feed, go to the website whose feed you’re looking for; if it has one, then it probably includes a link to it. Try looking on the page’s menu (usually left side or right side) and the footer. Most often RSS feeds are linked to with an small icon. The most common is an XML icon like this:

but there are a number of variations on label (RSS, RSS2, XML, RDF, Atom), colour, and size, such as and . Other times there may not be an image, but text with one of those lables, or a link labeled “Syndicate this site.” You may also see a variation on the standard XML icon such as and ; these are direct links to subscribing to an RSS feed in AmphetaDesk and Radio UserLand (both are RSS aggregators) respectively. Radio’s coffee cup icon is sometimes shown alone. If you are running one of these news aggregators, click on the icon to subscribe, otherwise just use the usual icon. Note that not all XML icons link to RSS feeds, because there are many other XML formats. If it isn’t labeled or self-evident, just try reading the file in an aggregator; if it doesn’t work, it is probably not an RSS feed.

If you still haven’t found the RSS feed for a website you can try searching in one of the tools listed below. Failing that, write an e-mail to the webmaster and suggest that they create an RSS feed. If they don’t know what that is, you can point them over to this RSS Workshop and these RSS specifications. Lastly, you can scrape the website. MyRSS is a tool for scraping. A number of news aggregators, such as Syndirella, have the ability to “create” an RSS feed, but the feed will only be accessable to people using Syndirella.

News Is Free
News Is Free is an online news aggregator and a directory of RSS feeds, over 5,000 of them. You can search (also available on Fagan Finder), or browse the directory. News Is Free also provides scraped RSS feeds for a number of websites.

BlogStreet’s RSS Directory
BlogStreet contains a number of blog-related tools. The directory lists the RSS feeds of over 10,000 blogs, organized alphabetically.

2 comments:

Eleni Bouga, Athens, GR said...: Hello! Ann

I think your article was quite analytical and helpful for people like me who know nothing about information technology.
I was amazed by how this RSS (Really Simple Syndication) works and of the new options available now for business to capitalize on. Also, your description on RSS and Atom further facilitated my research on podcasting, as I am supposed to give a presentation about it next week.
In fact, your article inspired me to do some further research on the subject. And I found that Google started a new service account for its users called Google Reader. The surface looks like a Google email account but instead of traditional emails you receive updates from your favourite websites or blogs by just subscribing to them with the RSS technology. Fascinating or what?

Check out: http://www.google.com/reader/view/); 10 March 2007 at 21:24
Ann Ha said...: Thank you for reading my work, i will be posting more information on RSS as the course goes on. I will have blog title which will be in more 'friendly' language for all readers. So keep watching.

:-)
Ann; 12 March 2007 at 13:37

MSC Stirling

Blog Labels

Thursday, 1 March 2007

RSS First Counter

2 comments:

Google Translate Tool

Comments - Classmates

Comments - Non Classmates

Subscribe Now

Personal Site

PRWeek UK Technology News

PR Newswire: All Releases

PR Resources

Classmates" blogs

Blog Archive

Visits

Thought of the week

RSS - Presentation by Netconcept