Abstract
In this article we'll show you how to turn your HTML files into mini
databases by embedding XML tags within your Web pages to describe the
enclosed content. We'll then build a Web-based Perl client that queries
and displays the contents of these embedded XML tags. Finally, we'll
extend what we've learned by developing a script that builds a top-news
summary for http://www.internetnews.com.
The introduction and adoption of XML, a simplified version of SGML, is poised to make a
monumental, yet fundamental advancement in how we store, share, and interact with data.
Thanks in part to its simplicity, the Web has
created a global information revolution that has redefined communication and human
interaction. As a result, these global information repositories on the Web are becoming
large and unmanageable. Even worse, HTML does not contain methods for organizing
or classifying data. This makes it difficult for users to find pertinent
information.
XML, on the other hand, is capable
of independently defining and describing data in an easy to understand human and machine
readable format. This frees the data from proprietary formats, which limit the scope,
effectiveness, and longevity of information.
But how can we use XML in our sites
today? After all, HTML is incompatible with XML. Read on and you'll find out.
| xml-fetch.pl |
View source | Try it |
| This script retrieves a file from the Web, queries it for a specified list of comma delimited tags, and displays the tag name, attributes, and content. |
|
|
|
|
|