Looks like the Semantic Web hurts Russell Beattie’s branes. Hurts mine too. But, I tried explaining what I think I understand in a comment on his blog and I figure it’s worth reposting here for ridicule and correction:
Did you happen to catch Tim Berners-Lee on NPR Science Friday today? Not sure if you get the broadcast there, or listen to the stream. He was expounding on the Semantic Web a bit.
Maybe I’ll take a shot at explaining, since I think I understand the idea. Likely I’ll fail miserably, but here goes.
First simple thing: Look at your weblog page. What would it take to extract the list of people from your blogroll, just given your URL? What about the titles of all the weblog posts on that page?
You, personally, can extract that information very easily since you, as a learned human, grasp the semantics of the page quite quickly. (The semantics are, basically, what’s what and what’s where and what does it all describe.)
Imagine a document containing exactly all of the same info your weblog page presents - only the data is completely, easily accessible to a robot in a universal, easily handled format.
Furthermore, imagine that the schema describing the data to be found on your page is in that same format. And then, imagine that the document describing the construction of schema is in that same format. And then imagine that the decomposition continues, all of the way down to base data types and relationships. Eventually, the whole thing rests on the back of a turtle - er I mean a sort of universal client.
Now, what if every single page on the web were available in this manner? No scraping, no regex, no tricks. I could use the entire web as database and execute queries that draw from data available on a myriad of disparate URLs. My client can figure out what to do with whatever it finds at a URL by chasing descriptions and meta-descriptions until it reaches the level of understanding implemented in the client.
Going out on a limb here, but imagine a practical example: “Hello computer, find me 2 roundtrip tickets for 7 days anytime in the next 10 weeks, for under US$300 each, to a vacation spot where the weather this time of year is usually warm and sunny, the exchange rate is better than 3 to 1 US dollar, and is rated as better than average by Ann Arbor, MI bloggers.”
Assume my semantic web client knows some URLs to airlines, to international weather services, to exchange rates, and to vacation spot reviews in weblogs in Ann Arbor, MI. Assume that there are schema available for the things these URLs describe. Assume that my semantic web client can parse my natural language query.
So, it takes my request, goes out and snags the URLs appropriate to the various topics involved. Once it has all it need to process the data in each URL, it can find me the answer to my query, based on data pulled from all over the place.
Now, get nuttier and bring in some intelligence with robots that can do some inference and reasoning. Say I throw out some facts: Mammals breathe oxygen. Men are mammals. Joe is a man. With the right client, the query “Give me all oxygen breathers,” will include Joe in its results.
Whew. There. That’s what I think I understand about the Semantic Web.
shortname=ooocae
-
About Me
Hi there! My name is Leslie Michael Orchard. I'm a serially enthusiastic, caffeine-dependent {web,mad,computer} scientist and {tech,scifi} writer working for the Mozilla Corporation and living near Ann Arbor / Detroit in Michigan.
-
Search
-
Buy my books
-
Related Posts
Most Commented Posts
- July 3, 2008 Getting Laconica up and running (90)
- June 16, 2008 Firefox 3 Download Day Mega Widget (56)
- December 3, 2004 If you snore, get tested for sleep apnea. Now. (41)
- March 4, 2003 Oh, I’m feeling lucky. (35)
- June 17, 2005 We’re engaged! (32)
- April 25, 2006 A Honeymoon in San Francisco (27)
- August 17, 2006 Don’t ask me who I am (24)
-
Tags
aggregators ajax apple atom books delicious entertainment feedmagick feeds firefox gaming hacking hacks identity ipod javascript js json links mashups metablogging microformats mozilla opml osx outliners php podcasting python rss scifi software syndication twitter web20 webdev webservices wordpress wow writing xml xoxo xoxooutliner xsl yahoo -
Recent readers
-
Archives
- April 2009
- January 2009
- December 2008
- November 2008
- October 2008
- September 2008
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
- October 2006
- September 2006
- August 2006
- July 2006
- June 2006
- May 2006
- April 2006
- March 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005
- September 2005
- August 2005
- July 2005
- June 2005
- May 2005
- April 2005
- March 2005
- February 2005
- January 2005
- December 2004
- November 2004
- October 2004
- September 2004
- August 2004
- July 2004
- June 2004
- May 2004
- April 2004
- February 2004
- December 2003
- November 2003
- October 2003
- September 2003
- August 2003
- July 2003
- June 2003
- May 2003
- April 2003
- March 2003
- February 2003
- January 2003
- December 2002
- November 2002
- October 2002
- September 2002
- August 2002
- July 2002
- June 2002
- May 2002
- April 2002
- March 2002
- February 2002




2 Comments
Yope! This explanation should make the concept pretty clear for everyone…
But there’s one little thing I wonder about: you assume so much about what your semantic web client can do, that you could actually go on assuming it can grasp the semantics of any web page without any further tagging, couldn’t you?
just kidding :-))
For an extension of the semantic web to word meanings you may want to look at:
http://jorl.com/inventions/num/index.htm
Probably the best, first use of this idea would be to add a duplicate metatag in webpages, for keywords, giving the “disambiguating” “number-words” corresponding to the keywords, and helping everybody get the search results they want. No more finding out about Mercury the car, planet, and God when you wanted to know whether your teeth were poisoning you…
One Trackback
[...] 0xDECAFBAD ยป What the frell is the Semantic Web Did you happen to catch Tim Berners-Lee on NPR Science Friday today? Not sure if you get the broadcast there, or listen to the stream. He was expounding on the Semantic Web a bit. (tags: decafbad.com 2008 mes1 dia8 athome semanticweb check&get 2002 blog_post) [...]