This Year's posts

Archive for May, 2005

School’s Out

Thursday, May 19th, 2005

So I’m finally done with everything school related for the semester. Woot!

I’ve got about a week off now before I start my internship– prepare for for some ‘buffer flushing.’ I have a lot of things to blog about and try to get done and I’m going to try to clear it all out in the next week (doesn’t sound like much of a break). Hopefully within the week my inbox, todo list and reading list will be empty.

And, as a sidenote, people have started using for:{delicious username} to allow people to send each other links. I’ve just subscribed to the RSS feed for for:ryansking tag– feel free so send me links that way.

Music Baton Meme

Tuesday, May 17th, 2005

Jonas has just passed me the Music Baton Meme. I don’t usually do these, but I need a break from studying from my exam (last one of the semester! woot!) that I have in the morning.

Total volume of music: 27.15 GB (including a bunch of podcasts)

Song playing right now: Be Mine, by David Gray

Five songs I’m listening to alot (lately):

Most recent purchase: Eisley, Room Noises

Of course, if you want to know more, you can always check my Audioscrobbler profile.

The Self-Organized Web

Monday, May 16th, 2005

Update: Added a quote from Tim Bray that I had forgotten.

I just came across a paper called The Self-Organized Web: The Yin to the Semantic Web’s Yang . Though it was written awhile ago (at least, I assume so, since it was apparently written before Yahoo bought Overture).

They start off by saying:

The Web programmer’s task— …—is to use these patterns [self-organized structure] to make the Web more digestible to users. This goal complements the Semantic Web’s goal: to have humans help make the Web more digestible for computers. Exploiting the self-organized Web will improve tomorrow’s algorithms; manually adding computer-friendly annotations to the Semantic Web will help today’s less-sophisticated algorithms to cope.

The key here is that our goal is to find ways that allow people to create meaninful content which is consumable by computers. I think SW folk have failed by not focusing on this factor enough. Conventional wisdom seems to be that people won’t have to read the source, they’ll just be able to use tools to generate it for them. I believe this premise to be false. TBL and company though that no one would really have to read and understand HTML, we’d all just have tools to create it for us. I think experience has shown this expectation to be unfounded– it is quite common for ‘civilians’ (non-geeks) to read and write HTML and most learn it by example, by reading others’ HTML. I don’t think there’s any reason to believe that the future will be any different.*

Update: I just refound an old post by Tim Bray which hits the same point:

RDF has ignored what I consider to be the central lesson of the World Wide Web, the “View Source” lesson. The way the Web grew was, somebody pointed their browser at a URI, were impressed by what they saw, wondered “How’d they do that?”, hit View Source, and figured it out by trial and error.

[/Update]

Back to the paper– they finish with…

But for the foreseeable future, efforts to leverage the self- organized Web will complement efforts to build the Semantic Web. Both open up opportunities for innovative new algorithms—data mining on one hand, symbolic inference on the other. Where these efforts meet, tools will arise for vastly improved search, filtering, personalization, economic efficiency, and scientific understanding of the social forces and trends reflected in the Web. We believe that the Web’s properties—structure, content, and explicit or inferred metadata—will continue to evolve in a decentralized and self-organized way. Users will benefit most if work on creating the Semantic Web coevolves with work on tools for data-driven analysis of the self-organized Web.

I whole-heartedly believe that the only way to succesfully create a more meaningful world-wide web is to build it in and on the current world-wide web.

In an sense the WWW and Semantic Web must meet in the middle. People writting for the www should try their best to make their material more meaningful, which will allow both people and computers to make more use of it.

Viva la revolution!

* Of course, this could be a result of the lack of quality authoring tools for (good) HTML.

Ontology is Overrated

Monday, May 16th, 2005

Clay Shirky has published an Essay entitled Ontology is Overrated — Categories, Links, and Tags. It does a great job of summarizing the pro-folksonomy/tagging arguments. A sample:

Today I want to talk about categorization, and I want to convince you that a lot of what we think we know about categorization is wrong. In particular, I want to convince you that many of the ways we’re attempting to apply categorization to the electronic world are actually a bad fit, because we’ve adopted habits of mind that are left over from earlier strategies.

World’s First Technorati Intern

Thursday, May 12th, 2005

Update: Sorry to those who got the shaft from Spam Karma. I’ve turned it off now, so you should be able to add comments.

It’s official. I’m interning at Technorati this summer. Thanks to Tantek, Niall, Jonas and whoever else put in a good word for me (from the inside ;-) ).

And, in case anyone’s wondering, there’s absolutely no doubt that without this blog I wouldn’t have this opportunity. Here’s the formula:

  1. Write about something you care about.
  2. Try to write it well.
  3. Figure out who else cares about it.
  4. Get them to read your stuff.
  5. 5. Let everyone know you’re looking for work.
  6. ???
  7. Profit!

Though steps 6 and 7 are out of your control, but 1-5 are within the grasp of anyone with at least a bit of writing skills.

Update: Niall (via IRC) says I should encourage people to contribute to OSS projects, as well. To paraphrase, “don’t wait around for the corporate master to take notice, open up a text editor and hack.” Couldn’t have said it better myself.