Daily Kos

blogrolls, tags, and similarity

Thu Mar 22, 2007 at 04:52:02 PM PDT

Well, I couldn't upgrade the dKosopedia today, but I've had an idea in the back of my mind for a while and decided to look at it. This is kinda sorta related to tags, of course. Right now diaries get a single set of tags, which is suboptimal. Eventually, I'm hoping before dK4, but certainly as part of dK4, each user will be able to assign their own sets of tags to diaries; in other words diaries will have multiple sets of tags. (This is nothing startling, it's moving from the Flickr model to the del.icio.us model.) In addition, there's nothing that says that only diaries can receive tags; you should be able to tag other resources as well. If you think about it, we do that already with comments. Each comment can be tagged "Recommended" or "Troll", with visible counts of the numbers of users who have so tagged.

More...

Well, what else can we tag? How about users? Wouldn't you want to tag different users based on their interests or reliabilty or humor? I would. And then why not assign tags to yourself to reflect your interests.

Well, we sort of do that now, via the new personal Blogrolls. In a way, the blogs we put on our Blogrolls are proxy tags for our interests. We're tagging ourselves. Well, then, so who are we?

I took everybody's Blogroll entries as of 3 or 4 days ago and counted our entries. 1271 of us have blogrolls, with 10094 entries covering 3853 blogs.

Here's the top 25:

24firedoglake.com213
45talkingpointsmemo.com206
99crooksandliars.com199
29digbysblog.blogspot.com174
1atrios.blogspot.com156
56mydd.com135
160boomantribune.com125
159huffingtonpost.com118
58salon.com/opinion/greenwald106
102juancole.com104
161myleftwing.com/frontpage.do98
95americablog.blogspot.com93
32myleftwing.com87
253thinkprogress.org76
61patriotboy.blogspot.com75
290rawstory.com70
189eurotrib.com67
191mediamatters.org66
226riverbendblog.blogspot.com59
63rudepundit.blogspot.com57
28pandagon.net56
105dneiwert.blogspot.com56
78talkleft.com50
91bonddad.blogspot.com49
101glenngreenwald.blogspot.com44

The first number there is my internal blog id. Ignore it. You'll notice some blogs, like Glenn Greenwald's, have more than one url spec, which screws things up. I took off www. prefixes, and trailing slashes, but still...

Anyway, this is sort of a teaser, I guess. I'll put the whole list up on meta.dkosopedia.com somewhere, with a search form -- stick in a blog name and it'll return all those who have that blog in their blogroll. It would help if there was a more standard means of specification. How about myleftwing.com instead of myleftwing.com/frontpage.do, for example.

Then the next step is implementing a similarity algorithm. You put in a username and get back all the most similar users, based on blogrolls. All you interested in healthcare issues can find each other.

This is a fun thing, not even really useful as a proof of concept, but I'm hoping some of the code can be used later when determining similarities based on tags.

For the moment, a quick and dirty version will be here a little later tonight. It's dinner time.

Tags: tags, blogrolls, dKosopedia, Daily Kos (all tags) :: Previous Tag Versions

Permalink | 22 comments

Permalink | 22 comments