Skip to main content

View Diary: Election Race Diary Roundup (11/11 - Final 2006 Edition) (92 comments)

Comment Preferences

  •  I don't think you were being... (1+ / 0-)
    Recommended by:
    Alma

    ... incomprehensible at all. Well, not necessarily anyway. I just got such tunnel-vision on this thing that I got to pay attention to precious little elsewhere on the site during that period. So you've probably given perfectly understandable explanations that I've just missed.

    And yeah, the one thing I can say for sure is that it was 81 diaries. I started 8/22 and we missed one day thereafter until 11/11.

    Make a difference today. Who better than you?

    by sidinny on Wed Nov 15, 2006 at 04:51:44 AM PST

    [ Parent ]

    •  81 (1+ / 0-)
      Recommended by:
      Alma

      Thanks, its always the last few that cause problems, and I needed to know they were there so I knew how hard to look.

      I have them all now - numbers to follow quickly.

      # 37 days 'til the light starts to return

      by jotter on Wed Nov 15, 2006 at 09:09:59 AM PST

      [ Parent ]

    •  once again (1+ / 0-)
      Recommended by:
      Alma

      From 81 Election Roundup diaries, I could recover 7511 links to Daily Kos stories/diaries.  

      Excluding any with the phrase "Election Race Roundup" or Election Race Diary Rescue Roundup" in the title left 6873, which after removing duplicates left 6849 unique diaries, compared to your number of 6827.  

      I don't think we can get closer without comparing final lists.  Probably a few  extra non round up diaries were mentioned along the way that I counted and you didn't.  But at least I'm getting as many diaries as you, instead of fewer.

      There were 24 diaries mentioned more than once.  These were mentioned in 23 diaries. Two such diaries had 8 mentions of duplicates, 11 had two, and 10 had one.  

      Now the hard part.  The whole point of the exercise was to pull out the race assignments the Roundup team made to each diary, for use as tags.

      I now can parse out 6881 such assignments, involving 6782 diaries and 573 races.  That means I've missed assignments for 60-70 diaries.  I took a look at those, and it appears they are mostly due to formatting discrepancies.  That caused my parsing to fail, but means the assignments can be retrieved.  I haven't done that yet.  

      Here are the numbers to date comparing roundup with database tagged election diaries.

      Roundup tags: 573, only in roundup: 264
      Roundup diaries: 6782, only in roundup: 2341

      KOSDB tags: 715, only in KOSDB: 406
      KOSDB diaries: 6532, only in KOSDB: 2091

      Tags found in both Roundup and KOSDB: 309
      Diaries found in both Roundup and KOSDB: 4441

      Tags found in one or the other but not both: 670
      Diaries found in one or the other but not both: 4432

      Union of Roundup and KOSDB tags: 979 diaries: 8873

      What I hope to do with this, once the final group are manually curated, is to hand off this list to a group of people interested in tagging, and make sure that all the diaries from the election roundup series get the tags you assigned as well as a tag indicating it was part of the election roundup series.

      Once that is done it will be possible to compare all  diaries with a particular race tag to those that got rounded up.

      Do you see any problem with that?  

      # 36 days 'til the light starts to return

      by jotter on Wed Nov 15, 2006 at 01:21:01 PM PST

      [ Parent ]

      •  Okay, think I got the basics... (0+ / 0-)

        Looks like you're just adding another layer to the cross-referencing on these. I thought of doing something like that by adding our own tag designation to diaries in the early stages but it seemed way too labor intensive. Makes sense, though, if you have the folks willing to do it.

        My thought is that adding something simple yet distinctive like 2006 (or '06) ERR (or ERDR - both have been used) would probably do the trick. And I certainly don't have a problem with anything, I think it's amazing that you're putting all the time into looking at this. I just wish we had been able to work with you on the front end, we probably could have organized things to make your life a lot easier.

        If there's anything else, just give a yell (or drop an email, it might be easier).

        Make a difference today. Who better than you?

        by sidinny on Thu Nov 16, 2006 at 11:18:24 AM PST

        [ Parent ]

        •  I don't have any "folks" (1+ / 0-)
          Recommended by:
          Alma

          but I'm hoping to faciitate an automated clean up.

          I can't find any existing ERR / ERDR tags in use - what am I doing wrong?  I tried 2006 or '06 in front or after with and without a space.

          You guys were too busy mining to pay attention to formatting - that's fine.  Besides it turns out it was really amazingly good as is.

          I'll let you know of any progress.

          # 35 days 'til the light starts to return

          by jotter on Thu Nov 16, 2006 at 02:01:21 PM PST

          [ Parent ]

Subscribe or Donate to support Daily Kos.

Click here for the mobile view of the site