Skip to main content

View Diary: Public Release: USAGate DOJ Email Database (176 comments)

Comment Preferences

  •  my db vs. this db. (3+ / 0-)
    Recommended by:
    motherlowman, 4Freedom, drational

    As i've said a couple times.  All I gave drational was the text of the emails.  I've ran those emails through scripts i wrote in perl and mangled the data much much more than just stripping the text files... here's the basic rundown.

    table1
    id - uniq id for each docid
    docid -actual docid ASG00000001
    filename - filename of document stored in my archive soon to be stored in the db.
    parts - #element within each document

    table2
    id - Same as table1
    part - element# 1-?
    to - recips
    from - sender
    cc - recips
    subject - subject <description>
    time - time sent (can provide multiple formats)

    table3
    id - same as table1
    data - Full data of TXT file. IE: data of page.

    my thoughts are stated above but i will be creating tables as i see the need to.  If drational will incorporate my db into his, he's more than welcome to my data as i said from the beginning.  i'd much rather work with people on this, creating a voice... my plan was to have by friday a complete and total archive of the data searchable by elements (emails within emails).

    I can create filters, but... you can for instance query the db where from: = rove@gwb.com or to: or CC: or something similar.

Subscribe or Donate to support Daily Kos.

  • Recommended (140)
  • Community (69)
  • Bernie Sanders (49)
  • Elections (36)
  • 2016 (30)
  • Hillary Clinton (30)
  • Culture (29)
  • Climate Change (29)
  • Environment (26)
  • Science (26)
  • Civil Rights (25)
  • Media (21)
  • Law (20)
  • Republicans (20)
  • Labor (19)
  • Barack Obama (18)
  • Spam (18)
  • Trans-Pacific Partnership (18)
  • International (15)
  • White House (15)
  • Click here for the mobile view of the site