Update: I inadvertantly failed to adequately acknowledge the main driver of the text search feature of the website: ichi brown. Ichi spent countless hours working on this, and my failure to acknowledge reflects my poor grasp of the technicals of website building....
Here is the long-promised research tool for DOJ emails, brought to you by Tech Guru (in my mind this translates to "deity") nuketeacher with the assistance of 18 Kos volunteers and myself. To my knowledge, this is the first and largest collective effort by volunteers to assimilate open source data and convert it into a public, referenced, research tool.
http://www.trainingdb.com
This tool has already been shared with numerous reporters in the mainstream and alternative media, as well as with staffers at the HJC. I, in my capacity as "organizer of the project", have received personal comments of gratitude from many of them that they find it useful for their research.
This database includes all of the 2086 unique emails included in the HJC-released documents from the DOJ. It is fully searchable and referenced, so you can easily see the email "with your own eyes". Also included is referenced biographical information for many of the involved personnel. Finally, as a second functionality, we also include a "Text Search" tool that enables search of all of the text of all of the PDF documents.
How to use the Search Engine
The tool works with any web browser.
Main Function- Email Locator
After linking to the site, click on the button by person of interest to bring up that person's biography and search the database by sender and/or recipient. Click "Show" to see all of the emails that person sent and/or recieved. Then click on "email ID" to bring up a window with a link to the document containing the desired email. Depending on your browser capabilities, clicking the document link should take you directly to the the page on which the email appears. If your browser is configured not to show PDF files, but rather downloads documents, make a notation of the page number so you can find the page within the downloaded document.
Disclaimer This function of the website is COMPLETE with respect to identifying all of the unique emails in the 9229 pages of DOJ documents released publicly by the HJC. Dates, senders, recipients, and a link to the PDF file and page are included for every available email released. You can quickly see any email you wish, with your own eyes. We still need help filling in the blanks with biographical data of the main players. We also have some "editing issues" such as to clean up spelling errors and improve nomenclature consistency.
.
.
.
Second Function- Text Search
In addition to the emails, you can search all of the text of all 9229 documents on the HJC website. Using the text search function at the bottom of any main page, enter search terms. This will bring up any documents containing your search, along with the link to the original PDF.
Disclaimer
Regarding the text search, we have parsed the PDFs (thanks to volunteers ICHI and MaverickModerate), so the text of all of the released documents is searchable using the "Text Search" function of the website. Most of the released documents were scanned files (i.e. pictures) of the documents. They have been put through an OCR process which is never perfect. Some effort has also been made to clean up these files with common OCR errors, but not all of them have been found, as you will see. This fuctionality of the website is still a work in progress. Regardless of the text errors, you can still link the original document to see the source.
.
.
.
Acknowledgements
Thanks again to the hard work of all the Kos Volunteers who helped bring this together. They have put in countless hours and should be commended. They include:
Audrey, MsWings, Ethan's Mom, WTF, Madhaus, davidincleveland, Howard, Fanaa, Brian, Michelle, Michael, Miss Butter, Pandora, Eli, OkieByAccident, Keith, Marco, Gray, Tracie, Thom K in LA, leveymg, Ichi and Valerie.
And a Special Big Thank You is owed to MaverickModerate, who has been working all along on the database development. I have inadvertantly omitted him from acknowledgement in prior diaries.....
Another Special Big Thank You to the volunteers who cranked away this weekend to help polish the database for public release.
Others have made helpful suggestions or volunteered to work on the data in different ways:
rhfactor, Bob R, ehill, DrReason, George.... If I have left anyone out, apologies and thanks again.
.
.
Notes
Please see the following for an example of analysis performed using the database:
Big Analysis
If you search my diaries, you will find other examples.
Please note that if you want an excel version of the database, or would like to volunteer and have editing access on the website, you can contact me at the email address in my profile. The likelihood is more documents will be released with respect to this scandal, as well as others. If you would like to contribute effort to helping our press and Congress in their oversight role, Join Us.