Skip to main content

View Diary: NSA whistleblower Russell Tice on Countdown w/KO, Day Two (update X4) (152 comments)

Comment Preferences

  •  The problem of writing a driver... (1+ / 0-)
    Recommended by:
    blueoasis

    ... to generate sufficiently realistic test data is exactly the same of that of training a classifier without actually reading the communications used to train said classifier.

    You can't use the classifier you are building to generate the training set.

    And you can't build a test generator without training it with real data, either.  The patterns being sought are too statistically subtle to generate in your test set without the use of real world data.

    Quick to judge, Quick to anger, Slow to understand; Ignorance and prejudice and fear walk hand in hand. -- Neil Peart

    by JRandomPoster on Fri Jan 23, 2009 at 12:35:33 AM PST

    [ Parent ]

Subscribe or Donate to support Daily Kos.

Click here for the mobile view of the site