We’ve spent the past year helping the intelligence community (IC) build a brand new system from the ground up that mines massive amounts of structured and unstructured data (high tens to 100+ million documents), stores the results in a highly flexible semantic meta-model that’s massively distributable, lends itself to fierce analysis by the men and women who protect our country and we heavily use Java-based open source software to do it.
This presentation begins with an overview of the problem domain and our experiences with performing semantic analysis on massive amounts of messy structured and unstructured data, but quickly transitions to a discussion of how we’ve leveraged open source software to create a powerful solution stack for the IC. Specifically, we’ll discuss:
A working demonstration using publicly available data will accompany the presentation (and hopefully be available at a public URL for users to tinker with during/after the presentation.
Vice President of Engineering – Digital Reasoning Systems, a firm that specializes in data mining at scale
Comments on this page are now closed.
For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at scordesse@oreilly.com
Download the OSCON Sponsor/Exhibitor Prospectus
Download the Media & Promotional Partner Brochure (PDF) for information on trade opportunities with O'Reilly conferences or contact mediapartners@ oreilly.com
For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com
To stay abreast of conference news and to receive email notification when registration opens, please sign up for the OSCON Newsletter (login required)
Have an idea for OSCON to share? oscon-idea@oreilly.com
View a complete list of OSCON contacts
Comments
Ben – it’s actually not classified. I just need to have it reviewed before I can make it available. Once I’m able to do that, I’ll get it posted. Thanks for the kind words, BTW.
The presentation was informative, but was a little disappointing that it’s classified so unavailable.