Sessions tagged with 'hadoop'

Aaron Kimball (Cloudera, Inc.)
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop. This session is appropriate for attendees who need to use Hadoop to analyze data with Hadoop's MapReduce paradigm.
Kevin Weil (Twitter, Inc.)
How does Twitter analyze its massive dataset? What tools do we use, and where do we focus our analysis? In this talk, I will discuss our transition from a MySQL-based to a Hadoop-based data infrastructure and our use of Pig (a scripting language built on top of Hadoop) to democratize big-data analysis across the company. I will present concrete examples of interesting analyses at each step.
Aaron Kimball (Cloudera, Inc.)
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop. This session is appropriate for attendees who are new to Hadoop and are seeking to understand where Hadoop is appropriate and how it fits with existing systems.
Robin Anil (Google)
Data is exploding all over the internet. There is immense knowledge within this huge volume of information that needs to be unlocked. We need to Mine patterns, Find clusters, Organize content and Predict the future. In this talk, we will show what these methods are and how the new Apache Mahout project is attempting to solve these problems in a scalable way by utilizing Hadoop.
Matthew Russell (Digital Reasoning Systems)
A non-classified case study that describes how we've built a stack based on MALLET, Hadoop/Cassandra, and Flare/Flex to build a highly scalable system for the U.S. intelligence community: MALLET lends itself to state of the art NLP, Hadoop/Cassandra yield a massively distributed back end, and Flare/Flex provide the tools for creating a great UI/UX capable of performing advanced analysis.
  • Intel
  • Microsoft
  • Google
  • Facebook
  • Rackspace Hosting
  • (mt) Media Temple, Inc.
  • ActiveState
  • CommonPlaces
  • DB Relay
  • FireHost
  • GoDaddy
  • HP
  • HTSQL by Prometheus Research
  • Impetus Technologies Inc.
  • Infobright, Inc
  • JasperSoft
  • Kaltura
  • Marvell
  • Mashery
  • NorthScale, Inc.
  • Open Invention Network
  • OpSource
  • Oracle
  • Parallels
  • PayPal
  • Percona
  • Qualcomm Innovation Center, Inc.
  • Rhomobile
  • Schooner Information Technology
  • Silicon Mechanics
  • SourceGear
  • Symbian
  • VoltDB
  • WSO2
  • Linux Pro Magazine

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at scordesse@oreilly.com

Download the OSCON Sponsor/Exhibitor Prospectus

Media Partner Opportunities

Download the Media & Promotional Partner Brochure (PDF) for information on trade opportunities with O'Reilly conferences or contact mediapartners@ oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

OSCON Newsletter

To stay abreast of conference news and to receive email notification when registration opens, please sign up for the OSCON Newsletter (login required)

OSCON 2.0 Ideas

Have an idea for OSCON to share? oscon-idea@oreilly.com

Contact Us

View a complete list of OSCON contacts