• Intel
  • Microsoft
  • Google
  • Sun Microsystems
  • BT
  • IBM
  • Yahoo! Inc.
  • Zimbra
  • Atlassian Software Systems
  • Disney
  • EnterpriseDB
  • Etelos
  • Ingres
  • JasperSoft
  • Kablink
  • Linagora
  • MindTouch
  • Mozilla Corporation
  • Novell, Inc.
  • Open Invention Network
  • OpSource
  • RightScale
  • Silicon Mechanics
  • Tenth Planet
  • Ticketmaster
  • Voiceroute
  • White Oak Technologies, Inc.
  • XAware
  • ZDNet

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at scordesse@oreilly.com.

Media Partner Opportunities

Download the Media & Promotional Partner Brochure (PDF) for more information on trade opportunities with O'Reilly conferences, or contact mediapartners@oreilly.com.

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com.

OSCON Newsletter

To stay abreast of conference news and to receive email notification when registration opens, please sign up for the OSCON newsletter (login required).

Contact Us

View a complete list of OSCON 2008 Contacts

Processing Large Data with Hadoop and EC2

Derek Gottfrid (The New York Times)
Location: E146
Average rating: ****.
(4.17, 6 ratings)

This talk will cover processing large quanties of data using Hadoop running on top of Amazon’s EC2 machines. It will cover the theory of the MapReudce/Hadoop model and its applicability to solve different kinds of problems. Gottfrid will provided a brief of overview of AWS EC2 and S3 and look in detail at some of the work he has done using these pieces; some of it is described in the Self Service Prorated Super Computing Fun blog post.

Photo of Derek Gottfrid

Derek Gottfrid

The New York Times

Derek Gottfrid is a Senior Software Architect at The New York Times. He has been involved in building many key parts of the nytimes.com infrastructure, including search, web serving, e-mail distribution, and platform development. Derek has led efforts to improve the use of open source software within the Times and is responsible for the open source project dbslayer —a database connection pooling server. He also blogs regularly about his open source work at open.nytimes.com.

OSCON 2008