Sponsors

  • Microsoft
  • Nebula
  • Google
  • SugarCRM
  • Facebook
  • HP
  • Intel
  • Rackspace Hosting
  • WSO2
  • Alfresco
  • BlackBerry
  • CUBRID
  • Dell
  • eBay
  • Heroku
  • InfiniteGraph
  • JBoss
  • LeaseWeb
  • Liferay
  • Media Temple, Inc.
  • OpenShift
  • Oracle
  • Percona
  • Puppet Labs
  • Qualcomm Innovation Center, Inc.
  • Rentrak
  • Silicon Mechanics
  • SoftLayer Technologies, Inc.
  • SourceGear
  • Urban Airship
  • Vertica
  • VMware
  • (mt) Media Temple, Inc.

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the convention, contact Sharon Cordesse at scordesse@oreilly.com

Download the OSCON Sponsor/Exhibitor Prospectus

Contact Us

View a complete list of OSCON contacts

Personal schedule for Gregory Altman

Download or subscribe to Gregory Altman's schedule.

Location: Oregon Ballroom 203/204
Sarah Novotny (NGINX), Bradford Stephens (Drawn to Scale)
Opening remarks by the OSCON Data program chairs, Sarah Novotny and Bradford Stephens. Read more.
Keynote
Location: Oregon Ballroom 203/204
Tom Quisel (OkCupid)
Average rating: ***..
(3.22, 9 ratings)
Dive into the distributed system that powers OkCupid’s match searches. Learn how we use C++, event-based programming, and SSDs to solve problems that crop up when building a high performance, high availability distributed system. Read more.
Keynote
Location: Oregon Ballroom 203/204
Benjamin Black (Boundary)
Average rating: ***..
(3.67, 12 ratings)
Keynote by Benjamin Black, Co-founder, fast_ip. Read more.
Keynote
Location: Oregon Ballroom 203/204
Steve Yegge (Google)
Average rating: ****.
(4.71, 17 ratings)
It's 2021. You have a petabyte drive on your keychain, your startup company leases bulk cloud storage by the exabyte, and you have a million cores for data crunching. You even can have your own copy of the entire world's public semantic data. What do you do with it? If you're not sure yet, I've got plenty of ideas for you. Read more.
Keynote
Location: Oregon Ballroom 203/204
Average rating: **...
(2.50, 2 ratings)
An open microphone question and answer session with the morning's keynote speakers. Read more.
Data: Hadoop
Location: C123
Tom Hanlon (Cloudera)
Average rating: ****.
(4.27, 11 ratings)
Hadoop gives you the ability to process massive amounts of data at scale. This presentation will show you how hadoop makes use of commodity hardware to allow you to build a system that scales, that deals gracefully with failure of individual nodes, and gives you the power of Map/Reduce to process Petabytes. Read more.
Data: NoSQL Databases
Location: B118-119
Patrick Lightbody (New Relic)
Average rating: **...
(2.78, 9 ratings)
Between the NoSQL movement and new cloud offerings, it seems there are new storage options popping up every day. How do you select which one is the best for your project? The truth is that it's unlikely one option is best for all your needs. This session walks you through the various options considered by one startup and how it selected five separate storage engines - and has no regret doing so! Read more.
Data: Roulette
Location: C123
Gleicon Moraes (7co.cc)
Average rating: *....
(1.88, 8 ratings)
Ever had to dig into a system that misused the most basic features of a RDBMS ? Better yet - after the whole NoSQL storm had you wondered why it didn't shown before when you had to twist your schema to fit into something it was not designed for ? Check on this anti-patterns collection and feel better that you are not alone - and how you can benefit from it even not having big data around. Read more.
Benoit Sigoure (StumbleUpon, Inc.)
Average rating: ****.
(4.30, 10 ratings)
OpenTSDB is an open-source, distributed time series database designed to monitor large clusters of commodity machines at an unprecedented level of granularity. OpenTSDB enables operations teams to keep track in real-time of all the metrics exposed by operating systems, applications and network equipment, and makes the data easily accessible. Read more.
Data: Hadoop
Location: C121/122
Greg Fodor (Etsy)
Average rating: ***..
(3.75, 4 ratings)
The data & analytics teams at Etsy build up and tear down more than a thousand independent Hadoop clusters on EC2 each month. This talk discusses the benefits of this approach, where Elastic Map Reduce serves as a "meta-cluster" in which on-demand Hadoop clusters can be created, used, and shut down quickly and easily. Read more.
Data: NoSQL Databases
Location: B118-119
Ezra Zygmuntowicz (VMware Inc)
Average rating: ****.
(4.00, 2 ratings)
Redis is an entry in the new breed of nosql databases. But it takes a different approach that makes it much more interesting then most of the other key/value stores in the same category. Come learn what makes redis so useful that it seems everyone is adding it to their toolbox. Read more.
Theo Schlossnagle (OmniTI/Circonus)
Average rating: ****.
(4.38, 8 ratings)
The art of dealing with real-time data is not new. In fact, much of the world's economy is propped up my making decisions on data sub milliseconds. The technology is there, we have the power. We'll take a whirlwind tour of the open-source Esper system and understand how to integrate it into your stack to enable rapid decision making on real-time data from anywhere in your architecture. Read more.
Data: Relational
Location: C121/122
Tags: dba_dude
Bruce Momjian (EnterpriseDB)
Average rating: ****.
(4.00, 1 rating)
Multiversion Concurrency Control (MVCC) allows Postgres to offer high concurrency even during significant database read/write activity. MVCC specifically offers behavior where "readers never block writers, and writers never block readers". This talk explains how MVCC is implemented in Postgres and highlights optimizations which minimize the downsides of MVCC. This talk is for advanced users. Read more.
Data: Hadoop
Location: C124
Arun Murthy (Hortonworks Inc.)
Average rating: ***..
(3.00, 4 ratings)
YARN is the next generation of Hadoop Map-Reduce designed to scale out much further while allowing for running applications other than pure Map-Reduce in a highly fault-tolerant manner. Read more.
Jonathan Seidman (Orbitz Worldwide), Ramesh Venkataramaiah (Orbitz Worldwide)
Average rating: **...
(2.75, 8 ratings)
An overview of the state of the art for bringing together the analytical power of the R language with the big data capabilities of Hadoop. Read more.
Aaron Kimball (Magnify Consulting)
Average rating: ***..
(3.62, 8 ratings)
This talk introduces an open-source SQL-based system for continuous or ad-hoc analysis of streaming data built on top of Flume-based data collection for Hadoop. Attendees will understand how to use a new tool to extend their Hadoop data collection pipeline with real-time streaming analytics. Read more.
Location: B118-119
Tom White (Cloudera)
Average rating: ***..
(3.33, 3 ratings)
Apache Whirr is a way to run distributed systems - such as Hadoop, HBase, Cassandra, and ZooKeeper - in the cloud. Whirr provides a simple API for starting and stopping clusters for evaluation, test, or production purposes. This talk explains Whirr's architecture and shows how to use it. Read more.
Location: B118-119
Brian Aker (HP)
Average rating: ***..
(3.50, 2 ratings)
Many people view topics like Map/Reduce and queue systems as advanced concepts that require in-depth knowledge and time consuming software setup. Gearman is changing all that by making this barrier to entry as low as possible with an open source, distributed job queuing system. Read more.
Data: NoSQL Databases
Location: C124
Rusty Klophaus (Basho Technologies)
Average rating: ****.
(4.67, 3 ratings)
The Basho engineering team has been working to make Riak more queryable with the addition of built-in indexing plus a SQL-style query language. In this talk, Rusty describes the usage, benefits, limitations, and evolution of this this functionality, called Secondary Indices. He also covers the challenges and pitfalls of adding indexing to a distributed datastore. Read more.
Noah Pepper (Lucky Sort), Homer Strong (Lucky Sort)
Average rating: ***..
(3.18, 11 ratings)
We produce gorgeous LaTeX reports while harnessing the power of R on the backend. The data is pulled from our PostgreSQL database, the analysis and visualizations are fast and distributed thanks to Redis. We'll talk about weaving together open source tools to build powerful analytics reporting engines that rival the commercial alternatives. Read more.
Event
Location: Oregon Ballroom
Average rating: ****.
(4.79, 24 ratings)
If you had five minutes on stage what would you say? What if you only got 20 slides and they rotated automatically after 15 seconds? Would you pitch a project? Launch a web site? Teach a hack? We’re going to find out when we conduct our third Ignite event at OSCON. Read more.
Event
Location: See BoF Schedule for Locations
Average rating: ***..
(3.00, 1 rating)
Birds of a Feather (BoF) sessions provide face to face exposure to those interested in the same projects and concepts. BoFs can be organized for individual projects or broader topics (best practices, open data, standards). BoFs are entirely up to you. We post your topic online and onsite and provide the space and time. You provide the engaging topic. Read more.
Location: Oregon Ballroom 203/204
Sarah Novotny (NGINX), Bradford Stephens (Drawn to Scale)
Opening remarks by the OSCON Data program chairs, Sarah Novotny and Bradford Stephens. Read more.
Keynote
Location: Oregon Ballroom 203/204
Dwight Merriman (10gen)
Average rating: ***..
(3.71, 7 ratings)
Much has been made of scalability as a driver for choosing a database, but the choice of a database influences much more than the scaling architecture. Different database choices drive different data models which in turn influence the development process. Read more.
Keynote
Location: Oregon Ballroom 203/204
Adrian Cockcroft (Battery)
Average rating: ****.
(4.44, 9 ratings)
Keynote by Adrian Cockcroft, Cloud Architect, Netflix. Read more.
Keynote
Location: Oregon Ballroom 203/204
Brian Aker (HP)
Average rating: ***..
(3.50, 8 ratings)
We love data, and today we generate data in astronomical amounts. When we hit save on a document, snap a photo, or fill out a form online, we want to know that this data will persist, and we want to know that we can share, access, or reference it in the future. For any meaningful use, we need to how data relates to other data. Read more.
Data: Big Data
Location: B118-119
Jay Kreps (LinkedIn)
Average rating: ****.
(4.11, 9 ratings)
The last few years have brought a wealth of new data technologies organized around horizontal scalability. This talk will cover the essential infrastructure areas: real-time stream processing, offline data crunching, large-scale data deployments and live serving. The focus will be on how these ingredients come together to enable innovative data-driven products at LinkedIn. Read more.
Jean-Daniel Cryans (Cloudera)
Average rating: ****.
(4.00, 4 ratings)
Imagine for a moment doing a JOIN on two HBase tables, crazy talk right? Well now you can thanks to Hive. True, it is only meant to be used in a batch context, but we have being doing it for a few months now at StumbleUpon and our analysts and engineers love it. This presentation will cover how the Hive-HBase integration works and how we use it at our company. Read more.
Data: Relational
Location: C123
Selena Deckelmann (PostgreSQL)
Average rating: ****.
(4.12, 8 ratings)
PostgreSQL continues to provide a major release every year full of improvements, better performance and features that measure up to the most popular commercial databases. Our 2011 release, 9.1, is no exception! Read more.
Data: Real-Time and Streaming
Location: C121/122
John Hugg (VoltDB)
Average rating: ***..
(3.50, 4 ratings)
In this talk, we will introduce a simple formula for all Big Data applications: Big Data = Fast Data + Deep Data. Through a use-case format, we will discuss the specialized requirements for real-time (“fast”) and analytic (“deep”) data management. Read more.
Data: Big Data
Location: B118-119
Jared Williams (New York State Senate), Noel Hidalgo (World Economic Forum), Graylin Kim (New York State Senate)
Average rating: ***..
(3.50, 2 ratings)
The story of the development team and what lessons we learned in building Open Legislation - an open government platform. It will detail our transition from a MySQL back end to an application fully powered by Lucene, the data quality and efficiency issues that we’ve had to address, and how we’re now trying to rebuild internal trust after our iterative and initially shaky development process. Read more.
Data: NoSQL Databases
Location: B118-119
Tags: nosql_nerd
Dwight Merriman (10gen)
Average rating: ****.
(4.00, 3 ratings)
One of the challenges that comes with moving to MongoDB is figuring how to best model your data. While most developers have internalized the rules of thumb for designing schemas for RDBMSs, these rules don't always apply to MongoDB. Read more.
Data: Big Data
Location: C123
Kate Matsudaira (SEOmoz)
Average rating: ***..
(3.50, 10 ratings)
Building large data applications can present a unique set of technical challenges because things that often work well in the conventional development environment can become incredibly arduous or expensive when applied on a much bigger scale. This talk will cover some of those challenges and potential solutions for each. Read more.
David Pacheco (Joyent), Brendan Gregg (Netflix)
Average rating: ***..
(3.00, 3 ratings)
We'll present the architecture and implementation of a Node.js/DTrace-based distributed platform for analyzing the performance of cloud applications in real-time. We'll do a live demo on a real, internet-facing cloud and discuss some of the interesting performance pathologies we've found and explained using this tool. Read more.
Location: C123
Scott Andreas (Boundary Inc.)
Average rating: ***..
(3.87, 15 ratings)
This language-agnostic proposal focuses upon concepts and strategies critical to the design and implementation of asynchronous systems and data processing layers. Key components include a survey of implementation strategies for non-blocking edge tiers, patterns for building out a distributed worker / processing tier, along with several horror stories of cascading failures and their resolution. Read more.
Data: Roulette
Location: C124
Peter Neubauer (Neo Technology)
Average rating: **...
(2.00, 1 rating)
Location-based services are hot, but geographic datasets are complex. But this shouldn’t put you off writing awesome location-aware services. This talk will show how to create spatial models and query the Open Street Map dataset together with social data using the Neo4j graph database. Read more.
Data: Scaling
Location: B118-119
Andy Blyler (Barracuda Networks), Lindsay Snider
Average rating: ****.
(4.00, 1 rating)
Solr, an open source enterprise search server, scales very well within an index (vertical scaling). It is when you have multiple indexes (horizontal scaling) that it starts to get hairy, which happens a lot when you are hosting a cloud based solution for multiple users. In this session we will discuss these issue as well as the techniques of how to overcome them in-depth. Read more.
Location: C121/122
Robert Treat (OmniTI)
Average rating: ****.
(4.17, 6 ratings)
Everyone thinks they know what sharding is and how to do it, but simple horizontal read scaling is the small potatoes. In this talk we'll focus on the sharding pattern for large scale read/write architectures, based on real world implementations. Supporting millions of users on commodity hardware doesn't need magical software, just careful application of the right scalability pattern. Read more.
Event
Location: Expo Hall
Average rating: ***..
(3.92, 24 ratings)
Grab a drink and kick off the 13th edition of OSCON by meeting and mingling with exhibitors and fellow attendees. Read more.
Event
Location: Hall B
Average rating: ****.
(4.22, 37 ratings)
Step right up and join us at the O'Reilly OSCON Carnival. There will be games, clowns, sumo wrestling, log rolling, tattoos, and lots more. There's free food, free wine, and free beer. You’ve never seen a carnival like this. Trust us. Read more.
Event
Location: 411 NW Park Ave.
Average rating: ****.
(4.08, 12 ratings)
Join Puppet Labs and SwellPath Interactive at their headquarters in the Pearl District. The party is free, as in free beer, food and fun. Two floors, two open bars, and more. Take the Green or Yellow line (free transit) west to Union Station and walk 2 blocks west to 411 NW Park Ave. Read more.
Location: Portland Ballroom
Average rating: ***..
(3.63, 19 ratings)
Keynotes today will be shared by OSCON, OSCON Data, and OSCON Java. Read more.
Keynote
Location: Portland Ballroom
Jono Bacon (Canonical Ltd)
Average rating: **...
(2.64, 55 ratings)
In this new keynote, Jono Bacon, author of The Art of Community (O'Reilly), founder of the Community Leadership Summit and award-winning Community Manager for the global Ubuntu community, talks about the new opportunities and challenges we face in understanding the art and science of community leadership. Read more.
Keynote
Location: Portland Ballroom
Ariel Waldman (Spacehack.org)
Average rating: ****.
(4.35, 62 ratings)
From launching robots into space to discovering distant galaxies: how people are creating open source space exploration and hacking science. Read more.
Data: NoSQL Databases
Location: Oregon Ballroom 204
Bradley Holt (Found Line)
Average rating: ***..
(3.12, 8 ratings)
CouchDB is a document-oriented database that uses JSON documents, has a RESTful HTTP API, and employs map/reduce views for querying data. This tutorial will teach web developers the concepts they need to get started using CouchDB in their projects. Libraries are available for CouchDB’s RESTful HTTP API in many programming languages and we will take a look at some of the more popular ones. Read more.
Data: Roulette
Location: Oregon Ballroom 203
Krishna Sankar (Tata America International)
Average rating: ***..
(3.00, 3 ratings)
Algorithms are getting raunchier, tools more potent and competitions more intimate! Let us mix analytics tools (like R & Mahout) and a dash of algorithmics to work on BigData Analytics competitions and see if the answer is always 42. In the process we will explore and apply a few good algorithms, to the Heritage Health competition … Read more.
Data: Roulette
Location: Oregon Ballroom 204
Dhruv Bansal (Infochimps), Winnie Hsia (Infochimps)
You have an idea for an app. Great! First you have to munge and maintain the data. Did you know there is one data API to pull clean, updated data from multiple sources? It slices, it dices, it serves out data on geo, social & more! And you don't need even touch MySQL. Mash up some data with the Infochimps Data Scientists Jacob Perkins, Dhruv Bansal and Ham the Incredible Coding Chimp. Read more.
Event
Location: Expo Hall
Average rating: ***..
(3.27, 11 ratings)
Quench your thirst with vendor-hosted libations and snacks while you check out all the cool stuff in the expo hall. Read more.
Event
Location: See BoF Schedule for Locations
Average rating: *****
(5.00, 1 rating)
Birds of a Feather (BoF) sessions provide face to face exposure to those interested in the same projects and concepts. BoFs can be organized for individual projects or broader topics (best practices, open data, standards). BoFs are entirely up to you. We post your topic and provide the space and time. You provide the engaging topic. Read more.
Keynote
Location: Portland Ballroom
Jim Zemlin (The Linux Foundation)
Average rating: ****.
(4.28, 29 ratings)
On the eve of Linux’ 20th anniversary, Jim Zemlin invites the OSCON audience into his "Bizarro World” of 2011. The world of computing has been turned upside down. Microsoft’s stock is down. They now are filing anti-trust suits, not being the subject of them. Heck, Microsoft is even contributing code to Linux. And for good reason. Read more.
Keynote
Location: Portland Ballroom
Eri Gentry (BioCurious)
Average rating: ****.
(4.19, 31 ratings)
Join Eri Gentry, founder of BioCurious, the world’s first “hackerspace for biology” on a journey from garage biology to community lab. Read more.
Keynote
Location: Portland Ballroom
John Graham-Cumming (CloudFlare)
Average rating: ****.
(4.14, 21 ratings)
This talk tells the behind-the-scenes story of the apology campaign complete with source code, tips on dealing with the old-school media, how Twitter helped and didn't, and a call for people who want to change the world to be "reasonably unreasonable" because nothing ever gets done by the reasonable. Read more.
Keynote
Location: Portland Ballroom
Gabe Zichermann (Gamification.Co & Gamification Summit)
Average rating: ****.
(4.03, 33 ratings)
Creating engaging user experiences in software have become the mantra of businesses big and small - but what about open source? Do we do enough user-centric design and are we creating the kind of long-term user engagement we want? What are the challenges for open source advocates and developers to building truly engaging experiences and how can gamification make open-everywhere a reality? Read more.
Event
Location: See BoF Schedule for Locations
Birds of a Feather (BoF) sessions provide face to face exposure to those interested in the same projects and concepts. BoFs can be organized for individual projects or broader topics (best practices, open data, standards). BoFs are entirely up to you. We post your topic and provide the space and time. You provide the engaging topic. Read more.
Event
Location: Jupiter Hotel @ the Dream Tent
Average rating: ***..
(3.33, 3 ratings)
Thursday, July 28th, (mt) Media Temple Party! held at the Jupiter Hotel @ the Dream Tent with an Open Bar/All you can eat Tacos/DJ! Read more.