Personal schedule for Clive Boulton
Download or
subscribe to Clive Boulton's
schedule.
Location: Oregon Ballroom 203/204
Opening remarks by the OSCON Data program chairs, Sarah Novotny and Bradford Stephens.
Read more.
Keynote
Location: Oregon Ballroom 203/204
Dive into the distributed system that powers OkCupid’s match searches. Learn how we use C++, event-based programming, and SSDs to solve problems that crop up when building a high performance, high availability distributed system.
Read more.
Keynote
Location: Oregon Ballroom 203/204
Keynote by Benjamin Black, Co-founder, fast_ip.
Read more.
Keynote
Location: Oregon Ballroom 203/204
It's 2021. You have a petabyte drive on your keychain, your startup company leases bulk cloud storage by the exabyte, and you have a million cores for data crunching. You even can have your own copy of the entire world's public semantic data. What do you do with it? If you're not sure yet, I've got plenty of ideas for you.
Read more.
Keynote
Location: Oregon Ballroom 203/204
An open microphone question and answer session with the morning's keynote speakers.
Read more.
We're being surrounded by data: Open government data, streaming media, and data we're creating as we track our lives and connect with our communities. Learn how to leverage easy to use tools to combine this together for our personal and organization decision making without requiring complex processes or training.
Read more.
Adding security to an existing product is never easy, but our team at Yahoo added strong authentication to Apache Hadoop by integrating it with Kerberos. This project was delivered on time and is currently deployed on all of Yahoo's 40,000 Hadoop computers. Come learn how we added security to and why it matters.
Read more.
In this workshop, one of the core MongoDB committers will present the fundamental principles of MongoDB, how to set up and interact with the database, and what to consider when building applications using a document-based data model.
Read more.
What happens when you write data to disk? We'll explore everything between your programming language and the spinning platters - both optimizations and dangerous pitfalls.
Read more.
Redis is an entry in the new breed of nosql databases. But it takes a different approach that makes it much more interesting then most of the other key/value stores in the same category. Come learn what makes redis so useful that it seems everyone is adding it to their toolbox.
Read more.
This talk introduces an open-source SQL-based system for continuous or ad-hoc analysis of streaming data built on top of Flume-based data collection for Hadoop.
Attendees will understand how to use a new tool to extend their Hadoop data collection pipeline with real-time streaming analytics.
Read more.
We produce gorgeous LaTeX reports while harnessing the power of R on the backend. The data is pulled from our PostgreSQL database, the analysis and visualizations are fast and distributed thanks to Redis. We'll talk about weaving together open source tools to build powerful analytics reporting engines that rival the commercial alternatives.
Read more.
Event
Location: Gather (Double Tree Hotel bar)
Join other Android developers for happy hour at Gather in the Double Tree Hotel on Monday evening. Meet face-to-face and share experiences with other developers working on Android. The first 100 people there get a free drink ticket.
Read more.
Event
Location: Oregon Ballroom
If you had five minutes on stage what would you say? What if you only got 20 slides and they rotated automatically after 15 seconds? Would you pitch a project? Launch a web site? Teach a hack? We’re going to find out when we conduct our third Ignite event at OSCON.
Read more.
Event
Location: See BoF Schedule for Locations
Birds of a Feather (BoF) sessions provide face to face exposure to those interested in the same projects and concepts. BoFs can be organized for individual projects or broader topics (best practices, open data, standards). BoFs are entirely up to you. We post your topic online and onsite and provide the space and time. You provide the engaging topic.
Read more.
Location: Oregon Ballroom 203/204
Opening remarks by the OSCON Data program chairs, Sarah Novotny and Bradford Stephens.
Read more.
Keynote
Location: Oregon Ballroom 203/204
Much has been made of scalability as a driver for choosing a database,
but the choice of a database influences much more than the scaling
architecture. Different database choices drive different data models
which in turn influence the development process.
Read more.
Keynote
Location: Oregon Ballroom 203/204
Keynote by Adrian Cockcroft, Cloud Architect, Netflix.
Read more.
Keynote
Location: Oregon Ballroom 203/204
We love data, and today we generate data in astronomical amounts.
When we hit save on a document, snap a photo, or fill out a form online, we
want to know that this data will persist, and we want to know that we can
share, access, or reference it in the future.
For any meaningful use, we need to how data relates to other data.
Read more.
Keynote
Location: Oregon Ballroom 203/204
The first OSCON Data Innovation Award winner will be announced.
Read more.
The last few years have brought a wealth of new data technologies organized around horizontal scalability. This talk will cover the essential infrastructure areas: real-time stream processing, offline data crunching, large-scale data deployments and live serving. The focus will be on how these ingredients come together to enable innovative data-driven products at LinkedIn.
Read more.
In this talk, we will introduce a simple formula for all Big Data applications: Big Data = Fast Data + Deep Data. Through a use-case format, we will discuss the specialized requirements for real-time (“fast”) and analytic (“deep”) data management.
Read more.
Whether you're a beginner Web guy or a veteran DBA, whether you get hands dirty with any code or just manage systems, you still must know algorithms. How come? Because that knowledge enables you to optimize your work, conduct correct benchmarks, and make educated decisions. We'll show you how knowing only a little about SQL internals can help so much with tuning things.
Read more.
One of the challenges that comes with moving to MongoDB is figuring how to best model your data. While most developers have internalized the rules of thumb for designing schemas for RDBMSs, these rules don't always apply to MongoDB.
Read more.
This talk will cover lessons learned in building Urban Airship's large-scale data warehouse in EC2 including PostgreSQL, Kafka, Cassandra, HBase and Hadoop.
Read more.
This language-agnostic proposal focuses upon concepts and strategies critical to the design and implementation of asynchronous systems and data processing layers. Key components include a survey of implementation strategies for non-blocking edge tiers, patterns for building out a distributed worker / processing tier, along with several horror stories of cascading failures and their resolution.
Read more.
A talk about how to scale foursquare using MongoDB and Scala.
Read more.
Location: Portland Ballroom
Keynotes today will be shared by OSCON, OSCON Data, and OSCON Java.
Read more.
Keynote
Location: Portland Ballroom
In this new keynote, Jono Bacon, author of The Art of Community (O'Reilly),
founder of the Community Leadership Summit and award-winning Community
Manager for the global Ubuntu community, talks about the new
opportunities and challenges we face in understanding the art and
science of community leadership.
Read more.
Keynote
Location: Portland Ballroom
Keynote
Location: Portland Ballroom
The world is changing, and so is Microsoft. We are continuing down the path of even greater openness and interoperability in new ways . . . not just in development, but rising to meet the challenges and opportunities of the cloud and becoming flexible and nimble in the world of mobile.
Read more.
Keynote
Location: Portland Ballroom
From launching robots into space to discovering distant galaxies: how people are creating open source space exploration and hacking science.
Read more.
Keynote
Location: Portland Ballroom
The popularity of NoSQL opens up an endless array of possible uses but also causes its own set of problems. Riak, a NoSQL offering created by Basho solves this by claiming to have no single point of failure. Proving this goes a long way to dispelling the concerns within an enterprise to begin adopting a non-relational solution.
Read more.
CouchDB is a document-oriented database that uses JSON documents, has a RESTful HTTP API, and employs map/reduce views for querying data. This tutorial will teach web developers the concepts they need to get started using CouchDB in their projects. Libraries are available for CouchDB’s RESTful HTTP API in many programming languages and we will take a look at some of the more popular ones.
Read more.
Algorithms are getting raunchier, tools more potent and competitions more intimate! Let us mix analytics tools (like R & Mahout) and a dash of algorithmics to work on BigData Analytics competitions and see if the answer is always 42. In the process we will explore and apply a few good algorithms, to the Heritage Health competition …
Read more.
The open source database landscape has never been in more turmoil, and yet the popularity of Postgres continues to grow and grow. Get up to speed on what you need to know to administer the world's most advanced open source database, including installation, configuration, tuning, and how best to use PostgreSQL's community resources; with special focus on Postgres 9 and the upcoming 9.1 release.
Read more.
This hands-on tutorial aims at learning the basics of the important machine learning algorithms in Mahout. It aims to help you get it up and running on a Hadoop cluster. Mahout is open source implementation of a collection of algorithms designed from ground up to sift through terabytes of data and help bring out important patterns which are otherwise not in the reach of standard tools.
Read more.
You have an idea for an app. Great! First you have to munge and maintain the data. Did you know there is one data API to pull clean, updated data from multiple sources?
It slices, it dices, it serves out data on geo, social & more! And you don't need even touch MySQL.
Mash up some data with the Infochimps Data Scientists Jacob Perkins, Dhruv Bansal and Ham the Incredible Coding Chimp.
Read more.
Event
Location: Expo Hall
Quench your thirst with vendor-hosted libations and snacks while you check out all the cool stuff in the expo hall.
Read more.