Data

Today’s system architectures embrace many flavors of data: relational, NoSQL, big data and streaming

Add to your personal schedule
Location: Portland 255
Steve Francia (10gen)
Average rating: ***..
(3.76, 21 ratings)
This hands-on session will introduce the audience to building applications with MongoDB - the open source document-oriented NoSQL database. The tutorial will take the user through building a simple location-based (like foursquare) from start to finish. Attendees will finish the session with a working application they use to check into locations around Portland from any HTML5 enabled phone! Read more.
Add to your personal schedule
Location: E145-146
Hadley Wickham (Rice University / RStudio)
Average rating: ****.
(4.38, 21 ratings)
Learn the basics of R for data science: what makes R special as a language, and what R packages are most important for data manipulation, visualisation and modelling. Read more.
Add to your personal schedule
Location: Portland 252
John David Duncan (Oracle Corp.), Craig L Russell (Oracle Corporation)
Average rating: **...
(2.00, 17 ratings)
A tutorial on setting up MySQL Cluster 7.2 and developing hybrid SQL/NoSQL applications using the Cluster/J and Memcached APIs. Read more.
Add to your personal schedule
Location: F150
Tags: openstack
Average rating: **...
(2.29, 14 ratings)
Monty Taylor, manager of Automation and Deployment at HP, will be our guest speaker and will be running a lab session. This will be an in-depth, hands-on session on how to set-up OpenStack. We'll walk through setting up devstack, with the end result of creating a working OpenStack development environment by the end of the night. Read more.
Add to your personal schedule
Location: E145-146
Krishna Sankar (Tata America International)
Average rating: **...
(2.33, 12 ratings)
Social media has become the true mirror of the society & no doubt, Twitter is silver behind the glass. An understanding of the underlying network models reflected by the tweets & associated metadata enables one to infer and predict. In this tutorial, we will derive domain metrics like Cliques and Brand Rank by applying SNA principles via Twitter APIs. Read more.
Add to your personal schedule
Location: E145-146
Tags: postgresql
Christophe Pettus (PostgreSQL Experts, Inc.)
Average rating: ****.
(4.25, 8 ratings)
You have your shiny new PostgreSQL source tarball or package, but what to do with it? In one intense tutorial, we'll go through everything need to install, configure, and maintain your new, tuned, replicated, back-uped PostgreSQL installation. Read more.
Add to your personal schedule
Location: D135
Jeremie Miller (Singly), Thomas Muldowney (Singly)
Average rating: ***..
(3.75, 4 ratings)
Learn how to build apps on a unified open source API combining data from Facebook, Twitter, Google, Github, Foursquare, Instagram, Tumblr, Linkedin, Fitbit, Wordpress, Runkeeper, Dropbox, and more, includes hands-on hack time to get a working dev environment up and running. Read more.
Add to your personal schedule
Location: Portland 252
Arun Murthy (Hortonworks Inc.)
Average rating: ***..
(3.00, 14 ratings)
The Apache Hadoop project is becoming the de-facto big-data platform. The community is gearing up the first major release of Hadoop in over 2 years. This talk will cover the major highlights of the release and also the mechanics of what it takes to deliver a major Hadoop release. Arun C Murthy is VP, Apache Hadoop at ASF and the Release Manager for this release. Read more.
Add to your personal schedule
Location: Portland 252
Nathan Marz (Twitter)
Average rating: ****.
(4.46, 13 ratings)
Storm is an open-source realtime computation system relied upon by Twitter for much of its analytics. Storm does for realtime computation what Hadoop did for batch computation. It has a huge range of applications and combines ease of use with a robust foundation. Since being open-sourced, Storm has been adopted by over 25 companies. Read more.
Add to your personal schedule
Location: Portland 252
Dave Revell (Urban Airship), Nate Putnam (Urban Airship )
Average rating: ***..
(3.29, 7 ratings)
Turning billions of events into near-realtime analytics is hard. Urban Airship collects events from hundreds of millions of mobile apps and turns them into meaningful analytics using open source technology like Hadoop, Kafka and HBase. We’ll cover near-realtime big data scaling techniques from the architectural level to the operational level. Read more.
Add to your personal schedule
Location: Portland 252
Charles Bell (Oracle)
Average rating: **...
(2.25, 4 ratings)
Building sensor networks, while challenging, can be a data rich endeavor. But what do you do with all of the data you collect? How do you store and make sense of the results? Where do you store the information? This session explores the options available and demonstrates how to store the data in a database system for easy retrieval. Read more.
Add to your personal schedule
Location: Portland 252
Tags: php, nosql, mongodb
Steve Francia (10gen)
Average rating: ***..
(3.75, 8 ratings)
It is common to use multiple systems as part of the infrastructure of an application, but it’s sometimes unclear to developers when to use MongoDB alongside a relational database and what the best practices are. This presentation will introduce MongoDB, make the case for hybrid applications, and outline several real-world examples of such applications. Read more.
Add to your personal schedule
Location: Portland 252
Nate McCall (Apigee)
Average rating: ****.
(4.50, 2 ratings)
Integrating a distributed database with standard test-driven development techniques can be next to impossible, especially the breadth and complexity of failure scenarios that need to be created. This Session, led by Nate McCall of DataStax, will show attendees how to make the best of the open source utilities and projects available for integrating Apache Cassandra with your testing environment. Read more.
Add to your personal schedule
Location: D136
Moderated by: Peter Zaitsev
Database backed Full-Text Search (MySQL) and why companies like Craigslist, LivingSocial, and Boardreader from a technical perspective have chosen to utilize Sphinx. Read more.
Add to your personal schedule
Location: Portland 252
Kim Rees (Periscopic)
Average rating: ***..
(3.00, 11 ratings)
Data, data everywhere, but not a structured bit. Open data is all the rage, but often this data is poorly formatted or not very accessible. This session will discuss various ways to pry open the oyster of public data. Read more.
Add to your personal schedule
Location: Portland 252
Average rating: ***..
(3.50, 2 ratings)
The web consists of free-form links, and Google has excelled at quickly searching through this information. But, finding structured data, such as databases, spreadsheets, and tables is hard: they contain few links into and out of these documents. This talk discusses some of our efforts to find and present this data (focusing on government-generated), making it universally accessible and useful. Read more.
Add to your personal schedule
Location: Portland 252
Calvin Sun (Twitter)
Average rating: **...
(2.33, 3 ratings)
This is a general session on InnoDB; give a brief overall of InnoDB architecture and its main features; Discuss the current state of InnoDB; also covers InnoDB roadmap. Read more.
Add to your personal schedule
Location: Portland 252
Luís Soares (Oracle)
Average rating: ****.
(4.33, 3 ratings)
This session presents how can MySQL replication be used in advanced setups for aggregating data from multiple masters, scaling out to hundreds of servers or even to integrate data into more esoteric slaves like non-relational stores. Read more.
Add to your personal schedule
Location: Portland 252
Matthew Soldo (Heroku, Inc)
Average rating: ***..
(3.33, 15 ratings)
Recent shifts in the tech world - including PaaS, cloud-services, and NoSQL - have dramatically altered the manner in which software is written, deployed, and run. This talk will discuss how PostgreSQL fits into - and can potentially take advantage of - this world. Read more.
Add to your personal schedule
Location: Portland 252
Pat Patterson (Salesforce.com)
Average rating: ***..
(3.88, 8 ratings)
This session provides an overview of PostgreSQL 9.1 Foreign Data Wrappers, a mechanism for retrieving data from remote data sources. We will contrast the native C interface with the Python interface provided via the Multicorn project. A real-world example will retrieve business data from salesforce.com and combine it with data held in native PostgreSQL tables using a simple SQL JOIN. Read more.
Add to your personal schedule
Location: Portland 252
Peter Zaitsev (Percona Inc)
Average rating: ***..
(3.60, 5 ratings)
MySQL's configuration file is often the focus of too much attention, and too much tweaking of variables that make no difference -- or worse, have the potential to negatively impact performance. The sample default configuration files that come with MySQL are unfortunately not very helpful or good, either. We'll looking in creating a better one in this session. Read more.
Add to your personal schedule
Location: D135
Bill Fox J.D., M.A. (LexisNexis), Jo Prichard (LexisNexis Risk Solutions)
Average rating: ****.
(4.40, 5 ratings)
In this session, two case studies will be presented on leveraging Big Data and an open source Big Data processing platform to detect relationships at levels not previously detected. This session will give a behind-the-scenes look at how to program rapid data delivery queries with Big Data to solve real world problems along with anecdotal examples from the field. Read more.
Add to your personal schedule
Location: Portland 252
Tags: optimizer
Bruce Momjian (EnterpriseDB)
Average rating: ****.
(4.33, 6 ratings)
The optimizer is the "brain" of the database, interpreting SQL queries and determining the fastest method of execution. This talk uses the explain command to show how the optimizer interprets queries and determines optimal execution. Read more.
Add to your personal schedule
Location: E144
Ian Plosker (Basho Technologies, Inc)
Average rating: **...
(2.60, 5 ratings)
Watch as data models compete for the top prize. Who will win? Contestants will be judged on performance, ease of querying, and scalability. Join us to find out who will be America's Next Top Data Model. Read more.
Add to your personal schedule
Location: Portland 252
Andreas Kollegger (Neo Technology)
Average rating: ****.
(4.11, 18 ratings)
In this session, Andreas Kollegger will take you on a whirlwind tour of the current NoSQL landscape. He'll give a crash course overview of the four main categories of NoSQL databases, and discuss what's currently lacking to make the enterprise adopt NoSQL, and how to solve it. Read more.
Add to your personal schedule
Location: E144
Leon Stein (Decide)
Average rating: *****
(5.00, 2 ratings)
These days it is not uncommon to have 100s of gigabytes of data that needs to be sliced and diced then delivered fast and rendered quickly. This talk seeks to cover some strategies for caching large data sets without tons of expensive hardware, but through software and data design. Read more.

Sponsors

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at (707) 827-7065 or scordesse@oreilly.com.

View a complete list of OSCON contacts