Skip to main content
Databases & Datastores

Databases & Datastores

No longer just the passive target of active code, data, data structures, and data stores often define the structure of programs. How should you store your data?

Add to your personal schedule
Cloud | Databases & Datastores
Portland 255
Tutorial Please note: to attend, your registration must include Tutorials.
Erik Rose (Mozilla), Laura Thomson (Mozilla Corporation)
Average rating: ****.
(4.36, 33 ratings)
Elasticsearch provides a powerful combination of clustered full-text search, synonyms, faceting, and geographic math, but there's a big gap between its documentation and real life. We'll tell hard-won war stories, work through hands-on examples, and show what happens behind the scenes, leaving you equipped to get the best use out of Elasticseach in your projects. Read more.
Add to your personal schedule
Databases & Datastores | Operations & System Administration
D139/140
Tutorial Please note: to attend, your registration must include Tutorials.
Patrick McGarry (Red Hat)
Average rating: ****.
(4.67, 3 ratings)
This Introduction to Ceph tutorial will include a mix of lecture and instructor-led demonstrations that will introduce students to the Ceph distributed storage system, the challenges it addresses, its architecture, and solutions it offers. Students will leave understanding how Ceph works, how it can be integrated with your services and applications, and how it works alongside OpenStack. Read more.
Add to your personal schedule
Databases & Datastores | Operations & System Administration
E143/144
Tutorial Please note: to attend, your registration must include Tutorials.
Colin Charles (SkySQL Ab/MariaDB Foundation)
Average rating: ***..
(3.25, 4 ratings)
The MySQL world is full of tradeoffs and choosing a High Availability (HA) solution is no exception. We demystify all the alternatives in an unbiased nature. Preference is of course only given to opensource solutions. Read more.
Add to your personal schedule
Databases & Datastores | Operations & System Administration
Portland 256
Tutorial Please note: to attend, your registration must include Tutorials.
Josh Berkus (PostgreSQL Experts, Inc.)
Average rating: ***..
(3.44, 9 ratings)
So, you’ve inherited a PostgreSQL server. Congratulations? This tutorial will cover the essential care and feeding of a Postgres server so that you can get back to your real job. Read more.
Add to your personal schedule
Databases & Datastores
E143/144
Tutorial Please note: to attend, your registration must include Tutorials.
Michael Hunger (Neo Technology)
Average rating: ****.
(4.71, 7 ratings)
This training offers the first step in building a good knowledge of graph databases, and covers the core functionality of the open source Neo4j graph database. With a mixture of theory and hands-on practice sessions, you will quickly learn how easy it is to work with a powerful graph database using Cypher as the query language. Read more.
Add to your personal schedule
Databases & Datastores | Java & JVM
E145/146
Tutorial Please note: to attend, your registration must include Tutorials.
Avi Bryant (Stripe)
Average rating: ****.
(4.50, 8 ratings)
Scalding is an open source framework developed at Twitter that provides a high level abstraction over Hadoop MapReduce, letting you concisely specify complex data analysis pipelines using simple Scala operations like map, filter, join, group, and sum. This introductory tutorial does not require experience with either Hadoop or Scala. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Average rating: ****.
(4.00, 11 ratings)
Elasticsearch is about more than just search. It’s currently being used in production for everything from traditional text search, to big data analytics, to distributed document storage. This talk will introduce you to Elasticsearch’s REST API, and discuss the basics of full text search and analytics with Elasticsearch. Read more.
Add to your personal schedule
Kevin Kluge (Elasticsearch), Steve Mayzak (Elasticsearch)
Average rating: ****.
(4.86, 7 ratings)
Elasticsearch is an open-source document store known for enabling search and real-time analytics on large data sets. In this presentation we will walk through the development of an application that monitors the Parrot AR.Drone. This application will collect metrics from the drone and then transform them to JSON for storage and real-time analysis in Elasticsearch. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Steve Francia (MongoDB)
Average rating: ***..
(3.88, 8 ratings)
Find out why some people claim Go and MongoDB are a 'pair made in heaven' and 'the best database driver they've ever used' in this talk by Gustavo Niemeyer, the author of the mgo driver, and Steve Francia, the drivers team lead at MongoDB Inc. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Carina C. Zona (ZeroVM)
Average rating: *....
(1.50, 4 ratings)
Development challenges us to code for users’ personal world. Users give push-back to ill-fitted assumptions about their own name, gender, sexual orientation, important relationships, & other attributes that are individually meaningful. We'll explore how to develop software that brings real world into focus & that allows individuals to authentically reflect their personhood & physical world. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
John Griffith (SolidFire)
Average rating: ***..
(3.00, 1 rating)
In this session, SolidFire's John Griffith will review some of the key features included within OpenStack Block Storage to help achieve the enterprise storage functionality they require to host production applications in their cloud infrastructure. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Curtis Poe (All Around The World)
Average rating: *****
(5.00, 5 ratings)
Many expert programmers who write complex SQL without a second thought still struggle with database design. Unfortunately, many introductory topics cause eyes to glaze over when we read 'transitive dependencies' and 'Boyce-Codd normal form'. When you're done with this talk, you'll understand the basics of creating a database that won't make a DBA yell at you. We won't even use (many) big words. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Fangjin Yang (Metamarkets), Gian Merlino (Metamarkets)
Average rating: ****.
(4.62, 8 ratings)
The maturation and development of open source technologies has made it easier than ever for companies to derive insights from vast quantities of data. In this session, we will cover how to build a real-time analytics stack using Kafka, Storm, and Druid. This combination of technologies can power a robust data pipeline that supports real-time ingestion and flexible, low-latency queries. Read more.
Add to your personal schedule
Jim Tommaney (InfiniDB)
Average rating: ***..
(3.00, 1 rating)
The Global Database of Events, Language, and Tone (GDELT) is an initiative to construct a catalog of human societal-scale behavior and beliefs across all countries of the world. Analysis of this data set requires addressing typical data quality and data skew issues. Use a combined Hadoop + SQL on Hadoop stack to cleanse the data and deliver insights into the state of the world. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Eric Tschetter (Tidepool.org)
Average rating: ****.
(4.33, 3 ratings)
This talk will focus on the motivation, design, and architecture of Druid (druid.io), an open-source, real-time analytical data store. Druid is used in production at several organizations to facilitate rapid exploration of high dimensional spaces. Druid can maintain a 95% query latency under 1 second on data sets with >50 billion rows and 2 trillion impressions in tables with 30+ dimensions. Read more.
Add to your personal schedule
Jeanne Holm (Jet Propulsion Laboratory, NASA)
Average rating: ****.
(4.17, 6 ratings)
The underpinnings of open government are transparency and citizen participation. In re-imagining a new Data.gov (the open data, open government initiative for the White House), this was taken to heart. This system was created using open source and with comments, issues, and commits worked with the public all along the way. Read more.
Add to your personal schedule
Michael Brewer (UGA: Franklin College OIT)
Average rating: ****.
(4.25, 4 ratings)
Have you tried some recursion in your SQL? In this session, we will go over the concept of Common Table Expressions (CTE), also known as WITH queries. We will explore syntax, features, and use cases for this powerful SQL construct. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Gwen Shapira (Cloudera)
Average rating: **...
(2.50, 2 ratings)
Do you use Hadoop for large scale data analysis? Do your data scientists love R? This presentation will discuss the challenges of scaling R to multi-terabyte data sets and how RHadoop can be used to solve them. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Alex Brandt (Rackspace Hosting)
Average rating: ***..
(3.17, 6 ratings)
There has been an explosion in datastore technologies. There are five main types of datastores: Relational, Column Family, Graph, Key-Value and Document. Polyglot Persistence, or the ability to have many different types of datastores interacting with one application, is becoming more prominent and beginning to take center stage. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Henning Diedrich (Eonblast Corporation), Robert Virding (Erlang Solutions Ltd.)
Average rating: ***..
(3.67, 3 ratings)
Discover why Electronic Arts goes Erlang and hear about a powerful, reactive server architecture that supports a highly concurrent, analyzable and secure simulation stack for gaming. Learn how to easily script composable entities using a server environment purpose-built for event-driven programming, which is scalable under load, resilient and enables evaluation of huge data sets in real-time. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Keith Fiske (OmniTI, Inc)
Average rating: ***..
(3.20, 5 ratings)
After using PostgreSQL for a while, you realize that there are missing features that would make it significantly easier to use in large production environments. Thankfully, it's extremely easy to make add-ons to enable some of those features right now, even without knowing C! This talk will discuss projects I've worked on and show how easy it is to make an impact in the PostgreSQL community. Read more.
Add to your personal schedule
Databases & Datastores
Portland 256
Byron Ruth (The Children's Hospital of Philadelphia)
Average rating: ****.
(4.00, 6 ratings)
There is an adage that given enough data, a data scientist can answer the world's questions. The untold truth is that the majority of work happens during the ETL and data preprocessing phase. In this talk I discuss Origins, an open source Python library for extracting and mapping structural metadata across heterogenous data stores. Read more.
Add to your personal schedule
Chad Naber (Intel), David Elfi (Intel Corporation)
Average rating: **...
(2.50, 2 ratings)
Do you know how long could it take to your team start producing value in the Big Data and Machine Learning area? This talk shows a real team experience starting from scratch to a functional Big Data and Machine Learning platform using several open source tools such as Apache Hadoop, Apache Hive and Python frameworks SciPy/Numpy/scikit-learn Read more.