Personal schedule for Cole Tuininga
Download or
subscribe to Cole Tuininga's
schedule.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include
Tutorials.
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data
processing with MapReduce and Hadoop. This session is appropriate for attendees who are new to Hadoop and
are seeking to understand where Hadoop is appropriate and how it fits with existing systems.
Read more.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include
Tutorials.
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data
processing with MapReduce and Hadoop. This session is appropriate for attendees who need to use Hadoop to
analyze data with Hadoop's MapReduce paradigm.
Read more.
Moderated by: Michael Widenius
MariaDB is a database server that offers drop-in replacement functionality for MySQL. MariaDB is built by some of the original authors of MySQL, with assistance from the broader community of Free and open source software developers. MariaDB offers a rich set of feature enhancements including alternate storage engines, server optimizations, and patches.
Read more.
Python
Location: D137/138
Please note: to attend, your registration must include
Tutorials.
Although Python programs may be slow for certain types of tasks, there are many different ways to improve performance. This tutorial will introduce optimization strategies and demonstrate techniques to implement them.
Participants will learn how to decide what might be the optimal solution for a certain performance problem. Participants are strongly recommended to bring laptops.
Read more.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include
Tutorials.
HBase is a distributed, sparse column-oriented store modeled after Google's BigTable and built on Hadoop's
Distributed File System (HDFS). This talk will explain the use cases for using HBase and how to use it.
Read more.
Databases
Location: Portland 256
Please note: to attend, your registration must include
Tutorials.
Moore's Law has run its course, yet despite the growing demands placed
on databases, traditional solutions offer little alternative to vertical
scaling. Come learn step-by-step how to use Apache Cassandra to turn a
cluster of inexpensive commodity servers in to a massively scalable
distributed datastore.
Read more.
NoSQL (or NOSQL -- Not Only SQL) is sometimes justly criticized for being too broad a category, but after thirty years of the relational database being the instinctive choice for data storage, publicizing the concept that One Size Does Not Fit All is a Good Thing. This talk will present some axes along which to evaluate database products, applied to some of today's popular NoSQL products.
Read more.
We were fortunate this past year to develop two of the larger Django applications out there – in the span of 12 weeks: michaelmoore.com and Santa Fe Institute's santafe.edu. Between the two, these sites have multiple layers of memcached caching, multiple web servers and database servers, integrated site search (Lucene/SOLR and Google GSA), DjangoCMS, and integrations with iCal and Alfresco.
Read more.
Database scalability means different things to different people. Vertical vs. Horizontal scaling? Federating vs. Sharding? Despite the labels database scalability tends to fall into a few common patterns that anyone can apply. In this talk we'll discuss factors for applying these patterns including the life-cycle of your database, how hardware affects your choices, and tools to help you on the way
Read more.
Sharding is a hot topic. Every big web site is using some sharding technique with home made solutions. The quest for the silver bullet goes on without apparent good results. This session will present two MySQL storage engines (Spider and Vertical partitioning) that implement transparent sharding techniques.
Read more.
MongoDB (from "humongous") is a high-performance, open source, schema-free document-oriented database.
Read more.
The need for database systems that scale efficiently has led to many alternatives to the traditional RDBMS. This talk presents an overview of these new non-relational databases, collectively referred to as "NoSQL," followed by an in-depth examination of SourceForge.net's deployment of MongoDB, an open-source NoSQL database.
Read more.
Moderated by: Gabrielle Roth
Get together with other PostgreSQL users & developers!
Read more.
Launch an API that can survive! Learn about unexpected load recovery techniques, analytic best practices and testing approaches to make sure your API runs smoothly & thrives with these tips from the trenches. Clay Loveless is Mashery's Chief Architect, the leading API management solution provider. With over 100 high-volume API customers, Mashery manages a broad range of enterprise API deployments.
Read more.
MySQL users have an insatiable need for speed, capacity, and availability, all at a reasonable cost. This session will provide technical overview of the approach that Schooner engineering took to optimize MySQL Enterprise and InnoDB with flash memory, multi-core processors, and DRAM to achieve an 8x improvement in performance relative to existing systems.
Read more.
Does Python have Design Patterns? You bet! Whatever the misguided meme going around is claiming to the contrary, every field of human endeavor has Patterns, and so of course does Python. This talk shows how and why, recapping what Patterns are all about, Design patterns in particular, and presenting examples of how they work best in Python, both singly and as part of a Language of Patterns.
Read more.
There are a number of toolkits available that make it much easier than ever before to design delightful, intuitive user interfaces for the terminal window. This talk will explore several options for Python, including cmd, curses, newt/snack and urwid. I'll compare the different approaches for different application domains, and show some shortcuts for the impatient.
Read more.
Perl
Location: Portland 256
Have something you want to say to many people? Want 5 minutes to do it? This is your chance.
Want to see 16 speakers on a variety of topics? This is your session.
Read more.
Computers are getting wider, not faster. If you want your code to run faster, it has to have some parallelism. This is hard, and threads probably aren't the answer. There is a lot of new concurrency technology on the scene. This talk surveys the 2010 state of the art in tools to empower developers to write concurrent code, and makes some predictions.
Read more.
One piece of software we've found to be particularly useful in scaling our site is Scribe, an open source system for aggregating massive amounts of logging data from thousands of machines, or more generally moving around large amounts of data in an asynchronous and mostly-reliable way.
Read more.
Perl
Location: Portland 256
So you want your code to run faster. This talk is for you. We're going to discuss some of the low-hanging fruit of optimization -- a few things that will make most Perl programs run significantly faster. We'll cover common bottlenecks, efficient usage of popular CPAN modules, and more.
Read more.