Personal schedule for Shane Curcuru
Download or
subscribe to Shane Curcuru's
schedule.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include
Tutorials.
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data
processing with MapReduce and Hadoop. This session is appropriate for attendees who are new to Hadoop and
are seeking to understand where Hadoop is appropriate and how it fits with existing systems.
Read more.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include
Tutorials.
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data
processing with MapReduce and Hadoop. This session is appropriate for attendees who need to use Hadoop to
analyze data with Hadoop's MapReduce paradigm.
Read more.
Operations
Location: Portland 256
Please note: to attend, your registration must include
Tutorials.
Internet traffic spikes aren't what they used to be. It is now evident that even the smallest sites can suffer the attention of the global audience. This presentation dives into techniques to avoid collapse under dire circumstances. Looking at some real traffic spikes, we'll pinpoint what part of the architecture is crumbling under the load; then, walk though stop-gaps and complete solutions.
Read more.
Databases
Location: Portland 256
Please note: to attend, your registration must include
Tutorials.
Moore's Law has run its course, yet despite the growing demands placed
on databases, traditional solutions offer little alternative to vertical
scaling. Come learn step-by-step how to use Apache Cassandra to turn a
cluster of inexpensive commodity servers in to a massively scalable
distributed datastore.
Read more.
NoSQL (or NOSQL -- Not Only SQL) is sometimes justly criticized for being too broad a category, but after thirty years of the relational database being the instinctive choice for data storage, publicizing the concept that One Size Does Not Fit All is a Good Thing. This talk will present some axes along which to evaluate database products, applied to some of today's popular NoSQL products.
Read more.
Are you the 'point' person for your team? Do you have sweaty palms, headaches, and a calendar full of meetings? You may have an affliction called 'manager'. This condition is treatable through analysis and therapy. We'll examine how you may have arrived at this state and how you can once again regain your self-respect and that of your peers. Hear real-life stories of both good and bad leadership.
Read more.
You already use the open source Apache Tomcat servlet container to serve your web applications, and this presentation will show you how to secure your web application running on Tomcat. We'll cover security fixes that will give your web application production-ready security when running on Tomcat. Improve your web site's security through these best practice techniques.
Read more.
Behind the scenes of many successful open source projects is a team of elves who keep the critical project infrastructure (mailing lists, websites, networks, mirrors, etc.). How does Apache run Apache? How does kernel.org run Linux? Learn some of their secrets in this session as the folks behind the curtain come out and share their experiences with the OSCON community.
Read more.
How does Twitter analyze its massive dataset? What tools do we use, and where do we focus our analysis?
In this talk, I will discuss our transition from a MySQL-based to a Hadoop-based data infrastructure and our use of Pig (a scripting language built on top of Hadoop) to democratize big-data analysis across the company. I will present concrete examples of interesting analyses at each step.
Read more.
Apache Traffic Server is an Open Source project implementing a caching HTTP proxy server, donated to the Apache Foundation by Yahoo! We will examine the technical details behind TS, what it's good for, and how you can configure it to accelerate your web traffic.
Read more.
Data is exploding all over the internet. There is immense knowledge within this huge volume of information that needs to be unlocked. We need to Mine patterns, Find clusters, Organize content and Predict the future. In this talk, we will show what these methods are and how the new Apache Mahout project is attempting to solve these problems in a scalable way by utilizing Hadoop.
Read more.