Personal schedule for David Kale

Download or subscribe to David Kale's schedule.

Hadoop
Location: E141/E142
Tags: cloud, hadoop
Please note: to attend, your registration must include Tutorials.
Aaron Kimball (Cloudera, Inc.)
Average rating: ***..
(3.29, 17 ratings)
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop. This session is appropriate for attendees who are new to Hadoop and are seeking to understand where Hadoop is appropriate and how it fits with existing systems. Read more.
Cloud Computing
Location: D139/140
Please note: to attend, your registration must include Tutorials.
Eric Day (craigslist), Brian Aker (HP)
Average rating: ***..
(3.87, 15 ratings)
Many people view topics like Map/Reduce and queue systems as advanced concepts that require in-depth knowledge and time consuming software setup. Gearman is changing all that by making this barrier to entry as low as possible with an open source, distributed job queuing system. This session dives into advanced use cases that demonstrate the power and flexibility of distributed architectures. Read more.
Hadoop
Location: E141/E142
Tags: cloud, hadoop
Please note: to attend, your registration must include Tutorials.
Aaron Kimball (Cloudera, Inc.)
Average rating: ***..
(3.38, 16 ratings)
Cloudera's Introduction to Hadoop provides a solid foundation for those seeking to understand large scale data processing with MapReduce and Hadoop. This session is appropriate for attendees who need to use Hadoop to analyze data with Hadoop's MapReduce paradigm. Read more.
Event
Location: Birds of a Feather
Average rating: ***..
(3.40, 5 ratings)
Following the planned sessions during the day, it's time for OSCON attendees to take the floor. BoFs are informal conversations that you and other participants plan. Visit the BoF page for more details and to sign up to lead a BoF of your own. Read more.
Cloud Computing
Location: Portland 251
Please note: to attend, your registration must include Tutorials.
Krishna Sankar (Tata America International)
Average rating: **...
(2.42, 26 ratings)
This tutorial will provide an in-depth tutorial on various forms of NOSQL (NotOnlySQL) datastores (key/value, data structure store, document store and wide column stores) for working with semi- structured data. The data ranges from web logs to social and knowledge graphs to configuration data stores for cloud infrastructures and other domains. Read more.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include Tutorials.
Aaron Kimball (Cloudera, Inc.)
Hive is a powerful data warehousing application built on top of Hadoop which allows you to use SQL to access your data. This tutorial is appropriate for people that have experience with SQL and want to analyze large data sets using Hadoop and HiveQL. Read more.
Databases
Location: Portland 256
Please note: to attend, your registration must include Tutorials.
Eric Evans (Rackspace)
Average rating: ***..
(3.08, 25 ratings)
Moore's Law has run its course, yet despite the growing demands placed on databases, traditional solutions offer little alternative to vertical scaling. Come learn step-by-step how to use Apache Cassandra to turn a cluster of inexpensive commodity servers in to a massively scalable distributed datastore. Read more.
Hadoop
Location: E141/E142
Please note: to attend, your registration must include Tutorials.
Aaron Kimball (Cloudera, Inc.)
HBase is a distributed, sparse column-oriented store modeled after Google's BigTable and built on Hadoop's Distributed File System (HDFS). This talk will explain the use cases for using HBase and how to use it. Read more.
Products & Services
Location: E142
Tags: mysql
Giuseppe Maxia (Continuent)
Average rating: **...
(2.86, 7 ratings)
MySQL 5.1 has been GA for 18 months. It is reliable and efficient. Demanding users are also looking expectantly at the goodies offered by MySQL 5.5, available in beta, where more performance and features are in store. If speed is what you are looking for, you can have it today with MySQL 5.1, by using the InnoDB plugin, which is GA as of MySQL 5.1.47. Read more.
Databases
Location: Portland 255
Jonathan Ellis (DataStax)
Average rating: ***..
(3.79, 24 ratings)
NoSQL (or NOSQL -- Not Only SQL) is sometimes justly criticized for being too broad a category, but after thirty years of the relational database being the instinctive choice for data storage, publicizing the concept that One Size Does Not Fit All is a Good Thing. This talk will present some axes along which to evaluate database products, applied to some of today's popular NoSQL products. Read more.
Mark Hinkle (Citrix), John Willis (Enstratius), Alex Honor (ControlTier open source automation project)
Average rating: **...
(2.70, 20 ratings)
The proliferation of cloud computing is inevitable, hosted apps, software-as-as-service and now dynamic on-demand utility computing is becoming the norm. The session will be a “fire-side” chat style discussion of the types of challenges presented by IT management operations personnel and how they can manage cloud infrastructure using open source tools. Read more.
Databases
Location: Portland 255
Average rating: ***..
(3.73, 26 ratings)
Database scalability means different things to different people. Vertical vs. Horizontal scaling? Federating vs. Sharding? Despite the labels database scalability tends to fall into a few common patterns that anyone can apply. In this talk we'll discuss factors for applying these patterns including the life-cycle of your database, how hardware affects your choices, and tools to help you on the way Read more.
Alasdair Allan (The Thing System, Inc.)
Average rating: ***..
(3.40, 10 ratings)
The iPhone platform is surprisingly powerful, capable of performing fairly advanced feats of computer-vision in (near to) real-time. The talk walks attendees through the procedure of cross-compiling the OpenCV computer vision library for the iPhone Simulator and device hardware, and building a simple application to perform face recognition using the iPhone's camera. Read more.
Health
Location: F151
Arien Malec (HHS/ONC (Contractor))
Average rating: ***..
(3.80, 5 ratings)
NHIN Direct project is a collaboration between the U.S. government, providers, HIT vendors, and other experts to improve how the U.S. health care system handles digital patient data. This talk will cover the project, the Open Source software that exists to support the effort as well as what is still needed to make this successful and how you can get involved. Read more.
Jason Brittain (eBay Inc.)
Average rating: ***..
(3.33, 12 ratings)
You already use the open source Apache Tomcat servlet container to serve your web applications, and this presentation will show you how to secure your web application running on Tomcat. We'll cover security fixes that will give your web application production-ready security when running on Tomcat. Improve your web site's security through these best practice techniques. Read more.
Average rating: ***..
(3.00, 2 ratings)
Open source software developed by Tolven has incorporated principles for assuring privacy from the Health Record Banking Alliance in order to fulfill national requirements for privacy protection of health care information in the Netherlands. The RijnmondNet project provides a valuable model for securing exchange of personal health care information in the United States. Read more.
Matthew Russell (Digital Reasoning Systems)
Average rating: **...
(2.69, 13 ratings)
A non-classified case study that describes how we've built a stack based on MALLET, Hadoop/Cassandra, and Flare/Flex to build a highly scalable system for the U.S. intelligence community: MALLET lends itself to state of the art NLP, Hadoop/Cassandra yield a massively distributed back end, and Flare/Flex provide the tools for creating a great UI/UX capable of performing advanced analysis. Read more.
Health
Location: F151
Andrew Hart (NASA Jet Propulsion Laboratory), David Kale (Children's Hospital Los Angeles), Heather Kincaid (NASA Jet Propulsion Laboratory)
Average rating: ***..
(3.75, 4 ratings)
Medical informatics lags behind the progress of other “big data” domains, in large part because data is often held hostage in proprietary applications and schema. We present a grid software solution to this problem that utilizes NASA JPL’s Object Oriented Data Technology (OODT) and is being deployed at Children’s Hospital Los Angeles to enable new data-driven clinical decision support tools. Read more.
Databases
Location: E145/E146
Kevin Weil (Twitter, Inc.)
Average rating: ****.
(4.20, 15 ratings)
How does Twitter analyze its massive dataset? What tools do we use, and where do we focus our analysis? In this talk, I will discuss our transition from a MySQL-based to a Hadoop-based data infrastructure and our use of Pig (a scripting language built on top of Hadoop) to democratize big-data analysis across the company. I will present concrete examples of interesting analyses at each step. Read more.
Health
Location: F151
Tags: google, health, api
Roni Zeiger (Smart Patients)
Average rating: **...
(2.71, 7 ratings)
Google Health is an application with an open API, and its long term success depends on the developer community building useful applications that help people achieve their health goals. In this talk, we will describe this model and the role of developers who create specialized solutions - especially mobile ones - for people with specific health needs. Read more.
Robin Anil (Google)
Average rating: ***..
(3.62, 8 ratings)
Data is exploding all over the internet. There is immense knowledge within this huge volume of information that needs to be unlocked. We need to Mine patterns, Find clusters, Organize content and Predict the future. In this talk, we will show what these methods are and how the new Apache Mahout project is attempting to solve these problems in a scalable way by utilizing Hadoop. Read more.
Health
Location: F151
Tags: emr, vista
David Uhlman (clearhealth inc.)
Average rating: **...
(2.50, 2 ratings)
The VistA system created by the Department of Veterans Affairs is by most measures the most successful medical record ever devised. We'll take a detailed look at ClearHealth's multi-year odyssey of re-implementing VistA using contemporary languages, tools, and databases as well as insight into the core features and usability that make VistA so successful. Read more.
Sam Faus (Sujansky & Associates, LLC)
Average rating: ***..
(3.33, 3 ratings)
The Common Platform is an open source personal health data repository built on a Java-based SOAP web service architecture. Developed as part of the Robert Wood Johnson Project HealthDesign program, the design goal was to enable the development of personal health applications by providing a platform that supports the storage and access of personal health data for innovative analysis and display. Read more.
Health
Location: F151
Tags: health_it
Vaibhav Bhandari (Microsoft, Health Solutions Group)
Average rating: ****.
(4.25, 4 ratings)
This talk focuses on practical solutions for interfacing various HealthCare Silos (like Labs, Medications, Imaging and EMR systems) to Personally controlled HealthCare records (Microsoft HealthVault, Google Health, Dossia) and public health networks (PHIN). We will analyze and present relevant software solutions for working with ontologies, HealthCareIT Standards and data security regulations. Read more.
Teddy Bachour (Microsoft)
Average rating: ***..
(3.00, 2 ratings)
The Microsoft Connected Health Platform (CHP) provides open toolkits and guidance for the information and communication technology (ICT) community to help them speed architecture, design and deployment of interoperable, efficient, and scalable e-Health infrastructures and solutions for the health industry. Read more.
Health
Location: F151
Tags: emr, data, graphviz
David Uhlman (clearhealth inc.)
Average rating: ***..
(3.67, 3 ratings)
The ongoing nationwide adoption of EMR presents enormous new opportunities and challenges for collecting, analyzing and reporting data for patient outcome improvement, cost control, and efficiency in care. We'll take a look at a number of open tools available and techniques to apply them to healthcare data including neural nets, data visualization and statistical modeling. Read more.
  • Intel
  • Microsoft
  • Google
  • Facebook
  • Rackspace Hosting
  • (mt) Media Temple, Inc.
  • ActiveState
  • CommonPlaces
  • DB Relay
  • FireHost
  • GoDaddy
  • HP
  • HTSQL by Prometheus Research
  • Impetus Technologies Inc.
  • Infobright, Inc
  • JasperSoft
  • Kaltura
  • Marvell
  • Mashery
  • NorthScale, Inc.
  • Open Invention Network
  • OpSource
  • Oracle
  • Parallels
  • PayPal
  • Percona
  • Qualcomm Innovation Center, Inc.
  • Rhomobile
  • Schooner Information Technology
  • Silicon Mechanics
  • SourceGear
  • Symbian
  • VoltDB
  • WSO2
  • Linux Pro Magazine

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Sharon Cordesse at scordesse@oreilly.com

Download the OSCON Sponsor/Exhibitor Prospectus

Media Partner Opportunities

Download the Media & Promotional Partner Brochure (PDF) for information on trade opportunities with O'Reilly conferences or contact mediapartners@ oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

OSCON Newsletter

To stay abreast of conference news and to receive email notification when registration opens, please sign up for the OSCON Newsletter (login required)

OSCON 2.0 Ideas

Have an idea for OSCON to share? oscon-idea@oreilly.com

Contact Us

View a complete list of OSCON contacts