Sponsors

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the convention, contact Sharon Cordesse at scordesse@oreilly.com

Download the OSCON Data Sponsor/Exhibitor Prospectus

Media Partner Opportunities

For information on trade opportunities with O'Reilly conferences or contact mediapartners@ oreilly.com

Press and Media

For media-related inquiries, contact Maureen Jennings at maureen@oreilly.com

OSCON Bulletin

To stay abreast of convention news and announcements, please sign up for the OSCON email bulletin (login required)

Contact Us

View a complete list of OSCON contacts

OSCON: Data 2011 Speakers

New speakers are being confirmed regularly. Please check back often to see the latest additions to the program.

Search Speakers

Found result
Brian Aker
Brian Aker (HP)

Brian has spent his life working on the details of how to build and scale out
systems. He is currently working on a new MicroKernel designed MySQL
called Drizzle and is building the plumbing required for a new generation of
large scale computer deployment. He also spends time working on Apache
Modules, Memcached, and Gearman.

In the past, he has been involved with projects for the Army Engineer Corps, The VirtualHospital, Splunk, MySQL, Slashdot, and was a Distinguished Engineer at Sun Microsystems. He calls Seattle his home since that is where his dog Rosalynd is.

Andrew Aksyonoff (Sphinx Technologies)

Andrew is the author of Sphinx, an open-source full-text search engine designed especially for indexing databases.

Siddharth Anand
Siddharth Anand (LinkedIn)

Siddharth “Sid” Anand is a software architect/developer with deep experience building and scaling high-traffic web sites. He is currently an engineer in the Search Infrastructure team @ LinkedIn. Before joining search, Sid spent some time in LinkedIn’s Analytics Platform team building self-service Hadoop-based analytics. Prior to joining LinkedIn, Sid held technical and leadership positions at Netflix, Etsy, eBay, and Siebel Systems.

Outside of work, Sid helps venture-backed startups with issues such as managing for growth – scaling people, traffic, and data.

He speaks at a few conferences every year and is a PMC for QCon SF.

Scott Andreas
Scott Andreas (Boundary Inc.)

Scott Andreas is an Engineer at Boundary, Inc. hell-bent on quality, efficiency, and performance in highly-concurrent network programming and asynchronous distributed systems.

With a background in Java, Scala, and Ruby and new production deployments in Erlang, his current work involves designing, implementing, and deploying event processing systems with a team of engineers to create a real-time distributed network analytics platform. Previously at Urban Airship, Andreas worked with a team to design and implement a scalable mobile messaging platform backed by a clustered service designed to serve millions of concurrent clients on a handful of commodity servers.

Red-lining services under development, measuring results, and quantifying improvements in terms of infrastructure cost and business value is a favorite past-time.

Robin Anil
Robin Anil (Google)

Robin is a Committer at the Apache Software Foundation where he works with the Mahout Machine Learning community. He is also a co-author of “Mahout in Action” by Manning Publications, a book on how Mahout is used to perform Machine learning on Terabytes of data with ease.

He used to be a Tech Lead on the ML infrastructure for Minekey Inc, a valley based startup which focussing on recommendations and behavioral targeting for publisher content. He was introduced to the newly born Mahout community through the Google Summer of Code program while he was a dual-degree student at IIT Kharagpur. Since then, he has been trying to model machine learning algorithms in to the Map/Reduce format and have successfully merged his Complementary Naive Bayes... Read More.

Dhruv  Bansal
Dhruv Bansal (Infochimps)

Dhruv Bansal cofounded Infochimps in 2008 after realizing that there was no single place to easily find updated data online. He is currently pursuing his PhD in Physics from the University of Texas at Austin.

Josh Berkus (PostgreSQL Experts)

Josh Berkus has been a member of the PostgreSQL Core Team since 2003 and has been working as a database consultant since 1998. His consulting specialties include:

  • Database design
  • Performance tuning and troubleshooting</li
  • Stored procedure debugging</li
  • Training</li
  • Reporting and BI
  • ETL and data cleaning

In addition to his PostgreSQL skills, Josh also:

  • Manages projects
  • Advises companies on working with open source communities
  • Speaks at 6 to 12 conferences a year, worldwide
  • Is an expert in SQL Server 2000
  • Co-founded the San Francisco PostgreSQL User Group

Josh’s work experience includes 8 years of consulting on database applications, primarily building applications for the legal and HR industries and performance tuning. He was also head of Sun Microsystem’s PosgtreSQL... Read More.

Jeremy Bingham
Jeremy Bingham (Dailykos.com)

Jeremy Bingham is the technical director for DailyKos.com, a progressive Democratic weblog that is one of the largest sites in the political blogosphere.

He has worked for Daily Kos for the last seven years handling the technology side of the operation, and was a co-founder of SB Nation, a network of sports-related sites.

Benjamin Black
Benjamin Black (Boundary)

Mr. Black most recently cofounded and was CEO of Boundary. He a
recognized expert in large-scale technology infrastructure design,
implementation,and operation. Previously, he was an engineering
director in the ECN group at Microsoft Windows Live, where he helped
build one of the largest content delivery networks in the world. As
the manager of website engineering at Amazon, he lead the design and
implementation of a new, layer 3 network for Amazon’s production
network. As principal engineer for Amazon infrastructure, he
co-authored the first documents on what later became EC2. As senior
manager for information security, he owned and successfully delivered
Sarbanes-Oxley and PCI DSS compliance for Amazon infrastructure,
including the... Read More.

Andy Blyler
Andy Blyler (Barracuda Networks)

Andy is a husband, programmer, system administrator, entrepreneur, musician, private pilot, & optimist. He is the lead software engineer on the Barracuda Networks Backup product.

Roger Bodamer
Roger Bodamer (10gen)

Roger heads the West Coast Operations for 10gen, the company that
develops and supports the open source database MongoDB. He has over 20 years of experience of building and delivering great and innovative
products to market and has deep expertise and knowledge of database
architectures and internals. Roger holds several patents for database
and middleware technology. His experience leading product development
and engineering teams includes 12 years with Oracle’s Database and
Application Server development organization where he pioneered
products that delivered heterogeneous interoperability, as well as
several years as SVP of product operations and engineering at Apple’s
PowerSchool division. Roger also held leadership positions at OuterBay
and Efficient Frontier. He earned a Bachelor’s degree... Read More.

Adrian Cockcroft
Adrian Cockcroft (Battery Ventures)

Adrian Cockcroft has had a long career working at the leading edge of technology. He’s always been fascinated by what comes next, and he writes and speaks extensively on a range of subjects. At Battery, he advises the firm and its portfolio companies about technology issues and also assists with deal sourcing and due diligence.

Before joining Battery, Adrian helped lead Netflix’s migration to a large scale, highly available public-cloud architecture and the open sourcing of the cloud-native NetflixOSS platform. Prior to that at Netflix he managed a team working on personalization algorithms and service-oriented refactoring.

Adrian was a founding member of eBay Research Labs, developing advanced mobile applications and even building his own homebrew phone, years before iPhone and Android launched. As a distinguished... Read More.

Jean-Daniel Cryans
Jean-Daniel Cryans (Cloudera)

Jean-Daniel is a Database Engineer at StumbleUpon. When he’s not developing HBase or supporting its usage inside the company, he’s helping others with the Hadoop stack. Jean-Daniel has been a commiter on the Apache HBase project since 2008.

Justin Dow
Justin Dow (Mozilla Corporation)

Justin, most people know him as “jabba” a long time nickname and is IRC nick, works as a Systems Administrator for Mozilla, managing the Infrastructure team. In his spare time he likes tinkering on classic cars and spending time with his family.

Aurelian Dumitru (Dell, Inc)

Aurelian “A.D.” Dumitru is the Dell | Hadoop chief architect. In that role he is responsible for all architecture decisions and long-term strategy for Hadoop. A.D. has over 20 years of experience. He has been with Dell for more than 11 years in various engineering, architecture, and management positions. His background is in hyperscale massively parallel compute systems. His interests are in automated process control, intelligent processes, and machine learning. Over the years he has authored or made significant contributions to more than 20 patent applications, from RFID and automated process controls to software security and mathematical algorithms.

Ted Dunning
Ted Dunning (MapR Technologies)

Ted Dunning is Chief Application Architect at MapR Technologies and committer and PMC member for the Apache Mahout project. He contributing to the Mahout clustering, classification and matrix decomposition algorithms. He was the chief architect behind the MusicMatch, (now Yahoo Music) and Veoh recommendation systems and built fraud detection systems for ID Analytics.

Ted Dziuba
Ted Dziuba (eBay Local/Milo.com)

Co-founder of Milo.com, a local shopping search engine, which was acquired by eBay in December 2010. Now lead engineer for eBay Local.

Jonathan Ellis
Jonathan Ellis (DataStax)

Jonathan is CTO and co-founder at DataStax (formerly Riptano). Prior to DataStax, Jonathan worked extensively with Apache Cassandra while employed at Racksace. Prior to Rackspace, Jonathan built a multi-petabyte, scalable storage system based on Reed-Solomon encoding for backup provider Mozy. In addition to his work with DataStax, Jonathan is project chair of Apache Cassandra.

Greg Fodor
Greg Fodor (Etsy)

Greg Fodor is currently a engineer on Etsy’s “data wranglers” team, responsible for building products around ‘big data’ at Etsy.

Bill Fox J.D., M.A.
Bill Fox J.D., M.A. (LexisNexis)

Bill Fox, JD, MA, is Senior Director of Healthcare at LexisNexis Risk Solutions. Prior to LexisNexis, Mr. Fox was Vice President of Program Integrity at MAXIMUS Federal. He was a Partner at the law firm of Post and Schell, and was Deputy Chief of Economic and Cyber Crime at the Philadelphia District Attorney’s office and a Special Assistant U.S. Attorney in Philadelphia. Mr. Fox is a Senior Fellow at the Jefferson School of Population Health in Philadelphia and a frequent speaker on topics of healthcare fraud, compliance and quality. He has also served as a tactical strength and conditioning consultant to the U.S. Navy SEALs.

LexisNexis Risk Healthcare Solutions develops and implements innovative data driven technology solutions across the healthcare continuum with a focus... Read More.

Brendan Gregg
Brendan Gregg (Netflix)

Brendan Gregg is a senior performance architect at Netflix, where he does large scale computer performance design, analysis, and tuning. He is the author of the book “Systems Performance”, and recipient of the USENIX 2013 LISA Award for Outstanding Achievement in System Administration. He has previously worked as a performance and kernel engineer, and has created performance analysis tools included in multiple operating systems, as well as visualizations and methodologies.

Jeff Hamann
Jeff Hamann (Forest Informatics)

Dr. Jeff Hamann, president of Forest Informatics is obsessed with, analyzing data from, developing tools for, and presenting collaborative, geek-friendly stories and solutions for forests, foresters, and forest owners.

He has developed econometrics and forest simulation packages for the R statistical computing environment, has authored a book for the Use-R series from Springer called "Forest Analytics with R’’, and loves to combine the incongruent worlds of tech, humans, and everything that separates them.

He holds a Bachelors of Science in Forestry from Humboldt State University, and an MS and PhD, in Forest Biometrics and Forest Engineering from Oregon State University.

His passions include Forest Management, Operations Research, Computer Science, Geographic Information Systems, Financial Analysis, and of course Open Source Software.

Tom Hanlon (Cloudera)

Tom Hanlon is currently an instructor at Cloudera where he delivers courses on the wonders of the hadoop ecosystem.

Before beginning his relationship with hadoop and large distributed data, he had a happy and lengthy relationship with MySQL with a focus on web operations.

He has been a trainer for MySQL, Sun , Percona.

Russell Hanson
Russell Hanson (RSI/Harvard/TCIN)

Russell Hanson was born in Palo Alto, CA on May 2, 1981 and grew up in California and Bloomington, IN. From 1999—2003 he studied in Portland, Oregon at Reed College, completing a bachelor’s degree in Physics. In 2005 he received an M.S. in Biology from Georgia Tech and his PhD studies were in Chemistry at Boston University. His research interests are varied and span machine learning and statistical learning, chemical engineering, biophysical and medicinal chemistry, cancer and cancer biology, quantitative finance, algorithmic trading and technology, synthetic biology and computational biology, quantum computation and information, natural language processing, and mathematical logic and recursion theory. Past academic appointments include Harvard, MIT, and the Technical University of Berlin. An entrepreneur and businessman, in 2008 he founded SciColab.com... Read More.

Harry Heymann (foursquare)

Harry Heymann is the Engineering Lead for foursquare. Before joining foursquare, Harry spent five years at Google where he worked on Google Payments, Dodgeball and various backend advertising systems. Previously, he held roles at Microsoft and Intel. Harry has a bachelor’s degree in engineering from Carnegie Mellon University. In addition to pursuing his obvious interest in mobile technologies, Harry is currently obsessed with exploring the Scala programming language.

Noel Hidalgo
Noel Hidalgo (World Economic Forum)

Noel Hidalgo works at the intersection of politics, community, technology and art. Advocate for free and open government data, open source software, open communities, free culture, dutch utility bicycles and transparent government; he is an established global leader in progressive political and technology communities; Eagle Scout. He is a member of the Royal Society of Arts (UK), a member of the British Council’s Transatlantic Network 2020, and on the board of advisors for Digital Democracy, a non-profit in New York City. He is one of three co-organizers of New Yorkʼs only meetup dedicated to open government, the Open New York Forum.

Bradley Holt
Bradley Holt (Found Line)

Bradley Holt is a web developer, entrepreneur, free/open source software contributor, community facilitator, speaker, and an author. He is the co-founder of Found Line, a creative studio with capabilities in web development, web design, and print design. He is a minor contributor of source code, bug reports, or documentation to several free/open source projects including PHP and Zend Framework. He organizes the Burlington, Vermont PHP Users Group and is a co-organizer of Vermont Code Camp. He has spoken at SXSW Interactive, OSCON, OSCON Data, the jQuery Conference, ZendCon, and CouchConf. He is the author of Writing and Querying MapReduce Views in CouchDB and Scaling CouchDB, both published by O’Reilly Media.

He blogs at bradley-holt.com... Read More.

Winnie Hsia (Infochimps)

Winnie is the Community Manager for Infochimps. She has been a career long advocate of community building and was the pioneer for social media initiatives at Whole Foods Market and Infosys. She has been quoted in publications including The Wall Street Journal, The Financial Times, Business Week and Mashable. She won the Shorty Award for Best Brand for Whole Foods Market in 2010.

In her spare time, she runs Austin’s premier aerial arts collective, Sky Candy and works to make the world a more connected and happier place.

John Hugg
John Hugg (VoltDB)

John Hugg is one of the architects of the VoltDB database, where he spends his day building open source, scalable and enterprise ready transaction processing tools.

He has spent his professional career working on non-traditional solutions to data management problems, including large-dataset, non-parametric statistics, column-oreinted analytics system, cloud deployments of RDBMSs and XML databases.

Philipp Janert
Philipp Janert (Principal Value, LLC)

After previous careers in physics and software development, Philipp K. Janert currently provides consulting services for data analysis, algorithm development, and mathematical modeling.

He is the author of two books on data analysis: “Data Analysis with Open Source Tools” (O’Reilly) and “Gnuplot in Action – Understanding Data with Graphs” (Manning Publications).

He holds a Ph.D. in theoretical physics from the University of Washington. Visit his company website at www.principal-value.com

Haidong Ji (Self-employed)

Haidong “Alex” Ji is a hacker who loves to tinker and automate things. During the day, he works with databases, Python, C#, and PowerShell. At night he hacks with whatever tools are necessary, Python and web scraping of late to support a charity. In between, he enjoys spending time with his family and two wonderful dogs.

Richard Kelm
Richard Kelm (Sphinx Search)

From humble beginnings focused on storing and maintaining data in MySQL. Richard, matriculated to Sphinx Search where he’s learned the value of synthesizing mountains of text and attributes into meaning for individuals to consume.

Graylin Kim (New York State Senate)

Graylin is a recent graduate of Rensselaer Polytechnic Institute and a member of the Rensselaer Center for Open Source Software at RPI. He officially joined the development team at the NY Senate CIO in June of 2011 and has been working informally with them since the summer of 2010.

Aaron Kimball
Aaron Kimball (Zymergen, Inc.)

Aaron Kimball is the CTO of Zymergen, Inc. Zymergen uses high-throughput techniques, combined with big data analysis, to improve genetic strains for microbial chemical production. Aaron has been working with Hadoop since 2007. In 2008 he was Cloudera’s first employee where he wrote Apache Sqoop and MRUnit, as well as performed a lot of Hadoop training. In 2010, Aaron founded WibiData and assumed the role of Chief Architect. WibiData helps organizations build big data applications. Aaron holds a BS in Computer Science from Cornell University and a MS in Computer Science from the University of Washington.

Jeffrey Kirkell (Project Management Institute)

Jeff Kirkell is a technologist, geek, overall agile fanboy, and aspiring cynic who spends his time formulating ways to take the best bits of open source and proprietary technologies to create systems that wow the business side with capabilities. Considering himself more an artist than engineer, and priding himself in having installed more data and application systems on his laptop than anyone he knows, Jeff usually in the Rich Internet Application space. Other interest include Triple-Stores, Key-Value stores, and various SemWeb goodness, in addition to destroying systems for testing purposes and forcing wide spread open source adoption.

Rusty Klophaus
Rusty Klophaus (Basho Technologies)

Rusty Klophaus is a Senior Software Engineer at Basho Technologies where he focuses on building distributed, fault-tolerant applications to store and retrieve Big Data. He is currently on the core engineering team of Riak Search and is the lead engineer designing Secondary Indices. Before joining Basho, Rusty launched an Erlang-based startup; before that he managed multi-million dollar technology products and guided multiple project teams at an Internet advertising company based in New York City. When he’s not hacking, Rusty plays guitar and organizes the Hacker News Meetup Group of Washington, DC.

jay kreps
jay kreps (Confluent)

Jay is the co-founder and CEO of Confluent a company focused on Apache Kafka.

Previously Jay was one of the primary architects for LinkedIn where he focused on data infrastructure and data-driven products.

He was among the original authors of a number of open source projects in the scalable data systems space, including Voldemort, Azkaban, and Kafka, and Samza.

Patrick Lightbody
Patrick Lightbody (New Relic)

Patrick runs product management for New Relic, the leader in software analytics. Prior to that, Patrick started two startups in the website monitoring and load testing space, which were acquired by Gomez and Neustar, respectively.

Patrick is an avid open source contributor, having founded OpenQA, created Selenium Remote Control, and co-created Struts 2.

Patrick has held management and software engineering positions with Jive Software, Spoke Software, and Cisco Systems.

Ryan Lowe
Ryan Lowe (Percona)

Ryan Lowe is a high performance consultant for Percona, where he advises clients on performance and scaleability best practices for a variety of application architectures. In his free time, he likes to blog, play with agent-based simulation models and travel the world.

Kate Matsudaira
Kate Matsudaira (SEOmoz)

Kate Matsudaira fills the role as Vice President of Engineering at SEOmoz where she is responsible for managing the core technology team. Prior to SEOmoz, she filled the role of VP Engineering at another startup, Delve Networks (acquired by Limelight). At Delve she helped create and monetize a very large distributed system used for online video delivery and video search. Prior to that she worked in at other leading technology companies like Amazon.com, Microsoft, and Sun Microsystems.
Kate has extensive knowledge of building large scale distributed web systems, web services, and search. Kate has a B.S. in Computer Science from Harvey Mudd College, and has completed graduate work at the University of Washington in both Business and Computer Science (M.S.).

Yoshinori Matsunobu

Yoshinori Matsunobu is a database and infrastructure architect at DeNA (http://www.dena.jp/en/index.html), living in Tokyo. Yoshinori’s primary responsibility at DeNA is to make our database infrastructure more reliable, faster and more scalable. Before joining DeNA, Yoshinori worked at MySQL/Sun/Oracle as a lead consultant in APAC for four years. Yoshinori has written eight MySQL related technical books so far and has published technical articles about MySQL, Linux, and Java for a monthly database magazine since 2004.

Dwight Merriman
Dwight Merriman (10gen)

Dwight is CEO of 10gen which sponsors the open source MongoDB project. Dwight is an active MongoDB committer.

Dwight was cofounder of DoubleClick and CTO of DoubleClick for 10 years. Dwight is the original architect of the DoubleClick DART ad serving system.

Bruce Momjian
Bruce Momjian (EnterpriseDB)

Bruce Momjian is a co-founder of the PostgreSQL Global Development Group, and has worked on PostgreSQL since 1996. He is the author of PostgreSQL: Introduction and Concepts, published by Addison-Wesley. Bruce is employed by EnterpriseDB. Previously, he was employed by SRA Japan and Great Bridge LLC, both PostgreSQL support companies. He has spoken at many international open-source conferences. Prior to his involvement with PostgreSQL, Bruce worked as a consultant, developing custom database applications for some of the world’s largest law firms. As an academic, Bruce was a high school computer science teacher for five years, holds a Masters in Education, and is currently an adjunct professor at Drexel University.

Gleicon Moraes
Gleicon Moraes (7co.cc)

Gleicon Moraes is a developer using Python, Go and Erlang who loves distributed systems, loves non relational databases and OSS. Daytime work is delivery and operations manager.

Arun Murthy
Arun Murthy (Hortonworks Inc.)

Arun is the lead of the next generation MapReduce project in Apache Hadoop. Arun has been a full-time contributor to Apache Hadoop since its inception in 2006. He is a long-time committer and member of the Apache Hadoop PMC and jointly holds the current world sorting record using Apache Hadoop. Prior to co-founding Hortonworks, Arun was responsible for all MapReduce code and configuration deployed across the 42,000+ servers at Yahoo!. In essence, he was responsible for running Apache Hadoop’s MapReduce as a service for Yahoo!. Follow Arun on Twitter: @acmurthy.
He is directly responsible for every bit of code and configuration of Map-Reduce deployed at over 40,000 machines running Apache Hadoop at Yahoo. He jointly holds the world-record for sorting data using Hadoop... Read More.

Peter Neubauer
Peter Neubauer (Neo Technology)

Peter is co-founder of a number of popular Open Source projects such
as Neo4j, Tinkerpop, OPS4J and Qi4j. Peter loves connecting things,
writing novel prototypes and throwing together new ideas and projects
around graphs and society-scale innovation. Right now, Peter is
concentrating on turning Open Source projects into profitable
enterprises at Neo Technology, the company sponsoring the development
of Neo4j, the Graph Database. Also, Peter is a Mentor helping startups
at Startupbootcamp Copenhagen and organizing events like
http://www.thoughtmade.com and TEDx Öresund.

Sarah Novotny
Sarah Novotny (NGINX)

Sarah Novotny is a technical evangelist and community manager for NGINX. Novotny has run large scale technology infrastructures as a Systems Engineer and a Database administrator for Amazon.com and the ill fated Ads.com. In 2001, she founded Blue Gecko, a remote database administration company with two peers from Amazon. Blue Gecko, was sold to DatAvail in 2012. She’s also curated teams and been a leader in customer communities focused on high availability web application and platform delivery for Meteor Entertainment and Chef.

Novotny regularly talks about technology infrastructure and geek lifestyle. She is additionally a program chair for O’Reilly Media’s OSCON. Her technology writing and adventures as well as her more esoteric musings are found at sarahnovotny.com.

Owen O'Malley
Owen O'Malley (HortonWorks)

Owen O’Malley is a software architect on Hadoop working for HortonWorks, which is a new startup focusing on Hadoop development. Prior to co-founding HortonWorks, he and the rest of the HortonWorks team worked at Yahoo developing Hadoop. He has been contributing patches to Hadoop since before it was separated from Nutch, and was the original chair of the Hadoop Project Management Committee. Before working on Hadoop, he worked on Yahoo Search’s WebMap project, which builds a graph of the known web and applies many heuristics to the entire graph that control search. Prior to Yahoo, he wandered between testing (UCI), static analysis (Reasoning), configuration management (Sun), and software model checking (NASA). He received his PhD in Software Engineering from University of California, Irvine.

... Read More.
Erik Onnen (Urban Airship)

Erik is a Senior Engineer and Analytics team lead at Urban Airship, the leading provider of Push, In-App Purchase and Subscription services for mobile applications. He specializes in distributed systems at scale. Prior to joining Urban Airship, Erik was a platform lead for Jive Software where he lead the development of the activity and recommendation engine services.

David Pacheco
David Pacheco (Joyent)

David Pacheco is the lead engineer of Joyent’s Introspection Team, which develops Cloud Analytics and other tools for observing software in the cloud. Previously a member of Sun’s Fishworks team, David worked on several areas of the Sun Storage 7000 series of appliances including remote replication, fault management, and flash device support.

Josh Patterson
Josh Patterson (Cloudera)

Master’s Thesis: self-organizing mesh networks
Published in IAAI-09: TinyTermite: A Secure Routing Algorithm

Conceived, built, and led Hadoop integration for the openPDC project at TVA (Smartgrid stuff). Led small team which designed classification techniques for timeseries and Map Reduce. Open source work at http://openpdc.codeplex.com

Now: Solutions Architect at Cloudera

Noah Pepper
Noah Pepper (Lucky Sort)

CEO of Lucky Sort, a Portland based startup building an intuitive and powerful platform for realtime text analysis using visual analytics and machine learning.

Tom Quisel
Tom Quisel (OkCupid)

Tom Quisel is OkCupid’s Director of Technology. For the past four years, he’s been devoted to helping millions across the world find love through smarter algorithms and better data analysis. Currently, Tom is giving his all learning to lead the phenomenally talented technical team at OkCupid. The path is as littered with pitfalls as it is with opportunities, but he wouldn’t trade it for any other position in the world. Tom has a B.S. in Computer Science from Carnegie Mellon where he focused on Statistics and Machine learning. Tom also studied bioinformatics at EPFL in Switzerland, where he co-authored a paper on extracting meaningful information from large but noisy biological datasets. In his free time Tom loves to hike the hills of upstate New... Read More.

Inaam Rana (Oracle)

Inaam Rana is a core member of the InnoDB development team at MySQL. He joined the InnoDB team in April 2007. His area of focus has been performance and scalability of the InnoDB storage engine. He has been mostly working on the buffer cache layer, low level concurrency and OS layer. Inaam has had a chance to contribute to the most of the scalability features that were included in the InnoDB plugin. In his previous work experience he had been involved in development of IBM’s DB2 LUW and PostgreSQL database engines. Inaam currently lives in Toronto, Canada.

Krishna Sankar
Krishna Sankar (Blackarrow.tv)

Krishna Sankar is a Chief Data Scientist at blackarrow.tv, where he is focusing on enhancing use experience via inference, intelligence & interfaces. Earlier stints include Principal architect/Data Scientist/Tata America Intl, Director of Data Science/Bioinformatics startup & as a Distinguished Engineer/Cisco. He has been sparking at various conferences (OSCON,pycon,pydata) about predicting NFL [http://goo.gl/QCpaO8], Spark[http://goo.gl/E4kqMD], Data Science [http://goo.gl/9pyJMH], Machine Learning [http://goo.gl/SXF53n], Social media Analysis [http://goo.gl/D9YpVQ] as well as has been guest lecturing at the Naval Postgraduate School. His other passion is Lego Robotics – you will find him at the St.Louis FLL World Competition as Robots Design Judge.

Theo Schlossnagle
Theo Schlossnagle (OmniTI/Circonus)

Theo Schlossnagle is a Founder and Principal at OmniTI where he designs and implements scalable solutions for highly trafficked sites and other clients in need of sound, scalable architectural engineering. He is the architect of the highly scalable Ecelerity mail transport agent. Theo is a participant in various open source communities including OpenSolaris, Linux, Apache, PostgreSQL, perl, and many others. He is a published author in the area of scalability and distributed systems as well as a veteran speaker in the open source conference circuit.

Theo founded several successful startups as engineering focused organizations including: OmniTI, Circonus, Message Systems and Fontdeck.

Jonathan Seidman
Jonathan Seidman (Orbitz Worldwide)

Jonathan has spent more than 15 years as a software developer, with a focus in the last few years on processing large data sets using tools such as Hadoop. Currently, Jonathan is a Lead Engineer on the Business Intelligence/Big Data team at Orbitz Worldwide. Jonathan is also a co-founder and organizer of the Chicago Hadoop User Group and founder of the Chicago Big Data User Group.

Corey Shields (Mozilla Corporation)

Corey is a manager for Mozilla’s IT Systems team. He has long been an open source advocate, since first being introduced to Slackware on an Infomagick CD in the mid 90’s.

Previous to Mozilla, Corey has worked as a Unix supercomputing administrator for Indiana University and as a systems manager for the Open Source Lab at Oregon State University.

Benoit Sigoure
Benoit Sigoure (StumbleUpon, Inc.)

Benoit is a software engineer with a strong UNIXy/Linux background. He specializes in designing, writing & running large-scale distributed serving systems that serve millions of users. He has a deep understanding of the entire technology stack (including Google’s), from on-wire protocols and low-level implementation details all the way up to high-level designs used in high-availability distributed systems (both in software and in the datacenter).

Benoit designed and implemented OpenTSDB, the open-source, highly scalable, distributed, monitoring system.

Adam Silberstein
Adam Silberstein (Yahoo!)

Adam Silberstein is a Research Scientist in the Systems group at Yahoo! Research. His main area of interest is large-scale data management, including work in online serving systems, batch processing systems, and management of social data. His work has appeared in top database conferences such as SIGMOD, VLDB and ICDE. Prior to joining Yahoo!, Adam completed his Ph.D. at Duke University in 2007.

Nicolas Spiegelberg (Facebook)

Nicolas Spiegelberg is a storage engineer in the Facebook messaging team. He helped implement the HBase storage solution for Facebook Messages from design to deployment. Additionally, Nicolas is an HBase committer and PMC, who has contributed many critical features such as HDFS data reliability, Bloom Filters, and an enhanced compaction algorithm.

UA-Huntsville : Masters in Computer Engineering

Bradford Stephens
Bradford Stephens (Drawn to Scale)

Bradford is the founder and CEO of Drawn to Scale, creators of the
Spire database. Spire is a SQL database built on Hadoop and HBase,
similar to Google F1. Drawn to Scale has customers powering large web
apps, mobile infrastructures, telecoms, social networks, and more. A
long-time user of Hadoop and HBase, Bradford has built large
infrastructures at various startups and enterprises, and worked on
Microsoft SQL Server. He holds degrees in Computer Science and
Political Science, and spent several years as a Campaign Manager in
politics at the Presidential and U.S. House levels.

Homer Strong (Lucky Sort)

Homer Strong is a data hacker in Portland. His background is in statistics, but he knows how to wrangle a unix terminal too.

Calvin Sun
Calvin Sun (Twitter)

Calvin Sun is the senior engineering manager at Twitter, managing the MySQL development. Prior to that, he was managing the InnoDB team at Oracle. Calvin was a development manager at MySQL from 2006 to 2008, managing MySQL storage engines and 3rd party storage engine vendors relationships. He has over 15+ years of database development experience, mostly on various storage engines.

Lars Thalmann
Lars Thalmann (Oracle)

Dr. Lars Thalmann is the development director for MySQL replication, backup, and connectors. He is responsible for the strategy and development of these features and leads the corresponding engineering teams. Thalmann has worked with MySQL development since 2001, when he started as a MySQL Cluster developer. More recently, he has driven the creation and development of the MySQL backup feature, has guided the evolution of MySQL replication since 2004, and has been a key player in the development of MySQL Cluster replication. Thalmann holds a doctorate in Computer Science from Uppsala University, Sweden. He is one of the authors of the “MySQL High Availability” book.

Laura Thomson
Laura Thomson (Mozilla Corporation)

Laura Thomson is a Senior Software Engineer at Mozilla Corporation on the web team, after spending much of the previous decade as a consultant and trainer on various Open Source technologies.

Laura is the co-author of “PHP and MySQL Web Development” and “MySQL Tutorial”. She is a veteran speaker at Open Source conferences world wide.

Robert Treat
Robert Treat (OmniTI)

With more than fifteen years of experience building database backed, internet based systems at multiple fortune 500 companies, Robert is now CEO of OmniTI, a technical services firm focused on providing web application development and infrastructure management at scale. Author and speaker at conferences worldwide, Robert is a recognized expert within the industry on topics such as Open Source, databases, managing operations at scale. He occasionally blogs at http://xzilla.net.

Andrew Turner
Andrew Turner (GeoIQ)

Andrew Turner is the CTO at GeoIQ, the company behind GeoCommons, a geospatial visualization and analytics platform. He co-founded Mapufacture, a personalized geospatial search and aggregation system that was acquired by FortiusOne in August 2008 in order to combine real-time feeds with large GIS datasets. Andrew is focused on collaboration and user-generated content around location and time. He is actively involved in open-data projects such as OpenStreetMap and VoteReport, as well as open-source projects like Mapstraction and GeoPress. He regularly speaks at conferences on the benefits of open-source software and geospatial standards to communities and organizations. Andrew wrote the O’Reilly shortcut “Introduction to Neogeography” and “Trends in Where2.0” business report in Spring 2008. He is also published in MacTech and Make magazine on... Read More.

Ramesh Venkataramaiah (Orbitz Worldwide)

Ramesh is a member of the Operations and Engineering Team at Orbitz Worldwide with a focus on analysis of distributed, high availability systems in the travel data domain. His passion is the fusion of distributed, multi-active datacenter infrastructure, parallel processing, and analysis platforms like R for Big Data. In combination with many Machine Learning methodologies, he believes that the next generation of data science and support infrastructure are a must for smooth operations of enterprise data centers and private clouds. He received a Ph. D. in Ocean Engineering from Texas A&M University with a focus on instrumentation and sensor platforms.

Tom White
Tom White (Cloudera)

Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He is the author of “Hadoop: The Definitive Guide” for O’Reilly. Previously he worked as an independent consultant specializing in Hadoop, and before that was co-founder and Lead Developer at Kizoom, a UK mobile application startup. Tom has a Bachelor’s degree in Mathematics from the University of Cambridge, and a Master’s degree in History and Philosophy of Science from the Universities of Leeds, UK, and Florence, Italy.

Christine White

My research interests are spatial data infrastructure, with emphasis on the study of geography (B.S. and M.S. in Geography from University of South Carolina). I have worked at Esri in Redlands, CA for four years as a consultant, helping large organizations manage and share their spatial data. Last Fall, our company decided to release the Esri Geoportal Server product as 100% open source, under the Apache 2.0 license on sourceforge. Since then, I’ve enjoyed learning about and participating hands-on in the open source community, and being a contributer instead of just a beneficiary of open source solutions.

Tom Wilkie
Tom Wilkie (Acunu Ltd)

Tom leads the engineering team at Acunu. He previous worked in a range of engineering roles at both Citrix and XenSource managing key customer projects with major enterprises. As one of the first employees at XenSource he played a key part in developing the XenServer management stack. He holds an MA in Computer Science from Cambridge University where he was also a research assistant.

Jared Williams (New York State Senate)

Jared is part of the Open Government movement started by the New York State Senate. He began as an intern in February of 2010 and was hired on in August of that year. He graduated with a BS in Computer Science from SUNY Albany.

Steve Yegge
Steve Yegge (Google)

Steve Yegge is a software engineer with over twenty years of industry experience. He has worked on everything from embedded systems to distributed systems to e-commerce, online games and much more. Steve has been at Google for the past eight years, where he has worked on Ads and Music systems, and most recently on a project to organize and serve all the world’s compiler output. Prior to joining Google, Steve was a Senior Engineering Manager at Amazon.com, where he spent nearly seven years leading teams in Developer Tools and Customer Service systems. Steve graduated from the University of Washington with a B.S. in Computer Science.

Ezra Zygmuntowicz (VMware Inc)

Ezra Zygmuntowicz has been a rubyist for 9 years now. Over the course of that time he has written many open source project including Merb(which was merged with rails to help build a better architecture for Rails 3.0), Nanite, redis-rb and many more. He wrote the first book on Rails deployment for the pragmatic programmers and also launched one of the first commercial rails sites at the end of 2004(http://yakima-herald.com).

Ezra has gravitated towards the deployment and tooling of ruby applications. He cofounded Engine Yard in 2006 and has spoken at every single RailsConf except for last year.

Ezra is currently working for VMware helping to build a new cloud operating system that will redefine the way people deploy and manage their development and production... Read More.