SQLBits 2013

ETL with Hadoop and MapReduce

Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? In this session we will cover some basics of Hadoop, MapReduce, Pig and Hive.
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? And how can we use it in our traditional BI solutions?
Apache Hadoop is a collection of tools mainly used nowadays in the Big Data space. With Hadoop (and MapReduce, Pig and Hive) we can use it as part of our ETL process to get insight from our (un)structured data sources.
In this session we will cover these basics, but also introduce some internals of Hadoop.