SQLBits 2013
ETL with Hadoop and MapReduce
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? In this session we will cover some basics of Hadoop, MapReduce, Pig and Hive.
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? And how can we use it in our traditional BI solutions?
Apache Hadoop is a collection of tools mainly used nowadays in the Big Data space. With Hadoop (and MapReduce, Pig and Hive) we can use it as part of our ETL process to get insight from our (un)structured data sources.
In this session we will cover these basics, but also introduce some internals of Hadoop.
Apache Hadoop is a collection of tools mainly used nowadays in the Big Data space. With Hadoop (and MapReduce, Pig and Hive) we can use it as part of our ETL process to get insight from our (un)structured data sources.
In this session we will cover these basics, but also introduce some internals of Hadoop.
Speakers
Jan Pieter Posthuma's previous sessions
Extending Power BI With Your Own Custom Visual
Microsoft Power BI supports custom visuals from the Office Store (store.office.com). In this session I will demonstrate what is needed for starting building your own.
Hadoop: From Hive with Stinger to Tez
For long Hadoop and MapReduce were linked together. With the release of YARN, Hortonworks, Microsofts Hadoop partner, launched the Stinger initiative to speedup Hive a 100x. But wat is it? In this session I will show you.
ETL with Hadoop and MapReduce
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? In this session we will cover some basics of Hadoop, MapReduce, Pig and Hive.