SQLBits 2015
Hadoop: From Hive with Stinger to Tez
For long Hadoop and MapReduce were linked together. With the release of YARN, Hortonworks, Microsofts Hadoop partner, launched the Stinger initiative to speedup Hive a 100x. But wat is it? In this session I will show you.
For long Hadoop and MapReduce were linked together.
Additional data manipulation libraries, like Hive, were added to query the
stored data more easily. But with the growth of the amount of data and
cluster sizes, MapReduce became too slow and with the release of Hadoop2 YARN was
introduced to schedule the resources in a Hadoop cluster. Parallel with
the creation of YARN Hortonworks, Microsofts Hadoop partner, launched the
Stinger initiative to speedup Hive a 100x in three waves: the first van the
introduction of the ORC files, the second wave optimized the query engine and with
the third wave Tez was released. In this session we will look at all the
different aspects of Hive/Stinger/Tez, from the history and future to the internals and
practical use.
Speakers
Jan Pieter Posthuma's previous sessions
Extending Power BI With Your Own Custom Visual
Microsoft Power BI supports custom visuals from the Office Store (store.office.com). In this session I will demonstrate what is needed for starting building your own.
Hadoop: From Hive with Stinger to Tez
For long Hadoop and MapReduce were linked together. With the release of YARN, Hortonworks, Microsofts Hadoop partner, launched the Stinger initiative to speedup Hive a 100x. But wat is it? In this session I will show you.
ETL with Hadoop and MapReduce
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? In this session we will cover some basics of Hadoop, MapReduce, Pig and Hive.