SQLBits 2015

Hadoop: From Hive with Stinger to Tez

For long Hadoop and MapReduce were linked together. With the release of YARN, Hortonworks, Microsofts Hadoop partner, launched the Stinger initiative to speedup Hive a 100x. But wat is it? In this session I will show you.

For long Hadoop and MapReduce were linked together. Additional data manipulation libraries, like Hive, were added to query the stored data more easily. But with the growth of the amount of data and cluster sizes, MapReduce became too slow and with the release of Hadoop2 YARN was introduced to schedule the resources in a Hadoop cluster. Parallel with the creation of YARN Hortonworks, Microsofts Hadoop partner, launched the Stinger initiative to speedup Hive a 100x in three waves: the first van the introduction of the ORC files, the second wave optimized the query engine and with the third wave Tez was released. In this session we will look at all the different aspects of Hive/Stinger/Tez, from the history and future to the internals and practical use.

Jan Pieter Posthuma's previous sessions

Extending Power BI With Your Own Custom Visual

Microsoft Power BI supports custom visuals from the Office Store (store.office.com). In this session I will demonstrate what is needed for starting building your own.

Hadoop: From Hive with Stinger to Tez

ETL with Hadoop and MapReduce

Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? In this session we will cover some basics of Hadoop, MapReduce, Pig and Hive.

Hadoop: From Hive with Stinger to Tez

Speakers

Jan Pieter Posthuma

azurebi.jppp.org

Jan Pieter Posthuma's previous sessions