Big Data is here to stay, no doubt about that. And Hadoop is THE framework for Big Data. But what is Hadoop? Is it a friend or foe for SQL Server DBA? Let’s explore what Hadoop is and how to make it work together with SQL Server.
I will discuss Hadoop features, components and extensions (Map-Reduce, YARN, Spark, Hive and Impala). I will show how to start a proof-of-concept Hadoop project on a small budget, including hardware, software, network, installation, configuration, testing, and tuning details. We will walk through several demos on a small desktop, 3-node Hadoop cluster with data access via ODBC from familiar Windows tools (Excel, SSMS, SSIS, SSRS, SSAS).