SQLBits 2020

SQL Server Big Data Clusters: The Full Story

A complete overview about SQL Server Big Data Clusters including it's major components and use-cases!

In the realm of data storage and processing, there are two major technologies which we deal with every day. On one side, we have relational data that is stored inside SQL Server, and on the other side, non-relational or very large datasets that do not fit the relational model which are stored on big data clusters like Hadoop or Spark. This introduces challenges when having to combine datasets across both these technologies. SQL Server was never built to process huge datasets in a distributed fashion or to handle non-relational data very well, meaning that in many cases you would have to resort to bringing your relational data into Hadoop or Spark clusters. SQL Server 2019 has the answer with Big Data Clusters: it combines SQL Server with HDFS and Spark! In this session we are going to explore the capabilities of the exciting new feature. How does it work and how can we work with datasets that are non-relational?

What you need to know first: (01:00)
Big Data Cluster Architecture: (11:01)
Demo: Data Virtualization: (21:22)
Demo: Storage Pool: (27:05)
Demo: End to End Real World Scenario: (32:32)
Managing a Big Data Cluster: (41:22)

Speaker

Ben has been working with SQL Server since SQL Server 6.5, mainly in the BI/Datawarehousing field. He is a Data Platform MVP, MCSE, MPP and a BimlHero Certified Expert. He is also a regular speaker at international events and an author.

Ben Weissman (he/him)'s Sessions

Flight of Innovation: Enhancing On-Prem and Multi-Cloud SQL Instances with a Cloud Cockpit

Fueling Your Future: Embracing the Rewards of Data Platform Community Involvement

Be more responsible around AI - Less Bias, More Ethics

Azure Arc in 50 Minutes

Never ETL again, thanks to Synapse Link. Really?

Keynote by The Community

Why is understanding Kubernetes important for your career as a Data Professional?

Azure Arc-enabled SQL MI – More than just another kind of SQL Server

(Almost) all about Azure Arc - in 20 Minutes

SQL Server Big Data Clusters: The Full Story

The Self-Tuning SSIS Package