Find the Spark as a SQL data warehouse developer
Proposed session for SQLBits 2026TL; DR
You are a data warehouse developer that´s been using SQL Server on-premise or in Azure to build your data warehouses. You are comfortable in writing SQL scripts, stored procedures and creating views to fuel your data warehouse. You have heard about the new kid in town Spark but you have not yet take the plunge and are wondering if you have to or if you should.
If the above is something like you then this is the session for you. The session is built largely on my own experience. I had been doing data warehouse in SQL for years and I had decided Spark was nothing for me. I felt there was enough to do in SQL so there was no reason to add Spark to the tools I was using. It seemed like a lot of learning and when you are an old dog that can be difficult to do 😊.
Session Details
You are a data warehouse developer that´s been using SQL Server on-premise or in Azure to build your data warehouses. You are comfortable in writing SQL scripts, stored procedures and creating views to fuel your data warehouse. You have heard about the new kid in town Spark but you have not yet take the plunge and are wondering if you have to or if you should.
If the above is something like you then this is the session for you. The session is built largely on my own experience. I had been doing data warehouse in SQL for years and I had decided Spark was nothing for me. I felt there was enough to do in SQL so there was no reason to add Spark to the tools I was using. It seemed like a lot of learning and when you are an old dog that can be difficult to do 😊.
The reality is that Spark offers some very cool features and it´s not that different to how you have been doing things. Actually the concept of how to do a data warehouse do not change that much even though it´s built in a data lake and is called a data lakehouse. Yes the technology is different and the language is a lot or slightly different depending on what you choose but transitioning was lot easier than I thought.
We will start by comparing SQL data warehouse and Spark lakehouse. What is common and what is different. Then we will look at a demo on how you could do things in a Spark lakehouse all the time comparing it to how we are used to doing it in SQL data warehouse. We will then move onto understanding how to acquire the right knowledge to get started with Spark as well as how to try it out without breaking the bank.
After attending this session you will be better able to understand if Spark lakehouses are for you and how you can figure out how to learn the required skills. You will also know how to get started for free or with a very low cost.
If the above is something like you then this is the session for you. The session is built largely on my own experience. I had been doing data warehouse in SQL for years and I had decided Spark was nothing for me. I felt there was enough to do in SQL so there was no reason to add Spark to the tools I was using. It seemed like a lot of learning and when you are an old dog that can be difficult to do 😊.
The reality is that Spark offers some very cool features and it´s not that different to how you have been doing things. Actually the concept of how to do a data warehouse do not change that much even though it´s built in a data lake and is called a data lakehouse. Yes the technology is different and the language is a lot or slightly different depending on what you choose but transitioning was lot easier than I thought.
We will start by comparing SQL data warehouse and Spark lakehouse. What is common and what is different. Then we will look at a demo on how you could do things in a Spark lakehouse all the time comparing it to how we are used to doing it in SQL data warehouse. We will then move onto understanding how to acquire the right knowledge to get started with Spark as well as how to try it out without breaking the bank.
After attending this session you will be better able to understand if Spark lakehouses are for you and how you can figure out how to learn the required skills. You will also know how to get started for free or with a very low cost.
3 things you'll get out of this session
To help attendees with SQL background to understand what the differences are between SQL Server data warehouse and Spark lakehouse
To show how the transition from SQL Server to Spark does not have to be difficult
To inspire SQL Server data warehouse developers to try out Spark
Speakers
Ásgeir Gunnarsson's other proposed sessions for 2026
Best Practices for Sharing Power BI Content with External Users - 2026
Data Quality Validations in Fabric Spark - 2026
From Chaos to Control: Orchestrating Lakehouse Workloads in Microsoft Fabric - 2026
From Chaos to Control: Orchestrating Lakehouse Workloads in Microsoft Fabric Part 1 - 2026
From Chaos to Control: Orchestrating Lakehouse Workloads in Microsoft Fabric Part 2 - 2026
Workspace strategy for Lakehouse/Warehouse in Microsoft Fabric - 2026
Panel Debate: Real-World Microsoft Fabric Administration - Lessons from the Trenches - 2026
Ásgeir Gunnarsson's previous sessions
Power BI Governance quick start
For many the mention of governance gives them images of massive effort and inconvenience for themselves and end-users. It´s a hindrance and waste of time. But it doesn´t have to be that way.
In this session we will talk about how we can start to tackle Power BI governance. We will look at few things you can get started with very quickly that will get you far in setting up your governance strategy.
We will look at things such as documentation, roles and monitoring. The hope is that you go back to your organization with a clear picture of where to start and a better feeling on governance.
5 Power Automate flows that could give immediate value in your organization
In this session we will demonstrate how to create value fast with Power Automate. We will do this by showing 5 types of flows that can give value with little development effort
Managing Power BI workspaces
This session will help you find the best strategy for managing your Power BI workspaces. Should you go for as much automation as possible or do careful manual process? Should you have few people controlling everything or should everyone do as they want? All this and more you will find in this session.
Whats Going On in My Power BI Environment?
As Power BI is a self-service tool, it can be hard for administrators to monitor it. Power BI is fast improving in this context but there still isn’t a consistent way of monitoring it.
Power BI Governance overview
Governance of your Power BI environment is very important. Setting up structure around it will allow developers (IT or business) to develop Power BI content the right way, first time as well as aid administrators
Impact of weather on English Premier League in Power BI
By using open data and web scraping in Power BI we will examine if weather has an impact on games in the English Premier League. We will look at the advantages and disadvantages of using open data and web page data how Power BI works with it.