SQLBits 2023
Building a Lakehouse on the Microsoft Intelligent Data Platform
This session session aims to give you that context. We'll look at how spark-based engines work and how we can use them within Synapse Analytics. We'll dig into Delta, the underlying file format that enables the Lakehouse, and take a tour of how the Synapse compute engines interact with it. Finally, we'll draw out our whole Lakehouse architecture
The Data Lakehouse - it's an emerging architecture being driven by the spark community, but it has a real place within the Microsoft ecosystem. But where do you start? What makes it different? How does it work within Synapse Analytics specifically?
This session aims to give you that context. We'll look at how spark-based engines work and how we can use them within Synapse Analytics. We'll dig into Delta, the underlying file format that enables the Lakehouse, and take a tour of how the Synapse compute engines interact with it. Finally, we'll draw out our whole data architecture, understanding how the Lakehouse serves our whole data community.
This session aims to give you that context. We'll look at how spark-based engines work and how we can use them within Synapse Analytics. We'll dig into Delta, the underlying file format that enables the Lakehouse, and take a tour of how the Synapse compute engines interact with it. Finally, we'll draw out our whole data architecture, understanding how the Lakehouse serves our whole data community.
Speakers
Craig Porteous's other proposed sessions for 2026
Centralise or Federate - How to Scale your Lakehouse - 2026
Do You Really Need a Data Lakehouse? Separating Hype from Business Need - 2026
Craig Porteous's previous sessions
Zero to Lakehouse in Microsoft Fabric
Fabric is Microsoft's unified software as a service data platform, built around a Data Lakehouse architecture. In this session I'll share an array of data Lakehouse architecture patterns, and demonstrate how you can build a full Data Lakehouse platform without ever touching an Azure resource.
Building a Lakehouse on the Microsoft Intelligent Data Platform
This session session aims to give you that context. We'll look at how spark-based engines work and how we can use them within Synapse Analytics. We'll dig into Delta, the underlying file format that enables the Lakehouse, and take a tour of how the Synapse compute engines interact with it. Finally, we'll draw out our whole Lakehouse architecture
Designing Data Architectures that InfoSec will actually approve
In this session I'll guide you from through a secure reference architecture with Data Factory, Databricks, Data Lake, and Azure Synapse, working together as a secure, fully productionised platform. Each has their own idiosyncrasies, but this session will teach you the options available and the pitfalls to avoid.
Why the Lakehouse?
In this session I'll cover what the Data Lakehouse architecture is, where it fits against existing architectures like a data warehouse, and why you should build one. We'll also cover the underlying technology options to arm you with all of the information you need to plan your next data platform.
Keynote by The Community
Ben and Rob have found some wonderful folk to actually do the important parts of the community keynote. on the theme of
How to be a nonpassive member of the data community
Simon Whiteley
advancinganalytics.co.uk/blog
Simon Whiteley's previous sessions
Behind the Hype - Architecture Trends in Data
Seasoned Data Engineer and YouTube grumbler Simon Whiteley takes us on a journey through the current industry trends and buzzwords, carving through the hype to get at the underlying ideals. Which is going to last and which is a sales gimmick? Which bandwagon might actually take you in the right strategic direction?
Nose-Dive Narratives: Slide Karaoke 2024
Get ready to wrap up a serious day of learning with a dash of humor, spontaneity, and friendly competition! SQLBits presents "Slide Karaoke" where SQLBits speakers reveal their hidden talents while vying for bragging rights. This session promises to be a one-of-a-kind experience that will leave you in stitches and awe, and the speakers scrambling for their non-existent notes!
Behind the Hype - Architecture Trends in Data
In this session, seasoned data engineer and youtube grumbler Simon Whiteley takes us on a journey through the current industry trends and buzzwords, carving through the hype to get at the underlying ideals.
Building a Lakehouse on the Microsoft Intelligent Data Platform
This session session aims to give you that context. We'll look at how spark-based engines work and how we can use them within Synapse Analytics. We'll dig into Delta, the underlying file format that enables the Lakehouse, and take a tour of how the Synapse compute engines interact with it. Finally, we'll draw out our whole Lakehouse architecture
Bringing Data Lakes to your Purview
A short, fast dive into the specific elements of Azure Purview that work well with Data Lakes, and how you implement them yourselves
Value-Driven Analytics Development
Ever spent an age releasing a data model, only to find no-one uses it? There's a better way of working, driven by both technology & agile working practices, let me tell you about Value Driven Development & DataOps
Databricks, Delta Lake and You
Databricks, Lakes & Parquet are a match made in heaven, but explode with extra power when using Delta Lake. This session will dive into the details of how Databricks Delta works and how to make the most of it.
The Azure Spark Showdown - Databricks VS Synapse Analytics
Azure now has two slick, platform-as-a-service spark offerings, but which one should you choose? A separate specialist tools or a one-size-fits-all solution? Join Simon as he compares and contrasts the spark offerings.
Azure SQL DataWarehouse: 0-100 (DWUs)
Azure SQLDW - WHAT, WHERE, WHEN and HOW to use it.