22-25 April 2026
SQLBits 2022

Creating a Metadata Driven Orchestration Framework Using Azure Data Integration Pipelines

We'll explore delivering this framework within an enterprise and consider an architect’s perspective on a wider platform of ingestion/transformation workloads with multiple batches and execution stages.
Azure Data Factory and Synapse Integration Pipeline are the undisputed PaaS resources within the Microsoft Cloud for orchestrating data workloads. With a 100+ Linked Service connections, a flexible array of both control flow and data flow Activities there isn't much these pipelines can’t do as a wrapper over our data platform solutions. That said, the service may still require the support of other Azure resources for the purposes of logging, monitoring, compute and storage. In this session we’ll will focus on exactly that point and explore the problem faced when structuring many integration pipelines in a highly scaled architecture.
Once coupled with other resources, we’ll look at one possible solution to this problem of pipeline organisation to create a dynamic, flexible, metadata driven processing framework that complements our existing solution pipelines. Furthermore, we will explore how to bootstrap multiple orchestrators (across tenants if needed), design for cost with nearly free Consumption Plans and deliver an operational abstraction over all our processing pipelines.
Finally, we'll explore delivering this framework within an enterprise and consider an architect’s perspective on a wider platform of ingestion/transformation workloads with multiple batches and execution stages.

Feedback Link https://sqlb.it/?7026

Speakers

Paul Andrew

mrpaulandrew.com

Paul Andrew's previous sessions

An Evolution of Data Architectures - Lambda, Kappa, Delta, Mesh & Fabric
How has advancements in highly scalable cloud technology influenced the design principals we apply when building data platform solutions?
 
Building an Azure Data Analytics Platform End-to-End
Based on real world experience let’s think about just how far the breadth of our knowledge now needs to reach when starting from nothing and building a complete Microsoft Azure Data Analytics solution.
 
Creating a Metadata Driven Orchestration Framework Using Azure Data Integration Pipelines
We'll explore delivering this framework within an enterprise and consider an architect’s perspective on a wider platform of ingestion/transformation workloads with multiple batches and execution stages.
 
ETL in Azure Made Easy with Data Factory Data Flows
What happens when you combine a cloud orchestration service with a Spark cluster?! The answer is a feature rich, graphical, scalable data flow environment to rival any ETL tech we’ve previously had available in Azure.
 
Using Azure DevOps for Azure Data Factory
DevOps as a concept does not always translate to the technology when implemented. In this session we'll explore that problem when working with Azure Data Factory and what the different cloud only CI/CD options are.
 
Complex Azure Orchestration w Dynamic Data Factory Pipelines
If you have already mastered the basics of Azure Data Factory (ADF) and are now looking to advance your knowledge of the tool this is the session for you.
 
Building an End to End IoT Solution Using Pi Sensors & Azure
Demonstrating an end to end IoT solution providing real-time sensor data from a Raspberry Pi into an Azure IoT Hub, through Stream Analytics, then with outputs to Power BI and SQL DB. Learn how to build this simplified IoT solution from scratch.