SQLBits 2022

Creating a Metadata Driven Orchestration Framework Using Azure Data Integration Pipelines

We'll explore delivering this framework within an enterprise and consider an architect’s perspective on a wider platform of ingestion/transformation workloads with multiple batches and execution stages.

Azure Data Factory and Synapse Integration Pipeline are the undisputed PaaS resources within the Microsoft Cloud for orchestrating data workloads. With a 100+ Linked Service connections, a flexible array of both control flow and data flow Activities there isn't much these pipelines can’t do as a wrapper over our data platform solutions. That said, the service may still require the support of other Azure resources for the purposes of logging, monitoring, compute and storage. In this session we’ll will focus on exactly that point and explore the problem faced when structuring many integration pipelines in a highly scaled architecture.
Once coupled with other resources, we’ll look at one possible solution to this problem of pipeline organisation to create a dynamic, flexible, metadata driven processing framework that complements our existing solution pipelines. Furthermore, we will explore how to bootstrap multiple orchestrators (across tenants if needed), design for cost with nearly free Consumption Plans and deliver an operational abstraction over all our processing pipelines.
Finally, we'll explore delivering this framework within an enterprise and consider an architect’s perspective on a wider platform of ingestion/transformation workloads with multiple batches and execution stages.

Feedback Link https://sqlb.it/?7026

Speaker

Microsoft Data Platform MVP. 10+ years’ experience working with the complete on premises SQL Server stack. Now working as a principal consultant and solution architect dealing with data platform deliveries in Azure.

Paul Andrew's Sessions

An Evolution of Data Architectures - Lambda, Kappa, Delta, Mesh & Fabric

Building an Azure Data Analytics Platform End-to-End

Creating a Metadata Driven Orchestration Framework Using Azure Data Integration Pipelines

ETL in Azure Made Easy with Data Factory Data Flows

Using Azure DevOps for Azure Data Factory

Complex Azure Orchestration w Dynamic Data Factory Pipelines

Building an End to End IoT Solution Using Pi Sensors & Azure