SQLBits 2024

Python in Fabric for Data Engineers: all you need to know

The data engineering landscape is evolving. T-SQL proficiency isn't enough now. Dive into modern tools like Delta Lake, Jupyter Notebooks, and MS Fabric with Python. This session covers Python syntax, Jupyter in Fabric, data manipulation with Pandas, PySpark for Big Data, and data extraction/loading techniques for various formats.
The world of data engineering is changing. Being fluent in T-SQL is not enough anymore. Whether you want to implement a Delta Lake using Data Bricks, use Jupyter Notebooks to analyse your data or built an ETL pipeline in Microsoft Fabric, Python is the new kid on the block.

In this demo filled session I will get you up to speed with Python and learn you all the tips and tricks you need to know. The following topics will be covered:
- Syntax overview
- Leverage Jupyter Notebooks in Fabric
- Data Wrangling with Pandas
- Select, Group, Filter and Join data
- Big Data with PySpark
- Extract data from SQL Server, CSV files and Parquet files
- Load data into a Lakehouse (MS Fabric) Parquet files and (Delta) Tables