SQLBits 2018

Advanced Techniques for Cleaning Data using SQL Server 2017

If your organisation doesn't have dirty data, it's because you are not looking hard enough. Join this session to see what you can do to clean up your data properly, using SQL Server 2017.
If your organisation doesn't have dirty data, it's because you are not looking hard enough. how do you tackle dirty data for your business intelligence projects, data warehousing projects, or your data science projects?

In this session, we will examine ways of cleaning up dirty customer data using the following technologies in SQL Server 2017 such as:

  • R
  • Python
  • AzureML and Machine Learning
  • SSIS

We will also examine techniques for cleaning data with artificial intelligence and advanced computing such as knowledge-based systems and using algorithms such as Levenshtein distance and its various implementations.

Join this session to examine your options regarding what you can do to clean up your data properly.