Data
lakes have been around for several years and there is still much hype and
hyperbole surrounding their use. This session covers the basic design patterns
and architectural principles to make sure you are using the data lake and
underlying technologies effectively. We will cover things like best practices
for data ingestion and recommendations on file formats as well as designing
effective zones and folder hierarchies to prevent the dreaded data swamp. We’ll
also discuss how to consume and process data from a data lake. And we will
cover the often overlooked areas of governance and security best practices.
This session goes beyond corny puns and broken metaphors and provides
real-world guidance from dozens of successful implementations in Azure.