Implementing Enterprise Data Lake using Amazon (AWS) S3
Introduction & Background In the modern digital world, many of the smaller to medium sized organizations (even some good sized organizations) still stumble upon the problem of having ever growing data and strive hard to become a truly data driven organization while their underlying architecture for this purpose is not strong enough to help them achieve their mission. In this post, I would try to focus on that topic and thus the beneficiaries of this post will be all those Engineers and organizations who are just beginning to think of moving towards a solution that takes them away from their traditional data warehousing models while staying in the big data platform. In other words, who still use a lot of traditional DWH principles while they either already started adopting big data frameworks and technologies or just thinking of going in that direction and more specifically who are on the AWS platform. But it is not too difficult to safely rethink the whole approach in terms of