Posts

Showing posts from July, 2022

Difference Between Data Warehousing and Data Lake

Image
   Difference Between Data warehousing and Data Lake   The difference between the two most popular options for storing big data but according to the nature of data both have some important differences also so first understand it individually after that we will also know the differences.   Data Warehouse: - A  data warehouse is a repository for structured, filtered data that                                     has already been processed for a specific purpose .   Data Lake :- The data  lake is a vast pool of raw data, the purpose for which is not yet defined.  Difference:-   Data lakes store data from a wide variety of sources like IoT devices, real-time social media streams, users, and web application transactions. Sometimes this data is structured, but often, it’s quite messy because data is being ingested straight from the data source. Data warehouses, on other hand, contain historical data that has been cleaned to fit a relational schema.   Data lakes are used for the