Data Lake

 

Data Lake is a repository for large quantities and varieties of structured, semi-structured, and unstructured data in their raw format. Azure Data Lake is a big container of data, irrespective of the volume of data Data Lake can save it. There is no limit. Petabytes or Exabytes or even beyond. 
Data Lake, lets you load the data immediately, and later apply the transformations as per the need. To get a more structured approach, databases within the Data Lake can be created, and the data that held within the databases in the Data Lake is called the structured data. And once we have all the data into the Data Lake, then other technologies later like Hadoop or Databricks or Data factory, these technologies process that data and analyze that data and present it to the business. But unlike data warehouse, we don't have to think about what business wants in the initial stages of data ingestion.






Comments