data lake and schemas

Thirston Third 21 Reputation points
2022-02-03T00:41:32.57+00:00

Since data lakes store data in its raw form from the original data source how do you manage providing data to users to create reports when they're accustomed to snowflake and star schemas?

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,466 questions
Azure Data Lake Analytics
{count} votes

1 answer

Sort by: Most helpful
  1. ShaikMaheer-MSFT 38,441 Reputation points Microsoft Employee
    2022-02-04T13:46:53.657+00:00

    Hi @Thirston Third ,

    Thank you for posting query in Microsoft Q&A Platform.

    Azure data lake storages are big data storages. Data engineers usually use them to dump raw data there and then from there ETL pipelines will be created to take that data and transform it to generate meaning full data out of it and finally load in Data ware house systems such as Azure Synapse Analytics.

    Once data is available in Datawarehouse systems we can either populate that data on to reports directly or we can create tabular model using Azure Analysis services on top of it. And then from tabular model we can take data to reports using power BI.

    Azure data factory or Azure Synapse pipelines can be used to create ETL pipelines.

    Azure Synapse Analytics will also gives capability to query data on data lake storages directly using SQL runtime or Spark runtime. Power BI can also directly connect with Azure Synapse Analytics to generate reports.

    Click here to know more about Azure data factory.
    Click here to know more about Azure Synapse Analytics.
    Click here to know more about Azure Analysis Service.
    Click here to know more about data lake storages.

    Hope this will help. Please let us know if any further queries.

    ----------

    Please consider hitting Accept Answer. Accepted answers helps community as well.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.