Cost of ADLS Gen2 while enabling Az synpase link for Cosmos database

Jaydeep Suryawanshi 26 Reputation points
2024-08-26T06:43:39.3933333+00:00

Hi Team,

I am exploring the Azure synapse link for Azure cosmos database, but when I enable the Azure synapse link then it require the azure synapse workspace to be created along with Azure data lake storage Gen2.

So, below are my queries around it.

  1. Why there is need of Azure data lake storage Gen2 ?
    • As per MS docs, its mentioned 'This account will be the primary storage account for the workspace, holding catalog data and metadata associated with the workspace.'
    • if it's only 'holding catalog data and metadata' then why Gen 2?
    • Does this storage store the data from Cosmos analytical store?
  2. If I am using Azure synapse link then what would be the avg cost for the ADLS Gen 2 storage?

Thank you in advance.

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,464 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,858 questions
Azure Cosmos DB
Azure Cosmos DB
An Azure NoSQL database service for app development.
1,612 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Nehruji R 7,306 Reputation points Microsoft Vendor
    2024-08-27T09:31:15.7766667+00:00

    Hello Jaydeep Suryawanshi,

    Greetings! Welcome to Microsoft Q&A Platform.

    1.Why is there need of Azure data lake storage Gen2 ?

    Azure Data Lake Storage Gen2 (ADLS Gen2) is required because it provides a scalable and secure storage solution that integrates well with Azure Synapse Analytics. While it primarily holds catalog data and metadata, it also supports advanced analytics and big data workloads, which require the performance and scalability that ADLS Gen2 offers.

    2.Does this storage store the data from Cosmos analytical store?

    No, the data from the Cosmos DB analytical store is not stored in ADLS Gen2. The analytical store in Azure Cosmos DB is a fully isolated column store that is used for running analytics directly on the operational data without affecting the performance of transactional workloads. ADLS Gen2 is used for storing metadata and other workspace-related data, not the actual analytical data from Cosmos DB.

    3.If I am using Azure synapse link then what would be the avg cost for the ADLS Gen 2 storage?

    The cost of ADLS Gen2 storage can vary based on several factors, including the amount of data stored, the frequency of access, and the operations performed on the data. Generally, the cost includes charges for storage capacity, read/write operations, and data transfer. With Azure Synapse Link, you can get a cost-optimized, fully managed solution for operational analytics. It eliminates extra storage and compute layers required in traditional ETL pipelines for analyzing operational data.

    Azure Cosmos DB analytical store follows a consumption-based pricing model, which is based on data storage and analytical read/write operations and queries executed. It doesn’t require you to allocate any throughput, as you do today for the transactional workloads. Accessing your data with highly elastic compute engines from Azure Synapse Analytics makes the overall cost of running storage and compute efficient. You can use the Azure Pricing Calculator to get a more accurate estimate based on your specific usage patterns.

    refer -https://video2.skills-academy.com/en-us/azure/cosmos-db/synapse-link.

    Hope this answer helps! Please let us know if you have any further queries. I’m happy to assist you further.


    Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.