Is it possible to encrypt and compress files on self-hosted runtime as part of ADF flow?

Sandeep Wadhwa 6 Reputation points
2020-11-09T19:06:20.71+00:00

I am looking for a way to first encrypt CSV files and then compress before executing the ADF copy flow. The encryption and compression needs to happen on the self-hosted IR.

To visualize this a bit

Step 1 - Kick-off the ADF flow

Step 2 - Encrypt the CSV files on Self-hosted IR

Step 3 - Compress the encrypted files on Self-hosted IR

Step 4 - Execute the copy flow in ADF to transmit the files from on-premise servers to Azure cloud

How can I achieve this through ADF flow?

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,568 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. MartinJaffer-MSFT 26,081 Reputation points
    2020-11-09T22:18:39.38+00:00

    Hello @Sandeep Wadhwa and welcome to Microsoft Q&A.

    I assume by "flow" you mean the general process, not Mapping Data Flow. (Mapping Data Flow does not run on self-hosted IR).

    In a Copy activity, when one of the Linked Services is set to use a Self-hosted IR, all of the Copy activity's work is done on that self-hosted IR.
    This means step 3 would be executed on the self-hosted IR.

    Compression options are set in the Dataset. To go from uncompressed --> compressed , set the compression in the source dataset to "none" and the compression type of your choice in the sink dataset.
    Since all work is done on the self-hosted IR, compression is done before transmitting.
    38551-image.png

    Step 2 is the tricky part. ADF has some built-in security for data in transit, but it sounds like your are asking for encryption-at-rest. Give me some time to think about how to do it.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.