Solution ideas
This article describes a solution idea. Your cloud architect can use this guidance to help visualize the major components for a typical implementation of this architecture. Use this article as a starting point to design a well-architected solution that aligns with your workload's specific requirements.
This solution idea describes how to extend your on-premises big data investments to the cloud and transform your business by using the advanced analytics capabilities of Azure HDInsight.
Architecture
Download a Visio file of this architecture.
Dataflow
- Establish ExpressRoute between on-premises infrastructure and Microsoft datacenters, to allow private connection for reliable, speedy, and secure data replication from an on-premises Hadoop setup to an Azure HDInsight cluster.
- Install the WANdisco Fusion server in the same Azure Virtual Network as the HDInsight cluster, which allows the server to access the cluster in a secure manner.
- Install the WANdisco Fusion app on a HDInsight cluster (new or existing). In the License key field, enter the Public IP of the Fusion Server.
- Configure the Fusion App on an HDInsight cluster to set up continuous active replication from on-premises large data/Hadoop deployments to Azure HDInsight, multi-region replication, backup and restore, and more.
Components
- Apache Hadoop or Apache Spark
- Metadata store
- Local edge router
- Azure ExpressRoute circuit
- Microsoft Edge router
- Data replication (WANdisco's LiveData Migrator for Azure and LiveData Plane for Azure)
- Azure HDInsight
- Azure Virtual Network
Scenario details
This solution idea describes how to extend your on-premises big data investments to the cloud.
Potential use cases
The integration of WANdisco Fusion with Azure HDInsight presents an enterprise solution that enables organizations to meet stringent data availability and compliance requirements while seamlessly moving production data at petabyte scale from on-premises big data deployments to Microsoft Azure.
Contributors
This article is maintained by Microsoft. It was originally written by the following contributors.
Principal author:
- Aadi Manchanda | Cloud Solution Architect
To see non-public LinkedIn profiles, sign in to LinkedIn.
Next steps
Learn more about the component technologies:
- What is Azure ExpressRoute?
- Migrate your Hadoop data lakes with WANDisco LiveData Platform for Azure
- What is Azure HDInsight?
- What is Azure Virtual Network?
Related resources
Explore related architectures: