SQL Server Connector for Hadoop

Hadoop is an open source framework from Apache which enables you to process large datasets across multiple nodes. Hadoop Distributed File System (HDFS) is the primary storage system used by Hadoop applications.

The SQL Server-Hadoop Connector is a Sqoop-based connector that facilitates efficient data transfer between SQL Server 2008 R2 and Hadoop. Sqoop supports several databases including MySQL and HDFS. This connector is bidirectional. You can import as well as export the data.

The SQL Server Hadoop connector is available in two flavours:

    1. For SQL Server Parallel Data Warehousing.
    2. For SQL Server 2008 R2 and Denali.

SQL Server Hadoop connector RTM for SQL Server 2008 R2 and Denali can be downloaded from the link below:

http://www.microsoft.com/download/en/details.aspx?id=27584

For more details on Sqoop refer to the user guide link below:

http://archive.cloudera.com/cdh/3/sqoop-1.2.0-CDH3B4/SqoopUserGuide.html