question

SeuliDasMajumder-3800 avatar image
SeuliDasMajumder-3800 asked ·

is azure data lake analytics supports onlu storage Gen1

is azure data lake analytics account only support data lake storage Gen1?
Can blob storage be use for data analytics account?

data analytics account locations are very limited. if storage account is in different location than analytics account, will that able to access data from storage?

what are the differences between these different storage tye like Gen1, Gen2 and blob storage?

azure-blob-storageazure-data-lake-storageazure-data-lake-analytics
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

PRADEEPCHEEKATLA-MSFT avatar image
PRADEEPCHEEKATLA-MSFT answered ·

@SeuliDasMajumder-3800 Welcome to the Microsoft Q&A platform.


Is azure data lake analytics account only support data lake storage Gen1?


Yes, Azure Data Lake Analytics supports only ADLS Gen1.

Note: We do not have plans to support ADLA-ADLS Gen2.

ADLA does not work with ADLS Gen2. We recommend that customers use Azure Databricks or Azure HDInsight instead of ADLA when working with ADLS Gen2.


Data analytics account locations are very limited. if storage account is in different location than analytics account, will that able to access data from storage?


Azure Data Lake Analytics available in regions of East US 2, Central US, North Europe, and West Europe.

Note: Azure Data Lake Analytics and ADLS Gen1 accounts should be same region.

Example: If you deployed ADLA in Central US and the ADLS Gen1 should be in Central US.


what are the differences between these different storage tye like Gen1, Gen2 and blob storage?


Azure Blob Storage is a general purpose, scalable object store that is designed for a wide variety of storage scenarios.

Azure Data Lake Storage Gen1 is a hyper-scale repository that is optimized for big data analytics workloads.

The table in this article summarizes the differences between Azure Data Lake Storage Gen1 and Azure Blob Storage along some key aspects of big data processing.

Azure Data Lake Storage Gen2 is a set of capabilities dedicated to big data analytics, built on Azure Blob storage. Data Lake Storage Gen2 is the result of converging the capabilities of our two existing storage services, Azure Blob storage and Azure Data Lake Storage Gen1. Features from Azure Data Lake Storage Gen1, such as file system semantics, directory, and file level security and scale are combined with low-cost, tiered storage, high availability/disaster recovery capabilities from Azure Blob storage.

Reference: Introduction to Azure Data Lake Storage Gen2

Hope this helps. Do let us know if you any further queries.


Do click on "Accept Answer" and Upvote on the post that helps you, this can be beneficial to other community members.











2 comments Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@PRADEEPCHEEKATLA-MSFT : As you mention to use Azure Databricks or Azure HDInsight if I have data in gen2 or blob, I need to know which service should I choose. In my scenario I have structured data coming from on prem SQL server. Also CSV files and some unstructured data are there which I need to put in data lake. For sql server data there will be an initial full load and then also ongoing changes have to put in lake. Next needs to transform those to create curated zone which will only have updated consistent records. Business will start doing some analysis on data in curated zone. For these scenario which service would be fit? would it be databricks or HDInsight? I have little bit idea about data bricks, but no idea in HDInsight, should I need to explore HDInsight for my data scenario?


0 Votes 0 · ·

@SeuliDasMajumder-3800, Azure HDInsight is a cloud distribution of the Hadoop components from the Hortonworks Data Platform (HDP). Azure HDInsight makes it easy, fast, and cost-effective to process massive amounts of data. You can use the most popular open-source frameworks such as Hadoop, Spark, Hive, LLAP, Kafka, Storm, R, and more.


Refer to Azure HDInsight .


Azure Databricks is a premium Spark offering that is ideal for customers who want their data scientists to collaborate easily and run their Spark based workloads efficiently and at industry leading performance.


Refer to Azure Databricks.


Refer: MSDN thread.


1 Vote 1 · ·