Using SynapseML v0.9.5 in Spark pool

Ali Davoudian 61 Reputation points
2022-02-20T16:44:58.593+00:00

We aim to use the geocoding features of SynapseML library v0.9.5 over Synapse Spark pools. However, this library needs Spark 3.2+ while Spark pools just support Spark 3.1 & 2.7. So we can just install SynapseML v0.9.4 but it does not have the geospatial features and we face an error when using "from synapse.ml.geospatial import *". So, does it mean that for now we can't do geocoding over Spark pools?

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,422 questions
.NET Machine learning
.NET Machine learning
.NET: Microsoft Technologies based on the .NET software framework.Machine learning: A type of artificial intelligence focused on enabling computers to use observed data to evolve new behaviors that have not been explicitly programmed.
150 questions
Microsoft Entra ID
Microsoft Entra ID
A Microsoft Entra identity service that provides identity management and access control capabilities. Replaces Azure Active Directory.
19,665 questions
0 comments No comments
{count} votes

Accepted answer
  1. ShaikMaheer-MSFT 37,971 Reputation points Microsoft Employee
    2022-02-24T05:26:43.143+00:00

    Hi @Ali Davoudian ,

    Thank you for posting query in Microsoft Q&A Platform.

    As I understand query here, your goal is to use geocoding features of SynapeML library v0.9.5 over Synapse Spark pools. Please correct me if my understanding is wrong.

    Below version is the latest available version for SynapseML on Spark3.1 pool. Internal team will be updating our official doc as well soon. Please use this configuration magic instead to make this work.

    %%configure -f  
    {  
      "name": "synapseml",  
      "conf": {  
          "spark.jars.packages": "com.microsoft.azure:synapseml_2.12:0.9.5-13-d1b51517-SNAPSHOT",  
          "spark.jars.repositories": https://mmlspark.azureedge.net/maven,  
          "spark.jars.excludes": "org.scala-lang:scala-reflect,org.apache.spark:spark-tags_2.12,org.scalactic:scalactic_2.12,org.scalatest:scalatest_2.12",  
          "spark.yarn.user.classpath.first": "true"  
      }  
    }  
    

    Hope this helps. Please let us know how it goes.

    ----------------

    Please consider hitting Accept Answer button. Accepted answers helps community as well.


0 additional answers

Sort by: Most helpful