您现在访问的是微软AZURE全球版技术文档网站,若需要访问由世纪互联运营的MICROSOFT AZURE中国区技术文档网站,请访问 https://docs.azure.cn.

如何连接 Azure 数据共享和 Azure 监控范围How to connect Azure Data Share and Azure Purview

本文介绍如何将 Azure 数据共享 帐户连接到 azure 监控范围,并控制共享数据集 (数据空间中的传出和传入) 。This article discusses how to connect your Azure Data Share account with Azure Purview and govern the shared datasets (both outgoing and incoming) in your data estate. 各种数据管理人员可以跨边界(如组织、部门甚至数据中心)发现和跟踪数据沿袭。Various data governance personas can discover and track lineage of data across boundaries like organizations, departments and even data centers.

常见方案Common Scenarios

数据共享沿袭旨在提供有关根本原因分析和影响分析的详细信息。Data Share Lineage is aimed to provide detailed information for root cause analysis and impact analysis.

方案 1: 360 对合作伙伴组织或内部部门共享的数据集的视图Scenario 1: 360 view of datasets shared in/out for a partner organization or internal department

数据安全官可以查看与合作伙伴组织双向共享的所有数据集的列表。Data officers can see a list of all datasets that are bi-directionally shared with their partner organizations. 他们可以按组织名称搜索和发现数据集,并查看所有传出和传入共享的完整视图。They can search and discover the datasets by organization name and see a complete view of all outgoing and incoming shares.

方案2:根本原因分析-对传入共享的数据集的上游依赖关系 (使用者视图) Scenario 2: Root cause analysis - upstream dependency on datasets coming into organization (consumer view of incoming shares)

报表包含错误的信息,因为外部数据共享帐户存在上游数据问题。A report has incorrect information because of upstream data issues from an external Data Share account. 数据工程师可以理解上游故障、了解原因,并进一步联系共享所有者,以解决导致其数据差异的问题。The data engineers can understand upstream failures, be informed about the reasons, and further contact the owner of the share to fix the issues causing their data discrepancy.

方案3:对传出共享 (提供者视图的外部数据集的影响分析) Scenario 3: Impact analysis on datasets going outside organization (provider view of outgoing shares)

数据生成者需要知道在对数据集进行更改时将受到什么影响。Data producers want to know who will be impacted upon making a change to their dataset. 使用沿袭,数据生成者可以轻松理解使用 Azure 数据共享使用数据的下游内部或外部合作伙伴的影响。Using lineage, a data producer can easily understand the impact of the downstream internal or external partners who are consuming data using Azure Data Share.

Azure 数据共享和监控范围连接体验Azure Data Share and Purview connected experience

若要连接 Azure 数据共享和 Azure 监控范围帐户,请执行以下操作:To connect your Azure Data Share and Azure Purview account, do the following:

  1. 创建监控范围帐户。Create a Purview account. 监控范围帐户将收集所有数据共享沿袭信息。All the Data Share lineage information will be collected by a Purview account. 您可以使用现有帐户,也可以创建新的监控范围帐户。You can use an existing one or create a new Purview account.

  2. 将 Azure 数据共享连接到监控范围帐户。Connect your Azure Data Share to your Purview account.

    1. 在监控范围门户中,可以在 "管理中心" 下中转到 " 管理中心 ",并在 " 外部连接 " 部分连接 Azure 数据共享。In the Purview portal, you can go to Management Center and connect your Azure Data Share under the External connections section.

    2. 在顶部栏中选择 " + 新建 ",在弹出侧栏中查找 Azure 数据共享并添加数据共享帐户。Select + New on the top bar, find your Azure Data Share in the pop-up side bar and add the Data Share account. 将数据共享连接到监控范围帐户后运行快照作业,以使数据共享资产和沿袭信息在监控范围中可见。Run a snapshot job after connecting your Data Share to Purview account, so that the Data Share assets and lineage information is visible in Purview.

      用于链接 Azure 数据共享的管理中心

  3. 在 Azure 数据共享中执行快照。Execute your snapshot in Azure Data Share.

    • 使用 Azure 监控范围建立 Azure 数据共享连接后,可以对现有共享执行快照。Once the Azure Data share connection is established with Azure Purview, you can execute a snapshot for your existing shares.
    • 如果没有任何现有共享,请使用 Azure 数据共享门户来共享数据并订阅数据共享If you don’t have any existing shares, go to the Azure Data Share portal to share your data and subscribe to a data share.
    • 共享快照完成后,可以在监控范围中查看关联的数据共享资产和沿袭。Once the share snapshot is complete, you can view associated Data Share assets and lineage in Purview.
  4. 发现数据共享帐户并在你的监控范围帐户中共享信息。Discover Data Share accounts and share information in your Purview account.

    • 在监控范围帐户的主页中,选择 " 按资产类型浏览 ",并选择 " Azure 数据共享 " 磁贴。In the home page of Purview account, select Browse by asset type and select the Azure Data Share tile. 可以搜索帐户名称、共享名称、共享快照或合作伙伴组织。You can search for an account name, share name, share snapshot, or partner organization. 否则,在 "搜索结果" 页上为帐户名称、共享类型 (发送与接收的共享) 应用筛选器。Otherwise apply filters on the Search result page for account name, share type (sent vs received shares).

      "搜索结果" 页中的 Azure 数据共享

    重要

    要使数据共享资产显示在监控范围中,必须在将数据共享连接到监控范围之后运行快照作业。For Data Share assets to show in Purview, a snapshot job must be run after you connect your Data Share to Purview.

  5. 跟踪与 Azure 数据共享共享的数据集的沿袭。Track lineage of datasets shared with Azure Data Share.

    • 从 "监控范围搜索结果" 页上,选择 (接收/发送的数据共享快照) 并选择 " 沿袭 " 选项卡,以查看具有上游和下游依赖项的沿袭图形。From the Purview search result page, choose the Data share snapshot (received/sent) and select the Lineage tab, to see a lineage graph with upstream and downstream dependencies.

    使用 Azure 数据共享共享的数据集的沿袭

后续步骤Next steps