Get Microsoft Academic Graph on Azure storage

Details step-by-step instructions for setting up one-time or automatic provisioning of Microsoft Academic Graph to an Azure blob storage account.

Open Data License: ODC-BY

When using Microsoft Academic data (MAG, MAKES, etc.) in a product or service, or including data in a redistribution, please acknowledge Microsoft Academic using the URI https://aka.ms/msracad. For publications and reports, please cite the following article:

Note

Arnab Sinha, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June (Paul) Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MA) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). ACM, New York, NY, USA, 243-246. DOI=http://dx.doi.org/10.1145/2740908.2742839

Setup storage account to receive MAG graph releases

Create an Azure Subscription

Please create a new Azure subscription for the distribution previews. If your organization already using Azure, this could be a separate subscription under the same tenant id. If you start from scratch, for example “create Azure free account”, the subscription will be created under a new tenant id.

Create an Azure Storage Account

  1. Home > Create a resource > Storage > Storage account

    Create new storage account resource

  2. Provide following values to create an Azure Storage account. Then click "Review + create" button.

    Property Description
    Subscription From the drop-down, select your Azure subscription.
    Resource group Specify whether you want to create a new resource group or use an existing one. A resource group is a container that holds related resources for an Azure solution. For more information, see Azure Resource Group overview.
    Storage account name Provide a name for your Databricks workspace, e.g. magas<org_name>. You don’t need to use your organization name after the "magas", however the account name must be unique among all Azure Storage Accounts.
    Location Select whatever location (region) that is most appropriate for your existing Azure resources
    Performance Standard
    Account kind StorageV2 (general purpose v2)
    Replication LRS
    Access tier Cool

    Enter details for new storage account resource

  3. Verify that the information you entered is correct and click the "create" button

    Submit new storage account resource for creation

Note Azure storage account name and primary key

  1. Once notified that the storage account has been created, click “go to resource”

    Navigate to the new storage account resource

  2. Go to “access keys” and take note of the “storage account name” and the “primary key”

    Save new storage account resource name and access keys for later

  3. Make sure that you have these items of information:

    ✔️ The name of your Azure Storage (AS) account.

    ✔️ The access key of Azure Storage (AS) account.

Sign up for MAG provisioning

To sign up for MAG on AS distribution preview, send the following information using your school or company email account to academicapi@microsoft.com with the subject "Access request to Microsoft Academic Graph (MAG) on Azure Storage (AS) distribution preview":

  • Are you affiliated with a company or university?
    • If company, please provide the company’s name, your department/group and your role
    • If university, please provide the university’s name, department, group/advisor, and your role (undergraduate student, grad student, professor, etc.)
  • Brief description of the project you will be using MAG for
  • Name of your Microsoft sales representative, if you have one
  • Azure Storage (AS) account name
  • Azure Storage (AS) account primary access key
  • Which type of provisioning model you want:
    1. One-time provisioning of the most recent MAG release
    2. Automatic provisioning of each new MAG release (~every 1-2 weeks)

Note

MAG is currently in a free preview period, so there are no charges associated with the provisioning or use of the data/service itself. However Azure requires you to cover all costs associated with standard resource creation, usage, etc. For cost estimates associated with MAG please see the Pricing page.

Most research institutions have an "Enterprise Account" with Microsoft including Azure subscription. The pricing for Enterprise Accounts differ from the individual account shown in Azure's price calculator.

If you have an Enterprise Account, please check with your individual institution's Information Technology/Computer Center resource on the process of setting up Azure to get MAG. You might need to obtain a "Master Agreement #" and involve MLSP (Microsoft Licensed Solution Provider) for help.

Next steps

Advance to next sections to learn about entity data schema and using Azure Data Lake Analytics or Databricks for Microsoft Academic Graph.