Analyze your learning loop with an offline evaluation

Learn how to complete an offline evaluation and understand the results.

Offline Evaluations allow you to measure how effective Personalizer is compared to your application's default behavior, learn what features are contributing most to personalization, and discover new machine learning values automatically.

Read about Offline Evaluations to learn more.


  • A configured Personalizer loop
  • The Personalizer loop must have a representative amount of data - as a ballpark we recommend at least 50,000 events in its logs for meaningful evaluation results. Optionally, you may also have previously exported learning policy files you can compare and test in the same evaluation.

Steps to Start a new offline evaluation

  1. In the Azure portal, locate your Personalization resource.

  2. In the Azure portal, go to the Evaluations section and select Create Evaluation. In the Azure portal, go to the Evaluations section and select Create Evaluation.

  3. Configure the following values:

    • An evaluation name
    • Start and end date - these are dates in the past, that specify the range of data to use in the evaluation. This data must be present in the logs, as specified in the Data Retention value.
    • Optimization Discovery set to yes

    Choose offline evaluation settings

  4. Start the Evaluation by selecting Ok.


Evaluations can take a long time to run, depending on the amount of data to process, number of learning policies to compare, and whether an optimization was requested.

Once completed, you can select the evaluation from the list of evaluations.

Comparisons of Learning Policies include:

  • Online Policy: The current Learning Policy used in Personalizer
  • Baseline: The application's default (as determined by the first Action sent in Rank calls),
  • Random Policy: An imaginary Rank behavior that always returns random choice of Actions from the supplied ones.
  • Custom Policies: Additional Learning Policies uploaded when starting the evaluation.
  • Optimized Policy: If the evaluation was started with the option to discover an optimized policy, it will also be compared, and you will be able to download it or make it the online learning policy, replacing the current one.

Results chart of offline evaluation settings

Effectiveness of Features for Actions and Context.

Next steps