Transaction churn prediction

Transactional churn prediction helps predict if a customer will no longer purchase your products or services in a given time window. You can create new churn predictions on Intelligence > Predictions. Select My predictions to see other predictions that you've created.

For environments based on business accounts, we can predict transactional churn for an account and also a combination of account and another level of information like product category. Adding a dimension can help find out how likely it is that the account "Contoso" will stop buying the product category "office stationery." In addition, for business accounts, we can also use AI to generate a list of potential reasons why an account is likely to churn for a category of secondary level information.

Tip

Try the tutorial for a transaction churn prediction using sample data: Transaction churn prediction sample guide.

Prerequisites

  • At least Contributor permissions in Customer Insights.
  • Business knowledge to understand what churn means for your business. We support time-based churn definitions, meaning a customer is considered to have churned after a period of no purchases.
  • Data about your transactions/purchases and their history:
    • Transaction identifiers to distinguish purchases/transactions.
    • Customer identifiers to match transactions to your customers.
    • Transaction event dates, which define the dates the transaction occurred on.
    • The semantic data schema for purchases/transactions requires the following information:
      • Transaction ID: A unique identifier of a purchase or transaction.
      • Transaction Date: The date of the purchase or transaction.
      • Value of the transaction: The currency/numerical value amount of the transaction/item.
      • (Optional) Unique product ID: The ID of the product or service purchased if your data is at a line item level.
      • (Optional) Whether this transaction was a return: A true/false field that identifies if the transaction was a return or not. If the Value of the transaction is negative, we will also use this information to infer a return.
  • (Optional) Data about customer activities:
    • Activity identifiers to distinguish activities of the same type.
    • Customer identifiers to map activities to your customers.
    • Activity information containing the name and date of the activity.
    • The semantic data schema for customer activities includes:
      • Primary key: A unique identifier for an activity. For example, a website visit or a usage record showing the customer tried a sample of your product.
      • Timestamp: The date and time of the event identified by the primary key.
      • Event: The name of the event you want to use. For example, a field called "UserAction" in a grocery store might be a coupon use by the customer.
      • Details: Detailed information about the event. For example, a field called "CouponValue" in a grocery store might be the currency value of the coupon.
  • Suggested data characteristics:
    • Sufficient historical data: Transaction data for at least double the selected time window. Preferably, two to three years of transaction history.
    • Multiple purchases per customer: Ideally at least two transactions per customer.
    • Number of customers: At least 10 customer profiles, preferably more than 1,000 unique customers. The model will fail with fewer than 10 customers and insufficient historical data.
    • Data completeness: Less than 20% of missing values in the data field of the entity provided.

Note

For a business with high customer purchase frequency (every few weeks) it's recommended to select a shorter prediction window and churn definition. For low purchase frequency (every few months or once a year), choose a longer prediction window and churn definition.

Create a transaction churn prediction

  1. In Customer Insights, go to Intelligence > Predictions.

  2. Select the Customer churn model tile and select Use this model.

  3. In the Customer churn model pane, choose Transaction and select Get started.

Screenshot with selected transaction option in Customer churn model pane.

Name model

  1. Provide a name for the model to distinguish it from other models.

  2. Provide a name for the output entity using letters and numbers only, without any spaces. That's the name that the model entity will use.

  3. Select Next.

Define customer churn

  1. Set the Prediction window. For example, predict the risk of churn for your customers over the next 90 days to align to your marketing retention efforts. Predicting churn risk for a longer or shorter period of time can make it more difficult to address the factors in your churn risk profile, but it depends on your specific business requirements.

    Tip

    You can select Save draft at any time to save the prediction as a draft. You'll find the draft prediction in the My predictions tab to continue.

  2. Enter the number of days to define churn in the Churn definition field. For example, if a customer has made no purchases in the last 30 days, they might be considered as churned for your business.

  3. Select Next to continue.

Add required data

  1. Select Add data and choose the activity type in the side pane that contains the required transaction or purchase history information.

  2. Under Select activities, choose the specific activities from the selected activity type you'd like the calculation to focus on.

    Side pane showing choosing specific activities under the semantic type.

    If you haven't mapped the activity to a semantic type yet, select Edit to do so. The guided experience to map semantic activities opens. Map your data to the corresponding fields in the selected activity type.

  3. Map the semantic attributes to the fields that are required to run the model. If the fields below aren't populated, configure the relationship from your purchase history entity to the Customer entity. Select Save.

  4. In the Add required data step, select Next to proceed if you don't want to add more activities.

Add additional data (optional)

Configure the relationship from your customer activity entity to the Customer entity.

  1. Select the field that identifies the customer in the customer activity table. It can be directly related to the primary customer ID of your Customer entity.

  2. Select the entity that is your primary Customer entity.

  3. Enter a name that describes the relationship.

Customer activities

  1. Optionally, select Add data for Customer activities.

  2. Select the semantic activity type that contains the data you would like to use, then select one or more activities in the Activities section.

  3. Select an activity type that matches to the type of customer activity you're configuring. Select Create new and choose an available activity type or create a new type.

  4. Select Next, then Save.

  5. If you have any other customer activities you would like to include, repeat the steps above.

Set schedule and review configuration

  1. Set a frequency to retrain your model. This setting is important to update the accuracy of predictions as new data is ingested. Most businesses can retrain once per month and get a good accuracy for their prediction.

  2. Select Next.

  3. Review the configuration of the prediction. You can go back to prior steps by selecting Edit under the shown value. Or you can select a configuration step from the progress indicator.

  4. If all values are configured correctly, select Save and run to begin the prediction process. On the My predictions tab, you can see the status of your predictions. The process may take several hours to complete depending on the amount of data used in the prediction.

Review a prediction status and results

  1. Go to Intelligence > Predictions and select the My predictions tab.

  2. Select the prediction you want to review.

    • Prediction name: Name of the prediction provided when creating it.
    • Prediction type: Type of model used for the prediction
    • Output entity: Name of the entity to store the output of the prediction. You can find an entity with this name on Data > Entities. In the output entity, ChurnScore is the predicted probability of churn and IsChurn is a binary label based on ChurnScore with 0.5 threshold. The default threshold might not work for your scenario. Create a new segment with your preferred threshold. Not all customers are necessarily active customers. Some of them may not have had any activity for a long time and are considered as churned already, based on you churn definition. Predicting the churn risk for customers who already churned isn't useful because they are not the audience of interest.
    • Predicted field: This field is populated only for some types of predictions, and isn't used in churn prediction.
    • Status: Status of the prediction run.
      • Queued: Prediction is waiting for other processes to run.
      • Refreshing: Prediction is currently running to produce results that will flow into the output entity.
      • Failed: Prediction run has failed. Review the logs for more details.
      • Succeeded: Prediction has succeeded. Select View under the vertical ellipses to review the prediction
    • Edited: The date the configuration for the prediction was changed.
    • Last refreshed: The date the prediction refreshed results in the output entity.
  3. Select the vertical ellipses next to the prediction you want to review results for and select View.

    View control to see results of a prediction.

  1. There are three primary sections of data within the results page:
    • Training model performance: A, B, or C are possible scores. This score indicates the performance of the prediction and can help you make the decision to use the results stored in the output entity. Scores are determined based on the following rules:

      • A when the model accurately predicted at least 50% of the total predictions, and when the percentage of accurate predictions for customers who churned is greater than the baseline rate by at least 10%.

      • B when the model accurately predicted at least 50% of the total predictions, and when the percentage of accurate predictions for customers who churned is up to 10% greater than the baseline.

      • C when the model accurately predicted less 50% of the total predictions, or when the percentage of accurate predictions for customers who churned is less than the baseline.

      • Baseline takes the prediction time window input for the model (for example, one year), and the model creates different fractions of time by dividing it by 2 until it reaches one month or less. It uses these fractions to create a business rule for customers who have not purchased in this time frame. These customers are considered as churned. The time-based business rule with the highest ability to predict who is likely to churn is chosen as baseline model.

    • Likelihood to churn (number of customers): Groups of customers based on their predicted risk of churn. This data can help you later if you want to create a segment of customers with high churn risk. Such segments help to understand where your cutoff should be for segment membership.

    • Most influential factors: There are many factors that are taken into account when creating your prediction. Each of the factors has its importance calculated for the aggregated predictions a model creates. You can use these factors to help validate your prediction results, or you can use this information later to create segments that could help influence churn risk for customers.

Manage predictions

It's possible to optimize, troubleshoot, refresh, or delete predictions. Review an input data usability report to find out how to make a prediction faster and more reliable. For more information, go to Manage predictions.