Copy other people's work to do data science

Video 5: Data Science for Beginners series

One of the trade secrets of data science is getting other people to do your work for you. Find a clustering algorithm example in Azure AI Gallery to use for your own machine learning experiment.


Cortana Intelligence Gallery was renamed Azure AI Gallery. As a result, text and images in this transcript vary slightly from the video, which uses the former name.

To get the most out of the series, watch them all. Go to the list of videos

Other videos in this series

Data Science for Beginners is a quick introduction to data science in five short videos.

Transcript: Copy other people's work to do data science

Welcome to the fifth video in the series "Data Science for Beginners."

In this one, you’ll discover a place to find examples that you can borrow from as a starting point for your own work. You might get the most out of this video if you first watch the earlier videos in this series.

One of the trade secrets of data science is getting other people to do your work for you.

Microsoft has a cloud-based service called Azure Machine Learning Studio (classic). It provides you with a workspace where you can experiment with different machine learning algorithms, and, when you've got your solution worked out, you can launch it as a web service.

Part of this service is something called the Azure AI Gallery. It contains resources, including a collection of Azure Machine Learning Studio (classic) experiments, or models, that people have built and contributed for others to use. These experiments are a great way to leverage the thought and hard work of others to get you started on your own solutions. Everyone is welcome to browse through it.

Azure AI Gallery

If you click Experiments at the top, you'll see a number of the most recent and popular experiments in the gallery. You can search through the rest of experiments by clicking Browse All at the top of the screen, and there you can enter search terms and choose search filters.

Find and use a clustering algorithm example

So, for instance, let's say you want to see an example of how clustering works, so you search for "clustering sweep" experiments.

Search for clustering experiments

Here's an interesting one that someone contributed to the gallery.

Clustering experiment

Click on that experiment and you get a web page that describes the work that this contributor did, along with some of their results.

Clustering experiment description page

Notice the link that says Open in Studio (classic).

Open in Studio (classic) button

I can click on that and it takes me right to Azure Machine Learning Studio (classic). It creates a copy of the experiment and puts it in my own workspace. This includes the contributor's dataset, all the processing that they did, all of the algorithms that they used, and how they saved out the results.

Open a Gallery experiment in Machine Learning Studio (classic) - clustering algorithm example

And now I have a starting point. I can swap out their data for my own and do my own tweaking of the model. This gives me a running start, and it lets me build on the work of people who really know what they’re doing.

Find experiments that demonstrate machine learning techniques

There are other experiments in the Azure AI Gallery that were contributed specifically to provide how-to examples for people new to data science. For instance, there's an experiment in the gallery that demonstrates how to handle missing values (Methods for handling missing values). It walks you through 15 different ways of substituting empty values, and talks about the benefits of each method and when to use it.

Gallery experiments open in Machine Learning Studio (classic) - methods for missing values

Azure AI Gallery is a place to find working experiments that you can use as a starting point for your own solutions.

Be sure to check out the other videos in "Data Science for Beginners" from Microsoft Azure Machine Learning Studio (classic).

Next steps