Creating a Solution and Data Source (Intermediate Data Mining Tutorial)

Each Microsoft SQL Server Analysis Services project defines the schema for the objects in a single Analysis Services database. An Analysis Services database contains mining structures and mining models, Online Analytical Processing (OLAP) cubes, and supplemental objects such as data sources and data source views.

When you create an Analysis Services project by using Business Intelligence Development Studio, your solution is stored as a local file until the solution is deployed. When you deploy the solution, Analysis Services automatically creates a database that has the same name as the project. By default, Analysis Services uses the** localhost** instance for new projects. If you are using a named instance, or if you specified a different name for the default instance, you must change the deployment database property of the project to the location where you want to create your data mining objects.

After the database has been created, you can browse it, view models, or create predictions by using either SQL Server Management Studio or Business Intelligence Development Studio. You can continue to add objects to this database either by using Analysis Management Objects (AMO) or by re-opening the project file in Business Intelligence Development Studio.

You can also create a project but change the database name, which will enable you to reuse an existing database and add data mining objects to it. However, if you do this, you should review the project properties to specify whether the existing database will be updated, or whether it will be overwritten.

For more information about Analysis Services projects, see Defining an Analysis Services Project.

To create a new Analysis Services project for this tutorial

  1. Open Business Intelligence Development Studio.

  2. On the File menu, point to New, and then click Project.

  3. Verify that Analysis Services Project is selected in the Templates pane.

  4. In the Name box, name the new project DM_Intermediate.

  5. Click OK.

To change the instance where data mining objects are stored (optional)

  1. In Business Intelligence Development Studio, on the Project menu, click Properties.

  2. In the left side of the Property Pages pane, click Deployment.

  3. Verify that the Server name is localhost. If you are using a different instance, type the name of the instance. Click OK.

To change the deployment properties for a project (optional)

  1. In Solution Explorer, right-click the project, and then select Properties.

    -- or --

    In Business Intelligence Development Studio, on the Project menu, select Properties.

  2. In the left side of the Property Pages pane, click Deployment.

    In the Options pane, select Deployment Mode, and set the options to Deploy All to overwrite, or to Deploy Changes Only to update objects or add objects.

Creating a Data Source

In the Basic Data Mining Tutorial, you created a data source that stores connection information for the AdventureWorksDW2008R2 database. Follow the same steps to create the AdventureWorksDW2008R2 data source in this solution.

To create a data source

A single data source can support multiple data source views, and each data source view can have multiple tables. However, because the data source and data source view are deployed to your Microsoft SQL Server Analysis Services database together with the data mining models that you create, as a best practice you should include in the data source view only those tables that are required for each data mining model or group of models.

In the following lessons, you will add data source views to support the new mining structures and models. The market basket and sequence clustering lessons use the same data source, but in all other respects, the lessons are independent of each other and can be completed separately.


Data Source view

Lesson 2: Building a Forecasting Scenario (Intermediate Data Mining Tutorial)

Monthly sales reports for bicycle versions in different regions.

Lesson 3: Building a Market Basket Scenario (Intermediate Data Mining Tutorial)

Data from analysis of customer purchases. This data source view includes a nested table.

Lesson 4: Building a Sequence Clustering Scenario (Intermediate Data Mining Tutorial)

Data from analysis of customer purchases, enhanced by addition of a sequence identifier.

Lesson 5: Building Neural Network and Logistic Regression Models (Intermediate Data Mining Tutorial)

Preliminary performance tracking data from a call center.