Data Mining (SSAS)

Analysis Services provides an integrated platform for solutions that incorporate data mining. You can use either relational or cube data to create business intelligence solutions with predictive analytics.

Benefits of Data Mining

Data mining uses well-researched statistical principles to discover patterns in your data, helping you make intelligent decisions about complex problems. By applying the data mining algorithms in Analysis Services to your data, you can forecast trends, identify patterns, create rules and recommendations, analyze the sequence of events in complex data sets, and gain new insights.

In SQL Server 2012, data mining is powerful, accessible, and integrated with the tools that many people prefer to use for analysis and reporting. See the links in this section to get the broad background about data mining that you need to get started.

Key Data Mining Features

SQL Server provides the following features in support of integrated data mining solutions:

  • Multiple data sources: You do not have to create a data warehouse or an OLAP cube to do data mining. You can use tabular data from external providers, spreadsheets, and even text files. You can also easily mine OLAP cubes created in Analysis Services. However, you cannot use data from an in-memory database.

  • Integrated data cleansing, data management, and ETL: Data Quality Services provides advanced tools for profiling and cleansing data. Integration Services can be used to build ETL processes for cleaning data, and also for building, processing, training, and updating models.

  • Multiple customizable algorithms: In addition to providing algorithms such as clustering, neural networks, and decisions trees, the platform supports development of your own custom plug-in algorithms.

  • Model testing infrastructure: Test your models and data sets using important statistical tools as cross-validation, classification matrices, lift charts, and scatter plots. Easily create and manage testing and training sets.

  • Querying and drillthrough: Create prediction queries, retrieve model patterns and statistics, and drill through to case data.

  • Client tools: In addition to the development and design studios provided by SQL Server, you can use the Data Mining Add-ins for Excel to create, query, and browse models. Or, create custom clients, including Web services.

  • Scripting language support and managed API: All data mining objects are fully programmable. Scripting is possible through MDX, XMLA, or the PowerShell extensions for Analysis Services. Use the Data Mining Extensions (DMX) language for fast querying and scripting.

  • Security and deployment: Provides role-based security through Analysis Services, including separate permissions for drillthrough to model and structure data. Easy deployment of models to other servers, so that users can access the patterns or perform predictions

In This Section

The topics in this section introduce the principal features of SQL Server Data Mining and related tasks.