Data exploration and visualization tools on the Azure Data Science Virtual Machine

In data science, the key is to understand the data. Visualization and data exploration tools help accelerate data understanding. The following tools, which are provided on the Data Science Virtual Machine (DSVM), make this key step easier.

Apache Drill

What is it? Open-source SQL query engine on big data
Supported DSVM versions Windows, Linux
How is it configured and installed on the DSVM? Installed in /dsvm/tools/drill* in embedded mode only
Typical uses For in-place data exploration without requiring extract, transform, load (ETL). Query different data sources and formats, including CSV, JSON, relational tables, and Hadoop.
How to use and run it Desktop shortcut
Get started with Drill in 10 minutes
Related tools on the DSVM Rattle, Weka, SQL Server Management Studio

Power BI Desktop

What is it? Interactive data visualization and BI tool
Supported DSVM versions Windows
Typical uses Data visualization and building dashboards
How to use and run it Desktop shortcut (C:\Program Files\Microsoft Power BI Desktop\bin\PBIDesktop.exe), or simply run from the Start menu.
Related tools on the DSVM Visual Studio 2019, Visual Studio Code, Juno

Note

You need a Microsoft Office 365 account to access Power BI.

Rattle

What is it? A graphical user interface (GUI) for data mining using R
Supported DSVM Editions Windows, Linux
Typical uses General UI Data Mining tool for R
How to use and run it UI tool. On Windows, open a command prompt, run R, and then inside R, run rattle(). On Linux, connect with X2Go, start a terminal, run R, and then inside R, run rattle().
Links to samples Rattle
Related tools on the DSVM LightGBM, Weka, Xgboost

Weka

What is it? A collection of machine-learning algorithms for data mining tasks. These algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data preprocessing, classification, regression, clustering, association rules, and visualization.
Supported DSVM Editions Windows, Linux
Typical uses General machine-learning tool
How to use and run it On Windows, search for Weka on the Start menu. On Linux, sign in with X2Go, and then go to Applications > Development > Weka.
Links to Samples Weka samples
Related tools on the DSVM LightGBM, Rattle, Xgboost