SQL reference for Databricks Runtime 7.3 LTS and above

This is a SQL command reference for users on clusters running Databricks Runtime 7.x and above in the Databricks Data Science & Engineering workspace and Databricks Machine Learning environment.


General reference

This general reference describes data types, functions, identifiers, literals, and semantics:

DDL statements

You use data definition statements to create or modify the structure of database objects in a database:

DML statements

You use data manipulation statements to add, change, or delete data:

Data retrieval statements

You use a query to retrieve rows from one or more tables according to the specified clauses. The full syntax and brief description of supported clauses are explained in the Query article. The related SQL statements SELECT and VALUES are also included in this section.

Databricks Runtime also provides the ability to generate the logical and physical plan for a query using the EXPLAIN statement.

Delta Lake statements

You use Delta Lake SQL statements to manage tables stored in Delta Lake format:

For details on using Delta Lake statements, see Delta Lake guide.

Auxiliary statements

You use auxiliary statements to collect statistics, manage caching for Apache Spark cache, explore metadata, set configurations, and manage resources:

Analyze statement

Apache Spark Cache statements

Describe statements

Show statements

Configuration management

Resource management

Security statements

You use security SQL statements to manage access to data:

For details using these statements, see Data object privileges.