Structured Streaming

Structured Streaming is the Apache Spark API that lets you express computation on streaming data in the same way you express a batch computation on static data. The Spark SQL engine performs the computation incrementally and continuously updates the result as streaming data arrives. For an overview of Structured Streaming, see the Apache Spark Structured Streaming Programming Guide. These articles provide introductory notebooks, details on how to use specific types of streaming sources and sinks, how to put streaming into production, and notebooks demonstrating example use cases:

API reference

For reference information about Structured Streaming, Azure Databricks recommends the following Apache Spark API reference:


For detailed information on how you can perform complex streaming analytics using Apache Spark, see the posts in this multi-part blog series:

Legacy Spark Streaming

For information about the legacy Spark Streaming feature, see: