Share via


DataFrame.Collect Method

Definition

Returns an array that contains all rows in this DataFrame.

public System.Collections.Generic.IEnumerable<Microsoft.Spark.Sql.Row> Collect ();
member this.Collect : unit -> seq<Microsoft.Spark.Sql.Row>
Public Function Collect () As IEnumerable(Of Row)

Returns

Row objects

Remarks

This requires moving all the data into the application's driver process, and doing so on a very large dataset can crash the driver process with OutOfMemoryError.

Applies to