DataFrame.Checkpoint(Boolean) Method

Definition

Returns a checkpointed version of this DataFrame.

public Microsoft.Spark.Sql.DataFrame Checkpoint (bool eager = true);
member this.Checkpoint : bool -> Microsoft.Spark.Sql.DataFrame
Public Function Checkpoint (Optional eager As Boolean = true) As DataFrame

Parameters

eager
Boolean

Whether to checkpoint this DataFrame immediately

Returns

DataFrame

Checkpointed DataFrame

Remarks

Checkpointing can be used to truncate the logical plan of this DataFrame, which is especially useful in iterative algorithms where the plan may grow exponentially. It will be saved to files inside the checkpoint directory set with SetCheckpointDir(String).

Applies to