DataFrameReader Class
Definition
DataFrameReader provides functionality to load a DataFrame from external storage systems (e.g. file systems, key-value stores, etc).
public sealed class DataFrameReader
type DataFrameReader = class
Public NotInheritable Class DataFrameReader
- Inheritance
-
DataFrameReader
Methods
Csv(String[]) |
Loads CSV files and returns the result as a DataFrame. |
Format(String) |
Specifies the input data source format. |
Jdbc(String, String, Dictionary<String,String>) |
Construct a DataFrame representing the database table accessible via JDBC URL url named table and connection properties. |
Jdbc(String, String, IEnumerable<String>, Dictionary<String,String>) |
Construct a DataFrame representing the database table accessible via JDBC URL url named table and connection properties. The predicates parameter gives a list expressions suitable for inclusion in WHERE clauses; each one defines one partition of the DataFrame. |
Jdbc(String, String, String, Int64, Int64, Int32, Dictionary<String,String>) |
Construct a DataFrame representing the database table accessible via JDBC URL url named table. Partitions of the table will be retrieved in parallel based on the parameters passed to this function. |
Json(String[]) |
Loads a JSON file (one object per line) and returns the result as a DataFrame. |
Load() |
Loads input in as a DataFrame, for data sources that don't require a path (e.g. external key-value stores). |
Load(String) |
Loads input in as a DataFrame, for data sources that require a path (e.g. data backed by a local or distributed file system). |
Load(String[]) |
Loads input in as a DataFrame from the given paths. |
Option(String, Boolean) |
Adds an input option for the underlying data source. |
Option(String, Double) |
Adds an input option for the underlying data source. |
Option(String, Int64) |
Adds an input option for the underlying data source. |
Option(String, String) |
Adds an input option for the underlying data source. |
Options(Dictionary<String,String>) |
Adds input options for the underlying data source. |
Orc(String[]) |
Loads an ORC file and returns the result as a DataFrame. |
Parquet(String[]) |
Loads a Parquet file, returning the result as a DataFrame. |
Schema(String) |
Specifies the schema by using the given DDL-formatted string. |
Schema(StructType) |
Specifies the schema by using StructType. |
Table(String) |
Returns the specified table as a DataFrame. |
Text(String[]) |
Loads text files and returns a DataFrame whose schema starts with a string column named "value", and followed by partitioned columns if there are any. |