RxDataSource

revoscalepy.RxDataSource(column_info: dict = None)

Description

Base class for all revoscalepy data sources. Can be used with head() and tail() to display the first and last rows of the data set.

Arguments

num_rows

Integer value specifying the number of rows to display starting from the beginning of the dataset. If not specified, the default of 6 will be used.

report_progress

Integer value with options:

  • 0: no progress is reported.
  • 1: the number of processed rows is printed and updated.
  • 2: rows processed and timings are reported.
  • 3: rows processed and all timings are reported.

Example

# Return the first 4 rows
import os
from revoscalepy import RxOptions, RxXdfData
sample_data_path = RxOptions.get_option("sampleDataDir")
ds = RxXdfData(os.path.join(sample_data_path, "AirlineDemoSmall.xdf"))
ds.head(num_rows=4)

# Return the last 4 rows
import os
from revoscalepy import RxOptions, RxXdfData
sample_data_path = RxOptions.get_option("sampleDataDir")
ds = RxXdfData(os.path.join(sample_data_path, "AirlineDemoSmall.xdf"))
ds.tail(num_rows=4)