dataaccess Package

Contains functionality providing blob file access methods.

When you use a class from the opendatasets package like the ChicagoSafety class, the dataaccess classes and functions in this package are used internally. In general, you won't need to use the functionality in the dataaccess package directly.

Modules

base_blob_info

Contains the blob info base class.

blob_parquet_descriptor

Contains the descriptor of blob parquet.

dataset_partition_prep

Contains functionality for specifying dataset partition preparation.

Partition preparation occurs automatically, when you use a opendatasets classe that requires a partition of data, such as the NycTlcGreen class.

pandas_data_load_limit

Contains functionality to control how the limit pandas data loads when parquet files are large.

With this module's functionality, you can specify how to limit how pandas data loads when parquet files are too large to load.