opendatasets package

Enable consuming Azure open datasets into dataframes and enrich customer data.

Packages

accessories

Accessory classes that help identify types of columns in data, e.g. lat/long, zipcode, time, etc.

aggregators

An aggregator defines the aggregations needed after the join.

If no aggregation is needed, use aggregator_all.

data

Init file for data resouces in publicholidays module.

dataaccess

Provide blob file access methods.

enrichers

An enricher is a class responsible for enrich customer data with open data.

But essentially they can be any data that make sense to be joined together.

granularities

Granularities used by enrichers, e.g. hourly, daily, closest X locations, etc.

selectors

Selectors define logics to select columns from both customer and public data to join together.

Examples: join by finding the nearest X locations, or by rounding to the same time granularity.

Modules

environ

Define runtime environment classes.

Classes

BostonSafety

Boston city safety class.

ChicagoSafety

Chicago city safety class.

NoaaGfsWeather

NOAA GFS forecast weather class.

NoaaIsdWeather

NOAA ISD historical weather class.

NycSafety

New York city safety class.

NycTlcFhv

NYC TLC FHV data class.

NycTlcGreen

NYC TLC green data class.

NycTlcYellow

NYC TLC yellow data class.

PublicHolidays

Public holiday class.

PublicHolidaysOffline

Public holiday class.

SanFranciscoSafety

San Francisco city safety class.

SeattleSafety

Seattle city safety class.

UsPopulationCounty

US population by county class.

UsPopulationZip

US population by zip class.

UsLaborCPI

US labor cpi class.

UsLaborPPIIndustry

US labor ppi industry class.

UsLaborPPICommodity

US labor ppi commodity class.

UsLaborEHENational

US labor ehe national class.

UsLaborEHEState

US labor ehe state class.

UsLaborLAUS

US labor laus class.

UsLaborLFS

US labor lfs class.

Wikipedia

Wikipedia class.

MNIST

MNIST data class.

SampleAdult

Sample Adult data class.

SampleBreastCancer

Sample Breast Cancer Wisconsin data class.

SampleHousing

Sample Boston hoursing data class.

SampleIris

Sample Iris data class.

Diabetes

Diabetes data class.