NoaaIsdWeather Class

Represents the National Oceanic and Atmospheric Administration (NOAA) Integrated Surface Dataset (ISD).

This dataset contains worldwide hourly weather history data (example: temperature, precipitation, wind) sourced from the National Oceanic and Atmospheric Administration (NOAA). For more information about this dataset, including column descriptions, different ways to access the dataset, and examples, see NOAA Integrated Surface Data in the Microsoft Azure Open Datasets catalog.

Initialize filtering fields.

Inheritance
NoaaIsdWeather
NoaaIsdWeather
NoaaIsdWeather

Constructor

NoaaIsdWeather(start_date: datetime = datetime.datetime(2008, 1, 1, 0, 0), end_date: datetime = datetime.datetime(2024, 2, 7, 0, 0), cols: List[str] | None = None, enable_telemetry: bool = True)

Parameters

start_date
datetime
default value: 2008-01-01 00:00:00

The date at which to start loading data, inclusive. If None, the default_start_date is used.

end_date
datetime
default value: 2024-02-07 00:00:00

The date at which to end loading data, inclusive. If None, the default_end_date is used.

cols
list[str]
default value: None

A list of columns names to load from the dataset. If None, all columns are loaded. For information on the available columns in this dataset, see NOAA Integrated Surface Data.

enable_telemetry
bool
default value: True

Whether to enable telemetry on this dataset. If not specified, telemetry is enabled.

start_date
datetime
Required

The start date you'd like to query inclusively.

end_date
datetime
Required

The end date you'd like to query inclusively.

cols
List[str]
Required

A list of column names you'd like to retrieve. None will get all columns.

enable_telemetry
bool
Required

Indicates whether to enable telemetry, disabled for UT only.

Remarks

The example below shows how to use access the dataset.


   from azureml.opendatasets import NoaaIsdWeather
   from datetime import datetime
   from dateutil.relativedelta import relativedelta


   end_date = datetime.today()
   start_date = datetime.today() - relativedelta(months=1)
   isd = NoaaIsdWeather(start_date=start_date, end_date=end_date)
   isd_df = isd.to_pandas_dataframe()

Methods

filter

Filter time.

filter

Filter time.

filter(env: RuntimeEnv, min_date: datetime, max_date: datetime)

Parameters

env
<xref:azureml.opendatasets.RuntimeEnv>
Required

The runtime environment.

min_date
Required

The min date.

max_date
datetime
Required

The max date.

Returns

filtered data frame.

Attributes

default_end_date

default_end_date = datetime.datetime(2024, 2, 7, 0, 0)

default_start_date

default_start_date = datetime.datetime(2008, 1, 1, 0, 0)

id_column_name

id_column_name = 'ID'

latitude_column_name

latitude_column_name = 'latitude'

longitude_column_name

longitude_column_name = 'longitude'

usaf_column_name

usaf_column_name = 'usaf'

wban_column_name

wban_column_name = 'wban'