A small sample of airline on-time performance data.
An .xdf file with 600000 observations on the following 3 variables.
arrival delay, in minutes (stored as integer).
schedule departure time (stored as float32).
day of the week (stored as a factor).
This data set is a small subsample of airline on-time performance data containing only 600,000 observations of 3 variables, so it will fit in memory on most systems. It is an .xdf file, which means that the data are stored in blocks. The AirlineDemoSmall.xdf data file contains 3 blocks. It is compressed using zlib compression. The AirlineDemoSmallUC.xdf contains the same data, but without compression.
The data are also available in text format in the file AirlineDemoSmall.csv. A smaller subset, with only 1010 rows and no missing data, is available in text format in the file AirlineDemo1kNoMissing.csv.
American Statistical Association Statistical Computing Group, Data Expo '09.
U.S. Department of Transportation, Bureau of Transportation Statistics,
Research and Innovative Technology Administration. Airline On-Time Statistics.
airlineSmall <- file.path(rxGetOption("sampleDataDir"), "AirlineDemoSmall.xdf") rxSummary(~ ArrDelay + CRSDepTime, data = airlineSmall)