US Population by County
US population by gender and race for each US county sourced from 2000 and 2010 Decennial Census.
This dataset is sourced from United States Census Bureau’s Decennial Census Dataset APIs. Review Terms of Service and Policies and Notices for the terms and conditions related to the use this dataset.
Note
Microsoft provides Azure Open Datasets on an “as is” basis. Microsoft makes no warranties, express or implied, guarantees or conditions with respect to your use of the datasets. To the extent permitted under your local law, Microsoft disclaims all liability for any damages or losses, including direct, consequential, special, indirect, incidental or punitive, resulting from your use of the datasets.
This dataset is provided under the original terms that Microsoft received source data. The dataset may include data sourced from Microsoft.
Volume and retention
This dataset is stored in Parquet format and has data for the year 2000 and 2010.
Storage location
This dataset is stored in the East US Azure region. Allocating compute resources in East US is recommended for affinity.
Related datasets
Columns
Name | Data type | Unique | Values (sample) | Description |
---|---|---|---|---|
countyName | string | 1,960 | Washington County Jefferson County | County name. |
decennialTime | string | 2 | 2010 2000 | The time of the decennial census happened, for example, 2010, 2000. |
maxAge | int | 23 | 9 66 | Max of the age range. If it’s null, it’s across all ages or the age range has no upper bound, for example, age > 85. |
minAge | int | 23 | 35 67 | Min of the age range. If it’s null, it’s across all ages. |
population | int | 47,229 | 1 2 | Population of this segment. |
race | string | 8 | ASIAN ALONE TWO OR MORE RACES | Race category in Census data. If it’s null, it’s across all races. |
sex | string | 3 | Male Female | Male or female. If it’s null, it’s across both sexes. |
stateName | string | 52 | Texas Georgia | Name of the state in US. |
year | int | 2 | 2010 2000 | Year (in integer) of the decennial time. |
Preview
decennialTime | stateName | countyName | population | race | sex | minAge | maxAge | year |
---|---|---|---|---|---|---|---|---|
2010 | Texas | Crockett County | 123 | WHITE ALONE | Male | 5 | 9 | 2010 |
2010 | Texas | Crockett County | 1 | ASIAN ALONE | Female | 67 | 69 | 2010 |
2010 | Texas | Crockett County | 111 | WHITE ALONE | Female | 55 | 59 | 2010 |
2010 | Texas | Crockett County | 64 | TWO OR MORE RACES | null | 2010 | ||
2010 | Texas | Crockett County | 18 | null | Male | 85 | 2010 | |
2010 | Texas | Crockett County | 16 | AMERICAN INDIAN AND ALASKA NATIVE ALONE | Female | 2010 | ||
2010 | Texas | Crockett County | 7 | WHITE ALONE | Male | 21 | 21 | 2010 |
2010 | Texas | Crockett County | 45 | null | Female | 85 | 2010 | |
2010 | Texas | Crockett County | 0 | NATIVE HAWAIIAN AND OTHER PACIFIC ISLANDER ALONE | Female | 67 | 69 | 2010 |
Data access
Azure Notebooks
# This is a package in preview.
from azureml.opendatasets import UsPopulationCounty
population = UsPopulationCounty()
population_df = population.to_pandas_dataframe()
population_df.info()
Azure Databricks
# This is a package in preview.
from azureml.opendatasets import UsPopulationCounty
population = UsPopulationCounty()
population_df = population.to_spark_dataframe()
display(population_df.limit(5))
Azure Synapse
# This is a package in preview.
from azureml.opendatasets import UsPopulationCounty
population = UsPopulationCounty()
population_df = population.to_spark_dataframe()
# Display top 5 rows
display(population_df.limit(5))
Next steps
View the rest of the datasets in the Open Datasets catalog.
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for