Making Sense of the US Census
Data Bytes Fall 2024
Survey: Please be sure to take our survey bit.ly/survey-data-bytes and let us know your feedback about our September 16th “Making Sense of the US Census” session.
Slides: The slides are available here (or above using the
Presentation
link).Recording: As soon as the recording is available for “Making Sense of the US Census” it will be made available on our Data Bytes Fall 2024 page.
Upcoming Data Bytes: See upcoming Data Bytes sessions at Data Bytes Fall 2024.
Census Resources
Learning
To learn more about the US Census Bureau, available censuses and surveys, and how to work with Census data, please visit Census Academy: Your Learning Hub for Data Skills.
Available Surveys
Census Survey Explorer: Use the Census Survey Explorer to discover U.S. Census Bureau surveys and censuses by filtering and searching by geography, frequency, topics, and subtopics.
List of Surveys: An alphabetical list of all Census Bureau surveys.
Accessing Census Data
QuickFacts: QuickFacts provides statistics for all states, counties, and cities and towns with a population of 5,000 or more.
data.census.gov: Your primary source for accessing census data.
Searching data.census.gov: There are two ways to search data.census.gov. Search using the natural language search, formulating your search in the format:
topic
ingeography
inyear
For example, if I wanted to find data on commuting in Baltimore in 2017, I would use the query:
commute
inBaltimore City
in2017
.If you are not successful, you can use the advanced search to manually specify your search filters.
Population Estimates Program (PEP): Produces estimates of the population for the United States, states, metropolitan and micropolitan statistical areas, counties, cities, towns, as well as for Puerto Rico and its municipios. The PEP is listed separately to data.census.gov
Accessing Census Microdata
Public: Access microdata from the American Community Survey using the Public Use Microdata Sample (PUMS) files.
Restricted Use Microdata: The Census Bureau maintains and enhances confidential microdata for use in social science research. These data are from censuses and surveys of businesses and households, linked employer-employee data, and administrative records from federal and state agencies and other sources.
These data are made available to qualified researchers at secure Federal Statistical Research Data Centers or, where authorized, through the secure Remote Access Program. See the list of remote access qualifying data.
To access restricted data, researchers will need to apply through the Standard Application Portal (SAP)
Historical: Decennial census records older than 72 years are released through the National Archives: archives.gov/research/census.
US and International Integrated Census Data
IPUMS: IPUMS provides census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community contexts. Data and services available free of charge.
IPUMS is a part of the Institute for Social Research and Data Innovation at the University of Minnesota.
- IPUMS USA: U.S. Census and American Community Survey microdata from 1850 to the present.
- IPUMS CPS: Current Population Survey microdata including basic monthly surveys and supplements from 1962 to the present.
- IPUMS International: World’s largest collection of census microdata covering over 100 countries, contemporary and historical.
- IPUMS Global Health: Health survey data from around the world.
- IPUMS NHGIS: Tabular U.S. Census data and GIS boundary files from 1790 to the present.
- IPUMS IHGIS: Tabular and GIS data from population, housing, and agricultural censuses around the world.
- IPUMS Time Use: Historical and contemporary time use data from 1930 to the present.
- IPUMS Health Surveys: Historical and contemporary U.S. health survey data from NHIS (1963-present) and MEPS.
- IPUMS Higher Ed: Survey data on the science and engineering workforce in the U.S. from 1993 to the present.
Application Programming Interface (APIs)
The Census Data Application Programming Interface (API) is an API that gives the public access to raw statistical data from various Census Bureau data programs. Data is aggregated the data and associated with Census geographic boundary/areas defined by a FIPS code. Data is associated with a specific reference year, otherwise known as a “vintage”.
Many programming languages and statistical software provide simplified interfaces for working with the US Census data:
Stata
getcensus
is a Stata package for loading American Community Survey (ACS) data from the U.S. Census Bureau API into Stata: https://centeronbudget.github.io/getcensus/
Python
census
is a Python package that provides access to the American Community Survey (ACS) and SF1: https://pypi.org/project/census/
ipumspy
: ipumspy is a collection of python tools for working with data from IPUMS and for accessing that data via API.
Currently only IPUMS microdata collections are supported:
R
tidycensus
is an R package that allows users to interface with a select number of the US Census Bureau’s data APIs including the American Community Survey (ACS), Decennial census, Population Estimates Program (PEP), Public Use Microdata Series (PUMS), and return tidyverse-ready data frames, optionally with simple feature geometry included: https://walker-data.com/tidycensus/index.html
ipumsr
: ipumsr provides an R interface for handling IPUMS data, allowing users to:
Easily read files downloaded from the IPUMS extract system
Request data, download files, and get metadata from certain IPUMS collections
Interpret and process data using the contextual information that is included with many IPUMS files