Primary tabs

Other Access

The information on this page (the dataset metadata) is also available in these formats.

JSON RDF

via the DKAN API

Vaccinating the US

The data provided below constitute all source material used for the Vaccinating the United States for Coronavirus dashboard on Tableau Public. The dashboard relies on three distinct data sources: CDC Vaccination data (updated daily), demographic data from the American Community Survey (2019), and employment estimates from the Bureau of Labor Statistics (2019). All the data are imported into a public Google Sheet (CDC Daily Vaccination Data - Google Sheets) for easy import into Tableau. The scripts we used to download the data can be found in this Github repo and below is a description of each of the tabs in the Google Sheet:

  • state, us, data_dictionary: The vaccination data web scraped daily from this CDC vaccination dashboard: https://covid.cdc.gov/covid-data-tracker/#vaccinations. Specifically, we use the following CDC API (https://covid.cdc.gov/covid-data-tracker/COVIDData/getAjaxData?id=vaccin...) to download the data daily at 1 PM EDT and 5 PM EDT. You can find some definitions of the data fields being displayed on the CDC dashboard here and information on how the CDC compiles the data here. But as far as we can tell, this API is mostly undocumented and contains many more fields than is visible on the CDC dashboard. Below are the fields we make use of from the API and what we believe they represent:

    • Date: The date the data was updated
    • Location: The 2- or 3-digit abbreviation of the state or agency
    • LongName: The long form human readable name of the state or agency
    • Administered_Dose1_Recip: The number of people who have received at least one dose of the two dose vaccines (i.e., either received 1 dose of the Pfizer or the Moderna vaccines)
    • Administered_Dose1_Recip_18Plus: The number of people aged 18 or above who have received at least one dose of the two dose vaccines (i.e., either received 1 dose of the two dose vaccines (i.e., either received 1 dose of the Pfizer or the Moderna vaccines).
    • Series_Complete_Yes: The number of people who have are fully vaccinated (i.e., either received 2 doses of the Pfizer/Moderna vaccines or 1 does of the Johnson and Johnson vaccine)
    • Series_Complete_18Plus: The number of people aged 18 or above who have are fully vaccinated (i.e., either received 2 doses of the Pfizer/Moderna vaccines or 1 does of the Johnson and Johnson vaccine)
    • Dist_Per_100K: The total number of doses distributed per 100,000 people
    • Doses_Distributed: The total number of doses distributed
  • state_timeseries, us_timeseries, timeseries_data_dictionary: In addition to the CDC API, we also pull a timeseries version of the CDC data from the Our World in Data repository. This data is compiled by the Our World in Data team daily from the same CDC dashboard, but they also maintain a timeseries record going back to January 12, 2021. We download their data daily from this URL: https://raw.githubusercontent.com/owid/covid-19-data/master/public/data/.... The code that performs the web scraping daily can be found here: https://github.com/ajjitn/covid-vaccines-scraping .

  • state_acs_data, state_data_only: Demographic data for each state and the US were pulled from the American Community Survey’s 2019 five-year estimates published by the US Census Bureau. The research team used the tidycensus R package to pull race, ethnicity, age, median income, and total population breakdowns. We also compiled employment data from the Bureau of Labor Statistics to capture industry-specific estimates as of May 2019 for two groups: educational services (sector 61 link to BLS XLS) and health care and social assistance (sector 62: link to BLS XLS). The research team filtered the data to get top-line sector-wide estimates at the state-level, and calculated employment percentages based on the total population estimates from the ACS data . The R script to pull the ACS and BLS data is available in our github repo: https://github.com/ajjitn/covid-vaccines-scraping .

  • Tile_Map, state_polygons: Helper datasets used by Tableau to generate the map of all states in the US. These are just a listing of all the states in the US and the polygon vertices that make up each state.

These fields are compatible with DCAT, an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web.
FieldValue
Modified
2021-04-22
Release Date
2021-04-15
Identifier
0328eefe-f4cf-46ab-92aa-ac6dd9d0e3bf
License
Contact Name
Urban Institute
Contact Email
Public Access Level
Public
These fields are specific to Urban Institute.
FieldValue
Geographic Level
Original Data Source
Citation Requirements

Urban Institute. 2021. Vaccinating the US. Accessible from https://datacatalog.urban.org/dataset/vaccinating-us. Data originally sourced from Centers for Disease Control, 5 year American Community Survey (2015-2019) and Bureau of Labor Statistics, developed at the Urban Institute, and made available under the ODC-BY 1.0 Attribution License.