The data provided below constitute all source material used for the Vaccinating the United States for Coronavirus dashboard on Tableau Public. The dashboard relies on three distinct data sources: CDC Vaccination data (updated daily), demographic data from the American Community Survey (2019), and employment estimates from the Bureau of Labor Statistics (2019). All the data are imported into a public Google Sheet (CDC Daily Vaccination Data - Google Sheets) for easy import into Tableau. The scripts we used to download the data can be found in this Github repo and below is a description of each of the tabs in the Google Sheet:
-
state, us, data_dictionary: The vaccination data web scraped daily from this CDC vaccination dashboard: https://covid.cdc.gov/covid-data-tracker/#vaccinations. Specifically, we use the following CDC API (https://covid.cdc.gov/covid-data-tracker/COVIDData/getAjaxData?id=vaccination_data) to download the data daily at 1 PM EDT and 5 PM EDT. You can find some definitions of the data fields being displayed on the CDC dashboard here and information on how the CDC compiles the data here. But as far as we can tell, this API is mostly undocumented and contains many more fields than is visible on the CDC dashboard. Below are the fields we make use of from the API and what we believe they represent:
- Date: The date the data was updated
- Location: The 2- or 3-digit abbreviation of the state or agency
- LongName: The long form human readable name of the state or agency
- Administered_Dose1_Recip: The number of people who have received at least one dose of the two dose vaccines (i.e., either received 1 dose of the Pfizer or the Moderna vaccines)
- Administered_Dose1_Recip_18Plus: The number of people aged 18 or above who have received at least one dose of the two dose vaccines (i.e., either received 1 dose of the two dose vaccines (i.e., either received 1 dose of the Pfizer or the Moderna vaccines).
- Series_Complete_Yes: The number of people who have are fully vaccinated (i.e., either received 2 doses of the Pfizer/Moderna vaccines or 1 does of the Johnson and Johnson vaccine)
- Series_Complete_18Plus: The number of people aged 18 or above who have are fully vaccinated (i.e., either received 2 doses of the Pfizer/Moderna vaccines or 1 does of the Johnson and Johnson vaccine)
- Dist_Per_100K: The total number of doses distributed per 100,000 people
- Doses_Distributed: The total number of doses distributed
-
state_timeseries, us_timeseries, timeseries_data_dictionary: In addition to the CDC API, we also pull a timeseries version of the CDC data from the Our World in Data repository. This data is compiled by the Our World in Data team daily from the same CDC dashboard, but they also maintain a timeseries record going back to January 12, 2021. We download their data daily from this URL: https://raw.githubusercontent.com/owid/covid-19-data/master/public/data/vaccinations/us_state_vaccinations.csv. The code that performs the web scraping daily can be found here: https://github.com/ajjitn/covid-vaccines-scraping .
-
state_acs_data, state_data_only: Demographic data for each state and the US were pulled from the American Community Survey’s 2019 five-year estimates published by the US Census Bureau. The research team used the tidycensus R package to pull race, ethnicity, age, median income, and total population breakdowns. We also compiled employment data from the Bureau of Labor Statistics to capture industry-specific estimates as of May 2019 for two groups: educational services (sector 61 link to BLS XLS) and health care and social assistance (sector 62: link to BLS XLS). The research team filtered the data to get top-line sector-wide estimates at the state-level, and calculated employment percentages based on the total population estimates from the ACS data . The R script to pull the ACS and BLS data is available in our github repo: https://github.com/ajjitn/covid-vaccines-scraping .
-
Tile_Map, state_polygons: Helper datasets used by Tableau to generate the map of all states in the US. These are just a listing of all the states in the US and the polygon vertices that make up each state.
Dataset Info
- Modified 2024-07-03
- Release Date 2021-04-15
- Temporal Coverage
- License odc-by
- Granularity
- Contact Name Urban Institute
- Contact Email [email protected]
- Public Access Level public
Urban Extended Info
- Modified 2024-07-03
- Release Date 2021-04-15
- Geographic Level
- Data Value
- Data Quality
-
Urban Publications
- Citation Requirements Urban Institute. 2021. Vaccinating the US. Accessible from https://datacatalog.urban.org/dataset/vaccinating-us. Data originally sourced from Centers for Disease Control, 5 year American Community Survey (2015-2019) and B…, developed at the Urban Institute, and made available under the ODC-BY 1.0 Attribution License.