This is a collaboration with Rebecca Hasdell and David Rehkopf.
Data Sources: Census Current Population Survey (CPS)
Source files available: https://github.com/alice1020/COVID-unemployment-and-income-supports
Data come from the Current Population Survey, which have been download from: https://cps.ipums.org/cps/index.shtml (registration required).
The codebooks for the extraction are the files:
Core_2019_2020_Extr_Codebook.xml for the core set of questions for monthly samples from 01-2019 to 04-2020. Variables selected: YEAR, SERIAL, MONTH, HWTFINL, CPSID, ASECFLAG, PERNUM, WTFINL, CPSIDP, FAMSIZE, NCHILD, ELDCH, YNGCH, EMPSTAT, LABFORCE, OCC, CLASSWKR, UHRSWORKT, UHRSWORK1, UHRSWORK2, AHRSWORKT, AHRSWORK1, AHRSWORK2, MARBASECIDP, LNKFW1MWT, LNKFW1YWT, LNKFW8WT, LNKFWMIS45WT, LNKFWMIS14WT, LNKFWMIS58WT, PANLWT, EARNWT, HOURWAGE, PAIDHOUR, EARNWEEK, UHRSWORKORG, WKSWORKORG.
ASEC_2019_Extr_Codebook.xml for the 2019 Annual Social and Economic Supplement (ASEC) of 03-2019. Among all the variables we selected: YEAR, SERIAL, MONTH, CPSID, ASECFLAG, ASECWTH, MARBASECIDH, ASECOVERH, PERNUM, CPSIDP, ASECWT, OCCLY, CLASSWLY, WORKLY, FTOTVAL, INCTOT, INCWAGE, INCLONGJ, OINCBUS, OINCFARM, OINCWAGE, EITCRED.
The Current Population Survey has monthly data for a core set of questions, which include employment status, occupation, geographical information, etc. Extensive data about socio-economic variables, such as earnings, Earned Income Tax Credit (EITC), health insurance, etc. are collected once a year (March) by the Social and Economic Supplement (ASEC) module. CPS is a survey which has basically been used for the analysis of cross-sectional data. It has a sampling scheme defined as 4-8-4. One household stays in the sample for max. 16 months (4+8+4). The household is interviewed during each of the first 4 months after entering the survey, followed by 8 without interview. The household is then interviewed again for 4 consecutive month, one year (4+8) after its first wave in order to match the seasonality.
The following figure visualizes the longitudinal structure of the survey. Our data ample cover the time span from January 2019 to April 2020. Each line represent an individual longitudinal record. Wave March 2019 is my master wave since it is the most recent information about EICT and earnings we have. Gray represents when the individual is out of the CPS’s sample.
Statistical Exploration
We are interested in analyzing the effects of COVID-19 on the employment status with a focus on the Earned Income Tax Credit (EITC).
Individual Level Longitudinal Analysis
At an individual/household level the sample size for individuals and households observed both in March 2019 (ASEC wave) and April 2020 corresponds to 19,013 individuals and 10,456 households. Among those a sub-sample of 4,700 individuals grouped in 3,984 households are part of the labor force. Visually these are mirrored in the three waves at the top of of Fig. 1. At the end, ~7% of the households in the sub-sample have received EITC.
Level | # Obs. 03-19 & 04-20 | In the Labor Force | with EITC > 0 |
---|---|---|---|
Individual | 31,806 | 14,945 | 1,175 |
Household | 13,666 | 9,391 | 1,137 |
Approximately 20% of people who received the EITC in 2019 are experiencing unemployment between February and April 2020.
Codes for this part are available in the file: Disaggregated_Analysis_April.R
Aggregate Occupational Level Analysis
We analyze the CPS data by aggregating over the occupational categories. Our cross-sectional unit becomes the occupational category. The data structure becomes the following
Occupation | Time | % Unempl | Ave. Earn | Ave. EITC |
---|---|---|---|---|
1 | 2020-02 | |||
1 | 2020-03 | |||
1 | 2020-04 | |||
2 | 2020-02 | |||
2 | 2020-03 | |||
2 | 2020-04 | |||
. | 2020-02 | |||
. | 2020-03 | |||
. | 2020-04 | |||
N | 2020-02 | |||
N | 2020-03 | |||
N | 2020-04 |
Employment vs Unemployment
We explore the impact of COVID-19 on employment.Hours Worked
Weekly Earnings
EITC by Occupational Category
Codes for this part are available in the file: Aggregated_Analysis_April.R