PMC Articles

Race, employment, and the pandemic: An exploration of covariate explanations of COVID-19 case fatality rate variance

PMCID: PMC9894486

PMID: 36730260


Abstract

We derive a simple asymptotic approximation for the long-run case fatality rate of COVID-19 (alpha and delta variants) and show that these estimations are highly correlated to the interaction between US State median age and projected US unemployment rate (Adj. r 2 = 60%). We contrast this to the high level of correlation between point (instantaneous) estimates of per state case fatality rates and the interaction of median age, population density and current unemployment rates (Adj. r 2 = 50.2%). To determine whether this is caused by a “race effect,” we then analyze unemployment, race, median age and population density across US states and show that adding the interaction of African American population and unemployment explains 53.5% of the variance in COVID case fatality rates for the alpha and delta variants when considering instantaneous case fatality rate. Interestingly, when the asymptotic case fatality rate is used, the dependence on the African American population disappears, which is consistent with the fact that in the long-run COVID does not discriminate on race, but may discriminate on access to medical care which is highly correlated to employment in the US. The results provide further evidence of the impact inequality can have on case fatality rates in COVID-19 and the impact complex social, health and economic factors can have on patient survival.


Full Text

The dynamics of COVID-19 have been extensively studied since the beginning of the pandemic (see e.g., [1–6]). As with many natural systems, the dynamics of COVID-19 are complex with non-linear second order and higher effects playing a role in the dynamics of transmission and fatality [7–10]. In particular, factors such as age, race, population density, socioeconomic status, and income inequality are all associated with [11–15] COVID-19 mortality. Unemployment alone was negatively associated with morbidity in the first wave of the 1918–19 Spanish Flu pandemic, [16] but with a few exceptions [17–19] has not been thoroughly explored in this manner for COVID-19.
In this paper, we show evidence that unemployment is correlated with case fatality rate (CFR) when taking median age into consideration, which partially contradicts results in [18]. We explain this contradiction by the fact that we are using bulk US state-level data, while [18] uses zip-code level data which is known to have substantially more noise in measurement. We also address the question of whether these correlations are caused by the known higher COVID-19 mortality rates of racial minorities in the US and show that by incorporating this data we can explain 53.5% of the variance in case fatality rates. We also construct an asymptotic estimate for the (true) case fatality rate and show that this is correlated to an interaction between age and projected unemployment. This is important since both CFR and unemployment rate are transient measures. Interestingly, when the asymptotic estimates are used, the dependence on race disappears (as it should) suggesting that COVID does not discriminate on race, but may discriminate on access to medical care which is highly correlated to employment in the US.
A rich scholarly tradition—one that spans the writings of Anna Julia Cooper [20, 21] and W. E. B. Du Bois (particularly [22, 23]) to the more recent research of C. L. R. James (e.g., [24, 25]) William Julius Wilson [26] and Sandy Darity (see [27, 28])—teaches us that an appreciation for the interconnections between race and class is foundational to the study of social inequality. It is therefore impossible for theoretically-grounded and empirically-rigorous research on “racial disparities” and health outcomes to ignore the role of class divisions [29–32]. We therefore join the growing body of “disparities” scholarship by taking race/class associations seriously, particularly as they pertain to the coronavirus pandemic. In many ways, COVID-19 has exacerbated pre-existing social inequalities, deepening the divisions between rich and poor, White and non-White, etc. [33, 34]. Focusing on unemployment as a key economic indicator [35, 36], we investigate the role that race and class play in explaining variations in COVID-19 fatality rates.
We approach the problem of modeling case fatality ratio (CFR) using an asymptotic analysis of an underlying Susceptible-Infected-Recovered-Deceased (SIRD) model, which is not explicitly calculated. Analysis is then carried out over data from the fifty US States. Standard epidemiologic compartment models show that over the course of a pandemic, the case fatality ratio decreases over time to a limiting value. Consider a compartmented epidemic model (e.g., an SIRD model). Let I be the instantaneous infected population and D be the deceased population. Further let C be the instantaneous cumulative case load. In an S1 File model, C ≡ I. SIRD models are inherently non-linear, usually having no closed form solution. Using a Taylor approximation, we can linearize the right-hand-side of the differential equations [37]. This allows us to approximate a closed form solution. Assuming a constant population size then linearizing about the current state of the compartmented epidemic model, the linearized solutions have form:

Here, κ and δ adjust the case fatality ratio to account for the initial condition (current value) and α and β are rate constants. The terms C∞ and D∞ are the limiting values of case load and the deceased population respectively. The simplicity of the solutions lies in the underlying linearization of the nonlinear SIRD dynamics and allows the derivation of a statistical model to be fit. Dividing the terms, the model for case fatality ratio is then:

where ϵ ∼ N(0, σ) is a noise variable accounting for experimental error. Here, A = −κ/D∞, B = −δ/D∞ and r∞ = 1/K is the long-run case fatality rate. We note: starting from an SIR model would create a more complex form for these expressions. Since it’s unclear whether COVID-19 caseload is following any simple SIR dynamic, we use the more parsimonious model. In the case of COVID-19, the measured caseload is believed to be a fraction of the true caseload. That is:

As t → ∞, we may assume τ is relatively stable (constant) and thus the approximated case fatality rate is too large by a factor of τ.
The choice of t* in Eq (1) is open. By varying t* over a number of possibilities and varying the size of the data tail we choose, we can construct a family (ensemble) of models following Eq (1) and use (i) the distribution on the computed r∞ to create upper and lower bounds on the estimator and (ii) use the various models to construct upper and lower bound predictors on the future behavior of CFR. For this study, we varied t* from 150 (days from January 22, 2020) to 330 (days from January 22, 2020) in increments of 10 days. The tail of the data used to fit Eq (1) started at day t* and proceeded to day 430, the last day of the data sample used.
Predicting the final CFR from early data is challenging, as there are confounding variables that may cause the model to over- or underestimate the true CFR [38]. For example, early models of COVID-19 did not account for the very high number of undiagnosed persons with COVID-19, leading to significant over-estimation of CFR. Further, different states may be at different points in the epidemic, leading to variances between states in CFR estimates. However, regardless of the ultimate CFR, the limiting behavior of CFR remains useful in analyzing factors that contribute to COVID-19 fatalities, because while different states may be at different places in their epidemic, the limiting behaviors should be invariant if we make two assumptions.
We use Adjusted-r2 (rather than p-values) as one of our primary measures of goodness of fit. The goal of r2 in this context is to assess whether specific predictors or explanatory variables have a significant effect on the dependent variable. Many of our Adjusted-r2 values fall between 0.5 and 0.6. Ozili [39] discusses acceptable r2 values concluding that in social science, with its erratic human-behavior data, r2 values between 0.1 and 0.5 are considered adequate provided some/most of the explanatory variables are statistically significant, r2 between 0.5 and 0.99 are considered quite good. This is reflected in clinical data, where studies with 0.2 < r2 < 0.4 have been accepted [40]. Even traditional regression analysis would consider an 0.5 < r2 < 0.7 to have a moderate effect size [41] and Henseler et al. [42] proposed general guidance that r2 of 0.5 was moderate (compared to r2 of 0.75, which was considered substantial and r2 of 0.25, which was considered weak).
In comparing CFRs between states, it is reasonable to ignore the effects of the undiagnosed over short time frames, because the population percentage of undiagnosed can be considered constant over recent history. Analysis of variations in states’ CFR supports this. To compute instantaneous case fatality rate, we use data from Johns Hopkins COVID Dashboard (https://github.com/CSSEGISandData/COVID-19) from April 8, 2021. We then compare forecasts to data from October, 2020 and December 2020. We use these dates because after this time, vaccination was widely underway in the US and the less severe [43] Omicron variant would be responsible for further waves, thus distorting the underlying data on the alpha and delta variants.

Fig 1 shows case fatality rates for 6 randomly selected states representing different regions of the country along with the fits of Eq (1).
We also assume that τ is related to the proportion of the population tested, denoted p, within each state. Here:

To assess the possibility that τ is not consistent across states, we regress rnow against p. This is shown in Fig 2.
It is also possible to use the per-state asymptotic CFR to estimate the asymptotic US CFR assuming that as t → ∞ the entire population will be exposed to COVID19 and considering a no-vaccine scenario. We use the weighted average:

where P is the total US population, P is the population of state s and is the asymptotic CFR of state s. Using this we obtain a 100% orders-statistic confidence interval on the CFR of (0.008, 0.019) with a pointwise estimator of 0.016. This is consistent with the single fit of cumulative US data, which estimates a nation-wide CFR of 0.0184, which is on the high-end of the confidence region. This is illustrated in Fig 3.
Verity et al., estimate the true 95% confidence interval on CFR as (0.0039, 0.0133) [38]. Comparing our CI with theirs, we see there is statistical agreement in the two approaches. Using Verity et al.’s model, we can estimate the under-sampling rate τ for the whole U.S. is contained in the interval (1.40, 4.76), which is consistent with past estimates.
Using the correction to r∞ we can compare the asymptotic case fatality rate to the instantaneous case fatality rate using the data from October 20, 2020. This is shown in Fig 4.
In this data, four states: Alabama, Kansas, Oklahoma, and Vermont, have asymptotic case fatality rates that are more than 1.5 times lower than their current estimated case fatality rate. However, if we apply the correction factors derived above, we see that all of those states may already be explained by unidentified cases. This is shown in Fig 5 where we show the region in which these states would lie if we correct using the estimate of τ derived for the U.S.
If we use the instantaneous (computed) case fatality rate over all states, 50.2% of the variance in this variable can be explained by the interaction of three simple variables: current unemployment rate [44] (u), population density [45] (d) and median age [45] (a) with the model:

We use the log of the population because it generally improves adjusted r2 while simultaneously decreasing AIC. The unemployment rate is provided by the US Dept. of Labor and reflects the unemployment recorded on March, 2021. (Descriptive statistics for all data used in this paper can be found in the S1 File). The parameter table for the model with significance is shown in below.

On its own, unemployment is correlated with instantaneous case fatality rate (p < 0.001) and explains 27% of the variance. The model in Eq (2) along with the correlation of unemployment and instantaneous case fatality rate is shown in Fig 6. We tested a model in which we predict unemployment using the inverse of median age, population density and computed case fatality rate. This model explains only 26.9% (adjusted r2) of the variance and the population density is not a significant estimator of unemployment as would be expected if Eq (2) could be used to solve for u. This suggests that we are not just seeing a simple correlation.
The correlation between unemployment and instantaneous (computed) case fatality rate is consistent over time. If we use monthly unemployment data and the instantaneous CFR computed on (or about) the 20th of each month, then we can see that the correlation between instantaneous CFR and monthly unemployment were consistent since May 2020 (see Table 4 in the S1 File). The Bonferroni corrected p-value for all fits is p = 0.0498, showing that all these correlations are significant simultaneously at the 0.05 level. These results suggests a strong correlation between CFR and the existence of a correlation between structural unemployment in a state and its asymptotic CFR.
When the case fatality rates are corrected using asymptotic approximation and we replace r with r∞ in Eq (2), then 52.2% of the variance in r∞ is explained. However, current unemployment is a transient, while r∞ is used to measure long-run behavior. Supported by the correlation between instantaneous CFR and monthly unemployment illustrated in Table 4 in the S1 File, we construct an asymptotic projection for unemployment using a power-law:

This projection is constrained so that c is not allowed to fall below unemployment from August 2019 [44], which was a point of historically low unemployment. As time goes to infinity, we can compute u∞. The relationship between unemployment in August 2019 and the projected unemployment is shown in Fig 7. Most states were projected to return to their pre-COVID unemployment levels. However, a small set of states may have longer term higher levels of unemployment as a result of COVID.
When we replace u in Eq (2) with u∞, and adjust Eq (2) to be:

we obtain the fit table:

On its own, the projected unemployment rate explains 27% of the variance in the asymptotic case fatality rate (p < 0.001) and again illustrates the correlation between case fatality and structural inequality and poverty. When interacting with median age and population density, 60.0% of the variance is explained. These fits are illustrated in Fig 8.
There is a known relationship between CFR and minority status, with minority populations bearing the greater brunt of COVID-19. It is therefore reasonable to expect that the relationship between unemployment and instantaneous CFR is a result of the higher rate of unemployment among minorities, which is illustrated in Fig 9 in the S1 File (p < 0.001).
We note that the correlation between minority population proportion and August 2019 unemployment is only weak (p = 0.056). This will be instructive when we evaluate the role minority status plays in asymptotic CFR. We now evaluate the relationship between minority population proportion and CFR using the simple linear model:

where m is the minority population proportion and as before r is the CFR. Because there is substantial geographic diversity among minorities in the US, we perform 3 analyses: (i) an analysis of the relationship between minority proportion and instantaneous CFR in all 50 states, (ii) in the lower 48 states and (iii) in states where the population is composed of at most 30% minorities (total). Results of these analyses are shown in Tables 1 to 3.
When we alter Eq (2) to include an interaction term between the proportion of African Americans in a state and unemployment we have:

Here A is the proportion of African Americans in the state. The resulting model parameters are all significant (see below).

This model explains 53.5% (Adjusted r2) of the variation in the data and suggests that unemployment, age, population density and minority status interact in complex ways to increase the instantaneous CFR.
When we consider asymptotic CFR and modifying Eq (4) as:

no terms involving the proportion of African Americans in a state are significant (in Eq (5), p = 0.485 for the coefficient of Au∞). Even replacing Au∞ with A is not significant. This is sensible since in the long-run, COVID is not a disease that discriminates on race. It seems to be a disease that discriminates on access to medical care required to treat acute pneumonia that may occur as a result of infection.
The correlation between COVID-19 case fatality rate and both race and unemployment is consistent with the prevailing theory that minority disparity in COVID-19 outcomes stems from systemic racial injustice [46] which drives disparity across all sociodemographic covariates. This theory is further corroborated by analysis which has shown that mortality for persons able to access hospital care does not differ between African American and White patients after adjusting for sociodemographic factors and comorbidities [47], that racial disparities in COVID-19 appear to be driven primarily by unequal infection risks (i.e., sociodemographics) rather than case fatality rates [48], and that racial segregation explains higher African American mortality to such a degree that no as-yet unmeasured confounder is within the range of plausible covariates [14, 49]. Our model is also supported by the current (December 7, 2021) CFR of 0.016 [50] which was our point-wise estimation of CFR from data through March 2021. The pointwise estimator as of March 2022 is 0.012 [50], reflecting improved vaccinations, treatments and the decreased aggressiveness of the Omicron variant [51]. However, this value is still well within our forecast confidence interval. In states with minority populations <30%, case fatality rates remain correlated with unemployment, but not with race. This suggests that unemployment alone is an independent risk factor for COVID-19, and may be considered in evaluating individual and group risk of COVID-19. Further, in populations with lower minority proportions, unemployment is a greater risk factor than race.
It is important that these findings not be misconstrued to suggest that racial inequity does not exist in states with low minority populations. Rather, it should be taken as evidence that, when it comes to social stratification, race and class are two sides of the same coin [43].
In this paper we analyzed the interactions of the complex social phenomena of age, race, unemployment, population density and COVID-19 case fatality rate using both pointwise time series data and an asymptotic projection of unemployment and case fatality rate. We showed correlations to non-linear interactions between age, unemployment and age and race to COVID-19 CFR. We also showed that in the limit as time goes to infinity, we expect COVID-19 CFR to uncorrelate to race, which is consistent with the fact that COVID-19 fatality does not discriminate by race but may discriminate by access to medical care, which is predicted by socio-economic status in the US. This work is contrasted with the work in [18], which did not find correlation of CFR to unemployment but analyzed zip-code level data, which may have had higher noise levels in the unemployment rates. Future work should determine whether the observations made here can be translated to zip-code level data and further determine the impact of socio-economic status and race on COVID-19 fatalities.
More generally, our research speaks to the need for careful race/class analyses, for this need becomes especially apparent if one considers the pandemic’s disproportionately adverse impact on both “health-” and “wealth-” related outcomes in African American communities [52–58]. As we note in the introduction, unequal outcomes across all aspects of the COVID-19 pandemic have highlighted systemic racial injustice in our society, and are appropriately driving calls for systemic change. The present paper offers data-driven support for such change, but also that unemployment is an independent, persistent risk factor for harm from the COVID-19 pandemic. Achieving equitable healthcare in the U.S. requires overcoming racial and economic disparities. Our work underscores the idea that the disparate impacts of COVID-19 must be both understood and addressed within an explicit framework of how race and class structure myriad arenas of American public life.


Sections

"[{\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref001\", \"pone.0274470.ref006\", \"pone.0274470.ref007\", \"pone.0274470.ref010\", \"pone.0274470.ref011\", \"pone.0274470.ref015\", \"pone.0274470.ref016\", \"pone.0274470.ref017\", \"pone.0274470.ref019\"], \"section\": \"Introduction\", \"text\": \"The dynamics of COVID-19 have been extensively studied since the beginning of the pandemic (see e.g., [1\\u20136]). As with many natural systems, the dynamics of COVID-19 are complex with non-linear second order and higher effects playing a role in the dynamics of transmission and fatality [7\\u201310]. In particular, factors such as age, race, population density, socioeconomic status, and income inequality are all associated with [11\\u201315] COVID-19 mortality. Unemployment alone was negatively associated with morbidity in the first wave of the 1918\\u201319 Spanish Flu pandemic, [16] but with a few exceptions [17\\u201319] has not been thoroughly explored in this manner for COVID-19.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref018\", \"pone.0274470.ref018\"], \"section\": \"Introduction\", \"text\": \"In this paper, we show evidence that unemployment is correlated with case fatality rate (CFR) when taking median age into consideration, which partially contradicts results in [18]. We explain this contradiction by the fact that we are using bulk US state-level data, while [18] uses zip-code level data which is known to have substantially more noise in measurement. We also address the question of whether these correlations are caused by the known higher COVID-19 mortality rates of racial minorities in the US and show that by incorporating this data we can explain 53.5% of the variance in case fatality rates. We also construct an asymptotic estimate for the (true) case fatality rate and show that this is correlated to an interaction between age and projected unemployment. This is important since both CFR and unemployment rate are transient measures. Interestingly, when the asymptotic estimates are used, the dependence on race disappears (as it should) suggesting that COVID does not discriminate on race, but may discriminate on access to medical care which is highly correlated to employment in the US.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref020\", \"pone.0274470.ref021\", \"pone.0274470.ref022\", \"pone.0274470.ref023\", \"pone.0274470.ref024\", \"pone.0274470.ref025\", \"pone.0274470.ref026\", \"pone.0274470.ref027\", \"pone.0274470.ref028\", \"pone.0274470.ref029\", \"pone.0274470.ref032\", \"pone.0274470.ref033\", \"pone.0274470.ref034\", \"pone.0274470.ref035\", \"pone.0274470.ref036\"], \"section\": \"Introduction\", \"text\": \"A rich scholarly tradition\\u2014one that spans the writings of Anna Julia Cooper [20, 21] and W. E. B. Du Bois (particularly [22, 23]) to the more recent research of C. L. R. James (e.g., [24, 25]) William Julius Wilson [26] and Sandy Darity (see [27, 28])\\u2014teaches us that an appreciation for the interconnections between race and class is foundational to the study of social inequality. It is therefore impossible for theoretically-grounded and empirically-rigorous research on \\u201cracial disparities\\u201d and health outcomes to ignore the role of class divisions [29\\u201332]. We therefore join the growing body of \\u201cdisparities\\u201d scholarship by taking race/class associations seriously, particularly as they pertain to the coronavirus pandemic. In many ways, COVID-19 has exacerbated pre-existing social inequalities, deepening the divisions between rich and poor, White and non-White, etc. [33, 34]. Focusing on unemployment as a key economic indicator [35, 36], we investigate the role that race and class play in explaining variations in COVID-19 fatality rates.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.s002\", \"pone.0274470.ref037\"], \"section\": \"Materials and methods\", \"text\": \"We approach the problem of modeling case fatality ratio (CFR) using an asymptotic analysis of an underlying Susceptible-Infected-Recovered-Deceased (SIRD) model, which is not explicitly calculated. Analysis is then carried out over data from the fifty US States. Standard epidemiologic compartment models show that over the course of a pandemic, the case fatality ratio decreases over time to a limiting value. Consider a compartmented epidemic model (e.g., an SIRD model). Let I be the instantaneous infected population and D be the deceased population. Further let C be the instantaneous cumulative case load. In an S1 File model, C \\u2261 I. SIRD models are inherently non-linear, usually having no closed form solution. Using a Taylor approximation, we can linearize the right-hand-side of the differential equations [37]. This allows us to approximate a closed form solution. Assuming a constant population size then linearizing about the current state of the compartmented epidemic model, the linearized solutions have form:\\n\\nHere, \\u03ba and \\u03b4 adjust the case fatality ratio to account for the initial condition (current value) and \\u03b1 and \\u03b2 are rate constants. The terms C\\u221e and D\\u221e are the limiting values of case load and the deceased population respectively. The simplicity of the solutions lies in the underlying linearization of the nonlinear SIRD dynamics and allows the derivation of a statistical model to be fit. Dividing the terms, the model for case fatality ratio is then:\\n\\nwhere \\u03f5 \\u223c N(0, \\u03c3) is a noise variable accounting for experimental error. Here, A = \\u2212\\u03ba/D\\u221e, B = \\u2212\\u03b4/D\\u221e and r\\u221e = 1/K is the long-run case fatality rate. We note: starting from an SIR model would create a more complex form for these expressions. Since it\\u2019s unclear whether COVID-19 caseload is following any simple SIR dynamic, we use the more parsimonious model. In the case of COVID-19, the measured caseload is believed to be a fraction of the true caseload. That is:\\n\\nAs t \\u2192 \\u221e, we may assume \\u03c4 is relatively stable (constant) and thus the approximated case fatality rate is too large by a factor of \\u03c4.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.e002\", \"pone.0274470.e002\", \"pone.0274470.e002\"], \"section\": \"Materials and methods\", \"text\": \"The choice of t* in Eq (1) is open. By varying t* over a number of possibilities and varying the size of the data tail we choose, we can construct a family (ensemble) of models following Eq (1) and use (i) the distribution on the computed r\\u221e to create upper and lower bounds on the estimator and (ii) use the various models to construct upper and lower bound predictors on the future behavior of CFR. For this study, we varied t* from 150 (days from January 22, 2020) to 330 (days from January 22, 2020) in increments of 10 days. The tail of the data used to fit Eq (1) started at day t* and proceeded to day 430, the last day of the data sample used.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref038\"], \"section\": \"Materials and methods\", \"text\": \"Predicting the final CFR from early data is challenging, as there are confounding variables that may cause the model to over- or underestimate the true CFR [38]. For example, early models of COVID-19 did not account for the very high number of undiagnosed persons with COVID-19, leading to significant over-estimation of CFR. Further, different states may be at different points in the epidemic, leading to variances between states in CFR estimates. However, regardless of the ultimate CFR, the limiting behavior of CFR remains useful in analyzing factors that contribute to COVID-19 fatalities, because while different states may be at different places in their epidemic, the limiting behaviors should be invariant if we make two assumptions.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref039\", \"pone.0274470.ref040\", \"pone.0274470.ref041\", \"pone.0274470.ref042\"], \"section\": \"Materials and methods\", \"text\": \"We use Adjusted-r2 (rather than p-values) as one of our primary measures of goodness of fit. The goal of r2 in this context is to assess whether specific predictors or explanatory variables have a significant effect on the dependent variable. Many of our Adjusted-r2 values fall between 0.5 and 0.6. Ozili [39] discusses acceptable r2 values concluding that in social science, with its erratic human-behavior data, r2 values between 0.1 and 0.5 are considered adequate provided some/most of the explanatory variables are statistically significant, r2 between 0.5 and 0.99 are considered quite good. This is reflected in clinical data, where studies with 0.2 < r2 < 0.4 have been accepted [40]. Even traditional regression analysis would consider an 0.5 < r2 < 0.7 to have a moderate effect size [41] and Henseler et al. [42] proposed general guidance that r2 of 0.5 was moderate (compared to r2 of 0.75, which was considered substantial and r2 of 0.25, which was considered weak).\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref043\"], \"section\": \"Case fatality rate modeling\", \"text\": \"In comparing CFRs between states, it is reasonable to ignore the effects of the undiagnosed over short time frames, because the population percentage of undiagnosed can be considered constant over recent history. Analysis of variations in states\\u2019 CFR supports this. To compute instantaneous case fatality rate, we use data from Johns Hopkins COVID Dashboard (https://github.com/CSSEGISandData/COVID-19) from April 8, 2021. We then compare forecasts to data from October, 2020 and December 2020. We use these dates because after this time, vaccination was widely underway in the US and the less severe [43] Omicron variant would be responsible for further waves, thus distorting the underlying data on the alpha and delta variants.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.g001\", \"pone.0274470.e002\"], \"section\": \"Case fatality rate modeling\", \"text\": \"\\nFig 1 shows case fatality rates for 6 randomly selected states representing different regions of the country along with the fits of Eq (1).\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.g002\"], \"section\": \"Case fatality rate modeling\", \"text\": \"We also assume that \\u03c4 is related to the proportion of the population tested, denoted p, within each state. Here:\\n\\nTo assess the possibility that \\u03c4 is not consistent across states, we regress rnow against p. This is shown in Fig 2.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.g003\"], \"section\": \"Case fatality rate modeling\", \"text\": \"It is also possible to use the per-state asymptotic CFR to estimate the asymptotic US CFR assuming that as t \\u2192 \\u221e the entire population will be exposed to COVID19 and considering a no-vaccine scenario. We use the weighted average:\\n\\nwhere P is the total US population, P is the population of state s and  is the asymptotic CFR of state s. Using this we obtain a 100% orders-statistic confidence interval on the CFR of (0.008, 0.019) with a pointwise estimator of 0.016. This is consistent with the single fit of cumulative US data, which estimates a nation-wide CFR of 0.0184, which is on the high-end of the confidence region. This is illustrated in Fig 3.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref038\"], \"section\": \"Case fatality rate modeling\", \"text\": \"Verity et al., estimate the true 95% confidence interval on CFR as (0.0039, 0.0133) [38]. Comparing our CI with theirs, we see there is statistical agreement in the two approaches. Using Verity et al.\\u2019s model, we can estimate the under-sampling rate \\u03c4 for the whole U.S. is contained in the interval (1.40, 4.76), which is consistent with past estimates.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.g004\"], \"section\": \"Case fatality rate modeling\", \"text\": \"Using the correction to r\\u221e we can compare the asymptotic case fatality rate to the instantaneous case fatality rate using the data from October 20, 2020. This is shown in Fig 4.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.g005\"], \"section\": \"Case fatality rate modeling\", \"text\": \"In this data, four states: Alabama, Kansas, Oklahoma, and Vermont, have asymptotic case fatality rates that are more than 1.5 times lower than their current estimated case fatality rate. However, if we apply the correction factors derived above, we see that all of those states may already be explained by unidentified cases. This is shown in Fig 5 where we show the region in which these states would lie if we correct using the estimate of \\u03c4 derived for the U.S.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref044\", \"pone.0274470.ref045\", \"pone.0274470.ref045\", \"pone.0274470.s002\"], \"section\": \"CFR and unemployment\", \"text\": \"If we use the instantaneous (computed) case fatality rate over all states, 50.2% of the variance in this variable can be explained by the interaction of three simple variables: current unemployment rate [44] (u), population density [45] (d) and median age [45] (a) with the model:\\n\\nWe use the log of the population because it generally improves adjusted r2 while simultaneously decreasing AIC. The unemployment rate is provided by the US Dept. of Labor and reflects the unemployment recorded on March, 2021. (Descriptive statistics for all data used in this paper can be found in the S1 File). The parameter table for the model with significance is shown in below.\\n\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.e008\", \"pone.0274470.g006\", \"pone.0274470.e008\"], \"section\": \"CFR and unemployment\", \"text\": \"On its own, unemployment is correlated with instantaneous case fatality rate (p < 0.001) and explains 27% of the variance. The model in Eq (2) along with the correlation of unemployment and instantaneous case fatality rate is shown in Fig 6. We tested a model in which we predict unemployment using the inverse of median age, population density and computed case fatality rate. This model explains only 26.9% (adjusted r2) of the variance and the population density is not a significant estimator of unemployment as would be expected if Eq (2) could be used to solve for u. This suggests that we are not just seeing a simple correlation.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.s002\"], \"section\": \"Temporal and asymptotic analysis\", \"text\": \"The correlation between unemployment and instantaneous (computed) case fatality rate is consistent over time. If we use monthly unemployment data and the instantaneous CFR computed on (or about) the 20th of each month, then we can see that the correlation between instantaneous CFR and monthly unemployment were consistent since May 2020 (see Table 4 in the S1 File). The Bonferroni corrected p-value for all fits is p = 0.0498, showing that all these correlations are significant simultaneously at the 0.05 level. These results suggests a strong correlation between CFR and the existence of a correlation between structural unemployment in a state and its asymptotic CFR.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.e008\", \"pone.0274470.s002\", \"pone.0274470.ref044\", \"pone.0274470.g007\"], \"section\": \"Temporal and asymptotic analysis\", \"text\": \"When the case fatality rates are corrected using asymptotic approximation and we replace r with r\\u221e in Eq (2), then 52.2% of the variance in r\\u221e is explained. However, current unemployment is a transient, while r\\u221e is used to measure long-run behavior. Supported by the correlation between instantaneous CFR and monthly unemployment illustrated in Table 4 in the S1 File, we construct an asymptotic projection for unemployment using a power-law:\\n\\nThis projection is constrained so that c is not allowed to fall below unemployment from August 2019 [44], which was a point of historically low unemployment. As time goes to infinity, we can compute u\\u221e. The relationship between unemployment in August 2019 and the projected unemployment is shown in Fig 7. Most states were projected to return to their pre-COVID unemployment levels. However, a small set of states may have longer term higher levels of unemployment as a result of COVID.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.e008\", \"pone.0274470.e008\"], \"section\": \"Temporal and asymptotic analysis\", \"text\": \"When we replace u in Eq (2) with u\\u221e, and adjust Eq (2) to be:\\n\\nwe obtain the fit table:\\n\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.g008\"], \"section\": \"Temporal and asymptotic analysis\", \"text\": \"On its own, the projected unemployment rate explains 27% of the variance in the asymptotic case fatality rate (p < 0.001) and again illustrates the correlation between case fatality and structural inequality and poverty. When interacting with median age and population density, 60.0% of the variance is explained. These fits are illustrated in Fig 8.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.s002\"], \"section\": \"CFR and race\", \"text\": \"There is a known relationship between CFR and minority status, with minority populations bearing the greater brunt of COVID-19. It is therefore reasonable to expect that the relationship between unemployment and instantaneous CFR is a result of the higher rate of unemployment among minorities, which is illustrated in Fig 9 in the S1 File (p < 0.001).\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.t001\", \"pone.0274470.t003\"], \"section\": \"CFR and race\", \"text\": \"We note that the correlation between minority population proportion and August 2019 unemployment is only weak (p = 0.056). This will be instructive when we evaluate the role minority status plays in asymptotic CFR. We now evaluate the relationship between minority population proportion and CFR using the simple linear model:\\n\\nwhere m is the minority population proportion and as before r is the CFR. Because there is substantial geographic diversity among minorities in the US, we perform 3 analyses: (i) an analysis of the relationship between minority proportion and instantaneous CFR in all 50 states, (ii) in the lower 48 states and (iii) in states where the population is composed of at most 30% minorities (total). Results of these analyses are shown in Tables 1 to 3.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.e008\"], \"section\": \"Combined model\", \"text\": \"When we alter Eq (2) to include an interaction term between the proportion of African Americans in a state and unemployment we have:\\n\\nHere A is the proportion of African Americans in the state. The resulting model parameters are all significant (see below).\\n\\nThis model explains 53.5% (Adjusted r2) of the variation in the data and suggests that unemployment, age, population density and minority status interact in complex ways to increase the instantaneous CFR.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.e014\", \"pone.0274470.e016\"], \"section\": \"Combined model\", \"text\": \"When we consider asymptotic CFR and modifying Eq (4) as:\\n\\nno terms involving the proportion of African Americans in a state are significant (in Eq (5), p = 0.485 for the coefficient of Au\\u221e). Even replacing Au\\u221e with A is not significant. This is sensible since in the long-run, COVID is not a disease that discriminates on race. It seems to be a disease that discriminates on access to medical care required to treat acute pneumonia that may occur as a result of infection.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref046\", \"pone.0274470.ref047\", \"pone.0274470.ref048\", \"pone.0274470.ref014\", \"pone.0274470.ref049\", \"pone.0274470.ref050\", \"pone.0274470.ref050\", \"pone.0274470.ref051\"], \"section\": \"Discussion\", \"text\": \"The correlation between COVID-19 case fatality rate and both race and unemployment is consistent with the prevailing theory that minority disparity in COVID-19 outcomes stems from systemic racial injustice [46] which drives disparity across all sociodemographic covariates. This theory is further corroborated by analysis which has shown that mortality for persons able to access hospital care does not differ between African American and White patients after adjusting for sociodemographic factors and comorbidities [47], that racial disparities in COVID-19 appear to be driven primarily by unequal infection risks (i.e., sociodemographics) rather than case fatality rates [48], and that racial segregation explains higher African American mortality to such a degree that no as-yet unmeasured confounder is within the range of plausible covariates [14, 49]. Our model is also supported by the current (December 7, 2021) CFR of 0.016 [50] which was our point-wise estimation of CFR from data through March 2021. The pointwise estimator as of March 2022 is 0.012 [50], reflecting improved vaccinations, treatments and the decreased aggressiveness of the Omicron variant [51]. However, this value is still well within our forecast confidence interval. In states with minority populations <30%, case fatality rates remain correlated with unemployment, but not with race. This suggests that unemployment alone is an independent risk factor for COVID-19, and may be considered in evaluating individual and group risk of COVID-19. Further, in populations with lower minority proportions, unemployment is a greater risk factor than race.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref043\"], \"section\": \"Discussion\", \"text\": \"It is important that these findings not be misconstrued to suggest that racial inequity does not exist in states with low minority populations. Rather, it should be taken as evidence that, when it comes to social stratification, race and class are two sides of the same coin [43].\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref018\"], \"section\": \"Conclusion\", \"text\": \"In this paper we analyzed the interactions of the complex social phenomena of age, race, unemployment, population density and COVID-19 case fatality rate using both pointwise time series data and an asymptotic projection of unemployment and case fatality rate. We showed correlations to non-linear interactions between age, unemployment and age and race to COVID-19 CFR. We also showed that in the limit as time goes to infinity, we expect COVID-19 CFR to uncorrelate to race, which is consistent with the fact that COVID-19 fatality does not discriminate by race but may discriminate by access to medical care, which is predicted by socio-economic status in the US. This work is contrasted with the work in [18], which did not find correlation of CFR to unemployment but analyzed zip-code level data, which may have had higher noise levels in the unemployment rates. Future work should determine whether the observations made here can be translated to zip-code level data and further determine the impact of socio-economic status and race on COVID-19 fatalities.\"}, {\"pmc\": \"PMC9894486\", \"pmid\": \"36730260\", \"reference_ids\": [\"pone.0274470.ref052\", \"pone.0274470.ref058\"], \"section\": \"Conclusion\", \"text\": \"More generally, our research speaks to the need for careful race/class analyses, for this need becomes especially apparent if one considers the pandemic\\u2019s disproportionately adverse impact on both \\u201chealth-\\u201d and \\u201cwealth-\\u201d related outcomes in African American communities [52\\u201358]. As we note in the introduction, unequal outcomes across all aspects of the COVID-19 pandemic have highlighted systemic racial injustice in our society, and are appropriately driving calls for systemic change. The present paper offers data-driven support for such change, but also that unemployment is an independent, persistent risk factor for harm from the COVID-19 pandemic. Achieving equitable healthcare in the U.S. requires overcoming racial and economic disparities. Our work underscores the idea that the disparate impacts of COVID-19 must be both understood and addressed within an explicit framework of how race and class structure myriad arenas of American public life.\"}]"

Metadata

"{\"Data Availability\": \"All relevant data are within the paper and its \", \"Outbreaks\": \"COVID-19\", \"Submission Version\": \"1\"}"