Abstract
During the first year of the COVID-19 pandemic, the number of incarcerated people in the United States decreased by at least 16%—the largest, fastest reduction in prison population in American history. Using an original dataset curated from public sources on prison demographics across all 50 states and the District of Columbia, we show that incarcerated white people benefited disproportionately from this decrease in the U.S. prison population, and the fraction of incarcerated Black and Latino people sharply increased. This pattern deviates from a decade-long trend before 2020 and the onset of COVID-19, during which the proportion of incarcerated Black people was declining. Using case studies of select states, we explore and quantify mechanisms that could explain these disparities: temporary court closures that led to fewer prison admissions, changes in the frequency of police interactions, and state-level prison release policies that sought to de-densify congregate settings. These findings illuminate how systemic racism pervades juridical and penal institutions and is the engine of mass incarceration in America.
1 Introduction
Mass incarceration in the United States is distinguished by striking racial disparities and a rate of imprisonment that surpasses all other nations, with 2.12 million people behind bars in 2019 [1–5]. As a result of a combination of structural inequities and discriminatory enforcement, Black and Latino people are more likely to be stopped by police [6], held in jail pretrial [7], charged with more serious crimes [8], and sentenced more harshly than white people [9, 10]. These practices have made Black American men six times as likely to be incarcerated as white men and Latino men 2.5 times as likely [11, 12].
In this study, we demonstrate how the COVID-19 pandemic—which produced the largest, most rapid single-year decrease in prison population in American history—amplified existing disparities in the nation’s prison system. We observe a convergent pattern across the country: a substantial decrease in the overall number of people incarcerated (approximately 200,000), but a significant increase in the proportion of incarcerated Black and other non-white people. This trend represents a strong departure from patterns preceding the pandemic, where the proportion of Black incarcerated individuals had been declining for nearly a decade. Although data reporting methods on racial demographics in prisons have made it difficult for researchers to identify the disparity that widened in 2020, we curate a large dataset across all 50 states and the District of Columbia that offers an unprecedented view into the dynamics of prison populations before, during, and after the pandemic.
To explore potential mechanisms that led to the dramatic reduction in U.S. prison populations and that exacerbated racial disparities, we utilize case studies of states, and analyze data across three developments: (i) interruptions to judicial proceedings that followed the lockdown policies that many institutions and non-essential businesses implemented in March 2020, (ii) changes in the frequency of police interactions during the pandemic, and (iii) decarceration policies that several states enacted to release individuals under a specific set of criteria. All three contain signatures of racial bias and are indicative of uniform and deeply entrenched racial disparities in the criminal justice system. We discuss our result in light of the ethics of public health interventions, national debates about the future direction of policing and incarceration, and the importance of data infrastructure in responsible public policy.
2 Results
2.1 Overall decrease in prison populations across every state
The state prison population in the United States decreased by at least 16.3% between March 2020 and July 2021, from approximately 1.23 million to 1.03 million people incarcerated (Figure 1A). Regardless of its pre-2020 population trends, this decrease in prison population occurred in every state, and, in most, started in early to mid-April 2020 (see Figure A.1 for a state-by-state look at prison populations over time). For instance, some states entered 2020 with a steadily-declining prison population (e.g. Massachusetts, South Carolina, California, among others); others had maintained relatively stable prison populations before 2020 (e.g. Virginia, Georgia, Iowa, etc.); even in states with a growing prison population before the pandemic (e.g. Alabama, Indiana, Montana, etc.), we see large reductions in the prison population during the COVID-19 pandemic.
As of October 2021, several states’ prison populations continue to decrease (Arizona, Massachusetts, Tennessee, Louisiana, Pennsylvania, among others) and have done so monotonically throughout the pandemic. Other states’ prison populations dropped sharply in the early months of the pandemic but have seen their prison populations begin to approach pre-pandemic levels through 2021 (North Dakota, South Dakota, Montana, West Virginia, among others). In Figure A.1, we plot time series for each state’s prison population over the last several years. Additionally, in Table A.1, we give an overview for each state’s approach for reporting prison population statistics, along with how we collected each state’s data.
2.2 Changing racial demographics of the prison population
Amid large-scale reductions in the prison population, the relative number of Black and other non-white incarcerated persons increased (Figure 1B; see Figure A.4 for comparison between effects among non-white vs. Black populations). Perhaps most alarming is the extent to which this sharp increase deviates from a decade-long trend before 2020: incarcerated Black people have accounted for a declining share of the total prison population over the last several years, from approximately 40% of the prison population in 2012 to 37.8% at the start of 2020 (decreasing by 0.275% each year, approximately linearly). This relative increase in the amount of incarcerated Black people is seen in almost every state, which often have very different approaches to crime and incarceration. In Figure 2, we show the percentage of incarcerated individuals who are Black across nine states. Despite the variability in pre-2020 trends, we see the same overall pattern: During the COVID-19 pandemic, the relative number of Black people in prison has increased. In several states and in the national average, this spike is temporary, eventually returning to its pre-pandemic level. We explore possible explanations for this reversal in subsequent sections, but the most likely reason is that the pace of prison admissions began to return to typical rates in early 2021.
2.3 Mechanisms of disparity: Admissions and releases
Consider a time series of a state’s prison population that does not notably change over several years. In order for this to occur, there needs to be approximately the same number of admissions and releases during this time period. This also means that in order for the demographic makeup of the prison population to remain stable, the relative number of admissions and releases by race also need to be roughly equivalent and constant over time. If there are sustained periods where there are more admissions (or releases) of a certain racial group, that will skew the overall demographic distribution of the prison population.
In our case, understanding this dynamic offers us a path toward identifying and isolating potential mechanisms that could bring about the spike in the proportion of Black incarcerated people during the COVID-19 pandemic. Namely, the observed spike in Figure 1B is either due to a disparity in who was admitted into prison during the pandemic or who was released, or both. In the remainder of this section, we present three data-based case studies from different states that highlight the structural racism within the American criminal justice system. These case studies incorporate data from Florida, Texas, and Arkansas—three states from which we were able to find useful, high quality data. While we do not claim that these states are representative of every justice system nationwide, each case study provides evidence for one potential mechanism that contributed to the trends from Figure 1B. There are surely more, but the general lack of public data hinders our ability to systematically study the issues addressed in each case. As such, these case studies also underscore the fundamental importance of public data in the study of inequality in the United States.
Disruptions in court operations
In every state except Nebraska, courts closed at the beginning of the pandemic. This dramatically reduced or altogether halted admissions into prisons for several months, starting around April 2020 [14–16]. Releases from prisons did not experience a commensurate reduction in volume and, as such, disruptions in court proceedings are likely the largest contributor to the reductions in states’ prison populations [17].
The Virginia Department of Corrections acknowledges this in their Annual Report from 2020 [18], “The reduction in [average daily population] is directly attributed to the suspension of intake due to COVID-19.” Similarly, a spokesperson for the Michigan Department of Corrections estimated that half of the reductions in incarcerated population were due to a decline in prisoner intake from courts and county jails [19].
Here we can use data from Florida as a case study, showing how changes to typical court proceedings can potentially lead to new racial biases in the prison population [20–23]. In Figure 3, we plot monthly trial statistics from circuit criminal defendants in Florida. Prior to March 2020, an average of 14,000 defendants were disposed each month (i.e., pass through the court system and either have their charges dropped, agree to a guilty plea, go to a jury trial, or go to a non-jury trial; Figure 3A). Starting in March 2020, the number of defendants decreased sharply, reaching nearly 4,000 in May 2020; as a result, we see a backlog of cases begin to accumulate (Figure 3B).
Between March 2020 and June 2020, more than 99% of cases did not go to trial (up from an average of approximately 97% prior to 2020). An increase in the proportion of cases that get resolved pre-trial means that a greater percent of all defendants passing through the Florida courts system will either agree to a guilty plea or see their charges dropped entirely. Importantly, both of these can be sources of bias in the resulting prison population.
First, we know from previous research that Black defendants are almost 70% more likely than white defendants to receive a plea deal that involves spending time in prison [8]. Second, we show here that the percent of cases that were dismissed entirely increased from an average of 10% before 2020 up to 15% in June 2020 (Figure 3C), and this increase in dismissed cases is strongly correlated with the percent of non-white incarcerated individuals (lagged one month to account for time delays in sentencing; Figure 3D). This case study highlights multiple potential sources of bias that can stem from disruptions in the court system, and it emphasizes the need for more states to make similar sentencing data available to the public. Ultimately, the absence of data about defendants’ race limits our ability to directly connect case-dismissals to prison demographic distributions. However, we observe these same correlations between percent of dismissed cases and percent of non-white incarcerated people prior to the COVID-19 pandemic, suggesting that this relationship is potentially more general and not merely an anomaly due to the pandemic.
Changes in police interactions
Another potential source of bias in terms of who gets admitted into prisons is related to changes in how certain populations are policed during the pandemic [25]. While this is (like other topics) difficult to study quantitatively due to a general lack of data in most states, here we can point to data from Texas as an illustrative case study. Texas Highway Patrol releases monthly data about violations and traffic stops recorded by highway patrol officers [24]. This dataset includes details about the drivers involved in the traffic stop, including the race and sex of the driver.
In Figure 4, we plot the percent of all monthly traffic violations that involved drivers who were white (Figure 4A) and drivers who were Black or Latino (Figure 4B) between January 2018 and December 2020. On average before March 2020, Black and Latino drivers accounted for 61% of traffic violations in Texas. In April 2020, however, this number climbed to nearly 70% of all traffic violations. Additionally, these numbers may still represent an under-estimate of the true proportion of traffic violations by Black and Latino drivers in the dataset; an analysis from 2015 found that the Texas Department of Public Safety consistently misreported and under-counted the number of Latino drivers that were stopped [26].
Decarceration
Several states enacted policies designed to de-densify prisons in response to pressure from public health officials. In some states, this directive came from an executive order from the governor; in others, this directive originated in the legislature. In Utah, for example, policies around releases are designed, approved, and implemented by the Board of Pardons and Parole (BPP)—an entirely separate entity from the courts and the Department of Corrections. According to the BPP, incarcerated people who are eligible for early release needed to be already characterized as a non-violent offender, be within 90 days of release (this was later extended to 180 days [27]), and have an approved address to stay at after their release. In Arkansas, under authorization from Governor Hutchinson (Executive Orders 20-06 and 20-16 [28]), the parole made 1,243 incarcerated people eligible for early release as of April 30, 2020. These eligible incarcerated people needed to have a parole plan in place, be medically screened (i.e., tested and screened for symptoms of COVID-19), and undergo final approval by the Arkansas Department of Corrections director in order to be released,. In Figure 5, we compare the demographics of the prison population in Arkansas in May 2020 to the demographics of incarcerated people eligible for early release under Governor Hutchinson’s authorization [28]; despite the fact that 57.2% of the Arkansas prison population was white, over 72% of the incarcerated people eligible for early releases were white—a disparity that we would not expect to see in a prison system absent of biases in sentencing.
This is not to suggest that that every state with COVID-19 based release policies was subject to the same bias observed in Arkansas. Indeed, in Figure A.5, we compare multiple states releases-to-incarcerated ratio (i.e., percent of releases of a given race, divided by the percent of the total prison population of the same race—if this ratio is 1.0, then there is a proportional number of releases of a given race as one would expect, given the demographic composition of the prison population). In Figure A.5 we show that while, on average, Black incarcerated people are released at disproportionately low rates (and incarcerate white people are released at disproportionately high rates), the effect of COVID-19 based releases can temporarily bring about less biased release patterns. We see this in Massachusetts especially in May and November of 2020, and we are encouraged that at least in one state, it is possible to carefully decarcerate in a way that does not exacerbate existing racial inequalities.
3 Discussion
Several concerning patterns emerge through the unprecedented decrease in US prison population during the COVID-19 pandemic. After declining steadily for the last decade, the percent of Black and other non-white incarcerated people sharply during 2020, a trend that was present across the country. We collected and validated a vast dataset to detect the spike in Black and other non-white incarceration proportions during the COVID-19 pandemic. The dataset we curated for this study is itself a core contribution. In order to obtain data across all 50 states, we had to manually collect data from individual Departments of Corrections, and by filing Freedom of Information Act requests (see Methods). The continued lack of granular data across the police, court, and prison system prevents us from accurately measuring racial disparities or the mechanisms that underlie them.
Given the heterogeneity in the structures of criminal justice systems across states, fully disentangling the precise cause of national patterns is challenging. There is a lack of standardized, systematically-reported data across the country that would permit a detailed interrogation of individual state patterns. However, we explore several potential explanatory mechanisms through case studies, including biases in admissions due to atypical court proceedings (see: Florida case study), biases in admissions due to increased police interactions (see: Texas case study), and biases in releases due to public health interventions (see: Arkansas case study).
We stress that these case studies are not intended to offer a definitive account of the racial disparities that emerged during the COVID-19 pandemic; they instead constitute investigations of three aspects of the justice systems (court admissions, policing, and releases) in three different states. Together, however, they offer potential mechanisms for the trends in Figure 1, and carry evidence for bias in the same direction as the national spike in incarcerations proportions among Black and other non-white people.
The most probable cause of the large-scale disparities that were observed during the pandemic are disruptions in the court system. Our results indicate that in several settings (see: Florida case study), the increase in Black and Latino incarceration proportions could in part be due to relative increases in pre-trial case dismissals (Figure 3D) and pre-trial plea deals. Plea deals in particular have long been demonstrated to result in a disproportionate number of Black defendants spending time in prison [8, 21, 22]. Consequently, interruptions in court proceedings led circuit courts to amplify a process long understood to drive inequalities, which likely contributed to the patterns outlined in this study.
According to an analysis of data from 12 states’ court caseloads as of June 2021, there was a backlog of almost two million active pending cases awaiting legal outcomes [29]. Starting in early 2022, there will increased pressure to clear this backlog, placing an additional workload on public defenders in particular [30]. Given the backlog of cases and the increased prevalence of algorithmic risk assessment tools [31], it is not unreasonable to imagine judges may start to rely even more on algorithmic-based approaches to clearing cases in the name of meeting standards for speedy trials. Additionally, with staffing shortages on top of an already overburdened prison system [32], we can expect to see conditions for incarcerated people devolve even further.
Another possible mechanism that we highlight in case studies involves the role of disparities in police interactions during the pandemic. A body of literature has demonstrated that Black and Latino communities are policed far more heavily than their white counterparts [1– 5, 33], and has identified factors that underlie bias in law enforcement [34, 35]. In Texas, the share of Black and Latino drivers who received traffic violations increased nearly 10% during the pandemic window. There are many potential explanations for why we see this abrupt and increased bias emerge in these data from Texas during the pandemic. While traffic stops are not the only way for people to enter the judicial system—nor are they the primary source of prison admissions—Texas offers an illustrative case study with evidence that policing has changed during the COVID-19 pandemic, which could ultimately bring about changes in the racial distribution of the prison population observed during the pandemic. Note also that this specific dataset—traffic stops—points to the structural nature of racial disparities in the United States carceral system. That is, during the pandemic, we saw large reductions in mobility and commuting patterns across the United States—regional reductions that are strongly correlated with how many jobs in the area allow workers to work from home [36]. Jobs that are more likely to require in-person attendance (i.e., jobs designated as “essential work”) are disproportionately held by Black and Latino people [37].
The last of our case-driven investigations involved state-driven decarceration policies. While they played a relatively minor role in the nationwide pandemic prison population decline, they still carried the signature of racial bias that is a feature of the pandemic. Maintaining the largest and most expansive prison system in the world is a major public health threat, especially in the context of infectious diseases [38]. In particular, severely overcrowded conditions have presented a challenging and fundamental public health threat during the COVID-19 pandemic [39]. The physical and administrative structure of prisons provided constraints on ways to quarantine incarcerated people and de-densify congregate settings [39–43]. In recognition of these circumstances, several states enacted policies and initiated executive orders to release individuals who they deem eligible [17].
As a public health intervention, decarceration is a highly effective way to mitigate outbreaks inside and outside of prisons [39, 41–45]. During the pandemic, criteria for decarceration differed from state-to-state, but often included factors such as the age of the incarcerated person and the offense for which they were convicted (e.g. nonviolent drug offenders) [46]. In this study, we able to quantify disparities in the efforts to de-densify prisons (see: Arkansas case study), which suggests that even decarceration policies widely understood to be consistent with effective and ethical public health practice (and that are assumed to be “race blind”) are also susceptible to biases. And one of the most important consequences of bias in releases is not only about who is released, but who is left behind: the increase in the proportion of incarcerated Black and other non-white people translates to their being at a heightened risk of exposure to SARS-CoV-2.
The COVID-19 pandemic has disrupted every facet of public life. In this study, we examine how these disruptions reveal structural racism underlying mass incarceration. As is the case with many large and complex social phenomena, the dynamics of the prison populations are defined by interactions between multiple actors, that in combination, create surprising or troubling results. In response to these findings, society has an ethical obligation to act, and reform the criminal legal system towards more equitable ends.
4 Data & Methods
4.1 Prison population by state over time
Time series data about states’ prison population over time were collected manually through scraping Departments of Corrections websites, as well as direct requests to state officials through public record requests (e.g. Freedom of Information Act requests, etc.). For every state in our dataset we sought the most temporally resolved data as possible. We collected population data at either weekly, monthly, quarterly, or, for some states, yearly levels. The most common form of data we were able to collect is the number of currently incarcerated persons in a given state, on a monthly timescale. In Table A.1, we link to the data source for every state in our dataset, and in Section A.1, we show how the prison population of every state has changed over time.
We compared the data collected here to data from other organizations that report statistics about the U.S. prison population—the Bureau of Justice Statistics (BJS) [4] and the Vera Institute for Justice [47]—and find high overlap between all three of the datasets. In Section A.2, we identify where our data differs from the BJS data, and we offer an explanation for why we are confident in accuracy of our approach (e.g. in several cases, we received the data directly from the states’ Departments of Corrections, via public records requests).
For every state in this dataset, the total prison population includes both male and female incarcerated people (something that is not always the case in studies about the U.S. carceral system, which so often focuses on male incarcerated people). In New Mexico and Vermont, “Transgender” or “Other” are also listed as gender categories, though this practice is not widely adopted in reporting statistics about the incarcerated population. In 27 states, incarcerated race statistics are separated by “male”, “female”, and “total”, and further characterizing the interaction between race and sex in admissions and release bias during the COVID-19 pandemic remains future work.
4.2 State policy data
4.2.1 Court closures and reduced intake capacity
Qualitative data on the closure and reopening of all 50 state court systems were collected primarily through the administrative orders and/or press releases of each state system’s Supreme or Superior Court or chief judicial officer as well as through local news coverage. The vast majority of states suspended all in-person proceedings with the exception of limited emergency matters between March 12 and March 20, 2020. Several states that adopted policies early in this period issued increasingly strict guidance as the pandemic worsened. New Jersey, for example, suspended new trials on March 12 and issued a two-week suspension on municipal court proceedings on March 14 before finally suspending all proceedings (with emergency exceptions) on March 15. In addition to closing judicial buildings and suspending proceedings, most court closures also extended statute of limitations and filing deadlines due to pandemic disruption. A handful of states, Pennsylvania and Texas among them, permitted or encouraged courts to begin conducting remote proceedings in their initial closure orders, though the adoption of remote proceedings was not widespread in this initial lockdown stage.
Court reopening policies were significantly more heterogeneous than the initial closures, though trials remained suspended in most states through at least early-Summer 2020 (and in most cases substantially later). The earliest such policies appeared at the beginning of April 2020, with most aimed at giving regional and local judges discretion to begin hearing proceedings remotely (e.g. Louisiana, Massachusetts, Florida, Iowa, among others). A substantially larger group of states adopted reopening guidelines between late-April and mid-May, many of which allowed essential judicial staff to return to offices following new public health guidance while also maintaining remote proceedings and expanding the number of non-trial proceedings that courts could conduct remotely. Further reopenings and the resumption of limited in-person proceedings took place in many states throughout June, July, and August 2020, though trial proceedings remained suspended. Notably, several states, especially those that adopted phased reopening plans, restricted in-person proceedings and further delayed trial resumption with the Fall-Winter 2020-21 COVID surge. In many states, most administrative orders restricting court operations have at the time of publishing been rescinded, though others, California notably among them, still retain certain accommodations including the option for remote proceedings.
4.2.2 Release policy data
Data on COVID release policies, where they existed, were collected from states’ individual corrections/prison bureau systems, governors’ executive orders, and local news coverage. Fifteen states did not adopt any official release policy, though our data nevertheless shows that there were still reductions in the overall prison population during the pandemic in all of these states. The remaining 35 states adopted policies with varying degrees of specificity and effectiveness, though many overlapped in their broadest contours, allowing consideration for early release to be granted to incarcerated people at increased public health risk (either due to age or underlying health condition) and for those nearing parole and/or the end of their prison sentences.
Almost all states with such policies did, however, adopt a restriction preventing the release of those incarcerated for violent crimes or sex offenses. North Dakota was an outlier in this regard. Of the 120 people the state initially released from prison in March 2020, 14 were serving time for violent crime convictions and 11 were convicted of sex offenses. New York’s release policy was notably more restrictive (on paper at least) than many other states—only those incarcerated for “non-criminal technical parole violations” were eligible for COVID release. As an example of one state’s release policy, we include below an excerpt from the Virginia Department of Corrections’ policy on releases [48], from April 24, 2020:
The Director of the Department of Corrections is authorized to consider early release for individuals with less than one year left to serve while the COVID-19 emergency declaration is in effect. Offenders convicted of a Class 1 felony or a sexually violent offense are not eligible for consideration. The exact number of individuals eligible for early release consideration will change depending on the length of the emergency declaration order. The [Department of Corrections] will identify those that are eligible for consideration using the procedures it has developed to ensure public safety and will notify offenders who are to be released under the early release plan. A diagnosis of COVID-19 is not a release factor.
The following Early Release Criteria will be utilized in considering an incarcerated person for early release pursuant to legislation:
Release Date: The inmate’s Good Time Release Date must be calculated and verified in order for the incarcerated to be considered.
Inmate Medical Condition: The inmate’s medical condition will be considered.
Offense History: By legislative mandate, early release does not apply to inmates convicted of a Class 1 felony or a sexually violent offense. Consideration for early release will be based on the seriousness of the current offense, in descending order as follows: Non-violent Offense, Felony Weapons Offenses, Involuntary Manslaughter, Voluntary Manslaughter, Robbery, Felony Assault, Abduction, Murder, Sex Offense
Viable Home Plan: The incarcerated person must have a documented approved home plan to be considered.
Good Time Earning Level: The inmate’s current good time earning level must be I or II to be considered.
No Active Detainers: Inmates must have no active detainer to be considered.
No Sexually Violent Predator Predicate Offenses: Inmates convicted of one or more sexually violent offenses established in §37.2-903 of the Code of Virginia are not eligible pursuant to legislation.
Recidivism Risk: Inmates must have a risk of recidivism of medium (5-7) or low (1-4), as identified by the validated COMPAS instrument, to be considered.
Note especially the inclusion of the COMPAS risk assessment tool, which is used in court systems across the U.S. as a way of quantifying an offender’s likelihood of re-offending (recidivism). Over the last several years, we’ve seen a growing body of scholarly work devoted to identifying problematic and harmful racial and economic biases that arise when algorithmic risk assessment tools are used in practice [49–54]. COMPAS, in particular, has been the subject of a number of studies that take a critical look at the effectiveness—and ethics—of these risk assessment tools in the justice system [31, 50]; in one study, COMPAS was found to predict recidivism 61% of the time, but at the same time, Black people were almost twice as likely to be labeled as high risk for re-offending but not actually re-offend [31].
Ultimately, further research is needed to quantify demographic patterns in the incarcerated individuals who were released across different states, and because there was such high heterogeneity in different states’ policies, it remains an open question whether we will see the same broad, systematic racial differences among the people who were released. However, as has been the case throughout the COVID-19 pandemic, heterogeneous policy responses across localities has typically had detrimental effects on our collective response to the pandemic [55].
4.3 Study definitions of race and ethnicity
The data that we collected for the study used definitions of racial and ethnic groups that were determined by the agencies that collected the data. When the authors are discussing race and ethnicity in their interpretations, they are referring to the historical categories that have social, cultural and political consequences. We use the term “Latino” to describe persons that are otherwise described as “Hispanic” in many settings. We have used the term “non-white” in select locations, as not all states had data disaggregated into the same set of categories. And so for some analyses, “non-white” directly describes the available data. For a table of the race categories reported by every state in our dataset, see Table A.6.
Recent advances in medical conventions have prompted discipline-wide introspection about the ways that race and ethnicity are discussed and used in research [56]. This is of critical importance to health equity and racial justice, and while in this work we rely on race statistics reported by states’ Departments of Correction, future work will critically examine the differences in approaches for reporting race and ethnicity statistics of incarcerated populations. Notably, it is important to know whether a state’s statistical reports use race categories that have been self-reported by the incarcerated person or whether it is interviewer-observed, which is often the case in administrative databases. These approaches are quite different and often result in inaccuracies in measurement of racial disparities [57].
Data Availability
All data produced in the present study are available upon reasonable request to the authors.
Additional information
Software and data availability
The dataset and Python code to reproduce the analyses and construction of the database is available at https://github.com/jkbren/decarceration-project.
Author contributions
B.K., C.B.O., S.V.S., & E.H. conceived the project. C.B.O., S.V.S., & E.H. directed the project. B.K. directed the construction of the data science pipeline. B.K., B.J.S., Z.B., P.K., J.S., & N.K. collected data. B.K., C.B.O., B.J.S., and S.V.S. conducted analyses. B.K., C.B.O., E.A.W., T.E.R., S.V.S., & E.H. interpreted and integrated the results. B.K., C.B.O., B.J.S., E.A.W., T.E.R., S.V.S., & E.H. contributed to researching, writing, and editing the final manuscript.
A Supplemental Information
A.1 State-by-state breakdowns
In Figure A.1, we show the prison population over time for all 50 states and the District of Columbia. In Table A.1, we give an overview of the scope of each state’s data in our dataset. In Table A.2, we list every state in order of the maximum reduction in prison population, alongside the month that this decrease was observed.
A.2 Comparison across prison population datasets
Other organizations collect and report data about prison populations over time. In order to situate the data used here within a broader body of work studying U.S. prison population trends, we validate against data released by the Bureau of Justice Statistics [4] (BJS) and the Vera Institute for Justice [47] (Vera). In Figure A.2, we plot the BJS’s yearly estimates of the number of people in state prisons across the United States since 2014. At the time of writing, the yearly “Prisoners in 2020” report has not been released by the BJS (though see “Prisoners in 2019” [4] for similar report), and as such, we only compare our data to BJS data ending on December 31, 2019. We concatenate the BJS data with the Vera data to approximate a “ground truth” estimate for the prison population over time.
We note several key points. First, starting in 2020, our dataset almost exactly matches the Vera dataset. Prior to 2020, our dataset reports a prison population that is approximately 1% smaller than the BJS data. After investigating what could have brought about these differences between the two datasets, we identified five states with the largest between-dataset differences (Montana, Florida, Texas, Virginia, and Ohio; see Table A.3). Because of these discrepancies, we took additional care to confirm that the data we had collected was exactly what was reported by states’ Departments of Corrections websites (or sometimes, through Freedom of Information Act requests). In Section A.2.1, we describe the rationale for why we are confident in the data included in the present study, and we also directly link to the data sources used to offer transparency in the data collection process.
A.2.1 Comparison to Bureau of Justice Statistics data
To our knowledge, the scale of the data that we assembled in this work is unique among the available public datasets about states’ prison populations over time. In Table A.3, we dive deeper into the discrepancies between the data used here and those that were released by the Bureau of Justice Statistics [4]. We offer explanations that reconcile why we may observe such differences, and we conclude that the data reported here is consistent with state prison population statistics reported by states’ Department of Corrections.
A.3 Court system, policing, and inmate release data
In addition to data about states’ prison populations and prison policy, we also used state-specific data about outcomes of court proceedings, traffic stops, and inmate releases in order to tell a broader story about the structural effects of mass incarceration during the COVID-19 pandemic.
A.3.1 Florida trial statistics data
To collect the court proceedings data used in Section 2.3 [20], we manually downloaded monthly data about the statewide data on the outcomes of Circuit Criminal Defendants between January 2018 and June 2020. We summarize this dataset in Table A.4.
A.3.2 Texas highway patrol data
Pursuant to Senate Bill 701, the Texas Department of Public Safety provides high resolution datasets about policing, public safety, and law (https://capitol.texas.gov/BillLookup/History.aspx?LegSess=82R&Bill=SB701) [24]. We downloaded monthly traffic stops data starting in January 2018, until December 2020. While we only used data about the reported race of the drivers involved in traffic stops, the dataset includes a rich range of personal and situational variables about the drivers and police officers involved in the traffic stop (for a list of the variables included see https://www.dps.texas.gov/sites/default/files/documents/highwaypatrol/docs/thpviolationscolnames.pdf).
A.3.3 Arkansas eligible release data
While there are not particularly noticeable changes after March 2020 in the trends of both curves in Figure A.5C, here we directly quantify racial disparities among the inmates who were eligible for release [28]. Using data released by the Arkansas Department of Corrections, we joined inmate identification numbers to their listed race and sex according to the Arkansas Department of Corrections Inmate Search tool (https://apps.ark.org/inmate_info/index.php).
A.4 Citation diversity statement
Recent work has quantified bias in citation practices across various scientific fields; namely, women and other minority scientists are often cited at a rate that is not proportional to their contributions to the field [58–65]. In this work, we aim to be proactive about the research we reference in a way that corresponds to the diversity of scholarship in this field. To evaluate gender bias in the references used here, we obtained the gender of the first/last authors of the papers cited here through either 1) the gender pronouns used to refer to them in articles or biographies or 2) if none were available, we used a database of common namegender combinations across a variety of languages and ethnicities. By this measure (excluding citations to datasets/organizations, citations included in this section, and self-citations to the first/last authors of this manuscript), our references contain 31% woman(first)-woman(last), 19% woman-man, 12% man-woman, 10% man-man, 0% nonbinary, 15% man solo-author, and 13% woman solo-author. This method is limited in that an author’s pronouns may not be consistent across time or environment, and no database of common name-gender pairings is complete or fully accurate.
Acknowledgements
The authors thank Harrison Hartle, Stefan McCabe, Timothy LaRock, Ryan Gallagher, and members of the Northeastern University NULab for helpful conversations and tips with constructing the dataset. The findings and conclusions in this study are those of the authors and do not necessarily represent the official position of The Rockefeller Foundation.
Footnotes
↵* b.klein{at}northeastern.edu; brandon.ogbunu{at}yale.edu; sscarpino{at}rockfound.org; elizabeth.hinton{at}yale.edu
References
- [1].↵
- [2].
- [3].
- [4].↵
- [5].↵
- [6].↵
- [7].↵
- [8].↵
- [9].↵
- [10].↵
- [11].↵
- [12].↵
- [13].↵
- [14].↵
- [15].
- [16].↵
- [17].↵
- [18].↵
- [19].↵
- [20].↵
- [21].↵
- [22].↵
- [23].↵
- [24].↵
- [25].↵
- [26].↵
- [27].↵
- [28].↵
- [29].↵
- [30].↵
- [31].↵
- [32].↵
- [33].
- [34].↵
- [35].↵
- [36].↵
- [37].↵
- [38].↵
- [39].↵
- [40].
- [41].↵
- [42].
- [43].↵
- [44].
- [45].↵
- [46].↵
- [47].↵
- [48].↵
- [49].↵
- [50].↵
- [51].
- [52].
- [53].
- [54].↵
- [55].↵
- [56].↵
- [57].↵
- [58].↵
- [59].
- [60].
- [61].
- [62].
- [63].
- [64].
- [65].↵