National Center for State Courts

 

Improving Justice through Leadership
and Service to the Courts

     

  

Trial Court Perforrmance Standards & Measurement System

Measure 3.3.3: Equality and Fairness in Sentencing

One application of Standard 3.3 is sentencing in criminal cases. Because the imposition of criminal sanctions deprives individuals of their liberty, the fairness of the process and corresponding outcomes is an important topic for the measurement of court performance. In fact, some courts might regard fairness in sentencing to be among the most critically important goals that it should strive to meet. However, fairness in sentencing is understandably very difficult to measure.24 Even the most refined measurement will produce results more suggestive than definitive, which is not astonishing given the difficulty of sentencing for trial judges. Just as the trial judge must weigh, balance, and take into account many factors, the court researcher must identify, measure, and interpret the effects of many complex factors, including some that are difficult to express as a precise scale of measurement.

Hence, trial courts take on a very daunting task by attempting to measure fairness in sentencing. Why? Because of the sensitive nature of conclusions about fairness, a court will want to know that the conclusions are valid to the greatest extent possible. However, sound conclusions require a rigorous methodology, which requires a substantial commitment of time, quantitative skills, and resources. Thus, without intending to deter courts from applying this measure, honesty requires acknowledging the labor-intensive aspect of the measurement process necessary to reach the kind of conclusions the court is likely to want to draw. (Note: The same point applies equally to the measurement of fairness in bail decisions, Measure 3.3.4.)

What does fairness in sentencing mean? According to Standard 3.3, "trial courts give individual attention to cases deciding them without undue disparity among like cases and only upon legally relevant factors." Translated into more operational terms, the standard is saying that the imposition of punishment should not be on the basis of a defendant’s race or gender. For example, African Americans should not receive longer sentences than non-African Americans simply because they are African American. Different sentences should be the product of differences in criminal backgrounds, offense severity, circumstances surrounding the offense, and other legally relevant factors.

Finally, while equality and fairness are positive standards, they are observed in the negative. Courts are urged to be equal and fair in their treatment, but their performance is measured in terms of outcomes that are not supposed to occur—inequality, disparity, and inconsistency.

Planning/Preparation. Courts should consider four steps in planning to undertake the measure. First, some familiarity with the literature on sentencing might prove useful. The most comprehensive volume, Research on Sentencing: The Search for Reform, is published by the National Academy of Sciences and available in most public and college libraries. The volume is written from the researcher’s perspective, however, and contains some articles of a technical nature. A complementary article, "Racial Discrimination" by Rose Matsui Ochi, which appeared in 1985 in The Judges’ Journal, illustrates how research results are interpreted and used by practitioners who seek to eliminate bias in sentencing. This article is also useful because it references additional readings on the topic that are readily available.

A second step is for the court to examine its capacity for conducting a rigorous measurement process. If the court lacks a staff person skilled in quantitative analysis, it might find it helpful to ask for guidance and assistance from a staff member of a State sentencing commission, State administrative office of the courts, or local university to assist in designing a plan of data collection, analysis, and interpretation.

A third step is to set some boundaries on the scope of the measurement process. Despite the fact that researchers construct very complex quantitative models of sentencing, the proposed measure is intended to help a court assess itself and not necessarily to advance the state of knowledge. Hence, it permits the court to limit the scope and detail of its inquiry without sacrificing the validity of the results. As an example, the court needs to decide what aspect of sentencing is of greatest importance. Is it more important to determine fairness in the types of sentences that defendants receive (e.g., incarceration versus probation) or in the length of sentences imposed (e.g., are men incarcerated for longer periods of time than women)? Are both aspects equally important?

Finally, before applying the measure, the court should discuss how it plans to interpret the results. The results will be in the form of numbers called coefficients that are based on the application of quantitative techniques to information gathered from individual case files. There will be a coefficient for each legally relevant (e.g., prior record and offense committed by the offender) and each extra-legally relevant factor (e.g., race of offender). The coefficient measures the impact of a particular factor, controlling for the effects of all other factors. If the legally irrelevant factors are not influencing outcomes, the numerical value of their respective coefficients will not be statistically different from zero. For example, knowing that an offender is a man will not predict the sentence any better than knowing that the offender is a woman. Additionally, the coefficients of all legally relevant factors should be significantly larger than those of irrelevant factors. If they are, one reasonably can draw the conclusion that there is limited bias in sentencing and that sentencing is primarily a product of legally relevant factors. If the court knows what to look for in advance, it will be more prepared to interpret and use the results both internally for self-improvement and for presentation to interested groups outside the court.

Defining the Data Elements. Although the exact delineation of legally relevant and legally irrelevant factors may vary somewhat across States because of differences in substantive and procedural law, some distinctions likely will be valid in almost all situations. For the purposes of demonstrating the utility of the measure, therefore, it is assumed that legally relevant factors include offense seriousness, quality of the evidence, prior criminal record, and current legal status. Irrelevant factors include demographic, socioeconomic, and social stability attributes, and case processing attributes.25 Based on that assumption, a court meeting Standard 3.3 has sentencing outcomes that can be explained more on the basis of those legally relevant factors than on factors deemed irrelevant.

In addition to identifying a set of determinants of sentencing outcomes, the initial measurement step involves specifying the outcomes of sentencing. Two related outcomes are especially important:

  • In/Out Decision. Is the offender sentenced to a term of institutional incarceration? Or is the offender given some alternative such as probation, restitution, community service, or fine?

  • Length of Sentence. How long is the period of institutionalized incarceration?

The first outcome distinguishes between convicted offenders who are sentenced to prison or jail and those who are given a sentence outside these institutions. The second outcome focuses on the length of the sentence in years, months, or days imposed on individuals sentenced to jail or prison.

Legally relevant factors: Concerning the seriousness of the offense, a basic judgment must be made to focus on either a broad range of offenses or to isolate particular offense categories (e.g., robbery, burglary). The first option is to consider a large set of offenses and to rank them according to severity (e.g., homicide, robbery, rape, assault, weapons, drug sale, drug possession, burglary, forgery, and theft).26 Additionally, other indicators may be used to gauge the more specific degrees of severity, such as the use of a dangerous weapon, the extent of injury to the victim, the amount of property taken, and whether the offender was a principal or accessory to the offense.

Although some version of the first approach is highly recommended, a second option is to focus on selected offenses separately. If particular offenses are deemed of such importance to the court and the community that they merit special attention, this approach may be appropriate. However, this option lacks the representativeness of the first option, which encompasses the full range of offenses. Hence, we generally recommend some version of the first option.

The quality of the evidence is extremely difficult to measure and may be known fully only by the participants involved in each individual case. As a result, retrospective reliance on case records for information only approximates the complete and correct picture of the strength of the evidence. Possible indicators include the number of prosecution witnesses, the number of expert witnesses, the number of exhibits, the submission of laboratory tests, and so forth. A limitation to these indicators, of course, is that they relate primarily to the few cases that go to trial.

Prior criminal history is usually information presented to the court from State law enforcement records. Although some law enforcement information systems are more detailed than others, criminal history generally is measured in terms of the number of prior adult felony convictions, the elapsed time since the last conviction, whether the last conviction was for the same offense as the current charge, and the current legal status of the individual at the time of arrest (e.g., on parole or probation).

Legally irrelevant factors: Demographic, socioeconomic, and social stability factors are a combination of quantitative indicators such as age (years), income (earned income per month), education (number of years) and categories such as gender (male versus female), race (white versus nonwhite), employment status (employed versus unemployed), and marital status (married versus nonmarried).

The case processing characteristics are all categories. Pretrial release status may be divided between those offenders on bail, those detained at least part of the time between arrest and final disposition, and those detained all of the time. Disposition similarly can be separated among those offenders who pled guilty, those convicted by a bench trial, and those convicted by a jury trial.

A final factor is the judge presiding over a sentencing decision. Each judge need only be identified by an alphabetic character (e.g., Judge A, Judge B, Judge C, and so forth). The measure is intended to determine if any judge has an influence on sentencing that is greater than generally accepted legal factors. Sentencing outcomes involve the distinction between institutional incarceration and some alternative to incarceration. This distinction captures the in/out decision. For the length of the sentence, a standardized measure is the percentage of the statutory maximum imposed in the actual sentence. Because some sentences may involve a range, the minimum of the sentence imposed should be used in calculating the percentage. This standardization permits different offenses to be compared despite their differences in severity.

Data Collection. In most jurisdictions, virtually all of the factors and sentencing outcomes can be measured against information contained in presentence investigation reports and closed court case records. A court can use these sources by drawing a random sample of approximately 1,000 closed cases and selecting from that pool those cases in which a conviction was obtained by guilty plea or trial. (The remaining cases should not be discarded because they can be used as part of the data set for Measure 3.3.4, Equality and Fairness in Bail Decisions.) Of this pool, 70 percent are likely to involve some sort of conviction, which means that these 700 cases can be used to examine the factors associated with the in/out sentencing decision. Of these cases, approximately half will involve a sentence of institutional incarceration, providing the basis for assessing the factors associated with the length of the sentence.

The measurement of sentencing and sentencing outcomes described above needs to be translated into a more specific and detailed form prior to the review of court case records and presentence investigation reports. A data collection form should be constructed for the purpose of applying the sorts of indices suggested for the different factors. (Please see Form 3.3.3, Illustrative Sentencing Data Collection Form.)

Data Analysis and Report Preparation. The question of whether legally relevant factors are more powerful predictors of sentencing outcomes than are irrelevant factors is addressed by the use of statistical models. These models are available in many software computer programs that are likely to be familiar to sentencing commission staff, court researchers in a State administrative office, or university professors. One or more of these individuals will likely know how to use an appropriate software program to analyze the data collected on the data collection form. Specifically, the expert will know what particular quantitative techniques should be applied to determine the independent impact of each legally relevant and irrelevant factor on the two types of sentencing decisions.

In the case of the in/out decision, an appropriate technique is logit analysis. Logit analysis is designed to indicate the independent effects of various factors on different categories (e.g., a sentence of institutional incarceration versus one of nonincarceration). The numbers generated by logit analysis include coefficients for each factor. The sign (±) of the coefficient indicates whether there is a positive (e.g., the more serious the offense, the more likely the sentence will involve incarceration) or inverse (e.g., the longer the length of time since the last conviction, the less likely the sentence will involve incarceration) relationship between each factor and the outcome. A comparison of the magnitude of the coefficients will indicate the relative importance of each factor in determining whether an offender is sentenced to prison as opposed to some alternative sentence.

The issue of the length of sentences for incarceration is examined appropriately through the use of regression analysis. Regression analysis is designed to indicate the independent effects of factors on an interval factor such as the number of months to be served. Similar to the logit analysis, coefficients are generated by regression analysis. They indicate if there is a positive (e.g., the older the offender, the longer the sentence) or inverse (e.g., the higher the offender’s level of education, the shorter the sentence) relationship between each factor and the length of the sentence.27

The coefficients bear upon the central purpose of the measure in two ways. First, if the legally irrelevant factors are not influencing outcomes, the coefficients associated with them should not be statistically different from zero.28 Second, the coefficients of all legally relevant factors should be significantly larger than those of irrelevant factors.

Looking at the coefficients associated with the different factors, the court can begin to assess their implications for fairness in sentencing. Do the results signal that legally irrelevant factors are having undue influence on the likelihood of incarceration or the length of sentences? Or do the results signal that irrelevant factors fail to account for the court’s decisions to sentence offenders to prison or the length of prison sentences? In sum, do the results indicate that sentencing decisions are the product primarily of legally relevant factors and that irrelevant factors are of limited significance?

Depending on what the results indicate, the court can use the information as a guide to reviewing its sentencing policies, practices, and procedures. The results might suggest the need for special training programs for newly appointed judges, especially those who come from private civil practice backgrounds. Or, the results might suggest the need for a courtwide training program on current developments in substantive and procedural criminal law.

________________________________________
24 The measure proposed outlines a statistical approach to assessing whether there is undue disparity and bias in a court's proceedings. However, it is not a complete treatment of every aspect of particular techniques and their interpretation. For this reason, the court may wish to consult outside experts when applying the measure.
25 The definition of the data elements and the proposed methods of data analysis reflect the input and advice of academic sentencing experts and former staff of the U.S. Sentencing Commission. Their opinions were solicited to achieve maximum statistical validity, although future research is likely to use even more refined methods in this growing area of research.
26 An offense severity scale can be developed by assigning numerical weights to different offenses. The U.S. Sentencing Commission has constructed such a scale.
27 For discussion of parallel applications of this technique to case processing data, see R. Flemming, P. Nardulli, and J. Eisenstein, "The Timing of Justice in Felony Trial Courts," Law & Policy 9 (1987); and M. Luskin and R. Luskin, "Why So Fast, Why So Slow: Explaining Case Processing Time," Journal of Criminal Law & Criminology 77 (1989).
28 A coefficient may be greater but not statistically greater than zero because the factor under consideration (e.g., race) does not have consistent, uniform effects on what is being measured (e.g., sentence length). However, a statistical test performed by the software will indicate whether each coefficient is significantly greater than zero.

Go to Form 3.3.3

Go to Measure 3.3.4

Go to Standard 3.3

Go to Table of Contents

 

line.gif (1068 bytes)
Copyright © 2001 National Center for State Courts
Last Modified: January 23, 2005