Supplementary

1110-0005_Arrest Estimation.docx

Age, Sex, and Race of Persons Arrested 18 Years of Age and Over; Age, Sex, and Race of Persons Arrested Under 18 Years of Age

Supplementary

OMB: 1110-0005

Document [docx]
Download: docx | pdf



Procedures for Estimating in the Uniform Crime Reporting (UCR) Program

James H. Noonan, PhD

The UCR Program will begin offense estimation in the 2013 Crime in the United States (CIUS) publication. The following information details the intended method for achieving a valid and reliable estimate and uses an example of Arrest offense estimation to demonstrate the employed statistical model.

Currently, the UCR Program publishes Arrest offense data by agency population group, region, city, county, and suburban area shown in Arrest tables 30-69 in CIUS. The FBI UCR Program does estimate arrest data. Table 29, the Estimated Number of Arrests, in CIUS provides the estimated number of persons arrested for the 28 offenses for which the FBI’s UCR Program collects data. The FBI calculates arrest rates using data from all law enforcement agencies submitting a complete year of arrest data, and then multiplying this complete-response arrest rate by the total population (including those of incomplete and non-reporters) in each arrest offense and population group and then multiplying by 100,000 for each arrest offense. However, arrest data is not imputed and then aggregated to the national level. The FBI UCR Program does not impute missing arrest data from either nonreporting or partially reporting jurisdictions. Published estimates are based only on data form those jurisdictions submitting 12 months of arrest data because the arrests from partial and incomplete reporting agencies are excluded from the rate calculation.



National Estimation

For 2012, 17,010 agencies reported data to the FBI, and of those 15,387 reported a complete 12 months of data.1 However, for arrests, only 12,196 reported 12 months of data which means the missing information must be imputed. National estimates for offense data will be computed by summing agencies reporting a complete set of data with agencies who are estimated using one of two imputation methods based on the number of months the agency reported for the calendar year.









Number of Agencies by Number of Months of Competed UCR Reporting


Number of Months

Frequency

Percent

Valid Percent

Cumulative Percent

1

102

.6

.6

.6

2

94

.6

.6

1.2

3

141

.8

.8

2.0

4

70

.4

.4

2.4

5

87

.5

.5

2.9

6

91

.5

.5

3.4

7

108

.6

.6

4.1

8

116

.7

.7

4.8

9

152

.9

.9

5.6

10

226

1.3

1.3

7.0

11

436

2.6

2.6

9.5

12

15,387

90.5

90.5

100.0

Total

17,010

100.0

100.0

 



Imputation for Agencies with 3-11 Months of Data

For agencies reporting 3-11 months of data to the FBI, it is assumed the agency best represents itself for imputing missing months. As such, the follow formula will be used to impute the agency’s arrest data:





For example if an agency reported 15 arrests in seven months of reporting there would be an estimated 26 arrests for the agency.



Imputation for Agencies with Less Than Three Months of Data

For agencies reporting less than three months of data, the FBI UCR Program will impute the entire year of arrest data based on the rate of arrest per 100,000 persons within the agency’s population group multiplied by the agency’s population.



For example, Philadelphia arrest data was not reported in 2012. The arrest rate for agencies over one million in population is 4,275.4 per 100,000 persons and Philadelphia has a population of 1,538,957 persons. Therefore its estimated arrest count for 2012 is 65,797.



Combining Estimates With Reported Totals

Once the agencies are imputed, their arrest totals will be summed with the agencies that reported a complete 12 months of arrest data to derive the national estimate. For 2012, there were 53,941 reported arrests by 12 month agencies and 12,397 estimated arrests. Therefore, the estimated number of arrests in the United States in 2012 is 66,338 arrests.



Limitations and Future Improvements

The imputation method demonstrated in this paper’s estimation model is based on the assumption that agencies within population groups are most similar to one another. Further study needs to be done to determine if regional characteristics of crime would help better stratify agencies into more representative groups. Until other characteristics, such as poverty rates, employment rates, or other socioeconomic variables become attributable to specific agencies and statistically tested for validity, the best stratification variable the FBI UCR Program has is agency population group. The FBI UCR Program is also assuming that the best representation of an agency’s missing data is the agency’s data reported in other months, but there is no evidence which definitively shows that a partial report’s own data qualifies as a representative sample.

Once new variables are tested and validated, other techniques for imputation may be used, such as regression analysis. Other types of imputation models, such as bootstrapping, are not currently available with the FBI UCR Program’s current technological or staffing resources; however, the Program is working toward a partnership with the Bureau of Justice Statistics (BJS) which will allow for shared resources in the future which could handle more sophisticated and statistically valid and reliable methods. Arrests may have seasonal variation that will need to be studied and identified for use in future estimations.

Agencies with zero-population are not represented in the imputation model. The FBI UCR Program will examine the value of longitudinal and ratio-based models for future estimation methods. For example, there may be a reliable ratio between offenses which can best represent missing data, including those with zero-population. Using an agency’s previous reporting patterns over the last five years may also yield valuable estimation results, but these models are not yet tested. Discussions concerning these methods with the BJS showed there is value in examining new cross-sectional, longitudinal, and ratio models, but they will take some time to develop and validate.

A final potential future improvement for nationally estimating crime data is testing UCR data for representativeness. If the data are tested and found to support the idea that UCR data is a statistically representative sample of reported crime data, then sampling estimation models can be used to estimate national reported crime totals. Of course, should the tests show the data are not representative; the other aforementioned imputation models will still be available to the FBI UCR Program to meet the need for determining national crime estimates.

Given the limitations, this estimation proposal is proposed as the next step for the FBI UCR Program to enhance its value for estimating crime data. It is not intended as a once-and-for-all solution for estimation. It is also expected that a current National Academy of Sciences and future partnering opportunities with the BJS and other UCR stakeholders will provide recommendations for estimation that are not currently considered or anticipated in this document.

This proposed method also has other uses within the UCR Program, including providing estimations for other reported data collections, including Part I crimes, police employment data, aron data, hate crime, cargo theft, human trafficking, and the Law Enforcement Officers Killed and Assaulted program. Specific methodological considerations need to be addressed for unique data concerns within each collection, but a method similar to the arrest estimation example can be used as the base estimation model for those programs.

1 FBI UCR Program. (2010). Crime by County Datafile. Processed May 13, 2013.

4/3/2014 Page | 3

File Typeapplication/vnd.openxmlformats-officedocument.wordprocessingml.document
Authorjnoonan
File Modified0000-00-00
File Created2021-01-28

© 2025 OMB.report | Privacy Policy