1
MEMORANDUM FOR: Lisa Clement
Survey Director for CPS & Time Use, Associate Directorate for Demographic Programs
From: Ruth Ann Killion
Chief, Demographic Statistical Methods Division
Subject: Source and Accuracy Statement for the October 2013 CPS Microdata File on School Enrollment
Attached is the statement on the source of the data and accuracy of the estimates for the October 2013 CPS Microdata File on School Enrollment.
If you have any questions or need additional information, please contact Stephen Clark of the Demographic Statistical Methods Division via email at [email protected].
Attachment
cc:
G. Weyland (ADDP)
K. Woods
N. McKee
B. Kominski (SEHSD)
W. Savino (ACSD)
D. Smallwood
J. Sweeney
C. Bennett
J. Farber (DSMD)
Y. Cheng
J. Scott
I. Irizarry-Pietri
T. Kennel
D. Hornick
S. Clark
M. Herbstritt
R. Hoop
S. Eaton
R. McElroy
1Source of the Data and Accuracy of the Estimates for the
October 2013 CPS Microdata File on School Enrollment
Basic CPS 1
October 2013 Supplement 2
Estimation Procedure 2
Sampling Error 3
Nonsampling Error 3
Nonresponse 3
Sufficient Partial Interview 4
Coverage 4
Comparability of Data 5
A Nonsampling Error Warning 6
Standard Errors and Their Use 6
Estimating Standard Errors 7
Generalized Variance Parameters 7
Standard Errors of Estimated Numbers 8
Standard Errors of Estimated School Enrollment Numbers 9
Standard Errors of Estimated Percentages 11
Standard Errors of Estimated Differences 12
Standard Errors of Quarterly or Yearly Averages 13
Technical Assistance 13
Table 1. CPS Coverage Ratios: October 2013 5
Table 2. Estimation Groups of Interest and Generalized Variance Parameters 8
Table 3. Population Controls for School Enrollment Age Groups: October 2013 10
Table 4. Parameters for Computation of Standard Errors for Labor Force Characteristics:
October 2013 14
Table 5. Parameters for Computation of Standard Errors for School Enrollment
Characteristics: October 2013 15
Table 6. Year Factors for Non-School Enrollment Characteristics 16
Table 7. Year Factors for School Enrollment Characteristics 17
Table 8. Factors for Regional Standard Errors and Parameters 17
1Source of the Data and Accuracy of the Estimates for the
October 2013 CPS Microdata File on School Enrollment
SOURCE OF THE DATA
The data in this microdata file are from the October 2013 Current Population Survey (CPS). The U.S. Census Bureau conducts the CPS every month, although this file has only October data. The October survey uses two sets of questions, the basic CPS and a set of supplemental questions. The CPS, sponsored jointly by the Census Bureau and the U.S. Bureau of Labor Statistics, is the country’s primary source of labor force statistics for the entire population. The Census Bureau and the National Center for Educational Statistics also jointly sponsor the supplemental questions for October.
1Basic CPS. The monthly CPS collects primarily labor force data about the civilian noninstitutionalized population living in the United States. The institutionalized population, which is excluded from the population universe, is composed primarily of the population in correctional institutions and nursing homes (98 percent of the 4 million institutionalized people in Census 2010). Interviewers ask questions concerning labor force participation about each member 15 years old and over in sample households. Typically, the week containing the nineteenth of the month is the interview week. The week containing the twelfth is the reference week (i.e., the week about which the labor force questions are asked).
The CPS uses a multistage probability sample based on the results of the decennial census, with coverage in all 50 states and the District of Columbia. The sample is continually updated to account for new residential construction. When files from the most recent decennial census become available, the Census Bureau gradually introduces a new sample design for the CPS.
In April 2004, the Census Bureau began phasing out the 1990 sample1 and replacing it with the 2000 sample, creating a mixed sampling frame. Two simultaneous changes occurred during this phase-in period. First, primary sampling units (PSUs)2 selected for only the 2000 design gradually replaced those selected for the 1990 design. This involved 10 percent of the sample. Second, within PSUs selected for both the 1990 and 2000 designs, sample households from the 2000 design gradually replaced sample households from the 1990 design. This involved about 90 percent of the sample. The new sample design was completely implemented by July 2005.
In the first stage of the sampling process, PSUs are selected for sample. The United States is divided into 2,025 PSUs. The PSUs were redefined for this design to correspond to the Office of Management and Budget definitions of Core-Based Statistical Area definitions and to improve efficiency in field operations. These PSUs are grouped into 824 strata. Within each stratum, a single PSU is chosen for the sample, with its probability of selection proportional to its population as of the most recent decennial census. This PSU represents the entire stratum from which it was selected. In the case of strata consisting of only one PSU, the PSU is chosen with certainty.
Approximately 72,000 housing units were selected for sample from the sampling frame in October. Based on eligibility criteria, 11 percent of these housing units were sent directly to computer-assisted telephone interviewing (CATI). The remaining units were assigned to interviewers for computer-assisted personal interviewing (CAPI).3 Of all housing units in sample, about 60,000 were determined to be eligible for interview. Interviewers obtained interviews at about 54,000 of these units. Noninterviews occur when the occupants are not found at home after repeated calls or are unavailable for some other reason.
1October 2013 Supplement. In October 2013, in addition to the basic CPS questions, interviewers asked supplementary questions of household members three years old and over on school enrollment.
1Estimation
Procedure. This survey’s estimation procedure adjusts
weighted sample results to agree with independently derived
population estimates of the civilian noninstitutionalized population
of the United States and each state (including the District of
Columbia). These population estimates, used as controls for the CPS,
are prepared monthly to agree with the most current set of population
estimates that are released as part of the Census Bureau’s
population estimates and projections program.
The population controls for the nation are distributed by demographic characteristics in two ways:
Age, sex, and race (White alone, Black alone, and all other groups combined).
Age, sex, and Hispanic origin.
The population controls for the states are distributed by race (Black alone and all other race groups combined), age (0-15, 16-44, and 45 and over), and sex.
The independent estimates by age, sex, race, and Hispanic origin, and for states by selected age groups and broad race categories, are developed using the basic demographic accounting formula whereby the population from the 2010 Census data is updated using data on the components of population change (births, deaths, and net international migration) with net internal migration as an additional component in the state population estimates.
The net international migration component in the population estimates includes a combination of the following:
Net international migration of the foreign born;
Net migration between the United States and Puerto Rico;
Net migration of natives to and from the United States; and
Net movement of the Armed Forces population to and from the United States.
Because the latest available information on these components lags the survey date, it is necessary to make short-term projections of these components to develop the estimate for the survey date.
1ACCURACY OF THE ESTIMATES
A sample survey estimate has two types of error: sampling and nonsampling. The accuracy of an estimate depends on both types of error. The nature of the sampling error is known given the survey design; the full extent of the nonsampling error is unknown.
Sampling Error. Since the CPS estimates come from a sample, they may differ from figures from an enumeration of the entire population using the same questionnaires, instructions, and enumerators. For a given estimator, the difference between an estimate based on a sample and the estimate that would result if the sample were to include the entire population is known as sampling error. Standard errors, as calculated by methods described in “Standard Errors and Their Use,” are primarily measures of the magnitude of sampling error. However, they may include some nonsampling error.
Nonsampling Error. For a given estimator, the difference between the estimate that would result if the sample were to include the entire population and the true population value being estimated is known as nonsampling error. There are several sources of nonsampling error that may occur during the development or execution of the survey. It can occur because of circumstances created by the interviewer, the respondent, the survey instrument, or the way the data are collected and processed. For example, errors could occur because:
• The interviewer records the wrong answer, the respondent provides incorrect information, the respondent estimates the requested information, or an unclear survey question is misunderstood by the respondent (measurement error).
• Some individuals who should have been included in the survey frame were missed (coverage error).
• Responses are not collected from all those in the sample or the respondent is unwilling to provide information (nonresponse error).
• Values are estimated imprecisely for missing data (imputation error).
• Forms may be lost, data may be incorrectly keyed, coded, or recoded, etc. (processing error).
To minimize these errors, the Census Bureau applies quality control procedures during all stages of the production process including the design of the survey, the wording of questions, the review of the work of interviewers and coders, and the statistical review of reports.
Two types of nonsampling error that can be examined to a limited extent are nonresponse and undercoverage.
Nonresponse. The effect of nonresponse cannot be measured directly, but one indication of its potential effect is the nonresponse rate. For the October 2013 basic CPS, the household-level nonresponse rate was 9.86 percent. The person-level nonresponse rate for the school enrollment supplement was an additional 8.0 percent.
Since the basic CPS nonresponse rate is a household-level rate and the school enrollment supplement nonresponse rate is a person-level rate, we cannot combine these rates to derive an overall nonresponse rate. Nonresponding households may have fewer persons than interviewed ones, so combining these rates may lead to an overestimate of the true overall nonresponse rate for persons for the school enrollment supplement.
Sufficient Partial Interview. A sufficient partial interview is an incomplete interview in which the household or person answered enough of the questionnaire for the supplement sponsor to consider the interview complete. The remaining supplement questions may have been edited or imputed to fill in missing values. Insufficient partial interviews are considered to be nonrespondents. Refer to the supplement overview attachment in the technical documentation for the specific questions deemed critical by the sponsor as necessary to be answered in order to be considered a sufficient partial interview.
1Coverage. The concept of coverage in the survey sampling process is the extent to which the total population that could be selected for sample “covers” the survey’s target population. Missed housing units and missed people within sample households create undercoverage in the CPS. Overall CPS undercoverage for October 2013 is estimated to be about 15 percent. CPS coverage varies with age, sex, and race. Generally, coverage is larger for females than for males and larger for non-Blacks than for Blacks. This differential coverage is a general problem for most household-based surveys.
1The CPS weighting procedure partially corrects for bias from undercoverage, but biases may still be present when people who are missed by the survey differ from those interviewed in ways other than age, race, sex, Hispanic origin, and state of residence. How this weighting procedure affects other variables in the survey is not precisely known. All of these considerations affect comparisons across different surveys or data sources.
1A common measure of survey coverage is the coverage ratio, calculated as the estimated population before poststratification divided by the independent population control. Table 1 shows October 2013 CPS coverage ratios by age and sex for certain race and Hispanic groups. The CPS coverage ratios can exhibit some variability from month to month.
Table 1. CPS Coverage Ratios: October 2013 |
|||||||||||
|
Total |
White only |
Black only |
Residual race |
Hispanic |
||||||
Age group |
All people |
Male |
Female |
Male |
Female |
Male |
Female |
Male |
Female |
Male |
Female |
0-15 |
0.86 |
0.87 |
0.85 |
0.90 |
0.87 |
0.75 |
0.79 |
0.81 |
0.78 |
0.87 |
0.83 |
16-19 |
0.80 |
0.81 |
0.78 |
0.84 |
0.82 |
0.73 |
0.67 |
0.75 |
0.73 |
0.83 |
0.84 |
20-24 |
0.71 |
0.69 |
0.73 |
0.72 |
0.77 |
0.60 |
0.67 |
0.65 |
0.62 |
0.67 |
0.76 |
25-34 |
0.82 |
0.80 |
0.85 |
0.83 |
0.87 |
0.67 |
0.79 |
0.70 |
0.73 |
0.75 |
0.85 |
35-44 |
0.85 |
0.82 |
0.88 |
0.85 |
0.91 |
0.72 |
0.81 |
0.73 |
0.81 |
0.78 |
0.88 |
45-54 |
0.88 |
0.87 |
0.89 |
0.89 |
0.89 |
0.76 |
0.87 |
0.85 |
0.83 |
0.81 |
0.82 |
55-64 |
0.88 |
0.86 |
0.89 |
0.88 |
0.91 |
0.81 |
0.86 |
0.80 |
0.82 |
0.80 |
0.95 |
65+ |
0.92 |
0.93 |
0.91 |
0.94 |
0.91 |
0.94 |
0.96 |
0.86 |
0.80 |
0.84 |
0.82 |
15+ |
0.85 |
0.84 |
0.87 |
0.86 |
0.88 |
0.74 |
0.82 |
0.76 |
0.77 |
0.78 |
0.85 |
0+ |
0.85 |
0.85 |
0.86 |
0.87 |
0.88 |
0.74 |
0.81 |
0.77 |
0.77 |
0.80 |
0.85 |
Notes: (1) The Residual race group includes cases indicating a single race other than White or Black,
and cases indicating two or more races.
Hispanics may be any race. For a more detailed discussion on the use of parameters for
race and ethnicity, please see the “Generalized Variance Parameters” section.
1Comparability of Data. Data obtained from the CPS and other sources are not entirely comparable. This results from differences in interviewer training and experience and in differing survey processes. This is an example of nonsampling variability not reflected in the standard errors. Therefore, caution should be used when comparing results from different sources.
Data users should be careful when comparing the data from this microdata file, which reflects Census 2010-based controls, with microdata files from January 2003 through December 2011, which reflect 2000 census-based controls. Ideally, the same population controls should be used when comparing any estimates. In reality, the use of the same population controls is not practical when comparing trend data over a period of 10 to 20 years. Thus, when it is necessary to combine or compare data based on different controls or different designs, data users should be aware that changes in weighting controls or weighting procedures can create small differences between estimates. See the discussion following for information on comparing estimates derived from different controls or different sample designs.
Microdata files from previous years reflect the latest available census-based controls. Although the most recent change in population controls had relatively little impact on summary measures such as averages, medians, and percentage distributions, it did have a significant impact on levels. For example, use of Census 2010-based controls results in about a 0.2 percent increase from the 2000 census-based controls in the civilian noninstitutionalized population and in the number of families and households. Thus, estimates of levels for data collected in 2012 and later years will differ from those for earlier years by more than what could be attributed to actual changes in the population. These differences could be disproportionately greater for certain population subgroups than for the total population.
Users should also exercise caution because of changes caused by the phase-in of the Census 2000 files (see “Basic CPS”)4. During this time period, CPS data were collected from sample designs based on different censuses. Three features of the new CPS design have the potential of affecting published estimates: (1) the temporary disruption of the rotation pattern from August 2004 through June 2005 for a comparatively small portion of the sample, (2) the change in sample areas, and (3) the introduction of the new Core-Based Statistical Areas (formerly called metropolitan areas). Most of the known effect on estimates during and after the sample redesign will be the result of changing from 1990 to 2000 geographic definitions. Research has shown that the national-level estimates of the metropolitan and nonmetropolitan populations should not change appreciably because of the new sample design. However, users should still exercise caution when comparing metropolitan and nonmetropolitan estimates across years with a design change, especially at the state level.
Caution should also be used when comparing Hispanic estimates over time. No independent population control totals for people of Hispanic origin were used before 1985.
1A Nonsampling Error Warning. Since the full extent of the nonsampling error is unknown, one should be particularly careful when interpreting results based on small differences between estimates. The Census Bureau recommends that data users incorporate information about nonsampling errors into their analyses, as nonsampling error could impact the conclusions drawn from the results. Caution should also be used when interpreting results based on a relatively small number of cases. Summary measures (such as medians and percentage distributions) probably do not reveal useful information when computed on a subpopulation smaller than 75,000.
For additional information on nonsampling error including the possible impact on CPS
data when known, refer to references [2] and [3].
1Standard Errors and Their Use. 1The sample estimate and its standard error enable one to construct a confidence interval. A confidence interval is a range about a given estimate that has a specified probability of containing the average result of all possible samples. For example, if all possible samples were surveyed under essentially the same general conditions and using the same sample design, and if an estimate and its standard error were calculated from each sample, then approximately 90 percent of the intervals from 1.645 standard errors below the estimate to 1.645 standard errors above the estimate would include the average result of all possible samples.
1A particular confidence interval may or may not contain the average estimate derived from all possible samples, but one can say with specified confidence that the interval includes the average estimate calculated from all possible samples.
1Standard errors may also be used to perform hypothesis testing, a procedure for distinguishing between population parameters using sample estimates. The most common type of hypothesis is that the population parameters are different. An example of this would be comparing the percentage of men who were part-time workers to the percentage of women who were part-time workers.
1Tests may be performed at various levels of significance. A significance level is the probability of concluding that the characteristics are different when, in fact, they are the same. For example, to conclude that two characteristics are different at the 0.10 level of significance, the absolute value of the estimated difference between characteristics must be greater than or equal to 1.645 times the standard error of the difference.
1The Census Bureau uses 90-percent confidence intervals and 0.10 levels of significance to determine statistical validity. Consult standard statistical textbooks for alternative criteria.
1Estimating Standard Errors. The Census Bureau uses replication methods to estimate the standard errors of CPS estimates. These methods primarily measure the magnitude of sampling error. However, they do measure some effects of nonsampling error as well. They do not measure systematic biases in the data associated with nonsampling error. Bias is the average over all possible samples of the differences between the sample estimates and the true value.
1Generalized Variance Parameters. While it is possible to compute and present an estimate of the standard error based on the survey data for each estimate in a report, there are a number of reasons why this is not done. A presentation of the individual standard errors would be of limited use, since one could not possibly predict all of the combinations of results that may be of interest to data users. Additionally, data users have access to CPS microdata files, and it is impossible to compute in advance the standard error for every estimate one might obtain from those data sets. Moreover, variance estimates are based on sample data and have variances of their own. Therefore, some methods of stabilizing these estimates of variance, for example, by generalizing or averaging over time, may be used to improve their reliability.
1Experience has shown that certain groups of estimates have similar relationships between their variances and expected values. Modeling or generalizing may provide more stable variance estimates by taking advantage of these similarities. The generalized variance function is a simple model that expresses the variance as a function of the expected value of the survey estimate. The parameters of the generalized variance function are estimated using direct replicate variances. These generalized variance parameters provide a relatively easy method to obtain approximate standard errors for numerous characteristics. In this source and accuracy statement, Table 4 provides the generalized variance parameters for labor force estimates, and Table 5 provides generalized variance parameters for characteristics from the October 2013 supplement. Also, tables are provided that allow the calculation of parameters for prior years and parameters for U.S. states and regions. Tables 6 and 7 provide factors to derive prior year parameters. Table 8 provides factors and population controls to derive U.S. state and regional parameters.
The basic CPS questionnaire records the race and ethnicity of each respondent. With respect to race, a respondent can be White, Black, Asian, American Indian and Alaskan Native (AIAN), Native Hawaiian and Other Pacific Islander (NHOPI), or combinations of two or more of the preceding. A respondent’s ethnicity can be Hispanic or non-Hispanic, regardless of race.
The generalized variance parameters to use in computing standard errors are dependent upon the race/ethnicity group of interest. The following table summarizes the relationship between the race/ethnicity group of interest and the generalized variance parameters to use in standard error calculations.
Table 2. Estimation Groups of Interest and Generalized Variance Parameters |
|
Race/ethnicity group of interest |
Generalized variance parameters to use in standard error calculations |
Total population |
Total or White |
White alone, White AOIC, or White non-Hispanic population |
Total or White |
Black alone, Black AOIC, or Black non-Hispanic population |
Black |
Asian alone, Asian AOIC, or Asian non-Hispanic population |
Asian, AIAN, NHOPI |
AIAN alone, AIAN AOIC, or AIAN non-Hispanic population |
|
NHOPI alone, NHOPI AOIC, or NHOPI non-Hispanic population |
|
Populations from other race groups |
Asian, AIAN, NHOPI |
Hispanic population |
Hispanic |
Two or more races – employment/unemployment and educational attainment characteristics |
Black |
Two or more races – all other characteristics |
Asian, AIAN, NHOPI |
Notes: (1) AIAN is American Indian and Alaska Native and NHOPI is Native Hawaiian and Other Pacific Islander.
AOIC is an abbreviation for alone or in combination. The AOIC population for a race group of interest includes people reporting only the race group of interest (alone) and people reporting multiple race categories including the race group of interest (in combination).
Hispanics may be any race.
Two or more races refers to the group of cases self-classified as having two or more races.
Standard Errors of Estimated Numbers. The approximate standard error, sx, of an estimated number from this microdata file can be obtained by using the formula:
(1)
1Here x is the size of the estimate and a and b are the parameters in Table 4 associated with the particular type of characteristic. When calculating standard errors from cross-tabulations involving different characteristics, use the set of parameters for the characteristic that will give the largest standard error.
1Illustration 1
Suppose there were 5,858,000 unemployed men (ages 16 and up) in the civilian labor force. Use the appropriate parameters from Table 4 and Formula (1) to get
Illustration 1 |
|
Number of unemployed males in the civilian labor force (x) |
5,858,000 |
a parameter (a) |
-0.000032 |
b parameter (b) |
2,971 |
Standard error |
128,000 |
90-percent confidence interval |
5,647,000 to 6,069,000 |
The standard error is calculated as
1The 90-percent confidence interval is calculated as 5,858,000 ± 1.645 × 128,000.
1A conclusion that the average estimate derived from all possible samples lies within a range computed in this way would be correct for roughly 90 percent of all possible samples.
1Standard Errors of Estimated School Enrollment Numbers. The approximate standard error, sx, of an estimated school enrollment number from this microdata file can be obtained by using the formula:
(2)
Here x is the size of the estimate, T is the population control in Table 3 for the total number of persons in a specific age group and b is the parameter in Table 5 associated with the particular type of characteristic. If Table 3 does not contain the age group of interest, use the smallest age group available in the table that does contain the age group of interest. When calculating standard errors for numbers from cross-tabulations involving different characteristics, use the set of parameters for the characteristic that will give the largest standard error.
Table 3. Population Controls for School Enrollment Age Groups: October 2013 |
||||
Age group |
Total or White |
Black |
Asian, AIAN, NHOPI |
Hispanic |
3+ |
300,823,503 |
67,022,486 |
27,909,820 |
53,808,926 |
3-4 |
9,104,972 |
3,356,476 |
1,536,653 |
5,156,498 |
3-6 |
18,468,594 |
6,748,256 |
3,058,357 |
10,303,707 |
3-17 |
21,469,534 |
6,748,256 |
3,058,357 |
10,303,707 |
3-24 |
71,163,498 |
19,645,959 |
8,491,670 |
19,667,631 |
5-24 |
93,246,754 |
25,365,494 |
10,770,461 |
24,267,716 |
6-13 |
84,141,782 |
22,009,018 |
9,233,808 |
19,111,218 |
14-17 |
34,093,693 |
9,886,807 |
4,348,728 |
10,856,770 |
15+ |
263,003,252 |
55,961,320 |
22,942,000 |
43,505,219 |
15-17 |
25,002,326 |
6,402,676 |
2,606,289 |
9,363,924 |
15-19 |
250,958,082 |
53,152,127 |
22,024,439 |
43,505,219 |
15-24 |
21,298,077 |
5,775,600 |
2,606,289 |
9,363,924 |
16-17 |
21,298,077 |
5,775,600 |
2,606,289 |
9,363,924 |
16-24 |
43,381,333 |
11,495,135 |
4,885,080 |
13,964,009 |
18-19 |
16,720,729 |
4,270,501 |
1,729,154 |
3,654,363 |
18-24 |
38,803,985 |
9,990,036 |
4,007,945 |
8,254,448 |
20-21 |
16,720,729 |
4,270,501 |
1,729,154 |
3,654,363 |
20-24 |
38,803,985 |
9,990,036 |
4,007,945 |
8,254,448 |
22-24 |
22,083,256 |
5,719,535 |
2,278,791 |
4,600,085 |
25+ |
22,083,256 |
5,719,535 |
2,278,791 |
4,600,085 |
25-29 |
22,083,256 |
5,719,535 |
2,278,791 |
4,600,085 |
25-34 |
207,576,749 |
41,656,992 |
17,139,359 |
29,541,210 |
30-34 |
20,972,036 |
5,142,371 |
2,266,434 |
4,310,623 |
35+ |
41,741,181 |
10,141,109 |
4,516,349 |
8,603,957 |
Notes: (1) AIAN is American Indian and Alaska Native and NHOPI is Native Hawaiian and Other Pacific Islander.
(2) Hispanics may be any race.
Illustration 2
Suppose there were 4,449,000 three and four year olds enrolled in school and 8,101,000 total children in that age group. Use the appropriate b parameter from Table 5 and Formula (2) to get
Illustration 2 |
|
Number of three and four year olds enrolled in school (x) |
4,449,000 |
Total (T) |
8,101,000 |
b parameter (b) |
2,453 |
Standard error |
70,000 |
90-percent confidence interval |
4,334,000 to 4,564,000 |
The standard error is calculated as
1The 90-percent confidence interval is calculated as 4,449,000 ± 1.645 × 70,000.
A conclusion that the average estimate derived from all possible samples lies within a range computed in this way would be correct for roughly 90 percent of all possible samples.
1Standard Errors of Estimated Percentages. The reliability of an estimated percentage, computed using sample data for both numerator and denominator, depends on both the size of the percentage and its base. Estimated percentages are relatively more reliable than the corresponding estimates of the numerators of the percentages, particularly if the percentages are 50 percent or more. When the numerator and denominator of the percentage are in different categories, use the parameter from Table 4 or 5 as indicated by the numerator.
1The approximate standard error, sy,p, of an estimated percentage can be obtained by using the formula:
(3)
1Here y is the total number of people, families, households, or unrelated individuals in the base of the percentage, p is the percentage (0 ≤ p ≤ 100), and b is the parameter in Table 4 or 5 associated with the characteristic in the numerator of the percentage.
1Illustration 3
Suppose there were 17,051,000 people aged 18 to 21, and 48.5 percent were enrolled in college. Use the appropriate parameter from Table 5 and Formula (3) to get
Illustration 3 |
|
Percentage of people aged 18-21 enrolled in college (p) |
48.5 |
Base (y) |
17,051,000 |
b parameter (b) |
2,131 |
Standard error |
0.56 |
90-percent confidence interval |
47.60 to 49.40 |
1The standard error is calculated as
1The 90-percent confidence interval for the estimated percentage of people aged 18 to 21 enrolled in college is from 47.60 to 49.40 percent (i.e., 48.5 ± 1.645 × 0.56).
1Standard Errors of Estimated Differences. The standard error of the difference between two sample estimates is approximately equal to
(4)
where sx1 and sx2 are the standard errors of the estimates, x1 and x2. The estimates can be numbers, percentages, ratios, etc. This will result in accurate estimates of the standard error of the same characteristic in two different areas, or for the difference between separate and uncorrelated characteristics in the same area. However, if there is a high positive (negative) correlation between the two characteristics, the formula will overestimate (underestimate) the true standard error.
1Illustration 4
Suppose that of the 6,990,000 employed men between 20-24 years of age, 30.6 percent were part‑time workers, and of the 6,696,000 employed women between 20-24 years of age, 44.5 percent were part-time workers. Use the appropriate parameters from Table 4 and Formulas (3) and (4) to get
Illustration 4 |
|||
|
Male (x1) |
Female (x2) |
Difference |
Percentage working part‑time (p) |
30.6 |
44.5 |
13.9 |
Base (y) |
6,990,000 |
6,696,000 |
- |
b parameter (b) |
2,971 |
2,782 |
- |
Standard error |
0.95 |
1.01 |
1.39 |
90-percent confidence interval |
29.04 to 32.16 |
42.84 to 46.16 |
11.61 to 16.19 |
The standard error of the difference is calculated as
1The 90-percent confidence interval around the difference is calculated as 13.9 ± 1.645 × 1.39. Since this interval does not include zero, we can conclude with 90 percent confidence that the percentage of part-time women workers between 20-24 years of age is greater than the percentage of part-time men workers between 20-24 years of age.
Standard Errors of Quarterly or Yearly Averages. For information on calculating standard errors for labor force data from the CPS which involve quarterly or yearly averages, please see the “Explanatory Notes and Estimates of Error: Household Data” section in Employment and Earnings, a monthly report published by the U.S. Bureau of Labor Statistics.
Technical Assistance. If you require assistance or additional information, please contact the Demographic Statistical Methods Division via e-mail at [email protected].
11
Table 4. Parameters for Computation of Standard Errors for Labor Force Characteristics: October 2013 |
||
Characteristic |
a |
b |
|
|
|
Total or White |
|
|
|
|
|
Civilian labor force, employed |
-0.000016 |
3,068 |
Not in labor force |
-0.000009 |
1,833 |
Unemployed |
-0.000016 |
3,096 |
|
|
|
Civilian labor force, employed, not in labor force, and unemployed |
|
|
Men |
-0.000032 |
2,971 |
Women |
-0.000031 |
2,782 |
Both sexes, 16 to 19 years |
-0.000022 |
3,096 |
|
|
|
Black |
|
|
|
|
|
Civilian labor force, employed, not in labor force, and unemployed |
|
|
Total |
-0.000151 |
3,455 |
Men |
-0.000311 |
3,357 |
Women |
-0.000252 |
3,062 |
Both sexes, 16 to 19 years |
-0.001632 |
3,455 |
|
|
|
Hispanic, may be of any race |
|
|
|
|
|
Civilian labor force, employed, not in labor force, and unemployed |
|
|
Total |
-0.000141 |
3,455 |
Men |
-0.000253 |
3,357 |
Women |
-0.000266 |
3,062 |
Both sexes, 16 to 19 years |
-0.001528 |
3,455 |
|
|
|
Asian, American Indian and Alaska Native, Native Hawaiian and Other Pacific Islander |
|
|
|
|
|
Civilian labor force, employed, not in labor force, and unemployed |
|
|
Total |
-0.000346 |
3,198 |
Men |
-0.000729 |
3,198 |
Women |
-0.000659 |
3,198 |
Both sexes, 16 to 19 years |
-0.004146 |
3,198 |
|
|
|
Notes: (1) These parameters are to be applied to basic CPS monthly labor force estimates.
The Total or White, Black, and Asian, AIAN, NHOPI parameters are to be used for both alone and in combination race group estimates.
For nonmetropolitan characteristics, multiply the a and b parameters by 1.5. If the characteristic of interest is total state population, not subtotaled by race or ethnicity, the a and b parameters are zero.
For foreign-born and noncitizen characteristics for Total and White, the a and b parameters should be multiplied by 1.3. No adjustment is necessary for foreign-born and noncitizen characteristics for Black, Hispanic, and Asian, AIAN, NHOPI parameters.
For the groups self-classified as having two or more races, use the Asian, AIAN, NHOPI parameters for all employment characteristics.
1Table 5. Parameters for Computation of Standard Errors for School Enrollment Characteristics: October 2013 |
||||
Characteristics |
b |
|||
Total or White |
Black |
Asian, AIAN, NHOPI |
Hispanic |
|
PEOPLE |
||||
Persons enrolled in school: |
|
|
|
|
Total............................................................ |
2,131 |
2,410 |
2,410 |
2,744 |
Children 13 and under................................ |
2,453 |
2,775 |
2,775 |
3,159 |
|
|
|
|
|
Marital status, household and family |
|
|
|
|
characteristics, health insurance |
|
|
|
|
Some household members.......................... |
4,687 |
6,733 |
6,733 |
11,347 |
All household members.............................. |
5,695 |
9,929 |
9,929 |
16,733 |
|
|
|
|
|
FAMILIES, HOUSEHOLDS, OR UNRELATED INDIVIDUALS |
||||
Income, earnings.............................................. |
2,016 |
2,201 |
2,201 |
3,709 |
|
|
|
|
|
Marital status, household and family |
|
|
|
|
characteristics, educational attainment, |
|
|
|
|
population by age/sex....................... |
1,860 |
1,683 |
1,683 |
2,836 |
|
|
|
|
|
Notes: (1) These parameters are to be applied to the October 2013 School Enrollment Supplement data.
AIAN is American Indian and Alaska Native and NHOPI is Native Hawaiian and Other Pacific Islander.
(3) Hispanics may be any race. For a more detailed discussion on the use of parameters for race and ethnicity, please see the “Generalized Variance Parameters” section.
(4) The Total or White, Black, and Asian, AIAN, NHOPI parameters are to be used for both alone and in combination race group estimates.
(5) For nonmetropolitan characteristics, multiply the a and b parameters by 1.5. If the characteristic of interest is total state population, not subtotaled by race or ethnicity, the a and b parameters are zero.
(6) For foreign-born and noncitizen characteristics for Total and White, the a and b parameters should be multiplied by 1.3. No adjustment is necessary for foreign-born and noncitizen characteristics for Black, Asian, AIAN, NHOPI, and Hispanic parameters.
(7) For the group self-classified as having two or more races, use the Asian, AIAN, NHOPI parameters for all characteristics except employment, unemployment, and educational attainment, in which case use Black parameters.
In 1994, we calculated school enrollment parameters directly from the 1994 CPS data. Since that time, the school enrollment parameters have been based on these updated parameters. Therefore, when calculating past school enrollment parameters, a separate set of year factors should be used.
Table 6 shows the prior year factors to apply to the non-school enrollment parameters.
Table 6. Year Factors for Non-School Enrollment Characteristics (1942-2013) |
|||||
Time Period |
Total or White |
Black |
Asian, AIAN, NHOPI |
Hispanic |
|
a and b |
a |
b |
a and b |
a and b |
|
January 2003 – Present |
1.00 |
1.00 |
1.00 |
1.00 |
1.00 |
July 2001 – December 2002 (SCHIP) |
1.00 |
1.20 |
1.00 |
NA |
1.00 |
January 1996 - June 2001 (Non-SCHIP) |
1.11 |
1.33 |
1.11 |
NA |
1.11 |
April 1989 - December 1995 |
1.03 |
1.23 |
1.03 |
NA |
1.03 |
April 1988 - March 1989 |
1.14 |
1.37 |
1.14 |
NA |
1.20 |
January 1985 - March 1988 |
0.96 |
1.15 |
0.96 |
NA |
0.96 |
January 1982 - December 1984 |
0.96 |
1.15 |
0.96 |
NA |
1.35 |
March 1973 - December 1981 |
0.86 |
1.03 |
0.86 |
NA |
1.20 |
January 1967 - February 1973 |
0.86 |
1.03 |
0.86 |
NA |
1.20 |
May 1956 - December 1966 |
1.29 |
1.55 |
1.29 |
NA |
1.81 |
August 1942 - April 1956 |
1.93 |
2.32 |
1.96 |
NA |
2.71 |
Notes: (1) These factors are for use with the 2013 non-School Enrollment a and b parameters. (2) AIAN, NHOPI are American Indian and Alaska Native, Native Hawaiian and Other Pacific Islander.
and ethnicity, please see the “Generalized Variance Parameters” section. |
Table 7 shows prior year factors to apply to school enrollment parameters.
Table 7. Year Factors for School Enrollment Characteristics (1945-2013) |
|||||
Time Period |
Total or White |
Black |
Asian, AIAN, NHOPI |
Hispanic |
|
a and b |
a |
b |
a and b |
a and b |
|
January 2003 – Present |
1.00 |
1.00 |
1.00 |
1.00 |
1.00 |
July 2001 - December 2002 (SCHIP) |
1.00 |
1.20 |
1.00 |
NA |
1.00 |
January 1996 - June 2001 (Non-SCHIP) |
1.11 |
1.33 |
1.11 |
NA |
1.11 |
March 1995 - December 1995 |
1.03 |
1.23 |
1.03 |
NA |
1.03 |
April 1989 - February 1995 |
1.19 |
1.70 |
1.42 |
NA |
2.10 |
April 1988 - March 1989 |
1.32 |
1.89 |
1.58 |
NA |
2.45 |
January 1985 - March 1988 |
1.11 |
1.60 |
1.33 |
NA |
1.97 |
January 1982 - December 1984 |
1.11 |
1.60 |
1.33 |
NA |
2.76 |
March 1973 - December 1981 |
0.99 |
1.43 |
1.19 |
NA |
2.46 |
January 1967 - February 1973 |
0.99 |
1.43 |
1.19 |
NA |
2.46 |
May 1956 - December 1966 |
1.49 |
2.14 |
1.78 |
NA |
3.69 |
October 1945 - April 1956 |
2.24 |
3.21 |
2.67 |
NA |
5.54 |
Notes: (1) These factors are for use with the 2013 School Enrollment a and b parameters. (2) AIAN, NHOPI are American Indian and Alaska Native, Native Hawaiian and Other Pacific Islander.
|
1Table 8 provides the U.S. regional factors to apply to parameters in order to calculate standard errors for U.S. regional estimates.
Table 8. Regional Factors to Apply To 2013 Parameters |
|
Type of characteristic |
Factor |
U. S. totals |
1.00 |
Regions: Northeast |
1.06 |
Midwest |
1.06 |
South |
1.07 |
West |
1.02 |
REFERENCES
[1] Bureau of Labor Statistics. 2004. Employment and Earnings. “Redesign of the Sample for the Current Population Survey.” Volume 51 Number 11, May 2004. Washington, DC: Government Printing Office. pp 4-6.
[2] U.S. Census Bureau. 2006. Current Population Survey: Design and Methodology. Technical Paper 66. Washington, DC: Government Printing Office. (http://www.census.gov/prod/2006pubs/tp-66.pdf)
[3] Brooks, C.A. and Bailar, B.A. 1978. Statistical Policy Working Paper 3 - An Error
Profile: Employment as Measured by the Current Population Survey. Subcommittee on
Nonsampling Errors, Federal Committee on Statistical Methodology, U.S. Department of
Commerce, Washington, DC. (http://www.fcsm.gov/working-papers/spp.html)
[4] U.S. Census Bureau, July 15, 2009, “Estimating ASEC Variances with Replicate Weights Part I: Instructions for Using the ASEC Public Use Replicate Weight File to Create ASEC Variance Estimates.”
http://usa.ipums.org/usa/repwt/Use_of_the_Public_Use_Replicate_Weight_File_final_PR.doc
Accessed: February 3, 2012
1 For detailed information on the 2000 sample redesign, please see reference [1].
2 The PSUs correspond to substate areas (i.e., counties or groups of counties) that are geographically contiguous.
3 For further information on CATI and CAPI and the eligibility criteria, please see reference [2].
4 The phase-in process using the 2010 Census files will begin April 2014.
File Type | application/vnd.openxmlformats-officedocument.wordprocessingml.document |
File Title | Template Source and Accuracy Statement |
Author | John M. Finamore |
File Modified | 0000-00-00 |
File Created | 2021-01-24 |