|
|
|
SUPPORTING STATEMENT B
U.S. Department of Commerce
U.S. Census Bureau
The American Community Survey
OMB Control No. 0607-0810
B. Collections of Information Employing Statistical Methods
ACS Households
The U.S. Census Bureau samples about 298,000 housing unit (HU) addresses each month including Puerto Rico; about 293,500 are mailable addresses. The mailing operations are conducted through the U.S. Postal Service and use first-class postage rates for all pieces. For addresses that were mailed materials but did not respond by mail, internet, or by calling our telephone questionnaire assistance line, the Census Bureau selects a subsample of all households and assigns them to the nonresponse follow-up data collection mode (also referred to as CAPI). Unmailable household addresses are sampled and included in the nonresponse follow-up data collection mode.
In 2022, the HU sample yielded an estimated combined response rate of 63 percent for the self-response modes. The HU nonresponse follow-up follow-up yielded an estimated response rate of approximately 62 percent in 2022. The 2022 final weighted HU response rate1 for ACS was 83 percent.
ACS Group Quarters
In addition to the ACS data collection from households, the data are also collected from a sample of group quarters (GQ) facilities and residents. Interviewers use the CAPI Group Quarters Facility questionnaire (GQFQ) in English or Spanish when making initial telephone contact to schedule an appointment and to conduct a telephone or personal visit at the sample GQ and also to generate the subsample of persons for ACS interviews. The GQ has the option to provide the interviewer with a hard copy of the listing information or upload an electronic listing of the residents in the online eListing application to generate the sub-sample of persons for ACS interviews. An introductory letter is mailed to the sample GQ approximately two weeks prior to the period when an interviewer may begin making contact with the GQ. The Spanish GQFQ instrument is used for ACS data collection at Puerto Rico GQs. A subset of the ACS HU questions is used for the interviews with sample residents in GQs. Resident-level personal interviews with sampled GQ residents are conducted in English and Spanish using CAPI, but bilingual (English/Spanish) paper questionnaires can also be used for self-response. Beginning in 2024, GQ residents in some GQs will have the option to self-respond to the survey online. The GQ CAPI, internet, and paper questionnaires contain questions for one person. The GQ CAPI also excludes certain questions for residents and institutional group quarters that are out of scope to reduce burden. Interviewers may call or conduct additional personal visits to the GQ and/or sample residents to obtain missing or incomplete ACS GQ forms until the closeout of each sample panel.
In 2022, the final weighted GQ response rate2 for ACS was 82.1 percent.
Describe the procedures for the collection of information including:
Statistical methodology for stratification and sample selection,
Estimation procedure,
Degree of accuracy needed for the purpose described in the justification,
Unusual problems requiring specialized sampling procedures, and
Any use of periodic (less frequent than annual) data collection cycles to reduce burden.
The ACS employs a two-phase, two-stage sample design. The first-phase sample consists of two separate address samples: Period 1 and Period 2. These samples are chosen at different points in time. Both samples are selected in two stages of sampling, a first-stage and a second-stage. Subsequent to second-stage sampling, the majority of sample addresses are randomly assigned to one of the 12 months of the sample year (the exception is for addresses in remote Alaska, which are assigned to either January or July). The second-phase of sampling occurs when the CAPI sample is selected. This document describes the sampling for 2022; the methods used in 2025 are the same.
The Period 1 sample is selected during September and October of the year prior to the sample year (e.g., the 2022 Period 1 sample was selected in September and October of 2021). Approximately half of a year’s sample is selected at this time. Sample addresses that are not in remote Alaska are randomly assigned to one of the first six months of the sample year; sample addresses in remote Alaska are assigned to the first six months as a whole to address access issues.
Period 2 sampling occurs in February and March of the sample year (e.g., the 2022 Period 2 sample was selected during February and March of 2022). This sample accounts for the remaining half of the overall first-phase sample. Period 2 sample addresses that are not in remote Alaska are randomly assigned to one of the last six months of the sample year; Period 2 sample addresses in remote Alaska are assigned to the last six months as a whole.3
A subsample of nonresponding addresses and of any addresses deemed unmailable is selected for the CAPI data collection mode.4
The following steps are used to select the first-phase and second-phase samples in both periods.
First stage sampling defines the universe for the second stage of sampling through three steps. First, all addresses that were in a first-phase sample within the past four years are excluded from eligibility. This ensures that no address is in sample more than once every five years. The second step is to select a 20 percent systematic sample of “new” units, i.e., those units that have never appeared on a previous Master Address File extract. Each new address is systematically assigned either to the current year or to one of four back-samples. This procedure maintains five equal partitions (samples) of the universe. The third step is to randomly assign all eligible addresses to a partition.5
Second-stage sampling uses 16 sampling strata in the United States.6 The stratum-level rates used in second-stage sampling account for the first-stage selection probabilities. These rates are applied at a block level to addresses in the United States by calculating a measure of size for each of the following geographic entities:
Counties.
Places.
School Districts (elementary, secondary, and unified).
American Indian Areas (AI).
Tribal Subdivisions.
Alaska Native Village Statistical Areas (ANVSA).
Hawaiian Homelands (HH).
Minor Civil Divisions – in Connecticut, Maine, Massachusetts, Michigan, Minnesota, New Hampshire, New Jersey, New York, Pennsylvania, Rhode Island, Vermont, and Wisconsin.7
Census
Designated Places – in Hawaii only.
The measure of size for all areas is an estimate of the number of occupied HUs in the area except for AI areas, Tribal Subdivisions, ANVSA, and HH. This is calculated by multiplying the number of ACS addresses by an estimated occupancy rate at the block level. A measure of size for each census tract is also calculated in the same manner.
For AI areas, Tribal Subdivisions areas, and ANVSA, the measure of size is the estimated number of occupied HUs multiplied by the proportion of people reporting AI or Alaska Native (alone or in combination) in the 2020 Census.
For HH, the measure of size is the estimated number of occupied HUs multiplied by the proportion of people reporting Native Hawaiian (alone or in combination) in the 2020 Census.
Each block is then assigned the smallest positive, nonzero measure of size from the set of all entities of which it is a part. The 2022 second-stage sampling strata and the overall first-phase sampling rates are shown in Table 1. Table 2 includes the rates used in Puerto Rico.
The overall first-phase sampling rates are calculated using the distribution of ACS valid addresses by second-stage sampling stratum in such a way as to yield an overall target sample size for the year of 3,576,000 (1,788,000 for each period) in the United States and Puerto Rico. The first-phase rates are adjusted for the first-stage sample to yield the second-stage selection probabilities.
Table 1. First-phase Sampling Rate Categories for the United States
Sampling |
Type of Area |
Rate Definitions |
2022 Sampling Rates Period 1 |
|
|
Stratum |
|
|
|
|
|
1 |
0 < MOS1 < 200 |
15.00% |
15.00% |
15.00% |
|
2 |
200 ≤ MOS < 400 |
10.00% |
10.00% |
10.00% |
|
3 |
400 ≤ MOS < 800 |
7.00% |
7.00% |
7.00% |
|
4 |
800 ≤ MOS < 1200 |
2.80 × BR |
3.95% |
3.94% |
|
5 |
1200 ≤ MOS and 0 < TRACTMOS2 < 400 |
3.50 × BR |
4.93% |
4.92% |
|
6 |
1200 ≤ MOS and 0 < TRACTMOS < 400 HR3 |
0.92 × 3.50 × BR |
4.54% |
4.53% |
|
7 |
1200 ≤ MOS and 400 ≤ TRACTMOS < 1000 |
2.80 × BR |
3.95% |
3.94% |
|
8 |
1200 ≤ MOS and 400 ≤ TRACTMOS < 1000 HR |
0.92 × 2.80 × BR |
3.63% |
3.62% |
|
9 |
1200 ≤ MOS and 1000 ≤ TRACTMOS < 2000 |
1.70 × BR |
2.40% |
2.39% |
|
10 |
1200 ≤ MOS and 1000 ≤ TRACTMOS < 2000 HR |
0.92 × 1.70 × BR |
2.20% |
2.20% |
|
11 |
1200 ≤ MOS and 2000 ≤ TRACTMOS < 4000 |
BR4 |
1.41% |
1.41% |
|
12 |
1200 ≤ MOS and 2000 ≤ TRACTMOS < 4000 HR |
0.92 × BR |
1.30% |
1.29% |
|
13 |
1200 ≤ MOS and 4000 ≤ TRACTMOS < 6000 |
0.60 × BR |
0.85% |
0.84% |
|
14 |
1200 ≤ MOS and 4000 ≤ TRACTMOS < 6000 HR |
0.92 × 0.60 × BR |
0.78% |
0.78% |
|
15 |
1200 ≤ MOS and 6000 ≤ TRACTMOS |
0.35 × BR |
0.49% |
0.49% |
|
16 |
1200 ≤ MOS and 6000 ≤ TRACTMOS HR |
0.92 × 0.35 × BR |
0.45% |
0.45% |
1MOS = measure of size (estimated number occupied housing units) of the smallest governmental entity
2TRACTMOS = the measure of size (MOS) at the census tract level
3HR = areas where predicted levels of completed mail and internet interviews are > 60%
4BR = base sampling rate
Table 2. First-phase Sampling Rate Categories for Puerto Rico
Sampling |
Type of Area |
Rate Definitions |
2022 Sampling Rates |
|
|
Stratum |
|
|
Period 1 |
Period 2 |
|
3 |
400 ≤ MOS1 < 800 |
7.00% |
7.00% |
7.00% |
|
5 |
1200 ≤ MOS and 0 < TRACTMOS2 < 400 |
3.50 × BR |
4.82% |
4.82% |
|
7 |
1200 ≤ MOS and 400 ≤ TRACTMOS < 1000 |
2.80 × BR |
3.85% |
3.85% |
|
9 |
1200 ≤ MOS and 1000 ≤ TRACTMOS < 2000 |
1.70 × BR |
2.34% |
2.34% |
|
11 |
1200 ≤ MOS and 2000 ≤ TRACTMOS < 4000 |
BR3 |
1.38% |
1.38% |
|
13 |
1200 ≤ MOS and 4000 ≤ TRACTMOS < 6000 |
0.60 × BR |
0.83% |
0.83% |
1MOS = measure of size (estimated number occupied housing units) of the smallest governmental entity
2TRACTMOS = the measure of size (MOS) at the census tract level
3BR = base sampling rate
After each block is assigned to a second-stage sampling stratum in each period, a systematic sample of addresses is selected from the second-stage universe (equal to the first-stage sample) within each county and county equivalent.
After the second stage of sampling, sample addresses selected during Period 1 sampling that are not in remote Alaska are allocated to one of the first six months of the sample year. Sample addresses selected during Period 2 sampling that are not in remote Alaska are assigned to a month between July through December, inclusively. Sample addresses in remote Alaska are assigned to January or July in Period 1 and Period 2 sampling, respectively.
Due to the difficulties in field operations during specific months of the year and the extremely seasonal population in these areas, data collection operations in Remote Alaska differ from the rest of the country. In both the main and supplemental HU address samples, the month assigned for each Remote Alaska HU address is based on the county, place, ANVSA, or block group (in that order) in which it is contained. The Census Bureau assigns all designated addresses located in each of these geographical entities to either the first half or the second half of the year in such a way as to balance workloads between the halves of the year and to keep groups of cases together geographically. Addresses are sorted for each month by county and geographical order in the address frame and all sample addresses are sent directly to CAPI (bypassing mail for the HU sample) in the appropriate month
The addresses from which CAPI subsamples are selected can be divided into two groups. (1) the first group includes addresses that are not eligible for any other data collection operation; these consist of a) unmailable addresses and b) those in remote Alaska areas. (2) the second group includes addresses that are eligible for the other data collection operations but for which no response was obtained prior to CAPI subsampling—these consist of mailable addresses not in a remote Alaska area.
Most unmailable addresses are selected for CAPI at a rate of 2-in-3. Mailable addresses from which a response was not obtained by the time of the CAPI operation are sampled at rates of 1-in-2, 2-in-5, and 1-in-3; these rates are set at the tract level. All sample addresses (unmailable and mailable) in HH, ANVSA, AI areas, and remote Alaska are selected for CAPI. See Table 3 for the CAPI subsampling rates.
All non-responding addresses in Puerto Rico are subsampled for CAPI at a 1-in-2 rate.
Table 3. Second-Phase (CAPI) Subsampling Rates for the United States
Address and Tract Characteristics |
CAPI Subsampling Rate |
Addresses in Remote Alaska* |
Take all (100.0%) |
Addresses in Hawaiian Homelands, Alaska Native Village statistical areas and a subset of American Indian areas* |
Take all (100.0%) |
Unmailable addresses that are not in the previous two categories |
66.7% |
Mailable addresses in tracts with predicted levels of completed mail and Internet interviews prior to CAPI subsampling between 0% and 35%, inclusive |
50.0% |
Mailable addresses in tracts with predicted levels of completed mail and Internet interviews prior to CAPI subsampling greater than 35% and less than or equal to 50% |
40.0% |
Mailable addresses in all other tracts |
33.3% |
The 2022 group quarters (GQ) sampling frame was divided into two strata: a small GQ stratum and a large GQ stratum. Small GQs have expected populations of 15 or fewer people residing at the GQ, while large GQs have expected populations of more than 15 people residing at the GQ.
Samples were selected in two phases within each stratum. In general, GQs were selected in the first phase and then residents of sampled GQs were selected in the second phase. Both phases differ between the two strata, as described in detail below. Each sampled GQ was randomly assigned to one or more months in 2022; it was in these months that their person samples were selected.
There were two stages of selecting small GQs for sample.
First stage
The small GQ universe is divided into five groups that are approximately equal in size. All new small GQs are systematically assigned to one of these five groups on a yearly basis, with about the same probability (20 percent) of being assigned to any given group. Each group represents a second-stage sampling frame, from which GQs are selected once every five years. The 2022 second-stage sampling frame was used in 2017 as well, and would be used in 2027, 2032, etc.
Second stage
Small GQs were systematically selected from the 2022 second-stage sampling frame. Each GQ had the same second-stage probability of being selected within a given state, where the probabilities varied between states. Table 4 below shows these probabilities.
Note that the GQ sampling rate for Puerto Rico was 2.5 percent.
Table 4. 2022 Group Quarter Targeted Sampling Rates for the U.S. by State
State |
Target Rate |
State |
Target Rate |
State |
Target Rate |
Alabama |
1.17% |
Kentucky |
2.32% |
North Dakota |
3.90% |
Alaska |
2.89% |
Louisiana |
2.37% |
Ohio |
2.39% |
Arizona |
1.78% |
Maine |
2.64% |
Oklahoma |
2.15% |
Arkansas |
2.00% |
Maryland |
2.46% |
Oregon |
2.26% |
California |
2.19% |
Massachusetts |
2.05% |
Pennsylvania |
2.58% |
Colorado |
1.76% |
Michigan |
2.54% |
Rhode Island |
2.42% |
Connecticut |
2.38% |
Minnesota |
2.31% |
South Carolina |
1.97% |
Delaware |
4.90% |
Mississippi |
2.25% |
South Dakota |
3.46% |
District of Columbia |
2.73% |
Missouri |
2.14% |
Tennessee |
2.25% |
Florida |
1.98% |
Montana |
3.39% |
Texas |
2.02% |
Georgia |
2.29% |
Nebraska |
2.41% |
Utah |
1.83% |
Hawaii |
3.09% |
Nevada |
3.37% |
Vermont |
4.25% |
Idaho |
2.23% |
New Hampshire |
2.52% |
Virginia |
2.40% |
Illinois |
2.46% |
New Jersey |
2.61% |
Washington |
1.88% |
Indiana |
2.32% |
New Mexico |
2.61% |
West Virginia |
2.14% |
Iowa |
2.18% |
New York |
2.12% |
Wisconsin |
2.23% |
Kansas |
2.34% |
North Carolina |
2.00% |
Wyoming |
6.83% |
Second Phase of Sample Selection for Small GQs
Individuals were selected for sample from each GQ that was selected for sample in the first phase of sample selection. If 15 or fewer people were residing at a GQ at the time an interviewer visited the GQ, then all of them were selected for sample. Otherwise, if more than 15 people were residing at the GQ, then the interviewer selected a systematic sample of ten people from the GQ’s roster.
The targeted state-level sampling rates are the probabilities of selecting any given person in a GQ; it is around these probabilities that the sample design is based. These probabilities reflect both phases of sample selection, and they varied by state. The probabilities for 2022 are shown in Table 4.
The sample was designed so that the second-phase sampling rate would be 100 percent for small GQs (i.e., select the entire expected population of 15 or fewer people for sample in every sampled small GQ). This means the probability of selecting any person in a small GQ was designed to equal the probability of selecting the small GQ itself.
All large GQs in scope for the ACS were eligible to being sampled in 2022. This means there was only a single stage of sampling in this phase. This stage consists of systematically assigning “hits” to GQs independently in each state, where each hit represents ten people to be sampled.
In general, a large GQ has either Z or Z+1 hits assigned to it. The value for Z is dependent on both the GQ’s expected population size and its within-state target sampling rate, shown in Table 4. When this rate is multiplied by a GQ’s expected population, the result is a GQ’s expected person sample size. If a GQ’s expected person sample size is less than ten, then Z = 0; if it is at least ten but less than 20, then Z = 1; if it is at least 20 but less than 30, then Z = 2; and so on.
If a GQ has an expected person sample size that is less than ten, then this method effectively gives the GQ a probability of selection that is proportional to its size; this probability is the expected person sample size divided by ten. If a GQ has an expected person sample size of ten or more, then it is in sample with certainty and is assigned one or more hits.
Individuals are selected within each GQ to which one or more hits are assigned in the first phase of selection. There are ten people selected at a GQ for every hit assigned to the GQ. The individuals are systematically sampled from a roster of people residing at the GQ at the time of an interviewer’s visit. The exception is if there are far fewer persons residing in a GQ than expected—in these situations, the number of people to sample at the GQ is reduced to reflect the GQ’s actual population. In cases where fewer than ten people reside in a GQ at the time of a visit, the interviewer will select all of them for sample.
As for small GQs, the targeted state-level sampling rates are the probabilities of selecting any given person in a GQ. The probabilities are shown in Table 4. Note that these rates are the same as for everyone in small GQs.
As an example, suppose a GQ in Indiana has an expected population of 250. The target sampling rate in Indiana is 2.32 percent, meaning any given person in a GQ in Indiana has about a 1-in-43.1 chance of being selected. This rate, combined with the GQ’s expected population of 250, means that the expected number of individuals selected for sample in this GQ is 5.8 (2.32 percent × 250). Since this is less than ten, this GQ has either 0 or 1 hits assigned to it (Z = 0). The probability of it being assigned a hit is the GQ’s expected person sample size of 5.8 divided by 10, or 58 percent.
As a second example, suppose a GQ in Idaho has an expected population of 1,000. The target sampling rate in Idaho is 2.23 percent, meaning any given person in a GQ in Idaho has about a 1-in-44.8 chance of being selected. This rate, combined with the GQ’s expected population of 1,000, means that the expected number of individuals selected for sample in the GQ is 22.3 (2.23 percent × 1,000); this GQ is assigned either two or three hits (Z = 2). To decide whether it is assigned two or three hits, we consider the remainder above 20, i.e., 2.3 and use the same calculation as the previous example to determine the probability of assigning an additional hit. Dividing 2.3 by 10 yields a 23 percent chance of being assigned three hits and a 77 percent chance of remaining with the assigned two hits.
All sample GQs are assigned to one or more months (interview months); these are the months in which interviewers will visit a GQ to select a person sample and conduct interviews. All small GQs, all large GQs that are assigned only one hit, all remote Alaska GQs, all sampled military facilities, and all sampled correctional facilities (regardless of how many hits a military or correctional facility is assigned) are assigned to a single interview month. Remote Alaska GQs are assigned to either the first half or second half of the year; federal prisons are assigned to September; all the others are randomly assigned one interview month. Military ships are restricted to March–December to allow time for the Census Bureau to let the point of contact for these ships know what month the sampled ships are in.
All large GQs that are assigned multiple hits, but are not in any of the categories above, have each hit randomly assigned to a different interview month. If a GQ has more than 12 hits assigned to it, then multiple hits are assigned to one or more interview months for the GQ. For example, if a GQ has 15 hits assigned to it, then there are three interview months in which two hits are assigned and nine interview months in which one hit is assigned. One restriction is applied to college dormitories, whose hits are randomly assigned to nonsummer months only (i.e., January through April and September through December).
Bureau of Prison Group Quarters
They are sampled separately from other GQs using the same procedure shown above and are all assigned to the September interview month as before. The Census Bureau uses the most up-to-date information through two files delivered from the Bureau of Prisons–a facilities file listing each federal prison including address and contact information, and a list of all federal inmates. The Census Bureau samples inmates directly from the file, and the questionnaires for inmates in federal prisons are preprinted with the names of inmates in sample.
Remote Alaska
Remote Alaska is a set of rural areas in Alaska that are difficult to access and for which all addresses are treated as unmailable. There are approximately 30,000 HU addresses and 500 GQs in Remote Alaska. Using the same procedure as the housing unit sample, the GQ remote Alaska sample is assigned twice a year. We allow up to six months to complete the HU and GQ data collection for each of the two data collection periods.
Data collection instruments are available to respondents and interviewers in English and Spanish. Respondents may also complete the survey via a phone interview with bilingual staff in about 15 languages. There are toll-free numbers to receive assistance in Spanish, Chinese, Russian, Korean, and Vietnamese.
Additional methods for maximizing response are explored as part of testing conducted by the ACS, called methods panel tests (OMB Control No. 0607-0936). As details of specific strategies for testing are determined they will be provided as part of a non-substantive change request.
The Census Bureau is continuously engaging and responding to stakeholders to adapt the way we gather data andadminister the survey. The ACS Methods Panel Tests program (OMB Control No. 0607-0936) allows the Census Bureau to respond to emerging trends and changes in our nation that spawn new data needs by building on our comprehensive research agenda. This work not only improves the ACS, but also allows the Census Bureau to innovate responsively across key aspects of our work. The ACS Methods Panel Tests program also provides an opportunity to research and test elements of survey data collection that relate to the decennial census.
Testing allows the Census Bureau the opportunity to improve data quality, reduce data collection costs, improve questionnaire content and data collection materials, as well as react to emerging needs.
The Census Bureau will collect and process these data. Within the Census Bureau, please consult the following individuals for further information on their area of expertise.
Statistical Aspects
Steven Hefter Chief, ACS Sample Design Branch
Decennial Statistical Studies Division
Phone: (301) 763-4082
Overall Data Collection
Cassie Fisher Chief, Housing Unit Self Response Branch
American Community Survey Office
Phone: (301) 763-5440
1 This represents the final weighted response rate for the sample which was selected for 2022. This differs from the response rate published for out 1-year data products as the universe for that rate differs slightly than for the rate shown here.
2 This represents the final weighted response rate for the sample which was selected for 2022. This differs from the response rate published for out 1-year data products as the universe for that rate differs slightly than for the rate shown here.
3 Remote Alaska assignments are made so that the sample addresses are approximately evenly distributed between the two data collection periods.
4
All nonmailable and nonresponding addresses in the following areas
are now sent to CAPI: all Hawaiian Homelands, all Alaska Native
Village statistical areas, American Indian areas with an estimated
proportion of American Indian population ≥ 10%.
5
Most of the period assignments are made during Period 1 sampling.
The only assignments in Period 2 sampling are made for addresses
that were not part of the process in Period 1, e.g., new
addresses.
6
Beginning with the 2011 sample the ACS implemented a change to the
stratification, increasing the number of sampling strata and
changing how the sampling rates are defined. Prior to 2011 there
were seven strata; there are now 16 sampling strata. Table 1
gives a summary of these strata and the rates.
7 These are the states where MCDs are active, functioning governmental units.
Page
|
File Type | application/vnd.openxmlformats-officedocument.wordprocessingml.document |
Author | Mary Reuling Lenaiyasa (CENSUS/PCO FED) |
File Modified | 0000-00-00 |
File Created | 2024-09-06 |