Response to the Office of Management and Budget
2011 California Health Interview Survey (CHIS 2011)
OMB #: 0925-0598
Would you please ask the Program if there is an update regarding section B2.2. copied below?
B.2.2. Estimation Procedure
CHIS data will be statistically weighted to account for the differential probability of selecting persons into the sample, and the weights will be raked to the various domains of California population totals. Specific strategy for drawing and weighting the CHIS sample will depend on the results of the recently fielded CHIS 2009, and as such, final specifications are not available at this time. The methods detailed below, therefore, enumerate the anticipated estimation procedure.
Unlike CHIS 2007 where the cell sample was screened and data collected from cell-only households, CHIS 2009 collects information from all households identified in the cell sample. These include cell only adults, adults that receive most of their calls in their cell phone, adults who receive some of their cells and some on their landlines, and adults who received few or no calls on their cell phone. The same groups are also identified in the landline sample if the adult has a cell phone.
Since the landline and cell phone populations and samples overlap and the drawn samples are probability samples, we can use a multiple frame estimation approach to combine and create weights for these samples. This approach follows the ideas of Hartley (1962) and is different from the approach used to combine the landline and surname samples. The proposed method is needed because we cannot determine the multiple probabilities of selection of all units in the sample. The method we plan to use is outlined in Brick, Flores Cervantes, Norman, and Lee (submitted).
There are three domains of interest in the overlapping frames. The first domain called a includes all adults in households with only landline service, the second domain called b includes all adults in cell-only households, and the third domain called ab includes all adults in households with both landline and cell phones. Let be a characteristic for adults in a domain (e.g., the number of adults with health insurance). Let be the estimate of computed using the landline sample, and let the estimate of computed using the records in cell phone sample. An estimate of using the landline sample is
,
where is the estimate computed using the records from landline only households and is the estimate computed using the adults with a landline and cell phone from in the landline sample. In a similar way, an estimate of based on the cell phone sample is where is the estimate computed using the adults with a landline and cell phone from the cell phone sample and is the estimate computed using the records from cell only households.
Notice that neither nor are unbiased estimates of . However, an unbiased estimate of can be computed as
,
where ( ) is the composite or weighting factor. The value of was chosen to minimize the bias of .
Before creating the composite weights, both samples (landline and cell) were poststratified separately to control totals defined by telephone service or type (i.e., persons in landline only households, persons in cell phone only households, and persons in households with both services). The distribution of telephone usage for California was derived from the National Health Interview Survey for January to June 2010 for the West region. The poststratified person weight, is computed as
where is the person weight (i.e., adult, child, or adolescent) and is the control total by telephone service.
Once the samples were poststratified, a composite weight that combined the landline and cell phone sample was created. Based on the research by Brick et. al (submitted), using the composite factor was used to reduce the bias of estimates computed using both samples. This factor and its complement ( ) can be seen as an additional weighting adjustment factors to apply to the poststratified weights. The expression of the composite weight, , is
where is the poststratified person weight above. The total sum of weights is an estimate of the total eligible population in California.
Since the landline and cell phone samples are independent samples, the estimates of variance can be computed using replication or linearization (i.e., Taylor series approximation).
In summary, the supplemental samples (i.e., geographic and surnames samples) are combined with the landline sample at the beginning of the weighting process. The cell phone sample and the combined landline-supplemental samples are first poststratified to telephone status or usage, combined through a composite factor, and then raked altogether.
April 8, 2011
File Type | application/vnd.openxmlformats-officedocument.wordprocessingml.document |
Author | Royce Park |
File Modified | 0000-00-00 |
File Created | 2021-02-01 |