Attachment XII - Modernizing Data Collection APIs and Webscraping

ATT_XII_Modernizing Data Collection_APIs_&_Web_Scraping.pdf

Consumer Price Index Housing Survey

Attachment XII - Modernizing Data Collection APIs and Webscraping

OMB: 1220-0163

Document [pdf]
Download: pdf | pdf
U.S. Bureau of
L abor Statistics

THE CONSUME R PRICE INDEX
Modernizing Data Collection: APIs & Web Scraping

To continue to produce high quality data, the CPI is looking to supplement traditional survey collection with
more modernized methods. While the BLS already includes prices for goods and services purchased online,
the CPI is exploring APIs and web scraping - extracting data directly from websites. APIs and web scraping are
efficient and cost-effective methods of collection that will make it even easier for businesses and organizations
to participate in CPI surveys.

What are APIs?
Some establishments provide Application
Programming Interfaces (APIs) to allow
partners to access information on their
website. Data collection through an API
is often easier and more straight-forward

What is the CPI?
The Consumer Price Index (CPI) is a measure of the average
change over time in the prices paid by urban consumers for
a market basket of goods and services. The CPI is one of the
most closely watched Principal Federal Economic Indicators
(PFEIs) produced by the U.S. Bureau of Labor Statistics (BLS).
Some of the people who follow the CPI closely include land-

than maintaining web scraping code over

lords (to help them calculate changes in rental prices), Social

time. By entering certain parameters, such

Security recipients (to anticipate their next cost-of-living ad-

as SKU code and store address, APIs can

justment), and Wall Street financial institutions and traders (to

identify specific products and prices.

determine potential moves of the stock and bond markets).

Your Voluntary Cooperation

What is web scraping?

Data collection for the CPI will not pursue any web scraping

Web scraping is a process through which

activities unless permission has been granted. The agency will

information is gathered and copied from
the web for analysis. Web scraping uses
software that simulates human web surfing
to collect existing information from a
company’s website without website
disruption.

engage in this activity in a responsible and transparent manner.
BLS will take steps to make sure CPI data collection programs
have minor impact on the website by limiting the time of day to
execute web scraping programs, number of fetches per hour/
day, time interval between requests, and type of data extracted.
This report is authorized by law, 29 U.S.C.2. Your voluntary cooperation is needed to make the results of this survey comprehensive,
accurate, and timely.

Your Confidential Participation

Why use APIs and web scraping?

The Bureau of Labor Statistics, its employees, agents, and

•	 Enhance the ease of participating in

for statistical purposes only and will hold the information in

CPI surveys by saving businesses and
organizations time and resources.
•	 Utilize high-speed methods to acquire
large volumes of information amassing
more data than staff collecting

partner statistical agencies will use the information you provide
confidence to the full extent permitted by law. In accordance
with the Confidential Information Protection and Statistical
Efficiency Act (44 U.S.C. 3572) and other applicable Federal
laws, your responses will not be disclosed in identifiable form
without your informed consent. Per the Federal Cybersecurity
Enhancement Act of 2015, Federal information systems are
protected from malicious activities through cybersecurity
screening of transmitted data.

How can I see CPI data?
Information is always available on the BLS-CPI homepage at www.bls.gov/cpi/home.htm.
New CPI data appear in a news release usually issued between the 10th and 15th of the month, reporting the data for the
previous month. Also at this time, CPI data is reported in various media, such as television, newspapers, and public websites.

U.S. Bureau of
L abor Statistics

THE CONSUME R PRICE INDEX
BLS national and regional offices

Washington, DC

Bureau of Labor Statistics
2 Massachusetts Avenue, NE
Washington, DC 20212
(202) 691-7000
[email protected]

Atlanta

Bureau of Labor Statistics
61 Forsyth Street, SW, Room 7T50
Atlanta, GA 30303
(404) 893-4222
[email protected]

Boston

Bureau of Labor Statistics
JFK Federal Building, E-310
Boston, MA 02203
(617) 565-2327
[email protected]

Chicago

Bureau of Labor Statistics
J.C. Kluczynski Federal Office Building
230 South Dearborn Street, Room 960
Chicago, IL 60604
(312) 353-1880
[email protected]

Dallas

Bureau of Labor Statistics
525 South Griffin Street, Room 221
Dallas, TX 75202
(972) 850-4800
[email protected]

Kansas City

Bureau of Labor Statistics
Two Pershing Square Building
2300 Main Street, Suite 1190
Kansas City, MO 64108
(816) 285-7000
[email protected]

New York

Bureau of Labor Statistics
New York-New Jersey Information Office
201 Varick Street, Room 808
New York, NY 10014
(646) 264-3600
[email protected]

Philadelphia

Bureau of Labor Statistics
1835 Market Street
Suite 1946
Philadelphia, PA 19103-2924
(215) 597-3282
[email protected]

San Francisco
Bureau of Labor Statistics
90 7th Street, Suite 14-100
San Francisco, CA 94103
(415) 625-2270
[email protected]

Questions?

If you have any questions or comments regarding any aspect of the survey, you may
contact:
Division of Consumer Prices and Price Indexes
Bureau of Labor Statistics
2 Massachusetts Avenue, NE
Washington, DC 20212
call (202) 691-6991, or
email: [email protected].
The U.S. Office of Management and
Budget (OMB) has approved this collection
of information and has assigned 1220-0039
as the control number. Without OMB approval and this number, we would not be
able to conduct this survey.


File Typeapplication/pdf
File TitleModernizing Data Collection: APIs & Web Scraping
File Modified2021-07-08
File Created2019-09-09

© 2024 OMB.report | Privacy Policy