High Frequency Phone Survey 2020-2024

Ethiopia, 2020 - 2024

Get Microdata

Reference ID

ETH_2020-2024_HFPS_v15_M

Producer(s)

World Bank

Metadata

Documentation in PDF DDI/XML JSON

Created on

Jan 16, 2021

Last modified

Aug 02, 2023

Page views

140108

Downloads

3747

Identification

Survey ID number

ETH_2020-2024_HFPS_v15_M

Title

High Frequency Phone Survey 2020-2024

Abbreviation or Acronym

HFPS 2020-24

Country

Name	Country code
Ethiopia	ETH

Study type

Socio-Economic/Monitoring Survey [hh/sems]

Series Information

The World Bank is providing support to countries to help mitigate the spread and impact of the new coronavirus disease (COVID-19) and Beyond. One area of support is for data collection to inform evidence-based policies that may help mitigate the effects of this disease and other economic and social problems. Towards this end, the World Bank is leveraging the Living Standards Measurement Study - Integrated Survey on Agriculture (LSMS-ISA) program to implement high-frequency phone surveys in 5 African countries - Nigeria, Ethiopia, Uganda, Tanzania, and Malawi. This effort is part of a broader first wave of World Bank-supported national longitudinal high-frequency surveys that can be used to help assess the economic and social implications of the COVID-19 pandemic and other socio-economic shocks on households and individuals.

Abstract

The potential impacts of the COVID-19 pandemic in Ethiopia are expected to be severe on Ethiopian households' welfare. To monitor these impacts on households, the team selected a subsample of households that had been interviewed for the Living Standards Measurement Study (LSMS) in 2019, covering urban and rural areas in all regions of Ethiopia. The 15-minute questionnaire covers a series of topics, such as knowledge of COVID and mitigation measures, access to routine healthcare as public health systems are increasingly under stress, access to educational activities during school closures, employment dynamics, household income and livelihood, income loss and coping strategies, and external assistance.

The survey is implemented using Computer Assisted Telephone Interviewing, using a modular approach, which allows for modules to be dropped and/or added in different waves of the survey. Survey data collection started at the end of April 2020 and households are called back every three to four weeks for a total of seven survey rounds to track the impact of the pandemic as it unfolds and inform government action. This provides data to the government and development partners in near real-time, supporting an evidence-based response to the crisis.

The sample of households was drawn from the sample of households interviewed in the 2018/2019 round of the Ethiopia Socioeconomic Survey (ESS). The extensive information collected in the ESS, less than one year prior to the pandemic, provides a rich set of background information on the COVID-19 High Frequency Phone Survey of households which can be leveraged to assess the differential impacts of the pandemic in the country.

Kind of Data

Sample survey data [ssd]

Unit of Analysis

Individual and household

Version

Version Description

Version 15: Edited, anonymized dataset for public distribution

Version Date

2025-01-10

Version Notes

This version contains updated household weights for round 13 to round 29.

Scope

Notes

The Ethiopia - High Frequency Phone Survey covered the following topics:

Household Roster (Round 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
Knowledge Regarding the Spread of COVID-19 (Round 1)
Behavior and Social Distancing (Round 1, 3, 6, 7)
Access to Basic Services (Round 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
Employment and Non farm Enterprise (Rounds 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 16, 18)
Income Loss and Coping (Round 1, 2, 3, 4, 5, 6, 7)
Food Security (Rounds 1, 2, 3, 4, 11, 15)
Aid and Support/ Social Safety Nets (Round 1, 2, 3, 4, 5, 6, 7, 9)
Agriculture (Round 3, 4, 5, 6, 7, 9, 14, 19)
Locusts (Rounds 4, 6, 7)
WASH (Round 4, 9)
Education and Childcaring (Round 8, 11)
Credit (Round 8)
Migration (Round 8)
Return Migration (Round 8)
SWIFT (Round 11)
Youth Aspirations and Employment (Round 12)
Access to Health Services (Round 13, 14, 15, 16, 17, 18)
Food and Non Food Prices (Round 13, 14, 15, 16, 17, 18, 19)
COVID-19 Vaccine (Round 14)
Economic Sentiments (Round 14, 15, 17, 18)
Shock/Coping (Round 16)
Subjective Welfare (Round 16)
Food Insecurity Experience Scale (Round 17, 18)

Coverage

Geographic Coverage

National coverage - rural and urban

Universe

The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.

Producers and sponsors

Primary investigators

Name
World Bank

Producers

Name	Role
Central Statistical Agency	Collaborator

Funding Agency/Sponsor

Name	Abbreviation	Role
United States Agency for International Development	USAID	Funded the study
The World Bank Group	WB	Funded the study
Global Financing Facility	GFF	Funded the study

Sampling

Sampling Procedure

The sample of the HFPS-HH is a subsample of the 2018/19 Ethiopia Socioeconomic Survey (ESS). The ESS is built on a nationally and regionally representative sample of households in Ethiopia. ESS 2018/19 interviewed 6,770 households in urban and rural areas. In the ESS interview, households were asked to provide phone numbers either their own or that of a reference household (i.e. friends or neighbors) so that they can be contacted in the follow-up ESS surveys should they move from their sampled location. At least one valid phone number was obtained for 5,374 households (4,626 owning a phone and 995 with a reference phone number). These households established the sampling frame for the HFPS-HH.

To obtain representative strata at the national, urban, and rural level, the target sample size for the HFPS-HH is 3,300 households; 1,300 in rural and 2,000 households in urban areas. In rural areas, we attempt to call all phone numbers included in the ESS as only 1,413 households owned phones and another 771 households provided reference phone numbers. In urban areas, 3,213 households owned a phone and 224 households provided reference phone numbers. To account for non-response and attrition all the 5,374 households were called in round 1 of the HFPS-HH.

The total number of completed interviews in round one is 3,249 households (978 in rural areas, 2,271 in urban areas).
The total number of completed interviews in round two is 3,107 households (940 in rural areas, 2,167 in urban areas).
The total number of completed interviews in round three is 3,058 households (934 in rural areas, 2,124 in urban areas).
The total number of completed interviews in round four is 2,878 households (838 in rural areas, 2,040 in urban areas).
The total number of completed interviews in round five is 2,770 households (775 in rural areas, 1,995 in urban areas).
The total number of completed interviews in round six is 2,704 households (760 in rural areas, 1,944 in urban areas).
The total number of completed interviews in round seven is 2,537 households (716 in rural areas, 1,1821 in urban areas).
The total number of completed interviews in round eight is 2,222 households (576 in rural areas, 1,646 in urban areas).
The total number of completed interviews in round nine is 2,077 households (553 in rural areas, 1,524 in urban areas).
The total number of completed interviews in round ten is 2,178 households (537 in rural areas, 1,641 in urban areas).
The total number of completed interviews in round eleven is 1,982 households (442 in rural areas, 1,540 in urban areas).
The total number of completed interviews in round twelve is 888 households (204 in rural areas, 684 in urban areas).
The total number of completed interviews in round thirteen is 2,876 households (955 in rural areas, 1,921 in urban areas).
The total number of completed interviews in round fourteen is 2,509 households (765 in rural areas, 1,744 in urban areas).
The total number of completed interviews in round fifteen is 2,521 households (823 in rural areas, 1,698 in urban areas).
The total number of completed interviews in round sixteen is 2,336 households.
The total number of completed interviews in round seventeen is 2,357 households.
The total number of completed interviews in round eighteen is 2,237 households (701 in rural areas, 1,536 in urban areas).
The total number of completed interviews in round nineteen is 2,566 households (806 in rural areas, 1,760 in urban areas).

Weighting

To obtain unbiased estimates from the sample, the information reported by households needs to be adjusted by a sampling weight (or raising factor) w_h. To construct the sampling weights, we follow the steps outlined in Himelein, K. (2014), which outlines eight steps, of which we follow six, to construct the sampling weights for the HFPS-HH:

Begin with base weights from the Ethiopia Socioeconomic Survey ESS 2018/19 for each household
Incorporate probability of sub-selection of round 1 unit for each of the phone survey households. We calculate the probability of selection for each of the 20 strata in the ESS (urban and rural in each of the 11 regions except for Addis Ababa where we only have an urban stratum) by creating the numerators as the number of completed phone interviews and the denominator as the number of households in the ESS for each stratum.
Pool the weights in Steps 1 and 2.
Derive attrition-adjusted weights for all individuals by running a logistic response propensity model based on characteristics of the household head (i.e. education, labor force status, demographic characteristics), characteristics of the household (consumption, assets, financial characteristics), and characteristics of the dwelling (house ownership, overcrowding).
Trim weights by replacing the top two percent of observations with the 98th percentile cut-off point; and
Post-stratify weights to known population totals to correct for the imbalances across our urban and rural sample. In doing so, we ensure that the distribution in the survey matches the distribution in the ESS.

Additional technical details and explanations on each of the steps briefly outlined above can be found in Himelein, K. (2014).

Survey instrument

Questionnaires

The survey questionnaires were administered to all the households in the sample. The questionnaires consisted of the following sections:

Baseline (Round 1)

Household Identification
Interview Information
Household Roster
Knowledge Regarding the Spread of Coronavirus
Behavior and Social Distancing
Access to Basic Services
Employment
Income Loss and Coping
Food Security
Aid and Support/ Social Safety Nets

Round 2

Household Identification
Household Roster
Access to Basic Services
Employment
Income Loss and Coping
Food Security
Aid and Support/ Social Safety Nets

Round 3

Household Identification
Household Roster
Behavior and social distancing
Access to Basic Services
Employment
Income Loss and Coping
Food Security
Agriculture
Aid and Support/ Social Safety Nets

Round 4

Household Identification
Household Roster
Access to Basic Services
Employment
Income Loss and Coping
Food Security
Agriculture
Aid and Support/ Social Safety Nets
Locusts
WASH

Round 5

Household Identification
Household Roster
Access to Basic Services
Employment
Income Loss and Coping
Aid and Support/ Social Safety Nets
Agriculture
Livestock

Round 6

Household Identification
Household Roster
Behavior and Social Distancing
Access to Basic Services
Employment
Income Loss and Coping
Aid and Support/ Social Safety Nets
Agriculture
Locusts

Round 7

Household Identification
Household Roster
Behavior and Social Distancing
Access to Basic Services
Employment
Income Loss and Coping
Aid and Support/ Social Safety Nets
Agriculture
Locusts

Round 8

Household Identification
Household Roster
Access to Basic Services
Employment
Education and Childcaring
Credit
Migration
Return Migration

Round 9

Household Identification
Household Roster Update
Access to Basic Services
Employment
Aid and Support/ Social Safety Nets
Agriculture
WASH

Round 10

Household Identification
Household Roster Update
Access to Basic Services
Employment

Round 11

Household Identification
Household Roster Update
Access to Basic Services
Employment
Education and Childcaring
Food Insecurity Experience Scale
SWIFT

Round 12

Household Identification
Household Roster Update
Youth Aspirations and Employment

Round 13

Household Identification
Household Roster Update
Access to Health Services
Employment
Food Prices

Round 14

Household Identification
Household Roster Update
Access to Health Services
COVID-19 Vaccine
Employment
Economic Sentiments
Food Prices
Agriculture

Round 15

Household Identification
Household Roster Update
Access to Health Services
Economic Sentiments
Food Insecurity Experience Scale
Food Prices

Round 16

Household Identification
Household Roster Update
Access to Health Services
Employment and Non-farm Enterprises
Food and Non-food prices
Shocks and Coping Strategies
Subjective Welfare

Round 17

Household Identification
Household Roster Update
Access to Health Services for Individual Household Members (Sample A)
Access to Health Services for Households (Sample B)
Food and Non-food prices
Economic Sentiments
Food Insecurity Experience Scale

Round 18

Household Identification
Household Roster Update
Access to Health Services for Individual Household Members
Food and Non-food prices
Economic Sentiments (Sample B)
Food Insecurity Experience Scale (Sample A)

Round 19

Household Identification
Household's Residential Location Verification
Household Roster Update
Food and Non-food Prices
Agriculture Crop
Agriculture Livestock

Data collection

Dates of Data Collection

Start	End	Cycle
2020-04-22	2020-05-13	Round 1
2020-05-14	2020-06-03	Round 2
2020-06-04	2020-06-26	Round 3
2020-07-27	2020-08-14	Round 4
2020-08-24	2020-09-17	Round 5
2020-09-21	2020-10-14	Round 6
2020-09-19	2020-11-10	Round 7
2020-12-01	2020-12-21	Round 8
2020-12-28	2021-01-22	Round 9
2021-02-01	2021-02-23	Round 10
2021-04-12	2021-05-11	Round 11
2021-06-01	2021-06-20	Round 12
2022-10-03	2020-11-05	Round 13
2022-12-13	2023-01-13	Round 14
2023-02-23	2023-03-25	Round 15
2023-03-26	2023-05-21	Round 16
2023-07-11	2023-08-04	Round 17
2023-10-02	2023-10-30	Round 18
2024-01-05	2024-02-02	Round 19

Mode of data collection

Computer Assisted Telephone Interview [cati]

Data Collectors

Name
Laterite BV

Data Collection Notes

The Ethiopia- COVID-19 High Frequency Phone Survey of Households (HFPS) was conducted using Computer Assisted Telephone Interview (CATI) techniques. The household questionnaire was implemented using the CATI software, SurveyCTO. Each enumerator was given a tablet which they used to implement the interviews, along with data bundles to be used on their own mobile phone devices.

DATA COMMUNICATION SYSTEM: SurveyCTO's built-in data monitoring functions are used. Each enumerator was provided with a data bundle, allowing for internet connectivity and daily synchronization of their tablet. Data was sent to the server daily. Senior Field Supervisors served as the first step in ensuring data quality. Senior Field Supervisors reviewed the survey with enumerators twice daily via one-on-one calls and were always available to address any concerns that arose while performing an interview. At the same time, a Research Analyst was in charge of checking the uploaded data daily to correct errors and work to prevent them in future surveys. The following data quality checks were completed:

• Daily SurveyCTO monitoring: This included outlier checks, skipped questions, a review of “Other, specify”, other text responses, and enumerator comments. Enumerator comments were used to suggest new response options or to highlight situations where existing options should be used instead. Monitoring also included a review of variable relationship logic checks and checks of the logic of answers. Finally, outliers in phone variables such as survey duration or the percentage of time audio was at a conversational level were monitored. A survey duration of close to 15 minutes and a conversation-level audio percentage of around 40% was considered normal.

• Dashboard review: This included monitoring individual enumerator performance, such as the number of calls logged, duration of calls, percentage of calls responded to and percentage of non-consents. Non-consent reason rates and attempts per household were monitored as well. Duration analysis using R was used to monitor each module's duration and estimate the time required for subsequent rounds. The dashboard was also used to track overall survey completion and preview the results of key questions.

• Daily Data Team reporting: The Field Supervisors and the Data Manager reported daily feedback on call progress, enumerator feedback on the survey, and any suggestions to improve the instrument, such as adding options to multiple choice questions or adjusting translations.

• Audio audits: Audio recordings were captured during the consent portion of the interview for all completed interviews, for the enumerators' side of the conversation only. The recordings were reviewed for any surveys flagged by enumerators as having data quality concerns and for an additional random sample of 2% of respondents. A range of lengths were selected to observe edge cases. Most consent readings took around one minute, with some longer recordings due to questions on the survey or holding for the respondent. All reviewed audio recordings were completed satisfactorily.

• Back-check survey: Field Supervisors made back-check calls to a random sample of 5% of the households that completed a survey in Round 1. Field Supervisors called these households and administered a short survey, including (i) identifying the same respondent; (ii) determining the respondent's position within the household; (iii) confirming that a member of the the data collection team had completed the interview; and (iv) a few questions from the original survey.

Data processing

Data Editing

DATA CLEANING
At the end of data collection, the raw dataset was cleaned by the Research team. This included formatting, and correcting results based on monitoring issues, enumerator feedback and survey changes. The details are as follows.

Variable naming and labeling:
• Variable names were changed to reflect the lowercase question name in the paper survey copy, and a word or two related to the question.

• Variables were labeled with longer descriptions of their contents and the full question text was stored in Notes for each variable.

• “Other, specify” variables were named similarly to their related question, with “_other” appended to the name.

• Value labels were assigned where relevant, with options shown in English for all variables, unless preloaded from the roster in Amharic.

Variable formatting:
• Variables were formatted as their object type (string, integer, decimal, time, date, or datetime).

• Multi-select variables were saved both in space-separated single-variables and as multiple binary variables showing the yes/no value of each possible response.

• Time and date variables were stored as POSIX timestamp values and formatted to show Gregorian dates.

• Location information was left in separate ID and Name variables, following the format of the incoming roster. IDs were formatted to include only the variable level digits, and not the higher-level prefixes (2-3 digits only.)

• Full Household and Enumeration Area ID variables were given leading 0s to match incoming roster format.

Observation and variable arrangement:

• Only consented surveys were kept in the dataset, and all personal information and internal survey variables were dropped from the clean dataset.

• Roster data is separated from the main data set and kept in long-form but can be merged on the key variable (key can also be used to merge with the raw data).

• In the main dataset, ii4_resp_id and cs7_hhh_id are the roster IDs of the respondent and household head respectively, and can be merged with individual_id in the roster.

• The variables were arranged in the same order as the paper instrument, with observations arranged according to their submission time.

Backcheck data review: Results of the backcheck survey are compared against the originally captured survey results using the bcstats command in Stata. This function delivers a comparison of variables and identifies any discrepancies. Any discrepancies identified are then examined individually to determine if they are within reason.

Data Access

Citation requirements

Use of the dataset must be acknowledged using a citation which would include:

the Identification of the Primary Investigator
the title of the survey (including country, acronym and year of implementation)
the survey reference number
the source and date of download

World Bank. Ethiopia - High Frequency Phone Survey 2020-2024. Ref: ETH_2020-2024_HFPS_v15_M. Dataset downloaded from www.microdata.worldbank.org on [date].

Disclaimer and copyrights

Disclaimer

The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.

Contacts

Name	Affiliation	Email
LSMS	The World Bank Group	lsms@worldbank.org

Metadata production

DDI Document ID

DDI_ETH_2020-2024_HFPS_v15_M

Producers

Name	Abbreviation	Affiliation	Role
Development Economics Data Group	DECDG	The World Bank	Documentation of the DDI

Date of Metadata Production

2023-08-02

Metadata version

DDI Document version

Version 15 (January 2025). Household weights have been updated for round 13 to round 19.

Version date

2025-01-10

Back to Catalog