IHSN Survey Catalog
  • Home
  • Microdata Catalog
  • Citations
  • Login
    Login
    Home / Central Data Catalog / ETH_2020-2024_HFPS_V15_M
central

High Frequency Phone Survey 2020-2024

Ethiopia, 2020 - 2024
Get Microdata
Reference ID
ETH_2020-2024_HFPS_v15_M
Producer(s)
World Bank
Metadata
Documentation in PDF DDI/XML JSON
Created on
Jan 16, 2021
Last modified
Aug 02, 2023
Page views
97827
Downloads
2368
  • Study Description
  • Data Dictionary
  • Downloads
  • Get Microdata
  • Identification
  • Version
  • Scope
  • Coverage
  • Producers and sponsors
  • Sampling
  • Survey instrument
  • Data collection
  • Data processing
  • Data Access
  • Disclaimer and copyrights
  • Contacts
  • Metadata production
  • Identification

    Survey ID number

    ETH_2020-2024_HFPS_v15_M

    Title

    High Frequency Phone Survey 2020-2024

    Abbreviation or Acronym

    HFPS 2020-24

    Country
    Name Country code
    Ethiopia ETH
    Study type

    Socio-Economic/Monitoring Survey [hh/sems]

    Series Information

    The World Bank is providing support to countries to help mitigate the spread and impact of the new coronavirus disease (COVID-19) and Beyond. One area of support is for data collection to inform evidence-based policies that may help mitigate the effects of this disease and other economic and social problems. Towards this end, the World Bank is leveraging the Living Standards Measurement Study - Integrated Survey on Agriculture (LSMS-ISA) program to implement high-frequency phone surveys in 5 African countries - Nigeria, Ethiopia, Uganda, Tanzania, and Malawi. This effort is part of a broader first wave of World Bank-supported national longitudinal high-frequency surveys that can be used to help assess the economic and social implications of the COVID-19 pandemic and other socio-economic shocks on households and individuals.

    Abstract

    The potential impacts of the COVID-19 pandemic in Ethiopia are expected to be severe on Ethiopian households' welfare. To monitor these impacts on households, the team selected a subsample of households that had been interviewed for the Living Standards Measurement Study (LSMS) in 2019, covering urban and rural areas in all regions of Ethiopia. The 15-minute questionnaire covers a series of topics, such as knowledge of COVID and mitigation measures, access to routine healthcare as public health systems are increasingly under stress, access to educational activities during school closures, employment dynamics, household income and livelihood, income loss and coping strategies, and external assistance.

    The survey is implemented using Computer Assisted Telephone Interviewing, using a modular approach, which allows for modules to be dropped and/or added in different waves of the survey. Survey data collection started at the end of April 2020 and households are called back every three to four weeks for a total of seven survey rounds to track the impact of the pandemic as it unfolds and inform government action. This provides data to the government and development partners in near real-time, supporting an evidence-based response to the crisis.

    The sample of households was drawn from the sample of households interviewed in the 2018/2019 round of the Ethiopia Socioeconomic Survey (ESS). The extensive information collected in the ESS, less than one year prior to the pandemic, provides a rich set of background information on the COVID-19 High Frequency Phone Survey of households which can be leveraged to assess the differential impacts of the pandemic in the country.

    Kind of Data

    Sample survey data [ssd]

    Unit of Analysis

    Individual and household

    Version

    Version Description

    Version 15: Edited, anonymized dataset for public distribution

    Version Date

    2025-01-10

    Version Notes

    This version contains updated household weights for round 13 to round 29.

    Scope

    Notes

    The Ethiopia - High Frequency Phone Survey covered the following topics:

    • Household Roster (Round 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
    • Knowledge Regarding the Spread of COVID-19 (Round 1)
    • Behavior and Social Distancing (Round 1, 3, 6, 7)
    • Access to Basic Services (Round 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
    • Employment and Non farm Enterprise (Rounds 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 16, 18)
    • Income Loss and Coping (Round 1, 2, 3, 4, 5, 6, 7)
    • Food Security (Rounds 1, 2, 3, 4, 11, 15)
    • Aid and Support/ Social Safety Nets (Round 1, 2, 3, 4, 5, 6, 7, 9)
    • Agriculture (Round 3, 4, 5, 6, 7, 9, 14, 19)
    • Locusts (Rounds 4, 6, 7)
    • WASH (Round 4, 9)
    • Education and Childcaring (Round 8, 11)
    • Credit (Round 8)
    • Migration (Round 8)
    • Return Migration (Round 8)
    • SWIFT (Round 11)
    • Youth Aspirations and Employment (Round 12)
    • Access to Health Services (Round 13, 14, 15, 16, 17, 18)
    • Food and Non Food Prices (Round 13, 14, 15, 16, 17, 18, 19)
    • COVID-19 Vaccine (Round 14)
    • Economic Sentiments (Round 14, 15, 17, 18)
    • Shock/Coping (Round 16)
    • Subjective Welfare (Round 16)
    • Food Insecurity Experience Scale (Round 17, 18)

    Coverage

    Geographic Coverage

    National coverage - rural and urban

    Universe

    The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.

    Producers and sponsors

    Primary investigators
    Name
    World Bank
    Producers
    Name Role
    Central Statistical Agency Collaborator
    Funding Agency/Sponsor
    Name Abbreviation Role
    United States Agency for International Development USAID Funded the study
    The World Bank Group WB Funded the study
    Global Financing Facility GFF Funded the study

    Sampling

    Sampling Procedure

    The sample of the HFPS-HH is a subsample of the 2018/19 Ethiopia Socioeconomic Survey (ESS). The ESS is built on a nationally and regionally representative sample of households in Ethiopia. ESS 2018/19 interviewed 6,770 households in urban and rural areas. In the ESS interview, households were asked to provide phone numbers either their own or that of a reference household (i.e. friends or neighbors) so that they can be contacted in the follow-up ESS surveys should they move from their sampled location. At least one valid phone number was obtained for 5,374 households (4,626 owning a phone and 995 with a reference phone number). These households established the sampling frame for the HFPS-HH.

    To obtain representative strata at the national, urban, and rural level, the target sample size for the HFPS-HH is 3,300 households; 1,300 in rural and 2,000 households in urban areas. In rural areas, we attempt to call all phone numbers included in the ESS as only 1,413 households owned phones and another 771 households provided reference phone numbers. In urban areas, 3,213 households owned a phone and 224 households provided reference phone numbers. To account for non-response and attrition all the 5,374 households were called in round 1 of the HFPS-HH.

    The total number of completed interviews in round one is 3,249 households (978 in rural areas, 2,271 in urban areas).
    The total number of completed interviews in round two is 3,107 households (940 in rural areas, 2,167 in urban areas).
    The total number of completed interviews in round three is 3,058 households (934 in rural areas, 2,124 in urban areas).
    The total number of completed interviews in round four is 2,878 households (838 in rural areas, 2,040 in urban areas).
    The total number of completed interviews in round five is 2,770 households (775 in rural areas, 1,995 in urban areas).
    The total number of completed interviews in round six is 2,704 households (760 in rural areas, 1,944 in urban areas).
    The total number of completed interviews in round seven is 2,537 households (716 in rural areas, 1,1821 in urban areas).
    The total number of completed interviews in round eight is 2,222 households (576 in rural areas, 1,646 in urban areas).
    The total number of completed interviews in round nine is 2,077 households (553 in rural areas, 1,524 in urban areas).
    The total number of completed interviews in round ten is 2,178 households (537 in rural areas, 1,641 in urban areas).
    The total number of completed interviews in round eleven is 1,982 households (442 in rural areas, 1,540 in urban areas).
    The total number of completed interviews in round twelve is 888 households (204 in rural areas, 684 in urban areas).
    The total number of completed interviews in round thirteen is 2,876 households (955 in rural areas, 1,921 in urban areas).
    The total number of completed interviews in round fourteen is 2,509 households (765 in rural areas, 1,744 in urban areas).
    The total number of completed interviews in round fifteen is 2,521 households (823 in rural areas, 1,698 in urban areas).
    The total number of completed interviews in round sixteen is 2,336 households.
    The total number of completed interviews in round seventeen is 2,357 households.
    The total number of completed interviews in round eighteen is 2,237 households (701 in rural areas, 1,536 in urban areas).
    The total number of completed interviews in round nineteen is 2,566 households (806 in rural areas, 1,760 in urban areas).

    Weighting

    To obtain unbiased estimates from the sample, the information reported by households needs to be adjusted by a sampling weight (or raising factor) w_h. To construct the sampling weights, we follow the steps outlined in Himelein, K. (2014), which outlines eight steps, of which we follow six, to construct the sampling weights for the HFPS-HH:

    1. Begin with base weights from the Ethiopia Socioeconomic Survey ESS 2018/19 for each household
    2. Incorporate probability of sub-selection of round 1 unit for each of the phone survey households. We calculate the probability of selection for each of the 20 strata in the ESS (urban and rural in each of the 11 regions except for Addis Ababa where we only have an urban stratum) by creating the numerators as the number of completed phone interviews and the denominator as the number of households in the ESS for each stratum.
    3. Pool the weights in Steps 1 and 2.
    4. Derive attrition-adjusted weights for all individuals by running a logistic response propensity model based on characteristics of the household head (i.e. education, labor force status, demographic characteristics), characteristics of the household (consumption, assets, financial characteristics), and characteristics of the dwelling (house ownership, overcrowding).
    5. Trim weights by replacing the top two percent of observations with the 98th percentile cut-off point; and
    6. Post-stratify weights to known population totals to correct for the imbalances across our urban and rural sample. In doing so, we ensure that the distribution in the survey matches the distribution in the ESS.
    • Additional technical details and explanations on each of the steps briefly outlined above can be found in Himelein, K. (2014).

    Survey instrument

    Questionnaires

    The survey questionnaires were administered to all the households in the sample. The questionnaires consisted of the following sections:

    Baseline (Round 1)

    • Household Identification
    • Interview Information
    • Household Roster
    • Knowledge Regarding the Spread of Coronavirus
    • Behavior and Social Distancing
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Food Security
    • Aid and Support/ Social Safety Nets

    Round 2

    • Household Identification
    • Household Roster
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Food Security
    • Aid and Support/ Social Safety Nets

    Round 3

    • Household Identification
    • Household Roster
    • Behavior and social distancing
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Food Security
    • Agriculture
    • Aid and Support/ Social Safety Nets

    Round 4

    • Household Identification
    • Household Roster
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Food Security
    • Agriculture
    • Aid and Support/ Social Safety Nets
    • Locusts
    • WASH

    Round 5

    • Household Identification
    • Household Roster
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Aid and Support/ Social Safety Nets
    • Agriculture
    • Livestock

    Round 6

    • Household Identification
    • Household Roster
    • Behavior and Social Distancing
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Aid and Support/ Social Safety Nets
    • Agriculture
    • Locusts

    Round 7

    • Household Identification
    • Household Roster
    • Behavior and Social Distancing
    • Access to Basic Services
    • Employment
    • Income Loss and Coping
    • Aid and Support/ Social Safety Nets
    • Agriculture
    • Locusts

    Round 8

    • Household Identification
    • Household Roster
    • Access to Basic Services
    • Employment
    • Education and Childcaring
    • Credit
    • Migration
    • Return Migration

    Round 9

    • Household Identification
    • Household Roster Update
    • Access to Basic Services
    • Employment
    • Aid and Support/ Social Safety Nets
    • Agriculture
    • WASH

    Round 10

    • Household Identification
    • Household Roster Update
    • Access to Basic Services
    • Employment

    Round 11

    • Household Identification
    • Household Roster Update
    • Access to Basic Services
    • Employment
    • Education and Childcaring
    • Food Insecurity Experience Scale
    • SWIFT

    Round 12

    • Household Identification
    • Household Roster Update
    • Youth Aspirations and Employment

    Round 13

    • Household Identification
    • Household Roster Update
    • Access to Health Services
    • Employment
    • Food Prices

    Round 14

    • Household Identification
    • Household Roster Update
    • Access to Health Services
    • COVID-19 Vaccine
    • Employment
    • Economic Sentiments
    • Food Prices
    • Agriculture

    Round 15

    • Household Identification
    • Household Roster Update
    • Access to Health Services
    • Economic Sentiments
    • Food Insecurity Experience Scale
    • Food Prices

    Round 16

    • Household Identification
    • Household Roster Update
    • Access to Health Services
    • Employment and Non-farm Enterprises
    • Food and Non-food prices
    • Shocks and Coping Strategies
    • Subjective Welfare

    Round 17

    • Household Identification
    • Household Roster Update
    • Access to Health Services for Individual Household Members (Sample A)
    • Access to Health Services for Households (Sample B)
    • Food and Non-food prices
    • Economic Sentiments
    • Food Insecurity Experience Scale

    Round 18

    • Household Identification
    • Household Roster Update
    • Access to Health Services for Individual Household Members
    • Food and Non-food prices
    • Economic Sentiments (Sample B)
    • Food Insecurity Experience Scale (Sample A)

    Round 19

    • Household Identification
    • Household's Residential Location Verification
    • Household Roster Update
    • Food and Non-food Prices
    • Agriculture Crop
    • Agriculture Livestock

    Data collection

    Dates of Data Collection
    Start End Cycle
    2020-04-22 2020-05-13 Round 1
    2020-05-14 2020-06-03 Round 2
    2020-06-04 2020-06-26 Round 3
    2020-07-27 2020-08-14 Round 4
    2020-08-24 2020-09-17 Round 5
    2020-09-21 2020-10-14 Round 6
    2020-09-19 2020-11-10 Round 7
    2020-12-01 2020-12-21 Round 8
    2020-12-28 2021-01-22 Round 9
    2021-02-01 2021-02-23 Round 10
    2021-04-12 2021-05-11 Round 11
    2021-06-01 2021-06-20 Round 12
    2022-10-03 2020-11-05 Round 13
    2022-12-13 2023-01-13 Round 14
    2023-02-23 2023-03-25 Round 15
    2023-03-26 2023-05-21 Round 16
    2023-07-11 2023-08-04 Round 17
    2023-10-02 2023-10-30 Round 18
    2024-01-05 2024-02-02 Round 19
    Mode of data collection
    • Computer Assisted Telephone Interview [cati]
    Data Collectors
    Name
    Laterite BV
    Data Collection Notes

    The Ethiopia- COVID-19 High Frequency Phone Survey of Households (HFPS) was conducted using Computer Assisted Telephone Interview (CATI) techniques. The household questionnaire was implemented using the CATI software, SurveyCTO. Each enumerator was given a tablet which they used to implement the interviews, along with data bundles to be used on their own mobile phone devices.

    DATA COMMUNICATION SYSTEM: SurveyCTO's built-in data monitoring functions are used. Each enumerator was provided with a data bundle, allowing for internet connectivity and daily synchronization of their tablet. Data was sent to the server daily. Senior Field Supervisors served as the first step in ensuring data quality. Senior Field Supervisors reviewed the survey with enumerators twice daily via one-on-one calls and were always available to address any concerns that arose while performing an interview. At the same time, a Research Analyst was in charge of checking the uploaded data daily to correct errors and work to prevent them in future surveys. The following data quality checks were completed:

    • Daily SurveyCTO monitoring: This included outlier checks, skipped questions, a review of “Other, specify”, other text responses, and enumerator comments. Enumerator comments were used to suggest new response options or to highlight situations where existing options should be used instead. Monitoring also included a review of variable relationship logic checks and checks of the logic of answers. Finally, outliers in phone variables such as survey duration or the percentage of time audio was at a conversational level were monitored. A survey duration of close to 15 minutes and a conversation-level audio percentage of around 40% was considered normal.

    • Dashboard review: This included monitoring individual enumerator performance, such as the number of calls logged, duration of calls, percentage of calls responded to and percentage of non-consents. Non-consent reason rates and attempts per household were monitored as well. Duration analysis using R was used to monitor each module's duration and estimate the time required for subsequent rounds. The dashboard was also used to track overall survey completion and preview the results of key questions.

    • Daily Data Team reporting: The Field Supervisors and the Data Manager reported daily feedback on call progress, enumerator feedback on the survey, and any suggestions to improve the instrument, such as adding options to multiple choice questions or adjusting translations.

    • Audio audits: Audio recordings were captured during the consent portion of the interview for all completed interviews, for the enumerators' side of the conversation only. The recordings were reviewed for any surveys flagged by enumerators as having data quality concerns and for an additional random sample of 2% of respondents. A range of lengths were selected to observe edge cases. Most consent readings took around one minute, with some longer recordings due to questions on the survey or holding for the respondent. All reviewed audio recordings were completed satisfactorily.

    • Back-check survey: Field Supervisors made back-check calls to a random sample of 5% of the households that completed a survey in Round 1. Field Supervisors called these households and administered a short survey, including (i) identifying the same respondent; (ii) determining the respondent's position within the household; (iii) confirming that a member of the the data collection team had completed the interview; and (iv) a few questions from the original survey.

    Data processing

    Data Editing

    DATA CLEANING
    At the end of data collection, the raw dataset was cleaned by the Research team. This included formatting, and correcting results based on monitoring issues, enumerator feedback and survey changes. The details are as follows.

    Variable naming and labeling:
    • Variable names were changed to reflect the lowercase question name in the paper survey copy, and a word or two related to the question.

    • Variables were labeled with longer descriptions of their contents and the full question text was stored in Notes for each variable.

    • “Other, specify” variables were named similarly to their related question, with “_other” appended to the name.

    • Value labels were assigned where relevant, with options shown in English for all variables, unless preloaded from the roster in Amharic.

    Variable formatting:
    • Variables were formatted as their object type (string, integer, decimal, time, date, or datetime).

    • Multi-select variables were saved both in space-separated single-variables and as multiple binary variables showing the yes/no value of each possible response.

    • Time and date variables were stored as POSIX timestamp values and formatted to show Gregorian dates.

    • Location information was left in separate ID and Name variables, following the format of the incoming roster. IDs were formatted to include only the variable level digits, and not the higher-level prefixes (2-3 digits only.)

    • Full Household and Enumeration Area ID variables were given leading 0s to match incoming roster format.

    Observation and variable arrangement:

    • Only consented surveys were kept in the dataset, and all personal information and internal survey variables were dropped from the clean dataset.

    • Roster data is separated from the main data set and kept in long-form but can be merged on the key variable (key can also be used to merge with the raw data).

    • In the main dataset, ii4_resp_id and cs7_hhh_id are the roster IDs of the respondent and household head respectively, and can be merged with individual_id in the roster.

    • The variables were arranged in the same order as the paper instrument, with observations arranged according to their submission time.

    Backcheck data review: Results of the backcheck survey are compared against the originally captured survey results using the bcstats command in Stata. This function delivers a comparison of variables and identifies any discrepancies. Any discrepancies identified are then examined individually to determine if they are within reason.

    Data Access

    Citation requirements

    Use of the dataset must be acknowledged using a citation which would include:

    • the Identification of the Primary Investigator
    • the title of the survey (including country, acronym and year of implementation)
    • the survey reference number
    • the source and date of download

    World Bank. Ethiopia - High Frequency Phone Survey 2020-2024. Ref: ETH_2020-2024_HFPS_v15_M. Dataset downloaded from www.microdata.worldbank.org on [date].

    Disclaimer and copyrights

    Disclaimer

    The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.

    Contacts

    Contacts
    Name Affiliation Email
    LSMS The World Bank Group lsms@worldbank.org

    Metadata production

    DDI Document ID

    DDI_ETH_2020-2024_HFPS_v15_M

    Producers
    Name Abbreviation Affiliation Role
    Development Economics Data Group DECDG The World Bank Documentation of the DDI
    Date of Metadata Production

    2023-08-02

    Metadata version

    DDI Document version

    Version 15 (January 2025). Household weights have been updated for round 13 to round 19.

    Version date

    2025-01-10

    Back to Catalog
    IHSN Survey Catalog

    © IHSN Survey Catalog, All Rights Reserved.