The 2011-12 National Risk and Vulnerability Assessment (NRVA) is a survey, which provides national and international stakeholders with information that is required for monitoring development progress and formulate development policies and programmes. The survey was conducted by the Central Statistics Organization (CSO) of the Islamic Republic of Afghanistan and provides results that are representative at national and provincial level. It covered 20,828 households and 159,224 persons across the country, and is unique in the sense that it also includes the nomadic Kuchi population of Afghanistan.
Kind of Data
Sample survey data [ssd]
Unit of Analysis
Unit of Analysis
Producers and sponsors
Authoring entity/Primary investigators
Central Statistics Organization (CSO)
Government of the Islamic Republic of Afghanistan
Funded the study
The sampling design of the NRVA 2011-12 was developed to produce results that are representative at national and provincial level, as well as for Shamsi calendar seasons. In total 35 strata were identified, 34 for the provinces of Afghanistan and one for the nomadic Kuchi population. Stratification by season was achieved by equally distributing data collection over 12 months within the provinces. For the Kuchi population, the design only provided sampling in winter and summer when communities tend to temporarily settle. Given the total sample size of 21,000 and uniform sample size per stratum, each province and the Kuchi stratum was assigned with 600 households to be interviewed.
The sampling frame used for the resident population in the NRVA 2011-12 was the pre-census household listing conducted by CSO in 2003-05. Households were selected on the basis of a two-stage cluster design within each stratum. In the first stage Enumeration Areas (EAs) were selected as Primary Sampling Units (PSUs) with probability proportional to EA size (PPS). Subsequently, in the second stage ten households were selected as the Ultimate Sampling Unit (USU). The design thus provided for 60 clusters per province, implying data collection of five clusters (50 households) per province per month and in total 170 clusters (1,700) households per month and 2,040 clusters (20,400 households) in the full year of data collection.
The Kuchi sample was designed on basis of the 2003-04 National Multi-sectoral Assessment of Kuchi (NMAK-2004). For this stratum a community selection was implemented with PPS and a second stage selection with again a constant cluster size of ten households. The 60 clusters (600 households) for this stratum were equally divided between the summer and winter periods within the survey period.
Deviations from the Sample Design
The reality of survey taking in Afghanistan imposed a number of deviations from the sampling design. In the first six fieldwork months areas that were inaccessible due to insecurity were replaced by sampled areas that were scheduled for a later month, in the hope that over time security conditions would improve and the original cluster interviews could still be conducted. In view of sustained levels of insecurity, from the sixth month of data collection onward clusters in inaccessible areas were replaced by clusters drawn from a reserve sampling frame that excluded insecure districts. In addition, delays in fieldwork caused an uneven seasonal coverage.
Sample weights were calculated for up-scaling the surveyed households and population to the total number of households and population in Afghanistan. The calculation was based on the official CSO population estimate by province for January 2012 and average provincial household size derived from the survey. In view of the unequal distribution of the sample across seasons, a post-stratification adjustment was imposed to give equal weight to the seasons.
Dates of Data Collection (YYYY/MM/DD)
Mode of data collection
Type of Research Instrument
The core of NRVA 2011-12 is a household questionnaire consisting of 15 subject sections, 11 administered by male interviewers and answered by the male household representative (usually the head of household), and four asked by female interviewers from female respondents. In addition, the questionnaire included three modules for identification and monitoring purposes.
In addition to household information, data were collected at community level through two community questionnaires, one male and one female Shura questionnaire. Finally, the NRVA survey instrument included a questionnaire to collect data on market prices for food items and a few other commodities.
Central Statistics Organization
Government of the Islamic Republic of Afghanistan
Data processing in CSO Headquarters was done in parallel to the fieldwork and started upon arrival of the first batch of completed questionnaires in May 2011. The first stage consisted of manual checking by three questionnaire editors. Subsequently, the questionnaire batch was submitted for data entry. The data entry staff received two rounds of training before actual data capture started. In the course of the survey, the team was expanded to 30 operators to keep up to eliminate the backlog that arose due to double data entry.
Data capture was done with a specially designed MS Access programme, which was piloted to ensure a smooth performance. The database was equipped with VB coding to perform basic consistency and range checks. The database programme also included several data-cleaning and data-management procedures for process monitoring and daily back-ups by the Database Director.
The principle of double data entry was introduced to avoid high levels of manual data capture errors. For each of the double-entered batches integrity checks were performed at individual, household and batch level. Emerging issues were resolved by a team of seven data editors. A complementary MS Access programme identified discrepancies between the batches of double-entered data, which were subsequently reconciled and again tested for integrity.
Further data editing was first performed on the MS Access database. This database was then transferred to Stata software for the application of programmes to identify data flaws and either perform automatic imputation or manual screen editing. Data processing was completed in September 2012.
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.