Survey of Living Conditions and Household Budgets 2005-2006
2nd Survey of Living Conditions
Integrated Survey (non-LSMS) [hh/is]
The Survey of Living Conditions and Household Bugetary Survey (SLC/HBS) is the second in a series of household surveys. The first SLC was done in 1995. Both the 1995 and the 2005 surveys were designed by Consultants hired by the CDB in collaboration with the CSO. The 2005 - 2006 survey also included a household budget survey component which was used to develop a national basket of expenditure shares for updating the Consumer Price Index. The primary purpose of the survey was to measure poverty indicators, the headcount, Forster-Greer-Thorbecke and other related indicators.
The main objective of the survey is to assess the living conditions of the population of St Lucia and to develop a national basket of goods and services for updating of the consumer price index. The survey contains information on housing conditions, cost of accomodation, cost of rountine household maintenance and repairs, annual cost of purching furniture and furishings for the household, cost of vehicle operations, where items are purchased, migration, anthopometric data, demographics, health, education, labour force, crime, clothing expenses, health expenses and income.
Kind of Data
Sample survey data [ssd]
Unit of Analysis
Individuals, households, spenders (defined as persons age 18 and over and employed)
The scope of the Survey of Living Conditions and Household Budgets 2006 includes:
- HOUSEHOLD: Household characteristics, household listing, cost of accomodation, cost of rountine household maintenance, major types of household expenses, annual cost of funiture, home grown production consumed, vehicle expenses, migration and anthropometric data
- PERSONS: Demographics, health, education, labour force, clothing expenses, health expenses and income
- SPENDER: Data on Expenditure made by the spender
- CHILDREN: Children's characteristics, birth registration, breastfeeding, care of illness, immunization and anthropometry
consumption/consumer behaviour [1.1]
general health [8.4]
specific diseases and medical conditions [8.9]
drug abuse, alcohol and smoking [8.3]
equality and inequality [12.4]
economic conditions and indicators [1.2]
income, property and investment/saving [1.5]
compulsory and pre-school education [6.2]
post-compulsory education [6.5]
health care and medical treatment [8.5]
National coverage, all Administrative Districts
The survey covered all de jure household members (usual residents), the fertility section of the person questionnaire covers all women aged 15-49 years resident in the household, the anthropometic section covers all children aged 0-4 years (under age 5) resident in the household and the expenditure data covers all spenders 18 year of age and over and employed.
Producers and sponsors
Central Statistical Office for Saint Lucia
Ministry of Economic Affairs
Caribbean Development Bank
Government of St Lucia - Ministry of Social Transformation
Kairi Consultants Ltd
Consultants hired by Caribbean Development Bank
1302 households were selected for the sample. Of these, 1222 were occupied households and were successfully interviewed for a response rate of 94%. Within these households, 4319 persons were successfully interviewed (response rate 93.9%).
The stratification is done by district and is based on the percentage of agricultural workers for rural EDs (Enumeration Districts) and percentage of professional workers for urban EDs. There are two stages of selection, firstly the selection of EDs in all Districts then the selection of households using a random start and systematic selection proceedure. Households which refused or could not be contacted were replaced.
The sample frame used was based on the May 2001 Census and the sample size was 2.78% of the frame. Stratification was done on the district (District) and then by ED (Enumeration District) and finally by household (hhno).
Sample weights were calculated for all of the data files by district.
Sample weights for the household data were computed as the inverse of the probability of selection of the household, computed at the sampling domain level represented by the district.
The household weights were adjusted for non-response at the domain level, and were then normalized by a constant factor so that the total weighted number of households equals the total unweighted number of households. The household weight variable is called WT and is used with the Household data and the Person data. Sample weights for the diaries were computed separately based on the number of diaries collected for each household within each domain of study.
Dates of Data Collection
Data Collection Mode
Data Collection Notes
The pre-test for the survey took place from July 15, 2005 - July 25, 2005 and included 7 interviewers who would later become supervisors and enumerators for the main survey.
Each interviewing team comprised of 3-4 interviewers, together with a supervisor and a driver in some cases. A total of 35 interviewers, 6 supervisors and 4 field editors were used. Data collection took place over a period of about 6 months from September 2, 2005 until March 31, 2006. Interviewing took place everyday throughout the fieldwork period but was organised in two week periods in conformity with the collection of daily expenditure diaries.
Four Questionnaires will be administered to each household; each household will be visited three times. On the first visit the Household and the Individual Questionnaires will be administered. At the start of the first week one Daily Diary of Expenditure Questionnaire for the household and a Memory Jogger notebook for each spender in the household will be left with household respondents to record all purchases over the next one-week period. The 2nd visit to the household will occur at the end of the first week at which point the Daily Diary completed by the household for the first week and all memory jogger notebooks will be collected and the second week’s Diary and memory joggers will be left. The 3rd visit to the household will occur at the end of the second week at which point the Daily Diary completed by the household and memory joggers from each spender for the second week will be collected.
Central Statistical Office
Ministry of Finance and Economic Affairs
Four Questionnaires were administered to each household; each household was visited at least three times. On the first visit the Household and the Individual Questionnaires were administered. At the start of the first week one Daily Diary of Expenditure Questionnaire for the household and a Memory Jogger notebook for each spender in the household was left with household respondents to record all purchases over the next one-week period. The 2nd visit to the household occurred at the end of the first week at which point the Daily Diary completed by the household for the first week and all memory jogger notebooks were collected and the second week’s Diary and memory joggers were left. The 3rd visit to the household occurred at the end of the second week at which point the Daily Diary completed by the household and memory joggers from each spender for the second week was collected.
THE HOUSEHOLD EXPENDITURE SURVEY QUESTIONNAIRES
There are four (4) questionnaires to be administered in the survey.
1. S.L.C.H.B Household Questionnaire
2. S.L.C.H.B Individual/Person Questionnaire
3. S.L.C.H.B Memory Jogger
4. S.L.C.H.B Household Daily Diary of Expenditure
THE STRUCTURE OF THE QUESTIONNAIRES
Front Page: Identification and control
Section 1: Housing conditions and household assets
Section 2 Part 1,2,3: Expenditure on accommodation, owned and rented
Section 2 Part 4: Expenditure on accommodation - Repair and maintenance of dwelling
Section 3 : Major types of household expenses
Section 4 Part 1: Furniture, furnishings and household equipment
Section 4 Part 2: Repairs and servicing of household articles
Section 5 : Agriculture products produced and consumed at home
Section 6: Transportation
Section 7: Regularity of purchase and main type of outlet
Section 8: For Heads of households only (Status of previous household head)
Section 9: To be completed for all former household members living away from the household in the past five years
Section 10: For children under the age of five years
Control: Identification and control
Section 1: Characteristics – For all persons
Section 2: Migration – For all persons
Section 3: Health – For all persons
Section 4: Education – For all persons
Section 5: Employment – For person 15 years and over
Section 6: Marital, union status and fertility for persons – For persons over the age of 15 years
Section 7: Crime
Section 8: Clothing and footwear consumed in the last 3 months
Section 9: Other expenses
Section 10: Other Disbursements
Section 11: Income
Front Page: Identification and control
Daily Record: Pages 1 to 7
Back Page: Notes on the method of completing the daily diary
Daily Diary of Expenditure
Front Page: Identification and control
Pages 2 – 3: Notes on the method of completing the daily diary
Example: Example of method of completion (Pages 4, 5, 6)
Day One: Daily expenditures (Pages 7, 8, 9, 10)
Day Two: Daily expenditures (Pages 11, 12, 13, 14)
Day Three: Daily expenditures (Pages 15, 16, 17, 18)
Day Four: Daily expenditures (Pages 19, 20, 21, 22)
Day Five: Daily expenditures (Pages 23, 24, 25, 26)
Day Six: Daily expenditures (Pages 27, 28, 29, 30)
Day Seven: Daily expenditures (Pages 31, 32, 33, 34)
Data editing took place at a number of stages throughout the processing, including:
a) Office editing and coding
b) During data capture which involved the scanning and verification of the data
c) Structure checking and completeness was done in SQL 2000 Enterprise Server
d) Secondary editing was done in SPSS
e) Structural checking of SPSS data files
Detailed documentation of the editing of data can be found in the "Data Editing and coding guidelines" document provided as an external resource.
Data were processed in clusters, with each cluster being processed as a complete unit through each stage of data processing. Each cluster goes through the following steps:
1) Questionnaire reception
2) Office editing and coding
3) Batch preparation (No of Pages in Batch)
5) Verification of Data which recognised by software OCR engine
6) Back up of raw data
7) Secondary editing of Data in SQL Server and structural editing of data takes place
8) Edited data back up
After all clusters are processed, all data is concatenated together and then the following steps are completed for all data files:
10) Export to Foxpro where summary variables are included, weights are added then data is exported to SPSS where further editing takes place in 3 files (houses, persons, diaries)
11) Recoding of variables needed for analysis
12) Adding of sample weights
13) Calculation of adult equivalent expenditure quintiles and merging into data
14) Checking of SPSS files
15) Data quality tabulations
16) Production of analysis tabulations
Details of each of these steps can be found in the data processing documentation, data editing guidelines, data processing programs in SQL Server 2000, Fox pro V8 and SPSS, and tabulation guidelines.
Data scanning and verification was conducted by 2 scanner operators and 4 verifier operators in tow shifts, supervised by 2 verification supervisors, using a total of 5 computers (4 verification computers plus one supervisors' computer). All data verification was conducted at the Central Statistical Office with a highly structured data capture program built into the scanning template, using system controlled approach that controlled entry of each variable. All range checks and skips were controlled by the program and operators could not override these. A limited set of consistency checks were also included in the data scanning program template. Open-ended responses ("Other" answers) were not entered or coded.
Structure and completeness checking ensured that all questionnaires for the cluster had been entered, were structurally sound. 100% verification of all variables was performed due to the scanning process used.
After completion of all processing in TELEform V10.2, all individual cluster files were stored in SQL Server and a proceedure for constant and regular backup was implemented through the process of data scanning and validation. For tabulation, and more specifically data quality tabulations and other analysis activities SPSS versions 14.0 was used.
After transferring all files to SPSS, certain variables were recoded for use as background characteristics in the tabulation of the data, including grouping age, education, geographic areas as needed for analysis. In the process of recoding ages and dates some random imputation of dates (within calculated constraints) was performed to handle missing or "don't know" ages or dates. Additionally, a wealth (asset) index of household members was calculated using principal components analysis, based on household assets, and both the score and quintiles were included in the datasets for use in tabulations.
Estimates of Sampling Error
Estimates from a sample survey are affected by two types of errors: 1) non-sampling errors and 2) sampling errors. Non-sampling errors are the results of mistakes made in the implementation of data collection and data processing. Numerous efforts were made during implementation of the 2005-2006 MICS to minimize this type of error, however, non-sampling errors are impossible to avoid and difficult to evaluate statistically.
If the sample of respondents had been a simple random sample, it would have been possible to use straightforward formulae for calculating sampling errors. However, the SLC/HBS 2005-2006 sample is the result of a multi-stage stratified design, and consequently needs to use more complex formulae. The CENVAR module of the IMPS 4.1 has been used to calculate sampling errors for the SLC/HBS 2005-2006. This module uses the Taylor linearization method of variance estimation for survey estimates that are means or proportions.
Sampling errors have been calculated for a select set of statistics (all of which are proportions due to the limitations of the Taylor linearization method) for the national sample, urban and rural areas, and for each of the five regions. For each statistic, the estimate, its standard error, the coefficient of variation (or relative error -- the ratio between the standard error and the estimate), the design effect, and the square root design effect (DEFT -- the ratio between the standard error using the given sample design and the standard error that would result if a simple random sample had been used), as well as the 95 percent confidence intervals (+/-2 standard errors).
Details of the sampling errors are presented in the sampling errors appendix to the report and in the sampling errors table presented in the external resources.
A series of data quality tables and graphs are available to review the quality of the data and include the following:
- Age distribution of the household population
- Age distribution of eligible children and children for whom the mother or caretaker was interviewed
- Age distribution of children under age 5 by 3 month groups
- Presence of mother in the household and person interviewed for the under 5 questionnaire
- School attendance by single year age
- Sex ratio at birth among children ever born, surviving and dead by age of respondent
- Distribution of women by time since last birth
- Scatter plot of weight by height, weight by age and height by age
- Graph of male and female population by single years of age
- Population pyramid
The results of each of these data quality tables are shown in the appendix of the final report and are also given in the external resources section.
The general rule for presentation of missing data in the final report tabulations is that a column is presented for missing data if the percentage of cases with missing data is 1% or more. Cases with missing data on the background characteristics (e.g. education) are included in the tables, but the missing data rows are suppressed and noted at the bottom of the tables in the report (not in the SPSS output, however).
Central Statistical Office for Saint Lucia
Edwin St Catherine, Director of Statistics
CSO, Tel: 758 453 7670
Non-anonymized versions of the datasets are confidential.
Use of the dataset must be acknowledged using a citation which would include:
- the Identification of the Primary Investigator
- the title of the survey (including country, acronym and year of implementation)
- the survey reference number
- the source and date of download
Disclaimer and copyrights
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
(c) 2006, Saint Lucia Central Statistics Office
DDI Document ID
Central Statistical Office
Documentation of Data
Date of Metadata Production
DDI Document version
Version 01: Adopted from " DDI-SLU-CSO-2008-v1.1" DDI that was done by metadata producer mentioned in "Metadata Production" section.