Skip to main content

Training: A Way to Improve Data Quality

Currently, I'm involved in the field team training for a second round of data collection for the Bricks project. As we're in the intervention part of the study, this round is going to be as interesting and challenging as the previous round. There are a few important aspects which, if emphasized at the time of training, could help not only improve the data quality but also reduce the time and effort needed for consecutive rounds of data cleaning.

Here are a couple of pointers from our training:
  • Correct Respondent Identification: If the enumerators are entrusted with the responsibility of identifying the respondent, then the recruitment criteria should be clearly explained and listed. Along with this, all possible cases of contradictions, eliminations and preferences should also be explained. If there is already an identified respondent then the replacement cases should be dealt with specifically. Under all circumstances, any conflict should be reported immediately and the field team should take note of such cases.
  • Codes for options: All the codes for the answers should be discussed and explained properly. Often, there are individual interpretations which lead to bias in selection of the answers for the same response. This can be clearly observed when we start looking at the data enumerator-wise. To avoid this mistake, at the time of training there can be one master respondent who can answer to a group and then the dummy data entered can be verified.
  • Translation errors: Correct translation of the question is very important for the enumerator to understand the context of the question. The question will not make any sense to the respondent if it is not in the correct context and hence, the data  collected can be inaccurate. At times, there are variations in the dialect of the language and so it is important for the enumerators to know about these variations
  • Comments and Closing: Since there are always a few observations which an enumerator captures during the course of the survey, it is very helpful to record these observations. Having specific and to the point comments is very useful as it gives us an idea about the respondent’s state of mind at the time of response. Also, the closing status of each respondent should clearly indicate how successful the survey has been. For e.g. separate codes for Refusal, completion, substitution etc. should be used at the end of the survey.
  • Pilot experience: The field experience from the pilot rounds should be shared with the enumerators. This gives them a pre-field training experience and all possible questions from respondents can be sorted and discussed to bring about more clarity.
  • Handling exceptions: In case of refusals or incomplete surveys, enumerators should be trained to convince respondents of the objectives and benefits of the data collection. Also, it should be explained that refusals are the individual right of the respondent.
  • Mathematical calculations: This is another important aspect during data collection, particularly if there are conversions involved. There are two important points that should be dealt with at the time of training. The unit of data is very important and so it should be clearly noted at the time of response. Secondly, sometimes (like in case of land area, unit of crop production etc) there are mixed units (large- Quintals  and small- Kilograms). In these cases, the conversions have to be done so that the data is in one standard unit. With a lot of practice and the help of the calculators, this can be done easily and accurately or surveyors must be given the option to select units within the survey so these conversions can be done at a later point in time.

Though this list is not exhaustive, if dealt with meticulously at the time of training, it can definitely deliver improved data quality. Along with this, if there is digital data collection with programmed checks and bounds, the errors can be further minimized. I personally feel that, the more effort we put at the time of training, the better data quality we can get.


  1. A very nice article. Basics of data collection and surveyor training is explained in simple words.


Post a Comment

Popular Posts

Vocationalisation of education in India: Current Scenario, Key Challenges and New directions

“Every handicraft has to be taught not merely mechanically as is done today, but scientifically. This is to say, the child should learn the why and wherefore of every process.” - Gandhi’s Philosophy of Education

The greatest challenge in Indian education system today is to provide skill based education to the youth. This is exacerbated by a mismatch in demand and supply for the skilled workforce. The penetration of vocational education and training remains poor not only in rural areas, but also in urban regions where there is a higher installed capacity to impart the same. This post is an attempt to make the readers understand the need of vocational education in India. Also, this is an attempt to summarise a few recommendations on the same. 
A recent survey (61st round) conducted by the NSSO found that:

1. The percentage of population that completed primary education was 70%, but less than 10% went on to complete a graduation course and above. Almost 97% of individuals in the age bracket…

Rockstar of Financial Inclusion: Business Correspondent Model of India

About Author:  Jatinder Handoo is a social business enthusiast and a branchless banking practitioner. Currently works at FINO PayTech Ltd and is based out of Mumbai. He is reachable at
India is a hot bed of financial exclusion. A country which houses nearly 16% of the global population  has more than 65% of its people outside the formal financial system (Global Findex 2012). The Indian banking system has adopted multiple approaches to make universal financial inclusion a reality right from early days Indian post-independence banking system. Be it bank nationalization in 1969 or formation of Regional Rural Banks. Formation of NABARD or fostering microfinance through Bank-SHG linkage programme in early 90’s. A shimmering ray hope was rekindled with the growth of JLG based microfinance, however later studies made it clear that the model is credit led, concentrated predominately in the southern region of India thus could not be seen as painting complete financial…

A Platform for Knowledge - Enabling people to learn ..

I received a rather interesting link/website via my email today. The link read as MR University and all I could think of was, "Ok, this must be another website portal of some university or college". Well, on clicking the link and looking through the contents of the site, I was pleasantly surprised. The site is an online education portal or platform that allows users or teachers to upload short videos on topics or lessons they wish to impart. First topic that I come across is Development Economics.
The intent of the website is eloquently put out by the two economists, Tyler Cowen and Alex Tabarrok in the intro video. What started as a blog focusing on economics and its various implications in understanding why things are the way they are around us, has now an interesting addition. A video portal titled MRUniversity or Marginal Revolution University that focuses on online education with subjects pertaining to economics. It brought back to my mind,…