Hostname: page-component-78c5997874-94fs2 Total loading time: 0 Render date: 2024-11-10T17:03:42.427Z Has data issue: false hasContentIssue false

Evaluation of data accuracies within a comprehensive geospatial-health data surveillance platform: SOMAARTH Demographic Development and Environmental Surveillance Site, Palwal, Haryana, India

Published online by Cambridge University Press:  27 December 2018

Natasha J. Howard
Affiliation:
Wardliparingga Aboriginal Research Unit, Division of Health Sciences, University of South Australia, North Terrace, Adelaide, South Australia, Australia South Australian Health and Medical Research Institute, Adelaide, South Australia
Shikha Dixit
Affiliation:
SOMAARTH Demographic Development and Environmental Surveillance Site, International Clinical Epidemiology Network (INCLEN) Trust, New Delhi, India
Hasan Raja Naqvi
Affiliation:
Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, India
Atiqur Rahman
Affiliation:
Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, India
Catherine Paquet
Affiliation:
School of Health Sciences, University of South Australia Division of Health Sciences, Adelaide, South Australia, Australia
Mark Daniel
Affiliation:
Health Research Institute, University of Canberra Faculty of Health, Canberra, Australian Capital Territory, Australia Department of Medicine, St. Vincentʼs Hospital, The University of Melbourne, Melbourne, Australia
Narendra K. Arora*
Affiliation:
SOMAARTH Demographic Development and Environmental Surveillance Site, International Clinical Epidemiology Network (INCLEN) Trust, New Delhi, India
*
Author for correspondence: Narendra K. Arora, E-mail: nkarora@inclentrust.org
Rights & Permissions [Opens in a new window]

Abstract

Evidence exists of an increasing prevalence of chronic conditions within developed and developing nations, notably for priority population groups. The need for the collection of geospatial data to monitor the health impact of rapid social-environmental and economic changes occurring in these countries is being increasingly recognized. Rigorous accuracy assessment of such geospatial data is required to enable error estimation, and ultimately, data utility for exploring population health. This research outlines findings from a field-based evaluation exercise of the SOMAARTH DDESS geospatial-health platform. Participatory-based mixed methods have been employed within Palwal-India to capture villager perspectives on built infrastructure across 51 villages. This study, conducted in 2013, included an assessment of data element position and attribute accuracy undertaken in six villages, documenting mapping errors and land parcel changes. Descriptive analyses of 5.1% (n = 455) of land parcels highlighted some discrepancies in position (6.4%) and attribute (4.2%) accuracy, and land parcel changes (17.4%). Furthermore, the evaluation led to a refinement of the existing geospatial health platform incorporating ground-truthed reflections from the participatory field exercise. The evaluation of geospatial data accuracies contributes to understandings on global public health surveillance systems, outlining the need to systematically consider assessment of environmental features in relation to lifestyle-related diseases.

Type
Original Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s) 2018

Background

Increased attention has been drawn to the role of environmental factors in the aetiology of chronic diseases, such as cardiovascular disease and type 2 diabetes [Reference Daniel, Lekkas, Cargo, Stankov and Brown1Reference Diez Roux and Mair3]. Notably, there is a need to focus on the burden of contemporary population health challenges experienced within low- and middle-income countries [Reference Daar, Singer, Leah Persad, Pramming, Matthews, Beaglehole, Bernstein, Borysiewicz, Colagiuri, Ganguly, Glass, Finegood, Koplan, Nabel, Sarna, Sarrafzadegan, Smith, Yach and Bell4]. Mohindra et al. [Reference Mohindra, Mukherjee, Khan and Thresia5] have specifically outlined for India the ‘high level of health needs and new public health challenges arising in the context of rapid economic growth and social change (p. 1)’. Key strategies and methods recommended for pursuing an equity-oriented public health research agenda were the investment in data systems and development of inter-disciplinary approaches. Despite the importance of these recommendations, there has been limited platforms focusing on the need for collection, maintenance and surveillance of geospatial-health data systems to address complex social determinants of health embedded within the places in which people live [Reference Daniel, Kestens and Paquet6Reference Chaix, Kestens, Bean, Leal, Karusisi, Meghiref, Burban, Fon Sing, Perchoux, Thomas, Merlo and Pannier8].

Traditionally, public health surveillance systems have been defined by the World Health Organization (WHO) as ‘the continuous, systematic collection, analysis and interpretation of health-related data needed for the planning, implementation, and evaluation of public health practice’ [Reference Campostrini and McQueen9]. Broadening public health surveillance systems to incorporate geospatial information on risk conditions beyond conventional health-related data will require in the first instance the systematic capture of detailed environmental influences on health. Demographic and health surveillance systems within low- and middle-income countries, such as those aligned with the INDEPTH network [Reference Sankoh and Byass10], have emerged to explore the dynamic nature of population health, within and across communities. An expansion beyond traditional health issues, such as infectious diseases, vaccine sciences, reproductive health and access to safe water, have been supported by calls for surveillance systems to explore behavioural risk factors associated with chronic diseases [Reference Campostrini, McQueen, Taylor and Daly11]. Moreover, the rapid social and economic changes also require systems to dynamically capture demographic and health changes in relation to the environmental risk conditions, through the application of remote-sensing and geographic information system (GIS) technologies. To date, demographic and health surveillance systems within low- and middle-income countries have not routinely captured environmental risk conditions. Evidence on which risk conditions to intervene on is poor, however, and exacerbated by a lack of data infrastructures that integrate the environmental (i.e. spatial) exposures, risk factors and health outcomes needed to elucidate the environmental factors and mechanisms as applied across or within different settings. Most notably, advances in geospatial-health infrastructures will be required within disadvantaged and minority populations.

Matthews et al. [Reference Matthews, Moudon and Daniel12] have supported the need for the collection of geospatial data, yet concurrently have outlined the vital requirement to assess these data for quality, and thus utility for application within population health. Research into place and health has seen the undertaking of geospatial data validation of existing administrative data sources such as seen with food retail and physical activity resources [Reference Paquet, Daniel, Kestens, Leger and Gauvin13], and valuation or landuse databases [Reference Sharkey and Horel14], for example, commercial database validation using the Australian telephone directory, ‘Yellow Pages’ [Reference Hooper, Middleton, Knuiman and Giles-Corti15]. Such research confirms that information obtained from commercial databases must be treated with caution; nonetheless, these sources have potential to further our understandings on place–health relations. Zhang et al. have outlined the importance of understanding measurement errors for applications in spatial epidemiology, yet did not explore beyond the inaccuracies of geospatial reference points (e.g. x and y coordinates in space) [Reference Zhang, Manjourides, Cohen, Hu and Jiang16]. Furthermore, assessments of applications such as Google Street View [Reference Rundle, Bader, Richards, Neckerman and Teitler17] and Google Earth [Reference Clarke, Ailshire, Melendez, Bader and Morenoff18] have been undertaken. Other assessments of geospatial data have seen the collection within urban areas of residential features according to a direct observation scale [Reference McDonell and Waters19]. Field-based measurement tools are beneficial in their ease to administrate and ability to capture aspects of the built and social environment, as well as demonstrating value within rural regions where access to secondary data sources is limited. There has been no study to our knowledge within low- and middle-income countries that has undertaken a validation of environmental data features integrated with social or health surveillance systems.

To address the identified need for geospatial health surveillance platforms, the International Clinical Epidemiology Network (INCLEN) Trust developed a large-scale surveillance system within the Palwal District, Haryana, India – the SOMAARTHFootnote Footnote 1 Demographic Development and Environment Surveillance Site (DDESS) [Reference Dixit, Arora, Rahman, Howard, Singh, Vaswani, Das, Ahmed, Mathur, Tandon, Dasgupta, Chaturvedi, Jethwaney, Dalpath, Prashad, Kumar, Gupta, Dube and Daniel20]. During the SOMAARTH platform preparation, a field-based evaluation of geospatial data element accuracy was undertaken within rural Indian villages undergoing rapid environmental, economic and social change. The encounters during the evaluation informed the refinement of data elements and will provide future directions and considerations for public health surveillance data systems within the context of low- and middle-income countries.

Methods

Study background

SOMAARTH DDESS is located in Palwal District, the 21st District of Haryana State, covering a regional area of 135 933 hectares and divided into four blocks; Palwal, Hodal, Hassanpur and Hathin (Fig. 1).

Fig. 1. SOMAARTH Demographic and Development Environmental Surveillance Site, study region, Palwal District, Haryana State, India.

The Palwal District relies predominantly on an agricultural industry, with these foundations crucial to the region's economic livelihood. There are rapid social, economic and urban form changes being witnessed and anticipated in the near future with the development, in 2011, of the Kundali-Manesar-Palwal (KMP) Expressway. Furthermore, the villages that are located in proximity to the National Highway-2 (NH-2) Delhi-Agra Highway and Delhi National Capital Region (NCR) are experiencing transformation of agrarian land to educational, commercial and industrial lands, most notably with new manufacturing and service industries within these areas. These developments most likely reflect the population growth of 25.5% between 2001 and 2011 in the Palwal District [21].

SOMAARTH DDESS geospatial health platform

The SOMAARTH DDESS includes 51 villages and three blocks which are bounded by the NH-2 on the east, Palwal-Mewat State Highway on the north-western side and Nuh-Hodal State Highway on the south side (Fig. 1). The data platform enables measurement of rapid regional development through the village-specific capture of proximities to foci including businesses, industrial and economic zones, road networks, and the nature of the social environment (e.g. education, income, caste and religion). The SOMAARTH DDESS is innovative in the development of a comprehensive GIS that will complement a demographic, development and environmental surveillance system incorporating social, behavioural and health data, allowing for place–health insights into both communicable and non-communicable disease outcomes. Human Research Ethics Committee approvals have been received from the INCLEN Trust International Committee (Ref No.s IIEC 010 and IIEC002) and Lucknow Ethics Committee (Ref 23/LEC/10).

A mixed-method approach to SOMAARTH DDESS geospatial data development included four steps: (1) an on-site participatory exercise with local village members to produce paper-based village maps; (2) digitization of land parcels and features of the built and physical environment using very high-resolution satellite imagery; (3) further attribution, validation and refinement of geospatial data layers through villager and field worker consultation and completion of a population Census and (4) data update and surveillance system maintenance.

QuickBird™ satellite imagery was used to prepare various geospatial data on rural areas and store the information (digital village layer) within a GIS domain [Reference Dixit, Arora, Rahman, Howard, Singh, Vaswani, Das, Ahmed, Mathur, Tandon, Dasgupta, Chaturvedi, Jethwaney, Dalpath, Prashad, Kumar, Gupta, Dube and Daniel20]. These layers included detail on features at their finest spatial unit; water bodies were represented as polygons, roads as either polygon or line, railway tracts as line features, landmarks as a point location (e.g. tube wells) and dwelling units or non-residential features as a land parcel. A land parcel was spatially depicted as a polygon feature representing an area of private or commercial ownership.

Each land parcel was subsequently assigned by the geospatial technician a landuse categorization according to an adapted system [Reference Dixit, Arora, Rahman, Howard, Singh, Vaswani, Das, Ahmed, Mathur, Tandon, Dasgupta, Chaturvedi, Jethwaney, Dalpath, Prashad, Kumar, Gupta, Dube and Daniel20]. The resulting landuse classification system included three levels. Level I representing ‘Built-Up Land’, ‘Agricultural Land’, ‘Water Bodies’, ‘Waste Land’ and ‘Vacant Land’. Built-Up (Level I) was further refined to Level II, to include the classifications of ‘Residential’, ‘Commercial’, ‘Industrial’, ‘Institutional’, ‘Utilities and Services’ and ‘Agricultural and Others’. Subsequently, each Level II category had been further refined into a Level III classification. The attribute table within the GIS village layer included all land parcels that were characterized within the village. Further information on methodology of SOMAARTH DDESS planning and implementation has been published elsewhere [Reference Dixit, Arora, Rahman, Howard, Singh, Vaswani, Das, Ahmed, Mathur, Tandon, Dasgupta, Chaturvedi, Jethwaney, Dalpath, Prashad, Kumar, Gupta, Dube and Daniel20].

Evaluation exercise of SOMAARTH DDESS

Following the SOMAARTH DDESS data creation, the objective of this study was to evaluate position and attribute accuracy for the existing geospatial elements (land parcel, road and landuse classifications) integrated into the geospatial health platform.

Erickson and Baker [Reference Erickson and Baker22] argue for the importance of assessing the ‘what’ as well as the ‘where’ with regards to the accuracy of data contained within a geospatial data system. Position accuracy entails an assessment of the location (or the ‘where’), for example, through observation of land parcels (e.g. size and shape). Attribute accuracy (or the ‘what’) has been described by Goodchild [Reference Goodchild, Guptill and Morrison23] to be one of the major contributors to the quality of geospatial data, and an attribute can be defined as ‘a fact about some location, set of locations, or feature on the surface of the earth (p. 59)’. Knowledge gained on the levels of error and ‘fitness for the user's need’ through assessing geospatial data element position and attribute accuracy [Reference Erickson and Baker22] will further interpretations of the relationship between environmental risk conditions and health (i.e. place–health relations). This study in collaboration with the SOMMARTH DDESS geospatial platform aimed to assess corresponding field locations for:

  1. (1) Position accuracy of attributes within a geospatial village data layer, including location of the land parcel, shape and size;

  2. (2) Attribute accuracy of landuse classifications and road surface types assigned to the geospatial village data layer; and

  3. (3) Land parcel changes observed over a period of 2 years.

Study selection

Villages already characterized within the GIS layer according to their built environment were eligible for selection, 32 of the 51 villages met this criteria. Eligible villages were assessed according to whether the village contained a small land parcel count (i.e. <1000 land parcels) or a large land parcel count (i.e. ≥1000 land parcels). Villages were assessed according to their settlement pattern as either linear or circular. A linear pattern of settlement has seen the development of dwellings and structures from a major highway, whereas a circular settlement begins in a central location radiating development in a circular pattern [Reference Sarkar24]. The final sampling frame, as indicated from satellite images in Fig. 2, included four groups for selection [(A) small linear, (B) small circular, (C) large linear, (D) large circular].

Fig. 2. Sample frame circular and linear settlement patterns.

A total of six villages were selected to undertake an assessment, equating to 18.8% of characterized and 11.8% of the total study villages. Two villages were selected randomly from each of the large linear and large circular samples and one from each of small linear and small circular. The selected villages were additionally assessed according to the initial date of data collection. A final visual inspection of the selected villages occurred to ensure a diverse spatial coverage for the field exercise across the study region.

Prior to commencing the field exercise, a 5% sample of land parcels was identified within each selected village, with oversampling in villages with few land parcels to attain a minimum of 30 land parcels. A random selection of 5% of land parcels across the categories was undertaken, with under-sampling of ‘Residential’ (targeted 1% representation) to ensure a focus was maintained on a range of landuse types. Table 1 outlines the selected Palwal District villages also describing the population characteristics and proportions of residential, non-residential and mixed residential land parcels.

Table 1. Study area and sample characteristics

Within the sampled villages, a total of 8901 land parcels had been characterized [median number of land parcels 1484; standard deviation (SD) 947.5]. Overall, 57.3% of land parcels represented a residential dwelling, with the lowest proportion (49.7) of residential within the small rural village of Garhi Vinoda, and the greatest proportion (64.0) of residential dwellings within Durgapur.

Table 1 also outlines a summary of sampled land parcels within each village, as well as summary statistics for each of the sampling frames and overall.

Field assessment

The sampled land parcels identified for assessment were highlighted on paper-based Sector-wise maps. Two assessors were involved, Assessor 1 (NH) was an independent place–health researcher and Assessor 2 (HRN) a trained geospatial researcher involved in the initial characterization. For this field assessment, a village check sheet, attribute table and marked paper-based Sector-wise map were utilized. A SOMAARTH DDESS village field worker was present during the assessment. These field workers were originally recruited from the local village and had knowledge of the region; moreover, for this study, it was desirable that the field worker was involved in the field-based participatory mapping. The SOMAARTH field team ensured that appropriate access was gained to the village, including notifying leaders (Gram Panchayat), and providing explanations to village members on the purpose of the field visit.

The two assessors oriented their direction within the village using physical features (e.g. rivers, water bodies) and major roads as landmarks to this positioning against a corresponding digitized paper-based map. Assistance was provided by a SOMAARTH field worker with input from local community members.

Using a defined check sheet of selected land parcels, the assessors systematically:

  1. (1) Undertook visual inspections of the sampled features and verified the presence or absence of the polygon feature on the digitized map.

  2. (2) Recorded whether the feature on the land parcel had been attributed (yes/no) and confirmed the location of this parcel along the road (e.g. fifth land parcel along road in an eastern direction).

  3. (3) Engaged with the village field worker, and if required assistance from a village member to reach consensus on the feature content.

Using the paper-based map as a field guide, position accuracy of the digitized land parcel was assessed for its relative size and shape using a step-out distance measure approach. The assessors used information associated with the land parcel (i.e. attribute table) to inform whether an environmental feature change had occurred since the participatory mapping exercise was undertaken. Photographs were taken of the features associated with the sampled land parcels and road surface confirming their presence and quality.

A post-field discussion was undertaken between the two assessors and field check sheets were entered into an Excel® (Microsoft, Washington, USA) spreadsheet and imported into an Access® (Microsoft) database for descriptive analyses, including proportion of missing attributes by village, and overall.

Results

Assessment of geospatial data accuracy

Table 2 highlights the descriptive results for position and attribute accuracy for each of the six sampled villages and summary by the sample frame (small linear/circular, large linear, large circular) and overall. Residential land parcels represented 31.9% of assessment sample, compared with 57.3% of land parcels represented in the study region, reflecting the under-sampling of residential locations to focus on adequate samples from a range of landuse types (e.g. commercial, industrial and agricultural).

Table 2. Position and attribute accuracy

Overall, 6.4% of land parcels were documented as having an error in the location, size or shape. The accuracy was least within a small village of Durgapur (21.2%), and there were no differences by sample frame. With respect to attribute accuracy, there were only 1.5% of land parcels that had an error in the building material type (metallic ‘pukka’, non-metallic ‘kutcha’ or mixed), and 2.9% relating to the building classification (as either residential/non-residential or mixed). During this exercise comparing existing geospatial data and participatory field observation with workers and village members, a 4.6% error in land parcel classifications was observed. The highest level of error was recorded within the sample frame ‘Large Circular’ (6.4%). Level II Landuse codes were misclassified for 4.2% of all land parcels. More prominent, however, was the misclassification observed for 6.4% (n = 29) of Level II Landuse codes.

The accuracy of the road surface type characterization ranged from complete accuracy (Durgapur and Garhi Vinoda) to the highest level of error observed within the village of Gahlab, with 10 incorrect classifications (11.8% of sampled parcels). The mean level of error for road surface type characterization was 8.2 (SD 6.85). Within the sample of parcels selected across the six villages, there were 10.8% (n = 49) that were incomplete.

In consultation with local community members, the researchers were able to determine any changes that had occurred within the villages since the time of the initial field survey (originally conducted 2011) and the Census taking place during 2013. Through this participatory approach, it was found that 17.4% of the land parcels had experienced a change in this 2-year period. Such results support the need to document the nature of land parcel changes due to rapid social and economic change within the region.

Discussion

This paper outlined the processes of evaluation undertaken on a sample of geospatial data elements within a comprehensive public health surveillance platform representing rural Indian villages. The knowledge gained from this evaluation led to a process of system refinement for ongoing monitoring and surveillance of environmental risk conditions, as well as, the development of a verification tool to further integrate data elements into the geospatial-health platform. The research further suggests directions for improving geospatial-health data systems within low- and middle-income countries, as outlined below.

The results from this field-based observation of land parcel locations, according to size and shape, indicated the SOMAARTH DDESS had a high level of position accuracy. The study did not seek to assess the position accuracy of vector lines according to the gold standard satellite imagery. The ability to assess the precision of land parcels for the rural Indian village context would be highly unlikely. Furthermore, it was believed that any inaccuracy due to precision would not have an influence on the quality of the environmental indicators to be derived from the geospatial-health surveillance platform. The SOMAARTH DDESS architecture has been designed to capture, store and harmonize comprehensive datasets pertaining to the built environment. Furthermore, the platform has been intended for undertaking temporal spatial epidemiological analyses which require indicators expressed as counts or aggregated indices to assess variations in health and behavioural risk factors according to social, built and physical environmental features, such as weather and air quality, education, water and sanitation, and health care services.

The field-based observations provided an understanding into the attribute accuracy of the assigned landuse classifications and road surface types. The assessment of geospatial data quality has also identified the need for considering a more nuanced system for identifying road types and quality. Notably, there were observed misclassification of road types as either a lane or driveway (i.e. private ownership, presence of gate and/or door). Other examples included the consideration of road width (i.e. public alleyway or lane) and quality as these aspects impede accessibility within the village, particularly by car or motorbike. The appropriateness of landuse code assignment was assessed (i.e. logical consistency) and there was an identified need to consider the socio-spatial and cultural contexts in environmental data capture.

The contemporaneous capture of environmental features within the surveillance site is a crucial aspect to the overall geospatial data element accuracy, and thus, the utility for assessing social and built environmental features against health outcomes. Given the rapid social and economic changes being experienced within the surveillance site, the initial environmental data collection witnessed changes in land parcels prior to the demographic and health data collection. The verification tool ensured that data elements were contemporaneous for both environmental indicators and health outcomes. The need for surveillance systems and verification tools for spatial data accuracy are evident through undertaking an evaluation of SOMAARTH DDESS. Dixit et al. have demonstrated for this study context preliminary community-level findings on built and physical environmental exposures being associated with individual household socio-economic status [Reference Dixit, Arora, Rahman, Howard, Singh, Vaswani, Das, Ahmed, Mathur, Tandon, Dasgupta, Chaturvedi, Jethwaney, Dalpath, Prashad, Kumar, Gupta, Dube and Daniel20]. The community and household-level exposures will be able to explain and quantify social determinants of health, as well as exploring associations with diverse individual health outcomes such as diabetes, cardiovascular and infectious disease [Reference Daniel, Moore and Kestens25]. Blakely and Woodward have outlined the importance of considering mismeasurement as a source of error affecting estimates of environmental exposures in relation to health outcomes [Reference Blakely and Woodward26]. Furthering discussion from Zhang et al. [Reference Zhang, Manjourides, Cohen, Hu and Jiang16], the assessment of attribute accuracy (e.g. the ‘what’ of land parcel use and classification of road type) are additional threats to the validity of spatial epidemiological analyses, such as the planned analyses from longitudinal surveillance activities implemented in the Palwal District.

The evaluation exercise allowed for a process of reflection on cultural interpretations on these environmental constructs. As part of the baseline data collection, the participatory research approach included ‘ground truthing’ and interaction between village members, field workers and GIS technicians. Field observations highlighted the need to consider socio-spatial and cultural contexts (e.g. religious, cultural community infrastructure) within the coding framework for data elements. Subsequently, the landuse classification system was reviewed as part of a refinement exercise to be executed within all villages captured within the platform. The refinement resulted in the development of a verification tool that reflected the complexity of land parcel use and incorporation of a multi-level classification system.

A form indicating the options that a field worker may encounter during this exercise was detailed, including assessment of size, shape and location error. The informed procedure saw the type of change recorded according to the following classifications: (1) New dwelling/feature, (2) Under construction (on vacant land), (3) Demolished dwelling/feature, (4) Parcel/dwelling split and (5) Parcel dwelling merged. The time period in which this change had occurred was recorded as: (1) 1–3 months, (2) 3–6 months, (3) 6–12 months, (4) 1 or more years and (5) Not available/Don't know. The field worker provided open-ended comments and photographs to assist the GIS technician in update of the base village map.

A detailed rule book was developed to indicate examples relating to feature content (i.e. definition of cattle shade includes the need for the presence of shelter on the land parcel for the cattle). Training of field assessors and pilot testing was undertaken in June 2013, and the tool was refined accordingly before employment across all villages. The geospatial attribute verification tool (see Appendix file 1) was implemented across all 51 SOMAARTH DDESS villages between July and November 2013.

It is well-established that any population health research with priority populations must be driven by participatory approaches [Reference Cargo and Mercer27]. This resonates with prevailing perspectives in social geography which are also strongly influenced by participatory underpinnings [Reference Pain28]. GIS decision-making tools that incorporate local people's spatial knowledge have mainly been employed for development activities, local planning, resource management and community advocacy [Reference Laituri29]. Such approaches to geospatial data collection do not privilege any one type of information but grant validity to all [Reference Dunn30]; an approach that allows for both insider and outsider perspectives on spatial relationships within local communities (e.g. cultural notions of place). A mixed-method approach is also reflective of the reciprocal nature of the interaction of people within their local communities.

A strength of the study was its use of critical reflexivity, allowing for the processes of research and the information collected to be socially constructed [Reference Hay31]. The researcher employed insider and outsider perspectives to reflect on geospatial data accuracy within the spatial-health data surveillance system under development. The lead author (NH) was an ‘outsider’ both to the Indian culture and language, and villages that were being assessed. The second assessor (HRN) was involved for a period of 6 months in the technical application of the base maps, visiting around three villages, living within the greater region, and speaking Hindi. This approach to the research is intended to provide an enriched understanding of the social and cultural construction of the data system and aid in the processes of refinement [Reference Laituri29].

An evaluation of the procedures to capture data indicated a high level of multi-disciplinary approaches integrated into the research, as reflected by the innovative nature to explore environmental risk conditions for cardiometabolic disease and its risk factors. There is however a crucial need to move from multi-disciplinary to trans-disciplinary perspectives in the assessment of transitions in health and lifestyle-related diseases. Such approaches include the integration of village members, field workers and desktop applications (i.e. geospatial technicians) to ensure digitization of environmental features was reflective of its position and content (i.e. attribute accuracy).

Conclusions

Capturing change is complex, added to this complexity are both the people and the places in which they live, it occurs at different speeds and across different spatial geographies. The need to explore both levels of the system has been recognized and is being captured within a novel spatial-health data surveillance system within rural Indian villages. Regardless of the spatial nature of data, accuracy is a crucial aspect to population health information. The dynamic nature of a surveillance system needs to systematically deal with the inevitable change to environmental conditions, and the reciprocal influence on the people that reside within these local regions. Furthermore, the considerations of the contemporaneous nature of data for all levels of the system are vital to the future exploration of place and health relationships. This study has informed the refinement of data elements for all SOMAARTH DDESS Palwal study villages. The evaluation exercise contributes to our understandings on construction of public health surveillance systems within low- and middle-income countries. Furthermore, findings provide insights into considerations for assessing social, built and cultural environmental risk conditions in relation to health outcomes, such as lifestyle-related disease, which is burdening these local contexts as the rapid social, economic and landscape changes are experienced.

Author ORCIDs

Natasha J. Howard 0000-0002-8099-3107.

Acknowledgements

Dr Natasha Howard was funded through a 2012–2013 Australian Academy of Science Fellowship, Australia–India Strategic Research Fund, 3-month Fellowship at the International Clinical Epidemiology Network (INCLEN) Trust. The SOMAARTH DDESS is partially funded by a Canadian Institutes of Health Research Collaborative Team Grant ‘Foundation for a brain-to-society diagnostic for prevention of childhood obesity and its chronic disease consequences (#85512)’. Professor NK Arora, M Daniel, A Rahman and C Paquet are Investigators of this research. We acknowledge the involvement of the local community, field workers and staff within the INCLEN Trust Executive Office, GIS Team and SOMAARTH DDESS, Palwal District.

Authorsʼ contributions

NH conceived the study and shaped its design together with the background, discussion and conclusions. NH, MD, NKA participated in the design of the study. NH and HRN conducted the field-based data collection for the evaluation of the SOMAARTH Demographic and Development Surveillance Site (DDESS) spatial data system. NH and SD developed the verification tool and refinement exercise. NKA and AR are responsible for the development and maintenance of the SOMAARTH DDESS spatial data system. MD and CP provided critical insight into manuscript preparation. All authors read and approved the final manuscript.

Conflict of interest

None.

Appendix

Footnotes

The notes appear after the main text.

1 ‘SOMAARTH’ is derived from Sanskrit: ‘Som’ meaning the highest form of physical, mental and spiritual health, and ‘Arth’ meaning money, wealth and resources. SOMAARTH envisions synergy between economic development, environment changes, social changes and health of the individual, family and community.

References

1.Daniel, M, Lekkas, P, Cargo, M, Stankov, I and Brown, A (2011) Environmental risk conditions and pathways to cardiometabolic diseases in indigenous populations. Annual Review of Public Health 32, 327347.Google Scholar
2.Chaix, B (2009) Geographic life environments and coronary heart disease: a literature review, theoretical contributions, methodological updates, and a research agenda. Annual Review of Public Health 30, 81105.Google Scholar
3.Diez Roux, AV and Mair, C (2010) Neighborhoods and health. Annals of the New York Academy of Sciences 1186, 125145.Google Scholar
4.Daar, AS, Singer, PA, Leah Persad, D, Pramming, SK, Matthews, DR, Beaglehole, R, Bernstein, A, Borysiewicz, LK, Colagiuri, S, Ganguly, N, Glass, RI, Finegood, DT, Koplan, J, Nabel, EG, Sarna, G, Sarrafzadegan, N, Smith, R, Yach, D and Bell, J (2007) Grand challenges in chronic non-communicable diseases. Nature 450, 494496.Google Scholar
5.Mohindra, KS, Mukherjee, S, Khan, S and Thresia, CU (2012) Towards the next generation of public health research in India: a call for a health equity lens. Journal of Epidemiology and Community Health 66, 839842.Google Scholar
6.Daniel, M, Kestens, Y and Paquet, C (2009) Demographic and urban form correlates of healthful and unhealthful food availability in Montréal, Canada. Canadian Journal of Public Health/Revue Canadienne de Santé Publique 100, 189193.Google Scholar
7.Paquet, C, Coffee, NT, Haren, MT, Howard, NJ, Adams, RJ, Taylor, AW and Daniel, M (2014) Food environment, walkability, and public open spaces are associated with incident development of cardio-metabolic risk factors in a biomedical cohort. Health & Place 28, 173176.Google Scholar
8.Chaix, B, Kestens, Y, Bean, K, Leal, C, Karusisi, N, Meghiref, K, Burban, J, Fon Sing, M, Perchoux, C, Thomas, F, Merlo, J and Pannier, B (2012) Cohort profile: residential and non-residential environments, individual activity spaces and cardiovascular risk factors and diseases – The RECORD cohort study†. International Journal of Epidemiology 41, 12831292.Google Scholar
9.Campostrini, S (2013) Surveillance for NCDs and health promotion: an issue of theory and method. In McQueen, DV (ed.), Global Handbook on Noncommunicable Diseases and Health Promotion. Tucker, GA, USA: Springer, pp. 5172.Google Scholar
10.Sankoh, O and Byass, P (2012) The INDEPTH network: filling vital gaps in global epidemiology. International Journal of Epidemiology 41, 579588.Google Scholar
11.Campostrini, S, McQueen, D, Taylor, A and Daly, A (2011) World alliance for risk factor surveillance white paper on surveillance and health promotion. AIMS Public Health 2, 1026.Google Scholar
12.Matthews, SA, Moudon, AV and Daniel, M (2009) Work group II: using geographic information systems for enhancing research relevant to policy on diet, physical activity, and weight. American Journal of Preventive Medicine 36(suppl. 4), S171S176.Google Scholar
13.Paquet, C, Daniel, M, Kestens, Y, Leger, K and Gauvin, L (2008) Field validation of listings of food stores and commercial physical activity establishments from secondary data. International Journal of Behavioral Nutrition and Physical Activity 5, 58.Google Scholar
14.Sharkey, JR and Horel, S (2008) Neighborhood socioeconomic deprivation and minority composition are associated with better potential spatial access to the ground-truthed food environment in a large rural area. The Journal of Nutrition 138, 620627.Google Scholar
15.Hooper, PL, Middleton, N, Knuiman, M and Giles-Corti, B (2012) Measurement error in studies of the built environment: validating commercial data as objective measures of neighborhood destinations. Journal of Physical Activity & Health 10, 792804.Google Scholar
16.Zhang, Z, Manjourides, J, Cohen, T, Hu, Y and Jiang, Q (2016) Spatial measurement errors in the field of spatial epidemiology. International Journal of Health Geographics 15, 21.Google Scholar
17.Rundle, AG, Bader, MDM, Richards, CA, Neckerman, KM and Teitler, JO (2011) Using google street view to audit neighborhood environments. American Journal of Preventive Medicine 40, 94100.Google Scholar
18.Clarke, P, Ailshire, J, Melendez, R, Bader, M and Morenoff, J (2010) Using google earth to conduct a neighborhood audit: reliability of a virtual audit instrument. Health & Place 16, 12241229.Google Scholar
19.McDonell, J and Waters, T (2011) Construction and validation of an observational scale of neighborhood characteristics. Social Indicators Research 104, 439457.Google Scholar
20.Dixit, S, Arora, NK, Rahman, A, Howard, NJ, Singh, RK, Vaswani, M, Das, MK, Ahmed, F, Mathur, P, Tandon, N, Dasgupta, R, Chaturvedi, S, Jethwaney, J, Dalpath, S, Prashad, R, Kumar, R, Gupta, R, Dube, L and Daniel, M (2018) Establishing a demographic, development and environmental geospatial surveillance platform in India: planning and implementation. JMIR Public Health Surveillance 4, e66.Google Scholar
21.Government of India, Indian National Census 2011: Table 1 District-wise Population of Haryana. 2011, Ministry of Home Affairs, Office of the Registrar General & Census Commissioner, India.Google Scholar
22.Erickson, RM and Baker, M (2010) Validating your geospatial data: Protecting your investment and yourself. White Paper ESRI.Google Scholar
23.Goodchild, MF (1995) Elements of Spatial Data Accuracy. Edited by Guptill, SC, Morrison, JL. Oxford, United Kingdom: Published on behalf of the International Cartographic Association by Elsevier Science.Google Scholar
24.Sarkar, A (2010) Analysis of human settlement patterns using RS and GIS in the plains of West Bengal. e-Traverse The On-Line Indian Journal of Spatial Science 1, 116.Google Scholar
25.Daniel, M, Moore, S and Kestens, Y (2008) Framing the biosocial pathways underlying associations between place and cardiometabolic disease. Health & Place 14, 117132.Google Scholar
26.Blakely, TA and Woodward, AJ (2000) Ecological effects in multi-level studies. Journal of Epidemiology and Community Health 54, 367374.Google Scholar
27.Cargo, M and Mercer, SL (2008) The value and challenges of participatory research: strengthening its practice. Annual Review of Public Health 29, 325350.Google Scholar
28.Pain, R (2004) Social geography: participatory research. Progress in Human Geography 28, 652663.Google Scholar
29.Laituri, M (2011) Indigenous People's issues and indigenous uses of GIS. In Nyerges TL, Couclelis H and McMaster R (eds), The SAGE Handbook of GIS and Society. London: SAGE Publications Ltd, pp. 202222. http://dx.doi.org/10.4135/9781446201046Google Scholar
30.Dunn, CE (2007) Participatory GIS — a people's GIS? Progress in Human Geography 31, 616637.Google Scholar
31.Hay, I (2005) Qualitative Research Methods in Human Geography. Oxford University Press.Google Scholar
Figure 0

Fig. 1. SOMAARTH Demographic and Development Environmental Surveillance Site, study region, Palwal District, Haryana State, India.

Figure 1

Fig. 2. Sample frame circular and linear settlement patterns.

Figure 2

Table 1. Study area and sample characteristics

Figure 3

Table 2. Position and attribute accuracy