Introduction
Non-pharmaceutical interventions (NPIs) play a critical role in the COVID-19 response, and are likely to remain important interventions for infection control despite high effectiveness of vaccines [Reference Hall1]. Governments and public health authorities require evidence on factors associated with transmission of SARS-CoV-2 to inform control measures, including restrictions on activities. Transmission is a continuous risk primarily determined by contact patterns, behavioural, environmental and socio-economic factors [Reference Cevik2, 3]. Some settings are likely to facilitate or amplify the risk of transmission due to a combination of behavioural and environmental factors [Reference Meyerowitz4]. Close-proximity contact, contact over a prolonged period of time and multiple contacts in a confined, poorly ventilated space pose the greatest risk for SARS-CoV-2 transmission [Reference Cevik2, 5]. The opportunities for being exposed in certain settings and through certain activities have changed over time with the implementation and lifting of NPIs.
The risk of transmission of SARS-CoV-2 is highest in household settings, particularly in multioccupancy households mainly due to the high number of close contacts [Reference Lee6]. Certain occupations such as working in healthcare [Reference Nguyen7–Reference Leclerc9] are known to be at high risk of exposure to SARS-CoV-2. Hospitality venues including restaurants, nightclubs and bars have been reported as common exposures in large outbreaks and clusters of COVID-19 [8, Reference Furuse10–Reference Fisher12]. Outbreaks have also been reported in some occupational settings including factories and warehouses [8, Reference Murti13] and in educational settings [Reference Ismail14, 15]. However, information on the role of community settings in facilitating transmission is still limited [5, Reference Lee6, 8, 16].
We conducted three a case-control study in England with three sampling periods between August 2020 and October 2020 with the aim of developing improved understanding of settings and activities potentially associated with transmission of COVID-19 in England. The studies focused on non-household activities.
Methods
Study design and setting
We conducted a retrospective case-control study, frequency-matched for age group (18–27, 28–37, 38–47, 48–57, ≥58 years) and geographical region (East Midlands, East of England, London, South East, South West, West Midlands, Yorkshire and Humber) to ensure a representative sample of controls and to account for regional variations in restrictions to activities. The study had three sampling periods: August, September and October 2020. The study population consisted of adults (aged 18 years or over) resident in England. Cases were individuals who had tested positive for SARS-CoV-2 and completed the NHS Test and Trace (NHS T&T) contact tracing questionnaire, and who were not reported to NHS T&T as household contacts of a confirmed case. Cases were randomly sampled based on these criteria from the NHS T&T database. Controls were members of the public, registered as volunteers for a market research panel. Controls were excluded if they had tested positive for SARS-CoV-2 or reported COVID-19 symptoms in the 7 days prior to completing the survey. A power calculation was performed for the initial sampling period and used for further samples without further refinement. With a false-positive error probability of 0.05, a total sample size of 4000 subjects, a 1:1 ratio of controls to cases and a minimum important odds ratio of 2, each sampling period had a power of ≥0.9 for exposure proportions of ≥0.02 in the control group.
We introduced pragmatic plausibility thresholds for numbers of different activities reported, and excluded individuals who reported high numbers of different types of workplace exposures that appeared unrealistic, likely representing reporting errors (>1 response: emergency services, military, immigration or border force services; >2 responses: social and home care, retail, hospitality, construction and manufacturing, warehouses, food production or agriculture, transport, work-related travel; >3 responses: education, arts and education, healthcare, close contact services).
A different sample of controls was obtained for each study period. The sampling periods were late August 2020, late September 2020 and late October 2020, for sampling periods 1, 2 and 3, respectively. Figure 1 outlines the recruitment process and study flow.
Data collection
Case data were collected as part of routine contact tracing, where cases provide the information either through a digital route (self-completed) or through telephone interview with a contact tracer. The contact tracing programme completion rate for cases during the three study periods was 82.7%, 76.8% and 85.0%, respectively [17].
Information on individuals' activities (exposures) was collected in a structured manner with any activity categorised by three nested levels providing increasing level of detail on activities. The highest level (Level 1) categorises the exposures to work and education activity, or leisure activities. The further information includes sectors, for example, educational, or hospitality setting. The most granular level (Level 3) of categorisation describes the workplace or the setting in more detail, for example, a pub or bar, or a secondary school. An example of this three-level structure for an exposure event associated with working in a warehouse is: Level 1 (least granular): ‘work or education’; Level 2: ‘warehouse or distribution’; Level 3 (most granular): ‘warehouse’.
Controls completed an online survey with the same exposure questions structured identically to the case questionnaire. We used the postcode of residence of the cases and controls to adjust for socio-economic deprivation using the Index for Multiple Deprivation (IMD) [18].
Statistical analysis
Cases and controls were described according to their demographic characteristics (age, sex, ethnicity, region of residence and IMD). Single variable (logistic regression) analyses were undertaken to estimate unadjusted odds ratios (uORs) as crude measures of association between exposures (non-household activities) and outcome (confirmed COVID-19). Confidence intervals (CI) and P values (Wald) were calculated.
Sampling periods had a number of settings in which the number of individuals reporting ‘exposure’ was small, leading to the phenomenon of separation. We used the Firth correction [Reference Heinze and Schemper19] to obtain finite parameter estimates through penalised maximum likelihood estimation, and to reduce the biases associated with separation. All work and leisure activities on a sector level were included in the regression model, unless there was evidence of multicollinearity. Demographic variables were considered as potential confounders and were only excluded if there was evidence of multicollinearity. Multicollinearity was assessed through exploring correlation between variables in the final model using Pearson's correlation statistic. A correlation statistic >0.80 was considered as evidence of multicollinearity.
Initially a random-effects meta-analysis was used to combine the estimated odds ratios from the multivariable analyses using sector-level exposures. Further meta-analyses explored the associations of the granular level workplace activities. The sector level exposures were broken down to their most granular level (Level 3) if the sectors showed evidence of a risk association (aOR >1.5, 95% CI not including 1) in all three study periods, or a trend of a substantial increase in the observed odds over the three periods. Other exposures were included as sector-level exposures.
Results
Descriptive analysis
Across the three study periods, a total of 12 338 individuals were included, of which 6000 were cases. Overall, 3214 (54%) of cases self-completed contact tracing using the digital route, with limited variation between study periods (period 1: 1129 (56%); period 2: 964 (48%); period 3: 1121 (56%)). Females were overrepresented: 6926 (56%) individuals, of which 3268 (47%) were cases. Across the study periods, cases reported fewer activities (median 2, range 1–20) than controls (median 4, range 1–68). There was a greater proportion of individuals in the control group that were of white ethnicity (83%) compared to the cases (65%), although ethnicity was not recorded in 9% of case respondents. A greater proportion of cases lived in areas of low deprivation (17%) than controls (12%), although deprivation score was unknown for a large proportion of control respondents (11%). Case and control distribution was largely similar for all other demographic variables (Table 1).
Single variable analysis
There was statistical evidence that working in social or domiciliary care, health care or hospitality was associated with being a COVID-19 case in all three study periods (Table 2). Additionally, study periods 2 and 3 showed evidence that working in a warehouse, emergency services and close contact services were each significantly associated with being a case (Table 2). Furthermore, in these two periods, working or attending education was associated with increased odds of illness (Table 2). The most granular work settings within each sector are included in the footnotes of Tables 2 and 3.
a Working in social care – including care homes, domiciliary care, in-home carers and health visitors.
b Working in healthcare – including hospitals, GPs, drop-in clinics, GPs, community hospitals, ambulance services and other healthcare professions.
c Working in hospitality – including restaurants, food and drink outlets and lodging.
d Working in close contact services – including barbers, hairdressers, beauty and nail salons, make-up studios, tattoo studios, spas and wellness business, tanning salons and any other services which require close contact.
e Working in manufacturing or construction – including construction labour or office-based roles, other construction professions and manufacturing jobs in textiles, electronics, cars, furniture, pharmaceuticals, chemical plants, printing, engineering.
f Working in warehouse settings – including working in warehouse, haulage, wholesale or food distribution.
g Working in food production – including farming and agriculture, any food manufacturing and food production.
h Working in retail – working in retail stores, supermarkets, newsagents, food stores and other retail related professions.
i Working in transport – including working in underground or tram, trains, buses and logistics and storage.
j Working in emergency services – including fire services, police and other emergency services.
k Working in arts, entertainment or recreation – including music, theatre, gyms, cinema, leisure centres.
l Work-related travel – including attending conferences, door-to-door sales, home care visits and visiting clients or sites.
m Education – working only: applicable to childminder, nursery, primary school, secondary school and sixth form and working or attending applicable to university, higher education and special needs educational settings.
n Working in immigration or border force services – including office-based and people facing professions.
o Working in the military – including the Navy, the Army and the Air Force.
a Working in warehouse settings – including working in warehouse, haulage, wholesale or food distribution.
b Working in social care – including care homes, domiciliary care, in-home carers and health visitors.
c Working in hospitality – including restaurants, food and drink outlets and lodging.
d Working in healthcare – including hospitals, GPs, drop-in clinics, GPs, community hospitals, ambulance services and other healthcare professions.
e Working in manufacturing or construction – including construction labour or office-based roles, other construction professions, and manufacturing jobs in textiles, electronics, cars, furniture, pharmaceuticals, chemical plants, printing, engineering.
f Education – working only: applicable to childminder, nursery, primary school, secondary school and sixth form and working or attending applicable to university, higher education and special needs educational settings.
g Working in transport – including working in underground or tram, trains, buses and logistics and storage.
h Working in emergency services – including fire services, police and other emergency services.
i Working in close contact services – including barbers, hairdressers, beauty and nail salons, make-up studios, tattoo studios, spas and wellness business, tanning salons and any other services which require close contact.
j Working in retail – working in retail stores, supermarkets, newsagents, food stores and other retail-related professions.
k Work-related travel – including attending conferences, door-to-door sales, home care visits and visiting clients or sites.
l Working in arts, entertainment or recreation – including music, theatre, gyms, cinema, leisure centres.
m Working in immigration or border force services – including office-based and people facing professions.
n Working in the military – including the Navy, the Army and the Air Force.
o Working in food production – including farming and agriculture, any food manufacturing and food production.
*Adjusted for age, sex, region, ethnicity, index of multiple deprivation and leisure activities.
We did not observe any substantial differences in uOR for point estimates obtained for level 2 workplace exposure groups using only cases completing contact tracing by the two different routes; across all three study periods: median difference = 0.29 (interquartile range 0.12–0.67) (full results not shown). Although some changes in statistical significance were seen when comparing these uOR, there was no change in the direction of the association (i.e. odds ratios both <1 or >1).
Multivariable analysis for occupational exposures
All multivariable analyses were adjusted for age, sex, ethnicity, socio-economic deprivation (IMD), geographical region and leisure activities. All three study periods showed an association between working in healthcare, social care or hospitality, and becoming a COVID-19 case (Table 3). Working in warehouse settings, construction and manufacturing and working or attending educational settings were all associated with increased odds of infection in the second and third study periods, with the strength of the observed association increasing substantially over the three study periods (Table 3).
Meta-analysis of sector-level work-related exposures
The estimated odds ratios for work-related exposures used in the meta-analyses were adjusted for age, sex, ethnicity, IMD, geographical region and leisure activities. There was evidence that working in healthcare (pooled odds ratio (pOR) 2.87; aORs 2.81, 2.72, 3.08, for studies 1–3 respectively), social care (pOR 4.14; aORs 5.41, 5.06, 2.46, for studies 1–3 respectively) or hospitality (pOR 2.36; aORs 2.53, 2.63, 2.01, for study periods 1–3 respectively) were associated with increased odds of being a COVID-19 case. There was also evidence that working in a warehouse setting was associated with increased odds (pOR 3.86; aORs 1.06, 3.93, 14.19, for study periods 1–3 respectively). The aORs for the association between warehouses and the odds of infection showed a substantial increase over the three study periods, also indicated by the evidence of heterogeneity between the observed aORs (I 2 = 78%, P < 0.01). A similar pattern was also observed in construction (pOR 2.03; aORs 1.37, 2.09, 2.67, for studies 1–3 respectively), and in educational settings (pOR 1.00; aORs 0.31, 1.53, 2.03, for studies 1–3 respectively) (Fig. 2).
Meta-analysis of granular level workplace-related exposures
We explored the association between work-related exposures and infection using more granular level information on work activities (Fig. 3). There was evidence across all study periods that working in bars, restaurants and pubs was associated with elevated odds of being a case (pOR 2.87; aORs 3.52, 2.92, 2.41, for studies 1–3 respectively). Cases also had higher odds of working in a warehouse than controls (pOR 5.58; aOR range 1.72, 4.89, 24.06, for study periods 1–3 respectively). Working or attending secondary school settings was also associated with increased odds of being a COVID-19 case (pOR 2.58; aORs 1.52, 3.02, 2.98, for study periods 1–3 respectively). Working in primary school settings (pOR 1.43; aORs 0.43, 2.23, 2.58, for study periods 1–3 respectively) was associated with increased odds of being a COVID-19 case, especially in the second and third study periods. There was also evidence that working in hospital was linked with increased odds of COVID-19 (pOR 3.19; aORs 4.06, 2.29, 3.53, for study periods 1–3 respectively). Working in general practice surgeries (GPs) was associated with elevated odds of being a COVID-19 case (pOR 1.47; aORs 0.51, 2.17, 2.71, for study periods 1–3 respectively), with an increase in observed aORs over the three studies, along with evidence of heterogeneity (I 2 = 69.4%, P = 0.04) (Fig. 3).
Discussion
We observed important associations between certain occupations and SARS-CoV-2 infection. The associations between being a COVID-19 case and working in health and social care, hospitality, education or warehouse settings persisted over the three study periods. The findings highlight that infection control measures may have only a partial effect on reducing transmission in some workplace settings.
The findings on hospitality, primarily working in bars, pubs or restaurants, are supported by the accumulating evidence on the risk of COVID-19 associated with these settings from other countries [8, Reference Furuse10, Reference Lu11, Reference Chau20, Reference Kang21]. Other studies have observed associations between exposure to hospitality venues and risk of infection that are similar to those observed in our study [Reference Fisher12, Reference Galmiche22]. These results are likely to reflect the ongoing risk in hospitality settings, especially among staff who work long hours and have multiple daily contacts, and may face multiple situations where adhering to social distancing is challenging [23].
The associations between certain settings and the odds of infection can only occur when there is an opportunity for the activities to take place. These opportunities are largely affected by NPIs, which in England have varied by geographical region and over time. It is therefore not surprising that, for example, the association between educational settings and increased odds of becoming a COVID-19 case was not observed in the first study period when most educational settings were closed. Educational settings have been associated with outbreaks [Reference Ismail14, 15], and there is evidence of potential transmission events taking place in these settings [Reference Larosa24]. However, the importance of educational settings in the context of community transmission of SARS-CoV-2 is still largely unclear [15].
Our findings on the associations between infection and working in manufacturing, construction and warehouse settings are consistent with the most commonly reported outbreak settings in European countries [8], Canada [Reference Murti13] and by other analytical studies [Reference Galmiche22]. It is plausible that sectors like construction may have different summer and winter working patterns with most of the work occurring indoors during the winter months, which may increase the opportunities for transmission in these settings. However, our study cannot account for this or other factors that may increase the opportunities for SARS-CoV-2 transmission outside workplaces. In general, use of shared facilities such as break rooms and kitchens, or high levels of car sharing or living in multioccupancy housing may create further opportunities for transmission, and while associated with workplace, they are not directly related to the occupation. Most of these factors, particularly those related to variation in individual behaviour, are still relatively poorly understood.
The findings regarding the association observed between working in healthcare (primarily associated with hospital exposure) and social care settings (primarily driven by exposure in care homes) and being a COVID-19 case were expected. These settings were closely associated with transmission in the first wave of the epidemic and, despite the appropriate use of personal protective equipment, healthcare workers experience high COVID-19 infection rates [Reference Nguyen7–Reference Leclerc9]. Our study identified that not only hospitals but also GP surgeries were associated with increased odds of infection.
Public health guidance and advice to reduce the risk of transmission in workplace settings have been published in England [25] and mitigation measures have been introduced. While compliance for these measures is generally high there is evidence that the effectiveness of these measures is only partial [16]. Furthermore, transmission of SARS-CoV-2 is driven by complex combinations of different environmental, social and behavioural factors that are challenging to fully quantify [Reference Cevik2]. Hence, it is likely that, within the same occupational sector, there will be examples of good and poor practice in the implementation of measures to mitigate transmission, including considerable variation in individual behaviour. The risk of transmission is therefore likely to vary from setting to setting within a sector and the impact of workers' behaviour, including social interaction of staff during breaks, or before and after work, on transmission is not well understood. However, while it is not possible to determine the exact locations where transmission occurred, the associations we have observed are likely to reflect that it is more challenging to mitigate risk in some sectors compared to others.
Our study has a number of limitations. Across all three study periods, cases reported fewer exposures than controls. Cases are often first identified as contacts of other cases and it is possible that people modify their behaviour as a result. It is also possible that multiple biases affect the responses provided. One plausible explanation for this is that the cases going through the NHS T&T questionnaire may be experiencing high levels of questionnaire fatigue due to the length of the questionnaire. While the structure of the control survey (for exposures) was designed to be as identical to the case questionnaire as was practical, cases were required, as part of routine contact tracing, to provide considerable additional non-exposure information (e.g. symptoms, details and settings for close contacts). It is also possible that cases may feel inclined to adhere to socially desirable reporting, and not report activities they perceive would not be socially acceptable in the context of the pandemic. This differential misclassification of exposure most likely biased the effect measures towards the null, hence the results described are more likely to be underestimates rather than overestimates. There is also a potential for selection bias. As of the end of the study period three, altogether 630 309 cases were reached by contact tracing, with 82% of cases transferred to NHS T&T being reached by contact tracing [17]. This varied between 76.8% and 85.0% during our three study periods [17]. There is likely to be some selection bias among cases if those who did not engage in contact tracing differed from the rest in terms of their exposures. Testing procedures and pathways have also varied by time, geographical region and by workplace settings, with some workplace settings having higher level of routine testing than others, likely leading to differential capture of asymptomatic cases. The controls were sampled from a pool of volunteers to a commercial market research panel, which most likely introduced some selection bias for controls [Reference Mook26]. Furthermore, people registered and who chose to participate in the present studies might differ from those registered but who did not engage. The present analyses could only control for confounding by the exposure and demographic variables available, and residual confounding is likely to persist. However, the study did adjust for important, known confounding factors, including social deprivation, age, sex, ethnicity and geographical region. The observations presented in our study may be further confounded by other factors that were not part of the study including living conditions, transport to work or socio-economic factors not explained by small area social deprivation.
This study took place in the context of COVID-19 pandemic response when there was an urgent need to provide timely evidence on the risks associated with community exposures. The findings from these sampling periods provided evidence which together with the wider pool of evidence was used to inform risk assessments, policies and targeted public health action in the operational context of the pandemic response. The study allowed rapid, concurrent data collection from both cases and controls, providing epidemiological insights with consistent estimates of associations reported over three study periods. To the best of our knowledge, this is the only case-control study on community risk factors for COVID-19 in England. As such, this study provides valuable insights into the risks associated with community settings and activities, and the findings are likely to be transferrable to other similar settings.
We conclude that several workplaces were associated with increased odds of being a COVID-19 case, primarily those in health and social care, hospitality, warehouse and educational settings. The results are aligned with the existing literature. While it was not possible to determine whether transmission occurs at workplaces or outside the workplace, for example, during commuting, our findings provide further evidence that certain sectors have been more affected, highlighting that the mitigation measures in these settings may only be partial. While the study has limitations, these are likely to result in underestimation, rather than overestimations of the associations. Therefore, our recommendation is that further mitigation measures are considered in these sectors to improve the control of COVID-19 in these settings.
Acknowledgments
We would like to thank the NHS Test and Trace team for their contribution, the members of UK's Scientific Advisory Group for Emergencies' sub-group Environmental Modelling Group (Transmission Group) for their comments and advice, and the members of the public who took part in this study.
Financial support
The studies were entirely funded by Public Health England (now UKHSA), and no external funding was received.
Conflict of interest
None.
Ethical standards
The study protocol was subject to an internal review by the Public Health England's (PHE's, UKHSA from 01 October 2021 onwards) Research Ethics and Governance Group, which is the PHE/UKHSA Research Ethics Committee, and was found to be fully compliant with all regulatory requirements, and was classified as surveillance that was being undertaken as part of PHE's responsibility to respond to the COVID-19 current pandemic. PHE has legal permission, provided by Regulation 3 of The Health Service (Control of Patient Information) Regulations 2002 to collect confidential patient information under Sections 3(i) (a) to (c), 3(i)(d) (i) and (ii) and 3(3) as part of its outbreak response activities. As such, this work fell outside the remit for ethical review and as no regulatory issues were identified the protocol was approved by the group.
Data availability statement
The data that support these studies were collected as part of a public health response, are considered sensitive and not made publicly available. Reasonable requests for access to anonymised data will be considered by the authors on request.