Article Text

Download PDFPDF

Protocol
Protocol for a longitudinal study examining the trajectory of COVID-19, post-COVID, multidimensional disadvantage and health-related quality of life in India: the IndiQol Project
  1. Kompal Sinha1,
  2. Nils Gutacker2,
  3. Yuanyuan Gu3,4,
  4. Juanita Haagsma5,
  5. Kaushalendra Kumar6,
  6. Mona Aghdaee3,4
  1. 1Department of Economics, Macquarie Business School, Macquarie University, North Ryde, New South Wales, Australia
  2. 2Centre for Health Economics, University of York, York, UK
  3. 3Macquarie University Centre for the Health Economy, Macquarie Business School, Macquarie University, Macquarie Park, New South Wales, Australia
  4. 4Australian Institute of Health Innovation, Sydney, South Wales, Australia
  5. 5Department of Public Health, Erasmus MC, Rotterdam, the Netherlands
  6. 6International Institute for Population Sciences, Mumbai, Maharashtra, India
  1. Correspondence to Dr Kompal Sinha; kompal.sinha{at}mq.edu.au

Abstract

Introduction The COVID-19 pandemic has raised concerns about the persistence of symptoms after infection, commonly referred to as ‘post-COVID’ or ‘long-COVID’. While countries in high-resource countries have highlighted the increased risk of disadvantaged communities, there is limited understanding of how COVID-19 and post-COVID conditions affect marginalised populations in low-income and middle-income countries. We study the longitudinal patterns of COVID-19, post-COVID symptoms and their impact on the health-related quality of life through the IndiQol Project.

Methods and analysis The IndiQol Project conducts household surveys across India to collect data on the incidence of COVID-19 and multidimensional well-being using a longitudinal design. We select a representative sample across six states surveyed over four waves. A two-stage sampling design was used to randomly select primary sampling units in rural and urban areas of each State. Using power analysis, we select an initial sample of 3000 household and survey all adult household members in each wave. The survey data will be analysed using limited dependent variable models and matching techniques to provide insights into the impact of COVID-19 pandemic and post-COVID on health and well-being of individuals in India.

Ethics and dissemination Ethics approval for the IndiQol Project was obtained from the Macquarie University Human Research Ethics Committee in Sydney, Australia and Institutional Review Board of Morsel in India. The project results will be published in peer-reviewed journals. Data collected from the IndiQol project will be deposited with the EuroQol group and will be available to use by eligible researchers on approval of request.

  • COVID-19
  • Health Equity
  • Health Literacy
  • HEALTH ECONOMICS
  • Post-Acute COVID-19 Syndrome
http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

STRENGTHS AND LIMITATIONS OF THIS STUDY

  • The project creates the first comprehensive longitudinal dataset on various aspects of well-being including health-related quality of life, socioeconomic conditions, COVID-19 history, post-COVID symptoms and beliefs about COVID-19 disease.

  • This population-based study surveys a representative sample of households from six states across India over four waves, thereby gathering evidence on the causal impact of COVID-19 and post-COVID on the health and well-being of individuals and their households.

  • The survey will be lengthy as it will collect household-level and individual-level self-reported data which can result in measurement error due to self-reporting bias.

  • The longitudinal nature of this survey would result in some attrition in later waves of the survey which is anticipated to affect the analysis and results of the study.

Introduction

Socioeconomic disadvantage is associated with poor life expectancy and overall health.1 The COVID-19 pandemic has disrupted lives and exacerbated health inequalities worldwide.2 3 In the regions with higher socioeconomic deprivation, COVID-19-related mortality rates are higher, and there are more cases reported than in less deprived regions.4–6 Despite clear evidence linking deprivation and mortality, there is limited understanding of the causal impact of socioeconomic deprivation on the health-related quality of life (HRQoL) in COVID-19 survivors. This is particularly important since the long-term health consequences of COVID-19 are likely to be influenced by individuals’ social circumstances, such as their occupation or their access to healthcare services. The COVID-19 pandemic has hit low-income and middle-income countries (LMICs) hard resulting in significant economic shocks that may further exacerbate the disadvantages associated with the pandemic.7 8 In India, where this study is situated, the pandemic has spread widely, with 98% of the country’s districts reported COVID-19 patients in 2020.9 India also has one of the highest rates of COVID-19 seroprevalence10 and rates of COVID-19 deaths globally. Recent reports highlight the socioeconomic impact of the pandemic, revealing that less educated COVID-19 patients experience greater disease severity and mortality in India.11

Among those who survive the acute COVID-19 infection, some individuals experience persistent symptoms that can significantly impact their HRQoL over an extended period.12 These persistent symptoms following a COVID-19 infection are termed as ‘long-COVID’ or ‘post-COVID’ (‘post-acute sequelae of COVID-19’). The WHO has defined post-COVID as prolonged symptoms occurring approximately 3 months after the onset of COVID-19 symptoms, lasting for at least 2 months and not attributable to an alternative diagnosis.13 Post-COVID symptoms can include a range of issues, including fatigue, shortness of breath and cognitive dysfunction, among others,14 which may also fluctuate or relapse over time. While post-COVID is estimated to occur in 10%–20% of COVID-19 cases, it affects people of all ages, including children, with most cases occurring in patients with mild acute illness.15 16 The incidence rates vary based on hospitalisation status and vaccination. For non-hospitalised cases, while the conservative estimate of post-COVID is 10%, the incidence is estimated to be 10%–30% for non-hospitalised cases; 50%–70% for hospitalised cases and 10%–12% for vaccinated cases.17 In India, it is estimated that 6%–10% of COVID-19 patients go on to develop post-COVID.16 Despite existing studies on post-COVID primarily focusing on patient populations in developed countries,5 18 19 there has been no specific study on post-COVID in India.20 India, as an LMIC country, has faced significant challenges due to the pandemic, with over 44.9 million COVID-19 patients reported as of October 2023 (according to the WHO website). Experts predict a potential intensification of COVID-19 cases in India.21 22

Socially disadvantaged communities have experienced higher COVID-19-related mortality and disease burden in less deprived areas2 3 in developed (UK, USA)4 and developing countries (India, China).6 In the UK, the Office of National Statistics estimated 2 million people experienced long-COVID in April 2024, with prevalence being highest among people aged 35–69 years, women and people living in deprived areas.23 In India, the economic shock, lockdown policies and mass migration amplified poverty and deprivation in the country leaving 121 million people out of work and 23 million people on streets. An issue of concern for public health is how the incidence and prevalence of disease symptoms among COVID-19 survivors living in deprivation. Persistent symptoms of COVID-19 among survivors can significantly diminish their quality of life for a long time. Moreover, with almost 40% of COVID-19 patients likely to experience post-COVID, the socioeconomic impact of the pandemic on multidimensional disadvantage and health inequalities will be substantial.

The condition of individuals facing hardships across multiple life domains, referred to as socioeconomic disadvantage, can influence health outcomes.24 Recent literature has considered disadvantage as a multidimensional concept,24–27 that is, multidimensional deprivation (MDD), based on the notion that deprived individuals should be identified using both income-based and non-income-based indicators. In India, a significant population of socioeconomically deprived people,28 29 with approximately 27.5% of the population reported to be multidimensionally deprived in 2015–2016.30 Specifically, 52% of households lacked proper sanitation, 12% lacked access to electricity, 10% did not have access to safe drinking water, 13% of individuals have less than 6 years of schooling and 37% had poor nutritional status during the same period.31 The importance of considering the chronicity and persistence deprivation across multiple domains has been emphasised in the recent literature resulting in measures of MDD that account for the duration and persistent deprivation in multiple domains. The measure proposed by Nicholas et al26 and Sinha et al24 accounts for the following factors: (a) deprivation across a wide variety of dimensions/domains of life in each period; (b) deprivation for the most periods in a specific dimension and (c) chronic deprivation characterised with uninterrupted periods of deprivation. The duration and persistence of deprivation across multiple domains can lead to a health-deprivation trap.32 To measure and understand the health inequalities ensuing from this disadvantage, it is important to untangle the contribution of each of the components of disadvantage, as well as their persistence and severity.26 Thus, it is important to understand how severe health conditions, as experienced during COVID-19 and post-COVID, impact the pattern of MDD and how it influences the HRQoL and well-being of individuals.

Aim of the research

The IndiQol Project aims to examine morbidity associated with COVID-19, that is, post-COVID symptoms, multidimensional deprivation and HRQoL in India. Conducting a large-scale longitudinal household survey in India, this project will select a representative sample of the general population in India across six states, surveyed over four waves. While existing literature has analysed COVID-19 patients only, this population-based study will interview all adult household members on their COVID-19 history, socioeconomic conditions, HRQoL and behaviour. Individuals currently affected by or with a history of COVID-19 will be asked questions on their long-COVID symptoms. The first wave of the survey will include a prepandemic recall questionnaire, asking respondents about their socioeconomic circumstances and HRQoL during the annual festival period of Diwali 2019. The novelty of the current analysis lies in investigating the long-term consequences of the COVID pandemic by exploring the association between COVID-19, post-COVID conditions, multidimensional deprivation, household demographics, socioeconomic status and HRQoL as measured by the EQ-5D-5L (using the recently developed value sets for India).33

Methods

Study population

India is a large country with six broad regions—North, North-East, South, West, East and Central India. To ensure representation of the regional diversity of the country, the study population was determined by grouping the 29 Indian states into six categories based on a set of economic and health indicators. One state from each category was chosen for our sample.1 Specifically, we have selected six states considering per capita net state domestic product,34 child mortality (under 5 years old), life expectancy at birth, schooling and the social subclass of scheduled caste and scheduled tribe (SC/ST) population. The sampled population is drawn from six states: Uttar Pradesh, Odisha, Tamil Nadu, Haryana, Rajasthan and Maharashtra (figure 1). These six states together have around half (48%) of the total population of India.35

Figure 1

The six states included in the sample design.

Table 1 shows the variation of our selection parameters of human development across the selected states and India including population growth, disability-adjusted life-years (DALYs), literacy rates and proportion of SC/ST, that is, socioeconomically disadvantaged groups. The six study states are at different stages of demographic and epidemiological transition. For instance, as per the Registrar General of India,36 the natural growth rate of the population varies from 18.7% (crude birth rate, CBR=25.1, crude death rate, CDR=6.1) (CBR: the number of births in a year per thousand population; CDR: the number of deaths in a year per thousand population.) in Uttar Pradesh to 7.7% (CBR=13.8, CDR=6.1) in Tamil Nadu. Additionally, the value for the ratio of DALYs due to communicable, maternal, neonatal and nutritional diseases over DALYs due to non-communicable disease and injuries varies across these states: 0.26 in Tamil Nadu, 0.33 in Maharashtra, 0.40 in Haryana, 0.66 in Rajasthan, 0.58 in Odisha and 0.68 in Uttar Pradesh.37

Table 1

Indicators of human development (2019–2020), by state

Household as the sampling unit

An important feature of this study is using the household as a sampling unit and interviewing all eligible (aged 18 years and older) members of the household. The motivation for considering a household rather than an eligible individual from the household lies in the contagious nature of the COVID-19 disease, post-COVID symptoms and the potential impact of changes in socioeconomic circumstances due to the pandemic. We categorise a household as a COVID-19 positive household (henceforth, COVID-positive household) if at least one member of the household reports having a history of COVID-19.

There are several advantages of the general population-based sampling approach. First, this approach allows us to assess the socioeconomic gradient in post-COVID conditions for both the overall population and the subgroup of COVID-positive households. Second, since we analyse the association between multidimensional deprivation and experiencing COVID-19 and post-COVID, having non-COVID households in the sample allows us to benchmark our analysis against the COVID households. Third, given the highly contagious nature and rapid transmissibility of this disease, there is a high likelihood of intrahousehold transmission of COVID-19. With households losing family members, jobs and income due to COVID-19, experiencing post-COVID by an individual and/or other members of households could have vicarious effects on the health and well-being of all members of the household. It is critical to understand the variation in experiencing post-COVID symptoms and HRQoL of the members within a COVID-19 household (ie, intrahousehold HRQoL). Our approach to studying HRQoL for each member allows us to explore the vicarious mental and physical health effects of post-COVID within the household. Fourth, the impact of the pandemic on household’s socioeconomic status would vary by the severity of disease incidence, that is, the number of household members experiencing post-COVID symptoms. For instance, the socioeconomic circumstances of a COVID-positive household where only one member lost their job due to COVID-19 and is now experiencing post-COVID would be different from that of a household where only one household member lost their job and multiple members of the household are experiencing post-COVID. Since the possibility of more than one family member of a COVID-positive household experiencing post-COVID is expected to be high, the impact on the HRQoL and well-being of the entire household will also be high. Finally, while most existing data on COVID-19 pandemic has collected data for COVID-19 survivors only, this population-based study will collect data on socioeconomic circumstances, COVID-19 history, post-COVID and HRQoL for the general population in India.

Sample size

The sample size is determined based on the level of confidence and statistical power analysis following Cohen’s38 guidelines, along with the effect sizes reported in recent literature.29 39–42 Our sample size accounts for the effect size (ie, the difference between the highest and the lowest level needed to detect a meaningful difference) between the HRQoL reported by deprived and not deprived groups with COVID-19 and post-COVID. The power calculation assumed a small effect size, 95% CI and a statistical power of 90%. Based on F-test analysis of variance (fixed effect, main and interaction effects, α err prob=0.05, power=0.90, effect size=0.1 (small)), a sample size of 2063 was suggested (The power analysis was carried out using the G*Power V.3.1.9.7 software). This provides sufficient power to detect statistical effects in multiple regression models.43–45 The sample size is also validated using the effect sizes reported in recent literature for Middle Eastern and North Africa (MENA) countries,39 Hong Kong,40 India,29 Asia41 and China.42 The project will recruit a sample of 3000 randomly selected households, representative across age, gender and education (≥12 000 individuals based on an average household size of 4.7 in India), through Morsel India, a field survey company in India. All members of the household (>18 years old) will be interviewed. Following the norm for survey data collection in household sample surveys in India,46 we aim for a 70% response rate across waves. Assuming a 30% attrition rate, which aims to collect a 70% response rate for household sample surveys in India, an initial sample of 3000 households will suffice for a final sample of at least 2063 households.

Sampling strategy

The sample design will follow the statistical approaches recommended for designing household surveys. Henderson and Sundaresan46 demonstrated that 30 clusters, that us, primary sampling units (PSUs), selected based on probability proportional to size (PPS) from strata (Henderson and Sundaresan46 created 12 hypothetical stratums by considering different levels of immunisation coverage in communities (ie, PSUs)) of a geographical unit (ie, state), provide an estimate of immunisation coverage with 95% confidence limits. Following Henderson and Sundaresan,46 we will adopt a two-stage sampling design strategy to select PSUs in both rural and urban areas of a state. A total of 30–32 PSUs in a state will be allocated and selected from rural and urban areas according to population proportions. Before selection, all PSUs with fewer than 40 households will be linked to the nearest PSU. We will form separate strata in rural and urban areas for household selection in a state.

Stratification and first stage selection of the sample

The purpose of stratification is to create groups within the population that have homogeneous characteristics within a stratum and heterogeneous characteristics between strata (population subgroups). For the selection of PSUs, we based our sampling frame on the 2011 census. In each state, we will create six explicit substrata by crossing three substrata based on the number of households in a PSU, and two substrata based on percentage of PSU’s population belonging to the social subgroups, that is, SC/ST. Within each state, the PSUs will be sorted by the number of households, and three explicit strata of PSUs will be formed with equal numbers of households in each. Additionally, we will sort PSUs within each substratum by the percentage of SC/ST population, resulting in two substrata in each of the three strata.

Implicit stratification

Within each explicit strata, the PSUs will be sorted by female literacy rate of females aged 6 years and above. The pattern of this sorting will alternate between an ascending and descending order for subsequent strata. We will then aggregate the number of households in each PSU to adopt a PPS sampling method to select the 30 PSUs in a state.

Second stage selection

Sample households will come from each selected PSU. Selected PSUs with more than 150 households will be divided into segments (We will follow geographical boundaries such as waterways, roads and streets) containing 60–100 households. One segment will be randomly selected for mapping and listing operation in the PSU. A complete mapping of listing of each household in the PSU will be done to select households for interview. From each of the selected PSUs or segment of a PSU, 20 households will be randomly selected with systematic sampling.

Survey design

The survey will be conducted four times over 2023 and 2024, at equally spaced intervals, starting with winter of 2023 and ending in winter of 2024. The survey data collection team at Morsel India will conduct the survey in consultation with the project team. The survey questionnaire will be developed and conducted using computer-assisted personal interview software SurveyCTO (https://www.surveycto.com/). All interviewers will be trained by researchers who directly work with us. Each survey will take approximately 40–50 min using the official EQ-5D-5L (EuroQol health-related quality of life questionnaire) local language translations approved by the EQ-5D legal team. To maximise participation across all four surveys, phone numbers of respondents will be collected for the purpose of scheduling follow-up interviews. Respondents will be given a gift voucher (worth €2), at the end of follow-up surveys.47 Attrition and non-response in the first follow-up will be appropriately tested following the best practices in the literature.48

The objective of collecting data over four waves (including one prepandemic recall) is twofold: (a) to monitor and evaluate the socioeconomic circumstances experienced by the population amid high rates of COVID-19 and post-COVID conditions and (b) to apply our theoretical framework for measuring of multidimensional deprivation,24 which necessitates multiple waves to accurately measure the persistence and severity of multidimensional deprivation. Furthermore, since most post-COVID symptoms last at least 3 months, collecting data at equally spaced intervals will allow us to closely track health patterns and disease symptoms.

Survey questionnaire

Each survey wave will collect data on the same set of individual and household-level socioeconomic circumstances, HRQoL, COVID-19 history and vaccination information. Detailed information on the data to be collected is presented in table 2. All members of the household (≥18 years old), including the elderly, disabled and low literacy or illiterate members, will be interviewed. The core dataset will include respondents’ age, gender, educational attainment, employment status, occupation, social caste, household size, income and material well-being. Material well-being encompasses housing conditions, access to electricity, fuel, clean water, sanitation, vehicles, land and house ownership. We will also inquire about schooling, financial hardship during COVID-19 and social engagement. Respondents will be asked to report their HRQoL using the EQ-5D-5L instrument.49 The resulting health profiles will be translated into index scores using the Indian EQ-5D-5L value set.50 Additionally, respondents will answer questions on chronic pre-existing health conditions, COVID history, smoking status (following the Fagerstrom test51), alcohol consumption (using the Alcohol Use Disorder Identification Test52), disability status (using the WHO disability assessment tool53) and anxiety (using the COVID-19 anxiety scale54). For respondents reporting a COVID-19 history, we will inquire about their post-COVID symptoms as established by the WHO.52 Focusing on the overall population, rather than only COVID-positive households, aligns with the sampling approach used in ongoing studies in the USA and the UK; specifically, studies such as the RECOVER project (USA)55 and the EuroQol-funded POPCORN.56 57 The POPCORN study investigated the direct and indirect effects of the COVID-19 pandemic on HRQoL in Greece, Italy, the Netherlands, the UK and the USA.

Table 2

Core questionnaire modules, respondents and objective of each survey wave

The first wave of the IndiQoL survey will include questions requiring respondents to recall their health and socioeconomic circumstances prior to the onset of the COVID-19 pandemic. To establish a common reference point, we will use Diwali 2019, a festival celebrated across the entire country, as the time point in the year before the COVID-19 pandemic began. To address potential bias associated with retrospective measurement, we will conduct a test–retest analysis using both retrospective and prospective data, following recent literature. Specifically, we will use the test for concordance and Bland-Altman test.

Ensuring data quality is crucial. The survey company and our research team will independently monitor data quality after every survey wave. We will record the number of households contacted, the acceptance or rejection of participation invitations, and, where known, the reason for non-participation. We will also quantify the rate of non-response for each of the items of the questionnaire.

Patient and public involvement

There is no patient and public involvement in the study.

Empirical analysis

The primary objective of estimating the average impact of COVID-19 and post-COVID conditions on HRQoL, as measured by EQ-5D-5L, will be achieved by comparing individuals with post-COVID symptoms to two reference groups: (a) those who have never had COVID-19 and (b) those who have recovered from an acute infection. Since individuals can experience multiple symptoms that may persist, fluctuate or relapse over time,52 longitudinal EQ-5D-5L data will be particularly valuable for characterising and capturing the longitudinal pattern of HRQoL effects. Our framework will analyse the individual EQ-5D-5L domain responses using ordered logistic regression. To account for observed respondent characteristics (such as age and gender) and prepandemic HRQoL (recall), we will employ either a matching approach or regression adjustments. The EQ-5D-5L index scores will use the value sets recently developed for India.33 The longitudinal nature of the data also allows us to control for some unobserved confounding effects through fixed effects models. To address potential missing values in multiple responses or covariates in longitudinal clustered data, we will use imputation methods.58 Additionally, to capture the heterogeneity of the impact of COVID-19,59–61 we will conduct stratified analysis by age, sex and across six states to provide further insights into the domain wise impact of the number of post-COVID symptoms on HRQoL.

The second objective of this study is to evaluate how disadvantage in multiple domains of well-being affects the HRQoL impact of COVID-19 and post-COVID. We consider several domains based on the global multidimensional deprivation indicators.6 Specifically, the domains include income, schooling, sanitation, electricity, housing conditions, unemployment, social interaction, social capital (trust, security), access to transport, access to healthcare, domestic violence and abuse, food insecurity, social isolation, indoor air pollution and clean drinking water. This objective will use an algorithm to measure persistence augmented multidimensional deprivation developed by Nicholas et al26 and Sinha et al.24 Specifically, we adopt a regression framework to estimate HRQoL values (index scores; domain responses) as a function of variables (including multidimensional deprivation, age, gender, household size, location, health behaviours, post-COVID status). We use generalised linear models (where the dependent variable is related to covariates through a specific link function) for the EQ-5D-5L value scores and ordered logit regressions for domains. We will not only evaluate how overall disadvantage impacts the HRQoL but also explore the specific role of severity and persistence of deprivation in each domain. Using the Shapley decomposition method,62 we will compute the proportional contribution of the components of multidimensional deprivation (ie, severity and persistence) on the HRQoL effect associated with post-COVID. Separately accounting for these components in our regression analyses will allow us to identify how severity and persistence in specific dimensions of deprivation have affected HRQoL for post-COVID. Further ranking of subgroups (gender, rural/urban, COVID-19 status) based on this decomposition technique will allow clear identification of COVID-sensitive dimensions of deprivation specific to each subgroup that policy could target.

Limitations

The survey will be lengthy as it will collect household-level and individual-level self-reported data, which can result in measurement error due to self-reporting bias. The longitudinal nature of this survey would result in some attrition in later waves of the survey, which is anticipated to affect the analysis and results of the study. The first wave of survey includes a module which requires respondents to recall their pre-COVID-19 HRQoL and socioeconomic circumstances. As these events are almost 3–4 years old, there is a possibility of recall bias.

Ethics and dissemination

Ethics approval for this study was given by the Macquarie University Human Research Ethics Committee in Sydney, Australia and by the Institutional Review Board of Morsel India. The results from this project will be published in peer-reviewed journals. Data collected from the IndiQol project will be deposited with the EuroQol group and will be available to use by eligible researchers upon approval of request.

Ethics statements

Patient consent for publication

Acknowledgments

We gratefully acknowledge the support of Elly Stolk, reviewers at the EuroQol Executive Committee, blinded reviewers of our grant application, editorial assistance by Rhonda Daniels and Nyamdavaa Byambadorj. We also thank the team at Morsel India for their partnership on this project.

References

Footnotes

  • X @kompalsinha

  • Contributors Guarantor: KS is the guarantor and takes full responsibility for the finished work and conduct of study, has access to data and controlled the decision to publish. Conception of study: KS. Study design: KS, NG, Y-YG and JH. Drafting the study: KS. Writing: NG, Y-YG and JH contributed to writing, reviewing and editing. Sample design: KS and KK. Questionnaire design and ethics approval: KS and MA. All authors agreed to be accountable for all aspects if the work. English language AI technology was used to improve the clarity of the text.

  • Funding This study is funded by the EuroQol Group, Netherlands under the grant number 430-RA and Macquarie University, Sydney, Australia.

  • Map disclaimer The inclusion of any map (including the depiction of any boundaries therein), or of any geographic or locational reference, does not imply the expression of any opinion whatsoever on the part of BMJ concerning the legal status of any country, territory, jurisdiction or area or of its authorities. Any such expression remains solely that of the relevant source and is not endorsed by BMJ. Maps are provided without any warranty of any kind, either express or implied.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were involved in the design, or conduct, or reporting, or dissemination plans of this research. Refer to the Methods section for further details.

  • Provenance and peer review Not commissioned; externally peer reviewed.