Article Text

Download PDFPDF

Protocol
Evaluation of the psychometric properties of patient-reported outcome measures of health-related quality of life across the European cancer continuum: a systematic review protocol using COSMIN methodology
  1. Laurence Leysen1,
  2. Karla Marticic Giljevic1,
  3. Claire Piccinin1,
  4. Morena Shkodra2,
  5. Madeline Pe1,
  6. Morten Petersen3,
  7. Giovanni Apolone2,
  8. Cinzia Brunelli2,
  9. Claudio Lombardo4,
  10. Galina Velikova5,
  11. Alexandra Gilbert5,
  12. Gabriella Pravettoni6,7,
  13. Gennaro Ciliberto8,
  14. Mogens Groenvold9,
  15. Montserrat Ferrer10,11,
  16. Nanne Bos12,
  17. Ricardo Pietrobon13,
  18. Norbert Couespel14,
  19. Augusto Caraceni15,
  20. Marianne Guren16,
  21. Aude Sirven17,
  22. Hugo Vachon1
  23. on behalf of the EUonQOL consortium
  1. 1Quality of Life Department, European Organisation for Research and Treatment of Cancer, Brussels, Belgium
  2. 2Fondazione IRCCS Istituto Nazionale dei Tumori, Milano, Italy
  3. 3The Research Unit, Department of Palliative Medicine, Bispebjerg Hospital, Copenhagen, Denmark
  4. 4Organisation of European Cancer Institutes – European Economic Interest Grouping, Rome, Italy
  5. 5Leeds Institute of Medical Research, University of Leeds, Leeds, UK
  6. 6Applied Research Division for Cognitive and Psychological Science, Istituto Europeo di Oncologia, Milano, Italy
  7. 7Department of Oncology and Hemato-oncology, Universita degli Studi di Milano, Milano, Lombardia, Italy
  8. 8Regina Elena Institute, Roma, Lazio, Italy
  9. 9Palliative Medicine, University of Copenhagen, Kobenhavn, Region Hovedstaden, Denmark
  10. 10Hospital del Mar Research Institute, Barcelona, Spain
  11. 11CIBERESP, Madrid, Spain
  12. 12, Netherlands Institute for Health Services Research (Nivel), Utrecht, The Netherlands
  13. 13SporeData OU, Tallinn, Estonia
  14. 14European Cancer Organisation, Brussels, Belgium
  15. 15Palliative Care, Pain Therapy and Rehabilitation, Fondazione IRCCS Istituto Nazionale dei Tumori, Milano, Italy
  16. 16Oncology, Oslo University Hospital, Oslo, Norway
  17. 17Unicancer, Paris, Île-de-France, France
  1. Correspondence to Hugo Vachon; hugo.vachon{at}eortc.org

Abstract

Introduction Over the past decades, there has been increasing recognition that assessing patients with cancer’s health-related quality of life (HRQoL) is pivotal to delivering optimal patient-centred healthcare. However, with the increasing number of patient-reported outcome measures (PROMs) available, it becomes more and more challenging to identify the most appropriate PROM to capture HRQoL. Therefore, the aim of this systematic review is to (1) identify all available PROMs assessing HRQoL across the European cancer continuum and (2) critically appraise, compare and summarise the psychometric properties of the identified PROMs.

Methods and analysis Bibliographic databases MEDLINE and PubMed Central (through PubMed) and EMBASE (through Scopus) will be comprehensively searched from database inception until March 2024. Studies reporting on the measurement properties of PROMs assessing HRQoL throughout the European cancer continuum will be included. The evaluation of the psychometric properties, data extraction and data synthesis will be conducted according to the Consensus-based Standards for the selection of health Measurement Instruments (COSMIN) methodology. Two reviewers will independently assess the methodological quality using the COSMIN risk of bias checklist and the COSMIN criteria for good measurement properties. Subsequently, findings will be qualitatively summarised. The Grading of Recommendations Assessment, Development and Evaluations (GRADE) guidelines will be used to grade and summarise the quality of the evidence.

Ethics and dissemination Ethical clearance for this research is not required, as the systematic review will only use information from previously published research. The results of this review will be submitted for publication in a peer-reviewed journal and will be used to provide a set of evidence-based recommendations for a European project (EUonQOL), which aims at developing a new PROM (EUonQOL toolkit) to assess HRQoL across the European cancer continuum. Moreover, findings will be disseminated to a clinical audience and policymakers through conferences, supporting researchers and clinicians in choosing the best measure to evaluate HRQoL in patients with cancer and survivors in Europe.

PROSPERO registration number CRD42023418616.

  • ONCOLOGY
  • Patient Reported Outcome Measures
  • Psychometrics
  • QUALITATIVE RESEARCH
  • Systematic Review
  • Quality of Life
http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

STRENGTHS AND LIMITATIONS OF THIS STUDY

  • This systematic review will report on the psychometric properties of the patient-reported outcome measures (PROMs) used for the assessment of health-related quality of life across the European cancer continuum at a subscale level.

  • This systematic review will follow the highest methodological standards, that is, the Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA) Protocols, the Consensus-based Standards for the selection of health Measurement Instruments and the PRISMA 2020 statement.

  • Due to the expected heterogeneity in study samples and reporting of PROMs’ psychometric properties within the literature, no quantitative pooling of data (ie, meta-analysis) will be performed and the findings will be summarised qualitatively.

  • The systematic review will specifically include PROMs validated in European patients with cancer and survivors; thus, PROMs which have only been validated outside of Europe will not be covered.

  • The systematic review will only consider articles published in English, which may introduce an information bias and limit the comprehensiveness of the results.

Introduction

Health-related quality of life (HRQoL) can be defined as ‘how well a person functions in their life and his or her perceived well-being in physical, mental and social domains of health’.1 Functioning refers here to a patient’s ability to carry out some pre-defined activities and well-being to their subjective feelings.1 More specifically, the framework developed by Wilson and Cleary, which is currently the most applied theoretical model of HRQoL,2 conceives HRQoL as a multidimensional construct encompassing five components: symptom status, functional status, biological and psychological variables, general health perceptions and overall quality of life.

Over the past decades, there has been increasing recognition that assessing patients with cancer’s HRQoL is pivotal to delivering optimal patient-centred healthcare.3 4 HRQoL is now perceived as a meaningful endpoint throughout the cancer continuum5 6 and can serve as a valuable source of information to guide healthcare policies (eg, Europe’s Beating Cancer plan7). However, HRQoL is often inaccurately assessed by healthcare providers and poorly captured by medical procedures or tests, highlighting the need for patient involvement in reporting their outcomes.3 4 8 9

Patient-reported outcomes (PROs) are defined by the Food and Drug Administration as ‘a measurement based on a report that comes directly from the patient about the status of a patient’s health condition, without amendment or interpretation of the patient’s response by a clinician or anyone else’.10 Patient-reported outcome measures (PROMs) refer to the tools used to measure PROs and are now systematically used for the assessment of HRQoL in cancer care. To assess the HRQoL of patients with cancer, a wide array of PROMs is currently available, ranging from generic (eg, 36-Item Short-Form Survey Instrument [SF-36], 5-level EQ-5D [EQ-5D-5L]) to cancer-specific (eg, EORTC Core Quality of Life questionnaire [QLQ-C30], Functional Assessment of Cancer Therapy - General [FACT-G]) and tumour-specific tools (eg, EORTC Breast Cancer Module [QLQ-BR23], Functional Assessment of Cancer Therapy - Breast [FACT-B]). However, this diversity made it more and more challenging to select the most appropriate PROM. This choice should be made with regard to the target population, the target construct and, importantly, the PROM measurement properties.11

Over the past years, many systematic reviews comparing PROMs for the assessment of HRQoL in patients with cancer have been published. Most of them focused on PROMs measuring HRQoL in a specific type of cancer (eg, breast cancer, prostate cancer)12–23 or cancer population (eg, cancer survivors, advanced cancer, palliative patients).14 24–26 Several of these reviews focused on PROMs evaluating one specific HRQoL-related construct (eg, depression, fatigue, pain),12 13 27–29 and the majority did not report the psychometric properties of the PROMs under investigation per subscale.13–17 19–22 24 25 27 28 30 For the reviews reporting on the psychometric properties of PROMs, the methods used to assess both the quality of the studies and the results differed significantly.31

Currently, the highest methodological standards for the conduct of systematic reviews on the psychometric properties of PROMS are provided by the COnsensus-based Standards for the selection of health Measurement INstruments initiative (COSMIN32). However, among the reviews published to date, only half relied on the COSMIN methodology, and most of them did not apply it fully. For instance, in several reviews, the rating of the overall results per PROM was unclear or not performed,12 16 20 27 33 and the risk of bias assessment or the grading of the evidence was not conducted.12 13 24 27 30 33 As such, a comprehensive overview of the psychometric properties of PROMs used for the assessment of HRQoL across the cancer continuum is still needed and missing. Therefore, this study aims to systematically review the measurement properties of PROMs assessing the multidimensional construct of HRQoL in European patients with cancer and survivors to make objective recommendations on the most suitable PROM to use in these populations.

Methods and analysis

The protocol of this systematic review is based on the PRISMA-P (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Protocols) guidelines34 and has been prospectively registered in the International Prospective Register of Systematic Reviews database (PROSPERO 2023, CRD42023418616). In case of protocol amendments, modifications will be reported in the publication reporting the results of the systematic review as supplementary material.

The systematic review will be conducted according to the COSMIN guidelines for systematic reviews32 and will use the COSMIN taxonomy of measurement properties (table 1). All steps of the screening process will be performed using Rayyan.35

Table 1

COSMIN definitions of measurement properties

Search strategy

A systematic search will be performed in the bibliographic databases MEDLINE and PubMed Central (through PubMed) and EMBASE (through Scopus) without a publication date restriction up to February 2023 (updated up to March 2024). The search strategy will be based on the Population Intervention Comparator acronym36 (PICOM) in which the population will be represented by patients with cancer and survivors, the outcome by health-related quality of life and the methods by psychometric properties. No comparator or intervention will be used. Both Medical Subject Headings (MeSH) terms and text words will be used.

Original research articles published in English (including erratum and correction articles) will be considered for inclusion. Reference lists of included articles will be manually searched by hand to ensure all relevant studies will be considered. Additionally, the exclusion filter of Terwee et al37 will be used. The grey literature will not be considered.

The respective search strategies that will be used for PubMed and Scopus are provided in online supplemental appendix 1.

Selection process

The selection process will be twofold. First, it will be determined whether the PROMs captured by the search should be included or excluded. Second, all titles and abstracts will be screened for eligibility in a blinded standardised manner. If the study seems relevant or in case of doubt, the full-text article will be retrieved and screened. Both the abstract and full-text screening will be done independently by a minimum of two reviewers. For both steps, a pilot screening will be performed on a random subsample of studies and the screening methodology will be clarified within the review team if deemed necessary. Discrepancies will be resolved by discussion and/or consultation of a third reviewer. Inter-rater reliability will be assessed and reported.

PROM selection

To be included, PROMs will need to meet the following criteria:

  1. PROMs must be self-administered questionnaires (paper-pencil or electronic). This excludes assessment tools based (fully or partially) on hetero-assessment, interactive voice response systems, talking touch screens, drawings or nomograms. An interview format is allowed when the study population is not able to complete the PROM independently.

  2. PROMs must assess HRQoL as a multi-domain concept (ie, based on a multidimensional model of HRQoL) and be applicable across cancer types. This excludes tools designed to assess a specific HRQoL subdomain (eg, exclusively assessing physical functioning) or cancer site (eg, assessing HRQoL following breast reconstruction). Preference-based measures that are used to calculate quality-adjusted life years within the field of health economics will not be considered for the scope of this review.

  3. PROMs must be validated (ie, evidence of content validity, structural validity or construct validity) for use in the target population of European patients with cancer or survivors (online supplemental appendix 2). In case no European validation can be found for a PROM identified through the initial search, an additional search will be performed in PubMed (online supplemental appendix 3). If no evidence of validity among European patients with cancer or survivors can be retrieved after the additional search, the PROM and its related articles will be excluded.

Study selection

Studies will be included when the following criteria are met:

  1. Studies must provide information on the measurement properties of the included PROMs. For this review, the development, content validity, structural validity/unidimensionality, internal consistency, cross-cultural validity, measurement invariance, reliability, measurement error and construct validity will be considered. Studies reporting on criterion validity will be considered to inform construct validity due to the absence of a gold standard for PROMs.32 Responsiveness will not be assessed in this review since the content and the number of hypotheses to assess responsiveness are inexhaustible and arbitrary, and the quality of comparator instruments (in the absence of gold standard) cannot be proven.38

  2. Studies must provide original research data (including erratum and correction articles) and be published in English. Articles written in other languages or case studies, protocols, conference abstracts, conference reports, commentaries, opinion articles and reviews will not be considered.

  3. Studies must be performed in adult European patients with cancer or survivors (mean age ≥21 years and not defined as adolescents and young adults). Articles including ‘mixed samples’ (ie, European patients with cancer and patients without cancer) will only be included if separate results are provided for the patients with cancer group. Studies involving both European and non-European patients with cancer will be included. Studies only reporting results within a non-European cancer sample will be excluded (except for development and content validity studies). Articles reporting on patients with benign tumours or including less than 15 patients with cancer will also be excluded.

Detailed information on the selection process will be reported in a PRISMA flowchart (PRISMA 2020 flow diagram39).

Data extraction

During the data extraction, it will be determined which measurement properties will be evaluated for every included study. Extracted data will be entered into a customised xls file using Microsoft Excel. Data extraction will be performed independently by two reviewers, and discrepancies will be resolved by discussion and/or consultation of a third reviewer. Data extraction will be piloted on a random subsample of studies, and the extraction methodology will be clarified within the review team if deemed necessary. When available, data will be extracted as follows:

  1. Study characteristics: authors, title, publication year, design.

  2. Study sample characteristics: sample size, age, gender, EU/non-EU, clinical status (general population, patients without cancer, patients with cancer undergoing curative treatment, patients with cancer undergoing palliative treatment, cancer survivors), cancer stage and cancer site.

  3. PROM characteristics: PROM specimen, original development paper, original language in which the PROM was developed, target population for whom the PROM was developed, number of subscales and items, content coverage, recall period, response options, type of scale(s), scoring and estimated duration of assessment. In case of missing data, additional information will be retrieved by searching Google and ePROVIDE (https://eprovide.mapi-trust.org) or by contacting the PROM developers.

  4. PROM measurement properties: development and content validity, structural validity/undimensionality, internal consistency, cross-cultural validity and measurement invariance, reliability, measurement error and construct validity. Detailed information on the data that will be extracted for these measurement properties is provided in online supplemental appendix 4.

Following data extraction, all PROMs and related studies will be included in the next phase of the review process for quality assessment.

PROM quality assessment

A scoring manual based on the procedures mentioned hereafter will be built and piloted on a random subsample of studies to enhance the inter-rater homogeneity of PROM quality assessment. The assessment will be performed independently by two reviewers. Discrepancies will be solved by consensus. In case of disagreement, a third reviewer will be involved to solve the discrepancy. As per COSMIN guidelines,32 quality assessment will be conducted sequentially for each PROM in the following order: development/content validity, internal structure (ie, structural validity, internal consistency and cross-cultural validity/measurement invariance), reliability, measurement error and construct validity (ie, criterion validity and hypotheses testing). The COSMIN group defines content validity as the most important measurement property and recommends assessing it first and excluding PROMs with high-quality evidence of inadequate content validity.32 40 However, studies that would report on the poor content validity of a PROM are unlikely to be published and this requirement is unlikely to be met, which does not allow for differentiating between PROMs based on the quality of content validity. Therefore, it was decided that the remaining psychometric properties will not be assessed if PROMs demonstrated inadequate content validity at any level of evidence or no evidence of content validity could be found, as PROMs should be relevant, comprehensive and comprehensible with respect to HRQoL and the European cancer population. Studies assessing structural validity based on a Multi-Trait Multimethod approach41 will be considered to inform construct validity as this method is not appropriate for the assessment of structural validity.32

For all psychometric properties, the assessment will be performed at a subscale level (when applicable). Quality assessment will be performed for each study and measurement property as follows:

Risk of bias assessment

The methodological quality of each study will be evaluated using the COSMIN Risk of Bias Checklist,42 which provides a set of standards for design requirements and preferred statistical analyses per measurement property. These standards provide a framework to assess whether the results, based on the methodological quality of a given study, are trustworthy. Each standard will be rated on a four-point rating scale as ‘very good’, ‘adequate’, ‘doubtful’ or ‘inadequate’. Each assessment of a measurement property is considered to be a separate study. For development/content validity, the quality of each standard will first be determined by retaining the highest rating across the identified studies before taking the lowest rating of each standard to determine the overall quality of the PROM development and content validity. For all other measurement properties, the overall rating of the quality of each study will be determined separately by taking the lowest rating of each standard. Several adjustments were made to the ratings of the COSMIN Risk of Bias Checklist, which are all listed in online supplemental appendix 5.

Criteria for good measurement properties

These criteria are recommendations from COSMIN for which PROMs are assessed as appropriate to be used in research or clinical practice.32

Development and content validity

The overall content validity scoring will comprise four steps.40 First, the results of both the PROM development and content validity studies will be rated by two reviewers independently (online supplemental appendix 6). Each criterion will be scored as ‘sufficient’ (+), ‘insufficient’ (−) or ‘indeterminate’ (?). Reviewers will rate the content of the PROM of interest with ‘sufficient’ (+) or ‘insufficient’ (−), using the same criteria. When there is no content validity study available, content validity criteria will be rated ‘insufficient’ (−). The scoring ‘indeterminate’ (?) will only be used when there is evidence that some aspects of content validity were assessed, but the authors did not provide enough information to score the criterion appropriately. Second, an overall ‘sufficient’ (+), ‘insufficient’ (−), ‘indeterminate’ (?) or ‘inconsistent’ (±) rating will be calculated for relevance, comprehensiveness and comprehensibility per study40 (online supplemental appendix 7). Third, an overall rating per PROM will be calculated for relevance, comprehensiveness and comprehensibility by jointly considering the results of the PROM development and content validity studies, and the reviewer’s ratings. The evidence from the content validity will be weighted higher than the evidence from the development study and the reviewer’s rating. Online supplemental appendix 8 provides a detailed overview of this overall rating process. Last, an overall ‘sufficient’ (+), ‘insufficient’ (−) or ‘inconsistent’ (±) content validity rating will be calculated by aggregating the overall relevance, comprehensiveness and comprehensibility rating. Online supplemental appendix 9 provides a detailed overview of the overall content validity rating process.

Other psychometric properties

Criteria for good measurement properties will be applied for each individual study, resulting in a ‘sufficient’ (+), ‘insufficient’ (−), or ‘indeterminate’ (?) rating. The evidence across studies will be summarised qualitatively, and it will be decided whether the results per psychometric property are consistent. Consistency is defined as at least 75% of individual studies being rated similarly for a given PROM and measurement property. If the threshold of 75% is not reached for any of the rating options and studies with exclusively ‘+’ or ‘−’ ratings are available in combination with ‘?’ ratings, studies with a ‘?’ will be ignored and not included when summarising the results. In all other cases, the overall rating will be scored as ‘inconsistent’ (±). If the results are inconsistent, possible explanations will be explored and the results will be summarised per subgroup when applicable. If no explanation for the inconsistency can be found, the overall rating will remain ‘inconsistent’ (±). A detailed overview of the criteria for good measurement properties, incorporating the inconsistency rating, can be found in table 2. For construct validity, a priori hypotheses were formulated to evaluate the results (table 3).

Table 2

COSMIN criteria for good measurement properties

Table 3

A priori hypotheses for construct validity

Quality of evidence

The quality of the evidence will be graded per measurement property using a modified Grading of Recommendations Assessment, Development and Evaluation approach (GRADE32 43) resulting in four quality levels: ‘high’, ‘moderate’, ‘low’ or ‘very low’. Starting from high-quality level, the quality of evidence will be downgraded if applicable according to the following factors: risk of bias (methodological quality of the studies), inconsistency (of results across studies), imprecision (total sample size of the studies) and indirectness (evidence comes from a different target population). For some factors, the original COSMIN modified GRADE approach does not provide clear guidance on the criteria to be used for the risk assessment; therefore, the GRADE approach was further adapted. The adapted GRADE approach that will be used is reported in tables 4 and 5 for development/content validity and the remaining psychometric properties respectively. The quality of evidence for internal consistency will start at the level of structural validity.32

Table 4

COSMIN adapted the GRADE approach for development/content validity

Table 5

COSMIN adapted the GRADE approach for other psychometric properties

Reporting of results

The reporting of the results will follow the PRISMA 2020 statement and a PRISMA checklist will be provided.44 Considering the expected high heterogeneity of the results, no quantitative pooling of the studies’ results per PROM will be performed and no meta-analysis will be planned. In line with the COSMIN guidelines,32 summary tables describing the PROMs’ characteristics, including feasibility and interpretability, and study populations will be produced. The reporting of the results will include the individual ratings on PROM development and content validity, PROM measurement properties and quality of evidence per study. The findings will then be qualitatively summarised as follows.

For content validity, an overall rating per PROM will be calculated for relevance, comprehensiveness and comprehensibility by jointly considering the results of the PROM development and content validity studies, and the reviewer’s ratings. The overall content validity will be rated as ‘sufficient’ (+), ‘insufficient’ (−) or ‘inconsistent’ (±), by aggregating the overall relevance, comprehensiveness and comprehensibility rating.

For the remaining psychometric properties, the evidence across studies will be summarised and it will be decided whether the results per psychometric property are consistent. Consistency will be defined as at least 75% of studies being rated similarly for a given PROM and measurement property. If the threshold of 75% is not reached for any of the rating options and studies with exclusively ‘+’ or ‘−’ ratings are available in combination with ‘?’ ratings, studies with a ‘?’ will be ignored and excluded from the summary. In all other cases, the overall rating will be scored as ‘inconsistent’ (±). For construct validity, a priori hypotheses will be formulated to evaluate the results.

Recommendations

PROMs with sufficient content validity (ie, rated ‘±’ or higher) and at least low-quality evidence (ie, GRADE)43 for sufficient structural validity and internal consistency will be recommended.32 On the other hand, PROMs will not be recommended when there is high-quality evidence for any insufficient measurement property. As with the quality assessment, the formulation of recommendations will be made at a subscale level.

Patient and public involvement

Currently, it is expected that researchers actively involve patients, healthcare professionals and the public in their research. Within systematic reviews, active patient and public involvement has been proposed as a way to enhance the actual and perceived usefulness of the summarised evidence, hence addressing barriers to the uptake of evidence in practice.45 Patient involvement will be ensured at key stages of the systematic review and peer reviewing the academic papers. The results of the review will be discussed with a representative panel of stakeholders, including patients and healthcare professionals to ensure the co-design approach throughout the entire EUonQoL project. It is essential that the PROMs selected to serve as a basis for the development of the EUonQOL toolkit are supported by evidence of content validity, that is, the items constituting these PROMs should be relevant, comprehensive and comprehensible with respect to HRQoL and the European cancer population.

Ethics and dissemination

Ethical clearance for this research is not required, as the systematic review will only use information from previously published research. The results will be disseminated to clinicians, researchers and health policymakers by presenting at relevant conferences and by publication in a peer-reviewed journal. Besides that, the findings will be used to identify the most appropriate PROMs for the assessment of HRQoL throughout the European cancer continuum, to serve as a basis for the development of the EUonQOL toolkit and to provide evidence-based recommendations to the EUonQOL consortium.

Ethics statements

Patient consent for publication

References

Footnotes

  • Contributors HV conceived the idea and is the guarantor. HV, KMG and LL planned and designed the study protocol, the data extraction and the data assessment. HV, KMG and LL wrote the first draft of the protocol. CP, MPe, MS and MPetersen provided critical insights. GA, CB, CL, GV, AG, GP, GC, MG, MF, NB, RP, NC, AC, MG and AS have approved and contributed to the final written manuscript. The EUonQOL consortium supported the conduct of this work.

  • Funding This work was supported by the European Union’s Horizon Europe Research and Innovation Programme under Grant Agreement Number 101096362.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were involved in the design, or conduct, or reporting, or dissemination plans of this research. Refer to the Methods section for further details.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.