Article Text

Original research
Quickly identifying people at risk of opioid use disorder in emergency departments: trade-offs between a machine learning approach and a simple EHR flag strategy
  1. Izabela E Annis1,
  2. Robyn Jordan2,
  3. Kathleen C Thomas1
  1. 1Division of Pharmaceutical Outcomes and Policy, University of North Carolina at Chapel Hill Eshelman School of Pharmacy, Chapel Hill, North Carolina, USA
  2. 2Department of Psychiatry, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, North Carolina, USA
  1. Correspondence to Izabela E Annis; izabela{at}unc.edu

Abstract

Objectives Emergency departments (EDs) are an important point of contact for people with opioid use disorder (OUD). Universal screening for OUD is costly and often infeasible. Evidence on effective, selective screening is needed. We assessed the feasibility of using a risk factor-based machine learning model to identify OUD quickly among patients presenting in EDs.

Design/settings/participants In this cohort study, all ED visits between January 2016 and March 2018 for patients aged 12 years and older were identified from electronic health records (EHRs) data from a large university health system. First, logistic regression modelling was used to describe and elucidate the associations between patient demographic and clinical characteristics and diagnosis of OUD. Second, a Gradient Boosting Classifier was applied to develop a predictive model to identify patients at risk of OUD. The predictive performance of the Gradient Boosting algorithm was assessed using F1 scores and area under the curve (AUC).

Outcome The primary outcome was the diagnosis of OUD.

Results Among 345 728 patient ED visits (mean (SD) patient age, 49.4 (21.0) years; 210 045 (60.8%) female), 1.16% had a diagnosis of OUD. Bivariate analyses indicated that history of OUD was the strongest predictor of current OUD (OR=13.4, CI: 11.8 to 15.1). When history of OUD was excluded in multivariate models, baseline use of medications for OUD (OR=3.4, CI: 2.9 to 4.0) and white race (OR=2.9, CI: 2.6 to 3.3) were the strongest predictors. The best Gradient Boosting model achieved an AUC of 0.71, accuracy of 0.96 but only 0.45 sensitivity.

Conclusions Patients who present at the ED with OUD are high-need patients who are typically smokers with psychiatric, chronic pain and substance use disorders. A machine learning model did not improve predictive ability. A quick review of a patient’s EHR for history of OUD is an efficient strategy to identify those who are currently at greatest risk of OUD.

  • Substance misuse
  • STATISTICS & RESEARCH METHODS
  • ACCIDENT & EMERGENCY MEDICINE

Data availability statement

Data may be obtained from a third party and are not publicly available. The data used in this study cannot be made publicly available. These data were obtained from the University of North Carolina Data Warehouse for Health, and are subject to a data use agreement that does not allow release or sharing of these data.

http://creativecommons.org/licenses/by-nc/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

STRENGTHS AND LIMITATIONS OF THIS STUDY

  • Large cohort study analysing prevalence of opioid use disorder in emergency departments.

  • Multivariable analyses taking into consideration demographic information, diagnoses, procedures and prescription orders.

  • Use of advanced machine learning modelling in conjunction with remedial measures for outcome imbalance.

  • Detailed presentation of performance statistics, stratified by outcome, interpreted in the light of results’ suitability for use in clinical practice.

  • Data contain records from only one healthcare system with possible variable misclassification due to missing medications and diagnoses from encounters outside of the studied health system.

Introduction

The rate of opioid-related hospitalisation and emergency department (ED) visits between 2005 and 2014 increased twofold.1 ED visits for opioid overdose rose across the USA by 30% between 2016 and 2017.2 Risk from overdose due to opioid consumption has also increased. In one state, emergency medical services trips to the ED increased 17%.3 As a result, EDs are a crucial point of contact for people with opioid use disorder (OUD). In recent years, EDs have received more recognition as places where patients can be identified and connected to care for OUD.4–7 Nevertheless, evidence indicates that there is significant variation in screening and treatment of OUD in EDs.8 9 It is unknown how to best selectively identify patients with risk of OUD who visit EDs, in order to accomplish efficient screening and referral to treatment.

Use of previous diagnoses and procedures coded in existing patients’ electronic health records (EHRs) may be an efficient way to identify those at risk during their ED visit. Several screening tools have been developed for patients with chronic pain to assess their risk of OUD,10 11 and numerous studies have investigated the relationship between individual patient-level factors and OUD in the population.12 Risk factors found to be associated with OUD include history of OUD or other substance use disorders, psychiatric disorders, concurrent prescriptions of selected psychiatric medications and certain social settings that foster illicit substance use.12 13

Proliferation and wide availability of administrative EHR databases have propelled advances in predictive analytics, which yield accurate identification of patients at risk of acute outcomes and healthcare utilisation. Some of the efforts include prediction of hospital readmission,14 repeat ED visits15 and hospital admission at the time of ED triage.16 These successes point to the promise of using patients’ historical EHR data to predict who is at risk of OUD among patients presenting in EDs. While the Diagnostic and Statistical Manual of Mental Disorders (DSM) and identification using Structured Clinical Interviews is considered as the ‘gold standard’ for diagnosis of OUD,17 a readily available alternative, the International Classification of Diseases (ICD) diagnostic codes, could be used in the development of a predictive algorithm. A reliable, predictive model that could identify patients at high risk of OUD could serve as a decision support tool for triage team members and alert them when a patient should be screened for OUD.4

The primary objectives of this study were to (1) describe the association between patient demographic and clinical characteristics and diagnosis of OUD among patients presenting at EDs, and (2) investigate the feasibility of constructing a valid machine learning predictive model of OUD in the ED in order to identify the best strategy for EDs to identify patients at high risk of OUD to facilitate targeted screening and diagnosis. Variable selection was informed by the Andersen Behavioural Model.18

Methods

Data and study sample

We used administrative EHR data from a large university health system in North Carolina with seven EDs across separate counties, which included patient demographics, medical encounter details, procedures, diagnoses, vitals, laboratory results and prescription medication orders. The EDs are located in two ‘mostly rural’19 counties (Chatham, Johnston) and five urban counties (Caldwell, Guilford, Henderson, Orange, Wake). The main academic hospital is a safety net hospital that serves patients from the entire state. Our cohort included all ED encounters that occurred during January 2016–March 2018, for patients aged 12 years or older, who had at least one clinical encounter during the baseline period. The baseline period was defined as 12 months prior to the ED encounter. Informed presence bias, the idea that inclusion and number of records in an EHR are associated with patients’ health status and severity of illness, is a well-known problem affecting studies using EHRs.20 Typically, analyses using administrative databases impose various inclusion/exclusion criteria, for example, continuous enrolment or a requirement to have at least one prescription fill. While this approach allows for a more rigorous study design, it also limits the relevance of the results to a narrow population of patients. We aimed to make our findings generalisable to the wider population of patients seeking care in ED, and therefore did not impose stricter data sufficiency inclusion criteria.

Outcome

Our outcome, the ‘index’ OUD, was a dichotomous variable indicating whether a patient was diagnosed with OUD during the ED encounter. Because our goal was to identify patients at high risk of OUD to refer for screening and diagnosis, we used a broad definition of OUD which included a range of opioid-related disorders, adverse effects of opioids and opioid poisoning.21 We identified OUD using ICD-10 codes (F11, T40.0, T40.1, T40.2, T40.3, and T40.6) and ICD-9 codes (304.00, 304.01, 304.02, 304.03, 304.70, 304.71, 304.72, 304.73, 305.50, 305.51, 305.52, 305.53, 965.00, 965.01, 965.02, 965.09, E850.0, E850.1, E850.2, E935.0, E935.1, E935.2, E940.1.). While we acknowledge the results from previous studies22 23 that examined validity of ICD codes in the identification of OUD and found that ICD diagnoses did not perfectly align with the DSM OUD diagnostic criteria, we proceed with this generous definition with the intention of retaining everyone who might be at risk of OUD and potentially benefit from screening.

Predictors

From EHR patient demographics, we obtained Andersen Model predisposing and enabling characteristics: age, sex, race, ethnicity and marital status.24–26 Using the EHR’s encounter-level data, we constructed Andersen Model indicators for need factors. We included comorbidity levels measured by the Charlson Comorbidity Index,27 history of alcohol use disorder, tobacco use disorder, mental illness, domestic violence and abuse, chronic pain and an indication of previous hospitalisation.24–26 An indicator for previous diagnosis of OUD was also ascertained from the EHRs.28 All clinical conditions were assessed in the 12-month baseline period and defined using previously validated algorithms from the Centers for Medicare and Medicaid Chronic Conditions Warehouse.29 We also used the EHR’s prescription order data to assess previous use of analgesics, psychotropics, medications for OUD (MOUD) and opioids.26 Psychiatric disorder was comprised of any anxiety disorder, bipolar disorder, depression, personality disorder, other psychotic disorder, post-traumatic stress disorder and schizophrenia. Pain was captured by dorsopathies, migraine and chronic headache, other pain, fibromyalgia, and chronic pain and fatigue. Physical accidents were comprised of transportation accidents, burns and corrosions and injuries. In sensitivity analyses, we analysed our data using disaggregated list of variables (for instance, instead of psychiatric disorder, we used individual indicators for anxiety, depression, etc). Our EHR database did not capture information on previous OUD-specific screening and therefore we were unable to include such variables as predictors.

Statistical analysis and machine learning model development

We used absolute standardised differences to evaluate the extent of the difference in means and prevalence of encounter characteristics between OUD and non-OUD groups.30 To assess the magnitude of the association between individual patient characteristics and diagnosis of OUD, we used generalised estimating equations accounting for repeated measures. Previous diagnosis of OUD was the strongest predictor and related to many other risk factors. Therefore, we also explored the relationship between patient characteristics and OUD among patients with no previous record of OUD.

To explore the feasibility of building a predictive model that could be implemented and used in EDs, we applied the Gradient Boosting Classifier,31 XGBoost, as implemented in Python V.2.7.17. We set aside a 50% random sample of our dataset as an independent validation dataset and used the remainder for model development. All performance measures are reported for the independent validation dataset. To develop the prediction model, we used a grid search approach, with stratified fivefold cross-validation, to identify the values of the hyperparameters that optimise the model prediction performance. Online supplemental table 1 provides the list of parameters used in optimisation and the range of explored values.

As is the case with many real-world health outcomes, our outcome is infrequent. Unbalanced outcome distributions can result in poor sensitivity in machine learning models and several remedial measures have been proposed to address this issue.32 33 We evaluated and compared the performance of three approaches: (1) the naïve model, where no imbalance remedy was applied, (2) a simple oversampling of the minority class and (3) a synthetic minority oversampling technique (SMOTE).32 In this study, the minority class is comprised of the cases where the outcome, index OUD, is equal to 1. In simplified terms, SMOTE is an oversampling method where new cases of the minority class are generated through first randomly selecting a case, then finding its k minority class neighbours, and finally generating a synthetic case by randomly drawing from the distributions of the characteristics of its neighbours. SMOTE’s key feature differentiating it from a simple oversampling approach is the process of synthesising new cases instead of generating replicates. In our analyses, we set the parameter k, the number of nearest neighbours, to five. In our model optimisation process, we used an F1 score as the primary measure of the models’ performance.34 The F1 score is a harmonic mean of the precision (positive predictive value) and recall (sensitivity) and therefore is a more robust measure of model performance than other metrics.35–37 To comprehensively assess the performance of the final models, we also report area under the receiver operating characteristic curve (AUC), precision and recall. Because the process of selecting threshold values is a multifaceted and complex process,38 we present all evaluation metrics based on the default threshold of 0.5. We report these performance measures in total for the entire sample and stratified by outcome.

In order to produce findings useful for identifying new or not previously identified OUD, we additionally investigated how well a predictive model would perform among patients with no previous diagnosis of OUD. In primary analyses, we built two sets of machine learning models. We constructed (1) one model for all patients and (2) one for the subset of patients with no history of OUD diagnosis. Because of strong association between baseline use of medications to treat OUD and previous diagnosis of OUD, we also explored models that included both variables as well as one variable at a time and reported results from the model with the best performance metrics. In sensitivity analyses, we built the machine learning models using disaggregated risk factors. The predictive performance of the models based on the full sample was also stratified by presence/absence of previous OUD.

To understand the trade-offs of using the full sample machine learning model versus a simple indicator of history of OUD, we performed simple calculations for 10 000 hypothetical encounters in the ED. For the purpose of this exercise, we assumed that our outcome, index OUD, was a true reflection of patients’ OUD status. By applying rates and findings from our data, we evaluated how many patients would be correctly identified for screening, how many would be missed and how many would be screened unnecessarily by each of the two methods.

We used SAS V.9.4 for statistical analyses and Python V.2.7.17 for machine learning model implementation.

Patient and public involvement

In our practice and research, patients have told us that it is very difficult to find and engage with treatment for OUD. An emergency room visit can be a turning point that makes a person open to receiving treatment. For this reason, we designed this study to develop an efficient means of recognising OUD and need for treatment so that people could be connected with treatment at their ED visit. Because we used already existing administrative data, patients were not involved in the study design and implementation.

Results

The final analytical sample included 345 728 ED encounters. The prevalence of index OUD diagnosis in the full cohort was 1.16%. Most patients had no evidence of OUD history prior to index OUD diagnosis (N=327 369 visits, 95%). The prevalence of index OUD diagnosis in this group was 0.57%. OUD was coded as a primary concern in 1024 (25.6%) of the 3995 cases. When OUD was not noted as a chief complaint, the most frequently recorded primary reasons for the remaining 2971 ED visits were: abdominal and pelvic pain (4.4%), alcohol-related disorders (4.1%), symptoms and signs involving emotional state (3.4%), dorsalgias (3.3%) and other sepsis (2.8%). Table 1 presents sample characteristics for the full cohort and for the subcohort of patients with no history of OUD, stratified by index diagnosis of OUD. When compared with patients with no index diagnosis of OUD, patients with a diagnosis of OUD were more likely to be male, white, non-married, and have a history of chronic pain, mental health conditions, alcohol use and tobacco use disorders. Fifty-three per cent of patients with index OUD diagnosis also had a previous diagnosis of OUD in their EHRs, compared with only 4.8% of patients with no index OUD. Almost 15% of patients with index OUD were previously treated with medications for OUD (methadone, buprenorphine, naltrexone). In contrast, the group with no index OUD received MOUD in only 2.4% of their encounters. While the demographic characteristics in the full cohort and in the subcohort with no index OUD were similar, the prevalence of different clinical conditions (for instance, mental health conditions, alcohol and tobacco use disorders) in the subcohort was, on average, lower than in the full cohort. Online supplemental table 2 provides descriptive statistics for the disaggregated set of factors.

Table 1

Characteristics of the study cohort stratified by presence of OUD diagnosis prior to and at index ED visit

Table 2 shows ORs that quantify the association between OUD diagnosis in the ED and patient demographics and clinical characteristics. Previous diagnosis of OUD was the strongest predictor of index OUD (OR=13.4, 95% CI: 11.8 to 15.1). Given its high correlation with all other substance use disorders and many of the clinical conditions, it was excluded from the second model where baseline use of MOUD (OR=3.4, 95% CI: 2.9 to 4.0), white race (OR=2.9, 95% CI: 2.6 to 3.3), tobacco use disorder (OR=2.3, 95% CI: 2.1 to 2.6) and non-married marital status (OR=1.6, 95% CI: 1.5 to 1.8) were the strongest predictors. Results from the model based on encounters with no history of OUD indicate that white race (OR=2.6, 95% CI: 2.3 to 3.0), previous use of MOUD (OR=2.5, 95% CI: 1.9 to 3.3), prescription of opioids in the baseline (OR=1.8, 95% CI: 1.6 to 2.1) and tobacco use disorder (OR=1.7, 95% CI: 1.5 to 1.9) were the key risk factors for OUD diagnosis in the ED.

Table 2

Generalised estimating equations model of patient-level factors and diagnosis of OUD in the ED

Table 3 presents performance metrics from the machine learning models. The best model, the model built using all ED visits and employing the SMOTE algorithm for outcome imbalance, achieved an overall F1 score of 0.97 and AUC of 0.71. This model attained a specificity of 0.97 but sensitivity of only 0.45. Further, after stratifying the performance metrics based on previous diagnosis of OUD, the results show that, among ED visits with previous OUD, the model achieved a high sensitivity of 0.81 but a low specificity of 0.43. Among encounters with no previous OUD, the model reached a perfect specificity of 1.0 and a poor sensitivity of 0.02. Findings were similar for the model based on visits with no history of OUD; models achieved high specificity while sensitivity remained very low. Results from our sensitivity analyses, models based on the full cohort as well as the subcohort, indicate that the model using the disaggregated set of variables (results in the Online supplemental table 3) did not perform substantially better than the models using the aggregated set of factors.

Table 3

Performance metrics from XGBoost machine learning model using short list of predictors

Table 4 presents trade-off calculations of our two hypothetical scenarios for selecting patients for OUD screening. These calculations operate on the assumption that if the value of the outcome (OUD in the ED) was 0 but the patient had a history of OUD elsewhere in the record, then the patient was either successfully treated or recovered. The first method, based on a simple EHR indicator of history of OUD, would correctly identify 53.2% of patients with OUD but would require 469 patients to be screened unnecessarily. On the other hand, the method based on the machine learning predictive model would correctly identify 45.0% of the patients with OUD but would require only 296 patients to have an unwarranted screen.

Table 4

Trade-off calculations, for 10 000 hypothetical ER encounters and a true OUD rate of 1.15%, to show what would happen if we screened (1) everyone indicated by the records of history of OUD, and (2) everyone indicated by the machine model

Discussion

This study examines strategies to identify patients who are at risk of OUD, among individuals presenting at EDs in order to facilitate targeted screening by busy ED staff. Among ED encounters at a large healthcare system, 1.16% resulted in the diagnosis of OUD. Patients who present at the ED with OUD are typically white, smokers, and with psychiatric disorders, chronic pain, substance use disorders and use of prescription opioids. History of OUD is the single strongest predictor of OUD diagnosis at the ED and is simultaneously strongly correlated with many other clinical conditions such as substance use disorders and mental health conditions. Among patients with an OUD diagnosis at the ED, over 50% had a previous diagnosis of OUD, while among encounters with no OUD, only 4.8% had a record of earlier OUD diagnosis.

Even though our predictive model of OUD based on patients’ EHRs achieved a high overall accuracy of 0.96, weighted F1 score of 0.97 and AUC of 0.71, its suitability for use in clinical practice is debatable due to low sensitivity. Diagnosis of OUD, like many other health outcomes or adverse effects, is a rare event, posing a significant challenge in development of machine learning predictive models. Our application of remedial approaches, simple oversampling and SMOTE improved the performance of the models slightly; however, poor sensitivity remained. While we would be inclined to accept a low specificity of the models, as it would cause no harm to the patients to have them selected for an OUD screening questionnaire, low sensitivity makes us pause and assess these results in the light of other viable options.

Even though both of our modelling approaches, logistic regression and machine learning models, suggest that there are several clinical and demographic factors that are strongly associated with OUD, the machine learning model implies that other non-clinical variables are needed to make accurate prediction of risk of OUD. For example, inclusion of social context and history in the EHR has the potential to improve model outcomes.39 Currently, history of OUD stands out as the most valuable predictor to select patients for OUD screening in EDs. A flag in the medical record indicating that a patient was diagnosed with OUD in the past year could serve as a simple first step in a decision-making protocol to select patients for OUD screening. Our trade-off calculation of 10 000 hypothetical ED encounters suggests that having such a flag in the EHR would correctly identify 53% of patients with OUD for screening, while 469 out of the 10 000 would be screened unnecessarily. However, this calculation was based on the generous assumption that our outcome, index OUD in the ED, was recorded properly if the patient truly had a current OUD. Additionally, if OUD was not noted in the ED, and we assume that the patient had ongoing chronic OUD as indicated by historical diagnosis of OUD, then some, if not all, of the 469 screening tests might be justified. If the machine learning model were to be incorporated into the EHR system, the model would correctly identify 45% of the patients with true OUD and only 296 additional patients without OUD would be screened. Given the complexity of incorporating a machine learning model into an EHR system, a simple indicator of previous diagnosis of OUD that alerts triage nurses seems like a natural choice to help identify patients for OUD screening. On the other hand, since staff time is scarce, incorporation of a machine learning model would cut nearly in half the number of ED patients unnecessarily screened. If, instead, a patient had no previous record of OUD, then ED staff would have to rely heavily on patient report of OUD or a chief complaint related to substance use as their reason for coming to the ED.

Our investigation makes a critical point with regard to construction of machine learning models for infrequent outcome prediction. It is necessary to not only assess the overall predictive performance measures such as accuracy and AUC, but also conduct careful review of sensitivity, specificity and stratification of the results by key predictors to appropriately assess the model’s usefulness in practice and consequences of its use. For instance, in our study, the false-positive result has minimal consequences, but a false negative might have serious ramifications. With opioid overdoses increasing at an alarming rate in the setting of COVID-19, it becomes paramount that people with a risk of OUD are identified and offered treatment.

Our data show that among ED visits where no OUD was noted, 4.8% had a record of previous OUD. These cases may represent patients who recovered from OUD or missed diagnosis of OUD. The median time since the last OUD diagnosis among ED encounters with OUD was 59 days, while among encounters where no OUD was noted, the median time was 98 days, a significantly longer time. Nevertheless, OUD is a chronic disease, and it is unlikely that the patients with OUD recovered in this length of time. Because ED clinicians list only diagnoses for problems that were actively treated or acknowledged while in the ED, this suggests that OUD may not have been addressed during these subsequent ED encounters. This is a missed opportunity and further strengthens the argument that an automated screening approach would increase ED clinicians’ ability to help patients with OUD.

While this study portrays a solid picture of patients diagnosed with OUD in EDs, it has some important limitations. Like many studies using real-world data, the data available to us were limited and imperfect. First, the validity of using ICD codes to diagnose OUD has been found to be adequate at best. Second, the documentation of ICD diagnoses in EDs might be infrequent and inconsistent. To the extent that OUD was consistently missed in patients, our models may perform better than we indicate. Because our EHR data contain records from only one healthcare system, there is a possibility of variable misclassification due to missing medications and diagnoses from encounters outside of the studied health system. These limitations emphasise and draw attention to earlier calls for development of validated algorithms to accurately identify patients with OUD in observational data.40 Additionally, the protocols for OUD screening in an ED and resources to support them might be health system specific and not generalisable to other settings. Our study used EHRs coded as structured data. Since many medical data elements are coded in the notes as free text, future research should explore the potential of inclusion of unstructured data in model building to increase the accuracy of the predictions. These limitations emphasise why implementation of machine learning models in real-world clinical settings continues to be rare.41 42

Our findings have important implications for research and practice. First, while it is difficult to accurately predict who has OUD, it is clear that history of OUD should be considered when selecting patients for screening. Given that it is not feasible to screen every single individual who presents at the ED, ED conversations about chronic pain, source of injury, substance use and psychiatric disorders may yield important information about risk of OUD in order to connect patients to needed services.

Data availability statement

Data may be obtained from a third party and are not publicly available. The data used in this study cannot be made publicly available. These data were obtained from the University of North Carolina Data Warehouse for Health, and are subject to a data use agreement that does not allow release or sharing of these data.

Ethics statements

Patient consent for publication

Ethics approval

This study was reviewed by the Office of Human Research Ethics of the University of North Carolina at Chapel Hill, which determined that this submission did not constitute human subjects research as defined under federal regulations (45 CFR 46.102 (d or f) and 21 CFR 56.102(c)(e)(l)) and did not require IRB approval.

References

Supplementary materials

  • Supplementary Data

    This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Footnotes

  • Contributors IA helped design the study, performed all analyses and wrote the draft of the manuscript. IA is the guarantor of the manuscript and accepts full responsibility for the work and conduct of the study. KT contributed to the study design and critically revised the article. RJ provided subject matter expertise in the area of addiction medicine and critically reviewed the manuscript. All authors discussed the results and contributed to the final manuscript.

  • Funding This work was supported by North Carolina Department of Health and Human Services.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.