Article Text
Abstract
Introduction Although various sham acupuncture techniques have been employed to ensure blinding in randomised controlled trials (RCTs) of acupuncture, the effectiveness of blinding in these trials and its influence on trial effect size estimates remain unclear. The objectives of this study are the following: (1) to investigate the proportion and study characteristics of sham-controlled trials reporting on blinding assessment, (2) to assess the blinding effectiveness of different types of sham acupuncture, (3) to investigate the relationship between blinding effectiveness and effect sizes in acupuncture RCTs.
Methods and analysis We will search PubMed and EMBASE from inception to 1 January 2025 to identify RCTs that compared acupuncture with sham acupuncture in humans with any disease or symptom, with no restrictions on language. Paired investigators will independently determine eligibility and use pilot-tested standardised forms for data extraction. We will calculate the proportion of sham-controlled trials that assessed and reported blinding success and conduct descriptive analyses of general study characteristics, acupuncture treatment details, sham acupuncture details and blinding assessments for included trials. We will assess the effectiveness of blinding success using the James blinding index (BI) and Bang BI, and pool data from included trials using random-effects models. We will use Hedges’ g, a standardised mean difference, with its 95% CI, to calculate treatment effects. We will use Pearson’s r correlation coefficient to assess the relationship between blinding effectiveness and trial effect sizes when variable distributions meet the assumptions of normality and linearity; otherwise, we will consider employing non-parametric tests. When sufficient data are available, we will also use random-effects meta-regression to explore the relationship.
Ethics and dissemination Ethical approval is not required. The findings of this study will be disseminated through peer-reviewed publications, conference presentations and condensed summaries for clinicians, health policymakers and guideline developers regarding the design, conduct, analysis and interpretation of blinded assessment of sham acupuncture RCTs.
Study registration Open Science Framework (https://doi-org.ezproxy.u-pec.fr/10.17605/OSF.IO/B3U7K).
- Randomised Controlled Trial
- Acupuncture
- Systematic Review
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
STRENGTHS AND LIMITATIONS OF THIS STUDY
This study will evaluate the effectiveness of blinding success and assess the relationship between blinding effectiveness and effect sizes in sham-controlled trials of acupuncture.
The study will use a rigorous review procedure with clearly defined criteria, comprehensive search strategies and pilot-tested data extraction forms.
The study will be based on published papers, which may have unreported blinding assessments due to journal constraints, which could affect the estimated proportion of sham-controlled trials reporting on blinding assessment.
This study will use only the pain outcomes of acupuncture for chronic pain trials to explore the relationship between blinding effectiveness and trial effect sizes.
Introduction
Randomised controlled trials (RCTs) are widely regarded as the gold standard for evaluating the efficacy of healthcare interventions. Blinding, a crucial methodological component in RCTs, involves keeping participants and/or researchers unaware of treatment allocation to minimise bias.1 It can be classified as single-blind, double-blind or triple-blind depending on who is blinded. Successful blinding can not only enhance the validity and credibility of RCTs but also assist readers in evaluating the quality of the results.2 Insufficient blinding may lead to an overestimation of patient-reported outcomes,3 with nonblinded patients exaggerating effect sizes by an average of 0.56 SDs (95% CI −0.71 to −0.41).4 The Template for Intervention Description and Replication-Placebo, a guide and checklist for reporting placebo and sham controls, recommends that trials should measure and report the success of blinding to aid in the interpretation and use of clinical study findings.5
Acupuncture is a traditional Chinese medicine practice that involves inserting thin needles into specific points on the body to treat various health conditions.6 It has gained worldwide popularity as a complementary and alternative therapy, particularly for various pain conditions, leading to numerous RCTs assessing its efficacy.7 Blinding is particularly crucial in acupuncture trials since it often involves subjective outcomes.8 However, blinding in acupuncture trials presents unique challenges due to the physical manipulation and subjective experiences involved.9 The nature of acupuncture makes it difficult to develop a placebo procedure that is physiologically inert and indistinguishable from true treatment.10 To achieve blinding, various types of sham acupuncture have been used in trials, such as inserting needles at non-acupuncture points or with minimal penetration to mimic the physical sensation of needling, which can potentially induce a therapeutic effect, as well as using non-penetrating placebo devices like Streitberger and Park sham needle, which may not fully achieve patient blinding.11
The effectiveness of the binding of sham acupuncture in acupuncture trials remains a topic of debate, with some studies suggesting that participants can still distinguish between real and sham acupuncture. For instance, an RCT on patients with neck pain found that 18% of participants in the non-penetrating acupuncture group and 19% in the shallow puncture group were convinced that they had received sham acupuncture treatment.12 An earlier survey, including published literature up to 2011, evaluated the blinding effectiveness of acupuncture trials using a blinding index (BI) and found that 61% of participants maintained ideal blinding.13 Moreover, insufficient attention has been given to assessing and reporting blinding in practice. Several studies have shown that the success of blinding was assessed or reported in less than 10% of trials,4 14 and methods for evaluating blinding were inconsistent and questionable;15 however, these results consisted mainly of trials in other fields of medicine, with only a few involving acupuncture trials. Additionally, a recent study proposed an exploratory principle and protocol for the blinding assessment without practical validation.16 Despite these efforts, it remains unclear in practice how many sham-controlled acupuncture trials report blinding assessment, what methods are used to evaluate the success of blinding and whether there are differences in blinding effectiveness of different types of sham acupuncture and whether the blinding effectiveness affects the trial effect size.
Given the importance of blinding in RCTs to ensure the validity and quality of study results, we aim to conduct a systematic survey on the blinding assessment of sham-controlled acupuncture trials. In this study, we have three main objectives: (1) estimating the proportion and study characteristics of sham-controlled trials reporting on blinding assessment, (2) assessing the blinding effectiveness of different types of sham acupuncture while exploring potential influencing factors, (3) investigating the relationship between blinding effectiveness and trial effect sizes.
Methods and analysis
We will conduct a systematic survey of randomised sham-controlled trials of acupuncture conducted in humans and published in PubMed and EMBASE. The protocol was registered in the Open Science Framework (https://doi-org.ezproxy.u-pec.fr/10.17605/OSF.IO/B3U7K).
Eligibility criteria
The inclusion criteria are:
The study is an RCT.
The participants are humans with any disease or symptom.
The study assesses the effect of acupuncture (eg, manual acupuncture, electroacupuncture).
The study uses sham acupuncture as a control.
The study is a main study report.
Acupuncture interventions are defined as the insertion of needles into the skin or muscle with or without stimulation, excluding non-inserted techniques such as laser acupuncture. Sham acupuncture refers to sham, placebo, fake or simulated treatments that differed from acupuncture in at least one aspect to skin penetration or point location. This systematic survey has no disease or condition restriction, but for investigating the relationship between blinding effectiveness and trial effect sizes, this analysis will be limited on trials involving pain-related disorders.
The exclusion criteria are:
The participants are healthy volunteers.
The study is to investigate the neurological mechanism of acupuncture.
The study is reported as an abstract, research letter, protocol and short report.
Literature search
We will search PubMed and EMBASE to identify randomised sham-controlled trials from inception to 1 January 2025, with no restrictions on language. The search for relevant studies will involve using database-specific subject headings (such as MeSH terms) and free texts associated with acupuncture, sham acupuncture and RCT. The search strategy will be developed by two experienced investigators (JL, LL) with reference to previous related studies17 18 (online supplemental appendix 1).
Supplemental material
Study procedures
Paired investigators, who have received training in research methods, will screen titles/abstracts and full texts to determine eligibility, and collect data from all included trials, independently and in duplicate. We will use electronic forms, developed with Microsoft Access, for study screening and data extraction. The forms will undergo standardisation and pilot-testing, accompanied by detailed written instructions to enhance reliability. Any disagreements will be settled through discussion, or consultation with a third researcher (JY or JL).
Data abstraction
We will develop a data extraction questionnaire, which will initially be created by three experienced investigators (JY, JL and LL) based on previous related studies.17 18 Subsequently, the data extraction form will undergo a pilot phase where data from 10 eligible RCTs will be collected. A discussion session will follow to assess the appropriateness and applicability of the listed items, resulting in necessary revisions. Based on the revised form, we will extract the following information from each included trial.
General characteristics of study
We will extract information on the first author, publication year, journal name, type of design, multi-nationality (ie, the trial was conducted in two or more countries), country of trial conducted, centre involved (single or multicentre), symptoms or diseases, sample size, number of arms, randomisation ratio, length of follow-up, registration information, availability of protocol, signed informed consent (ie, whether to inform the patients that they have the same opportunity to receive acupuncture or sham acupuncture treatment) and source of funding (private for profit, private not for profit, governmental, not funded). Additionally, we will record information regarding patients’ prior experience with acupuncture in the inclusion/exclusion criteria of each trial (ie, including patients who had never received acupuncture treatment, patients who had not recently received acupuncture, both patients with or without previous acupuncture experience, not reported).
Characteristics of acupuncture treatment
We will record the characteristics of acupuncture treatment according to the Standards for Reporting Interventions in Clinical Trials of Acupuncture (STRICTA) guideline,19 including the rationale or theory of acupuncture, type of acupuncture (manual acupuncture, electroacupuncture), number of needle insertions per subject per session (mean and range), names or location of points used, acupoint prescription (standardised point prescription, partially individualised prescription, fully individualised prescription), depth of insertion, response sought (de qi, muscle twitch response, other, not reported), needle retention time, frequency and duration of treatment, total treatment sessions and needle type (diameter, length). In addition, we will document information on participating acupuncturists, including the description of acupuncturists (qualification or professional affiliation, years of acupuncture practice, trained in acupuncture techniques), whether multiple acupuncturists were involved, whether multiple acupuncturists were randomly grouped and communication of acupuncturist-patient. For electroacupuncture, we will also record information on the current.
Characteristics of sham acupuncture
We will record characteristics of sham acupuncture, including whether the rationale or theory of sham acupuncture was described, type of acupoints (non-meridian and non-acupoints, non-disease-related acupoints, same acupoints as acupuncture group, not reported), insertion depth (non-penetrating, shallow/minimal needling, same insertion depth as acupuncture group, not reported), needle stimulation (no manipulation or without manual stimulation, same manipulation as acupuncture group, other, not reported), response sought (no de qi sensation or muscle response, same as acupuncture group, other, not reported), needle retention time (same as acupuncture group, other, not reported), frequency and duration of treatment (same as acupuncture group, other, not reported) and patients’ posture (same as acupuncture group, other, not reported). Furthermore, we will record the needle type (ie, diameter, material) used in sham acupuncture.
If the acupoints used in sham acupuncture were non-meridian and non-acupoints, we will record whether these points were predefined, the method of definition (near points, the midpoint between two acupoints, other, not reported) and whether the definition has a source or theoretical basis (based on Traditional Chinese Medicine (TCM) theory, expert/clinical experience, other, not reported). If the acupoints were non-disease-related acupoints, we will also record whether these points were predefined, the diseases associated with these points and rationale for selecting them.
If the insertion depth of sham acupuncture was non-penetrating, we will record whether a sham acupuncture device was used and the type of sham acupuncture device (eg, Foam needle, Streitberger needle, Park needle, Takakura needle). If the insertion depth of sham acupuncture was shallow/minimal needling, we will record whether the shallow/minimal needling was predefined and the insertion depth (less than 3 mm, 3–5 mm, more than 5 mm).
For electroacupuncture as an intervention, we will record the electrical stimulation information of sham acupuncture (no electrical stimulation or without current, turn off/interrupt the current after a few minutes of electrical stimulation, low current electrical stimulation, not reported) and the equipment of electroacupuncture. If low current is used, we will record whether it was predefined, and specify the range of current (less than 0.2 mA, 0.2–0.5 mA, more than 0.5 mA).
Assessment of blinding
We will record whether blind design was explicitly reported (single-blind, double-blind, not reported), the subject of binding (patients, acupuncturists, outcome assessors, not reported), whether the success of blinding was assessed, who assessed the success of blinding (patients, acupuncturists) and methods used to test for blinding (patients/acupuncturists were asked to guess their treatment allocation, other method). If the method was guessing treatment allocation, we will record when patients/acupuncturists were asked to guess (after the first treatment, at the end of trial, once during the trial, at multiple follow-up visits during trial, not reported), guessing options (acupuncture or sham acupuncture, acupuncture or sham acupuncture or uncertainty, other, not reported) and whether patients/acupuncturists rated the certainty of their guess using a continuous or Likert scale. Additionally, we will record whether before the trial volunteers were asked to evaluate the similarity between acupuncture and the sham acupuncture that will be used in the trial, and reported the similarity results.
We will also record the result and analysis of blinding assessment, including whether the results were reported, how the results were presented, statistical analysis methods for guessing treatment allocation (eg, χ2 or Fisher’s exact test, trend test, kappa value, James BI, Bang BI), conclusion of blinding (reported as successful blinding, reported as unsuccessful blinding, not reported), the reason for unsuccessful blinding (eg, lack of effect in sham acupuncture group, side effects in acupuncture group) and whether the implications of unsuccessful blinding on interpreting results were discussed. Additionally, we will extract the data on the number or proportion of patients in each arm who correctly guessed their treatment assignment (ie, contingency table).
Outcomes
To evaluate the relationship between blinding effectiveness and trial effect sizes, we will use the pain outcomes from acupuncture for chronic pain trials (eg, neck pain, shoulder pain, low back pain, osteoarthritis pain). We will record the name of pain outcome, mean and SD of changes from baseline pain intensity and number of patients included for analyses in each treatment group.
Statistical analysis
We will calculate the proportion of sham-controlled trials assessing and reporting the success of blinding. Then, we will conduct descriptive analyses of general study characteristics, details of acupuncture treatment, details of sham acupuncture and assessment of blinding for included trials. For all descriptive analyses, we will use frequencies (and percentages) for dichotomous variables, and mean (and SD) or median (and range) for continuous variables.
We will use James BI to assess the effectiveness of blinding success.20 James BI provides a comprehensive measure of blinding, with values ranging from 0 to 1, where 1 indicates complete blinding (successful blinding), 0 indicates complete unblinding (no blinding) and 0.5 indicates random guessing (ideal blinding).16 If the upper bound of the CI of James BI is below 0.5, unblinding may be claimed.16 Given the variability in study characteristics, we will use a random-effects model to pool James BI from included trials. We will also use Bang BI to assess the degree of blinding of each arm.21 Bang BI is directly interpreted as the extent of unblinding beyond chance and can capture different behaviours in different arms, with values ranging from −1 to 1, where 1 indicates all individuals guessed their assigned treatment correctly (complete unblinding), −1 indicates all individuals guessed incorrectly (opposite guessing) and 0 indicates random guessing (ideal blinding). We will calculate and pool the BI for each arm, that is, acupuncture BI and sham acupuncture BI. The statistical heterogeneity across the included studies will be examined using Cochrane’s Q test and I2 statistics. We will conduct several subgroup analyses to investigate heterogeneity when sufficient data are available, such as based on the type of disease, type of acupuncture, type of sham acupuncture, patients’ prior experience with acupuncture and subject of blinding.
We will calculate the treatment effect using Hedges’ g, a standardised mean difference (SMD), with its 95% CI.22 We will use Pearson’s r correlation coefficient to assess the relationship between blinding effectiveness and trial effect sizes when variable distributions meet the assumptions of normality and linearity; otherwise, we will consider employing non-parametric tests, such as Spearman’s rank correlation coefficient. When sufficient data are available, we will also use meta-regression to explore the relationship while accounting for several potential confounding variables, such as the type of sham acupuncture, type of sham acupoints, insertion depth, needle stimulation and patients’ prior experience with acupuncture. These variables will be incorporated as covariates within a random-effects model to accommodate inter-study heterogeneity and account for intra-study variability. All data analyses will be conducted using R software (V.4.3.2), with p≤0.05 considered statistically significant.
Patient and public involvement
None.
Ethics and dissemination
Ethical approval is not required. The findings of this study will be disseminated through peer-reviewed publications, conference presentations and condensed summaries for clinicians, health policymakers and guideline developers regarding the design, conduct, analysis and interpretation of blinded assessment of sham acupuncture RCTs.
Discussion
The protocol describes a methodological study aimed at analysing the blinding assessment in sham-controlled trials of acupuncture, including the blinding effectiveness of different types of sham acupuncture and its potential impact on trial effect sizes. By publishing the detailed study protocol, we aim to enhance transparency regarding our objectives and methods.
Our study has several strengths. First, we will use a rigorous review procedure that includes well-defined inclusion and exclusion criteria, comprehensive search strategies and the use of pilot-tested standardised forms for study selection and data extraction. Additionally, calibration exercises and pilot data extraction will be conducted to ensure consistency between reviewers before proceeding with data abstraction. Second, our study will employ broad eligibility criteria without restrictions on the journal or publication year, which enhances the generalisability of our findings. Finally, our study will address a series of unanswered questions regarding blinding assessment in sham-controlled trials of acupuncture that have not been adequately explored in prior research.
Our study also has some limitations. First, our study based on published papers may have unreported blind assessments by investigators due to journal constraints, which could affect the estimated proportion of sham-controlled trials reporting on blinding assessment. To address this concern, we plan to review the appendices and supplementary data files for any additional information provided. Second, there may be different methods to measure the success of blinding in practice. Guessing treatment allocation is a commonly used measurement method by researchers, and we will assess the blinding effectiveness of different types of sham acupuncture based on this measurement data. This may ignore data from other methods, but we will extract and describe the results of other blind measurements. Third, although we will use SMD to combine pain intensity data from different chronic pain disorders, there may still be heterogeneity between studies, such as variations in patient baseline characteristics, which may have compromised our findings. Finally, our search for eligible studies is limited to PubMed and EMBASE databases rather than including other search engines like China National Knowledge Infrastructure (CNKI) which may potentially result in missing some trials. However, studies obtained from PubMed and EMBASE are characterised by sufficient sample size and representativeness since these databases are considered comprehensive and reliable sources.
Blinding is a critical component in RCTs to minimise bias and ensure the validity of study results. However, blinding in acupuncture trials presents unique challenges due to the nature of the intervention. Regrettably, there has been insufficient emphasis on evaluating and reporting blinding effectiveness in practice.23 Although an earlier study examined the blinding effectiveness in sham acupuncture RCTs,13 it remains unclear regarding the status of blinding in sham-controlled acupuncture trials in practice. This study will investigate the extent of blinding evaluation and reporting in current sham-controlled acupuncture trials and analyse their study characteristics, so as to identify the strengths and limitations of different methods for assessing the success of blinding, highlight more appropriate practices and provide recommendations for enhancing blinding evaluation in future acupuncture trials.
Furthermore, diverse sham acupuncture techniques were employed in the trial to ensure blinding, such as needle insertion at non-acupoints or with minimal penetration, and the use of non-penetrating placebo devices.11 However, a standardised or universally accepted sham procedure for acupuncture trials remains elusive.24 While some studies have investigated the effectiveness of different types of sham acupuncture in terms of blinding,25 26 most have focused on non-penetrating placebo devices, leaving the blinding effectiveness of other forms such as shallow needling at non-acupoints unclear. This study will comprehensively summarise various types of sham acupuncture, provide detailed descriptions of their characteristics, evaluate the impact of different sham acupuncture on blinding effectiveness and explore the relationship between blinding effectiveness and trial effect sizes. The findings of this study will contribute to the comprehension of which sham procedure would be most appropriate for future acupuncture research.
By publishing the detailed study protocol, we aim to enhance transparency regarding our research objectives and methods. The findings of this study may have significant implications for the design, implementation, analysis and interpretation of blinding in acupuncture clinical trials, enhancing the methodological rigour and quality of acupuncture research.
Ethics statements
Patient consent for publication
References
Footnotes
JY and JL are joint first authors.
JY and JL contributed equally.
Contributors JY, JL, YF, LL and XS conceptualised the study. All authors contributed to the design of this protocol and approved the manuscript. YF is the guarantor for this study.
Funding This work is funded by the National Natural Science Fund for Distinguished Young Scholars (grant no. 82225049), National Natural Science Foundation of China (grant no. 82305045), China Postdoctoral Science Foundation (grant no. 2023M732494), Post-Doctor Research Project, West China Hospital, Sichuan University (grant no. 2023HXBH060), Jiangxi University of Chinese Medicine Science and Technology Innovation Team Development Program (No. CXTD-22012).
Competing interests None declared.
Patient and public involvement Patients and/or the public were not involved in the design, conduct, reporting or dissemination plans of this research.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.