Intended for healthcare professionals

Research

Management of chronic pain secondary to temporomandibular disorders: a systematic review and network meta-analysis of randomised trials

BMJ 2023; 383 doi: https://doi.org/10.1136/bmj-2023-076226 (Published 15 December 2023) Cite this as: BMJ 2023;383:e076226

Linked Editorial

Chronic pain associated with temporomandibular disorders

Linked Practice

Management of chronic pain associated with temporomandibular disorders: a clinical practice guideline

  1. Liang Yao, postdoctoral researcher12,
  2. Behnam Sadeghirad, assistant professor12,
  3. Meixuan Li, doctoral candidate34,
  4. Jing Li, methodologist34,
  5. Qi Wang, doctoral candidate1,
  6. Holly N Crandon, MBiotech candidate56,
  7. Grace Martin, medical student7,
  8. Rebecca Morgan, assistant professor18,
  9. Ivan D Florez, associate professor91011,
  10. Birk Stokke Hunskaar, medical student12,
  11. Jeff Wells, otolaryngologist13,
  12. Sara Moradi, masters student114,
  13. Ying Zhu, methodologist1,
  14. Muhammad Muneeb Ahmed, medical student7,
  15. Ya Gao, doctoral candidate3,
  16. Liujiao Cao, doctoral candidate15,
  17. Kehu Yang, professor34,
  18. Jinhui Tian, professor3,
  19. Jialing Li, associate professor16,
  20. Linda Zhong, associate professor17,
  21. Rachel J Couban, librarian14,
  22. Gordon H Guyatt, distinguished professor1,
  23. Thomas Agoritsas, general internist and associate professor11819,
  24. Jason W Busse, professor1214
  1. 1Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
  2. 2Department of Anesthesia, McMaster University, 1280 Main St. West, Hamilton, Ontario, Canada
  3. 3Evidence-Based Medicine Center, School of Basic Medical Sciences, Lanzhou University, Lanzhou, China
  4. 4Health Technology Assessment Centre, Evidence Based Social Science Research Center, School of Public Health, Lanzhou University, Lanzhou, China.
  5. 5Michael G. DeGroote Institute for Pain Research and Care, McMaster University, Hamilton, Ontario, Canada
  6. 6Institute for Management and Innovation, University of Toronto, Mississauga, Ontario, Canada
  7. 7Michael G. DeGroote School of Medicine, McMaster University, Hamilton, Ontario, Canada
  8. 8School of Medicine, Case Western Reserve University, Cleveland, OH, USA
  9. 9Department of Pediatrics, University of Antioquia, Medellin, Colombia.
  10. 10School of Rehabilitation Science, McMaster University, Hamilton, Canada.
  11. 11Pediatric Intensive Care Unit, Clínica Las Americas, Medellin, Colombia
  12. 12Institute of Health and Society, University of Oslo, Oslo, Norway
  13. 13Department of Otolaryngology-Head and Neck Surgery, Cleveland Clinic, Cleveland, OH, USA
  14. 14Michael G. DeGroote National Pain Centre, McMaster University, Hamilton, Ontario, Canada
  15. 15West China School of Nursing/West China Hospital, Sichuan University, Chengdu, China
  16. 16Department of Orthodontics, Nanjing Stomatological Hospital, Medical School of Nanjing University, China
  17. 17School of Biological Sciences, Nanyang Technological University, Singapore
  18. 18Division of General Internal Medicine, Department of Medicine, University Hospitals of Geneva, Geneva, Switzerland
  19. 19The MAGIC Evidence Ecosystem Foundation, Oslo, Norway
  1. Correspondence to: K Yang, Evidence-Based Medicine Center, School of Basic Medical Sciences, Lanzhou University, Lanzhou, China,730000; yangkh{at}lzu.edu.cn
  • Accepted 9 November 2023

Abstract

Objective We explored the comparative effectiveness of available therapies for chronic pain associated with temporomandibular disorders (TMD).

Design Systematic review and network meta-analysis of randomised clinical trials (RCTs).

Data sources MEDLINE, EMBASE, CINAHL, CENTRAL, and SCOPUS were searched to May 2021, and again in January 2023.

Study selection Interventional RCTs that enrolled patients presenting with chronic pain associated with TMD.

Data extraction and synthesis Pairs of reviewers independently identified eligible studies, extracted data, and assessed risk of bias. We captured all reported patient-important outcomes, including pain relief, physical functioning, emotional functioning, role functioning, social functioning, sleep quality, and adverse events. We conducted frequentist network meta-analyses to summarise the evidence and used the GRADE approach to rate the certainty of evidence and categorise interventions from most to least beneficial.

Results 233 trials proved eligible for review, of which 153—enrolling 8713 participants and exploring 59 interventions or combinations of interventions—were included in network meta-analyses. All subsequent effects refer to comparisons with placebo or sham procedures. Effects on pain for eight interventions were supported by high to moderate certainty evidence. The three therapies probably most effective for pain relief were cognitive behavioural therapy (CBT) augmented with biofeedback or relaxation therapy (risk difference (RD) for achieving the minimally important difference (MID) in pain relief of 1 cm on a 10 cm visual analogue scale: 36% (95% CI 33 to 39)), therapist-assisted jaw mobilisation (RD 36% (95% CI 31 to 40)), and manual trigger point therapy (RD 32% (29 to 34)). Five interventions were less effective, yet more effective than placebo, showing RDs ranging between 23% and 30%: CBT, supervised postural exercise, supervised jaw exercise and stretching, supervised jaw exercise and stretching with manual trigger point therapy, and usual care (such as home exercises, self stretching, reassurance).

Moderate certainty evidence showed four interventions probably improved physical functioning: supervised jaw exercise and stretching (RD for achieving the MID of 5 points on the short form-36 physical component summary score: 43% (95% CI 33 to 51)), manipulation (RD 43% (25 to 56)), acupuncture (RD 42% (33 to 50)), and supervised jaw exercise and mobilisation (RD 36% (19 to 51)). The evidence for pain relief or physical functioning among other interventions, and all evidence for adverse events, was low or very low certainty.

Conclusion When restricted to moderate or high certainty evidence, interventions that promote coping and encourage movement and activity were found to be most effective for reducing chronic TMD pain.

Registration PROSPERO (CRD42021258567)

What is already known on this topic

  • Temporomandibular disorders are the second most common cause of chronic musculoskeletal pain, after low back pain.

  • Several conservative, pharmacologic, and invasive management options are available to patients, but their comparative effectiveness is uncertain.

What this study adds

  • A BMJ Rapid Recommendation guideline panel including patients, clinical experts, and methodologists defined the scope of our review, informed outcome selection and importance, and interpretation of findings.

  • Moderate certainty evidence shows that, compared with placebo or sham procedures, cognitive behavioural therapy augmented with biofeedback or relaxation therapy, therapist-assisted jaw mobilisation, and manual trigger point therapy are probably among the most effective interventions for pain relief.

  • Moderate to high certainty evidence found that five interventions were less effective, yet more effective than placebo/sham procedures for pain relief: cognitive behavioural therapy, supervised postural exercise, supervised jaw exercise and stretching with or without manual trigger point therapy, and usual care (such as home exercises, stretching, reassurance, self massage, and education).

Introduction

Temporomandibular disorders (TMD) are a group of painful conditions affecting the muscles of mastication, temporomandibular joint, and associated structures.12 TMD are the second most common musculoskeletal chronic pain disorder, after low back pain; chronic TMD pain affects 6% to 9% of adults globally,3 with women reporting a higher prevalence than men.4 TMD have been proposed to include 12 subtypes: myalgia (local myalgia, myofascial pain, myofascial pain with referral), arthralgia, four types of disc displacement disorders, degenerative joint disease, subluxation, headache attributed to TMD, and others.5

Several conservative, invasive, and irreversible treatments are available for chronic TMD pain. Conservative therapies include jaw exercise, jaw mobilisation, cognitive behavioural therapy (CBT), self management (such as relaxation strategies), medication (such as anti-inflammatory drugs, analgesics, muscle relaxants), removable oral splints, low level laser therapy, and acupuncture. Invasive therapies include arthrocentesis, joint injection of local anaesthetic with or without steroids, and injection of purported regenerative substances (such as hyaluronic acid). Irreversible treatments include joint surgery, prosthodontics, orthodontics, and irreversible oral splints.

Several network meta-analyses (NMA) have previously summarised the efficacy and safety of interventions for chronic TMD pain but have important limitations.67891011121314 These include ranking treatments using the “surface under the cumulative ranking curves” (SUCRA) approach, which ignores the certainty of evidence and the absolute differences between treatment alternatives,67891011121314 addressing only certain categories of interventions or subtypes of TMD,67101314 suboptimal reporting of pooled estimates (such as pooling different instruments that measure a common domain as the standard mean difference,7811 only pooling trials that report the same continuous outcome measure67891011121314), focusing on statistical significance without considering whether the magnitude of effect is patient-important,67891011121314 and failure to assess the certainty of evidence.813

We conducted a systematic review and network meta-analysis comparing all available interventions for chronic TMD pain, as a group of conditions, addressing these limitations. This review is part of the BMJ Rapid Recommendations project, a collaborative effort from the MAGIC Evidence Ecosystem Foundation (https://magicevidence.org) and The BMJ.15 This systematic review informed a parallel guideline published in The BMJ and MAGICapp (box 1).16

Box 1

Linked articles in this BMJ Rapid Recommendation cluster

  • Busse JW, Casassus R, Carrasco-Labra A, et al. Management of chronic pain associated with temporomandibular disorders: a clinical practice guideline. BMJ 2023;383:e076227 doi:10.1136/bmj-2023-076227

    • Summary of results from the Rapid Recommendation process

  • Yao L, Sadeghirad B, Li M, et al. Management of chronic pain secondary to temporomandibular disorders: a systematic review and network meta-analysis of randomised trials. BMJ 2023;383:e076226 doi:10.1136/bmj-2023-076226

  • MAGICapp (https://matchit.magicevidence.org/230428dist-tmd)

    • Expanded version of the results with a multilayered recommendation, evidence summaries, and decision aids for use on all electronic devices

RETURN TO TEXT

Methods

We followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) extension statement for reporting of systematic reviews incorporating NMAs17 and registered our review protocol with PROSPERO (CRD42021258567).

Guideline panel involvement

A BMJ Rapid Recommendations guideline panel provided critical oversight for our review, including: (1) defining the study question, (2) categorising interventions, (3) prioritising outcome measures, (4) proposing subgroup analyses, and (5) informing if measures of precision associated with pooled effect estimates were imprecise. The panel included eight clinical experts (two general dentists (AC-L, CP), an orofacial pain specialist (RC), an oral surgeon with expertise in orofacial pain (JD), an oral physician (JMZ), a general internist (MC), and a clinical pharmacologist (CS)), eight methodologists (four of whom were also front-line clinicians), a patient liaison expert, and three patient partners living with chronic TMD pain. All patients received training and support to optimise contributions throughout the guideline development process. The members of the guideline panel led the interpretation of the results based on what they expected the typical values and preferences of patients to be, as well as the variation between patients.

Patient and public involvement

Three patients were full members of the guideline panel and contributed to the selection and prioritisation of outcomes, values and preferences assessments, critical feedback to the protocol, and interpretation of findings for the systematic review and the associated BMJ Rapid Recommendation.

Data sources and searches

An experienced medical librarian (RJC) developed and refined database-specific search strategies (see appendix 1) and searched MEDLINE, EMBASE, Cumulative Index to Nursing and Allied Health Literature (CINAHL), Cochrane Central Register of Controlled Trials (CENTRAL), and SCOPUS from inception through May 2021. After formulation of the recommendations, we updated our search to January 2023. We reviewed reference lists of included trials and relevant reviews for additional eligible studies.

Study selection and eligibility criteria

Pairs of trained reviewers screened titles and abstracts of identified citations independently and in duplicate, using a standardised, pilot-tested form, and assessed full texts of all potentially eligible studies. Disagreements were resolved through discussion to achieve consensus. We used online systematic review software (DistillerSR, Evidence Partners, Ottawa, Canada; http://systematic-review.net) to facilitate literature screening.

We included English language, interventional trials that: (1) enrolled adults (≥18 years old) living with chronic pain associated with a TMD (≥3 months or defined by the authors as “chronic”); (2) randomised them to an active treatment or an alternative treatment, placebo, sham procedure, or usual care; and (3) included at least 10 participants in each treatment group. All interventions were reviewed and categorised by two or more clinical experts, independently and blinded to study results, and we excluded trials of interventions that our experts advised were not currently in use. We reviewed details of interventions labelled as “physiotherapy” and clustered them according to their reported components, as physiotherapy is a profession and not a single modality.

Data extraction

Pairs of reviewers independently abstracted the following data from each eligible article: study characteristics (such as bibliographic information, country of origin, funding source), participant characteristics (such as sample size, age and sex of participants, subtypes of TMD, pain severity, duration of pain), and characteristics of interventions and comparators. We classified countries where trials were conducted as high, upper-middle, lower-middle, or low income as designated by the World Bank.18 We also extracted data for all patient-important outcomes as guided by the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials (IMMPACT) recommendations,1920 including pain, physical functioning, emotional functioning, role functioning, social functioning, sleep quality, and adverse events. For trials with different follow-up lengths, we abstracted data from the longest follow-up reported.

Risk of bias assessment

Pairs of reviewers independently assessed the risk of bias among eligible studies using a modified Cochrane risk of bias instrument that included random sequence generation; allocation concealment; blinding of participants, healthcare providers, and outcome assessor/adjudicator; and incomplete outcome data (≥20% missing data was considered high risk of bias).21 We rated the risk of bias for each criterion as “low,” “probably low,” “high,” or “probably high.” We resolved disagreements between reviewers through discussion. When all the above domains were judged at low or probably low risk, we rated the overall risk of bias as “low,” otherwise we rated the overall risk of bias as “high.” For interventions in which blinding is not possible, and when blinding was the only criterion not met, we referred to prior meta-epidemiological studies which showed no systematic difference in estimated treatment effect between trials with and without blinded patients, healthcare providers, or outcome assessors,2223242526 and rated the overall risk of bias as “probably low.”

Data synthesis

We used DerSimonian-Laird random-effects models for meta-analysis of direct comparisons for all patient-important outcomes reported by more than one trial. For pain and function, when studies reported effect estimates using different measurement instruments that captured a common construct, we transformed treatment effects to a common instrument score on a domain-by-domain basis (appendix 2).27 Specifically, we converted pain intensity to a 10 cm visual analogue scale for pain, and physical functioning to the 100-point 36-item Short Form Survey (SF-36) physical component summary score. We then calculated the weighted mean difference and the associated 95% confidence interval using change scores from baseline to the end of follow-up to address interpatient variability. If authors did not report change scores, we estimated them using the baseline and end-of-study scores and the associated standard deviations and median correlation coefficient reported by low risk of bias trials. We used methods described in the Cochrane Handbook28 and by Hozo et al29 to impute means and standard deviations when the median, range, and sample size were reported, or to impute the standard deviation when the standard error or standard deviation for the differences was not reported.

We pooled dichotomous outcomes (that is, adverse events) as the odds ratio and 95% confidence interval. When at least 10 trials were available for a direct comparison, we assessed small-study effects using Harbord’s test for binary outcomes and Egger’s test for continuous outcomes.3031 Further, when only a single trial was available to inform the effectiveness of an intervention, and reported a large significant effect, we considered this evidence at high risk of bias due to small study effects.

We constructed networks for outcomes in which 10 or more trials contributed data. When networks are sparse, the contrast-based random-effects model may generate non-credible wide confidence intervals for network estimates, even when the direct and indirect estimates are coherent (that is, the confidence interval of the network estimate is wider than both the estimates of precision associated with the direct and indirect effects).32 We used a fixed-effect model for pooling in such cases. We used the “design-by-treatment” model (global test) to assess the coherence assumption for each network.33 We used the side-splitting method to evaluate local (loop-specific) incoherence in each closed loop of the network as the difference between direct and indirect evidence.3435 We performed all analyses in STATA 17.0 MP edition (StataCorp, College Station, TX, USA).

Mean differences are often misunderstood by knowledge users, including clinicians and patients.36 Therefore, for continuous outcomes, we used the network estimate of treatment effects to model the risk difference for achieving the minimally important difference (MID) to optimise interpretability.37 We used 1 cm on the 10 cm visual analogue scale as the MID for pain relief,38 and 5 points on the 100-point SF-36 physical component summary score as the MID for physical function.3940 We used the median and standard deviation of the reference treatment (placebo/sham), with the established MID for the outcome in question to model the probability of achieving ≥MID in the reference treatment. We then used the pooled mean difference to estimate the mean in the treatment group and calculate the probability of achieving ≥MID in the treatment group. Finally, we used the calculated probabilities for both groups to acquire the risk difference for achieving ≥MID.

Categorisation of interventions

We categorised interventions from most to least effective, using a minimally contextualised approach.41 The minimally contextualised framework is based on two principles: interventions should be grouped in categories, from the most to the least effective or harmful, and judgments that place interventions in such categories should simultaneously consider the estimates of effect and the certainty of evidence (box 2).

Box 2

Classification of interventions using a minimally contextualised framework41

  • Step 1. Choosing a reference intervention (“placebo/sham” was our reference).

  • Step 2. Classifying interventions into categories based on comparison with the reference: category 1, not convincingly different than placebo/sham; and category 2 or higher, more effective (or harmful for adverse events) than placebo/sham.

  • Step 3. Further classification of category 2 or higher interventions based on comparisons between pairs of interventions. If any intervention proved more effective than another category 2 intervention, that intervention was moved to a higher rated group (category 3). We implemented this same step to differentiate among interventions in category 3 (if there was an intervention in category 3 superior to at least one other, it would move to category 4) until no new groupings resulted.

  • Step 4. Separating interventions into two clusters according to certainty of evidence: high or moderate certainty of evidence, and low or very low certainty of evidence.

RETURN TO TEXT

For pain relief and functional improvement, we created groups of interventions as follows: (1) category 1, the reference intervention (placebo/sham procedures) and interventions no different from placebo, which we refer to as “among the least effective”; (2) category 2, interventions superior to placebo but inferior to a category 3 intervention; and (3) category 3, interventions that proved superior to at least one category 2 intervention. We used the same approach for adverse events but created groups of interventions as follows: (1) no more harmful than placebo; (2) less harmful than a category 3 intervention, but more harmful than placebo; and (3) more harmful than at least one category 2 intervention. We created additional categories for benefits or harms, as needed, using the same approach. For both benefits and harms, we categorised interventions as those supported by moderate or high certainty evidence, and those supported by low or very low certainty evidence relative to placebo/sham procedures.3442

Subgroup analysis

At the direction of the guideline panel, we explored four a priori subgroup hypotheses to explain variability between trials: (1) subtypes of TMD will show different treatment effects; (2) studies at higher versus lower risk of bias will show larger treatment effects; (3) trials with longer versus shorter follow-up will show smaller treatment effects; and (4) studies enrolling patients receiving disability benefits or engaged in litigation versus not will show smaller treatment effects. We only conducted subgroup analyses if there were two or more studies in each subgroup and used a test of interaction to establish whether subgroups differed significantly from one another. We assessed the credibility of statistically significant subgroup effects (P value for test of interaction <0.05) using Instrument to assess the Credibility of Effect Modification Analyse (ICEMAN) criteria.43

Certainty of evidence

We used the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach to assess the certainty of evidence for direct, indirect, and network estimates for all outcomes. With this approach, the certainty of direct evidence from randomised trials starts as high but may be rated down for risk of bias, indirectness, imprecision, inconsistency, or small study effects to moderate, low, or very low.44 Certainty ratings of indirect estimates start at the lowest GRADE rating of the direct comparisons that contributed the most weight to the dominant first-order loop in the network, with further consideration of rating down for intransitivity when present.454647

Our assessment of transitivity relied on two fundamental issues: (1) eligible trials are jointly randomisable, and (2) potential effect modifiers are equally distributed between each treatment comparison in the network. We addressed the first issue by exploring the similarity of patient populations in our networks and confirming with our clinical experts that patients across trials were eligible to receive any of interventions considered in the network. We addressed the second issue by generating graphs to explore if the distribution of effect modifiers (that is, age, sex, risk of bias) were similar across comparisons. We were unable to explore subtype of TMD as an effect modifier as most trials enrolled mixed subtypes and reported aggregate results or did not report which subtype(s) were enrolled.

For the certainty of network estimates, we started with the estimate—direct or indirect—that dominated (contribution >50%) the network estimate, or we used the higher certainty of the direct and indirect estimates if they both contributed equally to the network estimate. Discounting imprecision associated with the direct estimate, which was assessed at the network level, as was incoherence. We judged network estimates as imprecise if the associated 95% confidence interval included decision thresholds48 chosen by the guideline panel: half the minimally important difference for continuous outcomes and the null value (odds ratio=1) for adverse events. We did not rate down for imprecision if the confidence interval excluded the decision threshold unless the comparison was statistically significant and informed by fewer than 400 observations for continuous outcomes or 300 events for binary outcomes.49

When present, we addressed incoherence in the network estimate by using the higher certainty evidence (direct or indirect) instead of the network estimate, or when the certainty of evidence was the same for the direct and indirect evidence, we used the network estimate but rated down the certainty of evidence one level for incoherence. We did not rate down the certainty rating of the network estimate twice if both intransitivity and incoherence were present, as these are related issues. We followed GRADE guidance for communicating our findings50 and made our results available in the MAGICapp interactive decision support tool MATCH-IT (https://matchit.magicevidence.org/230428dist-tmd).

Results

Our search identified 9033 unique citations, of which 233 (reporting 210 unique trials) proved eligible for review (fig 1). We were unable to acquire the full texts of eight citations for review, and our attempts to contact study authors were unsuccessful (appendix 3). Twenty four trials reported only surrogate outcomes (such as passive jaw opening, pain on palpation), 23 trials compared similar interventions that our clinical experts advised should be collapsed (for example, different types of removable oral splints, two-needle arthrocentesis v single-needle arthrocentesis; none of which showed different treatment effects651) and thus became observational data, and 10 trials compared interventions that are not typically available in current practice (supplement table 1). Thus, our meta-analyses included 153 studies from 172 references, which enrolled 8713 patients with chronic TMD pain.

Fig 1
Fig 1

Study selection process in review of interventions for chronic pain associated with temporomandibular disorders

Our updated literature search in January 2023, identified 13 additional eligible trials that enrolled a total of 875 patients, four of which only reported treatment effects on surrogate outcomes (supplement table 2). We did not include the results of the nine trials that reported patient-important outcomes in our pooled estimates of treatment effects to maintain consistency with the evidence used by the guideline panel to formulate recommendations.

Study characteristics

Studies eligible for our review were parallel trials that evaluated 23 conservative interventions, 15 pharmacological interventions, seven combinations of pharmacological and conservative interventions, 13 surgical interventions with or without adjunct treatments, and placebo/sham procedures (supplement tables 3-5). The median sample size of included trials was 46 (interquartile range 35-63), and the median of the mean age among patients was 35 years (30-39). The median average pain score at baseline was 5.4 cm (4.3-6.6) on a 10 cm visual analogue scale, which equates to moderate severity pain,52 and the median average pain duration when reported was 44 months (13-65). The median follow-up was 12 weeks (5-52).

Most trials provided no details on funding (44%) or declared non-industry funding (41%), with the remainder reporting no funding (15%) (table 1). Half of trials (75/153, 49%) enrolled patients with a specific subtype of TMD (most often myofascial TMD (38/75, 51%)), whereas the remainder (51%) enrolled mixed subtypes or did not report the proportion of TMD subtypes among patients (table 1). No trial explicitly reported enrolment of veterans, or patients involved in litigation or receiving disability benefits. Almost half of all trials (47%) excluded TMD patients with comorbid mental illness, and only three trials (2%) reported enrolling such patients. Approximately a third of trials explicitly excluded TMD patients with comorbid rheumatoid arthritis (36%) or prior TMJ surgery (32%), and 20% excluded those with comorbid fibromyalgia; only a single trial reported enrolling any patients with these features (supplement table 6).

Table 1

Characteristics of included studies for network meta-analysis (n=153)

View this table:

Risk of bias

Most trials (133/153, 87%) were at risk for bias for at least one domain; 118 studies (77%) adequately generated their randomisation sequence; 78 (51%) concealed allocation; 65 (42%) blinded patients, 48 (31%) blinded healthcare providers, and 65 (42%) blinded outcome assessors. Most trials (126; 82%) reported <20% missing outcome data. We rated the overall risk as high in 96 (63%) studies, and low or probably low in 57 (37%) studies (supplement table 7).

Main findings per outcome

Pain relief

In total, 148 RCTs involving 7867 patients and evaluating 59 interventions reported effects on pain (fig 2). In the 31 direct comparisons with two or more studies available for conventional pairwise meta-analysis, there was substantial heterogeneity (I2 >70%) for 11 comparisons (supplement table 8, the expanded GRADE assessment table for pain relief can be found at https://zenodo.org/record/7607558#.Y9_P1uyZM-Q; the league table for pain relief can be found at https://zenodo.org/record/7604396#.Y93irOyZM-Q). We found no evidence of global or loop-specific incoherence (supplement table 9).

Fig 2
Fig 2

Network plot for pain relief. The figure shows lines between treatments that have direct comparisons; thicker lines indicate that more comparison studies have been conducted. This figure is based on data in supplementary table 8. ACBT = augmented cognitive behavioural therapy; ACT = acetaminophen; ACU = acupuncture; AR = arthrocentesis; ATP = antidepressant; BB = beta-blocker; BOX = botulinum toxin; BZP = benzodiazepine; CAP = capsaicin; CBT = cognitive behavioural therapy; CS = cartilage supplement; DIS = discectomy; GA = gabapentin; HA = hyaluronic acid injection; HE = home exercises; IOS = irreversible oral splint; JE = jaw exercise; JES = jaw exercise + stretching; JM = jaw mobilisation; laser = low-level laser therapy; MP = manipulation; MR = muscle relaxant; NSAID = non-steroidal anti-inflammatory drug; OD = opioid; PE = postural exercise; ROS = removeable occlusal splint; RT = relaxation therapy; SD = steroid; SJ = steroid injection; TENS = transcutaneous electrical nerve stimulation; TPI = trigger point injection; TPT = trigger point therapy; UAR = ultrasound-guided arthrocentesis; UC = usual care.

Compared with placebo, effects on pain for eight interventions were supported by moderate to high certainty evidence. The three therapies probably most effective for pain relief were cognitive behavioural therapy (CBT) augmented with biofeedback or relaxation therapy (modelled risk difference (RD) for achieving the minimally important difference in pain relief of 1 cm on a 10 cm scale 36% (95% CI 33 to 39)), therapist-assisted jaw mobilisation (RD 36% (31 to 40)), and manual trigger point therapy (RD 32% (29 to 34)) (fig 3).

Fig 3
Fig 3

Network meta-analysis results, sorted by GRADE certainty of evidence and effect estimate for the comparisons of active treatments versus placebo/sham procedures, for pain relief, physical function, and adverse events

Fig 3
Fig 3

contd Network meta-analysis results, sorted by GRADE certainty of evidence and effect estimate for the comparisons of active treatments versus placebo/sham procedures, for pain relief, physical function, and adverse events

Five interventions were less effective, yet more effective than placebo/sham procedures: cognitive behavioural therapy (RD 30% (23 to 35)), supervised postural exercise (RD 26% (14 to 34)), supervised jaw exercise with stretching (RD 26% (20 to 31)), usual care (such as home exercises, stretching, reassurance, self massage, and education) (RD 25% (22 to 28)), and supervised jaw exercise with stretching and manual trigger point therapy (RD 23% (11 to 31)). The certainty in effects for all other interventions was low or very low (fig 3).

Physical functioning

Thirty six trials with 2009 patients, evaluating 30 interventions, reported treatment effects on physical functioning; however, four interventions reported in three studies (that is, arthrocentesis, arthrocentesis with steroid injection, arthrocentesis with hyaluronic acid injection, and discectomy) were disconnected from the network (supplement table 10). As such, our network included 33 RCTs with 1910 patients that evaluated 26 interventions (fig 4). The 33 RCTs included 44 direct comparisons, of which 12 included two or more studies for conventional pairwise meta-analysis and one showed substantial heterogeneity (supplement table 11, the expanded GRADE assessment table for physical functioning can be found at https://zenodo.org/record/7607558#.Y9_P1uyZM-Q; the league table for physical functioning can be found at https://zenodo.org/record/7604396#.Y93irOyZM-Q). We found no evidence of global or loop-specific incoherence (supplement table 12).

Fig 4
Fig 4

Network plot of physical function. The figure shows lines between treatments that have direct comparisons; thicker lines indicate that more comparison studies have been conducted. This figure is based on data in supplementary table 11. ACBT = augmented cognitive behavioural therapy; ACT = acetaminophen; ACU = acupuncture; BB = beta-blocker; BOX = botulinum toxin; CBT = cognitive behavioural therapy; GA = gabapentin; HA = hyaluronic acid injection; HE = home exercises; IOS = irreversible oral splint; JE = jaw exercise; JES = jaw exercise + stretching; JM = jaw mobilisation; laser = low-level laser therapy; MP = manipulation; MR = muscle relaxant; NSAID = non-steroidal anti-inflammatory drug; PE = postural exercise; ROS = removeable occlusal splint; SJ = steroid injection; TENS = transcutaneous electrical nerve stimulation; TPT = trigger point therapy; UC = usual care.

Moderate certainty evidence showed that, compared with placebo, four interventions probably improved physical functioning: supervised jaw exercise with stretching (RD for achieving the minimally important difference of 5 points on the 100 point short form-36 physical component summary score 43% (95% CI 33 to 51)), manipulation (RD 43% (25 to 56)), acupuncture (RD 42% (33 to 50)), and supervised jaw exercise with mobilisation (RD 36% (19 to 51)). The certainty in effects for physical functioning among other interventions was low or very low (fig 3).

Adverse events

Adverse events were only reported in 31 studies involving 1987 patients with chronic TMD (fig 5, supplement table 13). Four studies evaluating arthrocentesis with or without co-interventions (arthrocentesis, arthrocentesis plus steroid, arthrocentesis plus opioid, ultrasound guided arthrocentesis, and arthrocentesis plus hyaluronic acid injection), and reporting no adverse events, were disconnected and thus not included in our network (supplement table 14).

Fig 5
Fig 5

Network plot of adverse events. The figure shows lines between treatments that have direct comparisons; thicker lines indicate that more comparison studies have been conducted. This figure is based on data in supplementary table 14. ACT = acetaminophen; ACU = acupuncture; CS = cartilage supplement; HA = hyaluronic acid injection; JE = jaw exercise; JM = jaw mobilisation; laser = low-level laser therapy; MP = manipulation; MR = muscle relaxant; NSAID = non-steroidal anti-inflammatory drug; OD = opioid; PE = postural exercise; ROS = removeable occlusal splint; SD = steroid; SJ = steroid injection; TENS = transcutaneous electrical nerve stimulation; UC = usual care.

Of the 20 direct comparisons that contributed to our network, four had two or more studies for pairwise meta-analysis, and none showed substantial heterogeneity (supplement table 15, the expanded GRADE assessment table for adverse events can be found at https://zenodo.org/record/7607558#.Y9_P1uyZM-Q; the league table for adverse events can be found at https://zenodo.org/record/7604396#.Y93irOyZM-Q). We found no evidence of global or loop-specific incoherence (supplement table 16). The certainty in effects on adverse events for all 19 interventions was low or very low (fig 3).

Additional analysis

We were unable to construct networks for mental functioning, role functioning, social functioning, or sleep quality due to the small numbers of trials reporting these outcomes. A summary of findings for these outcomes (all low certainty) is presented in supplement table 17.

We were unable to explore subgroup effects based on subtypes of TMD or receipt of disability benefits/engaged in litigation versus not because of insufficient variability among eligible trials. We performed subgroup analysis based on the duration of follow-up and risk of bias and found significant subgroup effects for two comparisons: 1) removeable occlusal splint versus placebo, which found trials at high risk of bias were associated with larger treatment effects for pain relief; and 2) removeable occlusal splint versus usual care, which showed longer follow-up was associated with smaller treatment effects for pain relief (supplement table 18). The credibility of these subgroup effects proved to be moderate (supplement table 19).

There was no evidence of intransitivity based on the distribution of effect modifiers across comparisons. We found no evidence of small study effects for the three comparisons in which there were ≥10 trials available (supplement table 20).

Discussion

This network meta-analysis of 153 RCTs that enrolled 8713 participants provided moderate certainty evidence that, compared with placebo/sham procedures, cognitive behavioural therapy (CBT) augmented with biofeedback or relaxation therapy, therapist-assisted jaw mobilisation, and manual trigger point therapy are among the most effective therapies for reducing pain in patients with chronic TMD pain. Moderate to high certainty evidence showed that five interventions were less effective, yet more effective than placebo for pain relief: CBT, supervised postural exercise, supervised jaw exercise and stretching with or without manual trigger point therapy, and usual care (such as home exercises, stretching, reassurance, self massage, and education). Moderate certainty evidence showed that, compared with placebo/sham procedures, supervised jaw exercise with stretching, manipulation, acupuncture, and supervised jaw exercise with mobilisation probably improve physical functioning. The certainty in effects, among other interventions, was low or very low.

Strengths and limitations

Strengths of our review include a comprehensive synthesis of evidence on the benefits and harms of available pharmacologic and nonpharmacologic interventions for chronic pain associated with TMD. Independent coding of interventions by clinical experts blinded to study results provides reassurances regarding the composition of our network treatment nodes. Further, we used the GRADE approach to evaluate the certainty of evidence and ranked interventions using a minimally contextualised approach that considers both the magnitude of effects and certainty of evidence. Finally, we were able to translate the findings from our NMA into absolute effects, which are more understandable for knowledge users.

Our review also has several limitations. First, despite the large overall number of trials and participants, we found limited direct evidence to inform the effectiveness of several interventions versus placebo/sham procedures, and the evidence to inform the effectiveness of most interventions proved to be of low or very low certainty, including all evidence to inform adverse events. Second, a key assumption of our review is that treatment effects would be similar across different TMD subtypes, and we were unable to explore for subgroup effects as most trials eligible for our review enrolled mixed subtypes or did not report the proportion of TMD subtypes among patients and reported aggregate results. However, we did not find any evidence of statistical variability across treatment effects in our outcome networks and in the assessments of between-study variances within the closed loops of evidence. Further, diagnostic criteria for TMD subtypes are largely subjective and patients often satisfy criteria for multiple subtypes.5354

Third, because most eligible trials for our reviews excluded patients with chronic TMD pain who also had comorbid mental illness, fibromyalgia, or rheumatoid arthritis, or those who had previously undergone TMD surgery, the generalisability of our findings to these populations is uncertain. Fourth, although litigation and wage replacement benefits may influence treatment effects,55 data in the included trials were insufficient to form conclusions regarding these issues. Fifth, based on meta-epidemiological studies22232425 that found little to no effect of blinding on treatment results among randomised trials (including a review of oral health trials26), we did not rate down trials for risk of bias for interventions in which blinding is not possible and unblinding was the only risk of bias criterion not met; however, we could not directly test this assumption. Sixth, modelling assumptions for estimating the risk difference of achieving the minimally important difference assume that the standard deviations of outcome measures are the same in both the treatment and control groups, and that change scores in both groups are normally distributed. It is possible these assumptions were not met for some interventions. Seventh, although we planned to report treatment effects on all patient-important outcomes, interventional trials for chronic TMD consistently reported effects only on pain and, to a lesser extent, physical function. Most trials did not report effects on adverse events, and the certainty of evidence for the 19 interventions reporting on adverse events was low to very low. This is particularly concerning for invasive procedures and long term treatment with non-steroidal anti-inflammatory drugs plus opioids, which may be associated with important adverse effects.

Relation to other studies

There have been nine prior network meta-analyses (NMAs) of interventions for chronic TMD pain.67891011121314 None of them, however, assessed all available interventions across all TMD subtypes (supplement table 21). As such, they included only nine to 57 RCTs, versus the 210 unique trials considered in our review. Individual NMAs assessed needling therapies,10 low level laser therapy,11 inter-articular injections,13 pharmacotherapy,9 and different types of oral splints.612 Of the three NMAs that considered a broader range of interventions, they limited patient populations to individuals with disc displacement,14 arthrogenous TMD,7 or masticatory muscle pain.8 Findings were highly variable across reviews, and each NMA identified their most promising interventions based on SUCRA rankings. This approach is often misleading, as it considers only the point estimate of effect and not the associated precision or certainty of evidence and ignores absolute differences between treatment alternatives.565758

A 2022 Cochrane review of 12 RCTs assessed treatment effects of psychological therapies for chronic TMD pain and found low certainty evidence from five trials that cognitive behavioural therapy (CBT) may reduce pain intensity at longest follow-up; however, the effect estimate was associated with substantial imprecision.59 Our NMA has improved the precision associated with the effect of CBT for chronic TMD pain by augmenting direct evidence with indirect evidence.

Conclusions

In this large NMA of randomised controlled trials, moderate certainty evidence or higher shows interventions that promote coping and encourage movement and activity were most effective for reducing chronic pain associated with TMD. Several interventions commonly in use for chronic TMD pain are supported by only low or very low certainty evidence. The accompanying BMJ Rapid Recommendation provides contextualised guidance based on this body of evidence (box 1).16

Abbreviations

  • TMD temporomandibular disorders

  • NMA network meta-analysis

  • RCT randomised controlled trials

  • CENTRAL Cochrane Central Register of Controlled Trials

  • CINAHL Cumulative Index to Nursing and Allied Health Literature

  • MID minimally important difference

  • RD risk difference

  • SD standard deviation

  • WMD weighted mean difference

  • OR odds ratio

  • PRISMA Preferred Reporting Items for Systematic reviews and Meta-Analyses

  • IMMPACT Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials

  • VAS visual analogue scale

  • IQR interquartile range

  • CBT cognitive behavioural therapy

  • NSAIDs non-steroidal anti-inflammatory drugs

  • SUCRA surface under the cumulative ranking

Ethics statements

Ethical approval

Not required.

Data availability statement

Details of the characteristics of the included studies were shared in the supplementary materials. The study specific data included in the meta-analysis can be obtained from the first author at yaol12@mcmaster.ca.

Acknowledgments

We thank Naz Torabi of Unity Health Toronto for peer-review of our search strategy; Randi McCabe (McMaster University), David Mock (University of Toronto), Carolyn Palmer (Veterans Affairs Canada), Christine Goertz (Duke University), and Alonso Carrasco-Labra (University of North Carolina) for categorising interventions, blinded to treatment results; and Zijun Li, Mina Ma, Ke Guo, and Minyan Yang of Lanzhou University for reviewing extracted data. Clinical experts on the BMJ Rapid Recommendation guideline panel were Rodrigo Casassus (orofacial pain specialist), Alonso Carrasco-Labra (general dentist), David Mock (oral and maxillofacial pathologist), Matteo Coen (general internist), Carolyn Palmer (general dentist), Joanna M Zakrzewska (oral physician), Justin Durham (honorary consultant oral surgeon), and Caroline Samer (clinical pharmacologist). No financial compensation was provided to any of these individuals.

Footnotes

  • Contributors: LY, BS, and MXL contributed equally to the study and are joint first authors. LY, BS, TA, and JWB conceived and designed the study. RJC searched electronic databases for eligible trials. LY, MXL, JL, QW, HC, GM, RM, BSH, JW, SM, YZ, MMA, YG, LC, LDZ, and IDF screened the studies and abstracted data. LY carried out the statistical analysis. YL, BS, and JWB interpreted the data. LY, BS, and JWB drafted the manuscript. All authors critically revised the article for important intellectual content and gave final approval for the article. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.

  • Funding: This study was supported by a grant from the Chronic Pain Centre of Excellence for Canadian Veterans. JWB is supported, in part, by a Canadian Institutes of Health Research Canada Research Chair in Prevention and Management of Chronic Pain. The funding organisations had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication.

  • Competing interests: All authors have completed the ICMJE uniform disclosure form and declare: no financial support from any industry for the submitted work; no other relationships or activities that could appear to have influenced the submitted work.

  • Transparency: All authors affirm that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.

  • Dissemination to participants and related patient and public communities: We used MAGICapp decision aids (available at www.magicapp.org/) to facilitate conversations between healthcare providers and patients. The MAGICapp decision aids were co-created with people living with chronic pain. We also plan to use social media, the websites of our organisations and pain related associations or societies to disseminate our findings.

References