External Validation of the Assessment of Different NEoplasias in the adneXa Model Performance in Evaluating the Risk of Ovarian Carcinoma Before Surgery in China: A Tertiary Center Study
We would like to extend our sincere thanks to all patients included in this study.
This work was funded by the National Natural Science Foundation of China (31900889).
The authors declare that they have no competing interests.
Abstract
Objectives
The Assessment of Different NEoplasias in the adneXa (ADNEX) model was developed by the International Ovarian Tumor Analysis group to assess the risk of an ovarian mass being malignant. This study aimed to externally validate the ADNEX model performance in a tertiary center in China.
Methods
This retrospective, single-center university hospital study assessed the model diagnostic accuracy. All patients were examined by transvaginal ultrasonography, and serum CA125 levels were measured. Moreover, clinicopathological information was collected. The diagnostic performance of the ADNEX model was calculated with and without CA125 as a predictor.
Results
We retrieved data of 335 patients, of which 53 were excluded based on the exclusion criteria. Of the included 282 patients, 178 (63.1%) had benign tumors, and 104 (36.9%) had malignant tumors. When CA125 was factored in, the area under the receiver operating characteristic curve (AUC) for the distinction between benign and malignant tumors was 0.93 (95% confidence interval [CI], 0.90–0.96), whereas it was 0.91 (95% CI, 0.88–0.95) without CA125. The concordance between the predicted risk of malignancy and the proportion of observed malignancies was well demonstrated by the calibration plots.
Conclusions
The proper performance of the ADNEX model was verified externally in a tertiary center in China, showing a good distinction between tumour subtypes. Our findings suggest the ADNEX model is a valuable tool in clinical practice and may help in managing patients with adnexal masses.
Abbreviations
-
- ADNEX
-
- Assessment of Different NEoplasia in the adneXa
-
- AUC
-
- area under the curve
-
- BOTs
-
- borderline ovarian tumors
-
- CI
-
- confidence interval
-
- IOTA
-
- International Ovarian Tumor Analysis
-
- OC
-
- ovarian carcinoma
-
- ROC
-
- receiver operating characteristic curve
Ovarian cancer (OC) is the fourth most common cancer diagnosed in women worldwide.1 Globally, 313,959 new OC cases and 207,252 deaths were reported in 2020.2 The incidence rate in China increases with age after the age of 40 years and reaches its peak at the age of 55 to 59 years.3 Various studies indicated that most women with ovarian carcinoma were diagnosed with advanced disease, associated with a poor prognosis and a low 5-year survival rate of <50%. In comparison, the 5-year survival rate of patients with an early diagnosis of a localized disease could reach 90%.4, 5 A key factor in optimizing survival is the combination of early diagnosis and centralized management.6 Previous screening trials for OC were unsuccessful in achieving an early diagnosis.7 Several studies suggested that a first-line test to screen for OC had a limited role.8 However, the risk of OC algorithm, using a series of serum CA125 measurements, achieved higher detection rates with stage-dependent changes in detected cancers than the UK Collaborative Trial of Ovarian Cancer Screening, which used a CA125 cut-off value.9, 10 Moreover, recent reports have shown that a multi-marker longitudinal model for the early detection of OC was significantly better than CA125 alone.11
A crucial aspect of managing clinical situations is to make accurate diagnoses. It is vital that patients are referred to a specialist hospital for immediate treatment when they present with an abnormal ovarian mass and need specialist oncology services for OC.
In 2014, the International Ovarian Tumor Analysis (IOTA) team generated a new risk prediction model named the Assessment of Different NEoplasia in the adneXa (ADNEX).12 It comprises six ultrasonographic and three clinical indicators. The most significant advantage of the ADNEX model is it being the first multi-category model for adnexal masses. It can assess the overall risk of OC and evaluate each subtype. The experienced (level III) ultrasonographers who collected data for the model are equivalent to the level of a consultant with a special interest in gynecological ultrasonography in the UK.12 Additionally, the models and rules developed and validated by the IOTA group have been validated by less experienced (level II) ultrasonographers.13, 14 The correct classification of malignant tumour subtypes is crucial, as treatment differs between subtypes. It is essential to preserve fertility during surgery, especially in young women with borderline ovarian tumors (BOTs), whereas conventional lymphadenectomy is not recommended.15 Furthermore, treatment of metastatic OC should be based on the primary tumour.16
This study aimed to externally validate the ADNEX model performance in a tertiary center in China. The better diagnostic performance of the models allows clinicians to select the best treatment options for their patients, thus contributing to better patient prognosis.
Materials and Methods
Design and Study Setting
This retrospective study was conducted at the Department of Medical Ultrasound, the First Affiliated Hospital of Shandong First Medical University, and Shandong Provincial Qianfoshan Hospital, Jinan, Shandong Province, China, a tertiary care hospital. We reviewed the data of 335 consecutive patients diagnosed with adnexal lumps by ultrasonography between June 1, 2019 and June 30, 2020. The ultrasonographer was blinded to the final histological results. We followed the transparent reporting of a multivariable prediction model for Individual Prognosis or Diagnosis model development and validation guidelines.17 We used a standardized transvaginal ultrasonographic method previously published by the IOTA group.18, 19 Transabdominal ultrasonography was performed when the mass was too large or could not be fully assessed transvaginally.19
Patients
The inclusion criteria were as follows: 1) diagnosed by transvaginal ultrasonography with an adnexal mass that was not a physiological cyst (when bilateral or multiple adnexal masses were present, we included those with the most complicated sonographic signatures,18, 19 and if the ultrasonographic patterns of the two masses were similar, the greater mass or the one most easily detected by ultrasonography was included14, 19); 2) underwent surgery within 60 days of the ultrasonographic examination, and 3) had no known experience of OC and did not receive chemotherapy.
Participants were excluded from the study for any of the following reasons: 1) pregnancy at any point during the consultation; 2) refused transvaginal ultrasonography; 3) failed to undergo surgery within 60 days of the ultrasonographic examination, and 4) cytological rather than histological results were available. The final analysis included 282 patients.
The Medical Ethics Committee of the First Affiliated Hospital of Shandong First Medical University (Shandong Provincial Qian Foshan Hospital) approved this study. The need for informed consent was waived due to the retrospective nature of this study.
Data Collection
All examinations were conducted by two sonographers using the terminology and criteria proposed by the IOTA group.18 The first sonographer (YXZ; Level I) was a resident in the Department of Medical Ultrasound, and the second (LF; Level III) had >15 years of experience in gynecological ultrasonography, was a gynecological ultrasonography teacher, and was considered an expert in the field. The sonographers fulfilled, respectively, training and experience Levels I and III following the criteria recommended by the European Federation of Societies for Ultrasound in Medicine and Biology (EFSUMB).20 We estimated the risk of malignancy with and without CA125 using the ADNEX model, using the results for statistical analysis.12, 18 Voluson E8 and E10 sonographic scanners (GE Healthcare, Zipf, Austria) were used with 7.0 to 8.0 MHz transvaginal and 3.5 to 4.5 MHz transabdominal probes.
The serum CA125 level was measured 1 to 5 days before the operation with chemiluminescence microparticle immunoassays using an automatic analyzer (Architect i2000SN; Abbott AxSYM; Chicago, IL).
ADNEX Model
The ADNEX model is freely accessible online at https://www.iotagroup.org/adnexmodel/ and can be downloaded for use in portable applications. Three clinical and 6 ultrasonographic predictors were included in the ADNEX model: age (years), serum CA125 level (U/mL), type of center (an oncology center or other hospital type), maximal diameter of the lesion (mm), the proportion of solid tissue, over 10 cyst locules (yes or no), number of papillary projections (0, 1, 2, 3, and >3), acoustic shadows (yes or no), and ascites (yes or no; Figure 1).12 The proportion of solid tissue was defined as the ratio of the largest diameter of the maximum solid element to the largest diameter of the lesion. The oncology center was described as a “tertiary referral centre with a dedicated unit for gynecological oncology.”12 This model could also be used without the serum CA125 level. We calculated the precision performances of the ADNEX model with the serum CA125 level included or excluded.

With all predictive factors entered, the model outcome indicated the probability of a malignant tumour, presented graphically and numerically. Further, the risk of a borderline neoplasm, stages I and II to IV ovarian invasive neoplasm, or metastatic cancer was investigated. We followed the model as described in the initially published version,12 with no alterations to the equations or coefficients. We also obtained risk estimates using the online calculator on the IOTA website mentioned above.
Reference Standard
The histopathological diagnosis of the mass was the reference standard for all patients in this study. Histopathology testing of the excised tissue was performed by the pathology department after the operation. The pathological tumour types followed the World Health Organization categorization.21 The stages of malignancy were based on the International Federation of Gynecology and Obstetrics criteria.22 Histological classification was performed without knowledge of the sonographic information or ADNEX results. The diagnoses were categorized into five groups: benign neoplasm, borderline tumor, stage I OC, stage II to IV OC, and secondary metastatic carcinoma.
Statistical Analysis
Statistical analysis was performed using IBM SPSS Statistics for Macintosh, Version 26.0 (IBM Corp., Armonk, NY) and MedCalc for Windows, Version 20.0.9 (MedCalc Software, Ostend, Belgium). Borderline tumors were considered malignant for statistical analysis.
We used the chi-squared test and Fisher's exact test to compare categorical data such as the patients' clinical characteristics, several ultrasonographic features of the tumor, and the serum CA125 values. The Student's t-test compared means of continuous variables, and the Mann–Whitney U test was a nonparametric alternative for the t-test. Quantitative variables were presented as the mean and standard deviation if normally distributed and median and interquartile range if its distribution was skewed. Categorical variables were presented as frequencies.
The percentage of missing values for CA125 was very low (2%). Additionally, there were no missing values for other variables. Therefore, we used a single random interpolation based on logistic regression to deal with these missing values, as described in detail in the online supplemental Appendix 1. External validation of the ADNEX model was performed by assessing its discrimination performance with and without CA125 using the receiver operating characteristic curve. The area under the curve (AUC) was determined with its 95% confidence intervals (CIs) based on a thousand bootstraps, where the same patients were selected for each bootstrap sample in the interpolated dataset.23 Differences were considered statistically significant at P < .05. A comparison between the AUC values in the forecasting approaches was conducted using the methodology described by DeLong et al.24
The conditional risk approach was used to calculate the AUC values for each pair of tumor categories,25 10 pairs in total. Moreover, sensitivities and specificities were calculated for thresholds of 1, 3, 5, 10, 15, 20, and 30% of the definite malignancy risk. Several additional diagnostic performance measures were calculated with various cut-off points, including sensitivity and specificity, diagnostic odds ratios, positive and negative likelihood ratios, and predictive values. We assessed the calibration of the predictive probability using calibration plots that showed the relationship between the observed and predicted probabilities of malignant tumors.
Results
The level I and III examiners initially retrieved 335 consecutive patients with ultrasonographic adnexal masses managed by surgery. Based on the inclusion and exclusion criteria, 53 patients were excluded for the following reasons: 20 had no histology result (14 with only cytology, 6 without cytology and histology result); 10 underwent surgery >60 days after the ultrasonographic examination; 5 refused to undergo a transvaginal scan (only transabdominal scans were available), 2 were pregnant, and 16 did not undergo surgery because of a decline in the physical condition they could not sustain surgery (n = 2), refusal of surgery for personal reasons (n = 8), or postponement of surgery due to neoadjuvant chemotherapy (n = 10). Therefore, 282 patients were finally included in the analysis (Figure 2). Among them, approximately 100 patients were assessed by a level I operator, and the rest were assessed by a level III operator. Among them, serum CA125 level was missing in 6 (2%). Of the 282 patients, 178 (63.1%) had benign ovarian tumors, and 104 (36.9%) had malignant tumors. The most common benign diagnoses were teratoma (19.5%, 55/282) and endometriomas (11.3%, 32/282). The malignant lesions included 24 (8.5%) BOT, 27 (9.6%) stage I cancer, 46 (16.3%) stages II to IV cancer, and 7 (2.5%) secondary metastatic cancer (Table 1). Age was significantly higher in patients with malignant tumors than in those with benign tumors (P < .001). The median age was 45 years, with 181 (64.2%) premenopausal and 101 (35.8%) postmenopausal women. The serum CA125 level in postmenopausal women was higher than in premenopausal women (Z = −2.814, P = .005), with the ultrasonographic findings showing the former to have a larger malignant tumor diameter, broader solid tissue component, more papillary projections, and ascites (P < .05). Patients with a malignant tumor had almost no acoustic shadow. Table 2 shows the clinical characteristics and ultrasonographic features of the patients.

Histological Type of Masses | N (%) |
---|---|
Benign | 178 (63.1) |
Endometrioma | 32 (11.3) |
Serous cystadenoma | 15 (5.3) |
Mucinous cystadenoma | 26 (9.2) |
Serous—mucinous cystadenoma | 3 (1.1) |
Teratoma | 55 (19.5) |
Hydrosalpinx | 4 (1.4) |
Fibrothecoma | 2 (0.7) |
Mesosalpinx cyst | 11 (3.9) |
Paraovarian cyst | 2 (0.7) |
Cystadenofibroma | 2 (0.7) |
Fibroma | 9 (3.2) |
Adenofibroma | 5 (1.8) |
Brenner tumor | 1 (0.4) |
Sertoli-Leydig cell tumor | 1 (0.4) |
Other ovarian benign lesion | 10 (3.5) |
Borderline ovarian tumor | 24 (8.5) |
Serous | 9 (3.2) |
Mucinous | 8 (2.8) |
Serous—mucinous | 7 (2.5) |
Primary ovarian malignant | 73 (25.9) |
Serous adenocarcinoma | 44 (15.5) |
Clear cell carcinoma | 6 (2.1) |
Endometrioid adenocarcinoma | 7 (2.5) |
Mucinous adenocarcinoma | 3 (1.1) |
Sertoli-Leydig cell tumor | 1 (0.4) |
Carcinosarcoma | 2 (0.7) |
Granulosa cell tumor | 4 (1.4) |
Seromucinous adenocarcinoma | 2 (0.7) |
Diffuse large B cell lymphoma of ovary | 1 (0.4) |
Germinoma | 2 (0.7) |
Strumal carcinoid of ovary | 1 (0.4) |
Metastasis | 7 (2.5) |
Gastric cancer | 6 (2.1) |
Breast cancer | 1 (0.4) |
Characteristic | Group | Malignant (n = 104) | F/Z/χ2 Value | P Value | ||||
---|---|---|---|---|---|---|---|---|
Benign | Borderline | Stage I OC | Stage II–IV OC | Metastatic | ||||
(n = 178) | (n = 24) | (n = 27) | (n = 46) | (n = 7) | ||||
Age, y | ||||||||
Mean ± standard deviation | 40.17 ± 14.07 | 47.96 ± 17.89 | 48.85 ± 13.58 | 56.91 ± 8.37 | 61.71 ± 12.07 | 17.900 | <.001 | |
Menopausal status | Pre | 134 (75.3%) | 16 (66.7%) | 15 (55.6%) | 13 (28.3%) | 3 (42.9%) | 36.751 | <.001 |
Post | 44 (24.7%) | 8 (33.3%) | 12 (44.4%) | 33 (71.7%) | 4 (57.1%) | |||
CA 125 (U/mL) | 20.19 (13.25, 36.28) | 68.45 (14.95, 145.98) | 39.80 (20.30, 101.80) | 538.45 (189.95, 1275.48) | 148.00 (23.00, 171.10) | 101.578 | <.001 | |
Maximum diameter of lesion (mm) | 66.50 (44.00, 98.25) | 101.50 (54.50, 186.75) | 93.00 (60.00, 122.00) | 83.00 (52.50, 112.25) | 95.00 (41.00, 153.00) | 14.659 | .005 | |
Prescence of maximum solid part (mm) | 5.00 (5.00, 20.25) | 23.00 (11.25, 40.00) | 25.00 (20.00, 45.00) | 40.00 (20.00, 70.00) | 55.00 (35.00, 130.00) | 85.245 | <.001 | |
Papillary projections present (mm) | 0 | 132 (74.2%) | 13 (54.2%) | 13 (48.1%) | 17 (37.0%) | 4 (57.1%) | 47.285 | <.001 |
1 | 17 (9.6%) | 0 (0.0%) | 2 (7.4%) | 2 (4.3%) | 1 (14.3%) | |||
2 | 10 (5.6%) | 2 (8.3%) | 3 (11.1%) | 6 (13.0%) | 1 (14.3%) | |||
3 | 7 (3.9%) | 2 (8.3%) | 2 (7.4%) | 4 (8.7%) | 0 (0.0%) | |||
>3 | 12 (6.7%) | 7 (29.2%) | 7 (25.9%) | 17 (37.0%) | 1 (14.3%) | |||
Cyst locules | >10 | 3 (1.7%) | 2 (8.3%) | 1 (3.7%) | 0 (0.0%) | 0 (0.0%) | 5.478 | .176 |
<10 | 175 (98.3%) | 22 (91.7%) | 26 (96.3%) | 46 (100.0%) | 7 (100.0%) | |||
Acoustic shadows | Yes | 96 (53.9%) | 6 (25.0%) | 2 (7.4%) | 0 (0.0%) | 0 (0.0%) | 76.584 | <.001 |
No | 82 (46.1%) | 18 (75.0%) | 25 (92.6%) | 46 (100.0%) | 7 (100.0%) | |||
Ascites | Yes | 3 (1.7%) | 2 (8.3%) | 2 (7.4%) | 25 (54.3%) | 3 (42.9%) | 79.310 | <.001 |
No | 175 (98.3%) | 22 (91.7%) | 25 (92.6%) | 21 (45.7%) | 4 (57.1%) |
- Data are given as median (P25, P75) OR n (%).
The concordance between the total predicted risk of malignancy by the ADNEX model with and without CA125 and the observed proportion of malignant tumors is shown in the calibration plots (Figure 3).

The AUC for differentiating between benign and malignant masses by the ADNEX model with CA125 was 0.93 (95% CI, 0.90–0.96) and 0.91 (95% CI, 0.88–0.95) when CA125 was excluded (Figure 4), demonstrating a slightly better performance for the model that included the CA125 (AUC difference: 0.02, 95% CI: 0.002–0.04). Table 3 presents the model performance with risk cut-off points set from 1 to 30%, with and without CA125. The sensitivities of ADNEX models that included CA125 were 99, 95, and 88% at risk cut-off values of 1, 10, and 30%, respectively. The respective specificities were 24, 58, and 82%. Postmenopausal women had a larger AUC than premenopausal women: 0.94 (95% CI, 0.87–0.98) versus 0.91 (95% CI, 0.86–0.95) with CA125 and 0.90 (95% CI, 0.83–0.95) versus 0.88 (95% CI, 0.83–0.93) without.

Cut off | Sensitivity | Specificity | PPV | NPV | LR+ | LR− | DOR |
---|---|---|---|---|---|---|---|
(95% CI) | (95% CI) | (95% CI) | (95% CI) | (95% CI) | (95% CI) | ||
(%) | (%) | (%) | (%) | (%) | (%) | ||
With CA125 | |||||||
1% | 99.0 (94.8–100) | 23.6 (17.6–30.5) | 43.11 (41.1–45.2) | 97.58 (85.4–99.7) | 1.3 (1.2–1.4) | 0.04 (0.01–0.3) | 32.50 |
3% | 99.0 (94.8–100) | 38.8 (31.6–46.3) | 48.61 (45.6–51.5) | 98.52 (90.7–99.8) | 1.62 (1.4–1.8) | 0.03 (0–0.2) | 54.00 |
5% | 98.1 (93.2–99.8) | 46.6 (39.1–54.2) | 51.79 (48.3–55.3) | 97.67 (91.2–99.4) | 1.84 (1.6–2.1) | 0.04 (0.01–0.2) | 46.00 |
10% | 95.2 (89.1–98.4) | 57.9 (50.3–65.2) | 56.94 (52.5–61.2) | 95.38 (89.7–98.0) | 2.26 (1.9–2.7) | 0.08 (0.04–0.2) | 28.25 |
15% | 95.2 (89.1–98.4) | 68.0 (60.6–74.8) | 63.5 (58.3–68.4) | 96.04 (91.1–98.3) | 2.97 (2.4–3.7) | 0.07 (0.03–0.2) | 42.43 |
20% | 92.3 (85.4–96.6) | 71.9 (64.7–78.4) | 65.76 (60.1–71.0) | 94.11 (89.1–96.9) | 3.29 (2.6–4.2) | 0.11 (0.05–0.2) | 29.91 |
30% | 87.5 (79.6–93.2) | 82.0 (75.6–87.4) | 74.0 (67.3–79.7) | 91.8 (87.1–94.9) | 4.87 (3.5–6.7) | 0.15 (0.09–0.3) | 32.47 |
Without CA125 | |||||||
1% | 99.0 (94.8–100) | 19.1 (13.6–25.7) | 41.7 (39.9–43.5) | 97.1 (82.5–99.6) | 1.22 (1.1–1.3) | 0.05 (0.01–0.4) | 24.40 |
3% | 99.0 (94.8–100) | 32.6 (25.8–40.0) | 46.2 (43.6–48.8) | 98.3 (89.1–99.8) | 1.47 (1.3–1.6) | 0.03 (0–0.2) | 49.00 |
5% | 98.1 (93.2–99.8) | 40.5 (33.2–48.0) | 49.0 (45.9–52.1) | 97.3 (90.0–99.3) | 1.65 (1.5–1.9) | 0.05 (0.01–0.2) | 33.00 |
10% | 95.2 (89.1–98.4) | 54.5 (46.9–62.0) | 55.0 (50.9–59.1) | 95.1 (89.1–97.9) | 2.09 (1.8–2.5) | 0.09 (0.04–0.2) | 23.22 |
15% | 95.2 (85.4–96.6) | 61.8 (54.2–69.0) | 58.5 (53.7–63.2) | 93.2 (87.5–96.4) | 2.42 (2.0–2.9) | 0.12 (0.06–0.2) | 20.17 |
20% | 90.4 (83.0–95.3) | 70.0 (60.0–74.8) | 62.3 (56.9–67.3) | 92.4 (86.9–95.7) | 2.82 (2.3–3.5) | 0.14 (0.08–0.3) | 20.14 |
30% | 87.5 (79.6–93.2) | 77.0 (70.1–82.9) | 68.9 (62.7–74.6) | 91.3 (86.3–94.6) | 3.8 (2.9–5.0) | 0.16 (0.10–0.3) | 23.75 |
- DOR indicates diagnostic odds ratio; LR+, positive likelihood ration; LR−, negative likelihood ration; NPV, negative predictive value; PPV, positive predictive value.
The ADNEX model displayed good discrimination between the tumor subtypes when categorized as benign, borderline, stage I invasive, stages II to IV invasive, and secondary metastatic tumors (Table 4). For instance, near-perfect differentiation between benign and stage II to IV tumors was achieved in models that included CA125 (AUC, 0.99; 95% CI, 0.98–1.00). In contrast, performance was worse in differentiating borderline from phase I ovarian carcinoma (AUC, 0.53; 95% CI, 0.37–0.69). Compared to other groups, the ADNEX model without CA125 mainly had a lower AUC for stage II to IV tumors, particularly when compared to secondary metastatic cancers. The AUC for the model with CA125 was 0.82 (95% CI, 0.71–0.93), and that without CA125 was 0.51 (95% CI, 0.31–0.72).
Discrimination Measure | ADNEX Model With CA125 (95%CI) | ADNEX Model Without CA125 (95%CI) |
---|---|---|
AUC benign vs malignant | 0.93 (0.89–0.96) | 0.91 (0.87–0.94) |
AUC benign vs borderline | 0.85 (0.768–0.939) | 0.81 (0.72–0.91) |
AUC benign vs stage I OC | 0.89 (0.83–0.95) | 0.86 (0.80–0.93) |
AUC benign vs stage II–IV OC | 0.99 (0.98–1.00) | 0.98 (0.96–0.99) |
AUC benign vs metastasis | 0.99 (0.97–1.00) | 0.99 (0.97–1.00) |
AUC borderline vs stage I OC | 0.53 (0.37–0.69) | 0.53 (0.37–0.70) |
AUC borderline vs stage II–IV OC | 0.89 (0.81–0.96) | 0.86 (0.77–0.95) |
AUC borderline vs metastasis | 0.75 (0.57–0.93) | 0.89 (0.75–1.00) |
AUC stage I OC vs stage II–IV OC | 0.90 (0.83–0.97) | 0.84 (0.84–0.93) |
AUC stage I OC vs metastasis | 0.78 (0.60–0.96) | 0.86 (0.74–0.99) |
AUC stage II–IV OC vs metastasis | 0.82 (0.71–0.93) | 0.51 (0.31–0.72) |
- AUC indicates area under the receiver operating characteristic curve; CI, confidence interval; OC, ovarian cancer.
Discussion
Our study suggests that level I and III sonographers may help differentiate benign and malignant masses at presentation, achieving levels very similar to those obtained by the experienced sonographers in the original validation study by the IOTA group.12 The model sensitivity and specificity in our external validation study for defining malignancy with a 10% cut-off were 95.2% (95% CI, 89.1–98.4) and 57.9% (95% CI, 50.3–65.2), respectively, compared to 96.5 and 71.3% in the original study.12 It can be seen that the sensitivity was comparable to that of the original study; however, the specificity was lower than that of the primary research. When the cut-off value was 30%, the sensitivity and specificity were 87.5 and 82.0%, respectively. In contrast, a lower false-positive rate, and a much higher cut-off value for malignancy (30%) may be more appropriate at a tertiary center.26 Similarly, for some centers, a higher cut-off value (eg, 30%) was preferred, and they considered that having a high specificity may be more critical. This limits the number of false-positive results, where patients with benign lesions are referred to oncologists. Contrastingly, various hospitals tend to prefer lower cut-off values (eg, 5 or 10%), and having high sensitivity is probably the most crucial, which restricts the number of patients with false negatives. Thus, the optimal cut-off value might vary depending on the center type as different countries have different medical care systems, protocols for referrals, and clinical features of the patients.
It was impossible for us to determine the best cut-off value for conservative treatment as only patients treated with surgery were included in this study. In our study, based on the ultrasound diagnosis and the patient's symptoms, the sonographers recommend surgery or conservative treatment. However, ultimately, it is the treating clinician, together with the patient, who decides on the treatment strategy. Therefore, the recommended management and the actual may differ. Recent studies have been shown to evaluate all patients with adnexal masses, regardless of whether they were treated surgically or conservatively. The results suggest that the ADNEX model with or without CA125 is the best model for differentiating between benign and malignant adnexal masses.27 In our study, a total of 32 patients received advice on conservative treatment, and we will continue to follow up and pay attention to the changes in the condition of these patients. Therefore, in the future, we will carry out the long-term follow-up of conservatively treated patients to validate the model in a larger patient population to better distinguish between benign and malignant tumors. Moreover, we admit that this study was challenging, because there may be no information for patients who have been followed up, and various factors such as surgery during the follow-up period will be excluded.
This is an external ADNEX model validation study for OC in a tertiary center in China. Furthermore, the validation was performed by levels I and III ultrasonographers, whereas previous reports collected ultrasonographic scan data from experienced level II or III examiners.12, 13 Several limitations of this study could be summarised as follows. First, this was a single-center study. Such a design might lead to possible sampling variations between centres and limit the applicability of the outcomes for other fields. Second, this study included several patients with missing CA125 values. We used a single random interpolation to deal with the missing values, recognizing that these were not necessarily the true value. However, the 24 participating centers in the original ADNEX model study had malignancy prevalence between 0 and 66%,12 indicating that the results are likely to be scalable. Furthermore, models are expected to perform differently in different centers and countries. Nevertheless, it should be noted that this study had some strengths. The reference diagnoses were robustly selected, including only cases with histological features. However, this may also be viewed as potentially flawed as the option to treat the mass conservatively was not included in the study. The serum CA125 was measured in all patients using the same assay kit, thus avoiding potential bias. Our study relied on a centralized histopathology examination to avoid possible bias, especially in distinguishing borderline from benign or stage I tumors, which is an ongoing challenge for pathologists.
The levels of ultrasonography experience in this study (I and III) were categorization based on recommendations made by the EFSUMB.20 These findings were supported by the Royal College of Radiologists.28 It is worth noting that it is difficult to define precisely the levels, which might overlap at the boundaries. Therefore, we acknowledge that this approach had some limitations.
An individualized diagnosis of ovarian masses is possible with the ADNEX model by describing the type of malignancy in detail. The proper differentiation between benign and malignant adnexal masses is one of the most critical steps in choosing the appropriate treatment. Presurgical detailed diagnosis with the ADNEX model could assist in selecting the most appropriate surgical approach (laparoscopic or cesarean) or in focusing directly on the original site of the malignancy if metastases occur. The ADNEX model could be used to calculate relative risk ratios for comparison with the background risk of individual patients.29
Conclusions
To conclude, this study showed that the ADNEX model was the highly accurate performing model in discriminating, calibrating, and clinical applicability. External validation is a key step in the development of any diagnostic model. Undoubtedly, the ADNEX model could potentially change management decisions for patients with an adnexal mass and is a valuable tool for clinical applications. Further studies on the characteristics of the IOTA ADNEX model in a broader range of patients managed conservatively are needed.