Open Access

Risk of Ovarian Malignancy Algorithm and Pelvic Mass Score for the prediction of malignant ovarian tumors: a prospective comparative study


Cite

Introduction

Ovarian cancer is one of the most common female cancers worldwide. Frequently, ovarian malignancy in diagnosed at an advanced stage, which usually leads to poor prognosis. Early diagnosis of the malignant potential of ovarian masses allows prompt intervention, and referral if needed, which has a great impact on the overall outcome and prognosis(1). Still, there is no available universal screening method for malignant ovarian masses. Many researchers have tested various parameters aiming at an earlier diagnosis of ovarian masses, including clinical features, biological markers, and different imaging modalities; yet, they have failed to reach a consensus on the optimum screening method for ovarian cancer(2). Given the compelling need for an accurate test that can differentiate benign from malignant pelvic masses, various prediction models, scoring systems, and several markers have been analyzed. Despite these efforts, a universally accepted screening test is still unavailable(1).

Risk of Ovarian Malignancy Algorithm (ROMA) is a test that combines serum cancer antigen 125 (CA 125) and human epididymal protein 4 (HE4) together with the menopausal status, to obtain a numerical score(3). ROMA was approved by the FDA for distinguishing malignant from benign pelvic masses in 2011(4). Based on the morphological characteristics of ovarian masses in trans-vaginal ultrasonography, different scoring systems have been proposed(5). Doppler ultrasonography can also have a major role in distinguishing ovarian masses, their vasculature being devoid of muscles; malignant tumors show a higher diastolic flow and decreased impedance on Doppler examination(6). The scoring system called the Pelvic Mass Score (PMS), proposed by Rossi et al., takes into account all parameters, including Sassone score, vascular distribution, resistance index, and menopausal status, all of which have been proven by various studies to have a statistically significant association with the risk of malignancy(7).

This study compares the value of the ROMA and PMS scoring systems in diagnosing malignant ovarian masses.

Materials and methods

This prospective comparative study was conducted from March 2021 to April 2022. Ethical approval was obtained from the Research Ethics Committee (REC) for Human and Animal Research at the Faculty of Medicine, Helwan University (REC-FMHU, serial number: 20-2021). A total of 258 women diagnosed with ovarian mass and eligible for a surgical intervention according to applicable institutional guidelines were enrolled in the study. Pregnant women, as well as women refusing surgery, were excluded from the study; thus, a total of 225 women were ultimately recruited into the study (Fig. 1).

Fig. 1.

Flowchart of the study

A thorough history was taken, and physical examination and ultrasound assessment were done in every recruited woman before enrollment. After clarifying the study aim, all women matching the inclusion criteria were asked for an informed written consent.

Un ultrasound unit (Samsung Medison Co, LTD, Korea Model H60 or Toshiba Aplio 400, Toshiba Medical Systems, Japan) was used to perform pelvi-abdominal ultrasound (3.5 MHz convex probe) for the assessment of masses, ascites and metastases, and in virgins. Color flow Doppler was performed to measure the resistance index of the mass (Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6). Trans-vaginal ultrasound (7.5 MHz transducer) was performed in non-virgin females. Preoperative venous blood samples were collected from each patient for a full laboratory panel and estimation of CA 125 level by solid phase two-site chemiluminescent immunometric assay using Immulite® 1000 OM-MA and HE4 serum level using human WAP four-disulfide core domain 2, WFDC2(HE4), ELISA kit by ElAab®. Each scoring system was calculated before surgery as follows:

PMS was calculated according to Rossi et al., 2011, with a cut-off value of 29(7). PMS=SASS×LOGCA125×VAS×MSRI {\rm{PMS}} = {{{\rm{SASS}} \times {\rm{LOG}}\left( {{\rm{CA}}125} \right) \times {\rm{VAS}} \times {\rm{MS}}} \over {{\rm{RI}}}}

SASS – numeric value of the Sassone score.

LOG (CA125) – base-10 logarithm of the CA 125 level.

VAS – type of vascularization (peripheral = 1; central/septal = 2).

MS – menopausal status (pre-menopausal = 1; post-menopausal = 2).

RI – numeric value of the resistance index of the pelvic mass.

Fig. 2.

Case of mucinous cystadenoma

Fig. 3.

Case of mature cystic teratoma

Fig. 4.

Case of endometrioid adenocarcinoma

Fig. 5.

Case of papillary serous carcinoma

Fig. 6.

Case of undifferentiated carcinoma

The Sassone scoring system was calculated as shown in Tab. 1 (cutoff value 9)(8).

Sassone scoring system (cut-off value: 9)

US findings 1 2 3 4 5
Inner wall Smooth Irregularity <3 mm Papillary >3 mm Not applicable, mostly solid _____
Structure
Septa No septa Thin <3 mm _____ _____ _____
Wall thickness Thin <3 mm Thick >3 mm Not applicable, mostly solid _____ _____
Echogenicity Sonolucent Low echogenicity Low echogenicity with echogenic core Mixed echogenicity High echogenicity

ROMA classifies women as being at a low or high risk for malignant disease using the following algorithms:

Pre-menopausal: Predictive Index (PI) = − 12.0 + [2.38 × LN (HE4)] + [0.0626 × LN (CA125)]

Post-menopausal: Predictive Index (PI) = − 8.09 + [1.04 × LN (HE4)] + [ 0.732 × LN (CA125)]

Predicted probability: (Pp) =100 exp (PI) / [1+ exp (PI)] (LN = coefficients A for the natural log)

The following thresholds will be selected for:

Pre-menopausal women:

Pp ≥12.5% = high risk of finding ovarian malignancy.

Pp >12.5% = low risk of finding ovarian malignancy.

Post-menopausal women:

Pp ≥14.4% = high risk of finding ovarian malignancy.

Pp <14.4% = low risk of finding ovarian malignancy.

The predictive probability algorithm (ROMA) was developed from two separate studies(3,9).

All women were subjected to a surgical intervention (according to the applicable institutional guidelines), using either open or laparoscopic technique. Histopathological examination of the removed specimens was done, and in line with the recognized gold standard, it was compared to the pre-operative diagnosis of both scoring systems.

Sample size justification: The required sample size was calculated using PASS© version 11 (NCSS, LLC. Kaysville, Utah, USA). The primary outcome measure was the accuracy of the scoring systems studied in the discrimination between women with benign or malignant ovarian masses. A previous study reported that the area under the receiver-operating characteristic curves (ROC) for the prediction of malignant pelvis masses was 0.898 for the ROMA(10). Another study reported that the area under the curve (AUC) for the PMS was 0.96(7). Thus, it is estimated that a sample size of 225 women would achieve a power of 80% (type II error = 0.2) and type I error of 0.05 (confidence level = 95%) to detect a difference of 0.062 between the AUCs for the ROC curves associated with the PMS and the ROMA. The AUCs are assumed to equal 0.898 under the null hypothesis of no difference, and to be 0.96 and 0.898 for the PMS and the ROMA, respectively, under the alternative hypothesis.

IBM© SPSS© Statistics version 21 (IBM© Corp., Armonk, NY, USA), was used for data analysis. The normality of numerical data distribution was checked by the D’Agostino-Pearson test. The ROMA value Pp of ≥12.5 for pre-menopausal women or ≥14.4 for post-menopausal women, and a PMS >29 were used as a cut-off value for the differentiation between benign and malignant ovarian masses, and were examined by the construction of two-by-two contingency tables with the histopathological diagnosis regarded as the gold-standard test. The following diagnostic indices were calculated for each diagnostic tool: sensitivity, specificity, correct classification rate, misclassification rate, false positive rate, false negative rate, positive predictive value, and negative predictive value.

Receiver-operating characteristic (ROC) curve analysis was used to examine the value of the diagnostic indices and biomarkers for the differentiation between benign and malignant ovarian masses. The best cut-off value was identified as that associated with the highest Youden’s J index.

Comparison of the sensitivities of diagnostic tools was done by compiling two-by-two contingency tables for the classification of women with malignant ovarian masses using either tool, and the discordance between both tools was tested using McNemar’s test. To compare the specificities of diagnostic tools, similar contingency tables was constructed for the classification of patients with benign ovarian masses using either tool, and the discordance between both tools was tested using McNemar’s test. To adjust for multiple pairwise comparisons of sensitivities or specificities with McNemar’s test, the Bonferroni method was used to adjust the level of significance for the number of multiple pairwise comparisons. This indicated that a two-sided p-value <0.017 was statistically significant to keep the final type I error at 0.05. For other analyses, a conventional two-tailed p-value <0.05 was considered statistically significant.

Results

Out of the 258 women primarily enrolled into the study, 33 were excluded according to the exclusion criteria, so ultimately a total of 225 patients with adnexal mass were included in the study. Their median age was 40 years (interquartile ranging from 23 to 65 years). A total of 151 women (67.1%) were pre-menopausal and 74 (32.9%) were post-menopausal. Benign lesions were identified in 133 cases, representing 59.1%, while malignant lesions were detected in 92 cases, representing 39.9% (Fig. 1). Various histopathological types of both malignant and benign lesions are shown in Tab. 2 and Tab. 3.

Prevalence of various histopathological types of malignant lesions

Histopathological diagnosis N (%) Histopathological diagnosis N (%)
Endometrioid adenocarcinoma 29 (32.2%) Clear cell carcinoma 9 (10%)
Papillary serous carcinoma 17 (18.9%) Mucinous cystadenocarcinoma 5 (5.6%)
Papillary tubal serous carcinoma 1 (1.1%) Germ cell tumor 3 (3.3%)
Undifferentiated carcinoma 4 (4.4%) Clear cell + endometrioid adenocarcinoma 2 (2.2%)
Mixed epithelial tumor 4 (4.4%) Immature teratoma 2 (2.2%)
Granulosa cell tumor 2 (2.2%) Sclerosing stromal tumor 1 (1.1%)
Sertoli-Leydig tumor 1 (1.1%) Borderline serous tumor 5 (5.6%)
Borderline mucinous tumor 2 (2.2%) Lymphoma 1 (1.1%)
Krukenberg tumor 1 (1.1%) Pseudomyxoma peritonei 1 (1.1%)

Prevalence of various histopathological types of benign lesions

Histopathological diagnosis N (%) Histopathological diagnosis N (%)
Endometriotic cyst 35 (25.9%) Hemorrhagic corpus luteum 4 (3%)
Mucinous cystadenoma 13 (9.6%) Simple follicular cyst 8 (5.9%)
Tubo-ovarian abscess 17 (12.6%) Fibroma 7 (5.2%)
Mature cystic teratoma 24 (17.8%) Serous cyst 10 (7.4%)
Serous cystadenoma 10 (7.4%) Follicular cyst 3 (2.2%)
Hemorrhagic follicular cyst 2 (1.5%) Fibrothecoma 2 (1.5%)

Age was found to be a significant factor associated with malignancy (p <0.0001), with the median age for benign cases being 35 years (23–52), and that for malignant − 50.5 years (34–65). As for the menopausal status, malignancies were more prevalent in post-menopausal women (p <0.001). Among benign cases, 115 women (85.2%) were pre-menopausal in comparison to 20 (14.8%) post-menopausal, while among malignant cases, 54 women (60.0%) were post-menopausal, while 36 (40.0%) were pre-menopausal.

An analysis of different tumor markers and indices in both benign and malignant cases, including HE4, CA 125 as well as the ROMA, PMS and Sassone scores, showed a significant difference between benign and malignant cases (Tab. 4). Receiver-operating characteristic (ROC) curve analysis for the value of various indices and biomarkers in the differentiation between benign and malignant ovarian masses showed that PMS had the greatest area under the curve (AUC) − 0.93, followed by ROMA − 0.91 (Tab. 5). Comparison of the areas under the ROC curves for the differentiation between benign and malignant ovarian masses using various diagnostic indices and biomarkers showed PMS and ROMA to have the highest predictive probability compared to other diagnostic indices; yet, no statistically significant difference was found between the two scoring systems (p = 0.353, 95% CI −0.025 to 0.070) (Fig. 7). Comparing the sensitivities of different scoring systems showed ROMA to be more sensitive than the Sassone score (p = 0.012). PMS is also more sensitive than the Sassone score (p = 0.007), while a comparison of ROMA and PMS showed no significant difference (p = 1.000) (Tab. 6).

Results of biomarker assays and estimated values of predictive indices in women with benign or malignant ovarian masses

Variable Benign (n = 135) Malignant (n = 90) p-value
HE4 (pmol/l) 72.8 (52.90–94.3) 369.6 (80.8–909.1) <0.0001
CA 125 (IU/ml) 25 (11.0–69.8) 341.5 (117.0–640.0) <0.0001
ROMA predicted probability (%) 17.5 (9.1–28.9) 87.1 (67.7–97.9) <0.0001
Sassone score 8 (6–9) 11 (9.0–12.0) <0.0001
PMS 18.1 (10.7–31.6) 106.7 (59.7–198.9) <0.0001

Data are presented as median (interquartile range).

Receiver-operating characteristic (ROC) curve analysis for the value of various indices and biomarkers for differentiation between benign and malignant ovarian masses

Predictor
Index ROMA Sassone PMS CA 125 HE4 RI
AUC 0.91 0.80 0.93 0.86 0.81 0.72
95% CI 0.87 to 0.95 0.74 to 0.85 0.89 to 0.96 0.81 to 0.900 0.76 to 0.86 0.66 to 0.78
p-value <0.0001 <0.0001 <0.0001 <0.0001 <0.0001 <0.0001
J index 0.8 0.47 0.74 0.61 0.64 0.46
Cut-off criterion >48.1 >9 >43.1 >130 >143 ≤0.56
Sensitivity 84.4 67.8 83.3 73.3 67.8 58.9
95% CI 75.3–91.2 57.1–77.2 74.0–90.4 63.0–82.1 57.1–77.2 48.0–69.2
Specificity 95.56 79.26 90.37 87.41 96.3 86.67
95% CI 90.6–98.4 71.4–85.8 84.1–94.8 80.6–92.5 91.6–98.8 79.7–91.9
+LR 19 3.27 8.65 5.82 18.3 4.42
−LR 0.16 0.41 0.18 0.31 0.33 0.47
+PV 92.7 68.5 85.2 79.5 92.4 74.6
95% CI 84.8–97.3 57.8–78.0 76.1–91.9 69.2–87.6 83.2–97.5 62.9–84.2
−PV 90.2 78.7 89.1 83.1 81.8 76
95% CI 84.1–94.5 70.8–85.2 82.6–93.7 75.9–88.9 74.9–87.4 68.4–82.5

AUC – area under the ROC curve; +LR – positive likelihood ratio; −LR – negative likelihood ratio; +PV – positive predictive value; −PV – negative predictive value

Fig. 7.

Comparison of the receiver-operating characteristic (ROC) curves for the differentiation between benign and malignant ovarian masses using various diagnostic indices and biomarkers

Comparison of sensitivity of ROMA ≥12.5 (pre-menopausal) or ≥14.4 (post-menopausal) scores, Sassone score ≥9, and PMS ≥29 for the identification of malignant lesions

ROMA ROMA Sassone
Sass-one <12.5 or <14.4 ≥12.5 or ≥14.4 Σ PMS <12.5 or <14.4 ≥12.5 or ≥14.4 Σ PMS <9 ≥9 Σ
<9 2 16 18 (20%) <29 2 3 5 (5.6%) <29 1 4 5 (5.6%)
≥9 4 68 72 (80%) >29 4 81 85 (94.4%) >29 17 68 85 (94.4%)
Σ 6 (6.7%) 84 (93.3%) 90 Σ 6 (6.7%) 84 (93.3%) 90 Σ 18 (20%) 72 (80%) 90
Diff. 13.3% (95% CI, 2.8 to 19.7) Diff. 1.11% (95% CI, −4.9 to 6.2) Diff. 14.4% (95% CI, 3.8 to 20.8)
p-value 0.012 p-value 1.000 p-value 0.007

Data in the contingency table represent the number of patients; Σ – total;

McNemar test (exact binomial probability)

Discussion

The frequently encountered late diagnosis of ovarian cancers with its resultant poor prognosis highlight the importance of early diagnosis of ovarian masses to enable early management, which has a great impact on the patient’s overall prognosis(1). Prompt diagnosis of ovarian malignancies and early referral to a gynecologic oncologist can improve the patient survival rates(11). However, an accurate method for the prediction of ovarian malignancy is still unavailable(2). The menopausal status, age, sonographic parameters, and tumor markers, separately, might be valuable for distinguishing malignant from benign adnexal masses. Nevertheless, a scoring system integrating all these variables would be probably more accurate in diagnosing malignant masses(6).

The patient’s pre- or post-menopausal status is a fundamental clinical parameter for determining the risk of malignancy(7). In ovarian neoplasms, the post-menopausal status is associated with a statistically significant risk of malignancy(7). This study found the menopausal status to be associated with the malignant potential of adnexal masses. Malignancy was more prevalent in post-menopausal women (p <0.001), with 60.0% of malignant cases present in post-menopausal patients.

Nevertheless, no universal test discriminating benign from malignant adnexal masses is available. A scoring system predicting ovarian malignancy would allow better preoperative counseling and preparation as well as prompt referral of patients to a specialized center(2). CA 125 is a widely used marker for epithelial ovarian cancer (EOC). CA 125 is considered to be the best marker for the clinical follow-up of women with ovarian cancer(6). CA 125 is valuable in differentiating benign from malignant ovarian masses, diagnosis of post-menopausal ovarian masses, follow-up of ovarian cancer, and detection of recurrence following surgical treatment(6). However, inadequate specificity remains a problem, as many benign diseases, both gynecological and non-gynecological, are associated with elevated serum CA 125 levels. Also, it shows low sensitivity in early ovarian cancer stages. Regarding HE4, another proposed tumor marker for ovarian cancer, its levels can also be elevated due to smoking and use of oral contraceptives, and in several benign diseases such as renal failure, effusion, liver disease, lung disease, and chronic heart failure(12). Compared to single marker assays, ROMA has a better diagnostic performance for EOC, with higher sensitivity and greater accuracy compared to HE4 in post-menopausal women(13,14).

The results of the current study support the hypothesis that ROMA has a higher diagnostic accuracy compared to CA 125 or HE4 alone. Still, several cases with false-positive ROMA scores were encountered, one study suggested that patients with a false-positive ROMA score had significantly lower median serum T3 levels than those with a true-negative ROMA score, and the true-positive ROMA group also showed significantly lower serum T3 levels than in false-negative ROMA group(12). This correlation between serum T3 levels and the diagnosis of malignant ovarian masses needs to be studied more thoroughly to be validated.

Sonographic features of adnexal masses have been utilized by various scoring systems to help in predicting the malignant potential of these masses(5). Doppler ultrasonography also has an essential role in differentiating ovarian masses. Malignant tumors have defective muscle layers in their vasculature, and thus have a greater diastolic flow and reduced impedance. These dissimilarities from the vasculature patterns of benign masses are very useful in the diagnostic process(6). One case series, involving 14,317 women and adopting a cut-off point for RI of 0.40, showed only one false-positive and two false-negative results(15). In another study with the same cut-off point, the sensitivity was only 25% and specificity 89%(16). A number of studies in the literature have proposed other cut-off values for RI: one study showed 58.9% sensitivity and 86.7% specificity after adopting a cut-off level of 0.56. Consequently, a standardized Doppler flowmetry with a generally accepted RI cutoff point is difficult to achieve(6).

The PMS is a composite of multiple parameters having statistically a significant association with the risk of ovarian malignancy(17). Rossi et al. detected 100% sensitivity, 93.8% specificity, 70% positive predictive value (+PV), and 100% negative predictive value (−PV) when adopting 29 as the cut-off value(7), and other authors analyzing the same cut-off point showed sensitivity, specificity, +PV and −PV of 70%, 100%, 100% and 76.9%, respectively(6). There are limited data concerning the PMS scoring system available in the literature. After adopting a cut-off point of 14, 95% sensitivity, 80% specificity, 82.6% +PV, and 94.1% −PV were encountered(6). The current study postulated a higher cut-off value, where sensitivity, specificity, +PV and −PV were found to be 83.3%, 90.37%, 85.2%, and 89.1%.

The study showed no statistical difference between the two scoring systems, ROMA and PMS, and found both systems to have an acceptable diagnostic accuracy. Other proposed systems for the diagnosis of malignant masses including ultrasound parameters, CA 125, and the menopausal status are studied to define the malignant potential of adnexal masses. Risk of Malignancy Index (RMI) is one of these systems. In fact, the sample recruited in this study is included as a part of another ongoing study comparing the three scoring systems, with a calculated sample size of 792 participants. The study has the strength point of being prospective, and included a relatively acceptable sample size. Still, including all types of ovarian masses including both EOC and other types of ovarian cancers may be considered as a limitation due to the different nature of these tumors, and might explain the relatively high proposed cut-off values.

Conclusions

Both PMS and ROMA seem to be promising scoring systems for discriminating benign from malignant ovarian masses, but more research is needed to determine the optimum diagnostic pathway, especially one yielding the least false-negative results.

eISSN:
2451-070X
Language:
English
Publication timeframe:
4 times per year
Journal Subjects:
Medicine, Basic Medical Science, other