radiomics is an advanced quantitative image features analysis defined as the conversion of clinical images to higher dimensional data and the subsequent mining of these data for improved decision support in research and clinical practice.1 The majority of clinically available medical images can potentially be evaluated with radiomics analysis. In this perspective, images of computed tomography, Magnetic Resonance Imaging (MRI), ultrasound, mammography or digital breast tomosynthesis and Fluorodesoxyglucose Positron-emission tomography-computed tomography (FDG PET/CT) include more data than what is visible on human eyes.1, 2, 3 Indeed, mathematical algorithms of radiomics examine hundreds of quantitative images considering medical images as data sources. The extracted imaging data could be the product of the mechanisms occurring at a genetic and molecular level linked to the genotypic and phenotypic characteristics of the tissue.4, 5, 6, 7 Among its applications, radiomics has been evaluated to differentiate normal and pathological tissue.1, 2, 3, 5 Hence, we thought about the potential use of radiomics in local recurrence surveillance of soft tissue sarcoma. The incidence of local recurrence (LR) of STS is about 6,5%–25% and is associated with poor outcome for patients.7 According to the American College of Radiology (ACR) Appropriateness Criteria guidelines, MRI is the most appropriate imaging test for LR surveillance of malignant or aggressive musculoskeletal soft-tissue tumors.8 However, data informing the appropriate use of MRI in the surveillance setting are conflicting.5,8,9 Indeed, MRI can differentiate local recurrence from post-surgical seroma, hematoma, inflammation and scarring, but some post-operative changes in the surgical bed can be similar to those of recurrence with conventional T1-weighted, T2- weighted, and post-contrast sequences posing diagnostic dilemmas especially when sarcoma recurrence has low signal intensity on fluid-sensitive images.1,8, 9, 10, 11 We hypothesized that radiomics analysis of MRI of patients undergoing follow-up for STS allows to differentiate normal tissue from pathological tissue of LR. Therefore, the aim of our study was to perform a radiomics analysis of MRI of patients undergoing local surveillance for Soft-tissue sarcoma recurrence.
This is an exploratory study of an ongoing Italian prospective (blind) multicenter study with institutional review board approval (blind). Written informed consent was obtained from participants. This study is endorsed by ESSR (European Society of Musculoskeletal Radiology). Prospective recruitment of patients, as per protocol, includes MRI and US with commercially available equipment. MRI parameters of sequences included in the radiomics analysis are reported in Table 1.
MRI Parameters
Manufacturer | Siemens Healthcare, Erlangen, Germany |
---|---|
Repetition time / echo time (TR/TE) | 500/8 |
Acquisition voxel size (mm3) | 0.6x0.7x3.0 |
Repetition time / echo time (TR/TE) | 6200/110 |
Acquisition voxel size (mm3) | 0.6x0.7x3.0 |
Repetition time / echo time (TR/TE) | 5/3 |
Acquisition voxel size (mm3) | 0.6x0.7x3.0 |
* T2-weighted MR imaging and T1- weighted MR imaging with Gadolinium are acquired with fat-saturation
All consecutive MRI Images of follow-up events acquired between March 2016 to September 2018 were included. A follow-up event was considered a complete MRI assessment to exclude STT LR. Inclusion criteria were: patients included were 18 years and older operated on for localized soft tissue sarcomas of the limb. Exclusion criteria were patients unable to understand or execute written informed consent, unable or unwilling to agree to follow-up during observation period and patients with metastatic disease.
This study focused on finding if some radiomics features could discriminate on MRI patients who had confirmed significant disease at histology (LR) from those who did not. To ensure unbiased assessment, all MRI annotations were performed blinded to the biopsy findings. Only cases with matched locations (surgery, biopsy, reports, and radiologist’s delineations) were considered for inclusion in this study.
Two data-sets were created: pathological and control.
Each data set includes MRI images each with distinctive Regions of interest (ROIs). ROIs were positioned by two researchers (blind and blind; R1 and R2 respectively) expert in quantitative image analysis (8 and 4 years of experience) blindly one from each other. R1 and R2 ROIs data were used separately for intra- and inter-observer agreement estimation while the mean value was used for other estimations. Discrepancies higher that 15% between R1 ROI and R2 ROI were handled with arbitration.
Regions of interest including all the visible tumor or suspicious area represented the pathological data-set. Regions of interest in the same slices of tissue with no imaging evidence of recurrence according radiological assessment represented the control data-set. ROIs of control-data set were positioned not before 20 mm to the tumor to exclude inclusion of tumoral tissue. We did not include cancer cases with recurrences detected only with US. Image analysis was done per-lesion and not per-patient (Figure 1).
Radiomics analysis was performed on all MRI images included in pathological and control data-set within manually selected ROIs (Figure 1) as previously done for other 3D radiological techniques.2 From each image, we extracted 104 image features using an open-source software platform for medical image informatics, image processing, and three-dimensional visualization (3D Slicer 4.7;
All patients were examined by using a 1.5-Tesla equipment (Magnetom Avanto, Siemens Heathcare) with a standard protocol. For each patient, T1-weighted (T1w), Gd-enhanced T1-weighted (T1WGd) with 0.1-mmol/kg doses of gadoteric acid and T2-weighted with fat saturation (T2w fs) volume images were resampled to an uniform pixel spacing of 0.5 x 0.5 x 3mm. Then they were cropped to the lesion region of interest as delineated by R1 and R2. Inhomogeneity correction was applied to T1w and T2w fat saturated images to account for the presence of bias field artifacts. Next, T1w and T2w fs images were corrected for inherent acquisition-to acquisition signal intensity variations (non standardness) using scale-based standardization. This procedure was applied to mitigate the inherent drift phenomenon that accompanies MRI intensities as previously done in literature.12
Intra-observer agreement resulted to be 0.62 (95% CI: 0.52–0.67) for single measurements and 0.75 (95% CI: 0.69–0.80) for the average measure and was deemed acceptable for the purpose of the study. There were no discrepancies higher that 15% between R1 ROI and R2 ROI requesting arbitration.
N = 11 adult patients (6 men and 5 women) with suspicious STS LR were included for a total of 33 follow-up events on MRI. A total of 198 data-sets per patients of both pathological and normal tissue were analyzed (99 pathological and 99 control data-sets). Characteristics of the 19 pathological findings in 33 follow-up events are reported in Table 2. N = 3 patients had multiple lesions.
Distribution of the extremity soft tissue sarcoma patients’ clinical characteristics in 19 pathological findings of 11 patients in 33 follow-up events. N = 3 patients had multiple lesions
Clinical Characteristic | |
---|---|
Age (years) | 57.8 ± 17.8 |
Tumor size (mm) | 26,2 ± 16.9 |
G1 | 4 (21) |
G2 | 6 (37) |
G3 | 8 (42) |
Unassigned | 1 (5) |
Superficial | 6 (32) |
Deep | 13 (68) |
Upper extremity | 5 (26) |
Lower extremity | 14 (74) |
Pleomorphic liposarcoma | 6 (33) |
Myxofibrosarcoma | 5 (27) |
Myxoid liposarcoma | 2 (10) |
Leiomyosarcoma | 2 (10) |
Nerve sheath tumors | 2 (10) |
Synovial sarcoma | 2 (10) |
After feature number reduction to avoid overfitting, Mann-Whitney U test identified n = 7, n = 13 and n = 12 features able to differentiate pathological tissue from normal tissue on T1w MRI images, T2w fat saturated MRI images and T1wGd respectively (p < 0.001). Table 3 shows feature domain according to different MRI sequences.
Feature domain according to different MRI sequences
Feature | Description | Significance | T1-weighted MR imaging | T2-weighted MR imaging* | T1-weighted MR imaging with Gadolinium |
---|---|---|---|---|---|
Shape domain | descriptors of the three-dimensional size and shape of the ROI. | These features are independent from distribution the gray in level the ROI intensity and are therefore only calculated on the non-derived image and mask | 1 | 1 | 2 |
First order | Mean, standard deviation, median, and range; first-order differentials computed using Sobel operators | Localize hypo- and hyperintense regions; gradients detect edges and quantify region boundaries | 1 | 1 | 1 |
Gray level co-occurrence matrix (GLCM) | Localization of regions with significant intensity changes; gradients detect edges and quantify region boundaries | Localizes regions based on underlying heterogeneity of voxel intensities | 3 | 7 | 6 |
Gray level run lenght matrix (glrlm) | quantifies gray level runs, which are defined as the length in number of pixels, of consecutive pixels that have the same gray level value. | In a gray level run length matrix the element describes the number of runs with gray level and length occur in the image (ROI) along angle | 2 | 3 | 6 |
Gray level size zone matrix (glszm) domain | It is an advanced statistical matrix used for texture characterization. It estimates bivariate conditional probability density function of the image distribution values | represent the count of how many times a given size of given grey level occur | 0 | 0 | 0 |
Some radiomics features were significantly correlated to tumor size: 4 features (Compactness, MajorAxis, Flatness, Mean) with r = 0.75 with p < 0.02, on T1w MRI images, 2 features (MajorAxis, RootMeanSquared) on T2w fat saturated MRI images with r = 0.65 with p < 0.01, and 2 features (MajorAxis, Maximum) on T1wGd with r = 0.65 with p < 0.01. Four radiomics features (Sum entropy, difference entropy, energy, lmc2) were correlated with grading (4 features with r = 0.74 and p < 0.05) on T1w MRI images and none on T2w fat saturated MRI images and T1wGd respectively.
ROC analysis performed on T1w images, T2w fat saturated images and T1wGd showed an AUC between 0.71 (95%CI: 0.55–0.87) and 0.96 (95%CI: 0.87–1.00). Detailed results are reported in Table 4 and Figure 2. T2w fat saturated with fat-saturation and T1w post-Gadolinium showed a better performance than T1w images (p < 0.05).
ROC results according to different MRI sequences of the selected features
Sequence | Minimum AUC | 95%CI | Maximum AUC | 95%CI |
---|---|---|---|---|
T1-weighted MR imaging | 0.71 | 0.54–0.88 | 0.84 | 0.77–0.96 |
T2-weighted MR imaging* | 0.81 | 0.67–0.95 | 0.91 | 0.83–1.00 |
Twith 1-weighted Gadolinium MR imaging* | 0.87 | 0.69–1.00 | 0.96 | 0.87–1.00 |
Mean time to perform radiomics analysis was at least 5 h per patient including the creation of a data set of 19 patients with suspicious cancer visible at MRI, therefore the total time to perform analysis, excluding the time to recruit and select patients was at least 100 h.
In this study we performed a radiomics analysis on MRI images of patients suspected of having STS LR belonging to an ongoing prospective trial. The aim was to investigate whether radiomics features derived from standard clinical MRI sequences could be used to differentiate normal tissue from cancerous tissue of LR. In clinical practice, differentiating normal from pathological tissue on MRI in patients suspected of having LR is relevant especially when LR are not nodules, but plaque-like “tails” of tumor on MRI, especially for both undifferentiated pleomorphic sarcoma and myxofibrosarcoma.5,11 Despite the relatively low number of patients included in this study, we had a high percentage of pleomorphic sarcoma/liposarcoma and/or myxo-fibrosarcoma accounting for 11/19 of the lesions evaluated. The number of lesions evaluated in this study is similar to the number of pleomorphic sarcoma/liposarcoma and/or myxofibrosarcoma evaluted in the study by Corino
This study has several limitations. First, radiomics features were not extracted on ADC maps, that have been demonstrated to assess tumor cellularity even when different scanners are used.14 However, this is not a multicentric study and the MRI image sample was included from only one center. Second, the study population was relatively small. However, considering the incidence of LR in patients who underwent surgery for STT (reported rates of LR range from 6.5% to approximately 25%)5, the number of patients and images evaluated seem to be sufficient for the purpose of the study. The presence of a relatively high number of pleomorphic sarcoma and myxofibrosarcoma increases the clinical significance of the study because they are tumors difficult to be evaluated on MRI. Selection biases are excluded due to consecutive patient enrollment in the present study. Moreover, three MRI sequences per lesion were analyzed increasing data robustness. Finally, we also acknowledge that future software developments will reduce the necessity of freehand ROIs positioning.19 We also believe that further research will be critical to fully unveil the potential or radiomics in STT evaluation, indeed, it has been studied that radiomics features extracted from MR images are independently associated with survival when accounting for age and tumor grade and helps in differential diagnosis.20,21
In conclusion, radiomics features allow to detect LR on MRI images in STS local surveillance. radiomics in STS evaluation is useful not only for detection purposes but also for lesion biological characterization.