Safety margin assessment after microwave ablation of liver tumors: inter- and intrareader variability

Thermal ablation methods, such as microwave ablation (MWA), have established themselves in recent years as a suitable therapy for various malignancies.¹ The execution of a tumor ablation involves many challenges. One crucial factor for a successful ablation and the prevention of residual tumor tissue or the onset of a tumor recurrence is maintaining a sufficient safety margin. There is currently no clear consensus on what a sufficient safety distance is.² Most authors recommend a safety margin between 5-10 mm.^1,3,4,5,6,7 The precondition for the determination of a suitable safety distance, however, would initially be a proper measurement method. So far, this has been proved another major challenge. Several studies have investigated postinterventional methods and measurement techniques, such as computed tomography (CT) or magnetic resonance imaging (MRI), to be able to make a valid decision about a complete ablation.^8,9,10 Other authors try to improve the intraprocedural tumor detection and the assessment of the ablation margin as a recently published study used a FDG PET/CT guided ablation for the intraprocedural determination of the safety distance and achieved good results.¹¹ Many authors favour the MRI for ablation margin control.^10,12 However, in most cases tumor ablation is performed under CT guidance and the safety margin is assessed in the CT images, at least in the periinterventional setting. For best treatment, the decision whether ablation is complete or not should be made as immediately as possible. Therefore, in most cases, native or contrast-enhanced CT scans are performed, and the extent of the ablation is decided by side-by-side comparison of the pre- and post-interventional images or by simple and fast measuring techniques during the intervention like measurement with a simple distance measurement tool. Unfortunately, we do not have reliable data on the consistency and reproducibility of these subjective estimations of a sufficient safety distance in a real-world setting. For this reason, in this study we investigated the inter- and intrareader variability of the safety margin assessment after microwave ablation of liver malignancies.

Patients and methods

Study design and participant selection

The local ethics committee approved this retrospective study. Written informed consent was obtained from all patients. A total of 58 patients were included in this study, who were treated with microwave ablation between September 2017 and June 2019. Tumor entities were hepatocellular carcinoma (HCC) and metastases of colorectal and pancreatic cancer. Exclusion criteria were the patient’s refusal to participate in the study and other tumors than those mentioned above. All patients received a CT-scan one day before ablation and on the first postinterventional day (Figure 1). Subsequently, all patients were independently assessed by three interventional radiologists regarding the safety margin between tumor and healthy liver tissue using side-by-side measurement. No special evaluation software was used to simulate the procedure in everyday practice as accurately as possible. The orientation was based on reference points that could be reproduced exactly, e.g. prominent vessel outlets, foreign material or calcifications. Of course, the different breathing positions had to be taken into consideration as well. One of the three readers re-evaluated the patients after six weeks to detect possible intraindividual variability. The six weeks period of time between the two readings should ensure, that the reader could not remember the patients and avoid a bias. The minimum safety distance, the maximum safety distance and whether the ablation was considered complete or incomplete, i.e. technical efficacy, were estimated. The 6 weeks follow-up MRI was regarded as the gold standard for technical efficacy.

(A) Pre-interventional arterial phase in which the tumor is almost invisible. This not only complicates ablation but also post-interventional detection of residual tumor tissue. (B) The result immediately postinterventionally with a corresponding ablation defect. (C) Axial and (D) coronal show the situation one day postinterventionally. Due to the different breathing position, the tumor was more peripheral the day before, whereas on the following day healthy liver tissue around the ablation defect is visible. Measuring the safety distance is particularly difficult in these cases.

Statistics

Intraclass correlation (ICC) estimates and their 95% confident intervals were calculated using R irr statistical package version 3.5.1 based on a mean-rating (k = 3), absolute-agreement, 2-way mixed-effects model. The intraclass correlation coefficient (ICC) was calculated for the estimation of the minimal safety margin. ICC values less than 0.5 are considered indicative of poor reliability, values between 0.5 and 0.75 indicate moderate reliability, values between 0.75 and 0.9 indicate good reliability, and values greater than 0.90 indicate excellent reliability. Bland-Altman analyses were used to assess agreement in the side-by-side measurements between the two blinded readings (minimal safety margin) by the same radiologist and between the

readings (minimal safety margin) by the three independent radiologists.

Results

Patient and tumor characteristics

58 patients were included and evaluated. The mean age was 62.84 (10.85) years. 53 patients (91%) were male. All 58 patients were treated with MWA. Most tumors (n = 12) were located in liver segment VII, followed by segments VIII (n = 9) and IV a and V with n = 7 each. The minority of tumors were found in segments I and IV b with n = 3 each. The baseline data are shown in Table 1.

Table 1

Baseline characteristics

Number of patients	N = 58
Age
mean (years)	62.84 (10.85)
range (years)	36–83
Gender
male (%)	53 (91)
Ablation method (%)
microwave ablation	58 (100)
Liver segments
I	3 (5)
II	4 (7)
III	6 (10)
IVa	7 (12)
IVb	3 (5)
V	7 (12)
VI	4 (7)
VII	12 (21)
VIII	9 (16)
Tumor entity
Hepatocellular carcinoma	46 (79)
Metastasis colorectal cancer	9 (16)
Metastasis pancreatic cancer	3 (5)

Inter- and intrareader variability

The intraclass correlation coefficient (ICC) for estimation of the interindividual variability of the assessment of the minimal safety margin for all three readers was 0.357 (95%-confidence interval 0.194–0.522), indicating a poor reliability. The ICC for estimation of the variability of two repeated estimations of reader 1 was 0.774 (95%-confidence interval 0.645–0.860), indicating a good reliability for repeated measurements.

Bland–Altman plots were calculated to show intra- and interindividual variability (Figure 2). A systematic error was not detectable. The standard deviation in the intrareader-result was smaller compared to the interindividual evaluations. Nevertheless, deviations of more than 5 mm can be detected in

Bland-Altman plots: intra- (A) and inter-reader (B) = reader 1 vs. reader 2, (C) = reader 2 vs. reader 3, (D) = reader 1 vs. 3) agreement for minimum safety margin measurements. The middle line shows the mean percentage difference in measurements and the dashed lines above and below show the 95% reference range. Measurements within the 95% reference range can be considered as intrinsic measurement errors (or variations) that are associated with the given measurement tools and imaging techniques. Therefore, a narrower reference range indicates a lower measurement error/variation.

some measurements. The differences of the safety margins measured by the two readers are clearly larger in comparison to the deviations between both measurements performed by one reader.

Assessment of complete ablation

The readers achieved a sensitivity and specificity of 93%/82%/82% and 33%/17%/83%, respectively. The positive predictive value (PPV) was 91%/88%/97%. The negative predictive value (NPV) was 40%/10%/39%. The results are shown in Table 2 and 3.

Table 2

Contingency table of all the three independent readings compared with the six weeks follow-up MRI as gold standard

	Incomplete (6 weeks MRI)	Complete (6 weeks MRI)
Reader 1
Incomplete	2	3
Complete	4	41
Reader 2
Incomplete	1	8
Complete	5	36
Reader 3
Incomplete	5	8
Complete	1	36

Table 3

Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) of the three independent readings (R 1, 2, 3)) compared with the six weeks follow-up MRI as gold standard

	R 1	R 2	R 3
Sensitivity	93%	82 %	82 %
Specificity	33%	17 %	83 %
PPV	91 %	88 %	97 %
NPV	40 %	10 %	39%

Discussion

There is agreement that a safety distance is necessary after ablation of a liver tumor to prevent local tumor recurrence. When defining the optimal safety distance, there are already different approaches and no general definition. Most authors favour a minimum distance of 5 mm (1,3–7).

We agree with this in principle. In our opinion, however, the measurement methods are rarely described or questioned. Therefore, our approach was to question the measurement of the safety distance in the daily routine (Figure 3 and 4).

The HCC in segment VIII (A) was pre-interventionally localized using landmarks such as metal clips (arrow) and anatomical landmarks like the kidney (arrow) (B). Post-interventionally, the same landmarks are used and the target area is localized by distance measurements (dashed lines) from different angles (C). The MRI follow-up after 6 weeks confirmed complete ablation (D).

Metastasis was best seen in the portal venous phase (B). In this case, clips after hemihepatectomy serve as orientation. A line is drawn (solid line) and the distance (dashed line) is measured by means of a clip at an angle of 90 degrees. The same fixed points are used postinterventionally. This already shows only a small safety distance in the peripheral area. In the 6 weeks follow-up MRI residual tumor tissue (circle) was detected.

This confirmed our impression that measurement with the standard tools provided by the CT software can lead to difficulties in measurement and thus to considerable intraindividual differences. Although the reading was performed by three experienced interventional radiologists, the inter-reader variability was poor.

One reason could be the localization of the tumor. Subcapsular tumors represent a special measuring challenge. The same applies for tumors in the immediate vicinity of other organs or vessels that are also difficult to measure.

Another aspect that can lead to considerable differences in the evaluation of the distance is the choice of the reconstruction planes and the layer thickness. Zhao et al. claims to achieve best results with 1D or 3D 2.5 mm slices compared to 2D. From our point of view, an evaluation of the ablation zone in three planes is absolutely but also leads to a higher interreader variability.

A contentious aspect is always the experience of the interventionalist. Therefore, in our study the reading was carried out by an experienced radiologist (5 years experience), a specialist radiologist (7 years experience) and the head of the Centre for Interventional Oncological Radiology. The aim was to rule out diagnostic errors due to inexperience. Nevertheless, there were considerable differences between all three readers, which called the measuring method into question.

In our opinion, the fact that the intraindividual differences were smaller shows that there is no systematic error. The measurement results are interin-dividually different but not random. In our opinion, this indicates that our study results are reliable and meaningful.

New measurement methods or software for tumor segmentation are already being investigated in some studies.^{5,8,9,12,13,14,15,16} The results were promising and improved the assessment of ablation success. Our study was able to show that conventional measurement methods are inaccurate and can lead to large interindividual differences. We therefore support the development of new measurement methods to achieve more reliable measurement results.

eISSN:: 1581-3207
Langue:: Anglais

Périodicité:: 4 fois par an
Sujets de la revue:: Medicine, Clinical Medicine, Radiology, Internal Medicine, Haematology, Oncology

RSS Feed de la revue

Safety margin assessment after microwave ablation of liver tumors: inter- and intrareader variability

Article Category: Research Article

Publié en ligne: 12 févr. 2020

Pages: 57 - 61

Reçu: 16 sept. 2019

Accepté: 10 janv. 2020

DOI: https://doi.org/10.2478/raon-2020-0004

© 2020 Jan Schaible, Benedikt Pregler, Wolf Bäumler, Ingo Einspieler, Ernst-Michael Jung, Christian Stroszczynski, Lukas Philipp Beyer, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Figure 1

Figure 2

Figure 3

Figure 4