Variability in prostate and seminal vesicle delineations defined on magnetic resonance images, a multi-observer, -center and -sequence study
- Tufve Nyholm^{1}Email author,
- Joakim Jonsson^{1},
- Karin Söderström^{2},
- Per Bergström^{2},
- Andreas Carlberg^{3},
- Gunilla Frykholm^{4},
- Claus F Behrens^{5},
- Poul Flemming Geertsen^{5},
- Redas Trepiakas^{5, 6},
- Scott Hanvey^{7},
- Azmat Sadozye^{8},
- Jawaher Ansari^{8},
- Hazel McCallum^{9},
- John Frew^{9},
- Rhona McMenemin^{9} and
- Björn Zackrisson^{2}
https://doi.org/10.1186/1748-717X-8-126
© Nyholm et al.; licensee BioMed Central Ltd. 2013
Received: 31 October 2012
Accepted: 28 March 2013
Published: 24 May 2013
Abstract
Background
The use of magnetic resonance (MR) imaging as a part of preparation for radiotherapy is increasing. For delineation of the prostate several publications have shown decreased delineation variability using MR compared to computed tomography (CT). The purpose of the present work was to investigate the intra- and inter-physician delineation variability for prostate and seminal vesicles, and to investigate the influence of different MR sequence settings used clinically at the five centers participating in the study.
Methods
MR series from five centers, each providing five patients, were used. Two physicians from each center delineated the prostate and the seminal vesicles on each of the 25 image sets. The variability between the delineations was analyzed with respect to overall, intra- and inter-physician variability, and dependence between variability and origin of the MR images, i.e. the MR sequence used to acquire the data.
Results
The intra-physician variability in different directions was between 1.3 - 1.9 mm and 3 – 4 mm for the prostate and seminal vesicles respectively (1 std). The inter-physician variability for different directions were between 0.7 – 1.7 mm and approximately equal for the prostate and seminal vesicles. Large differences in variability were observed for individual patients, and also for individual imaging sequences used at the different centers. There was however no indication of decreased variability with higher field strength.
Conclusion
The overall delineation variability is larger for the seminal vesicles compared to the prostate, due to a larger intra-physician variability. The imaging sequence appears to have a large influence on the variability, even for different variants of the T2-weighted spin-echo based sequences, which were used by all centers in the study.
Keywords
Introduction
Successful radiotherapy depends on high geometric and dosimetric accuracy and precision. The introduction of treatment planning and dose calculation in 3D, more than two decades ago, has provided the clinicians with very good control over the dosimetric aspects of the treatment with typical relative errors in the order of a few percent. The more recent introduction of intensity modulated radiotherapy (IMRT) [1] has made it possible to shape the dose distribution to closely match the target volume and the use of image guided radiotherapy (IGRT) [2] enables reproducible patient positioning at every treatment fraction. At present, we have come close to a point where we can “hit the target” with the right dose every time with minimal dose deposition outside the intended volume. Hence, treatment precision has dramatically improved. However, there are still problems to be solved, as described by Njeh [3]; the uncertainty in the definition of the target. Sharp dose gradients are more a hazard than a benefit, if the geometric uncertainty in delineation is large.
The use of magnetic resonance (MR) imaging alone, or together with computed tomography (CT), improves the target delineation accuracy for many diagnoses [4, 5] and MR imaging is today in routine clinical use at many centers as a part of the preparation for radiotherapy. The dedicated MR examination for radiotherapy treatment planning involves issues not present in the diagnostic setting. The patient should ideally be imaged in the same position as during treatment, including fixations [6] which influence both the coil setup and image quality [7, 8]. The geometric accuracy of the images is crucial which increase demands on the choice of sequences and bandwidth [9] and the sequences and image planes should be optimized for determination of the precise geometrical extent of an already known pathology.
There are two alternative ways of incorporating the MR into the radiotherapy workflow; either the MR images are seen as a complement to the CT for target definition or the MR replaces the CT throughout the entire treatment process. The CT/MR workflow is already established in many centers, but suffers from drawbacks in terms of increased workload and potential introduction of geometric errors resulting from the image registration procedure [10, 11]. Fully MR based workflows have been described in the literature [12–15] and are considered feasible.
For prostate cancer patients, the use of MR alone or in combination with CT has been shown to reduce inter-observer variability in target definition and reduce the treatment volume [16–19]. The treatment of prostate cancer has been considered one of the most straightforward diagnoses for an MR only workflow, as the dose calculation accuracy in the pelvic region is adequate with bulk density assignments [20, 21] and the commonly used gold markers are visible with reliable geometric accuracy [22]. In addition to the technical challenges with the MR based workflow, one must also consider that the physicians need to adapt to a target definition process without CT information, and that the MR sequences need to be optimized for target definition purposes.
The aim of the present multi-center study is to evaluate the intra- and inter-physician variability of prostate and seminal vesicle volume delineations based on MR sequences from five different radiotherapy centers in the clinical setting. All centers participating in the study were at the time investigating the use of an MR based workflow for the treatment of prostate cancer. As part of this process it was considered important to perform an inter-clinic comparison of both the standard clinical MR images and the interpretation of the images by the physicians. The observed variations can be assumed to reflect the clinical reality as the images were acquired with the standard clinical protocol and the physicians were instructed to perform the delineation as for an ordinary clinical case.
Methods and materials
Five centers were involved in the study; Umeå University Hospital (Umeå, Sweden), Karolinska Hospital (Stockholm, Sweden), Herlev Hospital (Copenhagen, Denmark), Newcastle Upon Tyne Hospitals NHS Trust (Newcastle, United Kingdom) and Beatson West of Scotland Cancer Centre (Glasgow, United Kingdom). All centers were, at the time of the study, routinely using MRI data in their clinical practice for target definition for prostate cancer patients, except Karolinska who was in the startup process. Both participating physicians from Karolinska did however have extensive previous experience (>5 years) of prostate delineations on MR images from other hospitals. The different scanners and sequences used in the study are listed in Table 1. All centers had chosen to use spin-echo based T2 weighted images as primary bases for target delineation.
Imaging and preparation of data
Five consecutive patients scheduled for radiotherapy of the prostate were selected from each site. All patients had MR examinations as part of their standard preparation for radiotherapy. The axial images which were typically used for target delineation were anonymized and sent to the study coordinator. The 5 image series from each of the 5 sites were tagged as CT studies in the DICOM files to enable delineations to be performed directly on the MR data in all oncology delineation software applications. The set of 25 image series were then sent to each site and imported into the clinically used treatment planning systems or dedicated delineation software.
Delineations
Two physicians from each site independently delineated the prostate volume and the seminal vesicles. The instruction was: “Both prostate and vesicles should be delineated as if a clinical case with high risk for vesicle involvement”. The prostate and the vesicle delineations were stored as separate structure sets. After finalizing the delineations, the structure sets were returned to the study coordinator as DICOM RTstruct files for analysis.
Analysis
The sequence used at center C were a 3D sequence (Siemens, SPACE), while the other clinics used 2D sequences
Center | Delienation software | Scanner | Field strength | Echo time (ms) | Rep. time (ms) | Slice thickness (mm) | Pixel size (mm^{2}) |
---|---|---|---|---|---|---|---|
A | Eclipse | Philips Panorama | 1.0 T | 110 | 4471 | 2 | 0.91 × 0.91 |
B | Eclipse | Siemens Verio | 3 T | 92 | 3440 | 3.6 | 0.52 × 0.52 |
C | ProSoma[MedCom] | Siemens Espree | 1.5 T | 125 | 3000 | 1.7 | 0.78 × 0.78 |
D | Oncentra | Siemens Espree | 1.5 T | 115 | 10200 | 3.3 | 1.17 × 1.17 |
E | Eclipse | GE Signa HDxt | 1.5 T | 90 | 2520 | 2.5 | 0.82 × 0.82 |
For the prostate, the first step was to calculate the joint center of mass for all delineations for each patient. The distance from the center was calculated for each delineation in the directions right, left, anterior, posterior, superior, inferior, right-posterior and left-posterior. To reduce the influence of small scale variations in the structure sets and create a representative measure for the distance, the average over a solid angle Ω =0.49 sr was used (Figure 1). This procedure provides a single numerical measure for the distance in the different directions for each patient and delineation.
The joint center of mass for each patient was also used as a starting point for the analysis of the vesicle delineations. The shape of the vesicles does not, however, allow the same analysis approach due to the sometimes concave surface. Instead, the maximum distance in the right, left, anterior, posterior, superior and inferior directions from the center of mass were calculated for each delineation.
Nomenclature
i.e. the difference between an individual physician (q) delineation on a specific patient (p) and the average delineation over all physicians, for the delineation characteristic var.
use the notation $\mathrm{q}\mathcal{\&z.epsi;}\mathrm{DC}$. In the result section we use the notation A[variable]_{parameter} for the average, and S[variable]_{parameter} for the standard deviation, where the parameter defines the group. For example $A{\left[{\overline{x}}_{\mathrm{p},*}^{\mathrm{var}}\right]}_{\mathrm{p}\mathcal{\&z.epsi;}\mathrm{IC}}$ refers to the average measure of the delineation property var for all patients coming from imaging center IC.
Statistical analysis
The normality of the data was checked through visual inspection of Q-Q plots. Most reported significant differences use a Bonferroni corrected 0.01 confidence level. The reason for the use of the strict significance levels was that the main purpose of the tests was to highlight the most pronounced effects in the dataset, where most factors can be expected to have influence. Two sided F-tests were used to compare distributions and t-tests to compare averages, unless otherwise indicated.
Intra and inter-observer variation
where w_{q} is a factor that is only dependent on the physician, with expectation value $E{\left[{w}_{\mathrm{q}}^{\mathrm{var}}\right]}_{\mathrm{q}\in \mathrm{Q}}\equiv 0$ and standard deviation ${\sigma}_{w}^{\mathrm{var}}$. ${z}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}$ is a factor dependent on both patient and physician also with expectation value $E{\left[{z}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}\right]}_{\mathrm{p}\in \mathrm{P},\mathrm{q}\in \mathrm{Q}}\equiv 0$, and standard deviation ${\sigma}_{z}^{\mathrm{var}}$. Equation 3 is an ordinary one-way random effect Anova model [23], where ${w}_{\mathrm{q}}^{\mathrm{var}}$ is the effect of the physician, hence ${\sigma}_{w}^{\mathrm{var}}$ is interpreted as the inter-physician variation, and ${z}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}$ accounts for the residual variation, hence ${\sigma}_{z}^{\mathrm{var}}$ is interpreted as the intra-physician variation (Figure 2).
Confidence intervals for the true variabilities were found using simulation. The simulation was performed in a custom written Matlab™ Monte Carlo script. The script searched the ${\sigma}_{z}^{\mathrm{var}}$ and ${\sigma}_{w}^{\mathrm{var}}$ space to identify the area where the probability to get the observed ${s}_{z}^{\mathrm{var}}$ and ${s}_{w}^{\mathrm{var}}$ or more extreme values was below 5%.
Results
Normality
The differences between delineations from individual physicians and the average, i.e. ${y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}$, were approximately normally distributed for all scored variables. There was however a tendency that the largest deviations were larger than predicted with a Gaussian model, especially pronounced for the posterior, right-posterior and left-posterior directions for the prostate and for the volume and anterior direction for the vesicles.
Delineation summary
Average delineated prostate and vesicle volumes for patients from the different centers, and the mean relative standard deviations between the physicians
Prostate | Vesicles | |||
---|---|---|---|---|
Imaging center (IC) | Average volume (cm^{3}) | Mean relative standard deviation | Average volume (cm^{3}) | Mean relative standard deviation |
A | 44 | 18% | 12 | 22% |
B | 43 | 18% | 14 | 33% |
C | 43 | 18% | 9 | 33% |
D | 63^{(1)} | 17% | 24^{(1)} | 44% |
E | 37 | 17% | 9 | 37% |
approximately the same for the different imaging centers (Table 2).
Variability for different patients
The variability among the physicians differed for different patients, as can be seen in Tables 3 and 4 giving the median, max and min of the standard deviation for individual patients, i.e. $\mathit{max};\mathit{min};\mathit{median}{\left[S{\left[{y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}\right]}_{\mathrm{q}}\right]}_{\mathrm{p}}$.
For the prostate, the highest frequency of large deviations (${y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}>4\phantom{\rule{0.25em}{0ex}}\mathrm{mm})$ was found in the inferior and anterior directions (8%) each, followed by the superior direction (6%). The lowest frequency was found in the right, left and posterior directions (below 3%) while the frequency was around 5% in posterior-left and posterior-right directions.
For the vesicles the variability was larger as can be seen by comparing Tables 3 and 4. The highest frequency of large deviations $\left({y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}>8\phantom{\rule{0.25em}{0ex}}\mathrm{mm}\right)$ was found in the right and left directions (6%), while the frequency was around 2% in the other directions.
Physician variability
The influence of the delineating physician was large and significant for all investigated variables (p < 0.01, Kruskal-Wallis test, SPSS). Tables 5 and 6 gives the intra- and inter-physician variability (1 std) for the different delineation variables, for the prostate and the seminal vesicles, together with the 95% confidence interval for the true variability.
Variability for different Sequences
All centers participating in the study used T2 weighted images for target delineation. There were, however, noticeable differences in the image contrast, as can be appreciated in Figure 4. Tables 7 and 8 gives the variability scored for different imaging centers, i.e. $S{\left[{y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}\right]}_{\mathrm{p}\in \mathrm{IC},\mathrm{q}\in \mathrm{Q}}$, together with the variability for physicians delineating on images from their home center, i.e. $S{\left[{y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}\right]}_{\left\{\mathrm{p},\mathrm{q}\left|\mathrm{IC}=\mathrm{DC}\right.\right\}}$, and images from the other (foreign) centers, i.e. $S{\left[{y}_{\mathrm{p},\mathrm{q}}^{\mathrm{var}}\right]}_{\left\{\mathrm{p},\mathrm{q}\left|\mathrm{IC}\ne \mathrm{DC}\right.\right\}}$ .
The median, maximum and minimum observed variability for prostate delineation for an individual patient (1 std)
Anterior | Posterior | RightPost | Right | LeftPost | Left | Superior | Inferior | Volume | |
---|---|---|---|---|---|---|---|---|---|
(mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (cm^{3}) | |
Median | 1.9 | 0.8 | 1.5 | 1.4 | 1.7 | 1.4 | 1.8 | 2.5 | 7.4 |
Max | 4.4 | 4.6 | 4.0 | 3.3 | 4.0 | 2.8 | 4.0 | 4.0 | 13.6 |
Min | 0.9 | 0.4 | 0.9 | 0.9 | 0.7 | 0.6 | 1.1 | 1.0 | 2.8 |
This was the pattern for all scored variables for the prostate, and was significant for posterior and right/posterior directions (2-sided F-test p < 0.01 Bonferroni corrected). For the vesicles the pattern was similar. The variability was significantly lower for the right and left directions with images from the physicians home center compared to foreign center (2-sided F-test p < 0.01 Bonferroni corrected).
Discussion
Delineation errors have a direct effect on the quality of the treatment. An excessive target volume entails unnecessary risk of complications, while an undersized target reduces the chance of cure. The relationship between the target definition variability and the extent of the optimal margin to compensate for geometrical uncertainties is not completely clear. From a local control perspective the target definition variability should be considered a systematic uncertainty affecting the entire treatment, and should therefore be reflected in the employed margins. However, the uncertainty in the delineation is heavily dependent on both physician (Tables 5 and 6) and patient (Tables 3 and 4), which makes it inadequate to employ generalized margins to account for the variability. The opinion of the authors of the present work is that the responsibility to account for the delineation uncertainty should be placed on the physicians. The target volume should be delineated to cover the volume that the physician wants to treat; actively including volumes that are of benefit for the patient taking both local control probability and risk for side effects into account, and actively excluding volumes that for example are close to sensitive healthy tissues and the probability for tumorous growth is considered small. When only one physician delineates the target, the delineation from that physician is the best available estimate for the correctly defined target volume and should therefore be used without any additional generic margin accounting for the variability. Hence, the primary effect of improved imaging leading to decreased variability will not be a general possibility to reduce the standard margins, but will rather be reflected in a more uniform and generally increased treatment quality. Improved consistency will, as a secondary effect, improve the statistical power when evaluating and optimizing treatment protocols in clinical trials.
A way to decrease variability is through training and experience [24]. In Tables 7 and 8 it can be seen that physicians’ delineation on images from their home center generally were closer to the average compared to when delineating on images from foreign centers. In some directions the difference was up to 40% (for example posterior direction for the prostate). This effect may be attributed to customization and experience of the local MR sequence. Another way to potentially decrease the variability is to optimize the MR sequence. It is however ambitious to optimize with respect to the delineation variability. Data from the present work does not indicate reduced variability when using a 3 T scanner (center B), but the observations for the single 3 T scanner and only one sequence may not to be representative.
The median, maximum and minimum observed variability for seminal vesicle delineation for an individual patient (1 std)
Anterior | Posterior | Right | Left | Superior | Inferior | Volume | |
---|---|---|---|---|---|---|---|
(mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (cm^{3}) | |
Median | 3.2 | 2.0 | 2.9 | 3.7 | 2.8 | 3.3 | 3.0 |
Max | 7.2 | 7.5 | 8.5 | 7.6 | 5.8 | 8.0 | 16.8 |
Min | 1.5 | 0.8 | 1.3 | 0.7 | 1.3 | 1.5 | 1.5 |
Separation between intra- and inter-physician variability (1 std) for the prostate in different directions and for the volume
Anterior | Posterior | RightPost | Right | LeftPost | Left | Superior | Inferior | Volume | |
---|---|---|---|---|---|---|---|---|---|
(mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (cm^{3}) | |
Intra | 1.7 | 1.4 | 1.6 | 1.4 | 1.6 | 1.3 | 1.5 | 2.0 | 5.5 |
(1.9-1.6) | (1.5-1.3) | (1.7-1.4) | (1.6-1.3) | (1.8-1.5) | (1.5-1.2) | (1.6-1.3) | (2.2-1.8) | (6.1-5.0) | |
Inter | 1.5 | 0.7 | 1.3 | 1.0 | 1.4 | 0.9 | 1.6 | 1.7 | 6.03 |
(2.8-1.0) | (1.3-0.4) | (2.4-0.9) | (1.75-0.6) | (2.5-0.9) | (1.7-0.6) | (2.9-1.1) | (3.1-1.1) | (11.0-4.0) |
Tables 7 and 8 shows that the variability depends on the origin of the images (home vs. foreign center) as mentioned above. This phenomenon was not accounted for in the separation model. The numbers in Tables 5 and 6 are representative values describing the observations in the present study, but should be interpreted with these reservations in mind.
A concern when setting up the study was that the observed overall variability would primarily reflect the use of different clinical routines and traditions at the different centers. The separation of the variability into inter- and intra-physician components did however reveal that the intra-physician variability was dominating both for the prostate and especially for the seminal vesicles. There were significant differences between different delineation centers, the physicians from center A and D did on average delineate 20-30% larger prostate volumes compared to the physicians from centers B, C and E, but the dominating source for variability in individual directions was still the intra-physician variability. The increase of overall variability for the seminal vesicles delineation compared to the prostate delineation could be fully attributed to the larger intra-physician variability.
The inter-physician variability observed in the present study, summarized in Table 5, is approximately in line with the observations described in the literature. Rasch et al. found an inter-physician variability in the inferior region (apex) and superior region of around 1 mm (1.7 mm and 1.6 mm in present study) using axial MR images for 18 patients and with 3 observers [16]. The intra-observer variability was around 3 mm in both regions (2.0 mm and 1.5 mm in present study). It should be noted that Rasch et al. used a similar separation of variance as utilized in the present work, but the low number of physicians make the estimates for the inter-physician variability uncertain. Smith et al. reported inter-observer volume variability of 4.6 cm^{3} (6.1 cm^{3} in present study), and intra-physician volume variability of 2.7 cm^{3} based on repeated observations on same patient (5.1 cm^{3} in present study), in a study with 10 patients and 7 observers [26]. The large difference between the intra-physician variability in the present study compared to the study by Smith et al. could be due to the use of repeated delineations on the same image to estimate the intra-physician variability compared to separation of variances.
Separation between intra- and inter-physician variability (1 std) for the seminal vesicles
Anterior | Posterior | Right | Left | Superior | Inferior | Volume | |
---|---|---|---|---|---|---|---|
(mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (cm^{3}) | |
Intra | 3.9 | 2.9 | 3.6 | 3.8 | 2.9 | 3.5 | 5.7 |
(4.3-3.6) | (3.2-2.6) | (4.0-3.3) | (4.2-3.5) | (3.2-2.7) | (3.8-3.1) | (5.2-6.3) | |
Inter | 1.6 | 1.0 | 0.8 | 1.3 | 1.6 | 1.7 | 2.8 |
(3.1-0.9) | (2.1-0.6) | (1.8-0.2) | (2.7-0.7) | (3.0-1.0) | (3.2-1.0) | (5.4-1.7) |
comparison with Fiorino et al. does not reveal any substantial decrease in the variability when using MR compared to CT. This indicates that the benefit of MR is more in terms of accuracy than precision. To enable comparison with the results form Fiorino et al. in the right-left direction, the variability in the right and left directions from the present work was added together assuming these are independent variables.
Mean standard deviation for imaging sites
Imaging center | Anterior | Posterior | RightPost | Right | LeftPost | Left | Superior | Inferior | Volume |
---|---|---|---|---|---|---|---|---|---|
(IC) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (Cm^{3}) |
A | 1.6 | 0.9 | 1.8 | 1.8 | 1.7 | 1.5 | 1.9 | 2.3 | 6.6 |
B | 3.3 | 0.9 | 1.6 | 1.5 | 1.5 | 1.2 | 2.7 | 2.5 | 7.7 |
C | 1.8 | 0.9 | 1.3 | 1.5 | 1.6 | 1.2 | 2.0 | 2.3 | 6.7 |
D | 2.4 | 2.5 | 2.7 | 1.4 | 2.7 | 1.6 | 2.0 | 2.9 | 10.8 |
E | 1.4 | 1.4 | 2.2 | 2.0 | 2.4 | 2.0 | 1.7 | 2.3 | 6.4 |
Home | 1.7 | 0.9 | 1.4 | 1.3 | 1.6 | 1.4 | 1.8 | 2.5 | 6.1 |
Foreign | 2.3 | 1.6 | 2.1 | 1.7 | 2.1 | 1.6 | 2.1 | 2.5 | 8.1 |
Where DF is the degrees of freedom. For the intra-physician variation in the present study DF = (N_{Q} - 1)(N_{P} - 1). The confidence interval for ${\sigma}_{w}^{\mathrm{var}}$ can also be estimated using equation 6 (DF = N_{Q} - 1). but especially in cases when the ${\sigma}_{w}^{\mathrm{var}}\ll {\sigma}_{z}^{\mathrm{var}}$ this estimation will lead to an underestimation of the confidence interval. For the prostate, where the intra-physician and inter-physician variability was of the same magnitude, the use of equation 6 gives approximately the same results as simulation. For example in the inferior direction for the prostate, where the simulations and equation 6 gave equivalent results with 0.1 mm precision. But for the vesicles, where the intra-physician variability was larger compared to the inter-physician variability, the use of equation 6 under-estimates the confidence interval. For example the confidence interval for the inter-physician variability in the posterior direction was simulated to 2.1-0.6 mm, while equation 6 gave 1.9-0.7 mm. This can be understood considering the scenario with a very large intra-observer variation creating random variations between physicians, and making the inter-observer variation difficult to quantify. The reporting of confidence intervals is very important, especially when using small sample sizes and/or separation of variances into components.
Mean standard deviation for imaging sites (vesicles)
Imaging center | Anterior | Posterior | Right | Left | Superior | Inferior | Volume |
---|---|---|---|---|---|---|---|
(IC) | (mm) | (mm) | (mm) | (mm) | (mm) | (mm) | (cm^{3}) |
A | 3.2 | 1.2 | 2.6 | 3.0 | 2.3 | 2.2 | 2.7 |
B | 3.0 | 3.4 | 3.1 | 4.2 | 3.8 | 3.6 | 5.2 |
C | 4.0 | 1.8 | 2.9 | 2.9 | 2.9 | 2.9 | 3.7 |
D | 6.2 | 4.0 | 5.6 | 5.3 | 4.0 | 5.1 | 11.3 |
E | 2.8 | 3.4 | 3.6 | 3.4 | 2.7 | 3.9 | 2.8 |
Home | 3.9 | 2.4 | 2.9 | 2.7 | 3.2 | 3.9 | 5.2 |
Foreign | 4.0 | 3.1 | 5.0 | 4.1 | 3.2 | 3.6 | 6.2 |
prostate from the 10 radiation oncologists was close to the opinion of the radiologists, while the delineations of the vesicles performed by the radiation oncologists tended to overestimate the extent of the seminal vesicles for some patients, especially in the anterior and right-left directions. The radiologists preferred the image quality provided by center B, followed by the image quality from center D. It is interesting to notice that the images from these sites were associated with the largest delineation variability. Our interpretation is that increased amount of information increases the scope for interpretation and hence the importance of training and experience. It also highlights the complexity of the optimization procedure and the importance of a well defined objective for the optimization. If the objective is to reduce the delineation variability of the prostate or the seminal vesicles it could be counter-productive to use sequences optimized to visualize pathology. Our opinion is that recommendations on specific sequence settings are difficult to make because of the different needs and possibilities at different centers. For example, if high quality diagnostic images are already available for a patient there is less need to acquire images optimized for pathology.
Conclusion
The overall intra- and inter-physician variability for prostate and seminal vesicle delineations was determined for clinically used MR sequences optimized for target volume determination at 5 different radiotherapy centers in Europe. Large differences in variability were observed between different patients, but also between different MR sequences, even though all centers used T2-weighted spin-echo based sequences. The intra-physician variability was significantly larger for the seminal vesicles compared to the prostate, while the inter-physician variability was approximately the same.
Declarations
Authors’ Affiliations
References
- Bortfeld T: IMRT: a review and preview. Phys Med Biol 2006, 51: R363-R379. 10.1088/0031-9155/51/13/R21View ArticlePubMedGoogle Scholar
- Chen GTY, Sharp GC, Mori S: A review of image-guided radiotherapy. Radiol Phys Technol 2009, 2: 1-12. 10.1007/s12194-008-0045-yView ArticlePubMedGoogle Scholar
- Njeh CF: Tumor delineation: The weakest link in the search for accuracy in radiotherapy. J Med Phys 2008, 33: 136-140. 10.4103/0971-6203.44472View ArticlePubMedPubMed CentralGoogle Scholar
- Rasch C, Steenbakkers R, van Herk M: Target Definition in Prostate, Head, and Neck. Semin Radiat Oncol 2005, 15: 136-145. 10.1016/j.semradonc.2005.01.005View ArticlePubMedGoogle Scholar
- Khoo VS, Joon DL: New developments in MRI for target volume delineation in radiotherapy. Br J Radiol 2006, 79 Spec No: S2-S15.View ArticleGoogle Scholar
- Karlsson M, Karlsson MG, Nyholm T, Amies C, Zackrisson B: Dedicated magnetic resonance imaging in the radiotherapy clinic. Int J Radiat Oncol Biol Phys 2009, 74: 644-651. 10.1016/j.ijrobp.2009.01.065View ArticlePubMedGoogle Scholar
- Hanvey S, Glegg M, Foster J: Magnetic resonance imaging for radiotherapy planning of brain cancer patients using immobilization and surface coils. Phys Med Biol 2009, 54: 5381-5394. 10.1088/0031-9155/54/18/002View ArticlePubMedGoogle Scholar
- McJury M, O’Neill A, Lawson M, McGrath C, Grey A, Page W, O’Sullivan JM: Assessing the image quality of pelvic MR images acquired with a flat couch for radiotherapy treatment planning. Br J Radiol 2011, 84: 750-755. 10.1259/bjr/27295679View ArticlePubMedPubMed CentralGoogle Scholar
- Fransson A, Andreo P, Pötter R: Aspects of MR Image Distortions in Radiotherapy Treatment Planning. Strahlentherapie 2001, 177: 59-73. 10.1007/PL00002385View ArticleGoogle Scholar
- Nyholm T, Nyberg M, Karlsson MG, Karlsson M: Systematisation of spatial uncertainties for comparison between a MR and a CT-based radiotherapy workflow for prostate treatments. Radiat Oncol 2009, 4: 54. 10.1186/1748-717X-4-54View ArticlePubMedPubMed CentralGoogle Scholar
- Ulin K, Urie MM, Cherlow JM: Results of a multi-institutional benchmark test for cranial CT/MR image registration. Int J Radiat Oncol Biol Phys 2010, 77: 1584-1589. 10.1016/j.ijrobp.2009.10.017View ArticlePubMedPubMed CentralGoogle Scholar
- Prabhakar R, Julka PK, Ganesh T, Munshi A, Joshi RC, Rath GK: Feasibility of using MRI alone for 3D radiation treatment planning in brain tumors. Jpn J Clin Oncol 2007, 37: 405-411. 10.1093/jjco/hym050View ArticlePubMedGoogle Scholar
- Beavis AW, Gibbs P, Dealey RA, Whitton VJ: Radiotherapy treatment planning of brain tumours using MRI alone. Br J Radiol 1998, 71: 544-548.View ArticlePubMedGoogle Scholar
- Lee YK, Bollet M, Charles-edwards G, Flower MA, Leach MO, Mcnair H, Moore E, Rowbottom C, Webb S: Radiotherapy treatment planning of prostate cancer using magnetic resonance imaging alone. Science 2003, 66: 203-216.Google Scholar
- Buhl SK, Duun-Christensen AK, Kristensen BH, Behrens CF: Clinical evaluation of 3D/3D MRI-CBCT automatching on brain tumors for online patient setup verification - A step towards MRI-based treatment planning. Acta Oncol 2010, 49: 1085-1091. 10.3109/0284186X.2010.498442View ArticlePubMedGoogle Scholar
- Rasch C, Barillot I, Remeijer P, Touw A, van Herk M, Lebesque JV: Definition of the prostate in CT and MRI: a multi-observer study. Int J Radiat Oncol Biol Phys 1999, 43: 57-66. 10.1016/S0360-3016(98)00351-4View ArticlePubMedGoogle Scholar
- Villeirs GM, Van Vaerenbergh K, Vakaet L, Bral S, Claus F, De Neve WJ, Verstraete KL, De Meerleer GO: Interobserver delineation variation using CT versus combined CT + MRI in intensity-modulated radiotherapy for prostate cancer. Strahlenther Onkol 2005, 181: 424-430. 10.1007/s00066-005-1383-xView ArticlePubMedGoogle Scholar
- Debois M, Oyen R, Maes F, Verswijvel G, Gatti G, Bosmans H, Feron M, Bellon E, Kutcher G, van Poppel H, Vanuytsel L: The contribution of magnetic resonance imaging to the three-dimensional treatment planning of localized prostate cancer. Int J Radiat Oncol Biol Phys 1999, 45: 857-865. 10.1016/S0360-3016(99)00288-6View ArticlePubMedGoogle Scholar
- Wachter S, Wachter-Gerstner N, Bock T, Goldner G, Kovacs G, Fransson A, Pötter R: Interobserver Comparison of CT and MRI-Based Prostate Apex Definition. Strahlenther Onkol 2002, 178: 263-268. 10.1007/s00066-002-0907-xView ArticlePubMedGoogle Scholar
- Jonsson JH, Karlsson MG, Karlsson M, Nyholm T: Treatment planning using MRI data: an analysis of the dose calculation accuracy for different treatment regions. Radiat Oncol 2010, 5: 62. 10.1186/1748-717X-5-62View ArticlePubMedPubMed CentralGoogle Scholar
- Lambert J, Greer PB, Menk F, Patterson J, Parker J, Dahl K, Gupta S, Capp A, Wratten C, Tang C, Kumar M, Dowling J, Hauville S, Hughes C, Fisher K, Lau P, Denham JW, Salvado O: MRI-guided prostate radiation therapy planning: Investigation of dosimetric accuracy of MRI-based dose planning. Radiother Oncol 2011, 98: 330-334. 10.1016/j.radonc.2011.01.012View ArticlePubMedGoogle Scholar
- Jonsson JH, Garpebring A, Karlsson MG, Nyholm T: Internal Fiducial Markers and Susceptibility Effects in MRI-Simulation and Measurement of Spatial Accuracy. Int J Radiat Oncol Biol Phys 2012,82(5):1612-1618. 10.1016/j.ijrobp.2011.01.046View ArticlePubMedGoogle Scholar
- Fisher RA: Statistical Methods for Research Workers. Edinburgh: Oliver and Boyd; 1925.Google Scholar
- Khoo ELH, Schick K, Plank AW, Poulsen M, Wong WWG, Middleton M, Martin JM: Prostate Contouring Variation: Can It Be Fixed? Int J Radiat Oncol Biol Phys 2012,82(5):1923-1929. 10.1016/j.ijrobp.2011.02.050View ArticlePubMedGoogle Scholar
- Remeijer P, Rasch C, Lebesque JV, van Herk M: A general methodology for three-dimensional analysis of variation in target volume delineation. Med Phys 1999, 26: 931-940. 10.1118/1.598485View ArticlePubMedGoogle Scholar
- Smith WL, Lewis C, Bauman G, Rodrigues G, D’Souza D, Ash R, Ho D, Venkatesan V, Downey D, Fenster A: Prostate volume contouring: a 3D analysis of segmentation using 3DTRUS, CT, and MR. Int J Radiat Oncol Biol Phys 2007, 67: 1238-1247. 10.1016/j.ijrobp.2006.11.027View ArticlePubMedGoogle Scholar
- Fiorino C, Reni M, Bolognesi A, Cattaneo GM, Calandrino R: Intra- and inter-observer variability in contouring prostate and seminal vesicles: implications for conformal treatment planning. Radiother Oncol 1998, 47: 285-292. 10.1016/S0167-8140(98)00021-8View ArticlePubMedGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.