Abstract
Purpose
The manual delineation of organs at risk is a process that requires a great deal of time both for the technician and for the physician. Availability of validated software tools assisted by artificial intelligence would be of great benefit, as it would significantly improve the radiation therapy workflow, reducing the time required for segmentation. The purpose of this article is to validate the deep learning–based autocontouring solution integrated in syngo.via RT Image Suite VB40 (Siemens Healthineers, Forchheim, Germany).
Methods and Materials
For this purpose, we have used our own specific qualitative classification system, RANK, to evaluate more than 600 contours corresponding to 18 different automatically delineated organs at risk. Computed tomography data sets of 95 different patients were included: 30 patients with lung, 30 patients with breast, and 35 male patients with pelvic cancer. The automatically generated structures were reviewed in the Eclipse Contouring module independently by 3 observers: an expert physician, an expert technician, and a junior physician.
Results
There is a statistically significant difference between the Dice coefficient associated with RANK 4 compared with the coefficient associated with RANKs 2 and 3 (P < .001). In total, 64% of the evaluated structures received the maximum score, 4. Only 1% of the structures were classified with the lowest score, 1. The time savings for breast, thorax, and pelvis were 87.6%, 93.5%, and 82.2%, respectively.
Conclusions
Siemens’ syngo.via RT Image Suite offers good autocontouring results and significant time savings.
Introduction
The development of a radiation therapy (RT) plan requires images of the tumor and the normal structures surrounding it, and it must be possible to delineate both the tumor (gross tumor volume) and the organs at risk (OAR). These delineations are made manually by the RT oncologist or technician, with the precise segmentation of the gross tumor volumes and OARs forming the basis for good RT. To be able to increase reproducibility and reduce uncertainty and errors, international or institutional guidelines, contouring atlases, case libraries, and numerous recommendations exist.
1- Vinod SK
- Jameson MG
- Min M
- Holloway LC.
Uncertainties in volume delineation in radiation oncology: A systematic review and recommendations for future studies.
, 2- Vinod SK
- Min M
- Jameson MG
- Holloway LC.
A review of interventions to reduce inter-observer variability in volume delineation in radiation oncology.
, 3- Wright JL
- Yom SS
- Awan MJ
- et al.
Standardizing normal tissue contouring for radiation therapy treatment planning: An ASTRO Consensus Paper.
The manual delineation of critical organs is a process that requires a great deal of time both for the technician and for the physician. Having validated software tools assisted by artificial intelligence would be of great benefit, as these tools would significantly improve the RT workflow, reducing the time required for segmentation.
4- Boldrini L
- Bibault J
- Masciocchi C
- Shen Y
- Bittner M.
Deep learning: A review for the radiation oncologist.
, 5- Giraud P
- Giraud P
- Gasnier A
- et al.
Radiomics and machine learning for radiotherapy in head and neck cancers.
, 6- Oktay O
- Nanavati J
- Schwaighofer A
- et al.
Evaluation of deep learning to augment image-guided radiotherapy for head and neck and prostate cancers.
These tools also may contribute toward reducing interobserver variability, standardizing the critical organs delineated in different clinical pathologies, and reducing uncertainty when comparing results between them.
7- Men K
- Geng H
- Biswas T
- Liao Z
- Xiao Y.
Automated quality assurance of OAR contouring for lung cancer based on segmentation with deep active learning.
Syngo.via RT Image Suite is the dedicated multimodality imaging software solution for RT offered by Siemens Healthineers (Forchheim, Germany) including automatic contouring of organs required for RT planning. The automatic contouring algorithm is based on deep learning, and it has been trained and validated with patient images from multiple institutions across the world.
To use the contours generated automatically by a software program for assistance, they must be validated. The Dice index is often used to perform this type of validation. Also known as similarity coefficient, the Dice index is a statistic widely used in different specialties
to gauge the similarity of 2 samples. However, it has limitations, such as the effect of the size of the structure and clinical interpretability, because it does not provide local information. A qualitative classification system with a clear and straightforward clinical interpretation may be more suitable.
, 9- Liu C
- Gardner SJ
- Wen N
- et al.
Automatic segmentation of the prostate on CT images using deep neural networks (DNN).
, 10- Cardenas CE
- McCarroll RE
- Court LE
- et al.
Deep learning algorithm for auto-delineation of high-risk oropharyngeal clinical target volumes with built-in Dice similarity coefficient parameter optimization function.
In this work, we present the validation of 600 contours corresponding to 18 different OARs delineated automatically by syngo.via RT Image Suite VB40. For this purpose, we have used a specific qualitative classification system, RANK.
Methods and Materials
Algorithm description
The autocontouring algorithm, developed by Siemens Healthineers for syngo.via RT Image Suite, computes segmentation masks of an organ or anatomic structure for a given computed tomography (CT) data set.
11- Marschner S
- Datar M
- Gaasch A
- et al.
A deep image-to-image network organ segmentation algorithm for radiation treatment planning: Principles and evaluation.
The segmentation is performed on the region of interest of individual organs instead of on the entire image volume. To locate the regions of interest, anatomic landmarks (including vessel bifurcations, bony structures, organ center, and boundary points) are detected in the input image based on Ghesu et al.
12- Ghesu F-C
- Georgescu B
- Zheng Y
- et al.
Multi-scale deep reinforcement learning for real-time 3D-landmark detection in CT scans.
A Deep Image-to-Image Network (DI2IN) is employed for organ segmentation according to the technique described by Yang et al.
13- Yang D
- Xu D
- Zhou K
- et al.
Automatic liver segmentation using an adversarial image-to-image network.
It consists of a convolutional encoder–decoder architecture combined with multilevel feature concatenation. During the training process, the network is driven by a cross-entropy loss based on a learning rate of 0.001 using the Adam optimization. An adversarial network is selectively used to regularize the training process of DI2IN by discriminating the output of DI2IN from the ground truth in a patch-by-patch manner using binary cross-entropy.
Radiation therapy images from multiple institutions across Europe, Asia, and North and South America were used for training and validation. Because the application only segments normal anatomy, no ethnicity-based dependency is expected. Depending on the segmented organ, the number of annotated patients in the training set ranges from few hundreds to many thousands. The algorithm is trained with a fixed number of cases during the process of product development; it does not learn from new cases after product release and cannot be retrained in the hospital itself.
The training, validation, and testing data were annotated based on predefined clinical annotation protocols. These protocols were defined based on guidelines from the Radiation Therapy Oncology Group.
A team of experienced radiologists and a radiation oncologist mentored and supervised the entire process. For each organ, a detailed protocol was established based on the aforementioned international guidelines. A quality assessment based on peer review was performed for each data set before further use in model training and validation.
Patient selection
Computed tomography images from 105 different patients were included: 30 patients with lung, 30 patients with breast, and 35 patients with prostate cancer. All the CT scans were performed with a 64-slice, single-source, flat-table SOMATOM Confidence scanner (Siemens Healthineers). The characteristics of the CT scanning protocols are described in the Appendix. The CT scans of patients with lung cancer were acquired in the supine position with the stereotactic body RT Conchest Supra (Lorca Marin S.A., Murcia, Spain) immobilization system and respiratory control using the RGSC (Respiratory Gating for Scanners) system (Varian Medical Systems, Palo Alto, CA). For the contouring of OARs, the “Average” series was used. Patients with breast cancer were positioned with their arms abducted and were immobilized with the same system as those with lung cancer. The CT scan was performed with deep breaths and with respiratory control for all left breasts, as well as for right breasts requiring radiation of the ganglia regions, provided the patient could complete it. In total, there were 15 patients with the right breast affected (3 breath hold and 12 free breathing) and 15 patients with the left breast affected (12 breath hold and 3 free breathing). All CT scans were performed with the patient in the supine position.
Workflow
The CT images were sent from our CT scanner to syngo.via RT Image Suite VB40 software and ARIA (Varian Medical Systems). The structures were manually segmented in ARIA following the usual clinical workflow established in our service: an expert technician segments the structures and then a radiation oncologist reviews and validates them. In parallel, all studies were automatically contoured with the syngo.via RT Image Suite VB40 software, which is based on a server-client system.
The autocontoured organs that were evaluated in this study are shown in
Table 1. The heart was not taken into consideration, because our own protocol has changed in relation to the craniocaudal limits and presented discrepancies with the guidelines used by Siemens Healthineers. The bilateral organs (lung and breast) were delineated separately. The body structure corresponds to the patient's external contour.
Once the CT had been performed and the study concluded, the images were sent to syngo.via, which automatically contoured the structures assigned in the model. Approximately 2 minutes later, the images together with the set of structures were filed in a network folder without having to open syngo.via. The autocontoured structures from syngo.via were then reviewed in the Eclipse Contouring module independently by 3 observers: an expert physician, an expert technician, and a junior physician (completely independent of the clinical course of the patient and their manually contoured structures). The evaluation was done based on a categorical system of 4 values (
Table 2). The specifics of the respective RT treatment were taken into account. In patients with prostate cancer, for example, the upper parts of the abdominal cavity were not considered. In cases, for which the contouring of an additional organ was required for treatment, this was performed manually by the technician or physician responsible. Subsequently, all required structures were reviewed and edited. The responsible physician then delineated the planning target volume, checked the OARs, and approved all the structures. Before the dosimetric planning, all the patients underwent a clinical session in which a multidisciplinary group consisting of physicians, physicists, and technicians rechecked the correct delineation of the structures. Images from other modalities such as magnetic resonance imaging or positron emission tomography were not required in the process.
Table 2Categories of the specific qualitative evaluation system, RANK
Abbreviation: RT = radiation therapy.
Evaluations and measurements
The Dice coefficient was measured by means of the Varian Eclipse 16.1 software for each of the autosegmented organs, considering manual segmentation as ground truth. Each segmented structure, together with its corresponding Dice value, was assigned to a group based on its RANK value. The RANK 1 category was excluded for this part of the analysis because it corresponds to segmentations that could not be used and had to be completely redone manually.
The time invested in delineating the OARs was self-timed, after opening the CT series, manually for 19 cases (6 breast, 6 thorax, and 7 pelvis) in the Eclipse “Contouring” module. The contouring time of OARs not included in the study has not been considered. Similarly, the time taken by the expert RT technician to review and correct the autocontoured OARs was recorded. With Excel 365 (Microsoft, Redmond, WA), a t test analysis was performed to investigate the correlation between the 4-level categorical system and the Dice coefficient.
Discussion
Current techniques such as intensity modulated RT or volumetric modulated arc therapy require the contouring of various critical OARs for treatment planning. The definition and reliable segmentation of the normal tissues also have significant clinical and dosimetric consequences. For OARs that limit the dose in particular, the more information is provided to the system, the better the result will be. However, if performed manually, this process requires a significant amount of time and effort of the expert clinician. Autocontouring software plays an important role in the future of RT and will be an essential tool to manage the increasing patient volumes in cancer.
Artificial intelligence is demonstrating its importance with automatic contouring tools and autoplanning systems.
15- Brouwer CL
- Steenbakkers RJ
- Bourhis J
- et al.
CT-based delineation of organs at risk in the head and neck region: DAHANCA, EORTC, GORTEC, HKNPCSG, NCIC CTG, NCRI, NRG Oncology and TROG consensus guidelines.
, 16- Kaderka R
- Hild SJ
- Bry VN
- et al.
Wide-scale clinical implementation of knowledge-based planning: An investigation of workforce efficiency, need for post-automation refinement, and data-driven model maintenance.
, 17- Chen X
- Men K
- Li Y
- Yi J
- Dai J.
A feasibility study on an automated method to generate patient-specific dose distributions for radiotherapy using deep learning.
, 18- Nguyen D
- Jia X
- Sher D
- et al.
3D radiotherapy dose prediction on head and neck cancer patients with a hierarchically densely connected U-Net deep learning architecture.
New technologies will appear in the short term
19- Poortmans PM
- Takanen S
- Marta GN
- Meattini I
- Kaidar-Person O.
Winter is over: The use of artificial intelligence to individualise radiation therapy for breast cancer.
that will change the way of working, as happened with the appearance of 3-dimensional RT.
The atlas-based automatic contouring systems
20- Casati M
- Piffer S
- Calusi S
- et al.
Methodological approach to create an atlas using a commercial auto-contouring software.
significantly improve precision and reduce variability, which is further improved by systems based on artificial intelligence. Both can reduce the time taken and improve the workflow. Accordingly, we have verified the clinical applicability of syngo.via RT Image Suite in terms of the time invested and the clinical benefit of deep learning– based autocontouring.
Comparisons between contouring systems by means of geometric overlap metrics may not lead to meaningful results. Therefore, as proposed by Brouwer et al,
21- Brouwer CL
- Boukerroui D
- Oliveira J
- et al.
Assessment of manual adjustment performed in clinical practice following deep learning contouring for head and neck organs at risk in radiotherapy.
we decided to evaluate the manual adjustments made to the automatic contours to be able to establish the applicability of the system.
The performance of the autosegmentation software may be evaluated by the contour similarity indices as well as possible time savings. Different parameters exist, such as the Jaccard index, the Dice coefficient, or the Hausdorff distance,
6- Oktay O
- Nanavati J
- Schwaighofer A
- et al.
Evaluation of deep learning to augment image-guided radiotherapy for head and neck and prostate cancers.
, 7- Men K
- Geng H
- Biswas T
- Liao Z
- Xiao Y.
Automated quality assurance of OAR contouring for lung cancer based on segmentation with deep active learning.
, , 9- Liu C
- Gardner SJ
- Wen N
- et al.
Automatic segmentation of the prostate on CT images using deep neural networks (DNN).
, 10- Cardenas CE
- McCarroll RE
- Court LE
- et al.
Deep learning algorithm for auto-delineation of high-risk oropharyngeal clinical target volumes with built-in Dice similarity coefficient parameter optimization function.
which are used to compare the results of the tested delineation to a ground-truth delineation. However, aiming for a complete overlap between 2 structures contoured manually by 2 observers or by 1 observer and the software may not lead to any insights, as there will always be a certain degree of variability, which is inevitable in everyday clinical practice. Moreover, given that artificial intelligence is still at the developmental stage, the contours generated by computer tools must always be subject to revision and manual correction by the radiation oncologist.
22- McCarroll RE
- Beadle BM
- Balter PA
- et al.
Retrospective validation and clinical implementation of automated contouring of organs at risk in the head and neck: A step toward automated radiation treatment planning for low- and middle-income countries.
For that reason, we decided to perform a clinical applicability analysis, based on the possible changes that the technician would make to the automatically generated contour, and we performed a usability analysis.
Walker et al
23- Walker GV
- Awan M
- Tao R
- et al.
Prospective randomized double-blind study of atlas-based organ-at-risk autosegmentation-assisted radiation planning in head and neck cancer.
published the results of a randomized double-blind study in which the manually drawn contours were compared with the automatically created contours in 40 patients with head and neck cancer. The automatic system allowed a time savings of 30%, with a similar Dice coefficient between manual and autosegmented series. However, in their conclusions, the authors affirm that autocontouring must be validated by a physician. Choi et al
24- Choi MS
- Choi BS
- Chung SY
- et al.
Clinical evaluation of atlas- and deep learning-based automatic segmentation of multiple organs and clinical target volumes for breast cancer.
compared the contours that were drawn manually by an expert to those generated automatically by 2 algorithms, one based on atlas and the other based on deep learning in patients with breast cancer. They concluded that the algorithm based on deep learning is more reliable than the one based on atlas. Zabel et al
25- Zabel WJ
- Conway JL
- Gladwish A
- et al.
Clinical evaluation of deep learning and atlas-based auto-contouring of bladder and rectum for prostate radiation therapy.
performed a similar study on the bladder, prostate, and rectum contours, reaching the same conclusion; similar results have been demonstrated by Van Dijk et al.
26- van Dijk LV
- Van den Bosch L
- Aljabar P
- et al.
Improving automatic delineation for head and neck organs at risk by deep learning contouring.
The advantage of deep learning methods was also shown for delineations of greater complexity such as the parotid glands
27- Hänsch A
- Schwier M
- Gass T
- et al.
Evaluation of deep learning methods for parotid gland segmentation from CT images.
and masticatory muscles,
28- Chen W
- Li Y
- Dyer BA
- et al.
Deep learning vs. atlas-based models for fast auto-segmentation of the masticatory muscles on head and neck CT images.
as well as in the delineation of the hippocampus on magnetic resonance images.
29Deep learning-based automatic delineation of the hippocampus by MRI: Geometric and dosimetric evaluation.
In the present study, we quantified the time saved with autosegmentation. It is important to measure the total time for all parts of the process and to include the time invested in the definition of the atlas cases and in the choice of the correct atlas case (manual or automatic).
Several benefits are expected from the use of autocontouring systems. However, they must be implemented and carefully validated in everyday clinical practice.
To conclude, autosegmentation tools provide an attractive opportunity with the potential both to reduce segmentation time and to increase adherence to existing guidelines. By generating contours that require minimal editing time, these systems can be easily incorporated into the routine workflows of a radiation oncology department.
Article info
Publication history
Published online: January 15, 2023
Accepted:
January 5,
2023
Received:
July 28,
2022
Footnotes
Sources of support: Siemens provided the software and hardware necessary for the operation of syngo.via, as well as proofreading and translation services.
Disclosures: This research was conducted in collaboration with Siemens, but it were not involved in any way during the clinical phase of the study or in the data collection. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Research data are available upon request from the corresponding author. The raw data were generated in PSMar. Because of commercial restrictions, model data supporting the conclusions will not be available.
Copyright
© 2023 The Authors. Published by Elsevier Inc. on behalf of American Society for Radiation Oncology.