Automated contouring of CTV and OARs in planning CT scans using novel hybrid convolution-transformer networks for prostate cancer radiotherapy

Najmeh Arjmandi; Shahrokh Nasseri; Mehdi Momennezhad; Alireza Mehdizadeh; Sare Hosseini; Shokoufeh Mohebbi; Amin Amiri Tehranizadeh; Zohreh Pishevar

doi:10.1007/s12672-024-01177-9

Automated contouring of CTV and OARs in planning CT scans using novel hybrid convolution-transformer networks for prostate cancer radiotherapy

Discov Oncol. 2024 Jul 31;15(1):323. doi: 10.1007/s12672-024-01177-9.

Authors

Najmeh Arjmandi¹, Shahrokh Nasseri^{1

2}, Mehdi Momennezhad^{1

2}, Alireza Mehdizadeh³, Sare Hosseini^{4

5}, Shokoufeh Mohebbi⁶, Amin Amiri Tehranizadeh⁷, Zohreh Pishevar⁸

Affiliations

¹ Department of Medical Physics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
² Medical Physics Research Center, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
³ Ionizing and Non-Ionizing Radiation Protection Research Center, School of Paramedical Sciences, Shiraz University of Medical Sciences, Shiraz, Iran.
⁴ Department of Radiation Oncology, Mashhad University of Medical Sciences, Mashhad, Iran.
⁵ Cancer Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.
⁶ Medical Physics Department, Reza Radiation Oncology Center, Mashhad, Iran.
⁷ Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran. amin.amiriteh@gmail.com.
⁸ Department of Radiation Oncology, Mashhad University of Medical Sciences, Mashhad, Iran. zohreh.pishevar@gmail.com.

Abstract

Purpose objective(s): Manual contouring of the prostate region in planning computed tomography (CT) images is a challenging task due to factors such as low contrast in soft tissues, inter- and intra-observer variability, and variations in organ size and shape. Consequently, the use of automated contouring methods can offer significant advantages. In this study, we aimed to investigate automated male pelvic multi-organ contouring in multi-center planning CT images using a hybrid convolutional neural network-vision transformer (CNN-ViT) that combines convolutional and ViT techniques.

Materials/methods: We used retrospective data from 104 localized prostate cancer patients, with delineations of the clinical target volume (CTV) and critical organs at risk (OAR) for external beam radiotherapy. We introduced a novel attention-based fusion module that merges detailed features extracted through convolution with the global features obtained through the ViT.

Results: The average dice similarity coefficients (DSCs) achieved by VGG16-UNet-ViT for the prostate, bladder, rectum, right femoral head (RFH), and left femoral head (LFH) were 91.75%, 95.32%, 87.00%, 96.30%, and 96.34%, respectively. Experiments conducted on multi-center planning CT images indicate that combining the ViT structure with the CNN network resulted in superior performance for all organs compared to pure CNN and transformer architectures. Furthermore, the proposed method achieves more precise contours compared to state-of-the-art techniques.

Conclusion: Results demonstrate that integrating ViT into CNN architectures significantly improves segmentation performance. These results show promise as a reliable and efficient tool to facilitate prostate radiotherapy treatment planning.

Keywords: CT images; Convolutional neural network; Deep learning; Male pelvic radiotherapy; Prostate segmentation; Vision transformer.