Enhanced Cross-stage-attention U-Net for esophageal target volume segmentation

Xiao Lou; Juan Zhu; Jian Yang; Youzhe Zhu; Huazhong Shu; Baosheng Li

doi:10.1186/s12880-024-01515-x

Enhanced Cross-stage-attention U-Net for esophageal target volume segmentation

BMC Med Imaging. 2024 Dec 18;24(1):339. doi: 10.1186/s12880-024-01515-x.

Authors

Xiao Lou^{1

2}, Juan Zhu³, Jian Yang⁴, Youzhe Zhu^{5

6}, Huazhong Shu⁷, Baosheng Li^{8

9}

Affiliations

¹ Laboratory of Image Science and Technology, Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications, Ministry of Education, Southeast University, Sipailou 2, Nanjing, P.R. China.
² Department of Radiotherapy, Lishui People's Hospital, No. 1188, Liyang Street, Lishui, P.R. China.
³ Department of Respiratory Medicine, The People's Hospital of Zhangqiuqu Area, No. 1920, Huiquan Street, Jinan, P.R. China.
⁴ Department of Clinical Laboratory, The People's Hospital of Zhangqiuqu Area, No. 1920, Huiquan Street, Jinan, P.R. China.
⁵ Laboratory of Image Science and Technology, Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications, Ministry of Education, Southeast University, Sipailou 2, Nanjing, P.R. China. zhuyouzhe@yeah.net.
⁶ Department of Radiotherapy, Lishui People's Hospital, No. 1188, Liyang Street, Lishui, P.R. China. zhuyouzhe@yeah.net.
⁷ Laboratory of Image Science and Technology, Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications, Ministry of Education, Southeast University, Sipailou 2, Nanjing, P.R. China. shu.list@seu.edu.cn.
⁸ Laboratory of Image Science and Technology, Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications, Ministry of Education, Southeast University, Sipailou 2, Nanjing, P.R. China. bsli@sdfmu.edu.cn.
⁹ Shandong First Medical University and Shandong Academy of Medical Sciences, Shandong Cancer Hospital and Institute, No. 440, Jiyan Street, Jinan, P.R. China. bsli@sdfmu.edu.cn.

PMID: 39696039
DOI: 10.1186/s12880-024-01515-x

Abstract

Purpose: The segmentation of target volume and organs at risk (OAR) was a significant part of radiotherapy. Specifically, determining the location and scale of the esophagus in simulated computed tomography images was difficult and time-consuming primarily due to its complex structure and low contrast with the surrounding tissues. In this study, an Enhanced Cross-stage-attention U-Net was proposed to solve the segmentation problem for the esophageal gross tumor volume (GTV) and clinical tumor volume (CTV) in CT images.

Methods: First, a module based on principal component analysis theory was constructed to pre-extract the features of the input image. Then, a cross-stage based feature fusion model was designed to replace the skip concatenation of original UNet, which was composed of Wide Range Attention unit, Small-kernel Local Attention unit, and Inverted Bottleneck unit. WRA was employed to capture global attention, whose large convolution kernel was further decomposed to simplify the calculation. SLA was used to complement the local attention to WRA. IBN was structed to fuse the extracted features, where a global frequency response layer was built to redistribute the frequency response of the fused feature maps.

Results: The proposed method was compared with relevant published esophageal segmentation methods. The prediction of the proposed network was MSD = 2.83(1.62, 4.76)mm, HD = 11.79 ± 6.02 mm, DC = 72.45 ± 19.18% in GTV; MSD = 5.26(2.18, 8.82)mm, HD = 16.22 ± 10.01 mm, DC = 71.06 ± 17.72% in CTV.

Conclusion: The reconstruction of the skip concatenation in UNet showed an improvement of performance for esophageal segmentation. The results showed the proposed network had better effect on esophageal GTV and CTV segmentation.

Keywords: Attention; CNN; Esophageal carcinoma; Esophageal segmentation; Simulated CT; UNet.

MeSH terms

Algorithms
Esophageal Neoplasms* / diagnostic imaging
Esophageal Neoplasms* / radiotherapy
Esophagus / diagnostic imaging
Humans
Organs at Risk
Principal Component Analysis
Radiotherapy Planning, Computer-Assisted / methods
Tomography, X-Ray Computed* / methods
Tumor Burden

Abstract

MeSH terms

Grants and funding