ST-V-Net: incorporating shape prior into convolutional neural networks for proximal femur segmentation

Chen Zhao; Joyce H Keyak; Jinshan Tang; Tadashi S Kaneko; Sundeep Khosla; Shreyasee Amin; Elizabeth J Atkinson; Lan-Juan Zhao; Michael J Serou; Chaoyang Zhang; Hui Shen; Hong-Wen Deng; Weihua Zhou

doi:10.1007/s40747-021-00427-5

ST-V-Net: incorporating shape prior into convolutional neural networks for proximal femur segmentation

Complex Intell Systems. 2023;9(3):2747-2758. doi: 10.1007/s40747-021-00427-5. Epub 2021 Jun 16.

Authors

Chen Zhao^#¹, Joyce H Keyak^#², Jinshan Tang^{1

3}, Tadashi S Kaneko⁴, Sundeep Khosla⁵, Shreyasee Amin⁶, Elizabeth J Atkinson⁷, Lan-Juan Zhao⁸, Michael J Serou⁹, Chaoyang Zhang¹⁰, Hui Shen⁸, Hong-Wen Deng⁸, Weihua Zhou^{1

3}

Affiliations

¹ Department of Applied Computing, Michigan Technological University, 1400 Townsend Dr, Houghton, MI 49931 USA.
² Department of Radiological Sciences, Department of Mechanical and Aerospace Engineering, Department of Biomedical Engineering, and Chao Family Comprehensive Cancer Center, University of California, Irvine, Irvine, CA 92697 USA.
³ Center of Biocomputing and Digital Health, Institute of Computing and Cybersystems, and Health Research Institute, Michigan Technological University, Houghton, MI 49931 USA.
⁴ Department of Radiological Sciences, University of California, Irvine, Irvine, CA 92697 USA.
⁵ Division of Endocrinology, Department of Medicine, Mayo Clinic, Rochester, MN USA.
⁶ Division of Epidemiology, Department of Health Sciences Research, and Division of Rheumatology, Department of Medicine, Mayo Clinic, Rochester, MN USA.
⁷ Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN USA.
⁸ Division of Biomedical Informatics and Genomics, Tulane Center of Biomedical Informatics and Genomics, Deming Department of Medicine, Tulane University, School of Medicine, 1440 Canal Street, Suite 1610, New Orleans, LA 70112 USA.
⁹ Department of Radiology, Tulane University School of Medicine, New Orleans, LA 70112 USA.
¹⁰ School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS 39406 USA.

^# Contributed equally.

Abstract

We aim to develop a deep-learning-based method for automatic proximal femur segmentation in quantitative computed tomography (QCT) images. We proposed a spatial transformation V-Net (ST-V-Net), which contains a V-Net and a spatial transform network (STN) to extract the proximal femur from QCT images. The STN incorporates a shape prior into the segmentation network as a constraint and guidance for model training, which improves model performance and accelerates model convergence. Meanwhile, a multi-stage training strategy is adopted to fine-tune the weights of the ST-V-Net. We performed experiments using a QCT dataset which included 397 QCT subjects. During the experiments for the entire cohort and then for male and female subjects separately, 90% of the subjects were used in ten-fold stratified cross-validation for training and the rest of the subjects were used to evaluate the performance of models. In the entire cohort, the proposed model achieved a Dice similarity coefficient (DSC) of 0.9888, a sensitivity of 0.9966 and a specificity of 0.9988. Compared with V-Net, the Hausdorff distance was reduced from 9.144 to 5.917 mm, and the average surface distance was reduced from 0.012 to 0.009 mm using the proposed ST-V-Net. Quantitative evaluation demonstrated excellent performance of the proposed ST-V-Net for automatic proximal femur segmentation in QCT images. In addition, the proposed ST-V-Net sheds light on incorporating shape prior to segmentation to further improve the model performance.

Keywords: Convolutional neural networks; Deep learning; Proximal femur; Quantitative computed tomography; Segmentation.

Abstract

Grants and funding