Machine learning techniques for biomedical image segmentation: An overview of technical aspects and introduction to state-of-art applications

doi:10.1002/mp.13649

Review

. 2020 Jun;47(5):e148-e167.

doi: 10.1002/mp.13649.

Machine learning techniques for biomedical image segmentation: An overview of technical aspects and introduction to state-of-art applications

Hyunseok Seo¹, Masoud Badiei Khuzani¹, Varun Vasudevan², Charles Huang³, Hongyi Ren¹, Ruoxiu Xiao¹, Xiao Jia¹, Lei Xing¹

Affiliations

¹ Medical Physics Division in the Department of Radiation Oncology, School of Medicine, Stanford University, Stanford, CA, 94305-5847, USA.
² Institute for Computational and Mathematical Engineering, School of Engineering, Stanford University, Stanford, CA, 94305-4042, USA.
³ Department of Bioengineering, School of Engineering and Medicine, Stanford University, Stanford, CA, 94305-4245, USA.

PMID: 32418337
PMCID: PMC7338207
DOI: 10.1002/mp.13649

Review

Machine learning techniques for biomedical image segmentation: An overview of technical aspects and introduction to state-of-art applications

Hyunseok Seo et al. Med Phys. 2020 Jun.

. 2020 Jun;47(5):e148-e167.

doi: 10.1002/mp.13649.

Authors

Hyunseok Seo¹, Masoud Badiei Khuzani¹, Varun Vasudevan², Charles Huang³, Hongyi Ren¹, Ruoxiu Xiao¹, Xiao Jia¹, Lei Xing¹

Affiliations

¹ Medical Physics Division in the Department of Radiation Oncology, School of Medicine, Stanford University, Stanford, CA, 94305-5847, USA.
² Institute for Computational and Mathematical Engineering, School of Engineering, Stanford University, Stanford, CA, 94305-4042, USA.
³ Department of Bioengineering, School of Engineering and Medicine, Stanford University, Stanford, CA, 94305-4245, USA.

PMID: 32418337
PMCID: PMC7338207
DOI: 10.1002/mp.13649

Abstract

In recent years, significant progress has been made in developing more accurate and efficient machine learning algorithms for segmentation of medical and natural images. In this review article, we highlight the imperative role of machine learning algorithms in enabling efficient and accurate segmentation in the field of medical imaging. We specifically focus on several key studies pertaining to the application of machine learning methods to biomedical image segmentation. We review classical machine learning algorithms such as Markov random fields, k-means clustering, random forest, etc. Although such classical learning models are often less accurate compared to the deep-learning techniques, they are often more sample efficient and have a less complex structure. We also review different deep-learning architectures, such as the artificial neural networks (ANNs), the convolutional neural networks (CNNs), and the recurrent neural networks (RNNs), and present the segmentation results attained by those learning models that were published in the past 3 yr. We highlight the successes and limitations of each machine learning paradigm. In addition, we discuss several challenges related to the training of different machine learning models, and we present some heuristics to address those challenges.

Keywords: deep learning; machine learning; medical Image; overview; segmentation.

PubMed Disclaimer

Conflict of interest statement

“The authors have no conflicts to disclose.”

Figures

**Figure 1.**
The architecture of the segmentation network based on kernel SVMs, using a filter bank in conjunction with the kernel feature selection to generate semantic representations. Random feature maps φ₁, ⋯, φ_D capture the non-linear relationship between the representations and the class labels.

**Figure 2.**
Visualization of the random feature maps in three dimensions, using the t-SNE plot, and for different bandwidth parameters γ ≡ 1/2σ² of the Gaussian RBF kernel $k_{X} (x, y) = exp (- γ {‖ x - y ‖}_{2}^{2})$ . To generate the feature maps, the pre-trained VGG network is used. The red and blue regions correspond to the random feature maps generated by the pixels from each class label in a sampled colonoscopy image, respectively. To enhance the visualization, we have cropped the selected image and retained a balanced numbers of pixels from each class label. (a): γ = 10⁻⁶, (b): γ = 10⁻³, (c): γ = 0.1, and (d): γ = 1.

**Figure 3.**
Segmentation of Angiodysplasia colonoscopy images generated by FCN on sampled test images from the GIANA challenge dataset. Top: the colonscopy images obtained using Wireless Capsule Endoscopy (WCE), Middle: the heat maps depicting the soft-max output of FCN, Bottom: the heat map of the residual image computed as the absolute difference between the proposed segmentation and the ground truth. Due to training on a small data-set, FCN tends to overfit and does not generalize well to unseen data.

**Figure 4.**
Segmentation of Angiodysplasia colonoscopy images on sampled test images from the GIANA challenge dataset, generated via the kernel SVM using the VGG filter bank with the kernel feature selection. The bandwidth of RBF kernel 1/2σ² is selected via maximum mean discrepancy optimization. Top: the colonscopy images obtained using Wireless Capsule Endoscopy (WCE), Middle: the heat maps depicting the soft-max of SVM kernel classifier, Bottom: the heat map of the residual image computed as the absolute difference between the proposed segmentation and the ground truth. Despite training on a small data-set, the kernel SVM performs well on the test data set.

**Figure 5.**
Comparison of the mean IoU score M_IoU for FCN (the red color), the kernel SVM with Mallat’s scattering network as the filter bank (the green color), and the kernel SVM with a pre-trained VGG network as a filter bank (the blue color) on the test dataset. To tune the parameters of the kernel in the Gaussian RBF kernel, the two-sample test is performed. Each plot correspond to the performance of networks that are trained on different sample sizes. Panel (a): 76800 Pixels (1 image), Panel (b): 153600 Pixels (two images), Panel (c): Trained on 1 % of the data-set (3 images), (d): Trained on 5 % of the data-set (15 images).

**Figure 6.**
The architecture of the artificial neural network (ANN). (a) Mathematical model of a perceptron (node). (b) Multi-layer perceptron (MLP) structure for ANN. Each node in the hidden layer of (b) is described mathematically in (a). (c) An example of back-propagation. Loss is minimized by the update of the weight, w based on the gradient of the loss function with respect to w via the chain rule where b is the constant bias. (d) An example of convolution operation in CNN. Same kernel weights are applied to convolution operation for an output.

**Figure 7.**
The architecture of the recurrent neural network (RNN).

**Figure 8.**
Network architecture of the patch-wise CNN for liver/liver-tumor segmentation.

**Figure 9.**
Network architecture of (a) FCN and (b) U-Net.

**Figure 10.**
(a) The results of the liver and liver-tumor segmentation. Yellow, purple, red, green, and blue lines are acquired from SBBS-CNN, dual-frame U-Net, atrous pyramid pooling, the proposed network, and ground truth, respectively. (b) and (c) are the contouring of the segmentation results in (a).

**Figure 11.**
Network architecture of cascaded CNN network (example of patch-wise CNN and FCN) for tumor segmentation. The first network is trained for ROI or rough classification and the second network is further tuned for final segmentation.

**Figure 12.**
Descriptions of (a) stride and (b) atrous. Stride is the amount by which the convolution kernel shifts, and atrous is the distance of kernel elements (weights). (c) Structure of atrous pyramid pooling. Pyramid pooling can form the feature map which contains both local and global context information by applying different sub-region representations followed by up sampling and concatenation layers.

**Figure 13.**
The network architecture ranked 1st in BRATS challenge in 2018.

**Figure 14.**
Structure of the Generative Adversarial Network (GAN).

See this image and copyright information in PMC

Cited by

Automated Field of Interest Determination for Quantitative Ultrasound Analyses of Cervical Tissues: Toward Real-time Clinical Translation in Spontaneous Preterm Birth Risk Assessment.
Zuo J, Simpson DG, O'Brien WD Jr, McFarlin BL, Han A. Zuo J, et al. Ultrasound Med Biol. 2024 Dec;50(12):1861-1867. doi: 10.1016/j.ultrasmedbio.2024.08.011. Epub 2024 Sep 12. Ultrasound Med Biol. 2024. PMID: 39271408
On-board MRI image compression using video encoder for MR-guided radiotherapy.
Shang J, Huang P, Zhang K, Dai J, Yan H. Shang J, et al. Quant Imaging Med Surg. 2023 Aug 1;13(8):5207-5217. doi: 10.21037/qims-22-1378. Epub 2023 Jun 20. Quant Imaging Med Surg. 2023. PMID: 37581063 Free PMC article.
Automation and artificial intelligence in radiation therapy treatment planning.
Jones S, Thompson K, Porter B, Shepherd M, Sapkaroski D, Grimshaw A, Hargrave C. Jones S, et al. J Med Radiat Sci. 2024 Jun;71(2):290-298. doi: 10.1002/jmrs.729. Epub 2023 Oct 4. J Med Radiat Sci. 2024. PMID: 37794690 Free PMC article.
Optical coherence tomography confirms non-malignant pigmented lesions in phacomatosis pigmentokeratotica using a support vector machine learning algorithm.
Lee J, Beirami MJ, Ebrahimpour R, Puyana C, Tsoukas M, Avanaki K. Lee J, et al. Skin Res Technol. 2023 Jun;29(6):e13377. doi: 10.1111/srt.13377. Skin Res Technol. 2023. PMID: 37357662 Free PMC article.
Automated, high-throughput quantification of EGFP-expressing neutrophils in zebrafish by machine learning and a highly-parallelized microscope.
Efromson J, Ferrero G, Bègue A, Doman TJJ, Dugo C, Barker A, Saliu V, Reamey P, Kim K, Harfouche M, Yoder JA. Efromson J, et al. PLoS One. 2023 Dec 7;18(12):e0295711. doi: 10.1371/journal.pone.0295711. eCollection 2023. PLoS One. 2023. PMID: 38060605 Free PMC article.

See all "Cited by" articles

References

1. Mao KZ, Zhao P, Tan P-H. Supervised learning-based cell image segmentation for p53 immunohistochemistry. IEEE Transactions on Biomedical Engineering. 2006;53(6):1153–1163. - PubMed
1. Wachinger C, Golland P. Atlas-based under-segmentation. Paper presented at: International Conference on Medical Image Computing and Computer-Assisted Intervention2014. - PMC - PubMed
1. Li D, Liu L, Chen J, et al. Augmenting atlas-based liver segmentation for radiotherapy treatment planning by incorporating image features proximal to the atlas contours. Physics in Medicine & Biology. 2016;62(1):272. - PubMed
1. Noh H, Hong S, Han B. Learning Deconvolution Network for Semantic Segmentation. arXiv e-prints. 2015. https://ui.adsabs.harvard.edu/\#abs/2015arXiv150504366N. Accessed May 01, 2015.
1. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Paper presented at: Proceedings of the IEEE conference on computer vision and pattern recognition2016.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

[1] Mao KZ, Zhao P, Tan P-H. Supervised learning-based cell image segmentation for p53 immunohistochemistry. IEEE Transactions on Biomedical Engineering. 2006;53(6):1153–1163. - PubMed

[2] Mao KZ, Zhao P, Tan P-H. Supervised learning-based cell image segmentation for p53 immunohistochemistry. IEEE Transactions on Biomedical Engineering. 2006;53(6):1153–1163. - PubMed

[3] Wachinger C, Golland P. Atlas-based under-segmentation. Paper presented at: International Conference on Medical Image Computing and Computer-Assisted Intervention2014. - PMC - PubMed

[4] Wachinger C, Golland P. Atlas-based under-segmentation. Paper presented at: International Conference on Medical Image Computing and Computer-Assisted Intervention2014. - PMC - PubMed

[5] Li D, Liu L, Chen J, et al. Augmenting atlas-based liver segmentation for radiotherapy treatment planning by incorporating image features proximal to the atlas contours. Physics in Medicine & Biology. 2016;62(1):272. - PubMed

[6] Li D, Liu L, Chen J, et al. Augmenting atlas-based liver segmentation for radiotherapy treatment planning by incorporating image features proximal to the atlas contours. Physics in Medicine & Biology. 2016;62(1):272. - PubMed

[7] Noh H, Hong S, Han B. Learning Deconvolution Network for Semantic Segmentation. arXiv e-prints. 2015. https://ui.adsabs.harvard.edu/\#abs/2015arXiv150504366N. Accessed May 01, 2015.

[8] Noh H, Hong S, Han B. Learning Deconvolution Network for Semantic Segmentation. arXiv e-prints. 2015. https://ui.adsabs.harvard.edu/\#abs/2015arXiv150504366N. Accessed May 01, 2015.

[9] He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Paper presented at: Proceedings of the IEEE conference on computer vision and pattern recognition2016.

[10] He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Paper presented at: Proceedings of the IEEE conference on computer vision and pattern recognition2016.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning techniques for biomedical image segmentation: An overview of technical aspects and introduction to state-of-art applications

Affiliations

Machine learning techniques for biomedical image segmentation: An overview of technical aspects and introduction to state-of-art applications

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous