Article - Semantic point cloud segmentation with deep learning-based approaches for the construction industry: A Survey

Figure 13: Reported results for semantic segmentation task on the large-scale indoor S3DIS benchmark.

Authors: Lukas Rauch, Thomas Braml
Correspondence: lukas.rauch@unibw.de
DOI: https://doi.org/10.3390/app13169146

Abstract

Point cloud learning has recently gained strong attention due to its applications in various fields, like computer vision, robotics, and autonomous driving. Point cloud semantic segmentation (PCSS) enables the automatic extraction of semantic information from 3D point cloud data, which makes it a desirable task for construction-related applications as well. Yet, only a limited number of publications have applied deep-learning-based methods to address point cloud understanding for civil engineering problems, and there is still a lack of comprehensive reviews and evaluations of PCSS methods tailored to such use cases. This paper aims to address this gap by providing a survey of recent advances in deep-learning-based PCSS methods and relating them to the challenges of the construction industry. We introduce its significance for the industry and provide a comprehensive look-up table of publicly available datasets for point cloud understanding, with evaluations based on data scene type, sensors, and point features. We address the problem of class imbalance in 3D data for machine learning, provide a compendium of commonly used evaluation metrics for PCSS, and summarize the most significant deep learning methods developed for PCSS. Finally, we discuss the advantages and disadvantages of the methods for specific industry challenges. Our contribution, to the best of our knowledge, is the first survey paper that comprehensively covers deep-learning-based methods for semantic segmentation tasks tailored to construction applications. This paper serves as a useful reference for prospective research and practitioners seeking to develop more accurate and efficient PCSS methods.

Keywords: point cloud; semantic segmentation; deep learning; machine learning; construction; automation; open source; dataset; survey

Bibtext

@article{rauch:2023,
  title={Semantic Point Cloud Segmentation with Deep-Learning-Based Approaches for the Construction Industry: A Survey},
  author={Rauch Lukas, Braml Thomas},
  journal={Applied Science},
  year={2023},
  publisher={MDPI}
}

Figure 2 : A tree structure to summarize the variety of common dataset configurations for 3D scene understanding tasks.

Supplementary Material

1. List of publicly available datasets for 3D-scene understanding (Preview)

TABLE I. List of publicly available datasets for 3D-scene understanding, categories by data acquisition method, the content of the dataset, used hardware, data representation, and extent of available annotation classes.

Declaration of data type real-world (R), synthetic (S).

Nr.	Year	Name	Resource	Data type	Objects	Indoor sites	Urban (S)	Urban (D)	Panoramic cameras	Stereo camera	RGB-D	TLS	MLS	IMU	GPS	RGB sequence	Depth sequence	Point cloud	RGB	Intensity	# Sem. classes	Object detection	Pose estimation	Shape classfication	Object tracking	Semantic segmentation	Instance sem. segmentation	Scene reconstruction	SLAM	# Points	# Frames	# Scenes	# Scans
1	2009	Oakland 3-D	link	R			1					1						1			5					1				1,6M
2	2011	Ford Campus Vision and Lidar Data Set	link	R				1	1				1	1	1	1		1	1	1		1				1			1			2
3	2012	KITTI stereo evaluation 2012	link	R				1		1			1	1	1	1	1		1		8	1	1		1	1	1		1		1,5K	22
4	2013	NYUv2	link	R		1					1					1	1				14					1					407,0K	464
5	2013	SUN3D	link	R		1					1					1	1									1	1	1				254	415
6	2013	Sydney Urban Objects	link	R	1								1					1			14			1									613
7	2014	Paris-rue-Madame database	link	R				1					1					1		1	17					1	1			2,0M		1	2

2. Rankings on S3DIS Semantic Segmentation Benchmark (Previrew)

Reported results for semantic segmentation task on the large-scale indoor S3DIS benchmark (including all 6 areas, 6-fold cross validation). Ranked in descending order based on mIoU performance.

Declaration: C---convolution-based, G---graph-based, H---hybrid, P---pooling-based, R---RNN-based, T---Transformer-based, V---voxel-based.

Rank	Year	Model Name	Link	Method	mIoU	mAcc
1	2022	WindowNorm+StratifiedTransformer	link	T	77.60	85.8
2	2022	PointMetaBase-XXL	link	MLP	77.00	-
3	2022	PointNeXt-XL	link	MLP	74.90	83.0
4	2022	DeepViewAgg	link	H	74.70	83.8
5	2022	RepSurf-U	link	MLP	74.30	82.6
6	2022	WindowNorm+PointTransformer	link	T	74.10	82.5
7	2022	PointNeXt-L	link	MLP	73.90	82.2
8	2020	PointTransformer	link	T	73.50	81.9

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
data		data
article.pdf		article.pdf
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Article - Semantic point cloud segmentation with deep learning-based approaches for the construction industry: A Survey

Abstract

Bibtext

Supplementary Material

1. List of publicly available datasets for 3D-scene understanding (Preview)

2. Rankings on S3DIS Semantic Segmentation Benchmark (Previrew)

About

Releases

Packages

RauchLukas/Article-PCSS_for_Construction-A_survey

Folders and files

Latest commit

History

Repository files navigation

Article - Semantic point cloud segmentation with deep learning-based approaches for the construction industry: A Survey

Abstract

Bibtext

Supplementary Material

1. List of publicly available datasets for 3D-scene understanding (Preview)

2. Rankings on S3DIS Semantic Segmentation Benchmark (Previrew)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages