Large-scale Point Cloud Segmentation based on Multi-feature Local Enhanced Fusion

Zongshun Wang

doi:10.62051/ijcsit.v2n1.19

Authors

Zongshun Wang

DOI:

https://doi.org/10.62051/ijcsit.v2n1.19

Keywords:

3D scene understanding; Point cloud segmentation; Feature extraction; Manhattan distance

Abstract

This paper introduces a framework for large-scale 3D point cloud semantic segmentation - the MLEF-Net model. The model aims to improve the segmentation accuracy of large-scale point clouds by innovatively combining Manhattan distance-based KNN neighborhood search with feature aggregation techniques. This approach uniquely handles spatial, color, and normal vector attributes, thereby improving the segmentation results. The superiority of the model is validated through comprehensive testing on the SemanticKITTI and nuScenes datasets, demonstrating its potential to enhance point cloud segmentation through advanced feature fusion strategies.

Downloads

Download data is not yet available.

References

H. Chen, T. Xie, M. Liang, W. Liu, and P. X. Liu, "A local tangent plane distance-based approach to 3D point cloud segmentation via clustering," Pattern Recognition, vol. 137, p. 109307, 2023.

A.-T. Tran, H.-S. Le, S.-H. Lee, and K.-R. Kwon, "Pointct: Point central transformer network for weakly-supervised point cloud semantic segmentation," in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024, pp. 3556-3565.

X. Chen, H. Ma, J. Wan, B. Li, and T. Xia, "Multi-view 3d object detection network for autonomous driving," in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2017, pp. 1907-1915.

B. Wu, A. Wan, X. Yue, and K. Keutzer, "Squeezeseg: Convolutional neural nets with recurrent crf for real-time road-object segmentation from 3d lidar point cloud," in 2018 IEEE international conference on robotics and automation (ICRA), 2018: IEEE, pp. 1887-1893.

A. Milioto, I. Vizzo, J. Behley, and C. Stachniss, "Rangenet++: Fast and accurate lidar semantic segmentation," in 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS), 2019: IEEE, pp. 4213-4220.

D. Maturana and S. Scherer, "Voxnet: A 3d convolutional neural network for real-time object recognition," in 2015 IEEE/RSJ international conference on intelligent robots and systems (IROS), 2015: IEEE, pp. 922-928.

Z. Li, F. Wang, and N. Wang, "Lidar r-cnn: An efficient and universal 3d object detector," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 7546-7555.

J. Wang, W. Li, M. Zhang, and J. Chanussot, "Large kernel sparse ConvNet weighted by multi-frequency attention for remote sensing scene understanding," IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-12, 2023.

C. R. Qi, H. Su, K. Mo, and L. J. Guibas, "Pointnet: Deep learning on point sets for 3d classification and segmentation," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 652-660.

C. R. Qi, L. Yi, H. Su, and L. J. Guibas, "Pointnet++: Deep hierarchical feature learning on point sets in a metric space," Advances in neural information processing systems, vol. 30, 2017.

A. V. Phan, M. Le Nguyen, Y. L. H. Nguyen, and L. T. Bui, "Dgcnn: A convolutional neural network over large-scale labeled graphs," Neural Networks, vol. 108, pp. 533-543, 2018. https://doi.org/10.1016/j.neucom.2018.09.008

Q. Hu et al., "Randla-net: Efficient semantic segmentation of large-scale point clouds," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11108-11117.

H. Thomas, C. R. Qi, J.-E. Deschaud, B. Marcotegui, F. Goulette, and L. J. Guibas, "Kpconv: Flexible and deformable convolution for point clouds," in Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 6411-6420.

C. Xu et al., "Squeezesegv3: Spatially-adaptive convolution for efficient point-cloud segmentation," in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXVIII 16, 2020: Springer, pp. 1-19.

F. Zhang, J. Fang, B. Wah, and P. Torr, "Deep fusionnet for point cloud semantic segmentation," in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXIV 16, 2020: Springer, pp. 644-663.

M. Gerdzhev, R. Razani, E. Taghavi, and L. Bingbing, "Tornado-net: multiview total variation semantic segmentation with diamond inception module," in 2021 IEEE International Conference on Robotics and Automation (ICRA), 2021: IEEE, pp. 9543-9549.

V. E. Liong, T. N. T. Nguyen, S. Widjaja, D. Sharma, and Z. J. Chong, "Amvnet: Assertion-based multi-view fusion network for lidar semantic segmentation," arXiv preprint arXiv:2012.04934, 2020.

M. Axelsson, M. Holmberg, S. Serra, H. Ovren, and M. Tulldahl, "Semantic labeling of lidar point clouds for UAV applications," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 4314-4321.

Z. Liu, H. Tang, S. Zhao, K. Shao, and S. Han, "Pvnas: 3d neural architecture search with point-voxel convolution," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 11, pp. 8552-8568, 2021.

R. Cheng, R. Razani, E. Taghavi, E. Li, and B. Liu, "2-s3net: Attentive feature fusion with adaptive feature selection for sparse semantic segmentation network," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 12547-12556.

Y. Zhang et al., "Polarnet: An improved grid representation for online lidar point clouds semantic segmentation," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 9601-9610.

T. Cortinhal, G. Tzelepis, and E. Erdal Aksoy, "Salsanext: Fast, uncertainty-aware semantic segmentation of lidar point clouds," in Advances in Visual Computing: 15th International Symposium, ISVC 2020, San Diego, CA, USA, October 5–7, 2020, Proceedings, Part II 15, 2020: Springer, pp. 207-222.

H. Zhou et al., "Cylinder3d: An effective 3d framework for driving-scene lidar semantic segmentation," arXiv preprint arXiv:2008.01550, 2020.

J. Xu, R. Zhang, J. Dou, Y. Zhu, J. Sun, and S. Pu, "Rpvnet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 16024-16033.