A Study on Facial Landmark Detection and Image Processing

Authors

  • Xiyi Xiong

DOI:

https://doi.org/10.62051/60c6xk73

Keywords:

Face Alignment, ResNet, Convolutional Neural Networks, Cascade Regression, Facial Landmark Detection.

Abstract

This research focuses on developing a robust face alignment system crucial for applications in facial recognition, expression analysis, and augmented reality. Utilizing a dataset of 2811 training images and 554 test images each annotated with 44 facial landmarks, the study explores several models including Convolutional Neural Networks (CNN), Cascade Regression, and Residual Networks (ResNet). Through rigorous experimentation, it was observed that while CNNs and Cascade Regression models provided substantial accuracy, the ResNet model displayed superior performance, especially in complex scenarios involving diverse expressions and occlusions. Additionally, the project also developed a simple algorithm for lip and eye color modification, further broadening the applicability of the system in image processing. Results indicate that while traditional models like CNN and Cascade Regression perform adequately, ResNet offers superior accuracy and robustness, particularly in challenging conditions. This research confirms the effectiveness of ResNet in enhancing face alignment technologies and suggests potential areas for future improvements in the field.

Downloads

Download data is not yet available.

References

[1] Sánchez-Lozano E, Martinez B, Vaistar M F. Cascaded regression with sparsiticd feature covariance matrix for facial landmark detection. Pattern Recognition Letters, 2016, 73: 19-25. DOI: https://doi.org/10.1016/j.patrec.2015.11.014

[2] Zhang Z, Zhang W, Liu J, et al. Facial landmark localization based on hierarchical pose regression with cascaded random fcms. Procecdings of the 21st ACM international conference on Multimedia. Barcelona: IEEE, 2013:561—564. DOI: https://doi.org/10.1145/2502081.2502148

[3] Wu F, Li S. Zhao T, et al. Cascaded regression using landmark displacement for 3D face reconstruction. Pattern Recognition Letters, 2019, 125:7 66-772. DOI: https://doi.org/10.1016/j.patrec.2019.07.017

[4] Hinton G E, Salakhutdinov R R. Reducing the dimensionality of data with neural networks. Science, 2006, 31 3(5786): 504-507. DOI: https://doi.org/10.1126/science.1127647

[5] Hinton G E, Osindero S. Teh Y W. A fast-learning algorithm for deep belief nets. Neural Computation, 2006, 18(7):1527-1554. DOI: https://doi.org/10.1162/neco.2006.18.7.1527

[6] Deng J, Dong W, Socher R, et al. Imagenet: A large-scale hierarchical image database. IEEE Conference on Computer Vision & Pattern Recognition. Miami: IEEE, 2009: 248-255. DOI: https://doi.org/10.1109/CVPR.2009.5206848

[7] Marmanis D, Datcu M, Esch T, et al. Deep learning earth observation classification using ImageNet pretrained networks. IEEE Geoscience & Remote Sensing Letters, 2015, 13(1): 105-109. DOI: https://doi.org/10.1109/LGRS.2015.2499239

[8] Stadelmann T, Toikachev V, Sick B, et al. Beyond ImageNet: Deep learning in industrial practice. Applied Data Science. Anchorage: Springer, 2019: 205-232. DOI: https://doi.org/10.1007/978-3-030-11821-1_12

[9] Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks. Communications of the ACM, 2017, 60(6): 84-90. DOI: https://doi.org/10.1145/3065386

[10] He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 770-778. DOI: https://doi.org/10.1109/CVPR.2016.90

Downloads

Published

12-08-2024

How to Cite

Xiong , X. (2024) “A Study on Facial Landmark Detection and Image Processing”, Transactions on Computer Science and Intelligent Systems Research, 5, pp. 1640–1646. doi:10.62051/60c6xk73.