Cracks Semantic Segmentation

Published:

Cracks are the defects formed by cyclic loading, fatigue, shrinkage, creep, and so on. In addition, they represent the deterioration of the structures over some time. Therefore, it is essential to detect and classify them according to the condition grade at the early stages to prevent the collapse of structures. Deep learning and machine learning-based supervised semantic segmentation methods requires carefully annotated images of cracks. This is a repository of human annotated semantic segmentation datasets of concrete, pavement, walls/mixed and masonry/bricks cracks surface. I collected these datasets during my Ph.D. days and used many for my dissertation.

Datasets examples

No.Dataset nameCracks imagesGround-truth imagesRemarks
1Cracks-200Concrete surfaces, I won’t say easy but still naive looking images of cracks and a low-resolution dataset. Low in the textural details. I call it Cracks-101 dataset.
2Cracks-1K (644 x 483)Concrete and pavement surfaces, quite challenging dataset. It has cracks of thin, medium and thick varieties. This dataset is texturous and some pavement cracks are filled with debris and may not be darker in constrast compared to the background. Some pavement cracks have lighter shades.
3Cracks-676Concrete, wall and pavement surfaces, quite challenging dataset. It has cracks of thin, medium and thick varieties. Highly textureous and cracks width varies largely. I call this dataset as The Wild Wild West because of its diversity and different images size.
4Cracks-4K (512 x 512)Concrete surfaces dataset. It has medium and large-sized cracks. Overall, clean and high-resolution dataset.
5Cracks-1K (1280 x 720)Concrete surfaces dataset. It has some thin, medium, and large-sized cracks. Overall, clean and high-resolution dataset.
6Cracks-1K (448 x 252)Concrete surfaces dataset. It has thin, medium, and large-sized cracks. Same as Cracks-1K (1280 x 720), but with a different resolution manageable for Deep Learning and Machine Learning methods. Overall, clean and high-resolution dataset.
7Cracks-3K (512 x 512)Concrete and pavement surfaces. It has thin, medium, and large-sized cracks. It has shadows in the middle. Overall, the most texture-diverse and challenging dataset.
8Masonry-Brick Cracks 81 (5152x3864)Masonry, bricks, and concrete joints dataset. Includes thin, medium, and large-sized cracks. Clean and very high resolution. Use Cracks cropper algorithm to split large resolution images for Deep Learning and Machine Learning methods.

Citation

All cracks semantic segmentation datasets are available to the public. If you use any of these datasets in your research, please use the following BibTeX entry to cite:

@PhdThesis{preetham2021vision,
	author  = {Aghalaya Manjunatha, Preetham},
	title   = {Vision-Based and Data-Driven Analytical and Experimental Studies into Condition Assessment and Change Detection of Evolving Civil, Mechanical and Aerospace Infrastructures},
	school  = {University of Southern California},
	year    = {2021},
	type    = {Dissertations & Theses},
	address = {3550 Trousdale Parkway Los Angeles, CA 90089},
	month   = {December},
	note    = {Condition assessment, Crack localization, Crack change detection, Synthetic crack generation, Sewer pipe condition assessment, Mechanical systems defect detection and quantification}
}

Acknowledgements

I thank Shravan Ravi, Vinay Hegde, and Milind Bhat (chronological order) for their efforts in the collection and preparation of the crack and non-crack image database around the University of Southern California (USC) campus at Los Angeles, USA. I thank Professor Mohammad Reza Jahanshahi for providing the Cracks-200 raw crack and non-crack images dataset. I thank Dr. Azarang Golmohammadi who web harvested cracks images of concrete, pavement, and walls in Cracks-676 dataset. In addition, I thank Ryan, Youngseok, Wisepl Private Limited, Zhenghao Li, and Arjun Sridhar (chronological order) for their conscientious efforts in creating for crack semantic segmentation annotations.