CN117237639A - Pavement crack detection system based on YOLOv8 semantic segmentation - Google Patents

Pavement crack detection system based on YOLOv8 semantic segmentation Download PDF

Info

Publication number
CN117237639A
CN117237639A CN202311289399.5A CN202311289399A CN117237639A CN 117237639 A CN117237639 A CN 117237639A CN 202311289399 A CN202311289399 A CN 202311289399A CN 117237639 A CN117237639 A CN 117237639A
Authority
CN
China
Prior art keywords
data
crack
module
group
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311289399.5A
Other languages
Chinese (zh)
Inventor
曹恒睿
斛毓华
赵士博
谢悦涵
谢阳彦哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou University
Original Assignee
Zhengzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou University filed Critical Zhengzhou University
Priority to CN202311289399.5A priority Critical patent/CN117237639A/en
Publication of CN117237639A publication Critical patent/CN117237639A/en
Pending legal-status Critical Current

Links

Landscapes

  • Image Processing (AREA)

Abstract

The invention discloses a pavement crack detection system based on YOLOv8 semantic segmentation, which belongs to the technical field of crack detection and comprises an acquisition processing module, a target detection module, a semantic segmentation module, a calibration positioning module, a monitoring alarm module, a storage management module, a training update module, a monitoring platform, a report analysis module, a maintenance scheduling module and an integrated deployment module; the invention can more accurately classify and identify cracks with different types and severity, reduce false alarm rate, enable the system to perform more intelligent data analysis, help maintainers to better know the conditions of the cracks, and thus maintain the cracks more pertinently, improve the usability and fault tolerance of the system, ensure the safety and durability of the data, avoid the risk of data loss, improve the performance of the system, and ensure that the system can still normally operate under high load.

Description

Pavement crack detection system based on YOLOv8 semantic segmentation
Technical Field
The invention relates to the technical field of crack detection, in particular to a pavement crack detection system based on YOLOv8 semantic segmentation.
Background
Road networks, which are an important component of urban infrastructure, carry traffic for a large number of vehicles and pedestrians. Over time, however, the road surface inevitably becomes cracked, pitted, and damaged, which threatens traffic safety and also leads to an increase in maintenance costs. Therefore, it is important to detect and maintain road cracks in time. Conventional road inspection typically relies on manual inspection, which is time consuming, labor intensive and error prone. In recent years, the development of computer vision and deep learning techniques has provided new solutions for automated crack detection. An automatic crack detection system becomes an important tool for improving road maintenance efficiency and traffic safety.
The existing pavement crack detection system has high false alarm rate, and is not beneficial to help maintenance personnel to better know the state of cracks; in addition, the availability and fault tolerance of the existing pavement crack detection system are low, the safety and durability of data cannot be guaranteed, and the system cannot normally operate under the condition of high load, so that the pavement crack detection system based on the YOLOv8 semantic segmentation is provided.
Disclosure of Invention
The invention aims to solve the defects in the prior art, and provides a pavement crack detection system based on YOLOv8 semantic segmentation.
In order to achieve the above purpose, the present invention adopts the following technical scheme:
a pavement crack detection system based on YOLOv8 semantic segmentation comprises an acquisition processing module, a target detection module, a semantic segmentation module, a calibration positioning module, a monitoring alarm module, a storage management module, a training update module, a monitoring platform, a report analysis module, a maintenance scheduling module and an integrated deployment module;
the acquisition processing module is used for acquiring pavement information from each group of data sources and processing and labeling the acquired data;
the target detection module is used for detecting the position of a crack in the image and generating a boundary box;
the semantic segmentation module is used for further identifying and classifying the cracks and determining the type and severity of the cracks;
the calibration positioning module is used for correcting geographical coordinates of the crack by using a GPS positioning technology;
the monitoring alarm module is used for monitoring the road surface state in real time, and immediately sending an alarm notice if a crack is found;
the storage management module is used for storing and managing the collected data;
the training updating module is used for adjusting the optimization target detection model and the semantic segmentation model;
the monitoring platform is used for providing a visual user interface for maintenance personnel and administrators, and is used for monitoring the road surface state, checking detection results and alarming information;
the report analysis module is used for generating a report and analyzing data;
the maintenance scheduling module is used for making a maintenance plan and scheduling maintenance team modules;
the integrated deployment module is used for deploying the whole system to different hardware platforms and ensuring the normal operation of the system.
As a further scheme of the invention, the specific steps of the acquisition processing module for processing labels are as follows:
step one: after the acquisition processing module acquires original road surface images from each group of data sources, confirming the required image input width and input height according to preset data, and then adjusting each group of original road surface images by an equal proportion adjustment method;
step two: calculating scaling factors in the horizontal direction and the vertical direction, firstly creating a group of blank images, keeping the size of the blank images consistent with the preset size, then determining the position of each group of pixels in the adjusted pavement image in the original pavement image, calculating the gray value of each pixel through a bilinear interpolation method, filling the gray value into the corresponding position in the adjusted image to obtain a final pavement image, and storing the final pavement image on the blank image;
step three: and carrying out normalization processing on the adjusted pavement images, obtaining the position information of the corresponding cracks in each pavement image, distributing a group of unique integer labels for each group of crack class boundary boxes, calculating the information of the boundary boxes, and generating labels for each group of cracks, wherein the labels comprise class labels and boundary box information.
As a further scheme of the invention, the specific calculation formula of the equal proportion adjustment method in the step one is as follows:
W new type =H Order of (A) ×M Original source
(2)
H New type =W Order of (A) /M Original source
(3)
Wherein M is Original source Representing the original aspect ratio case; w (W) Original source Representing the original width; h Original source Representing the original height; w (W) New type Representing a new width; h New type Representing a new width;
the specific calculation formulas of the scaling factors in the horizontal direction and the vertical direction are as follows:
wherein α represents a horizontal scaling factor; beta represents a vertical scaling factor;
the specific calculation formula of the bilinear interpolation method is as follows:
wherein I is Order of (A) Representing a target pixel value; i Left upper part 、I Lower left 、I Upper right I Lower right Representing the most recent four raw pixel values; (delta) xy ) Representing the relative position of the target pixel in the nearest four pixel coordinates; c represents a normalization coefficient.
As a further scheme of the invention, the specific steps of crack detection of the target detection module are as follows:
step (1): the target detection module builds a group of YOLOv8 models, uses the weight trained by the COCO data set as the initial weight of the model, then collects the past road surface image data with labels, and then carries out data enhancement on each group of image data and integrates the image data into a road surface crack data set;
step (2): randomly dividing a crack data set into a training set, a verification set and a test set, then training a YOLO v8 model by using training data in the training set, and gradually adjusting weights of the model through a YOLO loss function in the training process to minimize a target detection error;
step (3): after each training is finished, evaluating the performance of the model through a verification set, checking the generalization capability of the model on unseen data, adjusting the fine tuning and super-parameters of the model parameters according to the verification result, evaluating the finally trained model through a test set after the training is finished, acquiring performance indexes, deploying the model into an API or an embedded system through an integrated deployment module, and continuously monitoring the performance of the model in practical application;
step (4): inputting the pavement image subjected to data preprocessing into a YOLOv8 model, wherein the YOLOv8 model extracts input image features through a plurality of convolution layers, pooling layers and full-connection layers and performs target detection to obtain a target detection result;
step (5): performing target detection on feature graphs with different scales, sending the detected features into a bidirectional feature pyramid, performing feature fusion, performing classification regression on fusion results to output a detection frame, and decoding the detection results in the feature graphs into coordinate and size information and category probability of a boundary frame by a model;
step (6): matching each group of detection frames with the detection result in the feature map to determine the best matched detection frame, predicting the object category probability contained in each group of detection frames by using a softmax activation function, and adjusting the position and the size of the boundary frame according to the dimension of the feature map and the dimension of the detection frame;
step (7): after the detection is completed, the detection results of the overlapping and low confidence are removed through non-maximum suppression so as to screen out the final crack detection results, and the YOLOv8 target detection module outputs the detection results containing the crack position and the category information in the form of a boundary frame.
As a further scheme of the invention, the semantic segmentation module identifies and classifies the specific steps as follows:
step I: the semantic segmentation module extracts crack areas from the detection result, extracts corresponding crack images from the original images for each group of crack areas, and then performs feature extraction on the crack images through the convolution and pooling layers by the semantic segmentation model so as to capture detail and texture information of cracks;
step II: decoding the acquired data through a decoder to acquire a pixel-level segmentation result, distributing each group of pixels in the crack image to a crack type or a background type, removing small segmentation areas, filling the segmentation areas to connect broken parts, distributing the crack type to each crack area through counting the number of the crack pixels or calculating the proportion of the crack pixels, and outputting a high-resolution crack segmentation mask;
step III: collecting various knowledge and information related to pavement cracks from the Internet and a crack database, classifying, de-duplicating and screening the collected pavement crack knowledge, identifying and extracting entities in the processed pavement crack knowledge through an NLP technology, extracting corresponding attributes of each entity from the related knowledge information, and establishing a relation among the entities to form connection of pavement crack knowledge maps;
step IV: processing the entity, attribute and relation into a corresponding graph structure in a triplet mode, selecting a proper graph database to store and manage the pavement crack knowledge graph, continuously updating and maintaining the pavement crack knowledge graph, matching the crack entity in the crack area with the corresponding entity in the knowledge graph to analyze and inquire, analyzing and inquiring the crack corresponding type information according to the analysis result, checking with a crack segmentation mask, and if deviation exists, feeding back to a worker for manual checking.
As a further scheme of the invention, the training update module adjusts and optimizes the concrete steps as follows:
step (1): the training updating module collects detection data output by a plurality of groups of YOLOv8 model models or semantic segmentation models, selects a group of observation data from the detection data as verification data, uses the rest of the observation data to fit a group of test models, uses the verification data to verify the accuracy of the test models, and repeatedly calculates the prediction capacity of the prediction models through root mean square errors;
step (2): initializing a parameter range, establishing a data sample, listing all possible data results, dividing the sample, selecting any subset of each group of data as a test set, selecting other subsets as training sets, predicting the test set after training a model, and counting the root mean square error of the test result;
step (3): and replacing the test set with another subset, taking the rest subset as a training set, counting root mean square errors again until all data are predicted once, selecting the corresponding combined parameter with the minimum root mean square error as the optimal parameter in the data interval, and replacing the original parameter of the corresponding model.
As a further scheme of the invention, the data storage of the storage management module comprises the following specific steps:
the first step: dividing each group of system data according to a preset time interval to obtain a plurality of groups of data blocks, generating the identification of each group of data blocks through a hash algorithm, and collecting each group of node information;
and a second step of: selecting proper nodes to store each group of data blocks according to the data block dividing rule and the node load condition through a load balancing algorithm, and after the data block storage is completed, configuring and copying a specified number of data blocks to a plurality of groups of nodes according to the requirements of a system and available resources;
and a third step of: when the data stored by the nodes changes, the data update is transmitted from one node to other nodes through a data synchronization algorithm, then the node operation condition is automatically detected, and the data migration or repair is carried out on the fault node.
Compared with the prior art, the invention has the beneficial effects that:
1. according to the invention, input image features are extracted through a trained YOLOv8 model and target detection is carried out to obtain target detection results, then the model decodes the detection results in the feature map into coordinates and size information of boundary boxes and class probability, each group of detection boxes is matched with the detection results in the feature map to determine the best matched detection boxes, then the final crack detection results are screened out, after a semantic segmentation module extracts crack regions from the detection results, a decoder is used for decoding acquired data to obtain pixel-level segmentation results, each group of pixels in the crack image is allocated to a crack class or background class, then small segmentation regions are removed, the segmentation regions are filled to be connected with broken parts, the crack class is allocated to each crack region through statistics of the number of crack pixels or calculation of the proportion of the crack pixels, meanwhile, a group of crack knowledge is constructed, a crack entity in the crack region is matched with a corresponding entity in the knowledge map to inquire, the crack corresponding type information is inquired according to the result mask, if deviation exists, the crack is fed back manually, the analysis is carried out to the analysis of the crack class information, the crack class is more accurately and the analysis is carried out, the analysis is better, the analysis is carried out, and the analysis is better is carried out on the analysis is carried out on the crack analysis state, and the analysis is better, and the system is better in terms of the analysis is better, and the analysis is better in terms of the conditions.
2. The invention divides each group of system data according to a preset time interval to obtain a plurality of groups of data blocks, then generates the identification of each group of data blocks through a hash algorithm, collects each group of node information, selects proper nodes to store each group of data blocks according to a data block dividing rule and a node load condition, configures and copies a specified number of data blocks to a plurality of groups of nodes according to the requirement and available resources of the system after the data blocks are stored, when the data stored by the nodes are changed, the data updating is transmitted from one node to other nodes through a data synchronization algorithm, then automatically detects the node operation condition, and performs data migration or repair on the fault node, thereby improving the availability and fault tolerance of the system, ensuring the safety and durability of the data, avoiding the risk of data loss, improving the system performance and ensuring that the system can still normally run under the high load condition.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention.
Fig. 1 is a system block diagram of a pavement crack detection system based on YOLOv8 semantic segmentation.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments.
Example 1
Referring to fig. 1, a pavement crack detection system based on YOLOv8 semantic segmentation includes an acquisition processing module, a target detection module, a semantic segmentation module, a calibration positioning module, a monitoring alarm module, a storage management module, a training update module, a monitoring platform, a report analysis module, a maintenance scheduling module and an integrated deployment module.
The acquisition processing module is used for acquiring pavement information from each group of data sources and processing and labeling the acquired data.
Specifically, after acquiring an original pavement image from each group of data sources, an acquisition processing module confirms the required image input width and input height according to preset data, adjusts each group of original pavement images through an equal proportion adjustment method, calculates scaling factors in the horizontal direction and the vertical direction, creates a group of blank images, keeps the blank image size consistent with the preset size, determines the position of each group of pixels in the adjusted pavement image in the original pavement image, calculates the gray value of each pixel through a bilinear interpolation method, fills the gray value of each pixel in the adjusted image to the corresponding position in the adjusted image to acquire a final pavement image, stores the final pavement image in the blank image, normalizes the adjusted pavement image, acquires corresponding crack position information in each pavement image, distributes a group of unique integer labels for each group of crack type boundary boxes, calculates the information of the boundary boxes, and generates labels for each group of cracks, including the type labels and the boundary box information.
In this embodiment, the specific calculation formula of the equal proportion adjustment method is as follows:
W new type =H Order of (A) ×M Original source
(2)
H New type =W Order of (A) /M Original source
(3)
Wherein M is Original source Representing the original aspect ratio case; w (W) Original source Representing the original width; h Original source Representing the original height; w (W) New type Representing a new width; h New type Representing a new width;
the scaling factors in the horizontal and vertical directions are specifically calculated as follows:
wherein α represents a horizontal scaling factor; beta represents a vertical scaling factor;
the specific calculation formula of the bilinear interpolation method is as follows:
wherein I is Order of (A) Representing a target pixel value; i Left upper part 、I Lower left 、I Upper right I Lower right Representing the most recent four raw pixel values; (delta) xy ) Representing the relative position of the target pixel in the nearest four pixel coordinates; c represents a normalization coefficient.
The object detection module is used for detecting the position of a crack in the image and generating a boundary box.
Specifically, the target detection module builds a group of YOLOv8 models, uses the weight trained by the COCO data set as the initial weight of the model, then collects the past pavement image data with labels, performs data enhancement on each group of image data and integrates the image data into pavement crack data sets, randomly divides the crack data sets into a training set, a verification set and a test set, then uses training data in the training set to train the YOLOv8 models, gradually adjusts the weight by a YOLO loss function to minimize target detection errors in the training process, evaluates the model performance by the verification set after each training, checks the generalization capability of the model on unseen data, performs fine adjustment of model parameters and super parameter adjustment according to the verification result, evaluates the final trained model by the test set after the training is finished to obtain performance indexes, the model is deployed as an API or an embedded system through an integrated deployment module, the performance of the model in practical application is continuously monitored, a pavement image subjected to data preprocessing is input into a YOLOv8 model, the YOLOv8 model extracts input image features through a plurality of convolution layers, pooling layers and full connection layers and performs target detection to obtain target detection results, target detection is performed on feature images with different scales, the detected features are sent into a bidirectional feature pyramid to perform feature fusion, the fusion results are subjected to classification regression to output detection frames, the model decodes the detection results in the feature images into coordinates and size information of boundary frames and class probabilities, each group of detection frames is matched with the detection results in the feature images to determine the best matched detection frames, the class probabilities of objects contained in each group of detection frames are predicted by using a softmax activation function, and then the position and the size of the boundary frame are adjusted according to the dimension of the feature map and the dimension of the detection frame, after detection is completed, the detection result of removing the overlapping and the detection result of low confidence coefficient are restrained by a non-maximum value so as to screen out a final crack detection result, and the YOLOv8 target detection module outputs the detection result containing the crack position and the category information in the form of the boundary frame.
The semantic segmentation module is used for further identifying and classifying the cracks and determining the type and severity of the cracks.
Specifically, the semantic segmentation module extracts crack areas from detection results, extracts corresponding crack images from original images for each group of crack areas, then the semantic segmentation model performs feature extraction on the crack images through a convolution layer and a pooling layer to capture details and texture information of cracks, decodes the obtained data through a decoder to obtain pixel-level segmentation results, distributes each group of pixels in the crack images to crack categories or background categories, removes small segmentation areas, fills the segmentation areas to connect broken parts, distributes the crack categories to each crack area through counting the number of the crack pixels or calculating the proportion of the crack pixels, simultaneously outputs high-resolution crack segmentation masks, collects various knowledge and information related to the pavement cracks from the Internet and a crack database, classifies, de-duplicates and screens the collected pavement crack knowledge, identifies and extracts entities in the pavement crack knowledge after processing through an NLP technology, extracts corresponding attributes of each entity from the related knowledge information, establishes a relation between the entities to form connection of the pavement crack knowledge, analyzes the entity, manages the corresponding relation between the entity and the map, analyzes the crack map by adopting a form of a triplet, analyzes the corresponding relation between the map and the crack map, and the map is properly analyzed and the map is analyzed by a map, and the map is analyzed and the map is then the map is analyzed and the map is not matched with the map and the map is analyzed if the map is matched with the map and the map is analyzed.
The calibration positioning module is used for correcting geographical coordinates of the crack by using a GPS positioning technology; the monitoring alarm module is used for monitoring the road surface state in real time, and if cracks are found, an alarm notification is immediately sent.
Example 2
Referring to fig. 1, a pavement crack detection system based on YOLOv8 semantic segmentation includes an acquisition processing module, a target detection module, a semantic segmentation module, a calibration positioning module, a monitoring alarm module, a storage management module, a training update module, a monitoring platform, a report analysis module, a maintenance scheduling module and an integrated deployment module.
The storage management module is used for storing and managing the collected data.
Specifically, the training update module collects detection data output by a plurality of groups of YOLOv8 model models or semantic segmentation models, selects one group of observation data from the detection data as verification data, uses the rest of observation data to simulate a group of test models, uses the verification data to verify the accuracy of the test models, repeatedly calculates the prediction capacity of the prediction models through root mean square errors for a plurality of times, initializes a parameter range, establishes data samples, lists all possible data results, divides the samples, selects any subset as a test set for each group of data, predicts the test set after the test model is trained, counts root mean square errors of the test results, replaces the test set with another subset, then takes the rest subset as a training set, counts root mean square errors again until all data are predicted once, and selects corresponding combination parameters, namely optimal parameters in a data interval, and replaces original parameters of the corresponding models.
The training updating module is used for adjusting the optimization target detection model and the semantic segmentation model.
Specifically, each group of system data is segmented according to a preset time interval to obtain a plurality of groups of data blocks, then the identifications of each group of data blocks are generated through a hash algorithm, each group of node information is collected, each group of data blocks are stored through selecting proper nodes according to a data block division rule and a node load condition and through a load balancing algorithm, after the data blocks are stored, a specified number of data blocks are configured and copied to a plurality of groups of nodes according to the requirements of the system and available resources, when the data stored by the nodes are changed, the data update is transmitted to other nodes from one node through a data synchronization algorithm, then the node operation condition is automatically detected, and the data migration or repair is carried out on the fault nodes.
The monitoring platform is used for providing a visual user interface for maintenance personnel and administrators, and is used for monitoring the road surface state, checking detection results and alarming information; the report analysis module is used for generating a report and analyzing data; the maintenance scheduling module is used for making a maintenance plan and scheduling maintenance team; the integrated deployment module is used for deploying the whole system to different hardware platforms and ensuring the normal operation of the system.

Claims (7)

1. The pavement crack detection system based on the YOLOv8 semantic segmentation is characterized by comprising an acquisition processing module, a target detection module, a semantic segmentation module, a calibration positioning module, a monitoring alarm module, a storage management module, a training update module, a monitoring platform, a report analysis module, a maintenance scheduling module and an integrated deployment module;
the acquisition processing module is used for acquiring pavement information from each group of data sources and processing and labeling the acquired data;
the target detection module is used for detecting the position of a crack in the image and generating a boundary box;
the semantic segmentation module is used for further identifying and classifying the cracks and determining the type and severity of the cracks;
the calibration positioning module is used for correcting geographical coordinates of the crack by using a GPS positioning technology;
the monitoring alarm module is used for monitoring the road surface state in real time, and immediately sending an alarm notice if a crack is found;
the storage management module is used for storing and managing the collected data;
the training updating module is used for adjusting the optimization target detection model and the semantic segmentation model;
the monitoring platform is used for providing a visual user interface for maintenance personnel and administrators, and is used for monitoring the road surface state, checking detection results and alarming information;
the report analysis module is used for generating a report and analyzing data;
the maintenance scheduling module is used for making a maintenance plan and scheduling maintenance team modules;
the integrated deployment module is used for deploying the whole system to different hardware platforms and ensuring the normal operation of the system.
2. The pavement crack detection system based on YOLOv8 semantic segmentation according to claim 1, wherein the specific steps of the acquisition processing module processing labels are as follows:
step one: after the acquisition processing module acquires original road surface images from each group of data sources, confirming the required image input width and input height according to preset data, and then adjusting each group of original road surface images by an equal proportion adjustment method;
step two: calculating scaling factors in the horizontal direction and the vertical direction, firstly creating a group of blank images, keeping the size of the blank images consistent with the preset size, then determining the position of each group of pixels in the adjusted pavement image in the original pavement image, calculating the gray value of each pixel through a bilinear interpolation method, filling the gray value into the corresponding position in the adjusted image to obtain a final pavement image, and storing the final pavement image on the blank image;
step three: and carrying out normalization processing on the adjusted pavement images, obtaining the position information of the corresponding cracks in each pavement image, distributing a group of unique integer labels for each group of crack class boundary boxes, calculating the information of the boundary boxes, and generating labels for each group of cracks, wherein the labels comprise class labels and boundary box information.
3. The pavement crack detection system based on YOLOv8 semantic segmentation according to claim 2, wherein the specific calculation formula of the equal proportion adjustment method in the step one is as follows:
W new type =H Order of (A) ×M Original source
(2)
H New type =W Order of (A) /M Original source
(3)
Wherein M is Original source Representing the original aspect ratio case; w (W) Original source Representing the original width; h Original source Representing the original height; w (W) New type Representing a new width; h New type Representing a new width;
the specific calculation formulas of the scaling factors in the horizontal direction and the vertical direction are as follows:
wherein α represents a horizontal scaling factor; beta represents a vertical scaling factor;
the specific calculation formula of the bilinear interpolation method is as follows:
wherein I is Order of (A) Representing a target pixel value; i Left upper part 、I Lower left 、I Upper right I Lower right Representing the most recent four raw pixel values; (delta) xy ) Representing the relative position of the target pixel in the nearest four pixel coordinates; c represents a normalization coefficient.
4. The pavement crack detection system based on YOLOv8 semantic segmentation according to claim 2, wherein the specific steps of the target detection module crack detection are as follows:
step (1): the target detection module builds a group of YOLOv8 models, uses the weight trained by the COCO data set as the initial weight of the model, then collects the past road surface image data with labels, and then carries out data enhancement on each group of image data and integrates the image data into a road surface crack data set;
step (2): randomly dividing a crack data set into a training set, a verification set and a test set, then training a YOLO v8 model by using training data in the training set, and gradually adjusting weights of the model through a YOLO loss function in the training process to minimize a target detection error;
step (3): after each training is finished, evaluating the performance of the model through a verification set, checking the generalization capability of the model on unseen data, adjusting the fine tuning and super-parameters of the model parameters according to the verification result, evaluating the finally trained model through a test set after the training is finished, acquiring performance indexes, deploying the model into an API or an embedded system through an integrated deployment module, and continuously monitoring the performance of the model in practical application;
step (4): inputting the pavement image subjected to data preprocessing into a YOLOv8 model, wherein the YOLOv8 model extracts input image features through a plurality of convolution layers, pooling layers and full-connection layers and performs target detection to obtain a target detection result;
step (5): performing target detection on feature graphs with different scales, sending the detected features into a bidirectional feature pyramid, performing feature fusion, performing classification regression on fusion results to output a detection frame, and decoding the detection results in the feature graphs into coordinate and size information and category probability of a boundary frame by a model;
step (6): matching each group of detection frames with the detection result in the feature map to determine the best matched detection frame, predicting the object category probability contained in each group of detection frames by using a softmax activation function, and adjusting the position and the size of the boundary frame according to the dimension of the feature map and the dimension of the detection frame;
step (7): after the detection is completed, the detection results of the overlapping and low confidence are removed through non-maximum suppression so as to screen out the final crack detection results, and the YOLOv8 target detection module outputs the detection results containing the crack position and the category information in the form of a boundary frame.
5. The pavement crack detection system based on YOLOv8 semantic segmentation according to claim 4, wherein the semantic segmentation module identifies and classifies the specific steps as follows:
step I: the semantic segmentation module extracts crack areas from the detection result, extracts corresponding crack images from the original images for each group of crack areas, and then performs feature extraction on the crack images through the convolution and pooling layers by the semantic segmentation model so as to capture detail and texture information of cracks;
step II: decoding the acquired data through a decoder to acquire a pixel-level segmentation result, distributing each group of pixels in the crack image to a crack type or a background type, removing small segmentation areas, filling the segmentation areas to connect broken parts, distributing the crack type to each crack area through counting the number of the crack pixels or calculating the proportion of the crack pixels, and outputting a high-resolution crack segmentation mask;
step III: collecting various knowledge and information related to pavement cracks from the Internet and a crack database, classifying, de-duplicating and screening the collected pavement crack knowledge, identifying and extracting entities in the processed pavement crack knowledge through an NLP technology, extracting corresponding attributes of each entity from the related knowledge information, and establishing a relation among the entities to form connection of pavement crack knowledge maps;
step IV: processing the entity, attribute and relation into a corresponding graph structure in a triplet mode, selecting a proper graph database to store and manage the pavement crack knowledge graph, continuously updating and maintaining the pavement crack knowledge graph, matching the crack entity in the crack area with the corresponding entity in the knowledge graph to analyze and inquire, analyzing and inquiring the crack corresponding type information according to the analysis result, checking with a crack segmentation mask, and if deviation exists, feeding back to a worker for manual checking.
6. The pavement crack detection system based on YOLOv8 semantic segmentation according to claim 5, wherein the training update module adjusts and optimizes the concrete steps as follows:
step (1): the training updating module collects detection data output by a plurality of groups of YOLOv8 model models or semantic segmentation models, selects a group of observation data from the detection data as verification data, uses the rest of the observation data to fit a group of test models, uses the verification data to verify the accuracy of the test models, and repeatedly calculates the prediction capacity of the prediction models through root mean square errors;
step (2): initializing a parameter range, establishing a data sample, listing all possible data results, dividing the sample, selecting any subset of each group of data as a test set, selecting other subsets as training sets, predicting the test set after training a model, and counting the root mean square error of the test result;
step (3): and replacing the test set with another subset, taking the rest subset as a training set, counting root mean square errors again until all data are predicted once, selecting the corresponding combined parameter with the minimum root mean square error as the optimal parameter in the data interval, and replacing the original parameter of the corresponding model.
7. The pavement crack detection system based on YOLOv8 semantic segmentation according to claim 1, wherein the storage management module data storage specifically comprises the following steps:
the first step: dividing each group of system data according to a preset time interval to obtain a plurality of groups of data blocks, generating the identification of each group of data blocks through a hash algorithm, and collecting each group of node information;
and a second step of: selecting proper nodes to store each group of data blocks according to the data block dividing rule and the node load condition through a load balancing algorithm, and after the data block storage is completed, configuring and copying a specified number of data blocks to a plurality of groups of nodes according to the requirements of a system and available resources;
and a third step of: when the data stored by the nodes changes, the data update is transmitted from one node to other nodes through a data synchronization algorithm, then the node operation condition is automatically detected, and the data migration or repair is carried out on the fault node.
CN202311289399.5A 2023-10-07 2023-10-07 Pavement crack detection system based on YOLOv8 semantic segmentation Pending CN117237639A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311289399.5A CN117237639A (en) 2023-10-07 2023-10-07 Pavement crack detection system based on YOLOv8 semantic segmentation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311289399.5A CN117237639A (en) 2023-10-07 2023-10-07 Pavement crack detection system based on YOLOv8 semantic segmentation

Publications (1)

Publication Number Publication Date
CN117237639A true CN117237639A (en) 2023-12-15

Family

ID=89087824

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311289399.5A Pending CN117237639A (en) 2023-10-07 2023-10-07 Pavement crack detection system based on YOLOv8 semantic segmentation

Country Status (1)

Country Link
CN (1) CN117237639A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117952977A (en) * 2024-03-27 2024-04-30 山东泉海汽车科技有限公司 Pavement crack identification method, device and medium based on improvement yolov s

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117952977A (en) * 2024-03-27 2024-04-30 山东泉海汽车科技有限公司 Pavement crack identification method, device and medium based on improvement yolov s
CN117952977B (en) * 2024-03-27 2024-06-04 山东泉海汽车科技有限公司 Pavement crack identification method, device and medium based on improvement yolov s

Similar Documents

Publication Publication Date Title
CN111353413B (en) Low-missing-report-rate defect identification method for power transmission equipment
US20210319561A1 (en) Image segmentation method and system for pavement disease based on deep learning
CN109165582B (en) Urban street garbage detection and cleanliness assessment method
CN112287807B (en) Remote sensing image road extraction method based on multi-branch pyramid neural network
CN109859163A (en) A kind of LCD defect inspection method based on feature pyramid convolutional neural networks
CN108961238A (en) Display screen quality determining method, device, electronic equipment and storage medium
CN108921839A (en) Continuous casting billet quality detection method, device, electronic equipment and storage medium
CN117237639A (en) Pavement crack detection system based on YOLOv8 semantic segmentation
CN111160432A (en) Automatic classification method and system for panel production defects
CN111539432A (en) Method for extracting urban road by using multi-source data to assist remote sensing image
CN117271683A (en) Intelligent analysis and evaluation method for mapping data
CN114862832A (en) Method, device and equipment for optimizing defect detection model and storage medium
CN115205255A (en) Stone automatic grading method and system based on deep learning
CN117409341B (en) Unmanned aerial vehicle illumination-based image analysis method and system
CN113971666A (en) Power transmission line machine inspection image self-adaptive identification method based on depth target detection
CN115830302B (en) Multi-scale feature extraction fusion power distribution network equipment positioning identification method
CN117197085A (en) Road rapid-inspection image pavement disease detection method based on improved YOLOv8 network
US20230314169A1 (en) Method and apparatus for generating map data, and non-transitory computer-readable storage medium
CN115170783A (en) Expressway pavement crack detection method using street view image
CN110163081A (en) SSD-based real-time regional intrusion detection method, system and storage medium
CN114140707A (en) Power grid fault inspection method
Yan et al. An automatic pavement crack detection system with FocusCrack Dataset
Ferrer-Espinoza et al. Evaluation of the use of Cascade Detection algorithms based on Machine Learning for Crack Detection in asphalt Pavements
CN113361968A (en) Power grid infrastructure worker safety risk assessment method based on artificial intelligence and big data
CN116188973B (en) Crack detection method based on cognitive generation mechanism

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination