CN115170971A - Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system - Google Patents

Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system Download PDF

Info

Publication number
CN115170971A
CN115170971A CN202210924807.9A CN202210924807A CN115170971A CN 115170971 A CN115170971 A CN 115170971A CN 202210924807 A CN202210924807 A CN 202210924807A CN 115170971 A CN115170971 A CN 115170971A
Authority
CN
China
Prior art keywords
depth estimation
depth
network model
monocular depth
supervision
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210924807.9A
Other languages
Chinese (zh)
Inventor
郑小玉
刘自强
童宁波
朱华蓉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
PowerChina Chengdu Engineering Co Ltd
Original Assignee
PowerChina Chengdu Engineering Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by PowerChina Chengdu Engineering Co Ltd filed Critical PowerChina Chengdu Engineering Co Ltd
Priority to CN202210924807.9A priority Critical patent/CN115170971A/en
Publication of CN115170971A publication Critical patent/CN115170971A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/176Urban or other man-made structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

Abstract

The invention discloses a construction safety-oriented self-supervision monocular depth estimation transfer learning method and a system, which relate to the technical field of artificial intelligence, and the technical scheme key points are as follows: acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images, and constructing a migration data set; establishing a self-supervision monocular depth estimation network model, and loading pre-training parameters of the monocular depth estimation network model; carrying out migration training on the monocular depth estimation network model according to the migration data set, wherein loss function constraint is adopted in the training process; and evaluating the performance of the monocular depth estimation network model after each training by using the quantitative evaluation indexes, and screening to obtain the optimal monocular depth estimation network model. The invention combines the structure characteristics of the coder and the decoder of the self-supervision monocular depth estimation model, and realizes the migration of the self-supervision monocular depth estimation network model from the scenes of automatic driving and the like to the construction scene by utilizing the hierarchical distinction of the scene depth.

Description

Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system
Technical Field
The invention relates to the technical field of artificial intelligence, in particular to a construction safety-oriented self-supervision monocular depth estimation transfer learning method and system.
Background
Estimating the depth of a construction scene from a single RGB image is a key prerequisite for various applications, including workspace security, localization, productivity analysis, activity recognition, and scene understanding. The self-supervised monocular depth estimation approach utilizes a large amount of unlabeled data to train the depth metric and the method, even better than some supervised approaches.
Currently, the practical application of this type of method to construction sites has inevitable limitations. In the field of self-supervised monocular depth estimation, the main datasets used for training and evaluation, such as the KITTI dataset and the DDAD dataset, are for the autopilot task, which includes true depth data for training, validation and testing of the model. Knowledge migration between the autonomous driving and construction scenario analyses will improve characterization learning in downstream tasks due to differences in feature space and data distribution. However, although we can easily solve the migration learning problem between autonomous driving tasks, such as model migration from the KITTI data set to the DDAD data set, migrating tasks to a building construction scenario remains a huge challenge, because true depth values like lidar data are not available at the construction site, migrating and evaluating models is a challenging problem.
Therefore, how to research and design a self-supervision monocular depth estimation migration learning method and system for construction safety, which can overcome the above defects, is a problem that needs to be solved urgently at present.
Disclosure of Invention
In order to solve the defects in the prior art, the invention aims to provide a construction safety-oriented self-supervision monocular depth estimation migration learning method and system, which are combined with the structural characteristics of a coder and a decoder of a self-supervision monocular depth estimation model, realize the migration of the self-supervision monocular depth estimation network model from automatic driving and other scenes to construction scenes by utilizing the hierarchical distinction of scene depth, and provide the reliable evaluation index estimation migration effect.
The technical purpose of the invention is realized by the following technical scheme:
in a first aspect, a construction safety-oriented self-supervision monocular depth estimation transfer learning method is provided, which comprises the following steps:
acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images, and constructing a migration data set;
establishing a self-supervision monocular depth estimation network model, and loading pre-training parameters of the monocular depth estimation network model;
carrying out migration training on the monocular depth estimation network model according to the migration data set, wherein loss function constraint is adopted in the training process;
and evaluating the performance of the monocular depth estimation network model after each training by using the quantitative evaluation index, and screening to obtain the optimal monocular depth estimation network model.
Furthermore, a marking frame in the construction site image is marked by adopting a rectangular boundary frame;
the rectangular bounding box is marked in the region with clear depth level and equal internal depth;
the labels of the rectangular bounding boxes are the serial numbers of all the selected rectangular bounding boxes from small to large according to the depth.
Further, the depth loss calculated by the loss function includes:
loss due to depth deviation between the random variable and the corresponding average value in each labeling box;
and the loss corresponding to the depth sequence among all the labeling frames.
Further, a calculation formula of the loss caused by the depth deviation is specifically as follows:
Figure BDA0003778541400000021
wherein L is dd Represents the loss caused by the depth deviation; n represents the number of the labeling frames in the current image; d' i Then representing D' of the ith labeling frame area in the current image; d' represents a depth map obtained by normalizing the depth map predicted by the depth estimation network in the current image; sigma (D' i ) Denotes a random variable D' i The standard deviation of (a);
the calculation formula of the depth map is specifically as follows:
Figure BDA0003778541400000022
d represents a depth map which is obtained by prediction of a depth estimation network and corresponds to the current image; d max 、D min Respectively representing the maximum and minimum values in D.
Further, the loss calculation formula corresponding to the depth order is specifically:
Figure BDA0003778541400000023
wherein L is do Representing the loss corresponding to the depth sequence; n represents the number of the labeling frames in the current image; v ij Representing the inter-frame distance estimation loss between the ith standard frame and the jth standard frame; i and j are integers, which represent the serial number of the labeling frame in the image, and the larger the serial number is, the larger the average depth value of the labeling frame is;
the calculation formula of the distance estimation loss is specifically as follows:
Figure BDA0003778541400000024
wherein μ (·) represents a function for solving the mean of the corresponding random variable;
Figure BDA0003778541400000025
to avoid the perturbation that trivial solution adds.
Further, the migration training process of the monocular depth estimation network model specifically includes:
loading a pre-trained PackNet network model on a Kitti automatic driving data set, and freezing an Encoder module of the model;
loading a training picture and a label of a data set;
training a PackNet model, and constraining the Decoder module parameters of the model by adopting a migration loss function;
the model after each round of training was tested using the validation set and evaluation index of the data set.
Further, the performance evaluation process of the monocular depth estimation network model specifically comprises the following steps:
carrying out comprehensive evaluation according to the relative average error of the depth deviation and the sequence accuracy of the depth sequence;
and if the relative average error is lower and the sequence accuracy is higher, the performance of the monocular depth estimation network model is good.
In a second aspect, a construction safety-oriented self-supervision monocular depth estimation transfer learning system is provided, which includes:
the data acquisition module is used for acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images and constructing a migration data set;
the model building module is used for building a self-supervision monocular depth estimation network model and loading pre-training parameters of the monocular depth estimation network model;
the migration training module is used for carrying out migration training on the monocular depth estimation network model according to the migration data set, and loss function constraint is adopted in the training process;
and the evaluation screening module is used for evaluating the performance of the monocular depth estimation network model after each training by utilizing the quantitative evaluation indexes and screening to obtain the optimal monocular depth estimation network model.
In a third aspect, a computer terminal is provided, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the program, the method for self-supervision monocular depth estimation migration learning for construction safety is implemented as described in any one of the first aspect.
In a fourth aspect, a computer-readable medium is provided, on which a computer program is stored, where the computer program is executed by a processor, and the construction safety-oriented self-supervision monocular depth estimation migration learning method according to any one of the first aspect may be implemented.
Compared with the prior art, the invention has the following beneficial effects:
1. the construction safety-oriented self-supervision monocular depth estimation migration learning method provided by the invention combines the structural characteristics of a coder and a decoder of a self-supervision monocular depth estimation model, realizes the migration of the self-supervision monocular depth estimation network model from scenes such as automatic driving and the like to construction scenes by utilizing the hierarchical distinction of scene depth, and provides a reliable evaluation index estimation migration effect;
2. according to the method, a new learning paradigm comprising a self-supervision monocular depth estimation model migration method, a loss function, an evaluation index and a depth marking method is established, so that the problem of model domain drift caused by the fact that a construction scene of a self-supervision monocular depth prediction model has no real depth data is solved, and the depth estimation precision of a downstream task is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention. In the drawings:
FIG. 1 is a flow chart in an embodiment of the invention;
fig. 2 is a block diagram of a system in an embodiment of the invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to examples and accompanying drawings, and the exemplary embodiments and descriptions thereof are only used for explaining the present invention and are not meant to limit the present invention.
Example 1: the construction safety-oriented self-supervision monocular depth estimation transfer learning method comprises the following steps as shown in figure 1:
s1: acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images, and constructing a migration data set;
s2: establishing a self-supervision monocular depth estimation network model, and loading pre-training parameters of the monocular depth estimation network model;
s3: carrying out migration training on the monocular depth estimation network model according to the migration data set, wherein loss function constraint is adopted in the training process;
s4: and evaluating the performance of the monocular depth estimation network model after each training by using the quantitative evaluation index, and screening to obtain the optimal monocular depth estimation network model.
Labeling a data set picture by adopting a method similar to the commonly used object rectangular bounding box labeling method in the field of target detection, wherein the difference is that the rectangular bounding box is a region which is labeled at any depth level and has the same internal depth; the labels of the rectangular bounding boxes are the serial numbers of all the selected rectangular bounding boxes from small to large according to the depth.
The migration loss function, instead of directly calculating the depth loss of each pixel, compares the depth deviation of the random variable from its mean value in each labeled box and the depth order among all labeled boxes, which can be directly applied to the depth loss.
The calculation formula of the loss function is specifically as follows:
L=αL do +L dd
wherein L represents the depth loss calculated by the loss function; l is dd Is a loss due to depth variation; l is d o is the loss of depth order; α is a weight adjustment parameter, α =1/3.
The formula for calculating the loss caused by the depth deviation is specifically as follows:
Figure BDA0003778541400000041
wherein L is dd Represents the loss due to depth deviation; n represents the number of the labeling frames in the current image; d' i Then representing D' of the ith label frame area in the current image; d' represents a depth map obtained by normalizing the depth map predicted by the depth estimation network and corresponding to the current image; sigma (D' i ) Denotes a random variable D' i Standard deviation of (d).
The calculation formula of the depth map is specifically as follows:
Figure BDA0003778541400000051
d represents a depth map which is obtained by prediction of a depth estimation network and corresponds to the current image; d max 、D min Respectively representing the maximum and minimum values in D.
The loss calculation formula corresponding to the depth order is specifically as follows:
Figure BDA0003778541400000052
wherein L is do Representing the loss corresponding to the depth sequence; n represents the number of the labeling frames in the current image; v ij Representing the inter-frame distance estimation loss between the ith standard frame and the jth standard frame; i and j are integers, represent the number of the labeling frame in the image, and the larger the number is, the larger the average depth value of the labeling frame is.
The calculation formula of the distance estimation loss is specifically as follows:
Figure BDA0003778541400000053
wherein μ (·) represents a function for solving the mean of the corresponding random variable;
Figure BDA0003778541400000054
to avoid the perturbation that trivial solution adds.
The transfer training process of the monocular depth estimation network model specifically comprises the following steps: loading a pre-trained PackNet network model on a Kitti automatic driving data set, and freezing an Encoder module of the model; loading a training picture and a label of a data set; training a PackNet model, and constraining Decoder module parameters of the model by adopting a migration loss function; the model after each round of training was tested using the validation set and evaluation index of the data set.
The evaluation index includes two accuracy indexes, and each test model is evaluated according to two losses of depth deviation and depth sequence. The first indicator is a Relative Mean Error (RME) indicator, which is defined as:
Figure BDA0003778541400000055
wherein, d j Is the annotation box Y in the corresponding predicted depth image i Pixels of the region; μ X i Is the comment box Y i A depth average of the region; n is the comment box Y i The number of pixels of a region.
Another indicator is the Order Accuracy (OAcc) indicator, which is defined as:
Figure BDA0003778541400000056
where K is the total number of comment boxes in the picture.
δ ij The calculation formula of (c) is:
Figure BDA0003778541400000057
Figure BDA0003778541400000061
wherein the content of the first and second substances,
Figure BDA0003778541400000062
representing the real depth relation of two labeling boxes (IO represents ascending order and DO represents descending order);
Figure BDA0003778541400000063
representing a normal difference distribution of two random variables; x i The distribution of the depth values of the ith mark box area is represented; r (X) i ,X j ) Indicating that the depth relation function of the two annotation boxes is calculated based on the predicted depth.
The formula for the Label (x) is:
Figure BDA0003778541400000064
in evaluating the depth estimation model, the higher the OAcc, the better the model with lower RME.
Example 2: the construction safety-oriented self-supervision monocular depth estimation transfer learning system is used for realizing the self-supervision monocular depth estimation transfer learning method described in the embodiment 1, and comprises a data acquisition module, a model construction module, a transfer training module and an evaluation screening module as shown in fig. 2.
The data acquisition module is used for acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images and constructing a migration data set; the model building module is used for building a self-supervision monocular depth estimation network model and loading pre-training parameters of the monocular depth estimation network model; the migration training module is used for carrying out migration training on the monocular depth estimation network model according to the migration data set, and loss function constraint is adopted in the training process; and the evaluation screening module is used for evaluating the performance of the monocular depth estimation network model after each training by utilizing the quantitative evaluation indexes and screening to obtain the optimal monocular depth estimation network model.
The working principle is as follows: the invention combines the structural characteristics of the coder and the decoder of the self-supervision monocular depth estimation model, realizes the migration of the self-supervision monocular depth estimation network model from the scenes of automatic driving and the like to the construction scene by utilizing the hierarchical distinction of the scene depth, and provides the reliable evaluation index evaluation migration effect; by constructing a new learning paradigm including an auto-supervision monocular depth estimation model migration method, a loss function, an evaluation index and a depth marking method, the problem of model domain drift caused by the fact that the auto-supervision monocular depth prediction model has no real depth data in a construction scene is solved, and the depth estimation precision of a downstream task is improved.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above embodiments, objects, technical solutions and advantages of the present invention are further described in detail, it should be understood that the above embodiments are only examples of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (10)

1. The construction safety-oriented self-supervision monocular depth estimation transfer learning method is characterized by comprising the following steps of:
acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images, and constructing a migration data set;
establishing a self-supervision monocular depth estimation network model, and loading pre-training parameters of the monocular depth estimation network model;
carrying out migration training on the monocular depth estimation network model according to the migration data set, wherein loss function constraint is adopted in the training process;
and evaluating the performance of the monocular depth estimation network model after each training by using the quantitative evaluation indexes, and screening to obtain the optimal monocular depth estimation network model.
2. The construction safety-oriented self-supervision monocular depth estimation migration learning method of claim 1, characterized in that a labeling frame in the construction site image is labeled by a rectangular bounding box;
the rectangular bounding box is marked in the region with clear depth level and equal internal depth;
the labels of the rectangular bounding boxes are the serial numbers of all the selected rectangular bounding boxes from small to large according to the depth.
3. The construction safety-oriented self-supervision monocular depth estimation transfer learning method of claim 1, wherein the depth loss calculated by the loss function comprises:
loss due to depth deviation between the random variable and the corresponding average value in each labeling box;
and the loss corresponding to the depth sequence among all the labeling frames.
4. The construction safety-oriented self-supervision monocular depth estimation transfer learning method of claim 3, wherein the loss calculation formula caused by the depth deviation is specifically:
Figure FDA0003778541390000011
wherein L is dd Represents the loss caused by the depth deviation; n represents the number of the labeling frames in the current image; d' i Then representing D' of the ith label frame area in the current image; d' represents a depth map obtained by normalizing the depth map predicted by the depth estimation network in the current image; sigma (D' i ) Denotes a random variable D' i Standard deviation of (d);
the calculation formula of the depth map is specifically as follows:
Figure FDA0003778541390000012
d represents a depth map which is obtained by prediction of a depth estimation network and corresponds to the current image; d max 、D min Respectively representing the maximum and minimum values in D.
5. The construction safety-oriented self-supervision monocular depth estimation migration learning method of claim 3, wherein the loss calculation formula corresponding to the depth order is specifically:
Figure FDA0003778541390000013
wherein L is do Representing the loss corresponding to the depth sequence; n represents the number of the labeling frames in the current image; v ij Representing the inter-frame distance estimation loss between the ith standard frame and the jth standard frame; i and j are integers, represent the serial number of the labeling frame in the image, and the larger the serial number is, the larger the average depth value of the labeling frame is;
the calculation formula of the distance estimation loss is specifically as follows:
Figure FDA0003778541390000021
wherein μ (·) represents a function for solving the mean of the corresponding random variable;
Figure FDA0003778541390000022
in order to avoid perturbations added by trivial solutions.
6. The construction safety-oriented self-supervision monocular depth estimation migration learning method according to any one of claims 1 to 5, characterized in that the migration training process of the monocular depth estimation network model specifically comprises:
loading a pre-trained PackNet network model on a Kitti automatic driving data set, and freezing an Encoder module of the model;
loading a training picture and a label of a data set;
training a PackNet model, and constraining Decoder module parameters of the model by adopting a migration loss function;
the model after each round of training was tested using the validation set and evaluation index of the data set.
7. The construction safety-oriented self-supervision monocular depth estimation migration learning method according to any one of claims 1 to 5, characterized in that the performance evaluation process of the monocular depth estimation network model specifically comprises:
carrying out comprehensive evaluation according to the relative average error of the depth deviation and the sequence accuracy of the depth sequence;
and if the relative average error is lower and the sequence accuracy is higher, the performance of the monocular depth estimation network model is good.
8. A construction safety-oriented self-supervision monocular depth estimation transfer learning system is characterized by comprising:
the data acquisition module is used for acquiring construction site images including construction equipment and workers, marking and dividing all the construction site images and constructing a migration data set;
the model building module is used for building a self-supervision monocular depth estimation network model and loading pre-training parameters of the monocular depth estimation network model;
the migration training module is used for carrying out migration training on the monocular depth estimation network model according to the migration data set, and loss function constraint is adopted in the training process;
and the evaluation screening module is used for evaluating the performance of the monocular depth estimation network model after each training by utilizing the quantitative evaluation indexes and screening to obtain the optimal monocular depth estimation network model.
9. A computer terminal comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements the method for construction safety oriented self-supervision monocular depth estimation migration learning according to any one of claims 1-7 when executing the program.
10. A computer-readable medium, on which a computer program is stored, the computer program being executable by a processor to implement the method for self-supervised monocular depth estimation migration learning for construction safety according to any one of claims 1 to 7.
CN202210924807.9A 2022-08-02 2022-08-02 Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system Pending CN115170971A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210924807.9A CN115170971A (en) 2022-08-02 2022-08-02 Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210924807.9A CN115170971A (en) 2022-08-02 2022-08-02 Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system

Publications (1)

Publication Number Publication Date
CN115170971A true CN115170971A (en) 2022-10-11

Family

ID=83476771

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210924807.9A Pending CN115170971A (en) 2022-08-02 2022-08-02 Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system

Country Status (1)

Country Link
CN (1) CN115170971A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117268345A (en) * 2023-11-20 2023-12-22 启元实验室 High-real-time monocular depth estimation measurement method and device and electronic equipment
CN117372492A (en) * 2023-12-04 2024-01-09 浙江优众新材料科技有限公司 Method and device for generating depth estimation model based on semi-supervision and electronic equipment

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117268345A (en) * 2023-11-20 2023-12-22 启元实验室 High-real-time monocular depth estimation measurement method and device and electronic equipment
CN117268345B (en) * 2023-11-20 2024-03-29 启元实验室 High-real-time monocular depth estimation measurement method and device and electronic equipment
CN117372492A (en) * 2023-12-04 2024-01-09 浙江优众新材料科技有限公司 Method and device for generating depth estimation model based on semi-supervision and electronic equipment

Similar Documents

Publication Publication Date Title
CN115170971A (en) Construction safety-oriented self-supervision monocular depth estimation transfer learning method and system
CN107872644A (en) Video frequency monitoring method and device
CN102810161B (en) Method for detecting pedestrians in crowding scene
CN108171217A (en) A kind of three-dimension object detection method based on converged network
CN110210398A (en) A kind of three-dimensional point cloud semantic segmentation mask method
CN109580004A (en) A kind of temperature checking method and device
CN110956646B (en) Target tracking method, device, equipment and storage medium
CN112990222B (en) Image boundary knowledge migration-based guided semantic segmentation method
Guo et al. Evaluation-oriented façade defects detection using rule-based deep learning method
CN112070071B (en) Method and device for labeling objects in video, computer equipment and storage medium
CN105046714A (en) Unsupervised image segmentation method based on super pixels and target discovering mechanism
CN115471625A (en) Cloud robot platform big data intelligent decision method and system
CN101908214A (en) Moving object detection method with background reconstruction based on neighborhood correlation
CN114565803A (en) Method, device and mechanical equipment for extracting difficult sample
CN110188495A (en) A method of the two-dimentional floor plan based on deep learning generates three-dimensional floor plan
CN114463932A (en) Non-contact construction safety distance active dynamic recognition early warning system and method
CN113052103A (en) Electrical equipment defect detection method and device based on neural network
CN112418149A (en) Abnormal behavior detection method based on deep convolutional neural network
Flotzinger et al. Building inspection toolkit: Unified evaluation and strong baselines for damage recognition
CN114998570B (en) Method and device for determining object detection frame, storage medium and electronic device
CN114299012A (en) Object surface defect detection method and system based on convolutional neural network
CN104050665A (en) Method and device for estimating foreground dwell time in video image
CN115631457A (en) Man-machine cooperation abnormity detection method and system in building construction monitoring video
CN115187884A (en) High-altitude parabolic identification method and device, electronic equipment and storage medium
CN111898422A (en) Power equipment identification method, system and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination