CN113673484A

CN113673484A - Road condition identification and decision-making method in unmanned driving scene

Info

Publication number: CN113673484A
Application number: CN202111058219.3A
Authority: CN
Inventors: 皮从明
Original assignee: Shanghai Rongjin E Commerce Co ltd
Current assignee: Shanghai Rongjin E Commerce Co ltd
Priority date: 2021-09-09
Filing date: 2021-09-09
Publication date: 2021-11-19

Abstract

The invention discloses a road condition identification and decision-making method in an unmanned driving scene, which comprises the following steps: firstly, a vehicle trains a picture expansion model by building a deep convolution neural network, and obtains a trained model; secondly, the vehicle realizes the relative positioning of the position of the vehicle in an unknown environment, and pictures are shot through a sensor; thirdly, the vehicle identifies and classifies the environment through a model which is matched and trained by pictures shot by the sensor; by adopting the technology of the invention, under the condition of adopting the same data set of the road condition picture in the unmanned scene, the road condition types in the picture can be identified, less storage space is occupied, the algorithm has better precision in a dynamic environment, decision opinions for guiding unmanned driving can be generated and returned to an unmanned control system, and the global semantic information obtained by semantic segmentation can help the robot to navigate and plan the path, thereby obviously improving the intelligent level of the mobile vehicle.

Description

Road condition identification and decision-making method in unmanned driving scene

Technical Field

The invention belongs to the technical field of computer image processing, and particularly relates to a road condition identification and decision-making method in an unmanned driving scene.

Background

In the field of computer image processing, a map expression mode based on a feature descriptor can complete a visual positioning task of a vehicle, but when the method is applied to road condition recognition and exploration in an unmanned scene, the method has the following defects: the traditional map based on the feature descriptor occupies more storage space and has larger estimation error, so the traditional map based on the feature descriptor cannot be directly used in the development of an unmanned control system.

Disclosure of Invention

The present invention is directed to a method for road condition identification and decision-making in an unmanned driving scene, so as to solve the problems mentioned in the background art.

In order to achieve the purpose, the invention provides the following technical scheme: a road condition identification and decision-making method in an unmanned driving scene is characterized in that: the method comprises the following steps:

firstly, a vehicle trains a picture expansion model by building a deep convolution neural network, and obtains a trained model;

secondly, the vehicle realizes the relative positioning of the position of the vehicle in an unknown environment, and pictures are shot through a sensor;

thirdly, the vehicle identifies and classifies the environment through a model which is matched and trained by pictures shot by the sensor;

step four, starting moving exploration of the vehicle through an unknown map path exploration algorithm, carrying out self-positioning through observation of the environment, and simultaneously building an incremental map;

fifthly, the vehicle realizes real-time construction and updating of the map during exploration through multi-sensor fusion data until the map construction is completed;

and step six, the vehicle control system makes a decision according to the sensed information.

Preferably, in the third step, the pose estimation of the visual SLAM algorithm is adopted.

Preferably, in the fourth step, local map splicing and global map model construction are realized through point cloud splicing and filtering technology.

Preferably, the semantic map is constructed in a dynamic environment by combining the semantic information with the SLAM algorithm adopted by the invention in the fifth step.

Compared with the prior art, the invention has the beneficial effects that: by adopting the technology of the invention, under the condition of adopting the same data set of the road condition picture in the unmanned scene, the road condition types in the picture can be identified, less storage space is occupied, the algorithm has better precision in a dynamic environment, decision opinions for guiding unmanned driving can be generated and returned to an unmanned control system, and the global semantic information obtained by semantic segmentation can help the robot to navigate and plan the path, thereby obviously improving the intelligent level of the mobile vehicle.

Drawings

FIG. 1 is a schematic diagram of the principles of the present invention;

FIG. 2 is a schematic view of the workflow of pose estimation of the visual SLAM algorithm of the present invention;

FIG. 3 is a schematic diagram of a method for local map stitching and global map model construction according to the present invention;

FIG. 4 is a schematic diagram of a single frame semantic point generation algorithm of the present invention;

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example 1

Referring to fig. 1 to 4, the present invention provides a technical solution: a method for road condition identification and decision-making in an unmanned driving scene, comprising 1) convolutional neural network-based image semantic segmentation: building a full convolution neural network model for semantic segmentation, and training through a data set, wherein the data set comprises 11335 semantic segmentation images, the semantic segmentation images are divided into 20 types of objects, and the resolution ratio is about 500 x 300; for semantic segmentation tasks, a training set and a verification set of the data set both have corresponding labels, and the labels store corresponding semantic categories of each pixel in an image; if other categories need to be identified, the trained network can be finely adjusted by using an additional data stage, pictures are cut, and the resolution of the input image is unified to 360 x 480; adopting a multi-distribution learning rate, taking a larger learning rate at the early stage of neural network training, and attenuating the learning rate to be 0.1 time of the original learning rate when reaching a certain set training frequency along with the training; the method can enable the neural network to reach the vicinity of the optimal value quickly without fluctuating back and forth around the optimal value.

2) Pose estimation of visual SLAM algorithm: referring to the work flow of fig. 2, the position and orientation of the camera are estimated by using the SLAM algorithm of the feature point method, the projection of the same space point in different images is obtained by matching the features, the relation between the position and the posture of the images is calculated, and the motion track of the moving vehicle of the camera is obtained.

3) Local map splicing and global map model construction are realized through point cloud splicing and filtering technology: after the SLAM algorithm is used for processing a moving object in a dynamic scene and a pose graph with the pose as a variable is optimized, the optimized globally consistent mobile robot key frame pose can be obtained; accordingly, the point clouds of the key frames are spliced to obtain a local map of the environment, and the method is as shown in fig. 3; converting the point cloud map corresponding to each key frame into a world coordinate system by using the pose of the key frame, and connecting to obtain a local global point cloud map; filtering outliers in the point cloud map by adopting a filtering technology to realize map optimization and complete map construction; the mathematical expression is as follows:

in the formula:

C_k-key(s)A single frame point cloud formed by the frame images,

T_kthe pose of the camera at the moment of the frame,

and (4) local point cloud under the m-camera coordinate system.

4) Combining the semantic information with the SLAM algorithm adopted by the invention to complete the construction of the semantic map under the dynamic environment: adding semantic information to the acquired point cloud information, identifying the class information of an object in the image by semantic segmentation of the image, fusing the segmentation result of the two-dimensional image into the point cloud, and giving semantic information to each point in the point cloud so as to obtain a semantic point cloud map with rich information; adopting different colors to correspond to objects of a specific category, namely semantic information of scenes; the single frame semantic point generation algorithm is shown in fig. 4.

5) And generating the unmanned decision-making opinions according to the obtained road condition types, returning the unmanned decision-making opinions to the unmanned control system, and generating the unmanned decision-making opinions according to the road condition types.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A road condition identification and decision-making method in an unmanned driving scene is characterized in that: the method comprises the following steps:

2. The method of claim 1, wherein the method comprises the steps of: and in the third step, the pose estimation of the visual SLAM algorithm is adopted.

3. The method of claim 1, wherein the method comprises the steps of: and in the fourth step, local map splicing and global map model construction are realized through point cloud splicing and filtering technology.

4. The method of claim 1, wherein the method comprises the steps of: and fifthly, building the semantic map in the dynamic environment by combining the semantic information with the SLAM algorithm adopted by the invention.