CN115661796A

CN115661796A - Guideboard identification method and device and vehicle

Info

Publication number: CN115661796A
Application number: CN202211394445.3A
Authority: CN
Inventors: 孟鹏飞; 贾双成; 朱磊; 郭杏荣
Original assignee: Zhidao Network Technology Beijing Co Ltd
Current assignee: Zhidao Network Technology Beijing Co Ltd
Priority date: 2022-11-08
Filing date: 2022-11-08
Publication date: 2023-01-31

Abstract

The invention provides a method, a device and a vehicle for identifying a guideboard, wherein the method comprises the following steps: acquiring an image to be identified, wherein the image to be identified carries the shielded guideboard elements; and inputting the image to be recognized into a recognition model, and obtaining the guideboard corner point category which is output by the recognition model and corresponds to the image to be recognized. According to the method, the device and the vehicle for identifying the guideboard, provided by the invention, based on the fact that the image carrying the shielded guideboard element is used as the input of the identification model, the output result is that the angular points and the angular point categories thereof contained in the shielded part and the shielded part in the corresponding image are identified and calculated automatically through the multi-layer neural network, the logicality and the relevance in the picture containing the incomplete information are fully mined, the accurate identification of the shielded guideboard is realized, the fineness and the accuracy of the guideboard identification can be improved, and the generation efficiency of a high-precision map is further improved.

Description

Road sign recognition method, device and vehicle

技术领域technical field

本发明涉及自动驾驶技术领域，尤其涉及一种路牌的识别方法、装置及车辆。The invention relates to the technical field of automatic driving, in particular to a road sign recognition method, device and vehicle.

背景技术Background technique

随着人工智能、自动驾驶等技术的发展，构建智慧交通也成为了研究热点，而高精地图是智慧交通数据构建中必不可少的部分。高精地图中可以包含多种交通标识，例如能够通过详细的地图来表达现实世界中诸如车道线、行车停止线、人行横道线等地面特征要素以及路牌、红绿灯等高空特征要素，以便为自动驾驶等应用场景时的导航提供数据支撑。With the development of technologies such as artificial intelligence and autonomous driving, the construction of intelligent transportation has become a research hotspot, and high-precision maps are an indispensable part of the construction of intelligent transportation data. High-precision maps can contain a variety of traffic signs. For example, ground features such as lane lines, stop lines, and pedestrian crossing lines in the real world can be expressed through detailed maps, as well as high-altitude feature elements such as road signs and traffic lights, so as to provide information for autonomous driving, etc. Navigation in application scenarios provides data support.

交通标识中路牌作为城市地理实体的信息承载载体，具备地名、路线、距离和方向等信息导航功能，同时作为分布于城市道路交叉口的基础设施，在空间上具有其特殊性，是城市基础物联网的良好载体。正确且高效地进行交通路牌的生成工作，对于高精地图的绘制十分关键。As the information carrier of urban geographical entities, road signs in traffic signs have information navigation functions such as place names, routes, distances and directions. Good carrier for networking. Correct and efficient generation of traffic signs is critical to the drawing of high-precision maps.

然而，当路牌被遮挡时，由于不能从车辆的相机所获取得到的图像中获取足够的可用信息，所以路牌不能够被识别，进而影响了高精地图绘制的效率。However, when the street signs are blocked, the street signs cannot be recognized because sufficient information cannot be obtained from the images acquired by the vehicle's camera, which affects the efficiency of high-precision map drawing.

发明内容Contents of the invention

本发明提供一种路牌的识别方法、装置及车辆，用以解决现有技术中不能够识别被遮挡住的路牌的缺陷。The invention provides a street sign recognition method, device and vehicle to solve the defect that the blocked street sign cannot be recognized in the prior art.

本发明提供一种路牌的识别方法，包括：The invention provides a road sign recognition method, comprising:

获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素；Acquiring the image to be recognized, the image to be recognized carries the blocked street sign element;

将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别；inputting the image to be recognized into a recognition model, and obtaining the street sign corner category output by the recognition model corresponding to the image to be recognized;

其中，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签得到的；所述识别模型包括特征提取层、通道分离层和类别识别层；Wherein, the recognition model is obtained based on a sample street sign image and a category label corresponding to the sample street sign image; the recognition model includes a feature extraction layer, a channel separation layer, and a category recognition layer;

所述将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别，具体包括：The step of inputting the image to be recognized into the recognition model and obtaining the street sign corner category output by the recognition model corresponding to the image to be recognized specifically includes:

将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像；Inputting the image to be identified into the feature extraction layer, and obtaining a fusion feature image output by the feature extraction layer;

将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像；input the fusion feature image to the channel separation layer, and obtain the channel feature image output by the channel separation layer;

将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别。The channel feature image is input to the category recognition layer, and the street sign corner category output by the category recognition layer is obtained.

根据本发明提供的一种路牌的识别方法，所述将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像，包括：According to a street sign recognition method provided by the present invention, the inputting the image to be recognized into the feature extraction layer and obtaining the fused feature image output by the feature extraction layer includes:

将所述待识别图像，进行不同尺度的下采样操作和卷积操作，获取不同尺度的特征图像；Performing downsampling operations and convolution operations of different scales on the image to be identified to obtain feature images of different scales;

基于各尺度的特征图像，进行特征融合，获取所述融合特征图像；performing feature fusion based on the feature images of each scale, and obtaining the fused feature image;

其中，每进行一次特征融合后会进行一次多尺度卷积后再进行下一次特征融合。Among them, after each feature fusion, a multi-scale convolution is performed before the next feature fusion.

根据本发明提供的一种路牌的识别方法，所述将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像，包括：According to a street sign recognition method provided by the present invention, the input of the fusion feature image to the channel separation layer, and the acquisition of the channel feature image output by the channel separation layer include:

对所述融合特征图像进行卷积计算，获取各通道对应的通道特征图像；Carrying out convolution calculation on the fusion feature image to obtain the channel feature image corresponding to each channel;

其中，所述通道的数量是根据完整的路牌元素所包含的路牌角点数量确定的。Wherein, the number of passages is determined according to the number of street sign corners contained in a complete street sign element.

根据本发明提供的一种路牌的识别方法，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签和角点坐标信息得到的；According to a street sign recognition method provided by the present invention, the recognition model is obtained based on a sample street sign image, and category labels and corner point coordinate information corresponding to the sample street sign image;

所述将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别，具体包括：The step of inputting the channel feature image into the category identification layer, and obtaining the street sign corner category output by the category identification layer specifically includes:

对各通道对应的通道特征图像，进行类别的识别抽取，获取各通道对应的类别概率集合；For the channel feature image corresponding to each channel, perform category identification and extraction, and obtain the category probability set corresponding to each channel;

利用所述通道对应的类别概率集合，确定所述通道特征图像所对应的路牌角点类别，以及路牌角点坐标。Using the class probability set corresponding to the channel, the street sign corner category corresponding to the channel feature image and the street sign corner point coordinates are determined.

根据本发明提供的一种路牌的识别方法，所述获取待识别图像，包括：According to a recognition method of street signs provided by the present invention, the acquisition of an image to be recognized includes:

在确定所述待识别图像未携带有被遮挡住的路牌元素的情况下，基于所述待识别图像中的目标角点，截取非角点区域图像；In the case where it is determined that the image to be recognized does not carry a blocked street sign element, based on the target corner point in the image to be recognized, intercept the non-corner area image;

将所述非角点区域图像覆盖在所述目标角点对应的区域，生成新的待识别图像；Overlay the non-corner area image on the area corresponding to the target corner to generate a new image to be recognized;

将所述新的待识别图像输入至所述识别模型，获得所述识别模型输出的与所述新的待识别图像对应的路牌角点类别。The new image to be recognized is input into the recognition model, and the street sign corner category corresponding to the new image to be recognized output by the recognition model is obtained.

本发明还提供一种路牌的识别装置，包括：The present invention also provides a street sign recognition device, comprising:

图像获取模块，用于获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素；An image acquisition module, configured to acquire an image to be recognized, the image to be recognized carries a blocked street sign element;

角点识别模块，用于将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别；A corner point recognition module, configured to input the image to be recognized into a recognition model, and obtain the street sign corner category output by the recognition model corresponding to the image to be recognized;

所述角点识别模块，具体包括特征提取单元、通道分离单元和类别识别单元，其中：The corner recognition module specifically includes a feature extraction unit, a channel separation unit and a category recognition unit, wherein:

所述特征提取单元，用于将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像；The feature extraction unit is configured to input the image to be recognized to the feature extraction layer, and obtain a fusion feature image output by the feature extraction layer;

所述通道分离单元，用于将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像；The channel separation unit is configured to input the fusion feature image to the channel separation layer, and obtain the channel feature image output by the channel separation layer;

所述类别识别单元，用于将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别。The category identification unit is configured to input the channel feature image to the category identification layer, and obtain the street sign corner category output by the category identification layer.

本发明还提供一种车辆，包括车辆本体，还包括设置在所述车辆本体的识别装置，所述识别装置用于执行如上任一所述的路牌的识别方法。The present invention also provides a vehicle, including a vehicle body, and a recognition device disposed on the vehicle body, the recognition device being used to execute the street sign recognition method as described above.

本发明还提供一种电子设备，包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序，所述处理器执行所述程序时实现如上述任一种所述路牌的识别方法。The present invention also provides an electronic device, which includes a memory, a processor, and a computer program stored on the memory and operable on the processor. When the processor executes the program, it realizes the identification of any road sign described above. method.

本发明还提供一种非暂态计算机可读存储介质，其上存储有计算机程序，该计算机程序被处理器执行时实现如上述任一种所述路牌的识别方法。The present invention also provides a non-transitory computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the identification method of any one of the above-mentioned street signs is realized.

本发明还提供一种计算机程序产品，包括计算机程序，所述计算机程序被处理器执行时实现如上述任一种所述路牌的识别方法。The present invention also provides a computer program product, including a computer program. When the computer program is executed by a processor, any one of the street sign recognition methods described above is realized.

本发明提供的路牌的识别方法、装置及车辆，基于携带有被遮挡住的路牌元素的图像作为识别模型的输入，输出的结果为对应图像中被遮挡部分和遮挡部分中所包含的角点及其角点类别，通过多层的神经网络对不完整的图像信息自动识别计算，实现被遮挡路牌的准确识别，能提高路牌识别的精细性和准确性，进而提高高精度地图的生成效率。The road sign recognition method, device and vehicle provided by the present invention are based on the image carrying the occluded street sign elements as the input of the recognition model, and the output result is the corner points and the occluded parts contained in the corresponding image. Its corner point category automatically recognizes and calculates incomplete image information through a multi-layer neural network to realize accurate recognition of blocked street signs, which can improve the fineness and accuracy of street sign recognition, thereby improving the generation efficiency of high-precision maps.

附图说明Description of drawings

为了更清楚地说明本发明或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍，显而易见地，下面描述中的附图是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the present invention or the technical solutions in the prior art, the accompanying drawings that need to be used in the description of the embodiments or the prior art will be briefly introduced below. Obviously, the accompanying drawings in the following description are the present invention. For some embodiments of the invention, those skilled in the art can also obtain other drawings based on these drawings without creative effort.

图1是本发明提供的路牌的识别方法的流程示意图之一；Fig. 1 is one of the schematic flow charts of the recognition method of street signs provided by the present invention;

图2是本发明提供的路牌的识别方法的流程示意图之二；Fig. 2 is the second schematic flow diagram of the identification method of street signs provided by the present invention;

图3是本发明提供的路牌的识别装置的结构示意图；Fig. 3 is a structural schematic diagram of a street sign recognition device provided by the present invention;

图4是本发明提供的车辆的结构示意图；Fig. 4 is a schematic structural view of a vehicle provided by the present invention;

图5是本发明提供的电子设备的结构示意图。Fig. 5 is a schematic structural diagram of an electronic device provided by the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚，下面将结合本发明中的附图，对本发明中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

本申请的说明书中的术语“第一”、“第二”等是用于区别类似的对象，而不用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换，以便本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施，且“第一”、“第二”等所区分的对象通常为一类，并不限定对象的个数，例如第一对象可以是一个，也可以是多个。The terms "first", "second" and the like in the description of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It should be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the application can be practiced in sequences other than those illustrated or described herein, and that references to "first," "second," etc. distinguish Objects are generally of one type, and the number of objects is not limited. For example, there may be one or more first objects.

应当理解，在本发明说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本发明。如在本发明说明书中所使用的那样，除非上下文清楚地指明其它情况，否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should be understood that the terminology used in the description of the present invention is for the purpose of describing particular embodiments only and is not intended to limit the present invention. As used in the present specification, the singular forms "a", "an" and "the" are intended to include the plural forms unless the context clearly dictates otherwise.

术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在，但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。The terms "comprising" and "comprising" indicate the presence of described features, integers, steps, operations, elements and/or components, but do not exclude the presence of one or more other features, integers, steps, operations, elements, components and/or The presence or addition of its collection.

图1是本发明提供的路牌的识别方法的流程示意图之一。如图1所示，本发明实施例提供的路牌的识别方法，包括：步骤101、获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素。Fig. 1 is one of the schematic flowcharts of the street sign recognition method provided by the present invention. As shown in FIG. 1 , the street sign recognition method provided by the embodiment of the present invention includes: Step 101 , acquiring an image to be recognized, and the image to be recognized carries a blocked street sign element.

需要说明的是，本发明实施例提供的路牌的识别方法的执行主体是路牌的识别装置。路牌的识别装置可以是车辆内置的中央处理器(Central Processing Unit，CPU)，或者是基于CPU集成的开发板，以进行信息处理和程序运行。It should be noted that, the road sign recognition method provided by the embodiment of the present invention is executed by a street sign recognition device. The street sign recognition device may be a central processing unit (Central Processing Unit, CPU) built into the vehicle, or a development board integrated based on the CPU for information processing and program operation.

本发明实施例提供的路牌的识别方法的应用场景为，对含有被遮挡路牌的图像进行识别，得到该路牌中未被遮挡部分和遮挡部分所包含的所有角点，及其角点的类别。The application scenario of the street sign recognition method provided by the embodiment of the present invention is to identify an image containing an occluded street sign, and obtain all corner points contained in the unoccluded part and the occluded part of the street sign, and the types of the corner points.

具体地，在步骤101中，路牌的识别装置实时接收由安装在车辆的摄像头所拍摄的图片，以作为待识别图片，并且，待识别图片携带有被遮挡住的路牌元素。Specifically, in step 101, the street sign recognition device receives in real time a picture taken by a camera installed in a vehicle as a picture to be recognized, and the picture to be recognized carries a blocked street sign element.

路牌元素，是在车辆行驶过程中由相机捕获到的属于交通标识的图形元素。The street sign element is a graphic element belonging to the traffic sign captured by the camera during the driving process of the vehicle.

其中，待识别图像内携带有路牌元素并不局限为一种，可以为多种交通标识对应的路牌。交通标识所对应的路牌通常为规则图形，本发明实施例对此不做具体限定。Wherein, the street sign element carried in the image to be recognized is not limited to one kind, and may be street signs corresponding to various traffic signs. The street sign corresponding to the traffic sign is usually a regular figure, which is not specifically limited in the embodiment of the present invention.

示例性地，交通标识所对应的路牌形状可以为方形，方形路牌具有四个直角点，每一方位上的直角点即为对应类别的路牌角点，继而在识别过程中，可以通过对被遮挡住的路牌元素的识别，得到路牌元素中的角点类别。Exemplarily, the shape of the street sign corresponding to the traffic sign can be square, and the square street sign has four right-angle points, and the right-angle points on each orientation are the street sign corner points of the corresponding category, and then in the recognition process, it can be blocked by The identification of the street sign element to get the corner point category in the street sign element.

步骤102、将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别。Step 102: Input the image to be recognized into a recognition model, and obtain the street sign corner category output by the recognition model corresponding to the image to be recognized.

其中，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签得到的。识别模型包括特征提取层、通道分离层和类别识别层。Wherein, the recognition model is obtained based on a sample street sign image and a category label corresponding to the sample street sign image. The recognition model includes feature extraction layer, channel separation layer and category recognition layer.

将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像。The image to be recognized is input to the feature extraction layer, and a fused feature image output by the feature extraction layer is obtained.

将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像。The fusion feature image is input to the channel separation layer, and the channel feature image output by the channel separation layer is obtained.

需要说明的是，识别模型可以是一种神经网络模型，神经网络的结构和参数包括但不限于神经网络的输入层，隐含层和输出层的层数，以及每一层的权重参数等。本发明实施例对神经网络的种类和结构不作具体限定。It should be noted that the recognition model may be a neural network model, and the structure and parameters of the neural network include but are not limited to the input layer, hidden layer and output layer of the neural network, and the weight parameters of each layer. The embodiment of the present invention does not specifically limit the type and structure of the neural network.

例如，识别模型可以是一种前馈神经网络，该模型由输入层、隐藏层和输出层构成，其中：For example, a recognition model could be a feed-forward neural network consisting of an input layer, a hidden layer, and an output layer, where:

输入层在整个网络的最前端部分，直接接收携带有被遮挡住的路牌元素的图像数据。The input layer is at the front end of the entire network, and directly receives the image data carrying the occluded street sign elements.

隐藏层可以有一层或多层，通过自身的神经元对输入向量以加权求和的方式来进行运算，计算公式可以表达为：The hidden layer can have one or more layers, and its own neurons perform weighted summation on the input vector. The calculation formula can be expressed as:

z＝b+w1*x1+w2*x2+…+wm*xmz=b+w1*x1+w2*x2+...+wm*xm

其中，z是隐藏层输出的权重加和值，x1、x2、x3……xm是每个样本的m个特征向量，b为偏置，w1、w2……wm为每个特征向量对应的权重。Among them, z is the sum of the weights output by the hidden layer, x1, x2, x3...xm are the m feature vectors of each sample, b is the bias, w1, w2...wm are the weights corresponding to each feature vector .

输出层是最后一层，用来输出角点类别的识别结果，根据不同的需求输出识别结果的类型，这个值可以是一个类别向量值，也可以是一个类似线性回归那样产生的连续的值，还可以是别的复杂类型的值或者向量，本发明实施例对此不作具体限定。The output layer is the last layer, which is used to output the recognition result of the corner point category, and output the type of the recognition result according to different requirements. This value can be a category vector value, or a continuous value similar to linear regression. It may also be other complex types of values or vectors, which are not specifically limited in this embodiment of the present invention.

激励函数是在人工神经网络的神经元上运行的函数，负责将神经元的输入映射到输出端，采用激活函数进行逻辑回归处理，即将隐藏层输出的权重加和值转换为非线性的识别结果，本发明实施例对激活函数的种类不作具体限定。The activation function is a function that runs on the neurons of the artificial neural network. It is responsible for mapping the input of the neurons to the output. The activation function is used for logistic regression processing, that is, the sum of the weights output by the hidden layer is converted into a non-linear recognition result. , the embodiment of the present invention does not specifically limit the type of activation function.

优选地，采用Softmax函数进行逻辑回归处理，将多个神经元的输出，映射到(0,1)区间内，可以看成概率来理解，从而来进行多分类。Preferably, the Softmax function is used for logistic regression processing, and the outputs of multiple neurons are mapped to the (0,1) interval, which can be understood as a probability, so as to perform multi-classification.

需要说明的是，样本数据包含与样本数据对应的样本路牌图像，以及在样本路牌图像中对角点标注的类别标签。将样本数据按照一定比例，划分为训练集和测试集。It should be noted that the sample data includes a sample street sign image corresponding to the sample data, and category labels marked on diagonal points in the sample street sign image. Divide the sample data into a training set and a test set according to a certain ratio.

示例性地，训练集和测试集在样本数据中的占比包括但不限于9：1、8：2等，本发明实施例对此不作具体限定。Exemplarily, the proportion of the training set and the test set in the sample data includes but is not limited to 9:1, 8:2, etc., which is not specifically limited in this embodiment of the present invention.

具体地，在步骤102中，路牌的识别装置对构建好的识别模型各层间的权值系数初始化，再将训练集中的一组样本问题数据和样本答案数据的标注内容输入到当前权值系数下的神经网络，依次计算输入层、隐藏层和输出层的各节点的输出。输出层最后的输出结果与其实际连接位置状态类型之间的累积误差，根据梯度下降法，修正输入层与隐藏层各节点间的权值系数。依照上述过程，直至遍历训练集中的所有样本，可以得到输入层与隐藏层的权值系数。Specifically, in step 102, the road sign recognition device initializes the weight coefficients between the layers of the constructed recognition model, and then inputs a set of sample question data and sample answer data in the training set into the current weight coefficient The neural network below calculates the output of each node in the input layer, hidden layer and output layer in turn. According to the cumulative error between the final output result of the output layer and the actual connection position state type, the weight coefficient between each node of the input layer and the hidden layer is corrected according to the gradient descent method. According to the above process, until all the samples in the training set are traversed, the weight coefficients of the input layer and the hidden layer can be obtained.

路牌的识别装置根据神经网络输入层与隐藏层的权值系数，还原步骤102中的识别模型，并将测试集中的每一待识别图像输入到训练好的识别模型，可以得到该图像对应的识别结果。The street sign recognition device restores the recognition model in step 102 according to the weight coefficients of the neural network input layer and the hidden layer, and inputs each image to be recognized in the test set into the trained recognition model to obtain the corresponding recognition of the image. result.

识别结果可以是一个概率值或者标签结果，本发明实施例对行为识别结果的形式不作具体限定。The recognition result may be a probability value or a label result, and the embodiment of the present invention does not specifically limit the form of the behavior recognition result.

若识别结果可以是一个概率值，则可以通过概率值说明待识别图像中所包含的角点分别属于各方位所对应的角点类别的概率。If the recognition result can be a probability value, the probability value can be used to describe the probability that the corner points included in the image to be recognized belong to the corner point categories corresponding to each orientation.

若识别结果可以是一个标签结果，则可以通过模型获取一个中间数值结果，将数值结果符合预先设置的目标条件，则为该数值结果对应的角点分配对应方位的角点类别标签。If the recognition result can be a label result, an intermediate numerical result can be obtained through the model, and the numerical result meets the preset target conditions, and the corner point corresponding to the numerical result is assigned the corner point category label of the corresponding orientation.

优选地，路牌的识别装置中内置的识别模型由输入层、隐藏层和输出层组成。隐藏层的作用是通过自身的神经元，对输入的具有不完整信息的图像进行特征提取，抽取出有利于识别的特征信息。Preferably, the recognition model built in the street sign recognition device consists of an input layer, a hidden layer and an output layer. The role of the hidden layer is to extract the features of the input image with incomplete information through its own neurons, and extract the feature information that is beneficial to recognition.

本发明实施例对隐藏层的结构不作具体限定。The embodiment of the present invention does not specifically limit the structure of the hidden layer.

优选地，隐藏层至少包含三层，分别是特征提取层、通道分离层和类别识别层，其中：Preferably, the hidden layer includes at least three layers, which are feature extraction layer, channel separation layer and category recognition layer, wherein:

特征提取层可以采用卷积神经网络(Convolutional Neural Network，CNN)对待识别图像进行降维，压缩向量的同时进行特征提取和融合，得到融合特征图像。The feature extraction layer can use Convolutional Neural Network (CNN) to reduce the dimension of the image to be recognized, compress the vector and perform feature extraction and fusion at the same time to obtain the fusion feature image.

其中，在卷积过程中，也可以采用空洞卷积或者扩张卷积在不进行pooling损失信息的情况下，加大了感受野，让每个卷积输出都包含较大范围的信息。Among them, in the convolution process, dilated convolution or dilated convolution can also be used to increase the receptive field without performing pooling loss information, so that each convolution output contains a larger range of information.

通道分离层可以通过路牌所包含的角点数量设置卷积核的通道数，对融合特征图像再次进行卷积计算，得到各通道对应的二维通道特征图像。The channel separation layer can set the number of channels of the convolution kernel according to the number of corner points contained in the street signs, and perform convolution calculation on the fusion feature image again to obtain the two-dimensional channel feature image corresponding to each channel.

类别识别层可以将各通道对应的二维通道特征图像依次采用全连接处理和Softmax处理将其映射成二维向量，并根据这个二维向量进行分类处理，得到路牌角点类别，即可获知待识别图像中包含的所有角点，以及对应的角点类别。The category recognition layer can map the two-dimensional channel feature images corresponding to each channel into two-dimensional vectors by using full connection processing and Softmax processing in turn, and classify them according to the two-dimensional vectors to obtain the street sign corner category, and then we can know the Identify all corner points contained in the image, and the corresponding corner class.

本发明实施例基于携带有被遮挡住的路牌元素的图像作为识别模型的输入，通过特征提取层对将待识别图像进行特征提取，通过通道分离层对特征提取层输出的融合特征图像进行通道分离后，再经由类别识别层将各通道的通道特征图像进行全连接处理，输出对应图像中被遮挡部分和遮挡部分中所包含的角点及其角点类别，通过多层的神经网络对不完整的图像信息自动识别计算，充分挖掘包含不完整信息的图片中的逻辑性和关联性，实现被遮挡路牌的准确识别，能提高路牌识别的精细性和准确性，进而提高高精度地图的生成效率。In the embodiment of the present invention, based on the image carrying the occluded street sign elements as the input of the recognition model, feature extraction is performed on the image to be recognized through the feature extraction layer, and channel separation is performed on the fused feature image output by the feature extraction layer through the channel separation layer After that, the channel feature images of each channel are fully connected through the category recognition layer, and the corner points and their corner point categories contained in the occluded part and occluded part of the corresponding image are output. The automatic recognition and calculation of image information fully excavates the logic and relevance of pictures containing incomplete information, realizes the accurate recognition of blocked street signs, improves the fineness and accuracy of street sign recognition, and then improves the generation efficiency of high-precision maps.

在上述任一实施例的基础上，将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像，包括：将所述待识别图像，进行不同尺度的下采样操作和卷积操作，获取不同尺度的特征图像。On the basis of any of the above embodiments, inputting the image to be recognized into the feature extraction layer, and obtaining the fused feature image output by the feature extraction layer includes: downscaling the image to be recognized at different scales Sampling operation and convolution operation to obtain feature images of different scales.

基于各尺度的特征图像，进行特征融合，获取所述融合特征图像。Based on the feature images of each scale, feature fusion is performed to obtain the fused feature image.

需要说明的是，路牌的识别装置利用大量卷积核从遥感图像所对应的矩阵中提取高维特征图，可以提取出多张特征图，每张特征图是从图片中提取出来的局部感知，综合这些特征图可以提取出图片中感兴趣的部分。It should be noted that the road sign recognition device uses a large number of convolution kernels to extract high-dimensional feature maps from the matrix corresponding to the remote sensing image, and can extract multiple feature maps. Each feature map is a local perception extracted from the picture. Combining these feature maps can extract the interesting parts of the picture.

不同大小的卷积核对不同种类的路牌原始的提取效果不同，本发明实施例进行多尺度下采样的卷积核数量和尺寸不做具体限定。Convolution kernels of different sizes have different effects on the original extraction of different types of road signs, and the number and size of convolution kernels for multi-scale downsampling in the embodiment of the present invention are not specifically limited.

其中，采用小卷积核1*1可以提取识别小路牌的相关特征数据。采用中卷积核3*3可以提取识别中路牌的相关特征信息。采用大卷积核5*5可以提取识别大路牌的相关特征信息。继而在融合不同尺度的特征数据时，就能够保留带有三种不同规格路牌的相关信息，从而防止原始有效数据的丢失。Among them, the use of a small convolution kernel 1*1 can extract relevant feature data for identifying small road signs. Using the middle convolution kernel 3*3 can extract the relevant feature information for identifying middle road signs. Using a large convolution kernel 5*5 can extract the relevant feature information for identifying road signs. Then, when the feature data of different scales are fused, the relevant information of street signs with three different specifications can be retained, thereby preventing the loss of original valid data.

需要注意的是，每次融合后还需要进行卷积计算，以防止引入噪声。It should be noted that convolution calculation is required after each fusion to prevent the introduction of noise.

具体地，路牌的识别装置将待识别图像输入至级联的卷积层中，先同时进入至尺寸不同的卷积核进行卷积计算，分别得到不同维度的特征图像，通过将本次卷积得到的多个维度的特征图像进行线性相加，获取融合特征图像后，再同时进入到下一级尺寸的卷积核重复上述过程，不断进行特征提取和特征融合过程，更新融合特征图像。Specifically, the road sign recognition device inputs the image to be recognized into the cascaded convolutional layer, and first enters the convolution cores of different sizes to perform convolution calculations to obtain feature images of different dimensions respectively. By convolving this time The obtained multi-dimensional feature images are linearly added, and after the fusion feature image is obtained, it enters the next-level convolution kernel at the same time to repeat the above process, continuously performs feature extraction and feature fusion processes, and updates the fusion feature image.

示例性地，可以依次设置5个大小为3*3，且通道数为128、64、32、16和8的卷积核，相应地，再对应设置5个大小为5*5，且通道数为128、64、32、16和8的卷积核进行卷积处理，使得图像经过每一次卷积处理后特征图的高度和宽度都会缩短一半，通道数减少一半，执行五次卷积和线性相加之后，可以得到一个通道数为8的融合特征图像。Exemplarily, five convolution kernels with a size of 3*3 and a number of channels of 128, 64, 32, 16, and 8 can be set in sequence, and correspondingly, five convolution kernels with a size of 5*5 and a number of channels can be set correspondingly Perform convolution processing for convolution kernels of 128, 64, 32, 16, and 8, so that the height and width of the feature map of the image will be reduced by half after each convolution processing, and the number of channels will be reduced by half, performing five convolutions and linear After the addition, a fusion feature image with 8 channels can be obtained.

需要说明的是，为了能够让多个尺度的特征图像进行线性相加，需要采用填充Padding的方式保证卷积计算出来的特征图像的行列数相同。It should be noted that in order to linearly add feature images of multiple scales, it is necessary to use padding to ensure that the number of rows and columns of feature images calculated by convolution is the same.

本发明实施例基于对待识别图像进行不同尺度的卷积计算，将卷积得到的多尺度的特征图像进行多次的融合和卷积降维后，获取融合特征图像。能够使图像同时包含低层级的细节特征和高层级的语义特征，能提高图像语义分割的精细性，进而提高图像识别的准确性。The embodiment of the present invention is based on performing convolution calculations of different scales on the image to be recognized, and performing multiple fusion and convolution dimensionality reduction on multi-scale feature images obtained by convolution to obtain the fused feature image. It can make the image contain low-level detail features and high-level semantic features at the same time, which can improve the fineness of image semantic segmentation, and then improve the accuracy of image recognition.

在上述任一实施例的基础上，将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像，包括：对所述融合特征图像进行卷积计算，获取各通道对应的通道特征图像。On the basis of any of the above embodiments, inputting the fused feature image to the channel separation layer, and obtaining the channel feature image output by the channel separation layer includes: performing convolution calculation on the fused feature image, obtaining The channel feature image corresponding to each channel.

需要说明的是，在步骤102之前，需要根据不同规格的路牌元素所包含的角点数量，设置通道的数量，以使得每一通道对应每一个角点对应的特征。It should be noted that before step 102, the number of channels needs to be set according to the number of corner points included in street sign elements of different specifications, so that each channel corresponds to the feature corresponding to each corner point.

具体地，路牌的识别装置对融合特征图像先后采用一个通道数为4的卷积核进行卷积计算，得到各通道的特征数据，再采用尺寸为1x1的卷积核进行全连接降维，得到宽高尺寸和原图一样的多个通道的通道特征图像。Specifically, the road sign recognition device uses a convolution kernel with a channel number of 4 to perform convolution calculations on the fused feature image successively to obtain the feature data of each channel, and then uses a convolution kernel with a size of 1x1 to perform full-connection dimensionality reduction to obtain The channel feature image of multiple channels with the same width and height dimensions as the original image.

本发明实施例基于路牌元素所包含的路牌角点数量确定通道数量后，再对融合特征图像进行对应通道数的卷积计算，以分离出各通道的通道特征图像。能够提高图像语义分割的精细性，进而提高图像识别的准确性。In the embodiment of the present invention, after the number of channels is determined based on the number of street sign corners included in the street sign elements, convolution calculation is performed on the fusion feature image corresponding to the number of channels to separate the channel feature images of each channel. It can improve the fineness of image semantic segmentation, and then improve the accuracy of image recognition.

在上述任一实施例的基础上，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签和角点坐标信息得到的。On the basis of any of the above embodiments, the recognition model is obtained based on a sample street sign image, and category labels and corner point coordinate information marked corresponding to the sample street sign image.

所述将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别，具体包括：对各通道对应的通道特征图像，进行类别的识别抽取，获取各通道对应的类别概率集合。The step of inputting the channel characteristic image into the category recognition layer and obtaining the street sign corner category output by the category recognition layer specifically includes: performing category identification and extraction on the channel characteristic image corresponding to each channel, and obtaining The set of category probabilities corresponding to each channel.

具体地，路牌的识别装置在训练识别模型的过程中，将各通道所对应的角点类别，以及标识类别的角点所对应的角点坐标信息为真值进行训练，对各通道特征图像进行识别，将通道特征图像中每一个像素点的像素值映射成不同通道对应的角点类别的概率值，整合成类别概率集合。Specifically, in the process of training the recognition model, the street sign recognition device trains the corner point categories corresponding to each channel and the corner point coordinate information corresponding to the corner points of the identification category as true values, and performs training on the feature images of each channel Recognition, the pixel value of each pixel in the channel feature image is mapped to the probability value of the corner point category corresponding to different channels, and integrated into a category probability set.

其中，对于任一通道特征图像，每一像素点对应的多种类别概率值中，与其通道对应的类别概率应当为最大值。Wherein, for any channel feature image, among various category probability values corresponding to each pixel, the category probability corresponding to its channel should be the maximum value.

具体地，路牌的识别装置从每一通道对应的类别概率集合中，对于每一像素点对应的多个概率值中的最大值对应的角点类别进行输出，并根据其对应的角点类别确定其路牌角点坐标。Specifically, the street sign recognition device outputs the corner point category corresponding to the maximum value among the multiple probability values corresponding to each pixel from the category probability set corresponding to each channel, and determines according to the corresponding corner point category The coordinates of the street sign corners.

例如，若要识别待识别图像的方形路牌时，不管是小型方形路牌、中型方形路牌还是大型方形路牌，这三种规格的方形路牌都是4个路牌角点，并定义路牌元素的左上角，右上角，右下角，左下角对应的角点类别设置编号为0、1、2、3。For example, if you want to recognize a square street sign in an image to be recognized, whether it is a small square street sign, a medium square street sign or a large square street sign, these three sizes of square street signs all have four street sign corners, and define the upper left corner of the street sign element, The upper right corner, lower right corner, and lower left corner correspond to the corner category setting numbers 0, 1, 2, and 3.

因此，将对待处理的融合特征图像进行普通卷积计算的普通卷积通道数等于4，可以得到4个通道特征图像。在对类别编号为0的通道所对应的通道特征图像进行识别时，将类别概率集合中大于预设阈值的概率值所对应的像素点识别为0号角点类别，并将对应像素坐标作为路牌左上角点坐标输出。Therefore, the number of ordinary convolution channels for performing ordinary convolution calculation on the fusion feature image to be processed is equal to 4, and 4 channel feature images can be obtained. When identifying the channel feature image corresponding to the channel whose category number is 0, the pixel point corresponding to the probability value greater than the preset threshold in the category probability set is identified as the corner point category 0, and the corresponding pixel coordinates are used as the upper left of the road sign Corner coordinate output.

在对类别编号为1的通道所对应的通道特征图像进行识别时，将类别概率集合中大于预设阈值的概率值所对应的像素点识别为1号角点类别，并将对应像素坐标作为路牌右上角点坐标输出。When identifying the channel feature image corresponding to the channel with category number 1, identify the pixel corresponding to the probability value greater than the preset threshold in the category probability set as the No. 1 corner point category, and use the corresponding pixel coordinates as the upper right corner of the street sign Corner coordinate output.

在对类别编号为2的通道所对应的通道特征图像进行识别时，将类别概率集合中大于预设阈值的概率值所对应的像素点识别为2号角点类别，并将对应像素坐标作为路牌右下角点坐标输出。When identifying the channel feature image corresponding to the channel whose category number is 2, the pixel corresponding to the probability value greater than the preset threshold in the category probability set is identified as the No. 2 corner point category, and the corresponding pixel coordinate is used as the street sign Output the coordinates of the lower corner point.

在对类别编号为3的通道所对应的通道特征图像进行识别时，将类别概率集合中大于预设阈值的概率值所对应的像素点识别为3号角点类别，并将对应像素坐标作为路牌左下角点坐标输出。When identifying the channel feature image corresponding to the channel with category number 3, the pixel corresponding to the probability value greater than the preset threshold in the category probability set is identified as the No. 3 corner point category, and the corresponding pixel coordinates are used as the lower left of the street sign Corner coordinate output.

本发明实施例基于携带有被遮挡住的路牌元素的图像作为识别模型的输入，输出的结果为对应图像中被遮挡部分和遮挡部分中所包含的角点、其角点类别及其角点坐标，通过多层的神经网络对不完整的图像信息自动识别计算，实现被遮挡路牌的准确识别，能提高路牌识别的精细性和准确性，进而提高高精度地图的生成效率。The embodiment of the present invention is based on the image carrying the occluded street sign elements as the input of the recognition model, and the output result is the corner points contained in the occluded part and the occluded part of the corresponding image, its corner point category and its corner point coordinates, through The multi-layer neural network automatically recognizes and calculates incomplete image information, realizes accurate recognition of blocked street signs, improves the fineness and accuracy of street sign recognition, and then improves the generation efficiency of high-precision maps.

在上述任一实施例的基础上，所述获取待识别图像，包括：在确定所述待识别图像未携带有被遮挡住的路牌元素的情况下，基于所述待识别图像中的目标角点，截取非角点区域图像。On the basis of any of the above-mentioned embodiments, the acquiring the image to be recognized includes: when it is determined that the image to be recognized does not carry an occluded street sign element, based on the target corner point in the image to be recognized , intercept the non-corner area image.

具体地，在步骤101中，路牌的识别装置对摄像头采集到的待识别图像进行初步的筛查识别，若识别到待识别图像中未含有被遮挡住的路牌元素，则从图像中抽取一个或者多个目标角点，将距离目标角点一定像素距离的像素点作为非角点区域的起点，在远离目标角点像素方向上选取任一像素点作为非角点区域的终点，以截取出限定好的非角点区域内的非角点区域图像。Specifically, in step 101, the street sign recognition device conducts preliminary screening and recognition on the image to be recognized collected by the camera, and if it is recognized that the image to be recognized does not contain any blocked street sign elements, it extracts one or For multiple target corners, a pixel point that is a certain pixel distance from the target corner point is used as the starting point of the non-corner point area, and any pixel point in the direction away from the target corner point pixel is selected as the end point of the non-corner point area to intercept the limited A non-corner area image within a good non-corner area.

将所述非角点区域图像覆盖在所述目标角点对应的区域，生成新的待识别图像。Overlay the non-corner region image on the region corresponding to the target corner to generate a new image to be recognized.

具体地，路牌的识别装置先确定非角点区域图像的尺寸，在以目标角点与非角点区域的起点重合，使对应尺寸的非角点区域图像覆盖在目标角点所处的区域中，生成新的待识别图像。以供识别模型从新的待识别图像中被覆盖的区域识别出目标角点的类别。Specifically, the street sign recognition device first determines the size of the non-corner area image, and then overlaps the starting point of the target corner with the non-corner area, so that the non-corner area image of the corresponding size is covered in the area where the target corner is located , to generate a new image to be recognized. It is used for the recognition model to recognize the category of the target corner from the covered area in the new image to be recognized.

本发明实施例对识别过程的具体实施方式不作具体限定。The embodiment of the present invention does not specifically limit the specific implementation manner of the identification process.

优选地，图2是本发明提供的路牌的识别方法的流程示意图之二。如图2所示，整个识别过程包括训练过程和测试过程。Preferably, FIG. 2 is the second schematic flow diagram of the street sign recognition method provided by the present invention. As shown in Figure 2, the entire recognition process includes a training process and a testing process.

(一)训练过程是将样本路牌图像依次输入至特征提取层、通道分离层和类别识别层进行处理，将所得到的样本路牌角点类别和与样本路牌图像对应标注的类别标签进行梯度下降法，实现识别模型的训练。(1) The training process is to sequentially input the sample street sign image to the feature extraction layer, channel separation layer and category recognition layer for processing, and perform the gradient descent method on the obtained sample street sign corner category and the category label corresponding to the sample street sign image , to realize the training of the recognition model.

(二)测试过程中，首先要对相机采集到的待识别图像进行预处理。(2) During the testing process, the image to be recognized collected by the camera should be preprocessed first.

若待识别图像中含有被遮挡的角点部分，则直接将待识别图像输入至训练好的识别模型中，依次经由特征提取层、通道分离层和类别识别层进行处理，最后从待识别图像中被遮挡的区域识别出其对应的角点类别。If the image to be recognized contains occluded corners, the image to be recognized is directly input into the trained recognition model, and then processed through the feature extraction layer, channel separation layer and category recognition layer in turn, and finally from the image to be recognized The occluded regions identify their corresponding corner categories.

若待识别图像中未含有被遮挡的角点部分，则需要将待识别图像中非角点区域覆盖在角点区域，以形成被遮挡的角点部分，将新的带识别图像输入至训练好的识别模型中，依次经由特征提取层、通道分离层和类别识别层进行处理，最后从新的待识别图像中被覆盖的区域识别出其对应的角点类别。If the image to be recognized does not contain the occluded corner part, it is necessary to cover the non-corner area in the image to be recognized on the corner area to form the occluded corner part, and input the new image with recognition to the trained In the recognition model, it is processed sequentially through the feature extraction layer, channel separation layer and category recognition layer, and finally the corresponding corner category is identified from the covered area in the new image to be recognized.

本发明实施例基于对待识别图像是否含有被遮挡角点进行预判，实现对完整的图片信息进行主动角点遮挡，能提高识别模型的鲁棒性，进而提高高精度地图的生成效率。The embodiment of the present invention is based on predicting whether the image to be recognized contains occluded corners, and realizes active corner occlusion of complete picture information, which can improve the robustness of the recognition model, and further improve the generation efficiency of high-precision maps.

图3是本发明提供的路牌的识别装置的结构示意图。在上述任一实施例的基础上，如图3所示，该装置包括图像获取模块310和角点识别模块320，其中：Fig. 3 is a schematic structural diagram of a street sign recognition device provided by the present invention. On the basis of any of the above embodiments, as shown in FIG. 3 , the device includes an image acquisition module 310 and a corner recognition module 320, wherein:

图像获取模块310，用于获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素。The image acquiring module 310 is configured to acquire an image to be recognized, and the image to be recognized carries a blocked street sign element.

角点识别模块220，用于将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别。The corner recognition module 220 is configured to input the image to be recognized into a recognition model, and obtain the street sign corner category output by the recognition model corresponding to the image to be recognized.

其中，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签得到的。所述识别模型包括特征提取层、通道分离层和类别识别层。Wherein, the recognition model is obtained based on a sample street sign image and a category label corresponding to the sample street sign image. The recognition model includes a feature extraction layer, a channel separation layer and a category recognition layer.

所述角点识别模块320，具体包括特征提取单元321、通道分离单元322和类别识别单元323，其中：The corner point identification module 320 specifically includes a feature extraction unit 321, a channel separation unit 322 and a category identification unit 323, wherein:

所述特征提取单元321，用于将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像。The feature extraction unit 321 is configured to input the image to be recognized to the feature extraction layer, and obtain a fused feature image output by the feature extraction layer.

所述通道分离单元322，用于将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像。The channel separation unit 322 is configured to input the fused feature image to the channel separation layer, and obtain a channel feature image output by the channel separation layer.

所述类别识别单元323，用于将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别。The category identification unit 323 is configured to input the channel feature image to the category identification layer, and obtain the street sign corner category output by the category identification layer.

具体地，图像获取模块310和角点识别模块320顺次电连接。Specifically, the image acquisition module 310 and the corner recognition module 320 are electrically connected in sequence.

图像获取模块310实时接收由安装在车辆的摄像头所拍摄的图片，以作为待识别图片，并且，待识别图片携带有被遮挡住的路牌元素。The image acquisition module 310 receives in real time the picture taken by the camera installed in the vehicle as the picture to be recognized, and the picture to be recognized carries the blocked street sign element.

角点识别模块320对构建好的识别模型各层间的权值系数初始化，再将训练集中的一组样本问题数据和样本答案数据的标注内容输入到当前权值系数下的神经网络，依次计算输入层、隐藏层和输出层的各节点的输出。输出层最后的输出结果与其实际连接位置状态类型之间的累积误差，根据梯度下降法，修正输入层与隐藏层各节点间的权值系数。依照上述过程，直至遍历训练集中的所有样本，可以得到输入层与隐藏层的权值系数。The corner point recognition module 320 initializes the weight coefficients between the layers of the constructed recognition model, and then inputs a set of sample question data and sample answer data in the training set into the neural network under the current weight coefficient, and calculates The output of each node of the input layer, hidden layer, and output layer. According to the cumulative error between the final output result of the output layer and the actual connection position state type, the weight coefficient between each node of the input layer and the hidden layer is corrected according to the gradient descent method. According to the above process, until all the samples in the training set are traversed, the weight coefficients of the input layer and the hidden layer can be obtained.

路牌的识别装置根据神经网络输入层与隐藏层的权值系数，还原识别模型，并将测试集中的每一待识别图像输入到训练好的识别模型，可以得到该图像对应的识别结果。The street sign recognition device restores the recognition model according to the weight coefficients of the input layer and the hidden layer of the neural network, and inputs each image to be recognized in the test set into the trained recognition model to obtain the recognition result corresponding to the image.

可选地，特征提取单元321包括多尺度卷积子单元和融合子单元，其中：Optionally, the feature extraction unit 321 includes a multi-scale convolution subunit and a fusion subunit, wherein:

多尺度卷积子单元，用于将所述待识别图像，进行不同尺度的下采样操作和卷积操作，获取不同尺度的特征图像。The multi-scale convolution subunit is used to perform down-sampling operations and convolution operations of different scales on the image to be recognized to obtain feature images of different scales.

融合子单元，用于基于各尺度的特征图像，进行特征融合，获取所述融合特征图像。The fusion subunit is configured to perform feature fusion based on feature images of various scales, and obtain the fused feature images.

可选地，通道分离单元322，具体用于对所述融合特征图像进行卷积计算，获取各通道对应的通道特征图像。Optionally, the channel separation unit 322 is specifically configured to perform convolution calculation on the fused feature image to obtain channel feature images corresponding to each channel.

可选地，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签和角点坐标信息得到的。Optionally, the recognition model is obtained based on a sample street sign image, and category labels and corner coordinate information marked corresponding to the sample street sign image.

相应地，类别识别单元323包括概率集合获取子单元和识别子单元，其中：Correspondingly, the category identification unit 323 includes a probability set acquisition subunit and an identification subunit, wherein:

概率集合获取子单元，用于对各通道对应的通道特征图像，进行类别的识别抽取，获取各通道对应的类别概率集合。The probability set acquisition subunit is used to identify and extract the category of the channel feature image corresponding to each channel, and acquire the category probability set corresponding to each channel.

识别子单元，用于利用所述通道对应的类别概率集合，确定所述通道特征图像所对应的路牌角点类别，以及路牌角点坐标。The identification subunit is configured to use the category probability set corresponding to the channel to determine the category of the corner point of the street sign corresponding to the feature image of the channel and the coordinates of the corner point of the street sign.

可选地，图像获取模块310包括截取单元和覆盖单元，其中：Optionally, the image acquisition module 310 includes an intercepting unit and a covering unit, wherein:

截取单元，用于在确定所述待识别图像未携带有被遮挡住的路牌元素的情况下，基于所述待识别图像中的目标角点，截取非角点区域图像。An intercepting unit, configured to intercept a non-corner region image based on a target corner point in the image to be recognized when it is determined that the image to be recognized does not carry an occluded street sign element.

覆盖单元，用于将所述非角点区域图像覆盖在所述目标角点对应的区域，生成新的待识别图像。A covering unit, configured to cover the non-corner region image on the region corresponding to the target corner to generate a new image to be recognized.

本发明实施例提供的路牌的识别装置，用于执行本发明上述路牌的识别方法，其实施方式与本发明提供的路牌的识别方法的实施方式一致，且可以达到相同的有益效果，此处不再赘述。The street sign recognition device provided by the embodiment of the present invention is used to implement the above-mentioned street sign recognition method of the present invention, and its implementation mode is consistent with the implementation mode of the street sign recognition method provided by the present invention, and can achieve the same beneficial effect. Let me repeat.

本发明实施例基于携带有被遮挡住的路牌元素的图像作为识别模型的输入，通过特征提取层对将待识别图像进行特征提取，通过通道分离层对特征提取层输出的融合特征图像进行通道分离后，再经由类别识别层将各通道的通道特征图像进行全连接处理，输出对应图像中被遮挡部分和遮挡部分中所包含的角点及其角点类别，通过多层的神经网络对不完整的图像信息自动识别计算，充分挖掘包含不完整信息的图片中的逻辑性和关联性，实现被遮挡路牌的准确识别，能提高路牌识别的精细性和准确性，进而提高高精度地图的生成效率。图4是本发明提供的车辆的结构示意图。在上述任一实施例的基础上，如图4所示，包括车辆本体410，还包括设置在所述车辆本体410的识别装置420，所述识别装置用于执行上述的路牌的识别方法。In the embodiment of the present invention, based on the image carrying the occluded street sign elements as the input of the recognition model, feature extraction is performed on the image to be recognized through the feature extraction layer, and channel separation is performed on the fused feature image output by the feature extraction layer through the channel separation layer After that, the channel feature images of each channel are fully connected through the category recognition layer, and the corner points and their corner point categories contained in the occluded part and occluded part of the corresponding image are output. The automatic recognition and calculation of image information fully excavates the logic and relevance of pictures containing incomplete information, realizes the accurate recognition of blocked street signs, improves the fineness and accuracy of street sign recognition, and then improves the generation efficiency of high-precision maps. Fig. 4 is a structural schematic diagram of the vehicle provided by the present invention. On the basis of any of the above embodiments, as shown in FIG. 4 , it includes a vehicle body 410 and a recognition device 420 disposed on the vehicle body 410 , the recognition device is used to implement the above-mentioned street sign recognition method.

具体地，车辆由至少车辆本体410，以及内嵌在车辆本体410的开发板中的识别装置420组成。Specifically, the vehicle is composed of at least a vehicle body 410 and an identification device 420 embedded in a development board of the vehicle body 410 .

车辆本体410的开发板与识别装置420连接，用于通过无线通信技术(Wi-Fi)、蓝牙或串口等通信方式进行的远距离传输通讯，本发明实施例对此不作具体限定。The development board of the vehicle body 410 is connected to the identification device 420 for long-distance transmission and communication through wireless communication technology (Wi-Fi), bluetooth or serial port and other communication methods, which is not specifically limited in the embodiment of the present invention.

本发明实施例基于携带有被遮挡住的路牌元素的图像作为识别模型的输入，通过特征提取层对将待识别图像进行特征提取，通过通道分离层对特征提取层输出的融合特征图像进行通道分离后，再经由类别识别层将各通道的通道特征图像进行全连接处理，输出对应图像中被遮挡部分和遮挡部分中所包含的角点及其角点类别，通过多层的神经网络对不完整的图像信息自动识别计算，充分挖掘包含不完整信息的图片中的逻辑性和关联性，实现被遮挡路牌的准确识别，能提高路牌识别的精细性和准确性，进而提高高精度地图的生成效率。图5示例了一种电子设备的实体结构示意图，如图5所示，该电子设备可以包括：处理器(processor)510、通信接口(Communications Interface)520、存储器(memory)530和通信总线540，其中，处理器510，通信接口520，存储器530通过通信总线540完成相互间的通信。处理器510可以调用存储器530中的逻辑指令，以执行路牌的识别方法，该方法包括：获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素；将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别；其中，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签得到的；所述识别模型包括特征提取层、通道分离层和类别识别层；所述将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别，具体包括：将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像；将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像；将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别。In the embodiment of the present invention, based on the image carrying the occluded street sign elements as the input of the recognition model, feature extraction is performed on the image to be recognized through the feature extraction layer, and channel separation is performed on the fused feature image output by the feature extraction layer through the channel separation layer After that, the channel feature images of each channel are fully connected through the category recognition layer, and the corner points and their corner point categories contained in the occluded part and occluded part of the corresponding image are output. The automatic recognition and calculation of image information fully excavates the logic and relevance of pictures containing incomplete information, realizes the accurate recognition of blocked street signs, improves the fineness and accuracy of street sign recognition, and then improves the generation efficiency of high-precision maps. FIG. 5 illustrates a schematic diagram of the physical structure of an electronic device. As shown in FIG. 5, the electronic device may include: a processor (processor) 510, a communication interface (Communications Interface) 520, a memory (memory) 530 and a communication bus 540, Wherein, the processor 510 , the communication interface 520 , and the memory 530 communicate with each other through the communication bus 540 . The processor 510 can call the logic instruction in the memory 530 to execute the identification method of the road sign, the method includes: acquiring an image to be recognized, the image to be recognized carries a blocked street sign element; inputting the image to be recognized to A recognition model, obtaining the street sign corner category output by the recognition model and corresponding to the image to be recognized; wherein, the recognition model is obtained based on a sample street sign image and a category label corresponding to the sample street sign image; The recognition model includes a feature extraction layer, a channel separation layer, and a category recognition layer; the input of the image to be recognized to the recognition model is to obtain the street sign corner category output by the recognition model corresponding to the image to be recognized, It specifically includes: inputting the image to be recognized into the feature extraction layer, obtaining a fusion feature image output by the feature extraction layer; inputting the fusion feature image into the channel separation layer, and obtaining the output of the channel separation layer The channel feature image; input the channel feature image to the category identification layer, and obtain the street sign corner category output by the category identification layer.

此外，上述的存储器530中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(ROM，Read-Only Memory)、随机存取存储器(RAM，Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。In addition, the above logic instructions in the memory 530 may be implemented in the form of software function units and be stored in a computer-readable storage medium when sold or used as an independent product. Based on this understanding, the essence of the technical solution of the present invention or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes. .

另一方面，本发明还提供一种计算机程序产品，所述计算机程序产品包括计算机程序，计算机程序可存储在非暂态计算机可读存储介质上，所述计算机程序被处理器执行时，计算机能够执行上述各方法所提供的路牌的识别方法，该方法包括：获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素；将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别；其中，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签得到的；所述识别模型包括特征提取层、通道分离层和类别识别层；所述将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别，具体包括：将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像；将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像；将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别。On the other hand, the present invention also provides a computer program product. The computer program product includes a computer program that can be stored on a non-transitory computer-readable storage medium. When the computer program is executed by a processor, the computer can Executing the street sign recognition method provided by each of the above methods, the method includes: acquiring an image to be recognized, the image to be recognized carries a blocked street sign element; inputting the image to be recognized to a recognition model to obtain the recognition The street sign corner category corresponding to the image to be recognized output by the model; wherein, the recognition model is obtained based on the sample street sign image and the category label corresponding to the sample street sign image; the recognition model includes feature extraction layer, channel separation layer, and category recognition layer; the input of the image to be recognized into the recognition model, and obtaining the street sign corner category output by the recognition model corresponding to the image to be recognized specifically includes: The recognition image is input to the feature extraction layer, and the fusion feature image output by the feature extraction layer is obtained; the fusion feature image is input to the channel separation layer, and the channel feature image output by the channel separation layer is obtained; The channel feature image is input to the category recognition layer, and the street sign corner category output by the category recognition layer is obtained.

又一方面，本发明还提供一种非暂态计算机可读存储介质，其上存储有计算机程序，该计算机程序被处理器执行时实现以执行上述各方法提供的路牌的识别方法，该方法包括：获取待识别图像，所述待识别图像携带有被遮挡住的路牌元素；将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别；其中，所述识别模型是基于样本路牌图像，以及与所述样本路牌图像对应标注的类别标签得到的；所述识别模型包括特征提取层、通道分离层和类别识别层；所述将所述待识别图像输入至识别模型，获得所述识别模型输出的与所述待识别图像对应的路牌角点类别，具体包括：将所述待识别图像输入至所述特征提取层，获取所述特征提取层输出的融合特征图像；将所述融合特征图像输入至所述通道分离层，获取所述通道分离层输出的通道特征图像；将所述通道特征图像输入至所述类别识别层，获取所述类别识别层输出的所述路牌角点类别。In yet another aspect, the present invention also provides a non-transitory computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, it is implemented to perform the street sign recognition method provided by the above-mentioned methods, the method includes : Obtain the image to be recognized, the image to be recognized carries the blocked street sign element; input the image to be recognized to the recognition model, and obtain the street sign corner category output by the recognition model corresponding to the image to be recognized ; Wherein, the recognition model is obtained based on the sample street sign image and the category label corresponding to the sample street sign image; the recognition model includes a feature extraction layer, a channel separation layer and a category recognition layer; the described The image to be recognized is input to the recognition model, and the street sign corner category corresponding to the image to be recognized outputted by the recognition model is obtained, which specifically includes: inputting the image to be recognized to the feature extraction layer, and obtaining the feature extraction The fusion feature image output by the layer; the fusion feature image is input to the channel separation layer, and the channel feature image output by the channel separation layer is obtained; the channel feature image is input to the category identification layer, and the The street sign corner category output by the category recognition layer.

以上所描述的装置实施例仅仅是示意性的，其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的，作为单元显示的部件可以是或者也可以不是物理单元，即可以位于一个地方，或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动的情况下，即可以理解并实施。The device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed to multiple network elements. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment. It can be understood and implemented by those skilled in the art without any creative effort.

通过以上的实施方式的描述，本领域的技术人员可以清楚地了解到各实施方式可借助软件加必需的通用硬件平台的方式来实现，当然也可以通过硬件。基于这样的理解，上述技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来，该计算机软件产品可以存储在计算机可读存储介质中，如ROM/RAM、磁碟、光盘等，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。Through the above description of the implementations, those skilled in the art can clearly understand that each implementation can be implemented by means of software plus a necessary general hardware platform, and of course also by hardware. Based on this understanding, the essence of the above technical solution or the part that contributes to the prior art can be embodied in the form of software products, and the computer software products can be stored in computer-readable storage media, such as ROM/RAM, magnetic discs, optical discs, etc., including several instructions to make a computer device (which may be a personal computer, server, or network device, etc.) execute the methods described in various embodiments or some parts of the embodiments.

最后应说明的是：以上实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be Modifications are made to the technical solutions described in the foregoing embodiments, or equivalent replacements are made to some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the various embodiments of the present invention.

Claims

1. A recognition method for street signs, comprising:

Acquiring the image to be recognized, the image to be recognized carries the blocked street sign element;

inputting the image to be recognized into a recognition model, and obtaining the street sign corner category output by the recognition model corresponding to the image to be recognized;

Wherein, the recognition model is obtained based on a sample street sign image and a category label corresponding to the sample street sign image; the recognition model includes a feature extraction layer, a channel separation layer, and a category recognition layer;

The step of inputting the image to be recognized into the recognition model and obtaining the street sign corner category output by the recognition model corresponding to the image to be recognized specifically includes:

Inputting the image to be identified into the feature extraction layer, and obtaining a fusion feature image output by the feature extraction layer;

input the fusion feature image to the channel separation layer, and obtain the channel feature image output by the channel separation layer;

The channel feature image is input to the category recognition layer, and the street sign corner category output by the category recognition layer is obtained.

2. the recognition method of street sign according to claim 1, is characterized in that, described described image to be recognized is input to described feature extraction layer, obtains the fusion characteristic image of described feature extraction layer output, comprises:

Performing downsampling operations and convolution operations of different scales on the image to be identified to obtain feature images of different scales;

performing feature fusion based on the feature images of each scale, and obtaining the fused feature image;

Among them, after each feature fusion, a multi-scale convolution is performed before the next feature fusion.

3. the identification method of street sign according to claim 1, is characterized in that, described fusion feature image is input to described channel separation layer, obtains the channel feature image of described channel separation layer output, comprising:

Carrying out convolution calculation on the fusion feature image to obtain the channel feature image corresponding to each channel;

Wherein, the number of passages is determined according to the number of street sign corners contained in a complete street sign element.

4. the identification method of street sign according to claim 1, is characterized in that, described recognition model is based on sample street sign image, and the category label and corner point coordinate information of label corresponding to described sample street sign image obtain;

The step of inputting the channel feature image into the category identification layer, and obtaining the street sign corner category output by the category identification layer specifically includes:

For the channel feature image corresponding to each channel, perform category identification and extraction, and obtain the category probability set corresponding to each channel;

Using the class probability set corresponding to the channel, the street sign corner category corresponding to the channel feature image and the street sign corner point coordinates are determined.

5. The method for identifying street signs according to any one of claims 1-4, wherein said acquiring an image to be identified comprises:

In the case where it is determined that the image to be recognized does not carry a blocked street sign element, based on the target corner point in the image to be recognized, intercept the non-corner area image;

Overlay the non-corner region image on the region corresponding to the target corner to generate a new image to be recognized.

6. A street sign recognition device, characterized in that it comprises:

An image acquisition module, configured to acquire an image to be recognized, the image to be recognized carries a blocked street sign element;

A corner point recognition module, configured to input the image to be recognized into a recognition model, and obtain the street sign corner category output by the recognition model corresponding to the image to be recognized;

The corner recognition module specifically includes a feature extraction unit, a channel separation unit and a category recognition unit, wherein:

The feature extraction unit is configured to input the image to be recognized to the feature extraction layer, and obtain a fusion feature image output by the feature extraction layer;

The channel separation unit is configured to input the fusion feature image to the channel separation layer, and obtain the channel feature image output by the channel separation layer;

The category identification unit is configured to input the channel feature image to the category identification layer, and obtain the street sign corner category output by the category identification layer.

7. A vehicle, comprising a vehicle body, characterized in that it further comprises a recognition device disposed on the vehicle body, the recognition device being used to execute the street sign recognition method according to any one of claims 1 to 5.

8. An electronic device comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, wherein the processor according to claim 1 is implemented when executing the program. The identification method of the street sign described in any one of to 5.

9. A non-transitory computer-readable storage medium, on which a computer program is stored, characterized in that, when the computer program is executed by a processor, the street sign recognition method according to any one of claims 1 to 5 is implemented.

10. A computer program product, comprising a computer program, characterized in that, when the computer program is executed by a processor, the street sign recognition method according to any one of claims 1 to 5 is implemented.