WO2021068376A1 - Convolution processing method and system applied to convolutional neural network, and related components - Google Patents

Convolution processing method and system applied to convolutional neural network, and related components Download PDF

Info

Publication number
WO2021068376A1
WO2021068376A1 PCT/CN2019/121105 CN2019121105W WO2021068376A1 WO 2021068376 A1 WO2021068376 A1 WO 2021068376A1 CN 2019121105 W CN2019121105 W CN 2019121105W WO 2021068376 A1 WO2021068376 A1 WO 2021068376A1
Authority
WO
WIPO (PCT)
Prior art keywords
convolution
calculation
target operation
side window
operation object
Prior art date
Application number
PCT/CN2019/121105
Other languages
French (fr)
Chinese (zh)
Inventor
金良
范宝余
郭振华
Original Assignee
浪潮电子信息产业股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 浪潮电子信息产业股份有限公司 filed Critical 浪潮电子信息产业股份有限公司
Publication of WO2021068376A1 publication Critical patent/WO2021068376A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Definitions

  • This application relates to the field of deep learning, and in particular to a convolution processing method, system and related components applied to a convolutional neural network.
  • convolutional neural network is a relatively important type of neural network. Its biggest feature is convolution operation. It is often used in the training process to extract different features through convolution, and then combine all these features organically. Make corresponding decisions.
  • the convolutional neural network usually has many layers, and each layer has multiple convolution kernel filters. The layers are connected to form a directed acyclic graph. Such a central convolution will increase the When the resolution is reduced, the performance of the convolutional neural network is reduced.
  • the purpose of this application is to provide a data storage method, system, device, and readable storage medium during abnormal shutdown, so as to store important data during abnormal shutdown and ensure that data is not lost.
  • the specific plan is as follows:
  • the specific plan is as follows:
  • a convolution processing method applied to a convolutional neural network including:
  • the target operation object is specifically an input feature
  • the process of performing side window convolution calculation on the target operation object to obtain calculation results in multiple directions specifically includes:
  • the four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  • the process of performing side-window convolution calculations on the target operation objects by using the calculation side windows to obtain calculation results in four directions specifically includes:
  • a unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
  • the process of performing cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object specifically includes:
  • the target operation object is specifically a boundary input feature and/or a texture input feature.
  • the convolution processing method further includes: performing a weighted average on the convolution result corresponding to each convolution kernel to obtain the final convolution result of the target operation object.
  • this application also discloses a convolution processing system applied to a convolutional neural network, including:
  • the acquisition module is used to acquire a target operation object;
  • the target operation object is specifically an input feature;
  • a calculation module configured to perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions
  • the result determination module is configured to perform cross-entropy optimization processing on the calculation results of the multiple directions to obtain the convolution result of the target operation object.
  • the calculation module is specifically used for:
  • the four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  • this application also discloses a convolution processing device applied to a convolutional neural network, including:
  • Memory used to store computer programs
  • the processor is configured to implement the steps of the convolution processing method applied to the convolutional neural network as described above when the computer program is executed.
  • the present application also discloses a readable storage medium having a computer program stored on the readable storage medium, and when the computer program is executed by a processor, the convolution applied to the convolutional neural network as described above is realized. Processing method steps.
  • the present application discloses a convolution processing method applied to a convolutional neural network, including: obtaining a target operation object; the target operation object is specifically an input feature; performing side window convolution calculation on the target operation object to obtain multiple Calculation results in three directions; performing cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
  • This application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the data obtained by the convolution operation is improved The ability to generalize features improves the performance of convolutional neural networks.
  • FIG. 1 is a flowchart of steps of a convolution processing method applied to a convolutional neural network in an embodiment of the application;
  • Fig. 2a is a schematic diagram of an image of a side window in an embodiment of the application.
  • 2b, 2c, and 2d are respectively schematic diagrams of images of side windows in different directions in an embodiment of the application.
  • FIG. 3 is a flowchart of steps of a specific convolution processing method applied to a convolutional neural network in an embodiment of the application;
  • FIG. 4 is a structural distribution diagram of a convolution processing system applied to a convolutional neural network in an embodiment of the application;
  • FIG. 5 is a structural distribution diagram of a convolution processing device applied to a convolutional neural network in an embodiment of the application.
  • the center point of the convolution kernel is selected for the corresponding multiplication and addition operation in the traditional convolution operation, when a pixel is on the boundary, placing the center of the window on the pixel for the convolution operation will blur the edge, which will reduce the feature’s reliability.
  • Resolvability coupled with the convolutional neural network usually has many layers, each layer has multiple filters, and the layers are connected to form a directed acyclic graph. Such a convolution at the center position will aggravate the decrease in resolvability Circumstance, thereby reducing the performance of the convolutional neural network.
  • This application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, it improves the convolution operation acquisition The ability of data to generalize features, thereby improving the performance of convolutional neural networks.
  • the embodiment of the application discloses a convolution processing method applied to a convolutional neural network, as shown in FIG. 1, including:
  • S11 Obtain a target operation object; the target operation object is specifically an input feature;
  • the convolution processing method is applicable to all input features in the convolutional neural network, where the input features include both the first layer input features of the initial input layer, such as image pixels, and also the hiding in the neural network.
  • the input features involved in the layer and output layer such as the fine-grained features of the bottom layer, and the semantic features of the high layer.
  • is the angle between the window and the horizontal line
  • r is the radius of the window
  • ⁇ 0,r ⁇ , ⁇ x,y ⁇ is the target pixel
  • the position of i, r is a user-defined parameter used to control all side windows.
  • This step specifically includes: performing cross-entropy optimization processing on the calculation results in the multiple directions, and determining the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
  • the cross-entropy optimization processing is performed on the calculation results in multiple directions, which can be replaced with L2 norm or other clustering methods to measure the final output to determine the final convolution result. Since the final convolution effect is obtained by comprehensively analyzing the convolution results in multiple directions, it can better reflect the target characteristics. In the continuous processing of the subsequent convolutional neural network, more generalized features can be obtained, thereby improving The learning ability of convolutional neural networks.
  • the side window convolution calculation algorithm based on the L2 norm may cause the algorithm to converge slowly, for example, based on sigmoid Inactive function, when the input data value is too large or too small, the first-order partial derivative tends to zero. Therefore, in this application, the optimization process of cross entropy with obvious advantages in learning speed is selected, and based on the optimization of cross entropy, the calculation result corresponding to the smallest cross entropy value among the current processing points is selected.
  • the target operation object is specifically a boundary input feature and/or a texture input feature.
  • the target operation object in this embodiment may also be other types of input features that can perform convolution processing operations.
  • the above description is based on the convolution processing method when the number of convolution kernel filters is 1.
  • the convolution results corresponding to each convolution kernel are weighted and averaged to obtain the target The final convolution result of the operation object.
  • nchw of caffe Convolutional Architecture for Fast Feature Embedding
  • the convolution kernel size kernel_size of the convolution kernel is 3
  • the padding is 1
  • the convolution step stride is 1
  • the number of convolution kernels num_out is 256. Since pad is 1, stride is 1, the shape of the output data obtained by the convolution operation is [1, 256, 13, 13].
  • This embodiment discloses a convolution processing method applied to a convolutional neural network, including: obtaining a target operation object; the target operation object is specifically an input feature; performing side window convolution calculation on the target operation object to obtain Calculation results in multiple directions; performing cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
  • This embodiment solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the acquisition of the convolution operation is improved. The ability of data to generalize features, thereby improving the performance of convolutional neural networks.
  • the embodiment of the present application discloses a specific convolution processing method applied to a convolutional neural network. Compared with the previous embodiment, this embodiment further explains and optimizes the technical solution. Specifically, see Figure 3:
  • S21 Acquire a target operation object; the target operation object is specifically an input feature;
  • S22 Determine the directions of the four calculation side windows as upper left, upper right, lower left, and lower right;
  • this step can be performed as follows:
  • a unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
  • this embodiment does not limit the calculation order to upper left, upper right, lower left, and lower right.
  • the calculation order is only an example. Calculations are performed in other calculation orders, as long as all side window convolution calculations are performed in the same direction. This embodiment has the effect of improving the calculation speed.
  • S24 Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
  • this embodiment replaces the previous convolution calculation centered on the convolution kernel with the side window convolution calculation in multiple directions, and optimizes the cross-entropy.
  • the final convolution result is processed, and the local receptive field characteristics of the convolutional neural network are fully utilized, so that the extracted features can better reflect the characteristics of the target, and the generalization performance of the features is stronger.
  • the convolution in this embodiment More generalized features can be extracted, so a relatively shallow and narrow network can be designed, which can improve the performance of the convolutional neural network and reduce the amount of model parameters.
  • this application also discloses a convolution processing system applied to a convolutional neural network, as shown in FIG. 4, including:
  • the obtaining module 01 is used to obtain a target operation object; the target operation object is specifically an input feature;
  • the calculation module 02 is configured to perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
  • the result determination module 03 is configured to perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
  • the calculation module 02 is specifically configured to:
  • the four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  • the calculation module 02 is specifically used for:
  • the process of performing side-window convolution calculations on the target operation objects by using the calculation side windows to obtain calculation results in four directions specifically includes:
  • a unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain multiple calculation results in the lower right direction.
  • the result determining module 03 is specifically configured to:
  • the target operation object is specifically a boundary input feature and/or a texture input feature.
  • the convolution processing system further includes: a weighted average module, configured to perform a weighted average on the convolution result corresponding to each convolution kernel when there are multiple convolution kernels to obtain The final convolution result of the target operation object.
  • a weighted average module configured to perform a weighted average on the convolution result corresponding to each convolution kernel when there are multiple convolution kernels to obtain The final convolution result of the target operation object.
  • the embodiment of the application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network.
  • the convolution operation is improved.
  • the ability to obtain more generalized features of the data improves the performance of the convolutional neural network.
  • this application also discloses a convolution processing device applied to a convolutional neural network. As shown in FIG. 5, it includes a processor 11 and a memory 12; wherein, the processing 11 executes the data stored in the memory 12
  • the computer program implements the following steps:
  • the target operation object is specifically an input feature
  • the embodiment of the application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network.
  • the convolution operation is improved.
  • the ability to obtain more generalized features of the data improves the performance of the convolutional neural network.
  • the four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  • a unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
  • the target operation object is specifically a boundary input feature and/or a texture input feature.
  • the convolution result corresponding to each convolution kernel is weighted and averaged to obtain the final convolution result of the target operation object.
  • the convolution processing device in this embodiment may further include:
  • the input interface 13 is used to obtain a computer program imported from the outside world, and save the obtained computer program in the memory 12, and can also be used to obtain various instructions and parameters transmitted by external terminal devices, and transmit them to the processor 11 , So that the processor 11 uses the above-mentioned various instructions and parameters to carry out corresponding processing.
  • the input interface 13 may specifically include, but is not limited to, a USB interface, a serial interface, a voice input interface, a fingerprint input interface, a hard disk reading interface, and the like.
  • the output interface 14 is used to output various data generated by the processor 11 to a terminal device connected to it, so that other terminal devices connected to the output interface 14 can obtain various data generated by the processor 11.
  • the output interface 14 may specifically include, but is not limited to, a USB interface, a serial interface, and the like.
  • the communication unit 15 is used to establish a remote communication connection between the convolution processing device and the external server, so that the convolution processing device can mount the image file to the external server.
  • the communication unit 15 may specifically include, but is not limited to, a remote communication unit based on wireless communication technology or wired communication technology.
  • the keyboard 16 is used to obtain various parameter data or instructions input by the user by hitting the keycap in real time.
  • the display 17 is used for real-time display of relevant information of the convolution processing process, so that the user can understand the current processing situation of the convolution neural network in time.
  • the mouse 18 can be used to assist the user in inputting data and simplify the user's operation.
  • the embodiment of the present application also discloses a computer-readable storage medium.
  • the computer-readable storage medium mentioned here includes random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, and Erase programmable ROM, register, hard disk, removable hard disk, CD-ROM or any other form of storage medium known in the technical field.
  • RAM random access memory
  • ROM read-only memory
  • Erase programmable ROM register, hard disk, removable hard disk, CD-ROM or any other form of storage medium known in the technical field.
  • a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the following steps are implemented:
  • the target operation object is specifically an input feature
  • the embodiment of the application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network.
  • the convolution operation is improved.
  • the ability to obtain more generalized features of the data improves the performance of the convolutional neural network.
  • the four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  • a unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
  • the target operation object is specifically a boundary input feature and/or a texture input feature.
  • the convolution result corresponding to each convolution kernel is weighted and averaged to obtain the final convolution result of the target operation object.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)
  • Complex Calculations (AREA)

Abstract

A convolution processing method, system and device applied to a convolutional neural network, and a readable storage medium, comprising: obtaining a target operation object, the target operation object being specifically an input feature (S11); performing side window convolution calculation on the target operation object to obtain calculation results in multiple directions (S12); and performing cross entropy optimization processing on the calculation results in the multiple directions to obtain a convolution result of the target operation object (S13). According to the method, performance loss caused by a convolution operation with a convolution kernel as a central point in an original convolutional neural network is avoided, the ability of the convolution operation to obtain more generalized features of data is improved by comprehensively analyzing a side window convolution operation in the multiple directions, and thus the performance of the convolutional neural network is improved.

Description

应用于卷积神经网络的卷积处理方法、系统及相关组件Convolution processing method, system and related components applied to convolution neural network
本申请要求于2019年10月11日提交中国专利局、申请号为201910963711.1、申请名称为“应用于卷积神经网络的卷积处理方法、系统及相关组件”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on October 11, 2019, the application number is 201910963711.1, and the application name is "Convolutional processing methods, systems and related components applied to convolutional neural networks." The entire content is incorporated into this application by reference.
技术领域Technical field
本申请涉及深度学习领域,特别涉及一种应用于卷积神经网络的卷积处理方法、系统及相关组件。This application relates to the field of deep learning, and in particular to a convolution processing method, system and related components applied to a convolutional neural network.
背景技术Background technique
在深度学习中,卷积神经网络是一类比较重要的神经网络,其最大特点是卷积运算,常用于在训练过程中,通过卷积提取不同的特征,然后将所有这些特征有机组合起来做相应决策。In deep learning, convolutional neural network is a relatively important type of neural network. Its biggest feature is convolution operation. It is often used in the training process to extract different features through convolution, and then combine all these features organically. Make corresponding decisions.
但是由于传统的卷积操作时选择了卷积核中心点作相应的乘加运算,当某个像素在边界上,将窗口中心放在像素上做卷积操作会模糊边缘,这样会降低特征的可分辨性,再加上卷积神经网络通常有很多层,每层有多个卷积核filter,层与层之间连接成有向无环图,这样的以中心位置的卷积会加剧可分辨性下降的情况,从而降低卷积神经网络的性能。However, because the center point of the convolution kernel is selected for the corresponding multiplication and addition operation in the traditional convolution operation, when a pixel is on the boundary, placing the center of the window on the pixel for the convolution operation will blur the edge, which will reduce the characteristic In addition, the convolutional neural network usually has many layers, and each layer has multiple convolution kernel filters. The layers are connected to form a directed acyclic graph. Such a central convolution will increase the When the resolution is reduced, the performance of the convolutional neural network is reduced.
申请内容Application content
有鉴于此,本申请的目的在于提供一种异常关机时的数据存储方法、系统、装置及可读存储介质,以便在异常关机的时候对重要的数据进行存储,保证数据不丢失。其具体方案如下:In view of this, the purpose of this application is to provide a data storage method, system, device, and readable storage medium during abnormal shutdown, so as to store important data during abnormal shutdown and ensure that data is not lost. The specific plan is as follows:
一种应用于卷积神经网络的卷积处理方法、系统及相关组件,以便解决模糊边缘的技术问题。其具体方案如下:A convolution processing method, system and related components applied to a convolutional neural network to solve the technical problem of fuzzy edges. The specific plan is as follows:
一种应用于卷积神经网络的卷积处理方法,包括:A convolution processing method applied to a convolutional neural network, including:
获取目标操作对象;所述目标操作对象具体为输入特征;Acquiring a target operation object; the target operation object is specifically an input feature;
对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;Perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
优选的,所述对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果的过程,具体包括:Preferably, the process of performing side window convolution calculation on the target operation object to obtain calculation results in multiple directions specifically includes:
确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
优选的,当所述目标操作对象为多个,所述分别利用所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果的过程,具体包括:Preferably, when there are multiple target operation objects, the process of performing side-window convolution calculations on the target operation objects by using the calculation side windows to obtain calculation results in four directions specifically includes:
利用左上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左上方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the upper left direction to obtain multiple calculation results in the upper left direction;
利用右上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右上方向的多个计算结果;Performing a unified side window convolution calculation on a plurality of the target operation objects by using the calculation side window in the upper right direction to obtain multiple calculation results in the upper right direction;
利用左下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左下方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the lower left direction to obtain multiple calculation results in the lower left direction;
利用右下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右下方向的多个计算结果。A unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
优选的,所述对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果的过程,具体包括:Preferably, the process of performing cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object specifically includes:
对所述多个方向的计算结果进行交叉熵的最优化处理,将交叉熵值最小的计算结果确定为所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions, and determine the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
优选的,所述目标操作对象具体为边界输入特征和/或纹理输入特征。Preferably, the target operation object is specifically a boundary input feature and/or a texture input feature.
优选的,当卷积核为多个时,所述卷积处理方法还包括:对每个所述卷积核对应的卷积结果进行加权平均,得到所述目标操作对象的最终卷积结果。Preferably, when there are multiple convolution kernels, the convolution processing method further includes: performing a weighted average on the convolution result corresponding to each convolution kernel to obtain the final convolution result of the target operation object.
相应的,本申请还公开了一种应用于卷积神经网络的卷积处理系统,包括:Correspondingly, this application also discloses a convolution processing system applied to a convolutional neural network, including:
获取模块,用于获取目标操作对象;所述目标操作对象具体为输入特 征;The acquisition module is used to acquire a target operation object; the target operation object is specifically an input feature;
计算模块,用于对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;A calculation module, configured to perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
结果确定模块,用于对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。The result determination module is configured to perform cross-entropy optimization processing on the calculation results of the multiple directions to obtain the convolution result of the target operation object.
优选的,所述计算模块具体用于:Preferably, the calculation module is specifically used for:
确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
相应的,本申请还公开了一种应用于卷积神经网络的卷积处理装置,包括:Correspondingly, this application also discloses a convolution processing device applied to a convolutional neural network, including:
存储器,用于存储计算机程序;Memory, used to store computer programs;
处理器,用于执行所述计算机程序时实现如上文所述应用于卷积神经网络的卷积处理方法的步骤。The processor is configured to implement the steps of the convolution processing method applied to the convolutional neural network as described above when the computer program is executed.
相应的,本申请还公开了一种可读存储介质,所述可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如上文所述应用于卷积神经网络的卷积处理方法的步骤。Correspondingly, the present application also discloses a readable storage medium having a computer program stored on the readable storage medium, and when the computer program is executed by a processor, the convolution applied to the convolutional neural network as described above is realized. Processing method steps.
本申请公开了一种应用于卷积神经网络的卷积处理方法,包括:获取目标操作对象;所述目标操作对象具体为输入特征;对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。本申请解决了原有卷积神经网络中以卷积核为中心点做卷积操作时带来的性能损失,通过综合分析多个方向上的边窗卷积操作,提高了卷积操作获取数据更泛化特征的能力,以此提高了卷积神经网络的性能。The present application discloses a convolution processing method applied to a convolutional neural network, including: obtaining a target operation object; the target operation object is specifically an input feature; performing side window convolution calculation on the target operation object to obtain multiple Calculation results in three directions; performing cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object. This application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the data obtained by the convolution operation is improved The ability to generalize features improves the performance of convolutional neural networks.
附图说明Description of the drawings
图1为本申请实施例中一种应用于卷积神经网络的卷积处理方法的步骤流程图;FIG. 1 is a flowchart of steps of a convolution processing method applied to a convolutional neural network in an embodiment of the application;
图2a为本申请实施例中边窗的图像示意图;Fig. 2a is a schematic diagram of an image of a side window in an embodiment of the application;
图2b、图2c、图2d分别为本申请实施例中不同方向的边窗的图像示意图;2b, 2c, and 2d are respectively schematic diagrams of images of side windows in different directions in an embodiment of the application;
图3为本申请实施例中一种具体的应用于卷积神经网络的卷积处理方法的步骤流程图;FIG. 3 is a flowchart of steps of a specific convolution processing method applied to a convolutional neural network in an embodiment of the application;
图4为本申请实施例中一种应用于卷积神经网络的卷积处理系统的结构分布图;4 is a structural distribution diagram of a convolution processing system applied to a convolutional neural network in an embodiment of the application;
图5为本申请实施例中一种应用于卷积神经网络的卷积处理装置的结构分布图。FIG. 5 is a structural distribution diagram of a convolution processing device applied to a convolutional neural network in an embodiment of the application.
具体实施方式Detailed ways
下面将结合本申请中的说明书附图,对申请中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the application will be clearly and completely described below in conjunction with the drawings in the specification of the application. Obviously, the described embodiments are only a part of the embodiments of the application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
实施例一Example one
由于传统的卷积操作时选择了卷积核中心点作相应的乘加运算,当某个像素在边界上,将窗口中心放在像素上做卷积操作会模糊边缘,这样会降低特征的可分辨性,再加上卷积神经网络通常有很多层,每层有多个filter,层与层之间连接成有向无环图,这样的以中心位置的卷积会加剧可分辨性下降的情况,从而降低卷积神经网络的性能。而本申请解决了原有卷积神经网络中以卷积核为中心点做卷积操作时带来的性能损失,通过综合分析多个方向上的边窗卷积操作,提高了卷积操作获取数据更泛化特征的能力,以此提高了卷积神经网络的性能。Because the center point of the convolution kernel is selected for the corresponding multiplication and addition operation in the traditional convolution operation, when a pixel is on the boundary, placing the center of the window on the pixel for the convolution operation will blur the edge, which will reduce the feature’s reliability. Resolvability, coupled with the convolutional neural network usually has many layers, each layer has multiple filters, and the layers are connected to form a directed acyclic graph. Such a convolution at the center position will aggravate the decrease in resolvability Circumstance, thereby reducing the performance of the convolutional neural network. This application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, it improves the convolution operation acquisition The ability of data to generalize features, thereby improving the performance of convolutional neural networks.
本申请实施例公开了一种应用于卷积神经网络的卷积处理方法,参见图1所示,包括:The embodiment of the application discloses a convolution processing method applied to a convolutional neural network, as shown in FIG. 1, including:
S11:获取目标操作对象;所述目标操作对象具体为输入特征;S11: Obtain a target operation object; the target operation object is specifically an input feature;
可以理解的是,该卷积处理方法适用于所有卷积神经网络内的输入特征上,这里的输入特征既包括初始输入层的第一层输入特征,例如图像像素,也包括神经网络中的隐藏层、输出层涉及的输入特征,如底层的细粒 度特征、高层的语义特征等。It can be understood that the convolution processing method is applicable to all input features in the convolutional neural network, where the input features include both the first layer input features of the initial input layer, such as image pixels, and also the hiding in the neural network. The input features involved in the layer and output layer, such as the fine-grained features of the bottom layer, and the semantic features of the high layer.
S12:对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;S12: Perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
可以理解的是,边窗(side window)定义如图2a所示,其中θ为window与水平线之间角度,r为window的半径,ρ∈{0,r},{x,y}为目标像素i的位置,r是用户定义的参数,用来控制所有的边窗,通过改变θ和{x,y}值,window的方向以及与之对应目标像素i可控制。为了简化在连续空间的计算量,通常仅计算在离散空间8个方向边窗,令θ=k×π/2,k∈[0,3],当ρ=r时,可以得到上下左右四个方向上的边窗,分别用大写字母U(up)、D(down)、L(left)、R(right)表示,如图2b、图2c所示;当ρ=0时,可以得到左上、右上、左下、右下四个方向的边窗,分别用字母NW(northwest)、NE(northeast)、SE(southeast)、SW(southwest)表示,如图2d所示。在每一个边窗计算卷积操作,可以获取到8个方向的输出,作为该方向的计算结果。It is understandable that the definition of a side window is shown in Figure 2a, where θ is the angle between the window and the horizontal line, r is the radius of the window, ρ∈{0,r}, {x,y} is the target pixel The position of i, r is a user-defined parameter used to control all side windows. By changing the values of θ and {x, y}, the direction of the window and the corresponding target pixel i can be controlled. In order to simplify the calculation in continuous space, usually only 8 directional side windows in discrete space are calculated, let θ=k×π/2, k∈[0,3], when ρ=r, four up, down, left, and right can be obtained The side windows in the direction are represented by capital letters U (up), D (down), L (left), and R (right), as shown in Figure 2b and Figure 2c; when ρ = 0, you can get the upper left, The side windows in the upper right, lower left, and lower right directions are represented by the letters NW (northwest), NE (northeast), SE (southeast), and SW (southwest) respectively, as shown in Figure 2d. The convolution operation is calculated in each side window, and the output of 8 directions can be obtained as the calculation result of that direction.
S13:对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。S13: Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
该步骤具体包括:对所述多个方向的计算结果进行交叉熵的最优化处理,将交叉熵值最小的计算结果确定为所述目标操作对象的卷积结果。This step specifically includes: performing cross-entropy optimization processing on the calculation results in the multiple directions, and determining the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
可以理解的是,该步骤中对多个方向的计算结果进行交叉熵的最优化处理,可以替换为L2范数或其他聚类方法来度量最后的输出,以确定最后的卷积结果。由于最终的卷积效果是综合分析多个方向的卷积结果得出的,因此更能反映出目标特性,在后续卷积神经网络的不断处理中,可以获取到更加泛化的特征,从而提高卷积神经网络的学习能力。It is understandable that, in this step, the cross-entropy optimization processing is performed on the calculation results in multiple directions, which can be replaced with L2 norm or other clustering methods to measure the final output to determine the final convolution result. Since the final convolution effect is obtained by comprehensively analyzing the convolution results in multiple directions, it can better reflect the target characteristics. In the continuous processing of the subsequent convolutional neural network, more generalized features can be obtained, thereby improving The learning ability of convolutional neural networks.
但是在训练过程中,由于卷积神经网络中需要基于目标函数或损失函数求取一阶偏导数,而基于L2范数的边窗卷积计算算法可能会导致算法收敛速度变慢,例如基于sigmoid非激活函数,当输入数据数值过大或过小时,一阶偏导数趋于零。因此本申请中选择了学习速度有明显优势的交叉熵的最优化处理,基于交叉熵的最优化,选择当前处理点中最小的交叉熵值对应的计算结果。However, in the training process, because the convolutional neural network needs to obtain the first-order partial derivative based on the objective function or the loss function, the side window convolution calculation algorithm based on the L2 norm may cause the algorithm to converge slowly, for example, based on sigmoid Inactive function, when the input data value is too large or too small, the first-order partial derivative tends to zero. Therefore, in this application, the optimization process of cross entropy with obvious advantages in learning speed is selected, and based on the optimization of cross entropy, the calculation result corresponding to the smallest cross entropy value among the current processing points is selected.
进一步的,所述目标操作对象具体为边界输入特征和/或纹理输入特征。此外,本实施例中目标操作对象还可以是可进行卷积处理操作的其他类型的输入特征。Further, the target operation object is specifically a boundary input feature and/or a texture input feature. In addition, the target operation object in this embodiment may also be other types of input features that can perform convolution processing operations.
可以理解的是,上述说明是以卷积核filter数目为1时的卷积处理方法,当卷积核filter为多个时,对每个卷积核对应的卷积结果进行加权平均,得到目标操作对象的最终卷积结果。具体的,按照caffe(Convolutional Architecture for Fast Feature Embedding)的数据格式nchw,其中n为batch批量数据,c为channel输入数据的通道数,h为hight高度,w为width宽度。即假设输入数据的形状shape为[1,384,13,13],卷积核的卷积核大小kernel_size为3,填充pad为1,卷积步长stride为1,卷积核的数目num_out为256,由于pad为1,stride为1,因此通过卷积操作得到输出数据的shape为[1,256,13,13]。在应用本实施例的卷积处理方法时,将输入对象作为目标操作对象,对输入数据做8个方向的边窗卷积计算时,得到8个方向的计算结果,再进行最优化处理,可获取到当前通道当前点上的边窗卷积结果(shape为[1,1,1,1]);由于输入数据的通道数为384,在当前点的其他通道上,按照上述步骤,计算其他通道边窗卷积结果(shape为[1,384,1,1]);将当前所有通道上的边窗卷积结果加权平均,得到当前点的输出结果(shape为[1,1,1,1]);紧接着,按照上述3个步骤,计算输入数据其他点在当前filer中的边窗卷积结果(shape为[1,1,13,13]);最后,由于filter数目为256,计算在其他filter中边窗卷积结果(shape为[1,256,13,13]),从而得到整个输入对象的最终卷积结果。It is understandable that the above description is based on the convolution processing method when the number of convolution kernel filters is 1. When there are multiple convolution kernel filters, the convolution results corresponding to each convolution kernel are weighted and averaged to obtain the target The final convolution result of the operation object. Specifically, according to the data format nchw of caffe (Convolutional Architecture for Fast Feature Embedding), where n is batch data, c is the number of channels of channel input data, h is hight height, and w is width width. That is, assuming that the shape of the input data is [1,384,13,13], the convolution kernel size kernel_size of the convolution kernel is 3, the padding is 1, the convolution step stride is 1, and the number of convolution kernels num_out is 256. Since pad is 1, stride is 1, the shape of the output data obtained by the convolution operation is [1, 256, 13, 13]. When applying the convolution processing method of this embodiment, the input object is taken as the target operation object, and when the side window convolution calculation in 8 directions is performed on the input data, the calculation results in 8 directions are obtained, and then the optimization processing can be performed. Obtain the convolution result of the side window on the current point of the current channel (shape is [1,1,1,1]); since the number of channels of the input data is 384, on other channels of the current point, follow the above steps to calculate other Channel side window convolution result (shape is [1,384,1,1]); the current side window convolution results on all channels are weighted and averaged to obtain the output result of the current point (shape is [1,1,1, 1]); Next, according to the above 3 steps, calculate the side window convolution results of other points of the input data in the current filer (shape is [1,1,13,13]); finally, because the number of filters is 256, Calculate the convolution result of the side window in other filters (shape is [1,256,13,13]) to obtain the final convolution result of the entire input object.
本实施例公开了一种应用于卷积神经网络的卷积处理方法,包括:获取目标操作对象;所述目标操作对象具体为输入特征;对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。本实施例解决了原有卷积神经网络中以卷积核为中心点做卷积操作时带来的性能损失,通过综合分析多个方向上的边窗卷积操作,提高了卷积操作获取数据更泛化特征的能力,以此提高了卷积神经网络的性能。This embodiment discloses a convolution processing method applied to a convolutional neural network, including: obtaining a target operation object; the target operation object is specifically an input feature; performing side window convolution calculation on the target operation object to obtain Calculation results in multiple directions; performing cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object. This embodiment solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the acquisition of the convolution operation is improved. The ability of data to generalize features, thereby improving the performance of convolutional neural networks.
实施例二Example two
本申请实施例公开了一种具体的应用于卷积神经网络的卷积处理方法,相对于上一实施例,本实施例对技术方案作了进一步的说明和优化。具体的,参见图3所示:The embodiment of the present application discloses a specific convolution processing method applied to a convolutional neural network. Compared with the previous embodiment, this embodiment further explains and optimizes the technical solution. Specifically, see Figure 3:
S21:获取目标操作对象;所述目标操作对象具体为输入特征;S21: Acquire a target operation object; the target operation object is specifically an input feature;
S22:确定四个计算边窗的方向分别为左上、右上、左下、右下;S22: Determine the directions of the four calculation side windows as upper left, upper right, lower left, and lower right;
可以理解的是,边窗的选择需要权衡目标特性和计算量,上一实施例中边窗卷积计算存在大量的重复性计算,速度上可能会大打折扣,基于此点进行改进,本实施例中只选择了图2d中的四个方向,在保证边窗卷积计算效果的同时降低了计算量。It is understandable that the selection of the side window needs to weigh the target characteristics and the amount of calculation. In the previous embodiment, the side window convolution calculation has a large number of repetitive calculations, and the speed may be greatly reduced. Based on this point, improvements are made. This embodiment Only the four directions in Figure 2d are selected in Figure 2d, which reduces the amount of calculation while ensuring the calculation effect of the side window convolution.
S23:分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。S23: Perform a side window convolution calculation on the target operation object by using the four calculation side windows to obtain calculation results in four directions.
具体的,该步骤可以按照以下方法执行:Specifically, this step can be performed as follows:
利用左上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左上方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the upper left direction to obtain multiple calculation results in the upper left direction;
利用右上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右上方向的多个计算结果;Performing a unified side window convolution calculation on a plurality of the target operation objects by using the calculation side window in the upper right direction to obtain multiple calculation results in the upper right direction;
利用左下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左下方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the lower left direction to obtain multiple calculation results in the lower left direction;
利用右下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右下方向的多个计算结果。A unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
可以理解的是,为了充分利用显卡或CPU的性能,调整多个目标操作对象的操作顺序,先统一计算同一个方向所有的边窗卷积,然后计算另一个方向的边窗卷积,这样可以充分利用显卡性能,提高计算速度。因此本实施例并非限定了计算顺序为左上、右上、左下、右下,该计算顺序仅为举例,以其他计算顺序进行计算,只要在同一方向进行了所有的边窗卷积计算,均可实现本实施例提高计算速度的效果。It is understandable that, in order to make full use of the performance of the graphics card or CPU, adjust the operation sequence of multiple target operation objects, first calculate all the side window convolutions in the same direction, and then calculate the side window convolution in the other direction, so that you can Make full use of graphics card performance to improve calculation speed. Therefore, this embodiment does not limit the calculation order to upper left, upper right, lower left, and lower right. The calculation order is only an example. Calculations are performed in other calculation orders, as long as all side window convolution calculations are performed in the same direction. This embodiment has the effect of improving the calculation speed.
S24:对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。S24: Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
相对于现有技术中卷积神经网络的卷积操作,本实施例将之前已卷积 核为中心的一次卷积计算,替换为多个方向的边窗卷积计算,通过交叉熵的最优化处理得到最终卷积结果,充分利用卷积神经网络局部感受野特性,使得提取到的特征更能反映目标的本身特性,特征的泛化性能更强,进一步来讲,本实施例中的卷积可以提取到更为泛化的特征,因此可以设计出相对比较浅且比较窄的网络,这样能够提高卷积神经网络的性能,降低模型的参数量。Compared with the convolution operation of the convolutional neural network in the prior art, this embodiment replaces the previous convolution calculation centered on the convolution kernel with the side window convolution calculation in multiple directions, and optimizes the cross-entropy. The final convolution result is processed, and the local receptive field characteristics of the convolutional neural network are fully utilized, so that the extracted features can better reflect the characteristics of the target, and the generalization performance of the features is stronger. Furthermore, the convolution in this embodiment More generalized features can be extracted, so a relatively shallow and narrow network can be designed, which can improve the performance of the convolutional neural network and reduce the amount of model parameters.
实施例三Example three
相应的,本申请还公开了一种应用于卷积神经网络的卷积处理系统,参见图4所示,包括:Correspondingly, this application also discloses a convolution processing system applied to a convolutional neural network, as shown in FIG. 4, including:
获取模块01,用于获取目标操作对象;所述目标操作对象具体为输入特征;The obtaining module 01 is used to obtain a target operation object; the target operation object is specifically an input feature;
计算模块02,用于对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;The calculation module 02 is configured to perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
结果确定模块03,用于对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。The result determination module 03 is configured to perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
在一些具体的实施例中,所述计算模块02具体用于:In some specific embodiments, the calculation module 02 is specifically configured to:
确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
在一些具体的实施例中,计算模块02具体用于:In some specific embodiments, the calculation module 02 is specifically used for:
优选的,当所述目标操作对象为多个,所述分别利用所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果的过程,具体包括:Preferably, when there are multiple target operation objects, the process of performing side-window convolution calculations on the target operation objects by using the calculation side windows to obtain calculation results in four directions specifically includes:
利用左上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左上方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the upper left direction to obtain multiple calculation results in the upper left direction;
利用右上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右上方向的多个计算结果;Performing a unified side window convolution calculation on a plurality of the target operation objects by using the calculation side window in the upper right direction to obtain multiple calculation results in the upper right direction;
利用左下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左下方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the lower left direction to obtain multiple calculation results in the lower left direction;
利用右下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右下方向的多个计算结果。A unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain multiple calculation results in the lower right direction.
在一些具体的实施例中,所述结果确定模块03具体用于:In some specific embodiments, the result determining module 03 is specifically configured to:
对所述多个方向的计算结果进行交叉熵的最优化处理,将交叉熵值最小的计算结果确定为所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions, and determine the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
在一些具体的实施例中,所述目标操作对象具体为边界输入特征和/或纹理输入特征。In some specific embodiments, the target operation object is specifically a boundary input feature and/or a texture input feature.
在一些具体的实施例中,所述卷积处理系统还包括:加权平均模块,用于当卷积核为多个时,对每个所述卷积核对应的卷积结果进行加权平均,得到所述目标操作对象的最终卷积结果。In some specific embodiments, the convolution processing system further includes: a weighted average module, configured to perform a weighted average on the convolution result corresponding to each convolution kernel when there are multiple convolution kernels to obtain The final convolution result of the target operation object.
本申请实施例解决了原有卷积神经网络中以卷积核为中心点做卷积操作时带来的性能损失,通过综合分析多个方向上的边窗卷积操作,提高了卷积操作获取数据更泛化特征的能力,以此提高了卷积神经网络的性能。The embodiment of the application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the convolution operation is improved. The ability to obtain more generalized features of the data improves the performance of the convolutional neural network.
实施例四Example four
相应的,本申请还公开了一种应用于卷积神经网络的卷积处理装置,参见图5所示,包括处理器11和存储器12;其中,所述处理11执行所述存储器12中保存的计算机程序时实现以下步骤:Correspondingly, this application also discloses a convolution processing device applied to a convolutional neural network. As shown in FIG. 5, it includes a processor 11 and a memory 12; wherein, the processing 11 executes the data stored in the memory 12 The computer program implements the following steps:
获取目标操作对象;所述目标操作对象具体为输入特征;Acquiring a target operation object; the target operation object is specifically an input feature;
对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;Perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
本申请实施例解决了原有卷积神经网络中以卷积核为中心点做卷积操作时带来的性能损失,通过综合分析多个方向上的边窗卷积操作,提高了卷积操作获取数据更泛化特征的能力,以此提高了卷积神经网络的性能。The embodiment of the application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the convolution operation is improved. The ability to obtain more generalized features of the data improves the performance of the convolutional neural network.
在一些具体的实施例中,所述处理器11执行所述存储器12中保存的计算机子程序时,具体可以实现以下步骤:In some specific embodiments, when the processor 11 executes the computer subprogram stored in the memory 12, the following steps may be specifically implemented:
确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
在一些具体的实施例中,所述处理器11执行所述存储器12中保存的计算机子程序时,具体可以实现以下步骤:In some specific embodiments, when the processor 11 executes the computer subprogram stored in the memory 12, the following steps may be specifically implemented:
利用左上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左上方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the upper left direction to obtain multiple calculation results in the upper left direction;
利用右上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右上方向的多个计算结果;Performing a unified side window convolution calculation on a plurality of the target operation objects by using the calculation side window in the upper right direction to obtain multiple calculation results in the upper right direction;
利用左下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左下方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the lower left direction to obtain multiple calculation results in the lower left direction;
利用右下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右下方向的多个计算结果。A unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
在一些具体的实施例中,所述处理器11执行所述存储器12中保存的计算机子程序时,具体可以实现以下步骤:In some specific embodiments, when the processor 11 executes the computer subprogram stored in the memory 12, the following steps may be specifically implemented:
对所述多个方向的计算结果进行交叉熵的最优化处理,将交叉熵值最小的计算结果确定为所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions, and determine the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
在一些具体的实施例中,所述目标操作对象具体为边界输入特征和/或纹理输入特征。In some specific embodiments, the target operation object is specifically a boundary input feature and/or a texture input feature.
在一些具体的实施例中,所述处理器11执行所述存储器12中保存的计算机子程序时,具体可以实现以下步骤:In some specific embodiments, when the processor 11 executes the computer subprogram stored in the memory 12, the following steps may be specifically implemented:
当卷积核为多个时,对每个所述卷积核对应的卷积结果进行加权平均,得到所述目标操作对象的最终卷积结果。When there are multiple convolution kernels, the convolution result corresponding to each convolution kernel is weighted and averaged to obtain the final convolution result of the target operation object.
进一步的,本实施例中的卷积处理装置,还可以包括:Further, the convolution processing device in this embodiment may further include:
输入接口13,用于获取外界导入的计算机程序,并将获取到的计算机程序保存至所述存储器12中,还可以用于获取外界终端设备传输的各种指令和参数,并传输至处理器11中,以便处理器11利用上述各种指令和参数展开相应的处理。本实施例中,所述输入接口13具体可以包括但不限于USB接口、串行接口、语音输入接口、指纹输入接口、硬盘读取接口等。The input interface 13 is used to obtain a computer program imported from the outside world, and save the obtained computer program in the memory 12, and can also be used to obtain various instructions and parameters transmitted by external terminal devices, and transmit them to the processor 11 , So that the processor 11 uses the above-mentioned various instructions and parameters to carry out corresponding processing. In this embodiment, the input interface 13 may specifically include, but is not limited to, a USB interface, a serial interface, a voice input interface, a fingerprint input interface, a hard disk reading interface, and the like.
输出接口14,用于将处理器11产生的各种数据输出至与其相连的终端设备,以便于与输出接口14相连的其他终端设备能够获取到处理器11产生的各种数据。本实施例中,所述输出接口14具体可以包括但不限于USB接 口、串行接口等。The output interface 14 is used to output various data generated by the processor 11 to a terminal device connected to it, so that other terminal devices connected to the output interface 14 can obtain various data generated by the processor 11. In this embodiment, the output interface 14 may specifically include, but is not limited to, a USB interface, a serial interface, and the like.
通讯单元15,用于在卷积处理装置和外部服务器之间建立远程通讯连接,以便于卷积处理装置能够将镜像文件挂载到外部服务器中。本实施例中,通讯单元15具体可以包括但不限于基于无线通讯技术或有线通讯技术的远程通讯单元。The communication unit 15 is used to establish a remote communication connection between the convolution processing device and the external server, so that the convolution processing device can mount the image file to the external server. In this embodiment, the communication unit 15 may specifically include, but is not limited to, a remote communication unit based on wireless communication technology or wired communication technology.
键盘16,用于获取用户通过实时敲击键帽而输入的各种参数数据或指令。The keyboard 16 is used to obtain various parameter data or instructions input by the user by hitting the keycap in real time.
显示器17,用于对卷积处理过程的相关信息进行实时显示,以便于用户及时地了解当前卷积神经网络的处理情况。The display 17 is used for real-time display of relevant information of the convolution processing process, so that the user can understand the current processing situation of the convolution neural network in time.
鼠标18,可以用于协助用户输入数据并简化用户的操作。The mouse 18 can be used to assist the user in inputting data and simplify the user's operation.
实施例五Example five
进一步的,本申请实施例还公开了一种计算机可读存储介质,这里所说的计算机可读存储介质包括随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动硬盘、CD-ROM或技术领域内所公知的任意其他形式的存储介质。计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时实现以下步骤:Further, the embodiment of the present application also discloses a computer-readable storage medium. The computer-readable storage medium mentioned here includes random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, and Erase programmable ROM, register, hard disk, removable hard disk, CD-ROM or any other form of storage medium known in the technical field. A computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the following steps are implemented:
获取目标操作对象;所述目标操作对象具体为输入特征;Acquiring a target operation object; the target operation object is specifically an input feature;
对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;Perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
本申请实施例解决了原有卷积神经网络中以卷积核为中心点做卷积操作时带来的性能损失,通过综合分析多个方向上的边窗卷积操作,提高了卷积操作获取数据更泛化特征的能力,以此提高了卷积神经网络的性能。The embodiment of the application solves the performance loss caused by the convolution operation with the convolution kernel as the center point in the original convolutional neural network. By comprehensively analyzing the side window convolution operation in multiple directions, the convolution operation is improved. The ability to obtain more generalized features of the data improves the performance of the convolutional neural network.
在一些具体的实施例中,所述计算机可读存储介质中存储的计算机子程序被处理器执行时,具体可以实现以下步骤:In some specific embodiments, when the computer subprogram stored in the computer-readable storage medium is executed by the processor, the following steps may be specifically implemented:
确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
在一些具体的实施例中,所述计算机可读存储介质中存储的计算机子程序被处理器执行时,具体可以实现以下步骤:In some specific embodiments, when the computer subprogram stored in the computer-readable storage medium is executed by the processor, the following steps may be specifically implemented:
利用左上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左上方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the upper left direction to obtain multiple calculation results in the upper left direction;
利用右上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右上方向的多个计算结果;Performing a unified side window convolution calculation on a plurality of the target operation objects by using the calculation side window in the upper right direction to obtain multiple calculation results in the upper right direction;
利用左下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左下方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the lower left direction to obtain multiple calculation results in the lower left direction;
利用右下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右下方向的多个计算结果。A unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
在一些具体的实施例中,所述计算机可读存储介质中存储的计算机子程序被处理器执行时,具体可以实现以下步骤:In some specific embodiments, when the computer subprogram stored in the computer-readable storage medium is executed by the processor, the following steps may be specifically implemented:
对所述多个方向的计算结果进行交叉熵的最优化处理,将交叉熵值最小的计算结果确定为所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions, and determine the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
在一些具体的实施例中,所述目标操作对象具体为边界输入特征和/或纹理输入特征。In some specific embodiments, the target operation object is specifically a boundary input feature and/or a texture input feature.
在一些具体的实施例中,所述计算机可读存储介质中存储的计算机子程序被处理器执行时,具体可以实现以下步骤:In some specific embodiments, when the computer subprogram stored in the computer-readable storage medium is executed by the processor, the following steps may be specifically implemented:
当卷积核为多个时,对每个所述卷积核对应的卷积结果进行加权平均,得到所述目标操作对象的最终卷积结果。When there are multiple convolution kernels, the convolution result corresponding to each convolution kernel is weighted and averaged to obtain the final convolution result of the target operation object.
最后,还需要说明的是,本领域技术人员可以理解上述实施例的各种方法中的全部或者部分步骤是可以通过程序来指令相关的硬件来完成的,该程序可以存储于一计算机可读存储单元中。本申请所述的所有实施例中所述的存储单元包括:只读存储器、随机存储器、磁盘或等等。Finally, it should be noted that those skilled in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing relevant hardware through a program, and the program can be stored in a computer-readable storage. Unit. The storage unit described in all the embodiments described in this application includes: read-only memory, random access memory, magnetic disk, or the like.
在本文中,诸如术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下, 由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。In this article, terms such as "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or device including a series of elements not only includes those elements, but also includes no Other elements clearly listed, or they also include elements inherent to the process, method, article, or equipment. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, article, or equipment that includes the element.
本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same or similar parts between the various embodiments can be referred to each other.
对所公开的实施例的上述说明,使本领域专业技术人员能够实现或使用本申请。对这些实施例的多种修改对本领域的专业技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本申请的精神或范围的情况下,在其它实施例中实现。因此,本申请将不会被限制于本文所示的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。The foregoing description of the disclosed embodiments enables those skilled in the art to implement or use this application. Various modifications to these embodiments will be obvious to those skilled in the art, and the general principles defined herein can be implemented in other embodiments without departing from the spirit or scope of the present application. Therefore, this application will not be limited to the embodiments shown in this document, but should conform to the widest scope consistent with the principles and novel features disclosed in this document.

Claims (10)

  1. 一种应用于卷积神经网络的卷积处理方法,其特征在于,包括:A convolution processing method applied to a convolutional neural network, which is characterized in that it includes:
    获取目标操作对象;所述目标操作对象具体为输入特征;Acquiring a target operation object; the target operation object is specifically an input feature;
    对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;Perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
    对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions to obtain the convolution result of the target operation object.
  2. 根据权利要求1所述卷积处理方法,其特征在于,所述对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果的过程,具体包括:The convolution processing method according to claim 1, wherein the process of performing side window convolution calculation on the target operation object to obtain calculation results in multiple directions specifically includes:
    确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
    分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  3. 根据权利要求2所述卷积处理方法,其特征在于,当所述目标操作对象为多个,所述分别利用所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果的过程,具体包括:The convolution processing method according to claim 2, wherein when there are multiple target operation objects, the side window convolution calculation is performed on the target operation object by using the calculation side windows respectively, to obtain four The process of calculating the result of the direction includes:
    利用左上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左上方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the upper left direction to obtain multiple calculation results in the upper left direction;
    利用右上方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右上方向的多个计算结果;Performing a unified side window convolution calculation on a plurality of the target operation objects by using the calculation side window in the upper right direction to obtain multiple calculation results in the upper right direction;
    利用左下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到左下方向的多个计算结果;Performing a unified side window convolution calculation on multiple target operation objects by using the calculation side window in the lower left direction to obtain multiple calculation results in the lower left direction;
    利用右下方向的所述计算边窗对多个所述目标操作对象进行统一的边窗卷积计算,得到右下方向的多个计算结果。A unified side window convolution calculation is performed on a plurality of the target operation objects by using the calculation side window in the lower right direction to obtain a plurality of calculation results in the lower right direction.
  4. 根据权利要求1至3任一项所述卷积处理方法,其特征在于,所述对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果的过程,具体包括:The convolution processing method according to any one of claims 1 to 3, wherein the cross-entropy optimization processing is performed on the calculation results of the multiple directions to obtain the convolution result of the target operation object The process includes:
    对所述多个方向的计算结果进行交叉熵的最优化处理,将交叉熵值最小的计算结果确定为所述目标操作对象的卷积结果。Perform cross-entropy optimization processing on the calculation results in the multiple directions, and determine the calculation result with the smallest cross-entropy value as the convolution result of the target operation object.
  5. 根据权利要求4所述卷积处理方法,其特征在于,所述目标操作对象具体为边界输入特征和/或纹理输入特征。The convolution processing method according to claim 4, wherein the target operation object is specifically a boundary input feature and/or a texture input feature.
  6. 根据权利要求5所述卷积处理方法,其特征在于,当卷积核为多个时,所述卷积处理方法还包括:The convolution processing method according to claim 5, wherein when there are multiple convolution kernels, the convolution processing method further comprises:
    对每个所述卷积核对应的卷积结果进行加权平均,得到所述目标操作对象的最终卷积结果。A weighted average is performed on the convolution result corresponding to each convolution kernel to obtain the final convolution result of the target operation object.
  7. 一种应用于卷积神经网络的卷积处理系统,其特征在于,包括:A convolution processing system applied to a convolutional neural network, which is characterized in that it comprises:
    获取模块,用于获取目标操作对象;所述目标操作对象具体为输入特征;The acquisition module is used to acquire a target operation object; the target operation object is specifically an input feature;
    计算模块,用于对所述目标操作对象进行边窗卷积计算,得到多个方向的计算结果;A calculation module, configured to perform side window convolution calculation on the target operation object to obtain calculation results in multiple directions;
    结果确定模块,用于对所述多个方向的计算结果进行交叉熵的最优化处理,得到所述目标操作对象的卷积结果。The result determination module is configured to perform cross-entropy optimization processing on the calculation results of the multiple directions to obtain the convolution result of the target operation object.
  8. 根据权利要求7所述卷积处理系统,其特征在于,所述计算模块具体用于:The convolution processing system according to claim 7, wherein the calculation module is specifically configured to:
    确定四个计算边窗的方向分别为左上、右上、左下、右下;Determine the directions of the four computing side windows as upper left, upper right, lower left, and lower right;
    分别利用四个所述计算边窗对所述目标操作对象进行边窗卷积计算,得到四个方向的计算结果。The four calculation side windows are used to perform side window convolution calculations on the target operation object to obtain calculation results in four directions.
  9. 一种应用于卷积神经网络的卷积处理装置,其特征在于,包括:A convolution processing device applied to a convolutional neural network, which is characterized in that it comprises:
    存储器,用于存储计算机程序;Memory, used to store computer programs;
    处理器,用于执行所述计算机程序时实现如权利要求1至6任一项所述应用于卷积神经网络的卷积处理方法的步骤。The processor is configured to implement the steps of the convolution processing method applied to a convolutional neural network according to any one of claims 1 to 6 when the computer program is executed.
  10. 一种可读存储介质,其特征在于,所述可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1至6任一项所述应用于卷积神经网络的卷积处理方法的步骤。A readable storage medium, characterized in that a computer program is stored on the readable storage medium, and when the computer program is executed by a processor, it is applied to a convolutional neural network as claimed in any one of claims 1 to 6 The steps of the convolution processing method.
PCT/CN2019/121105 2019-10-11 2019-11-27 Convolution processing method and system applied to convolutional neural network, and related components WO2021068376A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910963711.1 2019-10-11
CN201910963711.1A CN110796250A (en) 2019-10-11 2019-10-11 Convolution processing method and system applied to convolutional neural network and related components

Publications (1)

Publication Number Publication Date
WO2021068376A1 true WO2021068376A1 (en) 2021-04-15

Family

ID=69439139

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/121105 WO2021068376A1 (en) 2019-10-11 2019-11-27 Convolution processing method and system applied to convolutional neural network, and related components

Country Status (2)

Country Link
CN (1) CN110796250A (en)
WO (1) WO2021068376A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111784610A (en) * 2020-07-02 2020-10-16 重庆邮电大学 Clustering-based side window filter optimization method
CN112288829A (en) * 2020-11-03 2021-01-29 中山大学 Compression method and device for image restoration convolutional neural network
CN112750142A (en) * 2021-01-26 2021-05-04 上海工程技术大学 Ultrasonic image segmentation system and method based on side window attention mechanism
CN116310543B (en) * 2023-03-14 2023-09-22 自然资源部第一海洋研究所 GF-1WFV satellite red tide deep learning detection model, construction method and equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107437110A (en) * 2017-07-11 2017-12-05 中国科学院自动化研究所 The piecemeal convolution optimization method and device of convolutional neural networks
US20180373981A1 (en) * 2017-06-21 2018-12-27 TuSimple Method and device for optimizing neural network
CN109858618A (en) * 2019-03-07 2019-06-07 电子科技大学 The neural network and image classification method of a kind of convolutional Neural cell block, composition
CN109978137A (en) * 2019-03-20 2019-07-05 厦门美图之家科技有限公司 A kind of processing method of convolutional neural networks
CN110188869A (en) * 2019-05-05 2019-08-30 北京中科汇成科技有限公司 A kind of integrated circuit based on convolutional neural networks algorithm accelerates the method and system of calculating

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110263920B (en) * 2019-06-21 2021-08-10 北京石油化工学院 Convolutional neural network model, training method and device thereof, and routing inspection method and device thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180373981A1 (en) * 2017-06-21 2018-12-27 TuSimple Method and device for optimizing neural network
CN107437110A (en) * 2017-07-11 2017-12-05 中国科学院自动化研究所 The piecemeal convolution optimization method and device of convolutional neural networks
CN109858618A (en) * 2019-03-07 2019-06-07 电子科技大学 The neural network and image classification method of a kind of convolutional Neural cell block, composition
CN109978137A (en) * 2019-03-20 2019-07-05 厦门美图之家科技有限公司 A kind of processing method of convolutional neural networks
CN110188869A (en) * 2019-05-05 2019-08-30 北京中科汇成科技有限公司 A kind of integrated circuit based on convolutional neural networks algorithm accelerates the method and system of calculating

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
YIN HUI; GONG YUANHAO; QIU GUOPING: "Side Window Filtering", 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), IEEE, 15 June 2019 (2019-06-15), pages 8750 - 8758, XP033686989, DOI: 10.1109/CVPR.2019.00896 *

Also Published As

Publication number Publication date
CN110796250A (en) 2020-02-14

Similar Documents

Publication Publication Date Title
WO2021068376A1 (en) Convolution processing method and system applied to convolutional neural network, and related components
CN109949255B (en) Image reconstruction method and device
US11798132B2 (en) Image inpainting method and apparatus, computer device, and storage medium
US11734851B2 (en) Face key point detection method and apparatus, storage medium, and electronic device
CN106780512B (en) Method, application and computing device for segmenting image
US10885660B2 (en) Object detection method, device, system and storage medium
WO2022199583A1 (en) Image processing method and apparatus, computer device, and storage medium
CN108520247A (en) To the recognition methods of the Object node in image, device, terminal and readable medium
CN109272016B (en) Target detection method, device, terminal equipment and computer readable storage medium
CN112651438A (en) Multi-class image classification method and device, terminal equipment and storage medium
CN110059750A (en) House type shape recognition process, device and equipment
WO2021164550A1 (en) Image classification method and apparatus
CN110941999B (en) Method for adaptively calculating size of Gaussian kernel in crowd counting system
CN110349082B (en) Image area clipping method and device, storage medium and electronic device
WO2017041488A1 (en) Fingerprint ridge point recognition method and apparatus
CN110490203A (en) Image partition method and device, electronic equipment and computer readable storage medium
CN111597845A (en) Two-dimensional code detection method, device and equipment and readable storage medium
JP2022168167A (en) Image processing method, device, electronic apparatus, and storage medium
CN111160372B (en) Large target identification method based on high-speed convolutional neural network
CN115601283B (en) Image enhancement method and device, computer equipment and computer readable storage medium
WO2018053710A1 (en) Morphological processing method of digital image and digital image processing device
CN106469437B (en) Image processing method and image processing apparatus
CN103077396B (en) The vector space Feature Points Extraction of a kind of coloured image and device
CN111191675A (en) Pedestrian attribute recognition model implementation method and related device
CN112560709B (en) Pupil detection method and system based on auxiliary learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19948265

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19948265

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 07/10/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19948265

Country of ref document: EP

Kind code of ref document: A1