CN110766703A

CN110766703A - GPU-based intelligent image recognition method of computer Shader

Info

Publication number: CN110766703A
Application number: CN201910999880.0A
Authority: CN
Inventors: 夏磊; 尤海宁
Original assignee: Hefei Chengfang Information Technology Co Ltd
Current assignee: Hefei Chengfang Information Technology Co Ltd
Priority date: 2019-10-21
Filing date: 2019-10-21
Publication date: 2020-02-07

Abstract

The invention discloses an intelligent image recognition method of computer Shader based on a GPU, and relates to the technical field of image recognition. The invention comprises the following steps: extracting the features of the picture in a computer shader; carrying out normalization pretreatment on the picture; and training the extracted features in a GPU by using a BP neural network algorithm to obtain interconnection weights, so that one input sample corresponds to one expected output. According to the invention, the edge segmentation is carried out on the image in the GPU through the computer Shader technology, and then the image is identified by using the identification algorithm based on the wavelet distance characteristic and the neural network, so that the conversion times of data are reduced, any type of data can be directly calculated without being limited by the image rendering process, the data can be written in and written out at any time, the general calculation efficiency of the GPU is improved, and the image processing efficiency is improved.

Description

GPU-based intelligent image recognition method of computer Shader

Technical Field

The invention belongs to the technical field of image recognition, and particularly relates to an intelligent image recognition method of computer Shader based on a GPU.

Background

The prior intelligent image recognition method mainly comprises an image recognition method based on a neural network, an image recognition method based on wavelet moments, an image recognition method based on fractal characteristics and the like. The pattern recognition is realized based on a neural network method, some contents with complex environmental information can be processed, and a general neural network image recognition system consists of preprocessing, feature extraction and a neural network classifier. But the disadvantage is that the pattern classes which can be identified at present are not enough, and the relevant models are still being perfected.

The image identification method based on the wavelet moment comprises the following steps: the wavelet moment features have good resolution capability on samples with translation, scaling and rotation, the wavelet moment features can correctly resolve test samples under the condition of no noise, the recognition rate is superior to the geometric moment, and the difference reaches 30 percentage points. Not only the global features of the image but also the local features of the image can be obtained.

Before the invention of computer Shader, the GPU can not perform the complex algorithm. The GPU is a graphic processor, the conventional general computation of the GPU requires a programmer to firstly disguise data into an image which can be recognized by the GPU and then convert the image output by the GPU into a desired result, the writing method is complex and low in efficiency, and the computation consumes resources.

The CPU has a strong arithmetic logic unit, so that the operation delay is reduced; a huge Cache for reducing the delay of memory access; complex controllers use branch prediction to reduce branch latency and data forwarding to reduce data latency. CPUs are good at handling computational tasks with complex computational steps and complex data dependencies, such as distributed computing, data compression, artificial intelligence, physical simulation, and many other computational tasks.

GPUs have been generated for video games for historical reasons, and one type of operation that often occurs in three-dimensional games is to perform the same operation on a large amount of data, such as: and performing the same coordinate transformation on each vertex, and calculating color values of each vertex according to the same illumination model. The many-core architecture of the GPU is very suitable for sending the same instruction stream to many cores in parallel and executing by adopting different input data.

In contrast, the invention provides an intelligent image recognition method of the computer Shader based on the GPU, which is characterized in that the image is subjected to edge segmentation in the GPU through the computer Shader technology, and then the image is recognized by using a recognition algorithm based on wavelet distance features and a neural network, so that the conversion times of data are reduced, any type of data can be directly calculated, the data is not limited by a graph rendering process, the data can be written in and written out at any time, the general calculation efficiency of the GPU is improved, and the image processing efficiency is improved.

Disclosure of Invention

The invention aims to provide an intelligent image recognition method of computer Shader based on a GPU, which is characterized in that the edge segmentation is carried out on an image in the GPU through the computer Shader technology, and then the image is recognized by using a recognition algorithm based on wavelet distance characteristics and a neural network, so that the conversion times of data are reduced, any type of data can be directly calculated, and the data is not limited by a graph rendering process and can be written in and written out at any time.

In order to solve the technical problems, the invention is realized by the following technical scheme:

the invention relates to an intelligent image recognition method of computer Shader based on a GPU, which comprises the following steps:

s00: extracting the characteristics of the picture in a computer shader, and carrying out normalization pretreatment on the picture;

s01: performing polar coordinate processing on the picture in a computer shader;

s02: extraction of small distance features in the GPU, i.e.

The angle interval delta theta is 2 pi/N, and N is the number of image pixel points;

the method specifically comprises the following steps:

a00: the calculation angle integral algorithm is as follows:

a01: extracting features in a radial region (gamma is more than or equal to 0 and less than or equal to 1) by utilizing a wavelet function for the obtained angle integral; the method comprises the following specific steps:

therein, Ψ_m.n(r) represents a wavelet function, m is a scaling factor and n is a translation factor; q represents the FFT harmonic order taken (typically 0,1,2, 3);

is S_q(r) wavelet transform of r; s_q(r) r represents the characteristic distribution of f (gamma, theta) in a radial region (0. ltoreq. r.ltoreq.1);

s03: and training the extracted features in a GPU by using a BP neural network algorithm to obtain interconnection weights, so that one input sample corresponds to one expected output.

Preferably, the step of performing normalization preprocessing on the picture in S00 specifically includes the following steps:

determining a centroid coordinate of the picture and a scaling factor α, wherein the scaling factor α is a ratio of an image size and a standard size.

Preferably, the BP neural network algorithm in S03 includes the following steps:

b00: initializing all W by adopting random numbers; w is the interconnection weight from the current neuron to the next layer neuron;

b01: selecting an input pattern P from a training data set m and materializing an expected output; wherein, the output of the m output node is 1, and the outputs of all other output nodes are 0;

b02: calculating all node outputs using the current W; wherein, the total input value of the node j is net; the output of the node j is a nonlinear function of the total input value net; the nonlinear function is a sigmoid function, and is specifically as follows:

o_j＝f(net_j)＝1/(1+exp((-net_j+θ_j)/θ₀))；

wherein the parameter theta_jRepresents a threshold value; theta₀The number is a normal number and can be selected randomly, and the shape of the sigmoid function is adjusted;

b03: comparing the activation generated by the output layer with the expected value to obtain the input mode error:

wherein, t_pjIs the jth component of the target pattern P; o_pjIs the jth component of the output pattern resulting from the current input pattern p;

b04: calculating corresponding input mode errors of all input modes in the training data set m by adopting B01-B03, and calculating system errors

B05: and obtaining an image identification result through a gradient descent iteration process in the weight space, and outputting the corresponding Type to the console.

The invention has the following beneficial effects:

1. according to the invention, the edge segmentation is carried out on the image in the GPU through the computer Shader technology, and then the image is identified by using an identification algorithm based on the wavelet distance characteristic and the neural network, so that the conversion times of data are reduced, any type of data can be directly calculated without being constrained by a graph rendering process, the data can be written in and written out at any time, the general calculation efficiency of the GPU is improved, and the image processing efficiency is improved;

2. the invention can directly use the GPU as a parallel processor to be utilized after using the computer Shader, the GPU not only has 3D rendering capability, but also has other operational capability, and the multithread processing technology can better utilize a plurality of cores of the system, thereby improving the processing efficiency and the identification accuracy.

Of course, it is not necessary for any product in which the invention is practiced to achieve all of the above-described advantages at the same time.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a flow chart of the intelligent image recognition method of GPU-based computer Shader of the present invention;

FIG. 2 is a flow chart of the BP neural network algorithm of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, the present invention is an intelligent image recognition method based on GPU computer Shader, including the following processes:

s00, extracting the features of the picture in a computer shader, carrying out normalization preprocessing on the picture, and preparing for later translation and scaling, specifically, determining the centroid coordinate of the picture and a scaling factor α, wherein the centroid of the picture has the invariance of translation, scaling and rotation, and the scaling factor α is the ratio of the image size to the standard size;

s01: performing polar coordinate processing on the picture in a computer shader; for example, f (x, y) is a two-dimensional binary image on a rectangular coordinate system, and its corresponding polar coordinate form is f (γ, θ), where x is γ sin (θ); because the image is discrete, a proper angle interval delta theta is selected, and the polar coordinate conversion error is controlled to be minimum;

s02: extraction of small distance features in the GPU, i.e.

Namely discrete processing is carried out; the angle interval delta theta is 2 pi/N, and N is the number of image pixel points;

the method specifically comprises the following steps:

a00: the calculation angle integral algorithm is as follows:

is S_q(r) wavelet transform of r; s_q(r) r represents the characteristic distribution of f (gamma, theta) in a radial region (0. ltoreq. r.ltoreq.1); the wavelet invariant moment can be found to have rotation invariance, and rough to fine processing of the object can be realized by using different scaling factors, and the features are gradually extracted until the required resolution is realized. For GPU multithreading processing, each data block is processed by one thread, and in addition, the global data block can better support multi-resolution operation, so that the algorithm has the capability of realizing multi-resolution parallel observation;

s03: training the extracted features in a GPU by using a BP neural network algorithm to obtain interconnection weights, so that one input sample corresponds to one expected output; the multi-layer feedforward neural network classifier realized by the BP neural network algorithm comprises three layers of feedforward networks; the bottommost layer is an input layer, the second layer is a hidden layer, the third layer is an output layer, and all units take continuous values in an interval [0,1 ];

referring to fig. 2, the BP neural network algorithm includes the following steps:

o_j＝f(net_j)＝1/(1+exp((-net_j+θ_j)/θ₀))；

It should be noted that, in the above system embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.

In addition, it is understood by those skilled in the art that all or part of the steps in the method for implementing the embodiments described above may be implemented by a program instructing associated hardware, and the corresponding program may be stored in a computer-readable storage medium.

The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

Claims

1. The intelligent image recognition method of the computer Shader based on the GPU is characterized by comprising the following steps:

s02: extraction of small distance features in the GPU, i.e.

the method specifically comprises the following steps:

a00: the calculation angle integral algorithm is as follows:

therein, Ψ_m.n(r) represents a wavelet function, m is a scalingFactor, n is a translation factor; q represents the FFT harmonic order taken (typically 0,1,2, 3);

2. The intelligent image recognition method of GPU-based computer Shader, as recited in claim 1, wherein the step of normalizing preprocessing the picture in S00 specifically includes the following steps:

3. The intelligent image recognition method of GPU-based computer Shader, as claimed in claim 1, wherein the BP neural network algorithm in S03 comprises the following steps:

o_j＝f(net_j)＝1/(1+exp((-net_j+θ_j)/θ₀))；