CN110458903B

CN110458903B - Image processing method of coding pulse sequence

Info

Publication number: CN110458903B
Application number: CN201910691729.0A
Authority: CN
Inventors: 任全胜; 赵君伟
Original assignee: Peking University
Current assignee: Peking University
Priority date: 2019-07-29
Filing date: 2019-07-29
Publication date: 2021-03-02
Anticipated expiration: 2039-07-29
Also published as: CN110458903A

Abstract

The invention discloses an image processing method of a coded pulse sequence, which is used for coding information of multiple layers of images and coding the information of different layers of the images into the pulse sequence according to a time sequence relation; the method comprises the following steps: converting the image into a gray image and standardizing the gray value of the image; encoding the shape, characteristic points, color and texture of the gray level image into pulses; and then arranging the coded pulses according to a set sequence to form a series of pulse sequences. The invention effectively utilizes the information of each layer carried by the image and reduces the non-key information at the same time. For the pulse neural network, the information completeness of the pulse sequence input by the neural network is improved, the redundancy of information is reduced, and the information processing efficiency of the pulse neural network can be improved. For signal processing, the information capacity of the source and the coding efficiency of the information are improved.

Description

Image processing method of coding pulse sequence

Technical Field

The invention belongs to the technical field of brain-like computation, impulse neural network and image coding, in particular relates to a method for coding an image into an impulse sequence, and particularly relates to an image processing method applied to the field of impulse neural network information processing.

Background

In recent years, the field of brain-like computing has gradually attracted the attention of numerous scholars, researchers have developed researches in the fields of biological neurons, connection modes, impulse neural networks and the like, and try to model the brain from a microscopic level to a macroscopic level, further promote the current artificial intelligence process based on a deep learning technology, and finally achieve the aim of general artificial intelligence.

Compared with the widely popular numerical deep learning neural network, the Spiking neural network has more brain-like characteristics in connection mode, information processing mechanism and synaptic weight learning method, and is an important research direction of brain-like calculation. The impulse neural network processes the impulse sequence, and the acquisition mode of the impulse sequence generally has two approaches: one is to convert the image into a pulse sequence according to a certain coding mode (such as a threshold coding method, a Gaussian difference method, a frequency coding method and the like), and the other is to directly come from a neuromorphic camera. However, the neuromorphic camera is not widely used and many researchers do not have a practical device because the neuromorphic camera is not mature enough in production process, inconvenient in purchase channel, expensive in sale price and the like. Therefore, the pulse sequence widely used for information processing of the impulse neural network at present mainly originates from the first mode.

The current method for coding the image into the pulse sequence has the problems of serious information loss, low coding efficiency and the like, and further becomes one of the reasons of low recognition rate of the pulse neural network. For example, the threshold encoding method mentioned above is implemented by setting a threshold value, the pixels above the threshold value excite the pulse, and the pixels below the threshold value do not excite the pulse, and the principle of the method is too simple, so that the encoded pulse sequence loses a large amount of information (such as color, texture, etc.) compared with the original image; the Gaussian difference method is used for detecting the local contrast of an image and determining the issuing sequence of pulses according to the strength of the contrast, and a large amount of Gaussian noise is introduced by the method so as to seriously influence the identification effect of the pulse neural network; the frequency coding method is to set the frequency of each pixel excitation pulse to be proportional to the intensity value of the pixel, and restore the light intensity of the pixel by counting the number of pulses excited by each pixel in the simulation time interval.

Disclosure of Invention

Aiming at the problems of serious information loss, low coding efficiency and the like in the process of coding an image into pulses in the prior art, the invention provides a novel image pulse coding method, which extracts the information of an original image on a plurality of layers, greatly reduces the information loss caused after the image is coded into pulses, and effectively improves the coding efficiency because the length of a pulse sequence segment generated by coding is shorter (generally within 10 pulse frames).

The core innovation points of the invention are as follows:

the invention provides an image processing method for encoding an image into a group of pulse sequences, which encodes the shape, characteristic points, color and texture of a gray level image; and then the codes are arranged according to a set sequence to form a series of pulse sequences. According to the method, the process of observing the object by the human eyes is not kicking but progressive: looking first at the outline of the image, then noting the colors, then it will be observed that texture etc. … … is essentially a process from global to local to detail, with each step focusing on a different emphasis. The method provided by the invention carries out information mining on a plurality of layers of images and encodes information of different layers into pulse sequences according to a time sequence relation. Therefore, each pulse frame of the pulse sequence describes image characteristics of different layers, when the pulse neural network processes the pulse sequence, the processing process is similar to the process of observing objects by human eyes and is a progressive process, and the naturalness and the reality of an image processing result can be improved.

The technical scheme provided by the invention is as follows:

a image pulse coding method, carry on the information mining and coding of the multi-layer to the picture, and encode the information of different layers into the pulse sequence according to the time sequence relation; the method for encoding the image into the pulse comprises the following steps (as shown in the attached figure 1):

1) the input image is subjected to scale conversion and converted into images of uniform size (length and width are denoted as H and W, respectively).

2) And converting the image with the same size into a gray image and standardizing the gray value of the image.

3) The gray image is subjected to shape coding, a Gaussian filter is used for filtering the image, then an edge detection algorithm is used for extracting an edge contour of the gray image, contour information is subjected to binarization processing (the position value with the contour is 1, and the position value without the contour is 0) and then recorded in a two-dimensional matrix with the same length (H) and width (W) as the gray image, and the matrix is counted as S1.

4) Performing feature point coding on the gray level image, firstly determining key features in the image and the number of feature points required by the representation of the key features, then extracting position coordinates of the feature points by using a feature point detection algorithm, then defining a two-dimensional matrix with the same length (H) and width (W) as the gray level image and initializing the two-dimensional matrix to be 0, setting the position of the two-dimensional matrix with the same coordinate as the feature points to be 1, and counting the matrix as S2;

5) the gray image is color coded, the color of the gray image (the gray image is 256 steps) is divided into a plurality of segments according to actual needs, a three-dimensional matrix S3 with the length (H) and the width (W) being the same as the size of the gray image and the height (M) being the same as the number of the divided segments is defined, and the matrix is initialized to be 0. When the number of the divided sections is 5, the value of M is 5.

In specific implementation, each color segment corresponds to a two-dimensional matrix with the scale of M dimension of the three-dimensional matrix S3 from 0 to M-1; for example, if the number of segments is 5, the colors are divided into five segments, i.e., 0 to 51, 52 to 102, 103 to 153, 154 to 204, and 205 to 255. Each color segment is respectively corresponding to a two-dimensional matrix with the scale of the mth dimension of 0, 1, 2, 3 and 4 in the three-dimensional matrix S3, namely 0 to 51 correspond to S3(M is 0), 52 to 102 correspond to S3(M is 1), 103 to 153 correspond to S3(M is 2), 154 to 204 correspond to S3(M is 3), and 205 to 255 correspond to S3(M is 4). Then, the gray-scale value of each pixel of the gray-scale image is traversed, the color segment i (i, an integer with a value range of [0, M-1 ]) where the pixel is located is determined, and the value of the same position of the S3(M ═ i) matrix and the pixel is set to be 1.

6) Performing texture coding on a gray level image, firstly observing the integral texture type of the image, then extracting one or more textures of the image, such as roughness, direction degree, linearity and the like, then defining a two-dimensional matrix with the length (H) and width (W) same as the gray level image and initializing the two-dimensional matrix to 0, performing binarization processing on the extracted texture characteristics and recording the texture characteristics in the matrix, wherein the matrix is counted as S4;

7) the matrices S1 to S4 are arranged in a predetermined order and combined into a series of pulse trains S.

The pulse sequence of the image is obtained by the method, and the original image is represented from a plurality of layers such as the contour, the texture, the feature points and the like, so that the information utilization rate of the original image is greatly improved. The pulse sequence coded by the method has time sequence characteristics, and provides a very matched image pulse sequence data type for some pulse neural network algorithms which are good at mining, learning and identifying image pulse sequence time sequence information (for example, a Spike cube SNN-based image pulse data space-time information learning and identifying method, patent application number: 2019104814209).

It should be noted that, for different input images and different pulse information processing scenarios, the method corresponding to the matrixes S1-S4 can be adjusted, and the sequence of S1-S4 can also be adjusted according to the actual application scenario. For some simpler application scenes, the information processing links from S1 to S4 can be reduced appropriately, and for some more complex application scenes, other types of image processing methods can be added appropriately according to needs.

Compared with the prior art, the invention has the beneficial effects that:

the image pulse coding method provided by the invention not only extracts the outline of the image or codes the light intensity of the image, but also carries out information mining on the image from different layers, effectively utilizes the information of each layer carried by the image, and simultaneously reduces some information (such as the light intensity of the image) which is not particularly critical to recognition. In terms of the pulse neural network, the invention improves the information completeness of the pulse sequence input by the neural network, reduces the redundancy of information and can improve the information processing efficiency of the pulse neural network. From the aspect of signal processing, the invention improves the information capacity of the information source and improves the coding efficiency of the information.

Drawings

Fig. 1 is a flow chart illustrating an implementation procedure of a method for encoding image pulses according to the present invention.

Fig. 2 is an example of an original image encoded by a pulse encoding method according to an embodiment of the present invention.

FIG. 3 is a diagram of each pulse matrix of the pulse sequence S according to an embodiment of the present invention;

the shape coding matrix S1 (1), the feature point coding matrix S2 (2), the color coding matrix S3 (3) to (7), and the texture coding matrix S4 (8) are listed below.

Detailed Description

The invention will be further described by way of examples, without in any way limiting the scope of the invention, with reference to the accompanying drawings.

The invention provides an image processing method of a coding pulse sequence, which is a new image pulse coding method, and the method extracts the information of an original image from a plurality of layers and codes the information of different layers into the pulse sequence according to the time sequence relation, thereby greatly reducing the information loss caused after the image is coded into the pulse, and the length of the pulse sequence segment generated by coding is shorter, thereby effectively improving the coding efficiency, and solving the problems of serious information loss, low coding efficiency and the like in the process of coding the image into the pulse in the prior art.

Fig. 1 is a flow chart illustrating an implementation procedure of a method for encoding image pulses according to the present invention. Fig. 2 is an example of an original image encoded by a pulse encoding method according to an embodiment of the present invention. In specific implementation, encoding an image into a pulse sequence comprises the following steps:

1) transforming the example image into an image with a length and a width of H and W respectively by size transformation;

2) converting the size-converted image into a gray image, and standardizing the gray value of the image;

3) the gray image is subjected to shape coding, a Gaussian filter is used for filtering the image, then an edge detection algorithm is used for extracting an edge contour of the gray image, contour information is subjected to binarization processing (the position value with the contour is 1, and the position value without the contour is 0) and then recorded in a two-dimensional matrix with the same length (H) and width (W) as the gray image, and the matrix is counted as S1. Fig. 3 shows (1) that is the shape coding matrix S1 obtained in this embodiment;

4) performing feature point coding on the gray level image, firstly determining key features in the image and the number of feature points required by the representation of the key features, then extracting position coordinates of the feature points by using a feature point detection algorithm, then defining a two-dimensional matrix with the same length (H) and width (W) as the gray level image and initializing the two-dimensional matrix to be 0, setting the position of the two-dimensional matrix with the same coordinate as the feature points to be 1, and counting the matrix as S2; fig. 3 (2) is the characteristic point coding matrix S2 obtained in this embodiment;

5) the gray image is color coded, the image color (the gray image is 256 steps) is divided into a plurality of segments according to the actual requirement, a three-dimensional matrix S3 with the length (H) and the width (W) being the same as the size of the gray image and the height (M) being the same as the number of the divided segments is defined, and the matrix is initialized to be 0.

In the specific implementation, for example, the number of the divided segments is 5, the colors are divided into five segments of 0 to 51, 52 to 102, 103 to 153, 154 to 204, and 205 to 255. Each color segment is respectively corresponding to a two-dimensional matrix with the M-th dimension scale of 0, 1, 2, 3 and 4 in the three-dimensional matrix S3, namely 0-51 corresponds to S3(M is 0), 52-102 corresponds to S3(M is 1), 103-153 corresponds to S3(M is 2), 154-204 corresponds to S3(M is 3), and 205-255 corresponds to S3(M is 4). Then, the gray-scale value of each pixel of the gray-scale image is traversed, the color segment i (i, an integer with a value range of [0, M-1 ]) where the pixel is located is determined, and the value of the same position of the S3(M ═ i) matrix and the pixel is set to be 1. The color coding matrix S3 obtained in the present embodiment is (3) to (7) in fig. 3.

6) Performing texture coding on a gray level image, firstly observing the integral texture type of the image, then extracting one or more textures of the image, such as roughness, direction degree, linearity and the like, then defining a two-dimensional matrix with the length (H) and width (W) same as the gray level image and initializing the two-dimensional matrix to 0, performing binarization processing on the extracted texture characteristics and recording the texture characteristics in the matrix, wherein the matrix is counted as S4; fig. 3 (8) shows the texture coding matrix S4 obtained in this embodiment.

The pulse sequence of the image is obtained by the method, and the original image is represented from a plurality of layers such as the contour, the texture, the feature points and the like, so that the information utilization rate of the original image is greatly improved. The pulse sequence coded by the method has time sequence characteristics, and a very matched image pulse sequence data type is provided for some pulse neural network algorithms which are good at mining the time sequence information of the pulse sequence (for example, a Spike cube SNN-based image pulse data space-time information learning and identifying method, patent application number: 2019104814209).

It is noted that the disclosed embodiments are intended to aid in further understanding of the invention, but those skilled in the art will appreciate that: various substitutions and modifications are possible without departing from the spirit and scope of the invention and appended claims. Therefore, the invention should not be limited to the embodiments disclosed, but the scope of the invention is defined by the appended claims.

Claims

1. A image pulse coding method, carry on the information coding of the multi-layer to the picture, and encode the information of different layers of picture as the pulse sequence according to the time sequence relation; the method comprises the following steps:

1) carrying out scale transformation on an input image to obtain images with the length and width of H and W respectively;

2) converting the image into a gray level image, and carrying out standardization processing on the gray level value of the image;

3) carrying out shape coding on the gray level image; the method comprises the following steps:

31) filtering an image;

32) then extracting the edge contour of the gray level image, and then carrying out binarization processing on contour information, wherein the position value with the contour is 1, and the position value without the contour is 0;

33) recording the binarized values in a two-dimensional matrix of H x W, which is designated S1;

4) carrying out feature point coding on the gray level image, comprising the following steps:

41) firstly, determining key features in an image and the number of feature points required for representing the key features;

42) then extracting the position coordinates of the feature points by using a feature point detection algorithm;

43) defining a two-dimensional matrix S2 of H x W, initializing the two-dimensional matrix to 0, and setting the position in the two-dimensional matrix, which is the same as the coordinate of the characteristic point, as 1;

5) color coding a grayscale image, comprising:

51) the gray level image color is 256 steps, and the gray level image color is divided into a plurality of segments;

52) defining a three-dimensional matrix S3 of H W M, where M is the number of segments of the color division;

53) initializing the three-dimensional matrix S3 to 0;

54) each color segment is respectively corresponding to the dimension M of the three-dimensional matrix S3; namely, each color segment corresponds to a two-dimensional matrix with the scale of M dimension of the three-dimensional matrix S3 from 0 to M-1;

55) traversing the gray level value of each pixel of the gray level image, and determining the color segment i where the pixel is located, wherein the value of i is 0-M-1; setting the value of the same position of the pixel in an S3 matrix of dimension M-i corresponding to the color segment as 1;

6) texture coding a grayscale image, comprising:

61) extracting one or more textures of the image according to the texture type of the image;

62) defining a two-dimensional matrix S4 of H W and initializing to 0;

63) performing binarization processing on the extracted texture features and recording the processed texture features in a two-dimensional matrix S4;

7) arranging the matrixes S1-S4 according to a set sequence to form a series of pulse sequences S;

through the steps, the multi-layer information of the image is coded into the pulse sequence.

2. The method for pulse encoding of an image as claimed in claim 1, wherein in step 3), the image is filtered, in particular using a gaussian filter; specifically, an edge detection algorithm is used for extracting the edge contour of the gray level image.

3. The method for encoding an image pulse according to claim 1, wherein in the step 5), the number of the color division is 5, and the gray scale values are 0 to 51, 52 to 102, 103 to 153, 154 to 204, and 205 to 255; the color segment i is an integer with the value range of [0,4 ].

4. The image pulse encoding method of claim 1, wherein step 6) texture-encodes the gray scale image, said texture including one or more of roughness, orientation, and linearity.

5. The image pulse encoding method of claim 1, wherein the acquisition methods corresponding to the matrices S1 to S4 are adjustable according to different input images and different pulse information processing scenes.

6. A method as claimed in any one of claims 1 to 5, wherein the pulse sequence is applied to image pulse data spatio-temporal information learning and identification processes.