CN111950565A

CN111950565A - Abstract picture image direction identification method based on feature fusion and naive Bayes

Info

Publication number: CN111950565A
Application number: CN202010737934.9A
Authority: CN
Inventors: 白茹意
Original assignee: Shanxi University
Current assignee: Huizhou Weimili Technology Co ltd
Priority date: 2020-07-28
Filing date: 2020-07-28
Publication date: 2020-11-17
Anticipated expiration: 2040-07-28
Also published as: CN111950565B

Abstract

The invention belongs to the technical field of image processing and computer vision, and particularly relates to an abstract picture image direction identification method based on feature fusion and naive Bayes, which comprises the following steps: s1, rotating the abstract picture image to obtain four abstract picture images with different directions; simultaneously, dividing the abstract picture to obtain four sub-blocks; s2, extracting low-level features of the abstract picture image; s3, extracting high-level features of the abstract picture image by adopting a Convolutional Neural Network (CNN); s4, linearly combining the image low-level characteristic value and the image high-level characteristic value to obtain a final characteristic value of the abstract picture image; and S5, inputting the final characteristic value of the abstract image into a naive Bayes classifier for training and prediction. The method acquires the characteristic value of the image by fusing the low-level and high-level characteristics, and then puts the characteristic value of the image into a naive Bayes classifier (NB) for training and prediction, thereby realizing the automatic prediction of the direction of the abstract image and improving the prediction precision.

Description

Abstract picture image direction identification method based on feature fusion and naive Bayes

Technical Field

The invention belongs to the technical field of image processing and computer vision, and particularly relates to an abstract picture image direction identification method based on feature fusion and naive Bayes.

Background

Abstract art is a visual language that is patterned with shapes, colors, and lines, and is somewhat independent of the world. Among them, a drawing created for emotional expression is called "hot abstraction", and a drawing describing the world in an abstract manner is called "cold abstraction". Usually, when creating abstract drawings, artists determine the correct hanging direction of the work according to their own aesthetic concepts. While the correct direction is typically specified on the back of the canvas, this is not apparent to other non-professional viewers. Moreover, some studies in psychology in recent years have addressed the problem of the direction of abstract drawings, most of which consider that correctly positioned drawings will receive a higher aesthetic appreciation. Experiments by participants showed that about half of the preferred orientations were decided to be consistent with the artist's intended orientation, which is much higher than chance, but less than perfect performance. These all provide evidence for the relationship between painting direction and aesthetic quality. The research of orientation recognition can reveal objective rules of visual aesthetic evaluation.

With the trend of information digitization, digital images of paintings can be easily found on the internet. This makes computer-aided drawing analysis possible. People research various aesthetic evaluation methods by directly exploring the relationship between human aesthetic perception and calculation visual characteristics, but none of the methods solves the aesthetic evaluation problem through computer-aided orientation judgment. The current state of research in recent years on image orientation is as follows: (1) the research of image direction identification mainly aims at photographic pictures such as natural or scene images, and the identification rate is satisfactory. However, in the abstract picture image, since the content and the semantic meaning are relatively less conspicuous than the photographed image, it is difficult to recognize the direction of the abstract picture, and the correlation work in recent years is relatively small. (2) Human beings generally recognize directions through understanding the image content, so some methods adopt high-level semantic features to recognize the image directions, and the accuracy is obviously higher. Its accuracy will depend to a large extent on whether the semantic gap between high-level cues and low-level features can be closed.

At present, extensive research on natural image direction prompts us to explore the direction judgment problem of abstract painting. The aim of the invention is to better understand the sense of orientation of abstract drawings, and in particular to establish the relationship between the visual content of images and the correct orientation in the framework of machine learning without substantial content.

Disclosure of Invention

The invention overcomes the defects of the prior art, provides the method for identifying the direction of the abstract picture image based on feature fusion and naive Bayes, and can realize the automatic prediction of the direction of the abstract picture image through computer operation.

In order to solve the technical problems, the invention adopts the technical scheme that: an abstract picture image direction identification method based on feature fusion and naive Bayes comprises the following steps:

s1, rotating the abstract drawing image by 0 degrees, 90 degrees, 180 degrees and 270 degrees to obtain four abstract drawing images with different directions, and performing upper-lower average segmentation and left-right average segmentation on the abstract drawing images; therefore, each abstract picture image is divided into an upper sub-block, a lower sub-block, a left sub-block and a right sub-block;

s2, extracting low-level features of the abstract picture image, respectively calculating low-level feature descriptions of each sub-block, taking a comparison result of the low-level feature descriptions of each block as an image low-level feature value, if the comparison result is true, representing as 1, otherwise, representing as 0;

s3, extracting high-level features of the abstract picture image by adopting a Convolutional Neural Network (CNN), which comprises the following concrete steps:

s301, adjusting four sub-blocks of the abstract picture into a 128 multiplied by 128 RGB color image;

s302, respectively inputting the four subblocks into a convolutional neural network CNN, wherein the convolutional neural network CNN comprises 3 convolutional layers with the step length of 1, 3 maximum pooling layers of 2 multiplied by 2 and 2 full-connection layers, a ReLU is adopted as an activation function in each convolutional layer, the dimensionalities of the two full-connection layers are respectively 1024 and 521, and finally, 512-dimensional vectors are respectively obtained and used as neural network characteristic vectors;

s303, judging the comparison result of the feature vectors of the upper sub-block, the lower sub-block and the left sub-block and the right sub-block as image high-level feature values f14 and f15, wherein the calculation formula is as follows:

f14＝f_cnn_A≥f_cnn_B；f15＝f_cnn_L≥f_cnn_R；

wherein f _ cnn_A、f_cnn_B、f_cnn_L、f_cnn_RRespectively representing the characteristic values of the neural network of the upper sub-block, the lower sub-block, the left sub-block and the right sub-block;

s4, linearly combining the image low-level characteristic value obtained in the step S2 and the image high-level characteristic value obtained in the step S3 to obtain a final characteristic value of the abstract picture image;

and S5, performing the operations of the steps S1-S4 on all abstract pictures in the image library to obtain the final characteristic values of the abstract picture images, inputting a naive Bayes classifier for training and prediction, and finally dividing the abstract pictures into upward, downward, leftward or rightward, thereby realizing the automatic prediction of the directions of the abstract picture images.

The step S2 specifically includes the following steps:

s201, converting the four subblocks in the step S1 from an RGB color space into HSV models, dividing the H-S space into 16 hues and 8 saturations, and counting the number of pixels of 128 colors to be used as a color histogram vector of an abstract picture; judging the comparison result of the histogram vectors of the upper sub-block, the lower sub-block and the left sub-block and the right sub-block as image characteristic values f1 and f2, wherein the specific formula is as follows:

f1＝Hist_A≥Hist_B；f2＝Hist_L≥Hist_R；

wherein, Hist_A，Hist_B，Hist_L，Hist_RHistogram vectors of an upper sub-block, a lower sub-block, a left sub-block and a right sub-block are respectively;

s202, representing the maximum gradient of the image as the complexity of the image, and calculating the complexity of four sub-blocks; judging the comparison result of the complexity of the upper sub-block, the lower sub-block and the left sub-block and the right sub-block as image characteristic values f3 and f4, wherein the following formula is adopted:

f3＝Comp_A≥Comp_B；f4＝Comp_L≥Comp_R；

therein, Comp_A、Comp_B、Comp_L、Comp_RRespectively representing the complexity of an upper sub-block, a lower sub-block, a left sub-block and a right sub-block;

s203, calculating the similarity between every two sub-blocks in the four sub-blocks; and the comparison result of the similarity between the sub-blocks is taken as the image feature values f5, and f6 and f 7; the formula is as follows:

f5＝Sim(A,L)≥Sim(A,R)；f6＝Sim(B,L)≥Sim(B,R)；f7＝Sim(A,B)≥Sim(L,R)；

s204, detecting the significant straight lines of the four sub-blocks by using Hough transformation, judging whether the straight lines are static lines or dynamic lines according to the inclination angles alpha of the straight lines, calculating the number of the static lines and the dynamic lines and the average length of all the lines as image characteristics, and respectively taking the comparison results of the straight line attribute values between the two sub-blocks as image characteristic values f8, f9, f10, f11, f12 and f13, wherein the formula is as follows:

f8＝Len_S_A≥Len_S_B；f9＝Len_D_A≥Len_D_B；f10＝Ave_Len_A≥Ave_Len_B；

f11＝Len_S_L≥Len_S_R；f12＝Len_D_L≥Len_D_R；f13＝Ave_Len_L≥Ave_Len_R；

wherein Len _ S_A、Len_S_B、Len_S_L、Len_S_RRespectively representing the number of static lines in the upper, lower, left and right sub-blocks, Len _ D_A、Len_D_B、Len_D_L、Len_D_RRespectively represents the number of dynamic lines in the upper, lower, left and right sub-blocks, Ave _ Len_A、Ave_Len_B、Ave_Len_L、Ave_Len_RThe average lengths of all lines in the upper, lower, left, and right sub-blocks are represented, respectively.

In step S202, the calculation formula of the image complexity is as follows:

wherein G is_max(x, y) represents the maximum gradient of a pixel point (x, y) in the image in the RGB color space,

r, G, B representing the gradient values of (x, y) points in the image, Pixelnum (G) representing the total number of pixels of image G, Comp_GRepresenting the complexity of the image G.

In step S201, the formula for converting the image from the RGB color space to the HSV model is as follows:

kmax＝max(r′,g′,b′)；kmin＝min(r′,g′,b′)；Δ＝kmax-kmin；

v＝kmax；

the method comprises the steps that r, g and b respectively represent RGB values of image pixels in an RGB color space, r ', g ' and b ' are intermediate variables, kmax represents the maximum value of the r ', g ' and b ', kmin represents the minimum value of the r ', g ' and b ', and h, s and v represent hue values, saturation and brightness of the image pixels in an HSV model.

The network structure of the convolutional neural network CNN is as follows: the first convolutional layer consists of 16 convolution kernels of 3 × 3; the second convolutional layer consists of 8 convolution kernels of 3 × 3; the third convolution layer consists of 4 convolution kernels of 3 multiplied by 3, a feature graph obtained after each convolution fills the edge part with 0, and the size is kept unchanged; after each convolutional layer, reducing feature resolution with maximum pooling of 2 × 2; finally, the 4 16 × 16 two-dimensional matrices are converted into 1024-dimensional eigenvectors using the fully-connected layer, and 1024 is reduced to 512 dimensions.

In step S4, the vector dimension of the final feature value of the linearly combined abstract picture image is 1291.

In step S5, the specific method of performing four classifications, i.e., "up", "down", "left", and "right", when the naive bayes classifier predicts the direction of the abstract drawing image is:

these four cases are divided into four groups: one direction is selected as one type in each group, the other three directions are used as the other types, the ratio of the posterior probabilities of the two types in each group is calculated, and the calculation formula is as follows:

wherein the content of the first and second substances,

is the posterior probability ratio of the two classes in each group; p (C)_θIf) represents the posterior probability of the direction selected therein,

representing the posterior probabilities of the remaining three directions; comparing the posterior probability ratios of two of the four groups

Selecting

The direction with the largest value is taken as the correct direction for the abstract picture image.

Compared with the prior art, the invention has the following beneficial effects:

the invention provides an abstract picture image direction identification method based on feature fusion and naive Bayes, which comprises the following steps of (1) carrying out upper and lower average segmentation and left and right average segmentation on an abstract picture image. Thus, each abstract drawing is divided into four sub-blocks (up, down, left, and right). The image features are based on the comparison result of the four direction feature descriptions, so that the direction structure of the image can be embodied more specifically. (2) According to the basic principle of the abstract drawing theory, low-level features including color, complexity, similarity and linear attributes of all abstract drawing images are extracted. From the perspective of the drawing principle, the characteristics can better express the basic characteristics of abstract drawing and reflect the directionality of the image. (3) And extracting high-level features of the abstract picture image by adopting a Convolutional Neural Network (CNN). (4) And linearly combining the low-level features and the high-level features, wherein the combined vector is the final characteristic value of the abstract picture image. Therefore, local and global characteristics of the image can be better fused, and the image direction can be more accurately detected.

Drawings

FIG. 1 is a diagram illustrating abstract drawing rotation according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of abstract picture segmentation according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a CNN model adopted in the embodiment of the present invention;

FIG. 4 is a frame for identifying an abstract image direction according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below, and it is obvious that the described embodiments are some embodiments of the present invention, but not all embodiments; all other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention provides an abstract drawing image direction recognition method based on feature fusion and naive Bayes, which selects drawings of public websites to perform experiments, and comprises the following concrete implementation steps:

s1: 500 abstract pictures in the Wikiart (http:// www.wikiart.org) dataset were chosen. All abstract drawing images are rotated clockwise four directions (0 °,90 °,180 °,270 °) with reference to fig. 1. Finally, 2000 abstract picture images with different directions are obtained. And performing average segmentation on the abstract picture image up and down, and average segmentation on the abstract picture image left and right. Thus, each abstract picture is divided into four sub-blocks (up (a), down (B), left (L), and right (R)), referring to fig. 2.

S2: and extracting low-level features of all abstract drawing images according to the basic principle of the abstract drawing theory. And respectively calculating the low-level feature description of each sub-block, taking the comparison result of the feature descriptions as the final feature value of the image, and if the comparison result is true, representing as 1, otherwise, representing as 0. The method comprises the following specific steps:

s201: the four sub-blocks in S1 are converted from the RGB color space into HSV models (hue (h), saturation (S), value (v)). The calculation formula is as follows:

v＝kmax； (3)

the method comprises the steps that r, g and b respectively represent RGB values of image pixels in an RGB color space, r ', g ' and b ' are intermediate variables, kmax represents the maximum value of r ', g ' and b ', kmin represents the minimum value of r ', g ' and b ', and h, s and v represent hue values, saturation and brightness of the image pixels in an HSV model.

kmax＝max(r′,g′,b′)；kmin＝min(r′,g′,b′)；Δ＝kmax-kmin； (5)

In the direction recognition, the influence factor of brightness is small, so that the H-S space is divided into 16 hues and 8 saturations, and the number of pixels of 128 colors is counted as a color histogram vector of the painting. The image feature values f1 and f2 are the comparison result of two sub-block histogram vectors, and the formula is as follows:

f1＝Hist_A≥Hist_B；f2＝Hist_L≥Hist_R； (6)

wherein, Hist_A，Hist_B，Hist_L，Hist_RHistogram vectors for the four sub-blocks, respectively. The dimensions of f1 and f2 are 128 dimensions.

S202: the maximum gradient map of the image is represented as the complexity of the image, and then the complexity of the four subblock images in step S1 is calculated. Setting the image as G, and calculating the maximum gradient of pixel points (x, y) in the image as G in the RGB color space_max(x, y). Then G of all pixel points in the image is calculated_maxAs the complexity of the image. The calculation formula is as follows:

wherein (x, y) represents the coordinates of a pixel point in the image,

is the gradient value of (x, y) point, Pixelnum (G) is the total number of pixels in image G, Comp_GIs the complexity of the image G. The image feature values f3 and f4 are the comparison result of the complexity of the two sub-blocks, and the formula is as follows:

f3＝Comp_A≥Comp_B；f4＝Comp_L≥Comp_R； (9)

therein, Comp_A、Comp_B、Comp_L、Comp_RThe complexity of the upper sub-block, the lower sub-block, the left sub-block and the right sub-block is respectively expressed.

S203: and calculating the similarity between every two sub-blocks in the four sub-blocks.

Suppose two images G₁And G₂A histogram pyramid (HOG) is used to compute the similarity between the two images. The HOG features of 3 channels were calculated in RGB mode, with the image as a unit containing 8 directions. Similarity Sim (G) between two images₁,G₂) The calculation formula is as follows:

wherein G is₁，G₂∈RGB，H₁And H₂Are respectively an image G₁And G₂M is the number of cells present in the HOG feature. The image feature values f5, f6 and f7 are the comparison results of the similarities between the sub-blocks, and the formula is as follows: f5 ≧ Sim (A, L) ≧ Sim (A, R); f6 ≧ Sim (B, L) ≧ Sim (B, R); f7 ≧ Sim (A, B) ≧ Sim (L, R); (11)

s204: the Hough transform was used to detect the significant straight lines of the four sub-blocks. According to the inclination angle α of a straight line, the line is a static line if the inclination angle (-15 ° < α <15 °) or (75 ° < α <105 °), and a dynamic line otherwise. And calculating the number of the static lines and the dynamic lines and the average length of all the lines as image characteristics. The image feature values f8, f9, f10, f11, f12 and f13 are the comparison results of the straight line attribute values between two sub-blocks, and the formula is as follows:

f8＝Len_S_A≥Len_S_B；f9＝Len_D_A≥Len_D_B；f10＝Ave_Len_A≥Ave_Len_B； (12)

f11＝Len_S_L≥Len_S_R；f12＝Len_D_L≥Len_D_R；f13＝Ave_Len_L≥Ave_Len_R； (13)

wherein Len _ S_A、Len_S_B、Len_S_L、Len_S_RRespectively representing the number of static lines in the upper, lower, left and right sub-blocks, Len _ D_A、Len_D_B、Len_D_L、Len_D_RRespectively represents the dynamic lines in the upper, lower, left and right sub-blocksNumber, Ave _ Len_A、Ave_Len_B、Ave_Len_L、Ave_Len_RThe average lengths of all lines in the upper, lower, left, and right sub-blocks are represented, respectively.

S3: the convolutional Neural network CNN (convolutional Neural networks) is adopted to extract the high-level features of all abstract picture images, and the model refers to FIG. 3. The method comprises the following specific steps:

s303, judging the comparison result of the feature vectors of the upper sub-block, the lower sub-block and the left sub-block and the right sub-block, if true, representing as 1, otherwise, representing as 0, and taking the comparison result as image high-level feature values f14 and f15, wherein the calculation formula is as follows:

f14＝f_cnn_A≥f_cnn_B；f15＝f_cnn_L≥f_cnn_R； (14)

wherein f _ cnn_A、f_cnn_B、f_cnn_L、f_cnn_RRepresenting the neural network eigenvalues of the upper, lower, left and right sub-blocks, respectively. The dimensions of f14 and f15 are 512 dimensions.

In this embodiment, the CNN includes 3 convolutional layers with step size of 1, the activation function adopts ReLU, and the convolutional layers perform convolution on the input samples by using a filter to obtain a feature map. The first convolutional layer consists of 16 convolution kernels of 3 × 3; the second convolutional layer consists of 8 convolution kernels of 3 × 3; the third convolutional layer is composed of 4 convolution kernels of 3 × 3, and the feature map obtained after convolution each time fills the edge part with 0, and the size is kept unchanged. CNN contains 3 maximum pooling layers of 2 × 2 to reduce resolution. The pooling layer samples the input data to reduce parameters and avoid overfitting; CNN contains 2 fully connected layers for connecting all neurons. The two fully-connected dimensions are 1024 and 521 respectively, and finally, the 512-dimensional vector is used as a neural network characteristic value and is represented by f _ cnn. Other parameter settings of CNN network: batch _ size is 8, epochs is 10, learning rate is 1e-4, cost function selects "cross entropy loss function", optimizer is "Adam".

S4: and linearly combining the image features f1-f15 of S2 and S3, wherein the combined vector is the final feature value of the abstract picture image. The combined feature vector dimension is 1291.

S5: and randomly selecting 400 paintings as original images of a training set and 100 paintings as a test set, so that 1600 final training set samples and 400 final test set samples are obtained after the original images are rotated. To obtain more accurate classification results, the classification model was evaluated using 10-fold cross-validation. And (4) putting the characteristic values of the abstract drawing obtained in the step (S4) into Naive Bayes (NB) for training and prediction, and finally dividing the abstract drawing into four types of upward, downward, leftward and rightward, thereby realizing the automatic prediction of the image direction of the abstract drawing. The abstract picture image orientation recognition framework refers to fig. 4.

When a naive Bayes classifier is used for two classifications ("upward" and "non-upward"), the ratio of the posterior probabilities is:

wherein, F ═ F1, F2 …, F15]Representing the direction of features, C, of an abstract drawing image G₁Indicating an upward class and C2 indicating a non-upward class. P (C)₁) And P (C)₂) The prior probabilities of these two classes, P (C), respectively₁I F) and P (C)₂F) respectively represent the posterior probabilities of these two classes, P (F | C)₁) And P (F | C)₂) Respectively representing the conditional probability, P (f), of all the features_i|C₁) And P (f)_i|C₂) Respectively, the conditional probabilities of the ith feature state.

All features are discrete, P (f)_i|C_j) (i-1, 2, …, 1291, j-1, 2) is consistent with a 0-1 distribution.The conditional probability P (f) of each feature state can be calculated in the training phase_i|C_j). In the prediction stage, the probability of which class the abstract drawing G should be classified into is determined according to the posterior probability ratio, and the formula is as follows:

wherein T is a threshold, and the value of the threshold T in the embodiment of the present invention is 0.5.

In this embodiment, the abstract picture may be classified into four categories by a naive bayes classifier, and the abstract picture image is identified into four directions, namely "up", "down", "left", and "right", according to a specific method: these four cases are divided into four groups: one of the directions theta is selected as a class, and the other three directions

As another class. Then, the ratio of the posterior probabilities of the two classes in each group is calculated, and the formula is as follows:

wherein the content of the first and second substances,

is the posterior probability ratio of the two classes in each group. Comparing groups of

Value, selection

In order to fully verify the effectiveness and the applicability of the method, the classification model is tested in a mode of fusing a low-level feature and a high-level feature respectively, and the classification accuracy is shown in table 1. Experimental results show that the classification accuracy obtained by adopting the method of fusing the low-layer features and the high-layer features is highest no matter the abstract picture images are divided into two or four types.

Table 1: comparison of classification accuracy under different characteristics

In addition, the fused features were subjected to classification tests on a general classifier, and the test results are shown in table 2. The result shows that in the embodiment of the invention, the characteristic values are all 1 or 0, so the classification precision obtained by adopting the naive Bayes multi-classification model of the invention is higher.

Table 2: comparison of classification accuracy under different classifiers

In summary, the invention provides a method for identifying the direction of an abstract drawing image based on feature fusion and naive Bayes, which obtains the feature value of the image by means of fusion of low-level and high-level features, and then puts the feature value of the image into a naive Bayes classifier (NB) for training and prediction, thereby realizing automatic prediction of the direction of the abstract drawing image, effectively identifying the direction of the image, namely, establishing the relationship between the visual content of the image and the correct direction in the framework of machine learning, and improving the prediction precision.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims

1. An abstract picture image direction identification method based on feature fusion and naive Bayes is characterized by comprising the following steps:

f14＝f_cnn_A≥f_cnn_B；f15＝f_cnn_L≥f_cnn_R；

and S5, performing the operations of S1-S4 on all abstract pictures in the image library to obtain the final characteristic values of the abstract picture images, inputting the final characteristic values into a naive Bayes classifier for training and prediction, and finally dividing the abstract pictures into upward, downward, leftward or rightward, thereby realizing the automatic prediction of the directions of the abstract picture images.

2. The method for identifying the direction of an abstract drawing image based on feature fusion and naive Bayes as claimed in claim 1, wherein said step S2 specifically comprises the following steps:

f1＝Hist_A≥Hist_B；f2＝Hist_L≥Hist_R；

f3＝Comp_A≥Comp_B；f4＝Comp_L≥Comp_R；

3. The method for identifying the direction of an abstract picture image based on feature fusion and naive Bayes as claimed in claim 2, wherein in said step S202, the calculation formula of the complexity is as follows:

G_max(x,y)＝max(||▽G_R(x,y)||,||▽G_G(x,y)||,||▽G_B(x,y)||)；

wherein G is_max(x, y) represents the maximum gradient, # G, of a pixel point (x, y) in the image in RGB color space_R(x,y)，▽G_G(x,y)，▽G_B(x, y) representing (x, y) points in the image, respectivelyR, G, B, Pixelnum (G) represents the total number of pixels of image G, Comp_GRepresenting the complexity of the image G.

4. The method for identifying the direction of an abstract picture image based on feature fusion and naive Bayes as claimed in claim 2, wherein in said step S201, the formula for converting the image from RGB color space to HSV model is as follows:

kmax＝max(r′,g′,b′)；kmin＝min(r′,g′,b′)；Δ＝kmax-kmin；

v＝kmax；

5. The method for identifying the direction of the abstract picture image based on the feature fusion and naive Bayes as claimed in claim 1, wherein the network structure of the convolutional neural network CNN is as follows: the first convolutional layer consists of 16 convolution kernels of 3 × 3; the second convolutional layer consists of 8 convolution kernels of 3 × 3; the third convolution layer consists of 4 convolution kernels of 3 multiplied by 3, a feature graph obtained after each convolution fills the edge part with 0, and the size is kept unchanged; after each convolutional layer, reducing feature resolution with maximum pooling of 2 × 2; finally, the 4 16 × 16 two-dimensional matrices are converted into 1024-dimensional eigenvectors using the fully-connected layer, and 1024 is reduced to 512 dimensions.

6. The method for identifying the direction of an abstract drawing image based on feature fusion and naive Bayes as claimed in claim 1, wherein in said step S4, the vector dimension of the final feature value of the linearly combined abstract drawing image is 1291.

7. The method for identifying the direction of the abstract drawing image based on the feature fusion and naive Bayes as claimed in claim 1, wherein in said step S5, the concrete method for performing four classifications of "upward", "downward", "leftward" and "rightward" when the naive Bayes classifier predicts the direction of the abstract drawing image is:

wherein the content of the first and second substances,

Selecting