CN102147867B

CN102147867B - Method for identifying traditional Chinese painting images and calligraphy images based on subject

Info

Publication number: CN102147867B
Application number: CN 201110131873
Authority: CN
Inventors: 鲍泓; 潘卫国; 何宁; 李兵
Original assignee: Beijing Union University
Current assignee: Beijing Union University
Priority date: 2011-05-20
Filing date: 2011-05-20
Publication date: 2012-12-12
Anticipated expiration: 2031-05-20
Also published as: CN102147867A

Abstract

The present invention relates to a kind of recognition method of traditional Chinese painting image and calligraphy image based on subject, and it comprises the following steps: Utilize scanner to scan the traditional Chinese painting work before modern China and the calligraphy work that appeared in history, obtain the sample image of Chinese painting work and calligraphy work Carry out image preprocessing based on Top-down to the sample image, randomly select a training sample image and a test sample image from the scanned sample image; extract the subject feature vector of the training sample image from the training sample image, and train a classifier; Train the sample image classifier, extract the subject feature vector from the test sample image, use the trained sample image classifier for recognition, and obtain the recognition result. Based on the concept of subject, the invention realizes the recognition of Chinese painting image and calligraphy image by extracting the characteristics of the subject of Chinese painting image and calligraphy image, and can be widely used in the recognition and retrieval of Chinese painting image and calligraphy image.

Description

A kind of based on the traditional Chinese Painting image of main body and the recognition methods of handwriting image

Technical field

The present invention relates to a kind of image-recognizing method, particularly about a kind of based on the traditional Chinese Painting image of main body and the recognition methods of handwriting image.

Background technology

Traditional Chinese Painting works and calligraphy work are the important component parts in the Chinese culture, have unique art form, in fine arts field, the world, establish one's own system, and take the course of its own.Therefore in recent years, along with technology rapid development such as computing machine and multimedias, the digital picture of traditional Chinese Painting works and calligraphy work also grows with each passing day, and how effectively to retrieve the traditional Chinese Painting image and handwriting image more and more receives everybody concern.

Traditional Chinese Painting image and handwriting image mainly comprise and stay white region and main scene area; Characteristics to traditional Chinese Painting image and handwriting image self; Through the data pre-service staying white region to be set at the characteristic that remaining main scene area more can be given prominence to traditional Chinese Painting image and handwriting image after the unified color, so main body that to define this main scene area be traditional Chinese Painting image and handwriting image.In the present existing image-recognizing method, have for the extraction of characteristics of image: based on the feature extraction of vision noticing mechanism target area, directly use global image to carry out feature extraction, the overall situation combines local feature extraction etc.

Existing research work mainly concentrates on the creation style of traditional Chinese Painting image such as Chinese realistic painting, freehand brushwork etc.; The identification of content such as scenery with hills and waters, personage, birds and flowers etc.Jiang Shu waited the people to propose a kind of detection and Identification method that effectively is directed against the traditional Chinese painting image by force in 2006, and this method is at first come out the traditional Chinese Painting separation of images from general pattern, and then the traditional Chinese Painting image is carried out Chinese realistic painting and two enjoyable classification; The people such as Jana Zujovic of Northwestern Univ USA in 2009 have proposed a kind of by the school of art classification algorithms; With CBIR CBIR (Content Based Image Retrieval) system class seemingly, this algorithm mainly also is to utilize texture and color to carry out feature extraction; The Jia Li of Univ Pennsylvania USA and James Z.Wang have proposed a kind of based on the image-recognizing method that mixes probabilistic model; Use 2-d wavelet multiresolution hidden Markov model in this method; The traditional Chinese painting image is discerned by writer; Chosen Shen week on the Chinese history, Dong Qichang, Gao Fenghan, Wu Changshuo, five artists' of Zhang Daqian works in the test, writer under it has been discerned.

Said method all is the identification that concentrates on the traditional Chinese Painting image, does not find at present the method for traditional Chinese Painting image and handwriting image being discerned based on main body as yet.

Summary of the invention

To the problems referred to above, the purpose of this invention is to provide a kind of notion based on main body, through extracting the characteristic of traditional Chinese Painting image and handwriting image main body, realize method to traditional Chinese Painting image and handwriting image identification.

For realizing above-mentioned purpose; The present invention takes following technical scheme: a kind of based on the traditional Chinese Painting image of main body and the recognition methods of handwriting image; It comprises the steps: that (1) utilizes scanner scanning China traditional Chinese Painting works and calligraphy work that occurred in history before modern age, obtains the sample image of traditional Chinese Painting works and calligraphy work; (2) sample image is carried out the image pre-service based on Top-down, 1. it comprise the steps: sample image from the RGB color space conversion to the hsv color space; The sample image in the hsv color space that 2. 1. step is obtained is done the rim detection of Canny operator; The sample image of the rim detection that 3. 2. step is obtained is done the processing of edge swell; The sample image of the edge swell that 4. 3. step is obtained is done the processing that fill in the zone; The statistics of colouring information is carried out in background area beyond the fill area of the sample image that fill in the zone that 5. 4. step is obtained, comprises each component in the hsv color space is added up, and draws mean value Ave_H, Ave_S, the Ave_V of each component; 6. the sample image that preliminary sweep is obtained carries out the traversal of full figure individual element; The value of the H in the hsv color space of each pixel of sample image, S, each component of V respectively with the hsv color space in mean value Ave_H, Ave_S, the Ave_V of each component do the difference computing; Difference operation result and threshold value are compared; Pixel in threshold range thinks to stay white region, is set to be unified color; (3) picked at random training sample image and test sample image from the sample image that scanning obtains; (4) the body feature vector of extraction training sample image from training sample image, and training classifier, it comprises the steps: 1. to pass through the grey level histogram that step (2) image pre-service obtains training sample image, and gray scale is 256 rank; 2. count the number of times summation Total that in training sample image, occurs for bin between each chromatic zones in the training sample image grey level histogram; The last body feature vector that generates one 256 dimension of each training sample image; Accomplish the extraction of the body feature vector of training sample image; Consider the size of different training sample image, adopt following formula to calculate:

Total_bin = \frac{Total}{Wide \times High},

Wherein, Wide, High represent the wide and high of training sample image respectively;

(5) training sample image sorter; From test sample image, extract the body feature vector; The sample image sorter that utilization trains is discerned; 1. it comprise the steps: the body feature vector to the training sample image of extracting, and trains the sample image sorter that obtains training based on machine learning model; 2. extract the body feature vector of test sample image; 3. for the body feature vector of the test sample image of extracting, the sample image sorter that utilization trains is discerned, and draws the result of identification.

Said step (3) picked at random training sample image and test sample image from the sample image that scanning obtains comprise the steps 1. to define the sample image classification, are numbered 1 or 0,1 expression traditional Chinese Painting sample image, 0 expression calligraphy sample image; 2. being used for sample image to be identified is I, is labeled as { I ₁, I ₂, I wherein ₁Expression calligraphy sample image is designated as I ₁={ C ₁, C ₂... C _n, C _i(i=1 2...n) is expressed as the calligraphy sample image that scanning obtains, I ₂Expression traditional Chinese Painting sample image is designated as I ₂={ P ₁, P ₂... P _n, P _i(i=1 2...n) is expressed as the traditional Chinese Painting sample image that scanning obtains; 3. respectively from I ₁, I ₂The sample image that middle picked at random is set quantity is designated as { I as training sample image collection T ₁', I ₂', I ₁' expression calligraphy training sample image, I ₂' expression traditional Chinese Painting training sample image is with I ₁, I ₂Middle samples remaining image is as the test sample image collection

e _i(i=1 2...m) is test sample image.

In the said step (5) 1. to train the algorithm of employing based on machine learning model be decision Tree algorithms, artificial neural network, algorithm of support vector machine, a kind of in the Bayesian learning algorithm.

The present invention is owing to take above technical scheme, and it has the following advantages: 1, the present invention is based on the notion of main body, through extracting the characteristic of traditional Chinese Painting image and handwriting image main body, realized the identification to traditional Chinese Painting image and handwriting image.2, the present invention is through the pre-service to traditional Chinese Painting image and handwriting image; Realized the white region that stays of traditional Chinese Painting image and handwriting image background is handled; Make outstanding the manifesting of characteristic of traditional Chinese Painting image and each autonomous agent of handwriting image like this, help feature extraction traditional Chinese Painting image and handwriting image main body.3, the present invention can ignore the influence of the size of traditional Chinese Painting image and handwriting image to the body feature vector when traditional Chinese Painting image and handwriting image body feature are extracted.Therefore, the present invention can be widely used in the identification to traditional Chinese Painting image and handwriting image.

Description of drawings

Fig. 1 is a processing flow chart of the present invention

Fig. 2 is the sample example of employed traditional Chinese Painting image among the present invention

Fig. 3 is the sample example of employed handwriting image among the present invention

Fig. 4 is the pretreated process flow diagram of sample image of the present invention

Fig. 5 is traditional Chinese Painting Image Edge-Detection of the present invention figure as a result

Fig. 6 is handwriting image edge detection results figure of the present invention

Fig. 7 is the figure as a result after expanding in traditional Chinese Painting of the present invention image border

Fig. 8 is the figure as a result after the handwriting image edge swell of the present invention

Fig. 9 is the figure as a result after traditional Chinese Painting image-region of the present invention is filled

Figure 10 is the figure as a result after fill in handwriting image of the present invention zone

Figure 11 is the figure as a result of traditional Chinese Painting image after the pre-service of the present invention's process image

Figure 12 is the figure as a result of handwriting image after the pre-service of the present invention's process image

Figure 13 is the grey level histogram of traditional Chinese Painting image of the present invention

Figure 14 is the grey level histogram of handwriting image of the present invention

Figure 15 is the grey level histogram of the present invention through pretreated traditional Chinese Painting image

Figure 16 is the grey level histogram of the present invention through pretreated handwriting image

Embodiment

Below in conjunction with accompanying drawing and embodiment the present invention is carried out detailed description.

As shown in Figure 1, the recognition methods of traditional Chinese Painting image of the present invention and handwriting image comprises the steps:

1, scanning China's traditional Chinese Painting works and calligraphy work that occurred in history before modern age obtains the sample image of traditional Chinese Painting works and calligraphy work.

Like Fig. 2, shown in Figure 3, sample image of the present invention obtains through Epson Expression10000XL scanner scanning.

2, sample image is carried out the data pre-service, extract the characteristic of sample image main body.

Sample image mainly comprises and stays white region and main scene area; Characteristics to sample image self; It is after the unified color that white region is stayed in setting, and remaining main scene area more can be given prominence to the characteristic of sample image, the main body that therefore to define this main scene area be sample image.

Owing to reason of the remote past; Sample image stay white region variable color mostly; These information can be extracted the body feature of sample image and cause interference; The pretreated purpose of image is set at unified color to the white region that stays of sample image, reduces the interference that the sample image body feature is extracted.

As shown in Figure 4, the present invention carries out the image pre-service based on Top-down to sample image, and the process of its processing comprises the steps:

(1) at first the color space of sample image is transformed into HSV (hue, saturation, intensity color space) from RGB (red, green, blue color space).The RGB color space is not an even color space, and the distance on the RGB color space can not be represented the visual color similarity of human eye, though this representation is simple bigger with the sensory difference of human eye.Handling the appropriate to the occasion hsv color space of choosing of color characteristic, the hsv color space is by tone H, saturation degree S, and three components of brightness V are formed; More approaching with human vision property, wherein tone H representes different colours, as red, orange, green; Its strength component value scope is 0～360, and saturation degree S representes the depth of color, and its strength component value scope is 0～1; Brightness V representes the bright-dark degree of color, influenced by the light source power, measures with number percent usually; Its strength component value scope is 0% to 100%, and wherein black is 0%, is 100% in vain.The hsv color model is with Munsell (Meng Saier) three dimensions system representation; Variation that can feeling of independence fundamental color component; And this color space has linear extendible property; The Euclidean distance of the point of coordinate is proportional on appreciable colour-difference and the hsv color space, and the computing formula from the RGB color space conversion to the hsv color space is following:

H = \{\begin{matrix} \arccos \frac{(R - G) + (R - B)}{2 \sqrt{{(R - G)}^{2} + (R - B) (G - B)}} (B \leq G) \\ 2 π - \arccos \frac{(R - G) + (R - G)}{2 \sqrt{{(R - G)}^{2} + (R - G) (G - B)}} (B > G) \end{matrix},

S = \frac{\max (R, G, B) - \min (R, G, B)}{\max (R, G, B)},

V = \frac{\max (R, G, B)}{255} .

In the formula, R, G, B represent the strength component value of the red, green, blue of each pixel in the sample image respectively.

(2) like Fig. 5, shown in Figure 6, make the rim detection of Canny operator on the V component of sample image in the hsv color space, comprise the steps:

1. utilize the level and smooth sample image of Gaussian filter, two-dimensional Gaussian function is:

G (x, y) = \frac{1}{2 π δ^{2}} \exp (- \frac{(x^{2} + y^{2})}{2 δ^{2}})

In the formula, δ representes that variance is the parameter of Gaussian filter, and it is controlling level and smooth degree, and x, y are the coordinates that generates gaussian mask, calculate suitable mask with this formula, realizes that with the standard convolution Gauss is level and smooth, and the gaussian mask that calculates is shown in the following figure:

2	4	5	4	2
					4	9	12	9	4
5	12	15	12	5
					4	9	12	9	4
2	4	5	4	2

2. use the Sobel gradient operator to calculate the gradient estimated value of each pixel.

The Sobel gradient operator has two 3 * 3 convolution kernel: G _xGradient component for horizontal direction; G _yBe the gradient component of vertical direction, its computing formula is following:

G_{x} = [\begin{matrix} - 1 & 0 & 1 \\ - 2 & 0 & 2 \\ - 1 & 0 & 1 \end{matrix}]

G_{y} = [\begin{matrix} 1 & 0 & 1 \\ 0 & 0 & 0 \\ - 1 & - 2 & - 1 \end{matrix}],

Gradient magnitude or edge strength computing formula are:

|G|＝|G _x|+|G _y|。

3. if the gradient component G of horizontal direction _xGradient component G with vertical direction _yKnown, the deflection computing formula is:

θ＝arctan(G _y/G _x)，

If the gradient component G of horizontal direction _xBe 0, deflection depends on the gradient component G of vertical direction _y:

4. each pixel has only 4 possible directions to link to each other with neighbor pixel in the sample image: 0 ° (horizontal direction), and 45 ° (over against angular direction), 90 ° (vertical direction), 135 ° (negative diagonal), deflection is arrived following 4 angles by standard:

0°∶0°～22.5°，157.5°～180°；45°∶22.5°～67.5°；

90°∶67.5°～112.5°；135°∶112.5°～157.5°。

If 5. the Grad on the deflection direction of the pixel in the sample image is maximum, then keep, otherwise this pixel is removed, the set that the maximum point of all pixels of sample image Grad on the deflection direction constitutes is the set of possible marginal point.

6. set two Grads threshold, a high threshold TH, a low threshold value TL, high threshold TH generally get 2～3 times of low threshold value TL.From the set of possible marginal point, remove the pixel of Grad earlier less than high threshold TH; Get marginal point set F; Handle the pixel set M of Grad between high and low threshold value again; Face a little if the point among the M is put to have among the set F on the edge of, then this point is added marginal point set F, the marginal point that finally obtains set F is exactly the marginal point set of sample image.

(3) like Fig. 7, shown in Figure 8, the sample image of the above-mentioned rim detection that obtains is carried out expansion process, comprise the steps:

Suppose that A is the edge of the above-mentioned sample image that obtains; B is that (why select this structural element for use, be because through experiment, draw under this situation to one 4 * 4 the structural element of stating; Recognition result is best), A is followed following formula as edge swell:

A &CirclePlus; B = {z | [{(\hat{B})}_{z} \cap A] &SubsetEqual; A},

In the formula, A and B are z ²Set in (two-dimentional integer space), z is one of them element,

Be meant expansive working,

The reflection of expression B moves to a z=(z ₁, z ₂).

(4) like Fig. 9, shown in Figure 10, the above-mentioned sample image that obtains edge swell to be carried out the zone fill, the zone filling mainly is to be the basis with expansion, supplement and the common factor of gathering, the formula that fill in the zone is following:

X_{k} = (X_{k - 1} &CirclePlus; B) \cap A^{c} (k = 1,2,3, . . .),

In the formula, A ^cThe supplementary set of expression A, X _K-1Be a bit in the fill area, k is the step number of algorithm iteration.

(5) like Figure 11, shown in Figure 12, handle the background area of the sample image that the above-mentioned zone that obtains is filled.White region is thought to stay in background area for beyond the fill area in the sample image, stays white region to be designated as I_B, and statistics is stayed the colouring information of white region, mainly adds up the mean value of each component in the hsv color space: Ave_H, Ave_S, Ave_V, and formula is following:

Ave_H = \frac{1}{k} Σ_{i = 1}^{k} h_{k} (h_{k} &Element; I_B),

Ave_S = \frac{1}{k} Σ_{i = 1}^{k} s_{k} (s_{k} &Element; I_B),

Ave_V = \frac{1}{k} Σ_{i = 1}^{k} v_{k} (v_{k} &Element; I_B),

In the formula, h _k, s _k, v _kThe strength component value of representing tone, saturation degree and brightness respectively.

The sample image that preliminary sweep is obtained carries out the traversal of full figure individual element; The value of the H in the hsv color space of each pixel of sample image, S, each component of V respectively with the hsv color space in mean value Ave_H, Ave_S, the Ave_V of each component do the difference computing; Difference operation result and threshold value T_P are compared (this threshold value T_P draws through experiment, and threshold range is between 0.15～0.2), think to stay white region for the pixel within threshold value T_P scope; It is unified color that white region is stayed in setting; Be set at white (as example, being not limited thereto) here, its computing formula is:

Wherein, white is meant and is set to white that unchange is meant that the original sample image of maintenance is constant; I_pex representes each pixel in the sample image; I_pex_h, i_pex_s, i_pex_v represent it is H, the S on the hsv color space on each pixel of sample image, the value of each component of V.

Like Figure 13～shown in Figure 16, the present invention is from traditional Chinese Painting works and calligraphy work creative feature, and promptly calligraphy work is comparatively even with China ink, and the traditional Chinese Painting works want stereovision with China ink; Through after the above-mentioned data pre-service; Make that the main body of sample image is more outstanding, wherein, horizontal ordinate is represented the exponent number 0～255 of gray scale in grey level histogram; The gray scale exponent number is totally 256 rank, and ordinate is the number that the pixel in the statistical sample image occurs at certain gray scale exponent number.

3, from sample image, choose training sample image and test sample image at random.

Sample image is divided into training sample image and test sample image, and training sample image and test sample image labeling method comprise the steps: (1) definition sample image classification, are numbered 1 or 0,1 expression traditional Chinese Painting sample image, 0 expression calligraphy sample image; (2) supposing to be used for sample image to be identified is I, is labeled as { I ₁, I ₂, I wherein ₁Expression calligraphy sample image is designated as I ₁={ C ₁, C ₂... C _n, C _i(i=1 2...n) is expressed as the calligraphy sample image that scanning obtains, I ₂Expression traditional Chinese Painting sample image is designated as I ₂={ P ₁, P ₂... P _n, P _i(i=1 2...n) is expressed as the traditional Chinese Painting sample image that scanning obtains; (3) respectively from I ₁, I ₂The sample image that middle picked at random is set quantity is designated as { I as training sample image collection T ₁', I ₂', I ₁' expression calligraphy training sample image, I ₂' expression traditional Chinese Painting training sample image is with I ₁, I ₂Middle samples remaining image is as the test sample image collection e _i(i=1 2...m) is test sample image.

4, accomplish on the pretreated basis of sample image, training sample image is carried out body feature extract, and training classifier.The process that the training sample image body feature is extracted is following: the grey level histogram that training sample image is obtained training sample image through above-mentioned image pre-service; Gray scale is 256 rank; Count the number of times summation Total that occurs at sample image for bin between each chromatic zones in the grey level histogram, the last body feature vector that generates one 256 dimension of each training sample image.Consider the size of different training sample image, adopt following formula to calculate among the present invention:

Total_bin = \frac{Total}{Wide \times High},

Wherein, Wide and High represent the wide and high of training sample image respectively.

The present invention adopts the recognition methods of SVMs (as example, being not limited thereto) that training sample image is trained, and after training, obtains the model model of a sample image sorter.The kit that this experiment adopts LIBSVM to provide makes an experiment, and following function model capable of using is represented:

model＝svmtrain(T_F，label，options)

In the above-mentioned function call, svmtrain is the SVMs computing, the body feature vector of the training sample image that T_F representes to extract; Label representes the class label of corresponding training sample image, and value 0 or 1 is represented calligraphy sample image and traditional Chinese Painting sample image respectively here; Options is that parameter is selected; Parameter options='-t2-s0-b1-c1 ' for example, the implication of expression is that kernel function is intersection kernel, the SVM type is C-svc; The C-svc penalty coefficient is 1, and needs probability estimate.

5, from test sample image, extract the body feature vector of test sample image, and discern, accomplish the identification of test sample image, comprise the steps: with the sample image sorter that trains

(1) test sample image is carried out the data pre-service; (2) to through the pretreated test sample image of data, carry out body feature and extract, generate the body feature vector of test sample image; (3) the body feature vector of test sample image is imported the sample image sorter that trains, obtain recognition result.

The recognition methods that recognition result demonstration test of the present invention adopts is that SVMs is (as example; Be not limited thereto) in the MatlabR2008A software platform; Obtain the predict the outcome pre and the accuracy rate acc of test sample image, SVMs uses and handles with minor function:

[pre?acc]＝svmpredict(label_1，H_F，model，‘-b?1’)，

In the above-mentioned function call, svmpredict is an anticipation function, and label_1 is the class label of test sample image, and H_F is the body feature vector that test sample image generates, and model is the sample image sorter that trains.

The result of identification can adopt following formula:

P = \frac{n_R}{N_Total},

In the formula, n_R is the number of the test sample image that identifies, and N_Total is the number of test sample image.

The present invention is through verifying explanation to the test findings of following traditional Chinese Painting image and handwriting image; Testing employed sample image obtains through scanning " Chinese painting complete or collected works " and " complete or collected works of Chinese calligraphy "; Make it as the sample image storehouse; Then therefrom at random choose training sample image and test sample image, as shown in the table:

The result who obtains through test is following:

P = \frac{n_R}{N_Total} = \frac{238}{240} = 0.992 .

The above results shows, utilizes image-recognizing method of the present invention to obtain very desirable recognition result, helps the mark and the retrieval of traditional Chinese Painting image and handwriting image.

Above-mentioned each embodiment only is used to explain the present invention; Each step all can change to some extent; On the basis of technical scheme of the present invention, all improvement and equivalents of individual steps and proportioning being carried out according to the principle of the invention all should not got rid of outside protection scope of the present invention.

Claims

1. a recognition method based on subject-based traditional Chinese painting images and calligraphy images, comprising the steps of:

(1) Use a scanner to scan traditional Chinese painting works before modern China and calligraphy works that have appeared in history to obtain sample images of Chinese painting works and calligraphy works;

(2) Image preprocessing based on Top-down is carried out to sample image, and it comprises the following steps:

① Convert the sample image from RGB color space to HSV color space;

②The edge detection of Canny operator is done to the sample image of HSV color space that step ① obtains;

3. the sample image of the edge detection that step 2. obtains is done the processing of edge expansion;

4. The sample image of edge expansion that step 3. obtains is done the processing of region filling;

5. Carry out the statistics of the color information on the background area outside the filled area of the sample image obtained by the area filled in step 4, including performing statistics on each component in the HSV color space, and obtain the average values Ave_H, Ave_S, and Ave_V of each component;

⑥Traversing the sample image obtained from the initial scan pixel by pixel, the values of the H, S, and V components of each pixel in the HSV color space of the sample image are respectively compared with the average values Ave_H and Ave_S of each component in the HSV color space , Ave_V performs the difference calculation, compares the result of the difference calculation with the threshold value, and considers the pixels within the threshold value range as a blank area, and sets it as a uniform color;

(3) Randomly select training sample images and test sample images from scanned sample images;

(4) Extract the subject feature vector of the training sample image from the training sample image, and train the sample image classifier, which includes the following steps:

① Carry out the image preprocessing of step (2) to the training sample image, and obtain the grayscale histogram of the training sample image after preprocessing, and the grayscale is 256 orders;

② For each color interval bin in the grayscale histogram of the preprocessed training sample image, the total number of occurrences in the preprocessed training sample image is counted Total, and each preprocessed training sample image finally generates a 256 Dimension subject feature vector, complete the extraction of the subject feature vector of the training sample image after preprocessing, taking into account the size of the training sample image after different preprocessing, use the following formula to calculate:

Total Total__bin bin = = \frac{Total Total}{Wide wide \times \times High High},,

Among them, Wide and High respectively represent the width and height of the preprocessed training sample image;

(5) Extract the subject feature vector of the test sample image from the test sample image, and identify it with a trained sample image classifier, and complete the identification of the test sample image, including the following steps:

① Perform data preprocessing on the test sample image;

②Extract the subject feature from the test sample image after data preprocessing, and generate the subject feature vector of the test sample image;

③ Input the subject feature vector of the test sample image into the trained sample image classifier to obtain the recognition result.

2. the recognition method of a kind of traditional Chinese painting image and calligraphy image based on main body as claimed in claim 1, is characterized in that: described step (3) randomly selects training sample image and test sample image from the sample image that scanning comprises The following steps are as follows: 1. Define the sample image category, numbered as 1 or 0. 1 represents the sample image of traditional Chinese painting, and 0 represents the sample image of calligraphy; 2. The sample image to be recognized is I, marked as {I ₁ , I ₂ }, where I ₁ represents the sample of calligraphy Image, recorded as I ₁ ={C ₁ , C ₂ ...C _n }, C _i (i=1, 2...n) represents the calligraphy sample image obtained by scanning, I ₂ represents the sample image of traditional Chinese painting, record I ₂ = {P ₁ , P ₂ ... P _n }, P _i (i = 1, 2 ... n) represents the sample image of traditional Chinese painting obtained by scanning; ③ Randomly select from I ₁ and I ₂ respectively Set a number of sample images as the training sample image set T, denoted as {I ₁ ′, I ₂ ′}, I ₁ ′ represents the calligraphy training sample image, I ₂ ′ represents the Chinese painting training sample image, the I ₁ , I ₂ The remaining sample images are used as the test sample image set e _i (i=1, 2...m) are test sample images.

3. a kind of recognition method based on the traditional Chinese painting image of subject and calligraphy image as claimed in claim 1 or 2, it is characterized in that: 1. in the described step (5) the algorithm that training adopts based on machine learning model is decision tree Algorithm, artificial neural network, support vector machine algorithm, one of Bayesian learning algorithms.