CN105404682B

CN105404682B - A kind of book retrieval method based on digital image content

Info

Publication number: CN105404682B
Application number: CN201510849994.9A
Authority: CN
Inventors: 公绪超; 吴柯维; 郭长全
Original assignee: Beijing Zhuo Is Looked Logical Science And Technology Ltd Co Of Intelligence
Current assignee: Beijing Zhuo Is Looked Logical Science And Technology Ltd Co Of Intelligence
Priority date: 2015-06-12
Filing date: 2015-11-29
Publication date: 2019-06-18
Anticipated expiration: 2035-11-29
Also published as: CN105404682A

Abstract

The book retrieval method based on digital image content that the invention discloses a kind of, the following steps are included: the sample image selected is carried out resolution ratio normalization operation, and carry out the iamge description of multiple features level, then learning training is carried out to every kind of books, different training identification models is obtained with this；The book image to be retrieved correction that will acquire is rectangle, then carries out resolution ratio normalization operation to it, carries out the iamge description of multiple features level to it after obtaining out corresponding image, and characteristic is inputted in training identification model to the prediction for carrying out initial results；After obtaining initial predicted result, judge that two class results of maximum possible differentiate that the Gaussian Profile at center, that highest one kind of probability value are determined as final search result with it respectively.The present invention makes full use of the low-level image feature characteristic of image and by study automatically and model probability identification tactic, improves retrieval rate, can carry out the book retrieval work based on digital picture on a large scale.

Description

A kind of book retrieval method based on digital image content

Technical field

It is an important application in field of image search, more particularly, to one the invention belongs to computer vision field Book retrieval method of the kind based on digital image content.

Background technique

Fuzzy C-Means Cluster Algorithm is a kind of self-adaption cluster learning algorithm, it can be instructed according to different classes of input Practice sample characteristics and calculate corresponding eigencenter automatically, to provide reliable priori for subsequent identification and detection work Knowledge.The advantages of this method is that and sample size comprehensive in sample characteristics characterization is able to carry out under conditions of sufficient divides well Class work, it is stronger simultaneously for the tolerance power of bad data, it is adapted to a variety of different data environments.Therefore the algorithm is extensive Applied to multiple images process fields such as data mining, image classification, image retrieval, image object identification, Video object trackings. Being continuously increased with data volume in recent years, the application of the technology have pushed the hair of computer vision technique to a certain extent Exhibition.

Book retrieval based on image is an important application of image retrieval.On the one hand it can be quick according to image Corresponding book information is retrieved, to eliminate the link for manually inputting text query book information；On the other hand logical The difficulty that image retrieval enormously simplifies information searching is crossed, the search pattern of What You See Is What You Get is completed.

Have at present it is some in relation to the book retrieval based on image in terms of research, also there are some corresponding methods, In considerable method all used characteristics of image and priori study.From the book retrieval technical standpoint based on image come point, greatly It causes that two major classes can be divided into: the book image retrieval based on characteristics of image description and the book image retrieval based on information labeling, That pays close attention to herein focuses on the former.On the whole, the book image retrieval based on information labeling, due to will receive markup information Inaccurate, markup information cannot not meet the influence such as all kinds of identification situations effectively comprehensively so that the identification of such method be easy by To influence, and the environment adapted to also limits to relatively, without one-to-one retrieval is effectively performed, additionally due to information labeling work Work amount is difficult in maintenance greatly, and institute receives to a certain extent in this way using limitation.Comparatively based on the side of characteristics of image Method accuracy rate is higher, and the multiple features description of image can effectively analyze the information such as edge, color, the angle point of books, then pass through Clustering can find the line of demarcation between different books in the case where training sample is sufficiently large, furthermore such method for The interference of extraneous data has stronger tolerance, can adapt to more identification scene.But the method based on training study needs The feature of description image comprehensively is chosen adequately to obtain classifying face, in addition, the output for different cluster learning algorithms needs Find a kind of acquisition of suitable result acquisition rule progress final result.

With the continuous development of graph image major field, the function of image retrieval can be more and more perfect, for books figure It is higher and higher as retrieving the meeting of requirement, but the environment of retrieval but becomes increasingly complex.Therefore it makes a kind of reliable, robust Book retrieval recognizer be very it is necessary to.

Summary of the invention

It is an object of the invention in view of the deficienciess of the prior art, proposing a kind of figure based on digital image content Book search method.This method has incorporated clustering learning and probability distribution is established based on the various features for describing book image Thought, substantially increase the efficiency of book retrieval, the book image retrieval under different scenes can be effectively performed, it is right There is positive effect in image indexing system.

To achieve the above object, the invention discloses a kind of the book retrieval method based on digital image content, this method The following steps are included:

The sample image selected is subjected to resolution ratio normalization operation, and the image for carrying out multiple features level is retouched It states, learning training then is carried out to every kind of books, different training identification models is obtained with this；

The book image to be retrieved correction that will acquire is rectangle, then carries out resolution ratio normalization to it, is obtaining out phase Characteristic is inputted in the trained identification model and is carried out just by the iamge description for carrying out multiple features level after the image answered to it The prediction of beginning result；First two result comprising current books maximum possible to be retrieved in the prediction result；

After obtaining initial predicted result, judge that two class results of maximum possible differentiate the Gauss point at center with it respectively Cloth, that highest one kind of probability value, is determined as final search result.

Preferably, the iamge description of multiple features level include: describe book image integral color distribution color moment characteristics, One of the edge histogram feature at entire image edge and the corner feature for describing book image angle point or are described more Kind；

Preferably, sample image all carries out resolution ratio normalization operation when carrying out book image feature extraction.

Preferably, sample image is when carrying out color Moment Feature Extraction, after resolution ratio normalization operation, first Gamma filtering is carried out, then by sample picture breakdown at there is overlapping image block, wherein every two adjacent image district Block Duplication is 0.3 to 0.5 times of its area, then calculates the color feature value of respective block in image, finally statistics output The colouring information of entire image.

Preferably, sample image carries out piecemeal to sample image first when carrying out color Moment Feature Extraction Processing.

Preferably, sample image is when carrying out corner feature extraction, to coloured silk after resolution ratio normalization operation Chromatic graph picture carries out gray processing processing.

Preferably, sample image terminates when carrying out edge histogram feature extraction in resolution ratio normalization operation Gaussian filtering is carried out to each color channel image in color image afterwards, by picture breakdown at there is overlapping image block, Wherein every two adjacent image block Duplication is 0.3 to 0.5 times of its area, then the statistical chart in each image block As the gradient distribution situation within the scope of 360 degree, the marginal information description value of finally statistics output entire image.

Preferably, after each category feature for obtaining description image, sample characteristics are gathered with Fuzzy C-Means Cluster Algorithm Class study will merge selected sample characteristics data before training study to obtain whole training learning parameter.

Preferably, the prediction steps for carrying out initial results include:

Before calculating acquisition initial results, center traversal is carried out according to the breadth-first strategy of tree and calculates similarity, most Maximum two class of similarity is by as initial candidate result afterwards.

Preferably, training identification model is to carry out study instruction to every kind of books by the learning algorithm based on Clustering It is obtained after white silk.

The present invention can comprehensively describe color, angle point and the marginal information of image, and characteristic clustering technique can be right The interference of extraneous data has stronger tolerance, and can adapt to a variety of identification scenes.It is after obtaining initial predicted result Accurate fusion results use gaussian probability and differentiate theory, effectively raise the accuracy rate of book retrieval.

Detailed description of the invention

Fig. 1 is a kind of book retrieval method flow schematic diagram based on digital image content provided in an embodiment of the present invention；

Fig. 2 is color moment feature extracting method schematic process flow diagram provided in an embodiment of the present invention；

Fig. 3 is edge histogram feature extracting method schematic process flow diagram provided in an embodiment of the present invention；

Fig. 4 is corner feature extracting method schematic process flow diagram provided in an embodiment of the present invention.

Specific embodiment

In conjunction with attached drawing, the basic thought of the embodiment of the present invention will be entirely known for the actual conditions of book image retrieval It does not work and is divided into three parts.Before carrying out book image retrieval, rectangle correction, normalization are carried out again to book image first Series of features extraction and study and training based on Clustering are carried out, to obtain in the feature clustering in book image library The heart；Then cognitive phase is corrected into rectangle again according to obtained book image to be retrieved, row normalizes and feature extraction, by institute Feature is put into the model of book image library and obtains initial results；Finally further according to gaussian probability distribution theory to initial results into Row fusion is to obtain search result to the end.Above method is adapted to a variety of identification scenes, and improves in comparable degree Accuracy of identification.

Fig. 1 is a kind of book retrieval method flow schematic block based on digital image content provided in an embodiment of the present invention Figure.As shown in Figure 1, the method comprising the steps of 101-103:

Resolution ratio normalization operation is carried out in step 101, by the sample image selected, and carries out multiple features level Iamge description, learning training then is carried out to every kind of books, different training identification models is obtained with this.

Specifically, in the learning training stage of book image retrieval, rectangle is carried out to corresponding sample image first Correction, resolution ratio normalization operation, then extract edge histogram feature (the extracting method process of description image integral edge information Figure is as shown in Figure 2), color moment characteristics (extracting method flow chart is as shown in Figure 3) of description image color information and for describing The corner feature (extracting method flow chart is as shown in Figure 4) of image angle point, then by the learning algorithm based on Clustering into Row Category learning, it is intended to provide good differentiation identification model for subsequent specific identification process.

Preferably, sample image is when carrying out multiclass feature extraction, original scale of the resolution ratio according to image, normalizing Turn to suitable size；Sample picture will first carry out gal after resolution ratio normalization when carrying out color feature extracted Horse filtering, then by picture breakdown at there is overlapping image block, wherein every two adjacent image block Duplication is its face Long-pending 0.3 to 0.5 times is influenced caused by retrieval as the rotation translation of image with resistance, then calculates respective block in image Color feature value, finally statistics output entire image colouring information；Sample image when carrying out Edge Gradient Feature, Gaussian filtering is carried out to each color channel image in color image after resolution ratio normalization, the mesh done so Be reduce by Gaussian noise bring edge-description error.In this link equally by picture breakdown at there is overlapping image district Then block is united wherein every two adjacent image block Duplication is 0.3 to 0.5 times of its area in each image block Gradient distribution situation of the image within the scope of 360 degree is counted, the marginal information description value of finally statistics output entire image；Sample graph Book image will carry out at gray processing it to improve arithmetic speed after image normalization when carrying out corner feature extraction Reason；When carrying out corner feature extraction, gray processing processing is carried out to color image after resolution ratio normalization operation.

As a kind of improvement of the embodiment of the present invention, the embodiment of the present invention is used after each category feature for obtaining description image Fuzzy C-Means Cluster Algorithm to sample characteristics carry out clustering learning, training study before will to selected sample characteristics data into Row merges to obtain whole training learning parameter.By the learning algorithm of fuzzy C-means clustering thought, every kind of books are carried out Learning training obtains different training identification models.

It is rectangle in step 102, the book image to be retrieved that will acquire correction, resolution ratio normalization behaviour is then carried out to it Make, carry out the iamge description of multiple features level to it after obtaining out corresponding image, characteristic is inputted into the training and is known The prediction of initial results is carried out in other model；First two knot comprising current books maximum possible to be retrieved in the prediction result Fruit.

Specifically, in the retrieval stage of book image retrieval, firstly, will be shot according to corresponding rectangle correction algorithm Obtained irregular books carry out rectangle correction, then carry out resolution ratio normalization to it, then extract corresponding color moment characteristics, These characteristics are finally inputted the prediction in identification model into initial results by edge histogram feature and corner feature；In advance Two classes are as a result, be used to reduce the identification model of corresponding book image before surveying the highest comprising current book image to be detected in result It encloses.

Preferably, the embodiment of the present invention carries out center according to the breadth-first strategy of tree before calculating acquisition initial results Similarity is traversed and calculates, last maximum two class of similarity is by as initial candidate result.

In step 103, after obtaining initial predicted result, judge that two class results of maximum possible differentiate center with it respectively Gaussian Profile, probability value is highest, and that is a kind of, is determined as final search result.

Specifically, during final search result differentiates, first according to the characteristic attribute of input sample, pass through Gauss Probability distribution method fusion differentiates the probability value of current sample in all kinds of, finally obtains the accurate output of prediction result, wherein Corresponding book information be probability results predicted value it is maximum that.

Require book image relatively clear in this course and books shared by area should be larger, this is to effective Carrying out book retrieval is very important.In view of the operational performance of algorithm is using based on center tree when with feature decision Breadth first search method.The probability output method of result should be used in last result differentiates.

Preferably, the embodiment of the present invention needs to carry out phase according to the method for breadth first search when obtaining initial results It is measured like degree；After obtaining initial results gaussian probability be distributed under conditions of, it is carried out probability differentiate it is final to obtain As a result.

The embodiment of the present invention is on the basis of image characteristics extraction, image very widely used in field of image processing Low-level image feature, Clustering and probability forecasting method have been effectively combined together.The training study of first part can basis Selected books sample image obtains identification model.In cognitive phase, it is based on according to the initial predicted result specifically obtained The result of Probabilistic merges, until obtaining final result.

The present invention can be under the premise of ensureing completion basic identification function, and structure is simple, and complexity is low, and efficiency of algorithm is high, It is suitble to apply in Books Retrieve System.

It is clear that under the premise of without departing from true spirit and scope of the present invention, invention described herein can be with There are many variations.Therefore, all it will be apparent to those skilled in the art that change, be intended to be included in present claims Within the scope of book is covered.Scope of the present invention is only defined by described claims.

Claims

1. a kind of book retrieval method based on digital image content, which comprises the following steps:

The sample image selected is subjected to resolution ratio normalization operation, and carries out the iamge description of multiple features level, so Learning training is carried out to every kind of books afterwards, different training identification models is obtained with this；

The book image to be retrieved correction that will acquire is rectangle, then carries out resolution ratio normalization operation to it, is obtaining out phase Characteristic is inputted in the trained identification model and is carried out just by the iamge description for carrying out multiple features level after the image answered to it The prediction of beginning result；First two result comprising current books maximum possible to be retrieved in prediction result；

After obtaining initial predicted result, judge that two class results of maximum possible differentiate the Gaussian Profile at center with it respectively, generally That highest one kind of rate value, is determined as final search result.

2. according to the method for claim 1, it is characterised in that, the iamge description of the multiple features level includes: description figure The color moment characteristics of book image integral color distribution describe the edge histogram feature at entire image edge and for describing books One of corner feature of image angle point is a variety of.

3. according to the method for claim 1, it is characterised in that, the sample image is mentioned in progress book image feature Resolution ratio normalization operation is all carried out when taking.

4. according to the method for claim 2, it is characterised in that, the sample image is carrying out color Moment Feature Extraction When, after resolution ratio normalization operation, gamma filtering is first carried out, then by sample picture breakdown at there is overlapping figure As block, wherein every two adjacent image block Duplication is 0.3 to 0.5 times of its area, then calculate corresponding in image The color feature value of block, the colouring information of finally statistics output entire image.

5. according to the method for claim 2, it is characterised in that, the sample image is carrying out color Moment Feature Extraction When, piecemeal processing is carried out to sample image first.

6. according to the method for claim 2, it is characterised in that, the sample image is carrying out corner feature extraction When, gray processing processing is carried out to color image after resolution ratio normalization operation.

7. according to the method for claim 2, it is characterised in that, the sample image is carrying out edge histogram feature When extraction, gaussian filtering is carried out to each color channel image in color image after resolution ratio normalization operation, By picture breakdown at there is overlapping image block, wherein every two adjacent image block Duplication is 0.3 to the 0.5 of its area Times, then gradient distribution situation of the statistical picture within the scope of 360 degree in each image block, finally counts output whole picture figure The marginal information description value of picture.

8. method according to claim 1 or 2, which is characterized in that after each category feature for obtaining description image, with fuzzy C means clustering algorithm carries out clustering learning to sample characteristics, to close to selected sample characteristics data before training study And to obtain whole training learning parameter.

9. according to the method for claim 1, it is characterised in that, the prediction steps for carrying out initial results include:

Before calculating acquisition initial results, center traversal is carried out according to the breadth-first strategy of tree and calculates similarity, last phase Like maximum two class of degree by as initial candidate result.

10. being obtained the method according to claim 1, wherein described carry out learning training to every kind of books with this Different training identification model steps include:

Using the learning algorithm based on Clustering, learning training is carried out to every kind of books, obtains different training identification models.