CN106469437A

CN106469437A - Image processing method and image processing apparatus

Info

Publication number: CN106469437A
Application number: CN201510508531.6A
Authority: CN
Inventors: 蒋树强; 宋新航; 贺志强
Original assignee: Lenovo Beijing Ltd; Institute of Computing Technology of CAS
Current assignee: Lenovo Beijing Ltd; Institute of Computing Technology of CAS
Priority date: 2015-08-18
Filing date: 2015-08-18
Publication date: 2017-03-01
Anticipated expiration: 2035-08-18
Also published as: CN106469437B

Abstract

Provide a kind of image processing method and image processing apparatus.Described image processing method includes：Pending image averaging is divided into multiple images block；Obtain the semantic description information of described image block；Determine the semantic description information of described image based on the semantic description information of locus in described image for the described image block and described image block.In the technical scheme of the embodiment of the present disclosure, can organically contact associated picture block in the whole image semantic description information to determine image, more meet the understanding custom to image.

Description

Image processing method and image processing apparatus

Technical field

The present invention relates to areas of information technology, at a kind of image processing method and image Reason device.

Background technology

Image understanding (image understanding, IU) is the semantic understanding to image.Image understanding is With image as object, knowledge is core, the target in research image, the mutual relation between target, figure The scene of picture and its application.

Complete picture material, as the basic description carrier of knowledge information, can be changed by semantic description information One-tenth can intuitivism apprehension class text language performance, image understanding plays vital effect.Image In abundant semantic description information more accurate image search engine can be provided, generate the digitized map of intelligence As the visual scene description in photograph album and virtual world.

As the mode of the semantic description information generating image, the visual signature being typically based on image is to image Split and obtained multiple images region, each image-region after segmentation is separate, then Analyze the semantic description information of each image-region, and the semantic description information based on each image-region Lai Obtain the semantic description information of whole image.

Content of the invention

The embodiment of the present disclosure provides a kind of image processing method and image processing apparatus, and it provides one kind The image procossing mode of the semantic description information of new determination image, this image procossing mode organically contacts Related image block, more meets the understanding custom to image.

A kind of first aspect, there is provided image processing method.This image processing method may include：To wait to locate The image averaging of reason is divided into multiple images block；Obtain the semantic description information of described image block；Based on institute State locus in described image for the image block and the semantic description information of described image block determines described figure The semantic description information of picture.

In conjunction with a first aspect, in a kind of implementation of first aspect, the described described image block that obtains Semantic description information may include：Obtain the Gaussian Mixture with each corresponding picture material of semantic description information Model；Determine the semantic description information of described image block according to described image block and described gauss hybrid models.

In conjunction with first aspect and its above-mentioned implementation, in another implementation of first aspect, described Can be wrapped according to the semantic description information that described image block and described gauss hybrid models determine described image block Include：Determine that described image block belongs to institute according to the similarity of described image block and described gauss hybrid models State the probability Estimation of each picture material；The probability of each picture material according to described image block belongs to Estimate to determine the semantic description information of described image block.

In conjunction with first aspect and its above-mentioned implementation, in another implementation of first aspect, described Semantic description information based on locus in described image for the described image block and described image block determines The semantic description information of described image may include：Semantic description information according to described image block and described figure As locus in described image for the block determine the weight of described image block；Language according to described image block The weight of adopted description information and described image block determines the semantic description information of described image.

In conjunction with first aspect and its above-mentioned implementation, in another implementation of first aspect, described Locus in described image for the semantic description information and described image block according to described image block determine The weight of described image block may include：Semantic description information according to described image block calculates described image block It is adjacent the similarity between image block；It is adjacent similar between image block based on described image block The weight of degree setting described image block.

In conjunction with first aspect and its above-mentioned implementation, in another implementation of first aspect, described Semantic description information based on locus in described image for the described image block and described image block determines The semantic description information of described image may include：Based on locus in described image for the described image block Determine the adjacent image block conduct with similar semantic description information with the semantic description information of described image block Content aggregation region, using described similar semantic description information as described Content aggregation region semantic description Information；Semantic description information according to described Content aggregation region and described Content aggregation region are in described figure Locus in picture determine the weight in described Content aggregation region；Power according to described Content aggregation region The semantic description information in weight and described Content aggregation region determines the semantic description information of described image.

A kind of second aspect, there is provided image processing apparatus.This image processing apparatus may include：Memorizer； And processor, for executing following operation：Pending image averaging is divided into multiple images block；Obtain Obtain the semantic description information of described image block；Based on locus in described image for the described image block and The semantic description information of described image block determines the semantic description information of described image.

A kind of third aspect, there is provided image processing apparatus.This image processing apparatus may include division unit 910th, image block semanteme determining unit 920 and image, semantic determining unit 930.Division unit 910 will be treated The image averaging processing is divided into multiple images block.Image block semanteme determining unit 920 obtains described image The semantic description information of block.Image, semantic determining unit 930 is based on described image block in described image The semantic description information of locus and described image block determines the semantic description information of described image

In the technical scheme of the image processing method according to the embodiment of the present disclosure and image processing apparatus, lead to Cross the locus in whole image based on each image block true with the semantic description information of each image block Determine the semantic description information of whole image, thus organically contacted by the associated picture block in whole image, More meet the understanding custom to image.

Brief description

In order to be illustrated more clearly that the technical scheme of the embodiment of the present disclosure, below will be to embodiment or existing skill Art description in required use accompanying drawing be briefly described it should be apparent that, drawings in the following description It is only some embodiments of the present disclosure, for those of ordinary skill in the art, can also be according to this A little accompanying drawings obtain other accompanying drawings.

Fig. 1 is the flow chart schematically illustrating the image processing method according to the embodiment of the present disclosure.

Fig. 2 schematically illustrates the example of the image being divided into multiple images block.

Fig. 3 is that the semanteme of each image block of acquisition in the image processing method schematically illustrate Fig. 1 is retouched The flow chart stating information.

Fig. 4 schematically illustrates the semantic description information of obtained image block.

Fig. 5 be a diagram that the first of the semantic description information of the determination image in the image processing method of Fig. 1 The flow chart of example.

Fig. 6 schematically illustrate determined by image semantic description information.

Fig. 7 be a diagram that the second of the semantic description information of the determination image in the image processing method of Fig. 1 The flow chart of example.

Fig. 8 is the block diagram schematically illustrating the first image processing apparatus according to the embodiment of the present disclosure.

Fig. 9 is the block diagram schematically illustrating the second image processing apparatus according to the embodiment of the present disclosure.

Specific embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present disclosure, the technical scheme in the embodiment of the present disclosure is carried out Clearly and completely description is it is clear that described embodiment a part of embodiment that is the disclosure, rather than Whole embodiments.

For an image, machine generally cannot understand its content, thus being difficult to picture search.At this In disclosed embodiment, pending image carried out process with can convert images into can intuitivism apprehension Semantic description information, can provide more accurate image search engine based on this semantic description information, thus raw Become the visual scene description in the digital picture photograph album and virtual world of intelligence.Additionally, the language based on image Adopted description information, can also carry out image labeling, image recognition etc..Described pending image can be From web search to image, can be the image using collections such as photographing units.The acquisition of pending image Mode does not constitute the restriction to the embodiment of the present disclosure.

Fig. 1 is the flow chart schematically illustrating the image processing method 100 according to the embodiment of the present disclosure. As shown in figure 1, this image processing method 100 includes：Pending image averaging is divided into multiple figures As block (S110)；Obtain the semantic description information (S120) of described image block；Existed based on described image block The semantic description information of the locus in described image and described image block determines that the semanteme of described image is retouched State information (S130).

In S110, pending image averaging is divided into multiple images block.Pending image is usual To be represented with the numerical value of each pixel of image.For each pixel, can be divided with gray value, three primary colories Amount etc. represents.Image typically comprises the picture element matrix arranging according to row and column mode.Many as being divided into The example of individual image block, can divide successively according to the mode that each image block includes 16 × 16 pixels.

Fig. 2 schematically illustrates the example of the pending image being divided into multiple images block.This waits to locate The image of reason is a seashore picture with scenes.It is light blue sky above this seashore picture with scenes, should The middle part of seashore picture with scenes is azure sea, is golden sandy beach in the lower section of this seashore picture with scenes. Assume that seashore picture with scenes has 128 × 112 pixels, if each image block B (1,8) includes 16 × 16 Pixel, then, from the beginning of the end of the picture element matrix of image, be expert at and column direction with 16 × 16 pixels for unit Divide, this seashore picture with scenes can be divided into every 128/16=8 image block of behavior, often be classified as 112/16 56 (8 being multiplied by 7) the individual image block altogether of=7 image blocks.As shown in the grid lines in Fig. 2. Assume that each image block B (i, j) represents, wherein i is line number in whole image for the image block B, j It is columns in whole image for the image block B.

So that the total pixel number of pending image is divided equally just as a example it is illustrated in Fig. 2.Waiting to locate In the case of the total pixel number of the image of reason is not the integral multiple of image block, it is also the end from picture element matrix Start, be expert at 16 × 16 pixels for unit and column direction divides, and by last less than 16 × 16 pictures The image-region of element is also divided into image block.

According in the embodiment of the present disclosure, pending image averaging is divided into multiple images block, this Dividing mode is simple.However, in the image procossing mode of other semantic description information generating image, The visual signature based on image may be needed image to be split and is obtained multiple images region, this is very Complicated.

In S120, for each image block in the multiple images block dividing in S110, all obtain The semantic description information of this image block.The implication of data is exactly semantic, and semantic description information is used to describe Semantic information.View data inherently symbol, the data being only endowed implication can be used, At this time data has translated into semantic description information.This semantic description information is, for example, one kind can be by machine The class text language performance of device intuitivism apprehension.Taking Fig. 2 as a example, each image of described seashore picture with scenes The semantic description information of block can be each image block is the general of at least one of sky, ocean, sandy beach Rate is distributed.

Fig. 3 is that the semanteme of each image block of acquisition in the image processing method schematically illustrate Fig. 1 is retouched The flow chart stating information S120.As shown in figure 3, obtaining each image block in the plurality of image block Semantic description information includes：Obtain the Gaussian Mixture mould with each corresponding picture material of semantic description information Type (S121)；Described image block is determined according to the similarity of described image block and described gauss hybrid models Belong to the probability Estimation (S122) of each picture material described；Each figure according to described image block belongs to As the probability Estimation of content determines the semantic description information (S123) of described image block.

In S121, each image corresponding with each semantic description information can be obtained from data base in advance The training image of content, and the training image based on each picture material is obtaining the Gauss of this picture material Mixed model.For example, the training image of each picture material, described image are prestored in data base Content for example includes sky, ocean, sandy beach, meadow, great Shu, high mountain etc..Described image content is each Plant involved content in the semantic description information of image.

Each picture material is likely to be of different images, for example, color under the conditions of different weather for the sky Color and brightness is typically different.Therefore, accordingly it is likely to be of multiple training figures with each picture material Picture, such as 512.For each picture material, a Gauss can be set up based on the plurality of training image Model is characterizing this picture material.Gauss model utilizes normal distribution curve accurately quantized image content, Gray scale in picture material, color etc. are decomposed into some models being formed based on normal distribution curve. For each picture material, corresponding Gauss model is weighted averagely can be obtained by the height of this picture material This mixed model.The mode of the acquisition gauss hybrid models taken does not constitute the limit to the embodiment of the present disclosure System.Generally, the training image of each picture material and the Gaussian Mixture mould of each picture material are obtained ahead of time Type, and set up data base.

In S122, according to each image block described and the described gauss hybrid models of each picture material Similarity belongs to the probability Estimation of each picture material described determining each image block described.With in Fig. 2 The image block in the upper right corner as a example, by the view data of 16 × 16 pixels of image block B (1,8) respectively with The gauss hybrid models of each picture material obtaining in S121 are mated.For example, by image block B (1, 8) view data respectively with the gauss hybrid models of sky, the gauss hybrid models of ocean, sandy beach height This mixed model, the gauss hybrid models on meadow, the gauss hybrid models of big tree, the Gaussian Mixture of high mountain Model is mated, to determine that image block B (1,8) belongs to sky, ocean, sandy beach, meadow, big respectively Tree, the probability Estimation of high mountain.According to Fig. 2, the image block in the described upper right corner is sky, then pass through The operation of S122 is it may be determined that image block B (1,8) belongs to the maximum probability of sky, and belongs to ocean, sand Beach, meadow, great Shu, the probability of any one may very little even zero in high mountain.Here, suppose that image Block B (1,8) belongs to sky, ocean, sandy beach, meadow, great Shu, the probability of high mountain are 90% respectively, 6%, 4%th, 0,0,0.

In S123, according to described image block belongs to, the probability Estimation of each picture material determines described figure Semantic description information as block.For example, it is possible to represent that each image block belongs in each image with rectangular histogram With ratio chart, the probability holding, can also represent that each image block belongs to the probability of each picture material.Depend on In the specific requirement of described semantic description information, suitable treatments can be carried out to generate to described probability Estimation Required semantic description information.

For each image block in pending image, all judge the institute of this image block and each picture material State the similarity of gauss hybrid models and belong to the general of each picture material described determining each image block described Rate is estimated, and determines its semantic description information based on described probability Estimation, thus obtaining pending figure The semantic description information of each image block in picture.

In S122 and S123 in figure 3, determined according to described image block and described gauss hybrid models The semantic description information of described image block, this is only example.Based on described gauss hybrid models, also may be used To take other modes to determine the semantic description information of each image block.

Fig. 4 schematically illustrates the semantic description information of obtained image block.As shown in figure 4, for The image block in the upper right corner in each image block shown in Fig. 2, the operation that execution combines Fig. 3 description is processed, And obtain the semantic description information of the image block in the upper right corner representing with rectangular histogram.Diagram according to Fig. 4 As can be seen that the probability that image block B (1,8) belongs to sky is far longer than it and belongs to the probability at ocean or sandy beach, Thus substantially can determine that image block B (1,8) belongs to sky.For convenience, illustrate only in Fig. 4 The semantic description information of one image block.

In fact, by the process of S120, having obtained the semantic description information of each image block in Fig. 2. That is, having obtained and each image block one-to-one semantic description information.Continue taking Fig. 4 as a example, In the diagram, basically illustrate the scenery at sky, ocean and sandy beach.In the image block showing sky, Can obtain that there is the semantic description information that greater probability belongs to sky, the 1st row of such as Fig. 4 and the 2nd The image block of row；In the image block showing ocean, can obtain there is greater probability and belong to ocean 3rd row of semantic description information, such as Fig. 4 and the image block of the 4th row；Showing the image at sandy beach In block, can obtain that there is the semantic description information that greater probability belongs to sandy beach, the 6th row of such as Fig. 4 Image block with the 7th row.

Language in S130, based on locus in described image for the described image block and described image block Adopted description information determines the semantic description information of described image.As an example, can be according to described image block Locus in described image of semantic description information and described image block determine the power of described image block Weight；The weight of the semantic description information according to described image block and described image block determines the language of described image Adopted description information.By the locus in the picture according to image block, weight is set for this image block, can The importance of each image block to distinguish is set, thus more accurately expressing the semantic description of whole image Information.In other words, it is no longer separate between each image block dividing in S110, and be intended to Determine the semantic description information of image based on each image block locus in the picture.

Fig. 5 be a diagram that the semantic description information (S130) of the determination image in the image processing method of Fig. 1 The first example flow chart.As shown in figure 5, institute is calculated according to the semantic description information of described image block State the similarity (S131) that image block is adjacent between image block；Figure is adjacent based on described image block As the similarity between block arranges the weight (S132) of described image block；Retouched according to the semanteme of described image block The weight stating information and described image block determines the semantic description information (S133) of described image.

In S131, the semantic description information between two adjacent image blocks is closer to two neighbor map As the similarity between block is bigger.Taking each image block in Fig. 4 as a example it is assumed that each image block only It is related to three picture materials, i.e. sky, ocean and sandy beach.

For the image block B (Isosorbide-5-Nitrae) in Fig. 4, the image block being adjacent is image block B (1,3), image Block B (1,5), image block B (2,4), the picture material of image block B (Isosorbide-5-Nitrae) and its adjacent image block is all with sky Based on sky, ocean, the content at sandy beach occupy small part, so what this image block B (Isosorbide-5-Nitrae) was adjacent Similarity between image block is larger.When calculating similarity, each picture material of image block to be considered With the ratio occupied by each picture material.As an example, can be by calculating the semanteme of two image blocks Euclidean distance between description information is calculating the similarity between this two image blocks.In the present embodiment, Using four neighborhoods described in Fig. 4 as adjacent image block, it is possible to use eight neighborhood even more many neighborhoods conduct Adjacent image block, does not limit here.

For the image block B (2,4) in Fig. 4, the image block being adjacent is image block B (2,3), image Block B (2,5), image block B (1,4), image block B (3,4).Image block B (2,4), B (2,3), B (2,5), In B (Isosorbide-5-Nitrae), all based on sky, ocean, the content at sandy beach occupy small part to picture material.However, In image block B (3,4), based on ocean, sky, the content at sandy beach occupy small part to picture material. Therefore, image block B (2,4) is high with the similarity of adjacent image block B (2,3), B (2,5), B (Isosorbide-5-Nitrae), But low with the similarity of adjacent image block B (3,4), it reduce what image block B (2,4) was adjacent The similarity of image block.Therefore, the similarity that image block B (2,4) is adjacent between image block is less than figure As block B (1,4) is adjacent the similarity between image block.

In S132, it is adjacent the similarity setting described image between image block based on described image block The weight of block.For example, in the case of the similarity height that image block is adjacent between image block, it is institute State image block and high weight is set；In the case of the similarity that image block is adjacent between image block is low, For described image block, low weight is set.For example, for above-mentioned image block B (Isosorbide-5-Nitrae) and B (2,4), image Weight W (1,4) of block B (1,4) is more than weight W (2,4) of image block B (2,4).The like, thus Obtain weight W (i, j) of each image block B (i, j) in Fig. 4.

In image shown in Fig. 4, in the first row of image and the second row based on sky, the 3rd Row and fourth line in based on ocean, in fifth line before four row based on ocean, rear four row with sandy beach Based on, in the 6th row and the 7th row based on sandy beach.When each image block picture material with about The content of image block similarity higher, then the weight of this image block is bigger, more can represent figure exactly As the semanteme in this region.

In S131 and S132, semantic description information according to described image block and described image block are in institute State the weight that the locus in image determine described image block, this is only example, may also take on it Its mode determines the weight of each image block, for example can be according to the Pixel Information of each image block Lai really Fixed.

In S133, the semantic description information according to described image block and the weight of described image block determine institute State the semantic description information of image.Multiple images content is potentially included, each picture material in whole image Diverse location in image.Correspondingly, the semantic description information of image is the diverse location of image The distribution of the semantic description information in region.

Continue, the language of each image block in the first row of image and the second row taking the image in Fig. 4 as a example Adopted description information is all based on sky, and the ratio very little at ocean and sandy beach, and the first row each Image block is very big with the similarity of surrounding image block, has very big weight, each image block in the second row And three adjacent image blocks between, there is high similarity, only there is low phase seemingly with an adjacent image block Degree, the weight of each image block in this second row is less than the weight of the image block in the first row as early as possible, but It is that also there is higher weight, thus the semantic description information based on the first row and each image of the second row Can determine that the semantic description information in this region for the image is sky with weight, and pole is occupied at ocean and sandy beach Small scale.Similarly, using this S133 it may be determined that going out semantic description in the other positions region of image Information.

Fig. 6 schematically illustrate determined by image semantic description information.As shown in the rightmost side of Fig. 6, The method of the semantic description information of the determination image according to Fig. 5 has it can be determined that going out whole image Three picture materials, i.e. sky, ocean and sandy beach.The first row of image and the second row can use the first language Describing, the right side four row in the third line of image, fourth line and fifth line can be with for adopted description information Describing, the left side four in the fifth line of image arranges two semantic description information, the 6th row, the 7th row can be used 3rd semantic description information is describing.First semantic description information is the ratio diagram of each picture material, Wherein, sky occupies maximum ratio, and ocean and sandy beach occupy small percentage respectively.Second semantic description letter Breath is the ratio diagram of each picture material, and wherein, ocean occupies maximum ratio, and sky and sandy beach are respectively Occupy small percentage.3rd semantic description information is the ratio diagram of each picture material, wherein, sandy beach Occupy maximum ratio, sky and ocean occupy small percentage respectively.

Mode above in conjunction with the semantic description information of the determination image of Fig. 5 and Fig. 6 description is only example. In practice, may also take on the semantic description information that other modes determine image, such as figure 7 below Shown.

Fig. 7 be a diagram that the second of the semantic description information of the determination image in the image processing method of Fig. 1 The flow chart of example.As shown in fig. 7, the described locus based on described image block in described image The semantic description information (S130) determining described image with the semantic description information of described image block may include： Semantic description information based on locus in described image for the described image block and described image block determines The adjacent image block with similar semantic description information, as Content aggregation region, described similar semantic is retouched Information of stating is as the semantic description information (S131A) in described Content aggregation region；According to described Content aggregation The semantic description information in the region and described Content aggregation region locus in described image determine described The weight (S132A) in Content aggregation region；Weight according to described Content aggregation region and described content are gathered The semantic description information in collection region determines the semantic description information (S133A) of described image.

In S131A, the adjacent image block in image with similar semantic description information is gathered as content Collection region.This Content aggregation region generally includes multiple images block, has bigger area.But, Compared with the case of horn of plenty, this Content aggregation region potentially includes an image block to the content of image.This is interior Hold aggregation zone and there is similar semantic description information.For example, the image block B (1,1) in the image of Fig. 2, B (1,2), B (2,1), B (2,2) have similar semantic description information, can be used as Content aggregation region. Correspondingly, image is made up of multiple Content aggregation regions.

In S132A, calculate each Content aggregation region weight in the picture.Can scheme using with calculating Weight similar mode as block to calculate the weight in Content aggregation region.For example, it is possible to be gathered according to content The semantic description information in collection region calculates described Content aggregation region and is adjacent between Content aggregation region Similarity；It is adjacent the similarity setting institute between Content aggregation region based on described Content aggregation region State the weight in Content aggregation region.

The semantic description of image in S133A, is determined with the semantic description information based on image block and weight As info class, weight according to Content aggregation region and semantic description information determine the semantic description of image Information, and specifically may refer to the description carrying out above in conjunction with S133.

Be can be seen that according to the above description with reference to Fig. 7 and divide each Content aggregation from the image block of image Region, and according to obtain the semantic description info class of image based on the semantic description information of image block as Mode, obtains the semantic description information of image based on the semantic description information in Content aggregation region.Work as image Larger, when content is relatively enriched, this can reduce information processing capacity, thus quickly determining image Semantic description information.

In the technical scheme according to the image processing method of the embodiment of the present disclosure, by based on each image The semantic description information of locus in whole image for the block and each image block determines the language of whole image Adopted description information, thus organically contacted by the associated picture block in whole image, more meets to image Understand custom.

Fig. 8 is the block diagram schematically illustrating the first image processing apparatus 800 according to the embodiment of the present disclosure. As shown in figure 8, the first image processing apparatus 800 include one or more processors 810, storage device 820th, input equipment 830, output device 840, communicator 850 and photographic head 860, these assemblies By bindiny mechanism's (not shown) interconnection of bus system 870 and/or other forms.It should be noted that figure The assembly of the first image processing apparatus 800 shown in 8 and structure are illustrative, and not restrictive, As needed, the first image processing apparatus 800 can also have other assemblies and structure.

Processor 810 can be CPU (CPU) or have data-handling capacity and/or refer to Make the processing unit of the other forms of executive capability, and can control in the first image processing apparatus 800 Other assemblies to execute desired function.

Storage device 820 can include one or more computer programs, and described computer program produces Product can include various forms of computer-readable recording mediums, for example volatile memory and/or non-volatile Property memorizer.Described volatile memory for example can include random access memory (RAM) and/or height Fast buffer storage (cache) etc..Described nonvolatile memory for example can include read only memory (ROM), hard disk, flash memory etc..Described computer-readable recording medium can store one or many Individual computer program instructions, processor 810 can run described program instruction, to realize above in conjunction with figure The image processing method of 1 to Fig. 7 description.Described computer-readable recording medium can also store each Kind of application program and various data, such as view data and described application program using and/or produce each Plant data etc..

Input equipment 830 can be the device for input instruction for the user, and can include keyboard, Mus One or more of mark, mike and touch screen etc..Described instruction e.g. uses following photographic head 860 The instruction of shooting image.Output device 840 can export various information (examples to outside (such as user) As image or sound), and one or more of display, speaker etc. can be included.Communicator 850 can be by network or other technology and other device (such as personal computer, server, movements Platform, base station etc.) communication, described network can be the Internet, WLAN, mobile communications network etc., Described other technology for example can include Bluetooth communication, infrared communication etc..Photographic head 860 can shoot and treat The image (such as photo, video etc.) processing, and captured image is stored in storage device 820 In for other assemblies use.

Fig. 9 is the block diagram schematically illustrating the second image processing apparatus 900 according to the embodiment of the present disclosure. As shown in figure 9, the second image processing apparatus 900 may include division unit 910, the semantic determination of image block Unit 920 and image, semantic determining unit 930.Pending image averaging is divided by division unit 910 For multiple images block.Image block semanteme determining unit 920 obtains the semantic description information of described image block. Image, semantic determining unit 930 is based on locus in described image for the described image block and described image The semantic description information of block determines the semantic description information of described image.

Pending image averaging is divided into multiple images block by division unit 910.Pending image leads to Often to be represented with the numerical value of each pixel of image.For each pixel, gray value, three primary colories can be used Component etc. represents.Image typically comprises the picture element matrix arranging according to row and column mode.As being divided into The example of multiple images block, can divide successively according to the mode that each image block includes 16 × 16 pixels. The specific example that divides may refer to the diagram of Fig. 2 and the description carrying out with reference to Fig. 2.In fig. 2 each Image block represents, wherein i is line number in whole image for the image block B and 1≤i≤7, j with B (i, j) It is row number in whole image for the image block B and 1≤j≤8.In the application, it is possible to use processor and Memorizer is realizing this division unit 910.

Pending image averaging is divided into multiple images block, this dividing mode letter by division unit 910 Single.However, other generate images semantic description information image procossing mode in it may be necessary to Visual signature based on image is split to image and is obtained multiple images region, and this is extremely complex.

Every in the multiple images block that image block semanteme determining unit 920 is divided for division unit 910 Individual image block, all obtains the semantic description information of this image block.The implication of data is exactly semantic, and semanteme is retouched The information of stating is used to the information of descriptive semantics.View data inherently symbol, is only endowed implication Data can be used, and at this time data has translated into semantic description information.This semantic description information E.g. a kind of can be by the class text language performance of machine intuitivism apprehension.For example, the seashore landscape of Fig. 2 The semantic description information of each image block of photo can be each image block be sky, ocean, in sandy beach The probability distribution of at least one.

Image block semanteme determining unit 920 can obtain the semantic description information of image block as follows：Obtain with The gauss hybrid models of each corresponding picture material of semantic description information；According to described image block and described Gauss hybrid models determine the semantic description information of described image block.

Image block semanteme determining unit 920 can obtain and each semantic description information pair from data base in advance The training image of each picture material answered, and the training image based on each picture material is obtaining this figure Gauss hybrid models as content.Or, image block semanteme determining unit 920 can be directly from data base The middle gauss hybrid models obtaining each picture material.

Prestore the training image of each picture material in data base, described image content for example includes Sky, ocean, sandy beach, meadow, great Shu, high mountain etc..Described image content is the semanteme of various images Involved content in description information.Each picture material is likely to be of different images, for example, sky Color under the conditions of different weather and brightness are typically different.Therefore, corresponding with each picture material Be likely to be of multiple training images, such as 512.For each picture material, can be many based on this Individual training image is set up a Gauss model to characterize this picture material.Gauss model utilizes normal distribution curve Accurately quantized image content, by the gray scale in picture material, color etc. be decomposed into some based on normal state Distribution curve and the model that formed.For each picture material, corresponding Gauss model is averagely weighted The gauss hybrid models of this picture material can be obtained.The mode of the acquisition gauss hybrid models taken is not Constitute the restriction to the embodiment of the present disclosure.Generally, the training image of each picture material and every is obtained ahead of time The gauss hybrid models of individual picture material, and set up data base.

As an example, image block semanteme determining unit 920 can determine respectively according to gauss hybrid models as follows The semantic description information of individual image block：Similarity according to described image block and described gauss hybrid models Lai Determine that described image block belongs to the probability Estimation of each picture material described；Institute is belonged to according to described image block The probability Estimation stating each picture material determines the semantic description information of described image block.

, by the 16 × 16 of the image block B (1,8) in this upper right corner taking the image block in the upper right corner in Fig. 2 as a example The view data of pixel is mated with the gauss hybrid models of each picture material being obtained respectively.Example As the view data of image block B (1,8) is mixed with the gauss hybrid models of sky, the Gauss of ocean respectively Matched moulds type, the gauss hybrid models at sandy beach, the gauss hybrid models on meadow, the gauss hybrid models of big tree, The gauss hybrid models of high mountain are mated, with determine respectively image block B (1,8) belong to sky, ocean, Sandy beach, meadow, great Shu, the probability Estimation of high mountain.According to Fig. 2, the image block in the described upper right corner It is sky, may thereby determine that image block B (1,8) belongs to the maximum probability of sky, and belong to ocean, sand Beach, meadow, great Shu, the probability of any one may very little even zero in high mountain.

Hereafter, image block semanteme determining unit 920 each picture material according to described image block belongs to Probability Estimation determine the semantic description information of described image block.For example, image block semanteme determining unit 920 Can represent that each image block belongs to the probability of each picture material with rectangular histogram, can also be with ratio chart Show that each image block belongs to the probability of each picture material.Specifically will depending on described semantic description information Ask, described probability Estimation can be carried out with suitable treatments with the semantic description information required for generating.

For each image block in pending image, image block semanteme determining unit 920 all judges this figure As described in the similarity of block and the described gauss hybrid models of each picture material to determine, each image block belongs to In the probability Estimation of each picture material described, and determine that based on described probability Estimation its semantic description is believed Breath, thus obtain the semantic description information of each image block in pending image.Image block is semantic really The semantic description information of the image block that order unit 920 is obtained may refer to the diagram of Fig. 4 and combines figure 4 associated description carrying out.

Image block semanteme determining unit 920 can be realized using memorizer and processor.When processor fortune Each operation of image block semanteme determining unit 920 in line storage during the program of storage, can be completed.

Image, semantic determining unit 930 is based on locus in described image for the described image block and image The semantic description information of the image block that block semanteme determining unit 920 determines is retouched come the semanteme to determine described image State information.As an example, image, semantic determining unit 930 can be according to the semantic description of described image block The information and described image block locus in described image determine the weight of described image block；According to institute The weight of the semantic description information and described image block of stating image block determines the semantic description letter of described image Breath.By the locus in the picture according to image block, weight is set for this image block, can distinguish ground The importance of each image block is set, thus more accurately expressing the semantic description information of whole image.Cause This, be no longer separate between each image block that division unit 910 is divided, and be intended to be based on Each image block locus in the picture are determining the semantic description information of image.

Image, semantic determining unit 930 can determine the weight of image block as follows：According to described image block Semantic description information calculate described image block be adjacent the similarity between image block；Based on described figure The similarity being adjacent between image block as block arranges the weight of described image block.

Semantic description information between two adjacent image blocks is closer between two adjacent image blocks Similarity is bigger.For example, the image block B (Isosorbide-5-Nitrae) in Fig. 4 and its adjacent image block B (1,3), B (1,5), All based on sky, ocean, the content at sandy beach occupy small part to the picture material of B (2,4), so should Similarity between the image block that image block B (1,4) is adjacent is larger.When calculating similarity, examine Consider each picture material of image block and the ratio occupied by each picture material.Image, semantic determining unit 930 for example can calculate this by the Euclidean distance between the semantic description information of two image blocks of calculating Similarity between two image blocks.Image block B (2,4) in Fig. 4 and adjacent image block B (2,3), B (2,5), the similarity height of B (Isosorbide-5-Nitrae), all based on sky, ocean, the content at sandy beach occupy small part； But image block B (2,4) is low with the similarity of adjacent image block B (3,4) because image block B (3,4) with Based on ocean, sky, the content at sandy beach occupy small part.Therefore, image block B (2,4) is adjacent Similarity between image block is adjacent the similarity between image block less than image block B (1,4).

In the case of the similarity height that image block is adjacent between image block, image, semantic determining unit 930 can arrange high weight for described image block；It is adjacent the similarity between image block in image block In the case of low, it is that described image block arranges low weight.For example, for above-mentioned image block B (Isosorbide-5-Nitrae) and Weight W (Isosorbide-5-Nitrae) of B (2,4), image block B (Isosorbide-5-Nitrae) determined by image, semantic determining unit 930 is more than figure Weight W (2,4) as block B (2,4).The like, thus obtaining the weight of each image block B (i, j) W(i,j).In the application, image, semantic determining unit 930 may also take on other modes and determines each figure As the weight of block, for example, can be determined according to the Pixel Information of each image block.

Image, semantic determining unit 930 and then the semantic description information according to described image block and described image The weight of block determines the semantic description information of described image.Multiple images content is potentially included in whole image, The diverse location that each picture material is located in image.Correspondingly, the semantic description information of image is figure The distribution of the semantic description information in the diverse location region of picture.

Continue, image, semantic determining unit 930 is retouched according to the semanteme of image block taking the image in Fig. 2 as a example State the weight of information and image block it is found that the semanteme of each image block in the first row and the second row is retouched Information of stating is all based on sky, and the ratio very little at ocean and sandy beach, and each image of the first row Block has very big weight, and the weight of each image block in the second row is less than the image block in the first row as early as possible Weight, but also there is higher weight, thus the language based on the first row and each image of the second row Adopted description information and weight can determine that the semantic description information in this region for the image is sky, and ocean and Minimum ratio is occupied at sandy beach.Similarly, image, semantic determining unit 930 can determine that other of image Semantic description information in the band of position.

The semantic description information of image determined by image, semantic determining unit 930 may refer to the figure of Fig. 6 Show and related description.In short, the first row of image and the second row can be with the first semantic description information To describe, the right side four row in the third line of image, fourth line and fifth line can use the second semantic description Describing, the left side four in the fifth line of image arranges information, the 6th row, the 7th row can be retouched with the 3rd semanteme State information to describe.First semantic description information indicates that sky occupies maximum ratio, and ocean and sandy beach are divided Do not occupy small percentage.Second semantic description information indicates that ocean occupies maximum ratio, sky and sandy beach Occupy small percentage respectively.3rd semantic description information indicates that sandy beach occupies maximum ratio, sky and sea Ocean occupies small percentage respectively.

Alternatively, image, semantic determining unit 930 may also take on the semanteme that other modes determine image Description information.For example, image, semantic determining unit 930 can be based on described image block in described image Locus and described image block semantic description information determine there is the adjacent of similar semantic description information Image block as Content aggregation region, using described similar semantic description information as described Content aggregation region Semantic description information；Semantic description information according to described Content aggregation region and described Content aggregation area Locus in described image for the domain determine the weight in described Content aggregation region；Gathered according to described content The semantic description information in the weight in collection region and described Content aggregation region determines the semantic description of described image Information.

Here, image, semantic determining unit 930 will have the neighbor map of similar semantic description information in image As block is as Content aggregation region.This Content aggregation region generally includes multiple images block, has bigger Area.But, in the case of the content of image is compared with horn of plenty, this Content aggregation region potentially includes one Individual image block.Then, image, semantic determining unit 930 is using the side similar with the weight calculating image block Formula is calculating the weight in Content aggregation region, and the weight according to Content aggregation region and semantic description information Determine the semantic description information of image.For example, image, semantic determining unit 930 can be according to Content aggregation The semantic description information in region calculates described Content aggregation region and is adjacent the phase between Content aggregation region Like degree；The similarity setting being adjacent between Content aggregation region based on described Content aggregation region is described The weight in Content aggregation region, the weight according to Content aggregation region and semantic description information determine image Semantic description information.That is, whole image is drawn by image, semantic determining unit 930 based on image block Divide into Content aggregation region, and according to the semanteme obtaining image based on the semantic description information of image block Description information similar mode, is retouched based on the semanteme that the semantic description information in Content aggregation region obtains image State information.When image is larger, when content is relatively enriched, this can reduce information processing capacity, thus Quickly determine the semantic description information of image.

Image, semantic determining unit 930 can be realized using memorizer and processor.When processor runs Each operation of image, semantic determining unit 930 in memorizer during the program of storage, can be completed.

In the technical scheme according to the image processing apparatus of the embodiment of the present disclosure, by based on each image The semantic description information of locus in whole image for the block and each image block determines the language of whole image Adopted description information, thus organically contacted by the associated picture block in whole image, more meets to image Understand custom.

Those skilled in the art can be understood that, for convenience and simplicity of description, above-mentioned retouches The device stated, the specific work process of unit, may be referred to the corresponding process in preceding method embodiment, Will not be described here.

Those of ordinary skill in the art are it is to be appreciated that combine each of the embodiments described herein description The unit of example and step, can be come with the combination of electronic hardware or computer software and electronic hardware Realize.These functions to be executed with hardware or software mode actually, and specific depending on technical scheme should With and design constraint.Professional and technical personnel can use different methods to each specific application Realize described function, but this realization is it is not considered that exceed the scope of the present disclosure.

It should be understood that disclosed method and apparatus in several embodiments provided herein, Can realize by another way.For example, apparatus embodiments described above are only schematically, For example, the division of described unit, only a kind of division of logic function, in addition actual can have when realizing Dividing mode, for example multiple units can in conjunction with or be desirably integrated into another system, or some are special Levy and can ignore, or do not execute.

The above, the only specific embodiment of the disclosure, but the protection domain of the disclosure does not limit to In this, any those familiar with the art, can be easily in the technical scope that the disclosure discloses Expect change or replacement, all should cover within the protection domain of the disclosure.Therefore, the protection of the disclosure Scope should described be defined by scope of the claims.

Claims

1. a kind of image processing method, including：

Pending image averaging is divided into multiple images block；

Obtain the semantic description information of described image block；

Semantic description information based on locus in described image for the described image block and described image block Determine the semantic description information of described image.

2. image processing method according to claim 1, wherein, described acquisition described image block Semantic description information includes：

Obtain the gauss hybrid models with each corresponding picture material of semantic description information；

Determine the semantic description information of described image block according to described image block and described gauss hybrid models.

3. image processing method according to claim 2, wherein, described according to described image block and Described gauss hybrid models determine that the semantic description information of described image block includes：

Determine that described image block belongs to institute according to the similarity of described image block and described gauss hybrid models State the probability Estimation of each picture material；

According to described image block belongs to, the probability Estimation of each picture material determines the language of described image block Adopted description information.

4. image processing method according to claim 1, wherein, described is existed based on described image block The semantic description information of the locus in described image and described image block determines that the semanteme of described image is retouched The information of stating includes：

Locus in described image for the semantic description information and described image block according to described image block Determine the weight of described image block；

The weight of the semantic description information according to described image block and described image block determines the language of described image Adopted description information.

5. image processing method according to claim 4, wherein, described according to described image block The semantic description information and described image block locus in described image determine the weight of described image block Including：

Semantic description information according to described image block calculates described image block and is adjacent between image block Similarity；

The similarity being adjacent between image block based on described image block arranges the weight of described image block.

6. image processing method according to claim 1, wherein, described is existed based on described image block The semantic description information of the locus in described image and described image block determines that the semanteme of described image is retouched The information of stating includes：

Semantic description information based on locus in described image for the described image block and described image block Determine the adjacent image block with similar semantic description information as Content aggregation region, by described similar language Adopted description information is as the semantic description information in described Content aggregation region；

Semantic description information according to described Content aggregation region and described Content aggregation region are in described image In locus determine the weight in described Content aggregation region；

The semantic description information in the weight according to described Content aggregation region and described Content aggregation region determines The semantic description information of described image.

7. a kind of image processing apparatus, including：

Memorizer；With

Processor, for executing following operation：

Pending image averaging is divided into multiple images block；

Obtain the semantic description information of described image block；

Semantic description based on locus in described image for the described image block and described image block Information determines the semantic description information of described image.

8. image processing apparatus according to claim 7, wherein, described acquisition described image block Semantic description information includes：

9. image processing apparatus according to claim 8, wherein, described according to described image block and Described gauss hybrid models determine that the semantic description information of described image block includes：

10. image processing apparatus according to claim 7, wherein, described based on described image block The semantic description information of the locus in described image and described image block determines the semanteme of described image Description information includes：

11. image processing apparatus according to claim 10, wherein, described according to described image block Locus in described image of semantic description information and described image block determine the power of described image block Include again：

12. image processing apparatus according to claim 7, wherein, described based on described image block The semantic description information of the locus in described image and described image block determines the semanteme of described image Description information includes：