CN102033933B - Distance metric optimization method for maximizing mean average precision (MAP) - Google Patents

Distance metric optimization method for maximizing mean average precision (MAP) Download PDF

Info

Publication number
CN102033933B
CN102033933B CN201010594547.0A CN201010594547A CN102033933B CN 102033933 B CN102033933 B CN 102033933B CN 201010594547 A CN201010594547 A CN 201010594547A CN 102033933 B CN102033933 B CN 102033933B
Authority
CN
China
Prior art keywords
image
average
distance measure
map
precision ratio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201010594547.0A
Other languages
Chinese (zh)
Other versions
CN102033933A (en
Inventor
冯前进
阳维
卢振泰
陈武凡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southern Medical University
Original Assignee
Southern Medical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southern Medical University filed Critical Southern Medical University
Priority to CN201010594547.0A priority Critical patent/CN102033933B/en
Publication of CN102033933A publication Critical patent/CN102033933A/en
Application granted granted Critical
Publication of CN102033933B publication Critical patent/CN102033933B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a distance metric optimization method for maximizing a mean average precision (MAP). The method comprises the following steps of: (1) acquiring an image database which comprises an inquired image-image to be inquired pair set; (2) approximating the MAP by using a smooth function, converting the approximated MAP into a target function and adding a regularization item, so that the target function is continuously differentiable; (3) optimizing the target function by using an inquired image-image to be inquired pair in the image database and gradient descent-based optimization technology and repeating the optimization process, wherein a result which maximizes the MAP is taken as the optimal linear transformation matrix; and (4) acquiring a distance metric which is defined by the optimal linear transformation matrix. The method has the characteristics of direct optimization process and target. The distance metric obtained by the method can reach higher image retrieval performance.

Description

Maximize the distance measure optimization method of average precision ratio average
Technical field
The present invention relates to a kind of distance measure optimization method that is used to improve the image retrieval performance, relate in particular to a kind of distance measure optimization method that maximizes average precision ratio average.
Background technology
(Content-Based Image Retrieval, basic thought CBIR) is that visual feature of image is extracted to CBIR, realizes the image retrieval of higher level as index with visual signature.The CBIR technology can provide support and helps for managing image data, clinical diagnosis, medical teaching etc.Especially, the retrieval of similar focus in the medical image can improve the integrality of reliability of clinical diagnosis and relevant information.
Usually; The CBIR system is in order to let Query Result return according to similarity order from big to small; Need in image feature space distance measure of definition or similarity measurement to come the difference between image to be checked and query image in the measure database; During return results, image to be checked is according to the ordering of difference size.Distance measure commonly used has: Euclidean distance, mahalanobis distance, Cauchy's distance etc.The retrieval performance of CBIR system depends on two aspects to a great extent: 1. the visual signature that is used to express picture material and semantic concept; 2. distance measure that in feature space, defines or similarity measurement.Usually adopt the low layer visual signature in the CBIR system, describe picture material like color, texture, shape, edge etc.Yet the descriptive power of low layer visual signature is limited, and information has loss when expressing image, thereby the high-level semantic of image generally can not directly expressed or reflect to the low layer visual signature.Like this; In the low-level feature space; We are difficult to promptly have so-called " semantic wide gap " with one of the above-mentioned general distance measure definition similarity measurement with the semantic concept tight association, make the retrieval performance of CBIR system and actual demand that certain distance arranged.
Related feedback (Relevance Feedback) and distance measure study (Distance Metric Learning) method can be used to reduce semantic wide gap, improve the retrieval performance of CBIR.The data that are used to learn in the related feedback are difficult to obtain, and system needs repeatedly to adjust just can reach promising result, and it is not very convenient that the user uses.The distance measure learning method can be set up the mapping between low layer characteristics of image and the high-level semantic according to the image pattern of prior mark, thereby the distance measure of definition reflection semantic difference improves retrieval performance.Existing distance measure learning method, spininess is to cluster or the design of k nearest neighbour classification problem, and its learning objective is in order to improve the accuracy rate of cluster or k nearest neighbour classification.Yet; The retrieval precision index and the classification accuracy of CBIR system are very different; As average precision ratio average (Mean Average Precision, MAP), NDCG (Normalized DiscountedCumulative Gain), all with database in the ordering of image to be checked relation is arranged; Use the objective function of optimizing to cluster or classification problem design to carry out distance measure study, differing obtains the best distance measure of retrieval performance surely.And, because the retrieval precision index is not continuously differentiable, it is optimized the certain difficulty of existence.
Summary of the invention
The object of the present invention is to provide a kind of distance measure optimization method that maximizes average precision ratio average; This method has optimizing process and target is direct; Optimize the semantic difference that the distance measure that obtains can reflect image, the distance measure that obtains with this method can reach the better pictures retrieval performance.
The object of the invention can be realized through following technical measures:
A kind of distance measure optimization method that maximizes average precision ratio average may further comprise the steps:
(1) obtains the image data base that comprises query image-image pair set to be checked;
(2) adopt smooth function that MAP is approached, be converted into objective function and add regularization term, make that objective function is continuously differentiable;
(3) use the query image-image to be checked in the image data base right, adopt the method that descends based on gradient to be optimized to objective function, and repeat optimizing process, get and make maximum the separating of MAP as the optimum matrix of a linear transformation;
(4) image pitch is obtained from what estimate, comprises dual mode: to the advanced line translation of the characteristics of image in the image data base, the Euclidean distance with transform characteristics defines the distance measure between image again with the optimum linearity transformation matrix in the step (3); Or with the optimum linearity transformation matrix in the step (3) definition mahalanobis distance as the distance measure between image.
Image data base in the said step (1) is made up of image or image-region characteristic, image category.
In the said step (1) according to whether comprising definite images such as same anatomical, same type focus to whether relevant, to inquiry-image to be checked to marking.
The process of in the said step (2) MAP being approached is: to given query image, the ordering sequence number of arbitrary image to be checked uses the Sigmoid type function smoothly to approach, and then obtains smoothly approaching of MAP.
The ordering sequence number of said image to be checked compares definite by the distance between this image to be checked and other all images to be checked and the query image.
The transformation matrix that the initial solution of transformation matrix can be a stochastic matrix or obtained by additive method in the said step (3).
Adopting at random in the said step (3), gradient descent algorithm comes the optimization aim function.
The inventive method has following beneficial effect with respect to prior art:
1, this method relates to technology such as image retrieval, statistical learning and numerical optimization, and (Mean Average Precision MAP), can be used for image retrieval, cluster analysis, data and subtracts approximately etc. directly to optimize average precision ratio average through statistical learning.This method is mainly carried out statistical learning to the view data of mark; Obtain a matrix of a linear transformation; With primitive character spatial mappings to a new feature space that has more resolving ability, in new feature space, retrieve as image similarity tolerance with Euclidean distance, reach higher MAP.
2, the inventive method is directly separated the matrix of a linear transformation; The positive semidefinite planning algorithm that need not working costs is found the solution; And can carry out dimension to primitive character through the dimension of direct adjustment transformation matrix and subtract approximately, rather than the order of positive semidefinite matrix is retrained or regulate indirectly regularization parameter realize that dimension subtracts approximately.This method is an objective function with smoothly approaching of retrieval precision index MAP, and with respect to some other distance measure learning method, optimizing process and target are more direct, and low layer characteristics of image after the mapping and high-level semantic characteristic are got in touch more tight, and retrieval performance is more excellent.In content-based medical image retrieval was used, the present invention can reach the better retrieval performance than the distance measure that some other distance measure learning method obtains.
3, because very big to the calculated amount of calculating target function and gradient to all query image-image to be checked, therefore, adopt in this method at random that gradient descent algorithm is optimized, can reduce to calculator memory demand, accelerate algorithm the convergence speed; In addition; Because objective function is non-protruding, there are a lot of Local Extremum in the solution space, be absorbed in Local Extremum for avoiding separating; Estimate one near the initial solution of optimum solution through other distance measure learning methods faster, repeatedly the repeated optimization process is got optimum solution then.
Description of drawings
Fig. 1 is the process flow diagram of the distance measure optimization method of the average precision ratio average of maximization of the present invention;
Fig. 2 is that the present invention optimizes in the retrieval precision index MAP process MAP with the convergence situation synoptic diagram of optimized Algorithm iterations;
Fig. 3 be distance measure that the present invention and other distance measure optimization methods obtain be used to retrieve look into complete-precision ratio curve;
The precision ratio average curve map that Fig. 4 is four kinds of distance measures when returning image and be 50 width of cloth;
The distance measure that Fig. 5 is to use optimization of the present invention to obtain is used for the example of the sick retrieval of brain tumor T1 weighting Contrast-enhanced MRI image
Embodiment
Fig. 1 shows the idiographic flow of the distance measure optimization method of the average precision ratio average of maximization of the present invention, may further comprise the steps:
(1) obtains the image data base that comprises the right set of the artificial query image-image to be checked that marks; Image data base is made up of image or image-region characteristic, image category.
(2) adopt smooth function that MAP is approached, be converted into objective function and add regularization term, make that objective function is continuously differentiable; Desirable stochastic matrix of transformation matrix initial solution or the transformation matrix of estimating by other distance measure learning methods faster.The process that MAP is approached is: to given query image, the ordering sequence number of arbitrary image to be checked uses the Sigmoid type function smoothly to approach, and then obtains smoothly approaching of MAP.
The ordering sequence number of image to be checked compares definite by the distance between this image to be checked and other all images to be checked and the query image.
(3) use the query image-image to be checked in the image data base and adopt at random that gradient descent algorithm comes the optimization aim function, the repeated optimization process is got and is made maximum the separating as the optimum matrix of a linear transformation of MAP;
(4) distance measure between image is used for image is retrieved; The use-pattern of the optimum linearity transformation matrix in the step (3) comprises two kinds: a kind of is to the advanced line translation of the characteristics of image in the image data base with the matrix of a linear transformation; Euclidean distance with transform characteristics defines the distance measure between image again, thereby carries out image querying; Another kind is to carry out image querying with matrix of a linear transformation definition mahalanobis distance as the distance measure of image.
Below in conjunction with brain T1 weighting Contrast-enhanced MRI image retrieval problem, set forth principle of work of the present invention and step.
Step 1 reads the histological type of tumour MRI image, the figure and the focus profile that the doctor delineates by hand from image data base, extracts characteristics such as focus area grayscale, texture, shape and edge, and proper vector is expressed as x iThe image that definition comprises the same type tumour is an associated picture, otherwise is uncorrelated image.The task of image retrieval be with in the database with query image in the focus zone identical image of histological type return to the user.
Step 2, because MAP is relevant with the ordering of image to be checked, the ordering sequence number of at first confirming and approaching each image to be checked according to the distance between query image and the image to be checked.If L is one d * D (real number matrix of d≤D), positive semidefinite matrix W=L TL, then x iWith x jBetween mahalanobis distance (square) do
d ij=||Lx i-Lx j|| 2=(x i-x j) TL TL(x i-x j)=(x i-x j) TW(x i-x j),
Here TThe matrix transpose operation of representing matrix.If query feature vector is x q, inquiry-to be checked to being (x q, { x i, i=1,2 ..., N}), N is a total number of images to be checked here, proper vector x to be checked iThe ordering sequence number can be expressed as
π ( x i ) = 1 + Σ k , k ≠ i 1 { d qi > d qk } = 1 + Σ k , k ≠ i 1 { Δ d ik > 0 } ,
Δ d wherein Ik=d Qi-d Qk, 1{} is an indicator function.Since discontinuous, the non-differentiability of indicator function, ordering sequence number π (x i) as apart from d QiOr the function of L also is discontinuous, non-differentiability.Indicator function can be used continuously differentiable Sigmoid type function, is designated as S (Z).Ordering sequence number π (x i) smoothly approach into
Figure BDA0000039021690000052
(Average Precision AP) is defined as the average precision ratio of retrieval precision index
Figure BDA0000039021690000053
Rel (x wherein i) { 0,1} representes the label that two-stage is relevant, N to ∈ +Be the total number of images relevant with query image,
Figure BDA0000039021690000054
MAP be average precision ratio all inquiries on average.AP can be write as following form
AP = 1 N + Σ i ( rel ( x i ) π ( x i ) + Σ i , i ≠ j rel ( x i ) rel ( x j ) 1 { Δ d ij > 0 } π ( x i ) ) .
Use
Figure BDA0000039021690000056
Substitute π (x i), and use Sigmoid approximation of function indicator function once more, AP smoothly approach into
A ^ P = 1 N + Σ i = 1 N ( rel ( x i ) π ^ ( x i ) + Σ k , k ≠ i rel ( x i ) rel ( x k ) S ( Δ d ik ) π ^ ( x i ) ) .
The regularization term of transformation matrix L is expressed as Reg (L).Desirable trace (the L of Reg (L) TL), the mark of trace (M) representing matrix M here; Reg (L) is other norm forms of desirable L also.
Maximization MAP is equivalent to and minimizes 1-MAP.Define average precision ratio loss function:
Loss AP ( L ) = 1 - A ^ P ,
The objective function of the present invention's definition has following form:
R(L)=Loss AP(L)+γReg(L),
Wherein γ is a weight coefficient, is used to control the balance between loss function and the regularization term.R (L) is continuously differentiable.
Distance measure optimization problem of the present invention is expressed as:
L * = arg min L ∈ R d × D 1 M Σ m = 1 M R m ( L ) ,
Wherein M for inquiry-image to be checked to sum, R m(L) be m inquiry-image to be checked on objective function.
Step 3 is at first estimated initial solution L 0, L 0Can be one d * D stochastic matrix or get separating that other distance measure learning methods obtain.During the t time iteration, generation inquiry from image data base at random-to be checked to (x q, { x i, i=1,2 ..., N t) t, calculate R (L t) this inquiry on about L tGradient g t, upgrade L t=L T-1tg t, here
Figure BDA0000039021690000062
T=1,2 ..., T is an iterations, the maximum iteration time of T for setting, η tIt is the step-length of the t time iteration; Iteration is until reaching maximum iteration time or separating the condition of convergence.Repeatedly repeat said process, choose and make maximum the separating of retrieval precision index as L *As get trace (L TL) as regularization term, gradient g then tFor:
g t = ∂ R ( L t ) ∂ L t = - ∂ Loss AP ( L t ) ∂ L t + 2 γ L t
Loss function is about L tLocal derviation
Figure BDA0000039021690000064
Obtain through chain rule.
Algorithm parameter α, γ can adjust through the crosscheck method, and experience ground α gets 1~10, and γ gets 0.0001~0.01 can reach promising result.The desirable η of step-length that gradient descends t0(2t/T), t is the current iteration number of times to exp, also can adopt other modes to adjust step-length.Fig. 2 shows the present invention and optimizes in the retrieval precision index MAP process MAP with the convergence situation of optimized Algorithm iterations.
Step 4, the linear transformation L that obtains for step 3 *, have the mode of two kinds of equivalences to be applied to image retrieval.The one, use L *Original feature vector is mapped to new feature space, use the similarity measurement of the Euclidean distance of standard then as image; The one, directly use L *The definition mahalanobis distance is retrieved as the similarity measurement of image, and the former can reduce the required storage space of characteristic, accelerate image retrieval speed.
What the distance measure that Fig. 3 obtains for the present invention and other distance measure learning methods was used to retrieve looks into entirely-the precision ratio curve; Experimental data wherein is a brain T1 weighting Contrast-enhanced MRI image; The image that comprises the same type tumour is an associated picture, otherwise is uncorrelated image.Relatively the distance measure of usefulness comprise standard Euclidean distance, by large-spacing nearest neighbour classification (LMNN) and the acquistion of local Fisher discriminatory analysis (LFDA) methodology to mahalanobis distance.It is thus clear that the inventive method can reach higher MAP, be superior to Euclidean distance and other two kinds of distance measure learning methods.
The precision ratio average curve that Fig. 4 is four kinds of distance measures when returning image and be 50 width of cloth, used view data is identical with Fig. 3.Distance measure optimization method among the present invention, during the 5-50 image, the precision ratio average is higher than other three kinds of distance measures before returning.
Fig. 5 is used for the example of the sick retrieval of brain tumor T1 weighting Contrast-enhanced MRI image for the distance measure that uses the present invention to obtain; The image in the upper left corner is a query image among the figure; Add the associated picture (the expression tumor type is identical) that the expression of solid box (white wire frame) retrieves; Add the uncorrelated image (the expression tumor type is inequality) that the expression of frame of broken lines (dashed white wire frame) retrieves, among the figure tumor region profile manual work tick.Above-mentioned experimental data shows that the present invention can reach the better retrieval performance than the distance measure that some other distance measure learning method obtains.
Be not limited thereto at embodiment of the present invention; Other similar MAP like Preck and NDCG etc., also can use similar above-mentioned formula and method to approach by the retrieval precision index of image ordering sequence number to be checked definition; Find the solution with optimization method of the present invention then, do not enumerate one by one at this; Some other optimisation technique and step-length adjustment mode that descends based on gradient also can be used for realizing the object of the invention like the conjugate gradient descent algorithm; According to foregoing of the present invention; Ordinary skill knowledge and customary means according to this area; Do not breaking away under the above-mentioned basic fundamental thought of the present invention prerequisite, the present invention can also make equivalent modifications, replacement or the change of other various ways, all drops within the rights protection scope of the present invention.

Claims (4)

  1. One kind the maximization average precision ratio average the distance measure optimization method, may further comprise the steps:
    (1) obtains the image data base that comprises query image-image pair set to be checked;
    (2) adopt smooth function that average precision ratio average is approached, be converted into objective function and add regularization term, make that objective function is continuously differentiable; The process that average precision ratio average is approached is: to given query image, the ordering sequence number of arbitrary image to be checked uses the Sigmoid type function smoothly to approach, and then obtains smoothly approaching of average precision ratio average;
    (3) use the query image-image to be checked in the image data base right, adopt the method that descends based on gradient to be optimized to objective function, and repeat optimizing process, get and make maximum the separating of average precision ratio average as the optimum matrix of a linear transformation;
    (4) image pitch is obtained from what estimate, comprises dual mode: to the advanced line translation of the characteristics of image in the image data base, the Euclidean distance with transform characteristics defines the distance measure between image again with the optimum linearity transformation matrix in the step (3); Perhaps use the optimum linearity transformation matrix in the step (3) to define mahalanobis distance as the distance measure between image.
  2. 2. the distance measure optimization method of the average precision ratio average of maximization according to claim 1 is characterized in that: the image data base in the said step (1) is made up of image or image-region characteristic, image category.
  3. 3. the distance measure optimization method of the average precision ratio average of maximization according to claim 1; It is characterized in that: the query image in the said step (1)-right set of image to be checked is that whether foundation comprises same anatomical, the same type focus confirms that image to whether similar, marks.
  4. 4. the distance measure optimization method of the average precision ratio average of maximization according to claim 1 is characterized in that: adopting at random in the said step (3), gradient descent algorithm comes the optimization aim function.
CN201010594547.0A 2010-12-17 2010-12-17 Distance metric optimization method for maximizing mean average precision (MAP) Expired - Fee Related CN102033933B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010594547.0A CN102033933B (en) 2010-12-17 2010-12-17 Distance metric optimization method for maximizing mean average precision (MAP)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010594547.0A CN102033933B (en) 2010-12-17 2010-12-17 Distance metric optimization method for maximizing mean average precision (MAP)

Publications (2)

Publication Number Publication Date
CN102033933A CN102033933A (en) 2011-04-27
CN102033933B true CN102033933B (en) 2012-02-01

Family

ID=43886826

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010594547.0A Expired - Fee Related CN102033933B (en) 2010-12-17 2010-12-17 Distance metric optimization method for maximizing mean average precision (MAP)

Country Status (1)

Country Link
CN (1) CN102033933B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036109A (en) * 2014-03-14 2014-09-10 上海大图医疗科技有限公司 Image based system and method for case retrieving, sketching and treatment planning
EP3613060A1 (en) 2017-04-20 2020-02-26 Koninklijke Philips N.V. Learning and applying contextual similarities between entities

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050180639A1 (en) * 2004-02-17 2005-08-18 Trifonov Mikhail I. Iterative fisher linear discriminant analysis
CN101620638A (en) * 2009-08-06 2010-01-06 华中科技大学 Image retrieval method based on gauss mixture models
CN101853304A (en) * 2010-06-08 2010-10-06 河海大学 Remote sensing image retrieval method based on feature selection and semi-supervised learning

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050180639A1 (en) * 2004-02-17 2005-08-18 Trifonov Mikhail I. Iterative fisher linear discriminant analysis
CN101620638A (en) * 2009-08-06 2010-01-06 华中科技大学 Image retrieval method based on gauss mixture models
CN101853304A (en) * 2010-06-08 2010-10-06 河海大学 Remote sensing image retrieval method based on feature selection and semi-supervised learning

Also Published As

Publication number Publication date
CN102033933A (en) 2011-04-27

Similar Documents

Publication Publication Date Title
Mai et al. A review of location encoding for GeoAI: methods and applications
Akgül et al. 3D model retrieval using probability density-based shape descriptors
CN105118068B (en) Medical image automatic marking method under a kind of condition of small sample
CN112287839B (en) SSD infrared image pedestrian detection method based on transfer learning
DE112016004535T5 (en) Universal Compliance Network
CN109583483A (en) A kind of object detection method and system based on convolutional neural networks
CN107451609A (en) Lung neoplasm image identification system based on depth convolutional neural networks
CN106530321B (en) A kind of multichannel chromatogram image partition method based on direction and scale description
CN108629367A (en) A method of clothes Attribute Recognition precision is enhanced based on depth network
CN104021547A (en) Three dimensional matching method for lung CT
CN109493417A (en) Three-dimension object method for reconstructing, device, equipment and storage medium
CN104077303B (en) Method and apparatus for data to be presented
CN110378423A (en) Feature extracting method, device, computer equipment and storage medium
JP7373624B2 (en) Method and apparatus for fine-grained image classification based on scores of image blocks
Li et al. A robot vision navigation method using deep learning in edge computing environment
CN103226582B (en) A kind of medical image search method based on uncertain fixed point figure
US11514660B1 (en) Scene recognition method, training method and device based on pyramid attention
CN102033933B (en) Distance metric optimization method for maximizing mean average precision (MAP)
CN105426836A (en) Single-sample face recognition method based on segmented model and sparse component analysis
Lan et al. Integrating general principles into mixed-integer programming to optimize schematic network maps
CN113469091B (en) Face recognition method, training method, electronic device and storage medium
CN113255767B (en) Bill classification method, device, equipment and storage medium
CN104537694B (en) Online learning offline video tracking method based on key frames
CN117078956A (en) Point cloud classification segmentation network based on point cloud multi-scale parallel feature extraction and attention mechanism
CN110533118A (en) Remote sensing images sparse expression classification method based on Multiple Kernel Learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120201

Termination date: 20141217

EXPY Termination of patent right or utility model