CN102663447A - Cross-media searching method based on discrimination correlation analysis - Google Patents

Cross-media searching method based on discrimination correlation analysis Download PDF

Info

Publication number
CN102663447A
CN102663447A CN2012101334886A CN201210133488A CN102663447A CN 102663447 A CN102663447 A CN 102663447A CN 2012101334886 A CN2012101334886 A CN 2012101334886A CN 201210133488 A CN201210133488 A CN 201210133488A CN 102663447 A CN102663447 A CN 102663447A
Authority
CN
China
Prior art keywords
data
image
sigma
text
point set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012101334886A
Other languages
Chinese (zh)
Other versions
CN102663447B (en
Inventor
谭铁牛
王亮
王威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Automation of Chinese Academy of Science
Original Assignee
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Automation of Chinese Academy of Science filed Critical Institute of Automation of Chinese Academy of Science
Priority to CN201210133488.6A priority Critical patent/CN102663447B/en
Publication of CN102663447A publication Critical patent/CN102663447A/en
Application granted granted Critical
Publication of CN102663447B publication Critical patent/CN102663447B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a cross-media searching method based on discrimination correlation analysis. The method comprises the following steps of establishing a cross-media training database, carrying out feature extraction, mean-value pretreatment and linear projection transformation sequentially for different modal samples, and setting a target function according to a projection space; solving the target function to acquire a linear projection vector; establishing a cross-media test database; sequentially performing the feature extraction and mean-value pretreatment for a target to be searched; utilizing the linear projection vector to perform the linear projection transformation for the feature data after the mean-value pretreatment; and calculating an Euclidean distance between two modal data projection vectors, listing the Euclidean distance in ascending order, and acquiring a cross-media searching result. Due to the adoption of the method, dimensional reduction can be effectively performed for the feature data, so that the feature data can be widely applied to other multi-modal work, for example the multi-modal biological feature recognition.

Description

Stride the medium search method based on what differentiate correlation analysis
Technical field
The present invention relates to pattern-recognition and machine learning field, especially a kind ofly stride the medium search method based on what differentiate correlation analysis.
Background technology
In recent years, a large amount of multi-medium datas that occur present two tangible characteristics: high-dimensional property and polyphyly, for example same semantic concept can be represented by plurality of kinds of contents such as the literal on the network, picture, videos.In addition, the Internet user also mainly searches for needed information through text keyword, and this mainly is because search engine can't be understood the mutual relationship between the different modalities medium, thereby has limited the development of search engine.The characteristic dimensionality reduction has disclosed manifold structure and correlativity different modalities data between of high dimensional data in lower dimensional space, and in information retrieval, pattern classification, great function has been brought into play in fields such as information visualization.
The feature dimension reduction method of single mode data has a lot, and principal component analysis (Principal Component Analysis) projects to raw data on the principal direction with maximum variance; (Linear Discriminant Analysis LDA) is a kind of supervision dimension reduction method that has, and finds a projection subspace under the condition of classification information making full use of, and makes different classes of characteristic have optimum identification in linear discriminant analysis; Local linear embedding (Locally Linear Embedding) is a non-linear local reservation method the earliest, and the linear relationship of each data point and its arest neighbors data point is able to keep in projector space; LE (Laplacian Eigenmaps) has kept the distance of local two data points in projector space, LPP (Locality Preserving Projection) is its linear-apporximation algorithm; Multilayer own coding network (Multilayer Autoencoder Network) is the nonlinear stretch of principal component analysis method.Have research work to point out, though nonlinear method treatment of simulated data performance is fine, but not necessarily the principal component analysis method than traditional is good for real data, and more than these methods of mentioning all can not directly apply to the multi-modal medium retrieval of striding.
The feature dimension reduction method research of multi-modal data is not a lot; Canonical correlation analysis (Canonical Correlation Analysis; CCA) be wherein the most famous multivariate data analysis method; It to same subspace, makes multi-modal data difference linear projection multi-modal variable have maximum correlation; Relevant with typical linear different, PLS (Partial Least Square) makes multi-modal variable have maximum covariance in projector space; Under the inspiration of multilayer own coding network, multi-modal degree of depth learning network is suggested and is the common expression of different modalities data study.In a word; Above method more is to be that target removes to seek projector space with the correlativity that maximizes multi-modal variable; And ignored the identification that maximizes different classes of data in the multi-modal data, and identification is often extremely important in multi-modal data retrieval and classification task.
Summary of the invention
Existing multi-modal data analysing method is not generally considered the identification of data; The invention provides a kind of based on differentiating correlation analysis (Discriminant Correlation Analysis; DCA) method; It has merged the thought of canonical correlation analysis and linear discriminant analysis, optimizes the identification of multiple modalities correlation of data and different classes of data simultaneously.
Proposed by the invention a kind ofly stride the medium search method, it is characterized in that this method may further comprise the steps based on what differentiate correlation analysis:
Step 1 is set up and to be comprised right the striding the medium tranining database and extract the proper vector of different modalities sample in this database of image and text one to one, obtains corresponding characteristic point set;
Step 2, the characteristic point set to image and two mode of text carries out the average pre-service respectively, makes that the average of characteristic point set of each mode is 0;
Step 3 will be passed through the pretreated characteristic point set of average and carried out the linear projection conversion, and set an objective function about the linear projection variable according to the projector space that obtains;
Step 4, use characteristic value solving method is found the solution said objective function, obtains linear projection vector a and b;
Step 5, set up comprise image and text one to one right stride the medium test database;
Step 6 is imported object to be retrieved, and extracts the proper vector of object to be retrieved respectively and stride in the medium test database characteristic point set that belongs to the object set of different modalities with object to be retrieved;
Step 7, proper vector and characteristic point set that step 6 is obtained carry out said average pre-service respectively;
Step 8, the linear projection vector a and the b that use said step 4 to obtain carry out the linear projection conversion respectively to process pretreated proper vector of average and characteristic point set;
Step 9; Calculate the Euclidean distance between the projection variable of projection variable and object set of object to be retrieved; And all Euclidean distances are carried out ascending sort, preceding n the corresponding object data of Euclidean distance promptly is the object of striding another mode relevant with image to be retrieved that retrieval obtains in the medium test database said.
The inventive method can be carried out dimensionality reduction effectively to characteristic, thereby is widely used in other a lot of multi-modal work, discerns such as multi-modal biological characteristic.Experiment showed, the inventive method in striding medium retrievals than canonical correlation analysis, and the simple combination performance of canonical correlation analysis and linear discriminant analysis all will be got well.
Description of drawings
Fig. 1 is the realization flow figure of the inventive method;
Fig. 2 be the inventive method on a simulated data collection with the comparing result of other correlation techniques.
Embodiment
For making the object of the invention, technical scheme and advantage clearer, below in conjunction with specific embodiment, and with reference to accompanying drawing, to further explain of the present invention.
Fig. 1 is the realization flow figure of the inventive method; As shown in Figure 1, proposed by the invention a kind ofly comprise training process (Fig. 1 (a)) and test process (Fig. 1 (b) and (c)), particularly based on the medium search method of striding of differentiating correlation analysis; Fig. 1 (a) is for utilizing among the present invention image text in the tranining database to study projection vector a; The process flow diagram of b, shown in Fig. 1 (a), training process of the present invention may further comprise the steps:
Step 1 is set up and to be comprised right the striding the medium tranining database and extract the proper vector of different modalities sample in this database of image and text one to one, obtains corresponding characteristic point set.
The present invention at first sets up image and text is striden the medium tranining database one to one; Use yardstick invariant features conversion (Scale-Invariant Feature Transform then respectively; SIFT) algorithm and latent Di Lei Cray distribute, and (Latent Dirichlet Allocation, LDA) algorithm carries out feature extraction to image and text.
Step 2, the characteristic point set to image and two mode of text carries out the average pre-service respectively, makes that the average of characteristic point set of each mode is 0:
x←x-E(x) (1)
y←y-E(y)
Wherein, x and y are two given mode characteristic point sets, and such as image and text characteristic of correspondence data acquisition, its corresponding respectively data point set is { x 1... x nAnd { y 1... y n, the data in each data point set belong to k common classification respectively
Figure BDA0000158812600000041
E (x), E (y) is the average of original set of data points.
Step 3; To pass through pretreated image of average and text feature data point set carries out the linear projection conversion and obtains projector space; Set an objective function according to said projector space, this objective function is the objective function about the linear projection variable that is used to carry out the linear projection conversion.
Given projection vector a and b, variable set x and y that two mode characteristics of image and text point set is corresponding carry out the linear projection conversion, obtain respective projection variable u and v:
u=a Tx (2)
v=b Ty
The step of the said projector space target setting function that conversion obtains according to linear projection further may further comprise the steps:
Step 3.1, the covariance cov of projection variable u and v in the calculating projector space (u, v):
cov ( u , v ) = a T E ( xy T ) b
= 1 2 a T E ( xy T ) b + 1 2 b T E ( yx T ) a
= a T b T 0 1 2 E ( xy T ) 1 2 E ( yx T ) 0 a b - - - ( 3 )
= a T b T Σ a b
Wherein, ∑ defines the eigenmatrix of covariance for this reason.
Step 3.2, computed image and the inter-class variance of two mode characteristics of text point set in projector space and a type internal variance σ BAnd σ W:
σ B = Σ m = 1 k n m n ω m ω m T - - - ( 4 )
σ w = 1 2 n Σ m = 1 k Σ i ∈ C m ( ( u i - ω m ) ( u i - ω m ) T + ( v i - ω m ) ( v i - ω m ) T ) - - - ( 5 )
Wherein, n representes the number of each data point intensive data, n mThe number of representing the data of m class in each data point set, k are the number of classification, ω mThe average of representing the concentrated m class data of two data points:
ω m = 1 2 ( 1 n m Σ i ∈ C m u i + 1 n m Σ i ∈ C m v i ) - - - ( 6 )
Be brought into formula (4) and (5), then σ to projection formula (2) BAnd σ WCan be rewritten as:
σ B = a T b T S B a b - - - ( 7 )
σ W = a T b T S W a b - - - ( 8 )
Wherein, S BAnd S WBe called " the hash matrix between type " and " hash matrix in type " of multi-modal data, be respectively:
S B = 1 2 n Σ m = 1 k n m E m xx T E m xy T E m yx T E m yy T - - - ( 9 )
S W = 1 2 n Σ m = 1 k n m E m ( xx T ) - 1 2 E m xx T - 1 2 E m xy T - 1 2 E m yx T E m ( yy T ) - 1 2 E m yy T - - - ( 10 )
Wherein, E m(x) and E m(y) be the average that raw data points is concentrated m class data respectively, C mRepresent m class data set:
E m ( xx T ) = 1 n m Σ i ∈ C m ( x i x i T ) E m ( yy T ) = 1 n m Σ i ∈ C m ( y i y i T )
E m xx T = E m ( x ) E m T ( x ) E m xy T = E m ( x ) E m T ( y ) - - - ( 11 )
E m yy T = E m ( y ) E m T ( y ) E m yx T = E m ( y ) E m T ( x )
Step 3.3, according to the covariance cov that calculates (u, v), inter-class variance σ BWith class internal variance σ WThe target setting function.
The objective function that the present invention differentiates correlation analysis is defined as:
a * , b * = arg max a , b μσ B + ( 1 - μ ) cov ( u , v ) σ W - - - ( 12 )
Wherein, σ BAnd σ WBe respectively " inter-class variance " and " type internal variance " of two data points collection in projector space, (u v) is the covariance of variable u and v in the projector space to cov, and μ is the adjusting parameter, and it is controlling σ BAnd cov (u, relative weighting v).
Step 4, use characteristic value solving method is found the solution said objective function, is finally learnt the linear projection vector a and the b that obtain.
In order to find the solution said objective function, need convert said objective function into a generalized eigenvalue problem:
At first define f=(a, b), then objective function (12) can be rewritten as:
f * = arg max f f T ( μ S B + ( 1 - μ ) Σ ) f f T S W f - - - ( 13 )
Can see that objective function (13) is very similar with the objective function of linear discriminant analysis, adopt lagrange's method of multipliers promptly can convert a generalized eigenvalue problem to (13) into, be shown below:
(μS B+(1-μ)∑)f=λS Wf (14)
Find the solution the eigenwert and the proper vector of (14); And arrange proper vector again according to the order that eigenwert is successively decreased; Get linear projection vector a and b that big eigenwert characteristic of correspondence vector obtains as final study; The linear projection vector a and the b that promptly utilize said study to obtain carry out the linear projection conversion respectively to multi-modal characteristic point set, can realize the dimensionality reduction to said multi-modal characteristic point set.
Step 5, set up comprise image and text one to one right stride the medium test database.
Fig. 1 (b) is for concentrating the process flow diagram of retrieving the text relevant with image at text data among the present invention; Fig. 1 (c) is for concentrating the process flow diagram of retrieving the image relevant with text in view data among the present invention; Shown in Fig. 1 (b) and Fig. 1 (c), test process of the present invention may further comprise the steps:
Step 6 is imported object to be retrieved, and extracts the proper vector of object to be retrieved respectively and stride in the medium test database characteristic point set that belongs to the object set of different modalities with object to be retrieved.
In this step; Similar with step 1; (Scale-Invariant Feature Transform, SIFT) algorithm and latent Di Lei Cray distribute, and (Latent Dirichlet Allocation, LDA) algorithm carries out feature extraction to image and text to use the conversion of yardstick invariant features respectively.
For instance, when needs were retrieved a series of text object relevant with certain image, object to be retrieved was an image, extracts the SIFT proper vector x of image respectively iLDA characteristic point set with test database Chinese version data set
Figure BDA0000158812600000062
Wherein, N is the number of test database Chinese version data.
Step 7, similar with said step 2, proper vector and characteristic point set that step 6 is obtained carry out the average pre-service respectively.
Step 8, the linear projection vector a and the b that use said step 4 to obtain carry out the linear projection conversion respectively to process pretreated proper vector of average and characteristic point set, so that the pretreated characteristic of process average is carried out dimensionality reduction.
The linear projection vector a and the b that use said step 4 to obtain are with the SIFT proper vector x of image iLDA characteristic set with test database Chinese version data set Carry out the linear projection conversion respectively, obtain respective projection variable u iWith
Figure BDA0000158812600000072
u i=a Tx i
(15)
{ v i } i = 1 N = b T { y i } i = 1 N
Step 9; Calculate the Euclidean distance between the projection variable of projection variable and object set of object to be retrieved; And all Euclidean distances are carried out ascending sort, preceding n the corresponding object data of Euclidean distance promptly is the object of striding another mode relevant with image to be retrieved that retrieval obtains in the medium test database said.
If object to be retrieved is an image; In this step; The Euclidean distance between the projection variable of each text data in the projection variable of computed image and the test database at first; And all Euclidean distances are carried out ascending sort, preceding n the corresponding text data of Euclidean distance promptly is the text object relevant with image to be retrieved that retrieval obtains.Here, result for retrieval quantity n can be set up on their own by the user as required.
What need to specify is, except the retrieval of cross-module attitude, the inventive method also may be used on other anyly need carry out dimension-reduction treatment to carry out the field of feature identification to multi-modal data, discerns such as multi-modal biological characteristic.
Prove that with the test result on simulated data collection and the True Data the inventive method is superior to the combination of canonical correlation analysis, linear discriminant analysis and canonical correlation analysis and linear discriminant analysis respectively below.
Simulated data collection instance is as shown in Figure 1, has generated two two-dimentional point sets among Fig. 1 (a), and asterism (the 1st type) is a point set with crunode (the 2nd type), and square frame (the 1st type) is the another one point set with rhombus (the 2nd type), and these two point sets belong to 2 types respectively; (b) provided the projection result of canonical correlation analysis (CCA) on simulated data; Though these two point sets are very relevant; But they but have a large amount of overlapping regions on low dimension projector space (here data projection to transverse axis), and the projecting direction that obtains of canonical correlation analysis does not have identification thus; (c) provided the projection result of linear discriminant analysis (LDA) on simulated data, though two types after the projection have good identification, the correlativity of two point sets after the projection is very poor; (d) provided the result of linear discriminant analysis (LDA) and a kind of combination of canonical correlation analysis (CCA), promptly earlier each point set has been done linear discriminant analysis, and then do canonical correlation analysis, the result who obtains is with directly to do canonical correlation analysis (b) closely similar; (e) provided the result of linear discriminant analysis (LDA) with other a kind of combination of canonical correlation analysis (CCA); Promptly earlier two point sets are done canonical correlation analysis; And then doing linear discriminant analysis, its result seems more similar with the result (g) of the inventive method (DCA), yet carries out obtaining after the log-transformation (f) and (h) to two results (e) and transverse axis coordinate (g); The result that can see canonical correlation analysis and linear discriminant analysis combination is linear inseparable on horizontal axis; Like P data point in (f) and Q data point, and the result of the inventive method is a linear separability, has explained that the inventive method has more identification.
On an image text data set, tested the performance of differentiating correlation analysis below the True Data collection instance; It is right that this data set comprises 2866 image texts; It is right that wherein training set has 2173 image texts; Test set has 693 image texts right, and each image text belongs to a certain type in following 10 types: art, biology, geography, history, literature, medium, music, royal family, physical culture, military affairs to a class label is arranged.Wherein, image adopts the SIFT characteristic of 128 dimensions, and text adopts the LDA text semantic characteristic of 10 dimensions.Project to the characteristic of these two types of data with two kinds of combinations differentiating correlation analysis, canonical correlation analysis and canonical correlation analysis and linear discriminant analysis the lower dimensional space of 9 dimensions then respectively; In this 9 dimension space, carry out cross-module attitude retrieval tasks; Promptly concentrate the retrieval image relevant, perhaps concentrate the retrieval text relevant with certain image at text data with certain text in view data.The result of cross-module attitude retrieval measures with mean accuracy (MAP, mean average precision), and mean accuracy is the bigger the better, and the mean accuracy here is meant the mean value of each query and search precision.Table 1 has provided the classification results of four kinds of algorithms, can see, differentiates correlation analysis and is superior to additive method.
Table 1
Method Image is as test data Text is as test data
DCA 0.2108 0.2482
CCA 0.2032 0.2032
CCA+LDA 0.2020 0.2011
LDA+CCA 0.2031 0.2034
Above-described specific embodiment; The object of the invention, technical scheme and beneficial effect have been carried out further explain, and institute it should be understood that the above is merely specific embodiment of the present invention; Be not limited to the present invention; All within spirit of the present invention and principle, any modification of being made, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. stride the medium search method based on what differentiate correlation analysis for one kind, it is characterized in that this method may further comprise the steps:
Step 1 is set up and to be comprised right the striding the medium tranining database and extract the proper vector of different modalities sample in this database of image and text one to one, obtains corresponding characteristic point set;
Step 2, the characteristic point set to image and two mode of text carries out the average pre-service respectively, makes that the average of characteristic point set of each mode is 0;
Step 3 will be passed through the pretreated characteristic point set of average and carried out the linear projection conversion, and set an objective function about the linear projection variable according to the projector space that obtains;
Step 4, use characteristic value solving method is found the solution said objective function, obtains linear projection vector a and b;
Step 5, set up comprise image and text one to one right stride the medium test database;
Step 6 is imported object to be retrieved, and extracts the proper vector of object to be retrieved respectively and stride in the medium test database characteristic point set that belongs to the object set of different modalities with object to be retrieved;
Step 7, proper vector and characteristic point set that step 6 is obtained carry out said average pre-service respectively;
Step 8, the linear projection vector a and the b that use said step 4 to obtain carry out the linear projection conversion respectively to process pretreated proper vector of average and characteristic point set;
Step 9; Calculate the Euclidean distance between the projection variable of projection variable and object set of object to be retrieved; And all Euclidean distances are carried out ascending sort, preceding n the corresponding object data of Euclidean distance promptly is the object of striding another mode relevant with image to be retrieved that retrieval obtains in the medium test database said.
2. method according to claim 1 is characterized in that, in the said step 1 and 6, uses yardstick invariant features mapping algorithm and latent Di Lei Cray Distribution Algorithm that image and text are carried out feature extraction respectively.
3. method according to claim 1 is characterized in that, the linear projection map table in the said step 3 is shown:
u=a Tx
V=b TY wherein, x and y are respectively the set of the corresponding variable of two mode characteristics of image and text point set, a and b are respectively corresponding projection vector, u and v pass through the projection variable that the linear projection conversion obtains.
4. method according to claim 3 is characterized in that, further may further comprise the steps according to the step of the projector space target setting function that obtains:
Step 3.1, and the covariance cov of projection variable u and v in the calculating projector space (u, v);
Step 3.2, computed image and the inter-class variance of two mode characteristics of text point set in projector space and a type internal variance σ BAnd σ W
Step 3.3, according to the covariance cov that calculates (u, v), inter-class variance σ BWith class internal variance σ WThe target setting function.
5. method according to claim 4 is characterized in that, in the said step 3.1, the covariance cov of projection variable u and v (u v) is expressed as:
cov ( u , v ) = a T b T Σ a b ,
Wherein, ∑ defines the eigenmatrix of covariance for this reason.
6. method according to claim 4 is characterized in that, in the said step 3.2, and a said inter-class variance and a type internal variance σ BAnd σ WBe expressed as:
σ B = a T b T S B a b ,
σ W = a T b T S W a b ,
Wherein, S BAnd S W" the hash matrix between type " and " hash matrix in type " that is called multi-modal data:
S B = 1 2 n Σ m = 1 k n m E m xx T E m xy T E m yx T E m yy T ,
S W = 1 2 n Σ m = 1 k n m E m ( xx T ) - 1 2 E m xx T - 1 2 E m xy T - 1 2 E m yx T E m ( yy T ) - 1 2 E m yy T ,
Wherein, n representes the number of each data point intensive data, n mThe number of representing the data of m class in each data point set, k are the number of classification, E m ( Xx T ) = 1 n m Σ i ∈ C m ( x i x i T ) , E m ( Yy T ) = 1 n m Σ i ∈ C m ( y i y i T ) ,
E m xx T = E m ( x ) E m T ( x ) , E m xy T = E m ( x ) E m T ( y ) , E m yy T = E m ( y ) E m T ( y ) , E m yx T = E m ( y ) E m T ( x ) ,
C mRepresent m class data set, E m(x) and E m(y) be the average that raw data points is concentrated m class data respectively.
7. method according to claim 4 is characterized in that, said objective function is defined as:
a * , b * = arg max a , b μσ B + ( 1 - μ ) cov ( u , v ) σ W ,
Wherein, μ is for regulating parameter, and it is controlling σ BAnd cov (u, relative weighting v).
8. method according to claim 1 is characterized in that, in the said step 4, the step that use characteristic value solving method is found the solution said objective function further may further comprise the steps:
At first, (a b), rewrites said objective function to definition f=;
Then, the objective function after adopting lagrange's method of multipliers to rewrite convert into one can try to achieve generalized eigenvalue equality;
At last, find the solution the eigenwert and the proper vector of this equality, and arrange proper vector again, get linear projection vector a and b that big eigenwert characteristic of correspondence vector obtains as final study according to the order that eigenwert is successively decreased.
9. method according to claim 1 is characterized in that, the object to be retrieved in the said step 6 is image or text.
10. method according to claim 1 is characterized in that, result for retrieval quantity n is set up on their own by the user as required in the said step 9.
CN201210133488.6A 2012-04-28 2012-04-28 Cross-media searching method based on discrimination correlation analysis Active CN102663447B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210133488.6A CN102663447B (en) 2012-04-28 2012-04-28 Cross-media searching method based on discrimination correlation analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210133488.6A CN102663447B (en) 2012-04-28 2012-04-28 Cross-media searching method based on discrimination correlation analysis

Publications (2)

Publication Number Publication Date
CN102663447A true CN102663447A (en) 2012-09-12
CN102663447B CN102663447B (en) 2014-04-23

Family

ID=46772930

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210133488.6A Active CN102663447B (en) 2012-04-28 2012-04-28 Cross-media searching method based on discrimination correlation analysis

Country Status (1)

Country Link
CN (1) CN102663447B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103793507A (en) * 2014-01-26 2014-05-14 北京邮电大学 Method for obtaining bimodal similarity measure with deep structure
CN103838836A (en) * 2014-02-25 2014-06-04 中国科学院自动化研究所 Multi-modal data fusion method and system based on discriminant multi-modal deep confidence network
CN103995903A (en) * 2014-06-12 2014-08-20 武汉科技大学 Cross-media search method based on isomorphic subspace mapping and optimization
CN104077319A (en) * 2013-03-29 2014-10-01 南京邮电大学 Method for annotating images on basis of measurement of difference among non-Euclidean spaces
CN103870500B (en) * 2012-12-14 2017-05-24 联想(北京)有限公司 Searching method and searching device
CN107220475A (en) * 2016-11-01 2017-09-29 重庆交通大学 A kind of bearing features data analysing method based on linear discriminant analysis
CN107657008A (en) * 2017-09-25 2018-02-02 中国科学院计算技术研究所 Across media training and search method based on depth discrimination sequence study
CN108399414A (en) * 2017-02-08 2018-08-14 南京航空航天大学 Method of Sample Selection and device
CN109213876A (en) * 2018-08-02 2019-01-15 宁夏大学 Based on the cross-module state search method for generating confrontation network

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5647058A (en) * 1993-05-24 1997-07-08 International Business Machines Corporation Method for high-dimensionality indexing in a multi-media database
CN101021849A (en) * 2006-09-14 2007-08-22 浙江大学 Transmedia searching method based on content correlation
CN101334796A (en) * 2008-02-29 2008-12-31 浙江师范大学 Personalized and synergistic integration network multimedia search and enquiry method
CN101996191A (en) * 2009-08-14 2011-03-30 北京大学 Method and system for searching for two-dimensional cross-media element

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5647058A (en) * 1993-05-24 1997-07-08 International Business Machines Corporation Method for high-dimensionality indexing in a multi-media database
CN101021849A (en) * 2006-09-14 2007-08-22 浙江大学 Transmedia searching method based on content correlation
CN101334796A (en) * 2008-02-29 2008-12-31 浙江师范大学 Personalized and synergistic integration network multimedia search and enquiry method
CN101996191A (en) * 2009-08-14 2011-03-30 北京大学 Method and system for searching for two-dimensional cross-media element

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103870500B (en) * 2012-12-14 2017-05-24 联想(北京)有限公司 Searching method and searching device
CN104077319B (en) * 2013-03-29 2018-03-06 南京邮电大学 Image labeling method based on non-Euclidean space Diversity measure
CN104077319A (en) * 2013-03-29 2014-10-01 南京邮电大学 Method for annotating images on basis of measurement of difference among non-Euclidean spaces
CN103793507B (en) * 2014-01-26 2016-10-05 北京邮电大学 A kind of method using deep structure to obtain bimodal similarity measure
CN103793507A (en) * 2014-01-26 2014-05-14 北京邮电大学 Method for obtaining bimodal similarity measure with deep structure
CN103838836A (en) * 2014-02-25 2014-06-04 中国科学院自动化研究所 Multi-modal data fusion method and system based on discriminant multi-modal deep confidence network
CN103838836B (en) * 2014-02-25 2016-09-28 中国科学院自动化研究所 Based on discriminant multi-modal degree of depth confidence net multi-modal data fusion method and system
CN103995903B (en) * 2014-06-12 2017-04-12 武汉科技大学 Cross-media search method based on isomorphic subspace mapping and optimization
CN103995903A (en) * 2014-06-12 2014-08-20 武汉科技大学 Cross-media search method based on isomorphic subspace mapping and optimization
CN107220475A (en) * 2016-11-01 2017-09-29 重庆交通大学 A kind of bearing features data analysing method based on linear discriminant analysis
CN108399414A (en) * 2017-02-08 2018-08-14 南京航空航天大学 Method of Sample Selection and device
CN108399414B (en) * 2017-02-08 2021-06-01 南京航空航天大学 Sample selection method and device applied to cross-modal data retrieval field
CN107657008A (en) * 2017-09-25 2018-02-02 中国科学院计算技术研究所 Across media training and search method based on depth discrimination sequence study
CN109213876A (en) * 2018-08-02 2019-01-15 宁夏大学 Based on the cross-module state search method for generating confrontation network
CN109213876B (en) * 2018-08-02 2022-12-02 宁夏大学 Cross-modal retrieval method based on generation of countermeasure network

Also Published As

Publication number Publication date
CN102663447B (en) 2014-04-23

Similar Documents

Publication Publication Date Title
CN102663447B (en) Cross-media searching method based on discrimination correlation analysis
CN107515895B (en) Visual target retrieval method and system based on target detection
Chen et al. Model Metric Co-Learning for Time Series Classification.
Rodríguez-Serrano et al. A model-based sequence similarity with application to handwritten word spotting
Zhao et al. Spectral feature selection for data mining
CN101187927B (en) Criminal case joint investigation intelligent analysis method
CN103049526B (en) Based on the cross-media retrieval method of double space study
CN103473327A (en) Image retrieval method and image retrieval system
CN105930873B (en) A kind of walking across mode matching method certainly based on subspace
CN106250925B (en) A kind of zero Sample video classification method based on improved canonical correlation analysis
CN103559191A (en) Cross-media sorting method based on hidden space learning and two-way sorting learning
CN105740378B (en) Digital pathology full-section image retrieval method
CN106844481B (en) Font similarity and font replacement method
WO2013159356A1 (en) Cross-media searching method based on discrimination correlation analysis
CN103177121A (en) Locality preserving projection method for adding pearson relevant coefficient
CN108319959A (en) A kind of corps diseases image-recognizing method compressed based on characteristics of image with retrieval
CN102831161A (en) Semi-supervision sequencing study method for image searching based on manifold regularization
CN104143088A (en) Face identification method based on image retrieval and feature weight learning
Wang et al. Action recognition using linear dynamic systems
Eravci et al. Diversity based relevance feedback for time series search
Cao et al. Research on dynamic time warping multivariate time series similarity matching based on shape feature and inclination angle
CN103049570B (en) Based on the image/video search ordering method of relevant Preserving map and a sorter
Bafna et al. Identification of significant challenges in the sports domain using clustering and feature selection techniques
Arslan et al. Comparison of feature-based and image registration-based retrieval of image data using multidimensional data access methods
Ji et al. Efficient semi-supervised multiple feature fusion with out-of-sample extension for 3D model retrieval

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant