CN105631469A - Bird image recognition method by multilayer sparse coding features - Google Patents

Bird image recognition method by multilayer sparse coding features Download PDF

Info

Publication number
CN105631469A
CN105631469A CN201510964442.2A CN201510964442A CN105631469A CN 105631469 A CN105631469 A CN 105631469A CN 201510964442 A CN201510964442 A CN 201510964442A CN 105631469 A CN105631469 A CN 105631469A
Authority
CN
China
Prior art keywords
sparse coding
feature
dictionary
birds
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510964442.2A
Other languages
Chinese (zh)
Inventor
郭礼华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China University of Technology SCUT
Original Assignee
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China University of Technology SCUT filed Critical South China University of Technology SCUT
Priority to CN201510964442.2A priority Critical patent/CN105631469A/en
Publication of CN105631469A publication Critical patent/CN105631469A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines

Abstract

The invention discloses a bird image recognition method by multilayer sparse coding features. Firstly, a cascaded multilayer sparse coding structure is used on R, G and B channels of a local image block for extracting sparse coding features, wherein each layer of sparse coding structure comprises a feature coding part and a feature maximum value extraction part; and then, in the aspect of output features, the multilayer sparse coding features use a linear kernel for fusion, and an SVM is used as a classifier for classified judgment. In the multilayer sparse coding structure, a local constraint term is added to an optimization objective function, a solved approximate solution for the objective function is iteratively used for sparse coding in the feature coding part, the coding values are then used for minimally reconstructing errors, and a dictionary is updated. According to the bird image recognition method by multilayer sparse coding features, the bird image recognition precision by the system can be greatly improved.

Description

A kind of birds image-recognizing method of multilamellar sparse coding feature
Technical field
The present invention relates to field of image recognition, particularly to the birds image-recognizing method of a kind of multilamellar sparse coding feature.
Background technology
The patent of more existing birds identifications at present, but the patent being identified for birds image does not find. In birds identification, as patent 2009102108999 provides the remote wireless monitoring system based on Acoustic image integrated wild birds identification technology, it is used for long-range ecological monitoring, and described system includes song and records module, video record module, audio, video data processing module, audio/video flow transport module and audiovisual digital file storage module. Patent 2013105810072 provides a kind of portable birds recognition methods based on chirm, the chirm signal gathered is carried out pretreatment by LabVIEW software by this patent, and process the chirm signal after pretreatment by the AOK Time-Frequency Analysis Method of LabVIEW and MATLAB software hybrid programming, finally by MATLAB software processes AOK time-frequency spectrum, it is achieved the extraction of eigenvalue. The eigenvalue of the bird of known bird kind is generated training template through model training and carries out data storage, the eigenvalue of the bird of bird kind to be identified is generated test template through model training, carries out mating the identification realizing birds with training template by test template in conjunction with DTW algorithm. In real life, user's mode such as more often with mobile phone, birds are taken pictures is recorded and identifies. A kind of recognition methods based on birds image it is badly in need of for this.
Summary of the invention
In order to overcome disadvantages mentioned above and the deficiency of prior art, it is an object of the invention to the birds image-recognizing method of a kind of multilamellar sparse coding feature, the accuracy of identification for birds image is high.
The purpose of the present invention is achieved through the following technical solutions:
The birds image-recognizing method of a kind of multilamellar sparse coding feature, comprises the following steps:
S1 trains process
S1.1 collects various birds training images, forms birds training dataset;
S1.2 dictionary learning
S1.2.1 set the object function of dictionary learning as:
min D , X | | Y - D X | | F 2 + λ Σ i = 1 N Σ j = 1 , j ≠ i N | d i T d j | + β Σ i = 1 N | | e i · x i | | 2 - - - ( 1 )
Wherein Y is the picture element matrix collection of image block, yiIt it is the picture element matrix of i-th image block in picture element matrix collection Y; D is the dictionary that need to learn, and dictionary number is N, and dictionary element is diAnd dj; X is the weight coefficient of dictionary, and its element is xi;It is F norm, ei=exp ([dist (yi,d1),...,dist(yi,dN)]T/ ��), and dist (yi,dj) it is yiWith djEuclidean distance, representing matrix dot product, �� is weight, ��, �� be balance two kinds constraint weight coefficients;
S1.2.2 calculates the code coefficient X of input pixel signal matrix stack Y, shown in the fresh target function such as formula (2) obtained, shown in its analytic solutions such as formula (3);
min D , X | | Y - D X | | F 2 + β Σ i = 1 N | | e i · x i | | 2 - - - ( 2 )
s . t . ∀ i , 1 T x i = 1
C i = ( D T - 1 y i T ) ( D T - 1 y i T ) T x i = ( C i + βdiag 2 ( e ) ) \ 1 x i = x i / 1 T x i - - - ( 3 )
S1.2.3 is after trying to achieve code coefficient X, and the word in dictionary optimizes renewal in order by KSVD algorithm, and object function (1) is updated to:
min d m { x ‾ m T x ‾ m d m T d m - 2 R m x ‾ m + λ Σ j = 1 , j ≠ m N | d j T d m | } - - - ( 4 )
s.t.||dm||2=1
WhereinIt is the vector of X m row,It it is the picture element matrix collection Y residual error about m-th word; N is the total number of dictionary;
S1.2.3 constantly iteration realizes step S1.2.1��S1.2.2; Until the dictionary D that each iteration is run no longer changes;
S1.3 utilizes the S1.1 dictionary obtained, and adopts multilamellar sparse coding that birds training image is carried out sparse coding sparse calculation, obtains the output of sparse coding feature;
S1.4 classifier training
The sparse coding feature of step S1.3 gained is exported and sends into linear support vector machine grader, obtain the maximum classification areal model between different classes of birds;
S2 test process
To test image, adopt the method for step S1.3 to obtain the output of sparse coding feature, send into the birds of grader gained of S1.4 different classes of between maximum classification areal model, it is judged that the birds classification output that current test image is corresponding.
Described multilamellar sparse coding is specially two-layer sparse coding, and step is as follows:
In ground floor sparse coding, training image is uniformly divided into two kinds of sizes of 16x16 or 32x32, respectively in the image block of both sizes, utilize the dictionary that S1.1 obtains, carry out sparse coding sparse calculation, obtain the sparse coding coefficient of ground floor, obtain the feature output of ground floor sparse coding;
The feature of ground floor sparse coding is exported the input as second layer sparse coding, in second layer sparse coding, the feature of the sparse coding of ground floor is exported generated two-dimensional matrix, and after it is uniformly divided into 8*8 size, utilize the dictionary that S1.1 obtains, carry out sparse coding sparse calculation, obtain the sparse coding coefficient of the second layer; Obtain the feature output of second layer sparse coding;
The feature of ground floor is exported and the feature output of the second layer carries out cascade, obtain final sparse coding feature output.
The feature output of described acquisition ground floor sparse coding, particularly as follows:
The extraction mode that maximizes utilizing 2*2 size obtains the feature output of ground floor sparse coding.
The feature output of described acquisition second layer sparse coding, particularly as follows:
The extraction mode that maximizes utilizing 2*2 size obtains the feature output of second layer sparse coding.
The described feature output feature of ground floor exported with the second layer carries out cascade, obtains final sparse coding feature output, particularly as follows:
The feature of ground floor is exported and the feature output of the second layer all carries out space delamination sampling, obtain final sparse coding feature output; The processing procedure of space delamination sampling is exactly that image is divided into the subgraph region of 2*2 and 4*4, then again in whole figure, 1/2 subgraph region and 1/4 subgraph region, the rectangular histogram of the feature output of statistics regional, then the feature of regional is exported rectangular histogram and carry out cascade, obtain final sparse coding feature output.
Compared with prior art, the present invention has the following advantages and beneficial effect:
(1) the birds image-recognizing method of the present invention, first at the R of topography's block, utilizing cascade multilamellar sparse coding structure extraction sparse coding feature in G, B triple channel, every layer of sparse coding structure all comprises feature coding part and profile maxima extracts two parts; Then on output characteristic, multilamellar sparse coding feature uses linear kernel to merge, and uses SVM to carry out classification judgement as grader. The present invention adopts multilamellar sparse coding structure, obtained by spatial decimation and multiple dimensioned merge with spatial distribution rank, to extract the diagnostic characteristics of enough discriminations as much as possible, maximize the class inherited of epicritic vision target, improve the system accuracy of identification for birds image.
(2) present invention is in multilamellar sparse coding structure, optimization object function adds a local restriction item, the approximate solution using tried to achieve object function in feature coding part iteratively carries out sparse coding, then these encoded radios are used to minimize reconstructed error, update dictionary, emphasize local smoothing method character in an encoding process.
(3) the birds image-recognizing method of the present invention can rely on merely the information of image that wherein birds classification is judged, method is convenient easily to be realized.
Accompanying drawing explanation
Fig. 1 is the schematic flow sheet of the birds image-recognizing method of the multilamellar sparse coding feature of embodiments of the invention.
Fig. 2 is the multilamellar sparse coding process schematic of embodiments of the invention.
Detailed description of the invention
Below in conjunction with embodiment, the present invention is described in further detail, but embodiments of the present invention are not limited to this.
Embodiment
As it is shown in figure 1, the birds image-recognizing method of the multilamellar sparse coding feature of the present embodiment, including following;
S1 trains process
S1.1 collects various birds training images, forms birds training dataset;
The present invention selects California Institute of Technology 200 class birds data set, and this data set comprises the view data of 200 class difference birds, and each classification picture number has more than 300.
S1.2 dictionary learning
S1.2.1
Before carrying out classifier training, it is necessary to set up sparse coding dictionary. In traditional sparse coding dictionary learning process, owing to some image block can substantial amounts of repeat, therefore, when allusion quotation is practised handwriting in randomly drawing sample training, these significantly high image blocks of repeatability will be selected with significantly high probability, thus cause final study to dictionary Expired Drugs occurs. Simultaneously in the process of study dictionary, some have the visually bigger image block of texture difference can use same atoms in dictionary, and therefore dictionary learning process is not sufficiently stable. In order to make dictionary learning process have the character of local smoothing method, simultaneously also can fast coding, the present invention at object function plus a local smoothing method bound term, and iteration go optimize new object function.
The present embodiment set the object function of dictionary learning as:
min D , X | | Y - D X | | F 2 + λ Σ i = 1 N Σ j = 1 , j ≠ i N | d i T d j | + β Σ i = 1 N | | e i · x i | | 2 - - - ( 1 )
Wherein Y is that (the present embodiment is that training data is concentrated image have employed the image array of 20000 parts of 16*16 sizes, y at random for the picture element matrix collection of image blockiIt is the picture element matrix of wherein i-th image block); D is the dictionary that need to learn, and dictionary number is N, and dictionary element is diAnd dj; X is the weight coefficient of corresponding dictionary, and its element is xi;It is F norm, ei=exp ([dist (yi,d1),...,dist(yi,dN)]T/ ��), and dist (yi,dj) it is yiWith djEuclidean distance, representing matrix dot product, �� is weight (the present embodiment sets it as 0.1), ��, �� be balance two kinds constraint weight coefficients (it is 0.003 that the present embodiment sets �� as 0.01, ��). Owing to relating to solving of two variablees of X and D, the method for iteration optimization is adopted to implement for this.
S1.2.2 calculates the code coefficient X of input pixel signal matrix stack Y, and therefore the Section 2 in formula (1) can be omitted, shown in the fresh target function such as formula (2) obtained, shown in its analytic solutions such as formula (3);
min D , X | | Y - D X | | F 2 + β Σ i = 1 N | | e i · x i | | 2 - - - ( 2 )
s . t . ∀ i , 1 T x i = 1
{ C i = ( D T - 1 y i T ) ( D T - 1 y i T ) T x i = ( C i + βdiag 2 ( e ) ) \ 1 x i = x i / 1 T x i - - - ( 3 )
S1.2.3 is after trying to achieve code coefficient X, and the word in dictionary optimizes renewal in order by KSVD algorithm, and by the constant part in removable (1), object function (1) is updated to:
min d m { x ‾ m T x ‾ m d m T d m - 2 R m x ‾ m + λ Σ j = 1 , j ≠ m N | d j T d m | + β Σ i = 1 N exp ( 2 d i s t ( y i , d m ) / σ ) * x ‾ m i 2 * δ m i } - - - ( 4 )
s.t.||dm||2=1
WhereinIt is the vector of the m row of X,Being the matrix residual error about m-th word, the total number of dictionary is N,Be in X (m, i) element, ifThen ��mi=1, otherwise ��mi=0. In order to solve formula (4), it is possible to find 0 < exp (2dist (y of wherein the 3rd bound termi,dm)/��) < 1,Both products make Section 3 be almost 0, it is possible to this omitted; From another perspective, this actual meaning is introduced into the local restriction in cataloged procedure, and with new middle maintenance, this is only the d making to solve out at dictionarymCloser to training sample, this it is not necessarily the case.
S1.2.3 constantly iteration realizes step S1.2.1��S1.2.2; Until the dictionary D that each iteration is run no longer changes;
S1.3 utilizes the S1.1 dictionary obtained, and adopts multilamellar sparse coding that birds training image is carried out sparse coding sparse calculation, obtains the output of sparse coding feature, as shown in Figure 2:
In ground floor sparse coding, training image is uniformly divided into two kinds of sizes of 16x16 or 32x32, respectively in the block of both image sizes, utilize the dictionary that S1.1 obtains, carry out sparse coding sparse calculation, obtain the sparse coding coefficient of ground floor, utilize the extraction mode that maximizes of 2*2 size to obtain the feature output of ground floor sparse coding;
The feature of ground floor sparse coding is exported the input as second layer sparse coding, in second layer sparse coding, the feature of the sparse coding of ground floor is exported generated two-dimensional matrix, and after it is uniformly divided into 8*8 size, utilize the dictionary that S1.1 obtains, carry out sparse coding sparse calculation, obtain the sparse coding coefficient of the second layer; The extraction mode that maximizes utilizing 2*2 size obtains the feature output of second layer sparse coding;
The feature of ground floor is exported and the feature output of the second layer all carries out space delamination sampling, obtain final sparse coding feature output; The processing procedure of space delamination sampling is exactly that image is divided into the subgraph region of 2*2 and 4*4, then in whole figure, 1/2 subgraph region and 1/4 subgraph region, the rectangular histogram of the feature output of statistics regional, then the feature of regional is exported rectangular histogram and carry out cascade, obtain final sparse coding feature output.
S1.4 classifier training
The sparse coding feature of step S1.3 gained exports the linear support vector machine grader of feeding, and (support vector machine grader principle is referred to document Chang, Chih-Chung; Lin, Chih-Jen, 2011, LIBSVM:Alibraryforsupportvectormachines, ACMTransactionsonIntelligentSystemsandTechnology), obtain the maximum classification areal model between different classes of birds;
S2 test process
To test image, adopt the method for step S1.3 to obtain the output of sparse coding feature, send into the birds of grader gained of S1.4 different classes of between maximum classification areal model, it is judged that the birds classification output that current test image is corresponding.
Above-described embodiment is the present invention preferably embodiment; but embodiments of the present invention are also not restricted by the embodiments; the change made under other any spirit without departing from the present invention and principle, modification, replacement, combination, simplification; all should be the substitute mode of equivalence, be included within protection scope of the present invention.

Claims (5)

1. the birds image-recognizing method of a multilamellar sparse coding feature, it is characterised in that comprise the following steps:
S1 trains process
S1.1 collects various birds training images, forms birds training dataset;
S1.2 dictionary learning
S1.2.1 set the object function of dictionary learning as:
m i n D , X | | Y - D X | | F 2 + &lambda; &Sigma; i = 1 N &Sigma; j = 1 , j &NotEqual; i N | d i T d j | + &beta; &Sigma; i = 1 N | | e i &CenterDot; x i | | 2 - - - ( 1 )
Wherein Y is the picture element matrix collection of image block, yiIt it is the picture element matrix of i-th image block in picture element matrix collection Y; D is the dictionary that need to learn, and dictionary number is N, and dictionary element is diAnd dj; X is the weight coefficient of dictionary, and its element is xi;It is F norm, e i = exp ( &lsqb; d i s t ( y i , d 1 ) , ... , d i s t ( y i , d N ) &rsqb; T / &sigma; ) , And dist (yi,dj) it is yiWith djEuclidean distance, representing matrix dot product, �� is weight, ��, �� be balance two kinds constraint weight coefficients;
S1.2.2 calculates the code coefficient X of input pixel signal matrix stack Y, shown in the fresh target function such as formula (2) obtained, shown in its analytic solutions such as formula (3);
m i n D , X | | Y - D X | | F 2 + &beta; &Sigma; i = 1 N | | e i &CenterDot; x i | | 2 - - - ( 2 )
s . t . &ForAll; i , 1 T x i = 1
C i = ( D T - 1 y i T ) ( D T - 1 y i T ) T x i = ( C i + &beta;diag 2 ( e ) ) \ 1 x i = x i / 1 T x i - - - ( 3 )
S1.2.3 is after trying to achieve code coefficient X, and the word in dictionary optimizes renewal in order by KSVD algorithm, and object function (1) is updated to:
m i n d m { x &OverBar; m T x &OverBar; m d m T d m - 2 R m x &OverBar; m + &lambda; &Sigma; j = 1 , j &NotEqual; m N | d j T d m | } - - - ( 4 )
s.t.||dm||2=1
WhereinIt is the vector of X m row,It it is the picture element matrix collection Y residual error about m-th word; N is the total number of dictionary;
S1.2.3 constantly iteration realizes step S1.2.1��S1.2.2; Until the dictionary D that each iteration is run no longer changes;
S1.3 utilizes the S1.1 dictionary obtained, and adopts multilamellar sparse coding that birds training image is carried out sparse coding sparse calculation, obtains the output of sparse coding feature;
S1.4 classifier training
The sparse coding feature of step S1.3 gained is exported and sends into linear support vector machine grader, obtain the maximum classification areal model between different classes of birds;
S2 test process
To test image, adopt the method for step S1.3 to obtain the output of sparse coding feature, send into the birds of grader gained of S1.4 different classes of between maximum classification areal model, it is judged that the birds classification output that current test image is corresponding.
2. the birds image-recognizing method of multilamellar sparse coding feature according to claim 1, it is characterised in that described multilamellar sparse coding is specially two-layer sparse coding, and step is as follows:
In ground floor sparse coding, training image is uniformly divided into two kinds of sizes of 16x16 or 32x32, respectively in the image block of both sizes, utilize the dictionary that S1.1 obtains, carry out sparse coding sparse calculation, obtain the sparse coding coefficient of ground floor, obtain the feature output of ground floor sparse coding;
The feature of ground floor sparse coding is exported the input as second layer sparse coding, in second layer sparse coding, the feature of the sparse coding of ground floor is exported generated two-dimensional matrix, and after it is uniformly divided into 8*8 size, utilize the dictionary that S1.1 obtains, carry out sparse coding sparse calculation, obtain the sparse coding coefficient of the second layer; Obtain the feature output of second layer sparse coding;
The feature of ground floor is exported and the feature output of the second layer carries out cascade, obtain final sparse coding feature output.
3. the birds image-recognizing method of multilamellar sparse coding feature according to claim 2, it is characterised in that the feature output of described acquisition ground floor sparse coding, particularly as follows:
The extraction mode that maximizes utilizing 2*2 size obtains the feature output of ground floor sparse coding.
4. the birds image-recognizing method of multilamellar sparse coding feature according to claim 3, it is characterised in that the feature output of described acquisition second layer sparse coding, particularly as follows:
The extraction mode that maximizes utilizing 2*2 size obtains the feature output of second layer sparse coding.
5. the birds image-recognizing method of multilamellar sparse coding feature according to claim 2, it is characterised in that the described feature output feature of ground floor exported with the second layer carries out cascade, obtains final sparse coding feature output, particularly as follows:
The feature of ground floor is exported and the feature output of the second layer all carries out space delamination sampling, obtain final sparse coding feature output; The processing procedure of space delamination sampling particularly as follows: be divided into the subgraph region of 2*2 and 4*4 by image, then in whole figure, 1/2 subgraph region and 1/4 subgraph region, the rectangular histogram of the feature output of statistics regional, then the feature of regional is exported rectangular histogram and carry out cascade, obtain final sparse coding feature output.
CN201510964442.2A 2015-12-18 2015-12-18 Bird image recognition method by multilayer sparse coding features Pending CN105631469A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510964442.2A CN105631469A (en) 2015-12-18 2015-12-18 Bird image recognition method by multilayer sparse coding features

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510964442.2A CN105631469A (en) 2015-12-18 2015-12-18 Bird image recognition method by multilayer sparse coding features

Publications (1)

Publication Number Publication Date
CN105631469A true CN105631469A (en) 2016-06-01

Family

ID=56046381

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510964442.2A Pending CN105631469A (en) 2015-12-18 2015-12-18 Bird image recognition method by multilayer sparse coding features

Country Status (1)

Country Link
CN (1) CN105631469A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106203414A (en) * 2016-07-01 2016-12-07 昆明理工大学 A kind of based on the method differentiating dictionary learning and the scene image character detection of rarefaction representation
CN108197591A (en) * 2018-01-22 2018-06-22 北京林业大学 A kind of birds individual discrimination method based on multiple features fusion transfer learning
CN109117732A (en) * 2018-07-16 2019-01-01 国网江西省电力有限公司电力科学研究院 A kind of transmission line of electricity relates to the identification of bird failure bird kind figure sound and control method
CN109165636A (en) * 2018-09-28 2019-01-08 南京邮电大学 A kind of sparse recognition methods of Rare Birds based on component-level multiple features fusion
CN109344878A (en) * 2018-09-06 2019-02-15 北京航空航天大学 A kind of imitative hawk brain feature integration Small object recognition methods based on ResNet
CN110392893A (en) * 2017-02-17 2019-10-29 考吉森公司 Image processing method for content detection
CN111914599A (en) * 2019-05-09 2020-11-10 四川大学 Fine-grained bird recognition method based on semantic information multi-layer feature fusion

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103942550A (en) * 2014-05-04 2014-07-23 厦门大学 Scene text recognition method based on sparse coding characteristics

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103942550A (en) * 2014-05-04 2014-07-23 厦门大学 Scene text recognition method based on sparse coding characteristics

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
GUO LIHUA等: "a two-layer local constrained sparse coding method for fine-grained visual categorization", 《ARXIV》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106203414A (en) * 2016-07-01 2016-12-07 昆明理工大学 A kind of based on the method differentiating dictionary learning and the scene image character detection of rarefaction representation
CN106203414B (en) * 2016-07-01 2019-07-05 昆明理工大学 A method of based on the scene picture text detection for differentiating dictionary learning and rarefaction representation
CN110392893A (en) * 2017-02-17 2019-10-29 考吉森公司 Image processing method for content detection
CN108197591A (en) * 2018-01-22 2018-06-22 北京林业大学 A kind of birds individual discrimination method based on multiple features fusion transfer learning
CN109117732A (en) * 2018-07-16 2019-01-01 国网江西省电力有限公司电力科学研究院 A kind of transmission line of electricity relates to the identification of bird failure bird kind figure sound and control method
CN109344878A (en) * 2018-09-06 2019-02-15 北京航空航天大学 A kind of imitative hawk brain feature integration Small object recognition methods based on ResNet
CN109344878B (en) * 2018-09-06 2021-03-30 北京航空航天大学 Eagle brain-like feature integration small target recognition method based on ResNet
CN109165636A (en) * 2018-09-28 2019-01-08 南京邮电大学 A kind of sparse recognition methods of Rare Birds based on component-level multiple features fusion
CN111914599A (en) * 2019-05-09 2020-11-10 四川大学 Fine-grained bird recognition method based on semantic information multi-layer feature fusion
CN111914599B (en) * 2019-05-09 2022-09-02 四川大学 Fine-grained bird recognition method based on semantic information multi-layer feature fusion

Similar Documents

Publication Publication Date Title
CN109344736B (en) Static image crowd counting method based on joint learning
CN105631469A (en) Bird image recognition method by multilayer sparse coding features
CN103258214B (en) Based on the Classifying Method in Remote Sensing Image of image block Active Learning
CN111126488B (en) Dual-attention-based image recognition method
CN109063649B (en) Pedestrian re-identification method based on twin pedestrian alignment residual error network
CN103984959A (en) Data-driven and task-driven image classification method
CN104268593A (en) Multiple-sparse-representation face recognition method for solving small sample size problem
US20170262478A1 (en) Method and apparatus for image retrieval with feature learning
CN110942091A (en) Semi-supervised few-sample image classification method for searching reliable abnormal data center
CN106991355A (en) The face identification method of the analytical type dictionary learning model kept based on topology
CN104809469A (en) Indoor scene image classification method facing service robot
CN103745205A (en) Gait recognition method based on multi-linear mean component analysis
CN102122353A (en) Method for segmenting images by using increment dictionary learning and sparse representation
CN110751027B (en) Pedestrian re-identification method based on deep multi-instance learning
CN104077742B (en) Human face sketch synthetic method and system based on Gabor characteristic
CN113269224A (en) Scene image classification method, system and storage medium
CN103593674A (en) Cervical lymph node ultrasonoscopy feature selection method
CN105654122A (en) Spatial pyramid object identification method based on kernel function matching
CN104751175A (en) Multi-label scene classification method of SAR (Synthetic Aperture Radar) image based on incremental support vector machine
CN105868711B (en) Sparse low-rank-based human behavior identification method
CN114692732A (en) Method, system, device and storage medium for updating online label
CN110111365B (en) Training method and device based on deep learning and target tracking method and device
CN113657414B (en) Object identification method
CN103440651A (en) Multi-label image annotation result fusion method based on rank minimization
CN106033546A (en) Behavior classification method based on top-down learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160601