CN105304087B

CN105304087B - Voiceprint recognition method based on zero-crossing separating points

Info

Publication number: CN105304087B
Application number: CN201510586504.0A
Authority: CN
Inventors: 邓方; 关胜盘; 陈杰; 窦丽华; 吕建耀; 代凤驰; 陈文颉; 白永强; 李佳洪; 樊欣宇; 顾晓丹; 张乐乐
Original assignee: Beijing Institute of Technology BIT
Current assignee: Beijing Institute of Technology BIT
Priority date: 2015-09-15
Filing date: 2015-09-15
Publication date: 2017-03-22
Anticipated expiration: 2035-09-15
Also published as: CN105304087A

Abstract

The invention discloses a voiceprint recognition method. The method has the advantages of simple process, small calculation quantity and high precision of the voiceprint recognition, and has the concrete processes of collecting a sound signal and determining zero-crossing points; counting the number of sampling points between all adjacent zero-crossing points to build a one-dimension feature vector; counting the number of the sampling points between all of the zero-crossing points separated by one zero-crossing point to build a two-dimensional feature vector; deducing the rest points by analogy to obtain a multi-dimension feature vector of the sound signal; building a template base; and realizing the voiceprint recognition through the matching of the multi-dimensional feature vector of the detected sound signal and the template base.

Description

It is a kind of to be based on zero passage spaced points method for recognizing sound-groove

Technical field

The invention belongs to computer and information services field, and in particular to a kind of vocal print based on zero passage spaced points is known Other method.

Background technology

Sound groove recognition technology in e be 20th century mid-term the U.S. propose, carry out technique research earliest is U.S. shellfish The Lao Lunsikesite of your laboratory, he has carried out Analysis and Identification by the up to ten thousand vocal print figures to more than 100 Healthy Peoples, accurately Rate reaches 99.65%.The sound groove recognition technology in e of China is started late, and just proceeds by formal research in the nineties in 20th century, Carry out correlational study at present has Peking University, Tsing-Hua University, Acoustical Inst., Chinese Academy of Sciences and some political-legal departments.With The continuous development of each research field (such as material, communication, computer, life sciences etc.), sound groove recognition technology in e is also at full speed Development, its reliability and accuracy will be improved constantly.Mainly include with regard to the algorithm of Application on Voiceprint Recognition：Mel cepstrum coefficients (MFCC) Method, based on the method for HMM (HMM), method based on zero passage spaced points etc..

MFCC and HMM methods accuracy of identification is high, but their amount of calculation is too big, high to hardware requirement.Based between zero passage The method amount of calculation of dot interlace is little, it is only necessary to which relatively low sample rate and less sampled point can just complete the identification function of vocal print, But accuracy of identification is low.

The content of the invention

In view of this, the invention provides a kind of method for recognizing sound-groove based on zero passage spaced points, the method will be original One-dimensional zero passage spaced points recognition methodss expand to multidimensional identification, improve accuracy of identification.

Realize that technical scheme is as follows：

A kind of method for recognizing sound-groove based on zero passage spaced points, detailed process is：

S00, collected sound signal：

Sample rate is set as n, one section of acoustical signal is sampled, sampled point number is k；

S01, determine zero crossing：

If the sampled value when not having acoustical signal is X, when having acoustical signal, the value of each sampled point is X (1), X (2) ... X (k), when formula (1) is met：

(X(i)-X)(X(i+1)-X)≤0(i≤k-1) (1)

Then remember that X (i) is zero crossing, by that analogy, records all zero crossings；

The sampled value of all zero crossings is designated as into y (1), y (2) ... y (ε), wherein ε is the sum of all zero crossings；

S02, statistics zero passage spaced points：

First, the number of the sampled point between all adjacent zero crossings is counted, that is, is counted y (i+1) and is sampled and y (i) between The number of point, and be stored in matrix z1, wherein i=1,2 ... ε -1；The number of times that each element occurs in statistical matrix z1, And store the result of statistics in matrix w1, using w1 as the first dimensional feature vector；

Secondly, the number of the sampled point between the zero crossing of one zero crossing in all intervals is counted, that is, counts y (i+2) and y The number of sampled point between (i), and be stored in matrix z2, wherein i=1,2 ... ε -2；Each element in statistical matrix z2 The number of times of appearance, and the result of statistics is stored in matrix w2, using w2 as the second dimensional feature vector；

By that analogy, obtain successively be separated by two, three ..., sampled point between the zero crossing of N-1 zero crossing Number, obtain w3, w4 ..., wN；

S03, set up multidimensional characteristic matrix：

By w1, w2 ..., the short characteristic vector subsequent Zero of length in wN, obtain the characteristic vector of a N-dimensional；

S04, set up template base：

According to the mode of step S00 to S03, its corresponding N-dimensional feature is obtained respectively for various different acoustical signals Vector, builds template base；

S05, obtain matching result：

According to the mode of step S00 to S03, the N-dimensional characteristic vector for being detected acoustical signal is extracted, and by itself and template base The characteristic vector of middle acoustical signal is matched, and realizes the identification of vocal print.

Further, the process that matched of the present invention is：By the N-dimensional characteristic vector of detected acoustical signal and mould In plate storehouse, characteristic vector asks for Euclidean distance respectively, if the Euclidean distance of minimum is less than the threshold value of setting, by minimum euclidean distance The signal being detected otherwise, is considered as unknown signaling as the acoustical signal for matching by corresponding acoustical signal.

Beneficial effect：

Method provided by the present invention obtain successively be separated by one, two, three ..., the zero crossing of N-1 zero crossing it Between sampled point number, set up multidimensional characteristic matrix to be matched, with Mel cepstrum coefficients (MFCC) method, based on hidden Ma Er The method of section's husband's model (HMM) compares that amount of calculation is little, and compared with traditional voiceprint recognition algorithm, real-time is good, high precision.

Description of the drawings

Fig. 1 is the flow chart of method provided by the present invention.

Specific embodiment

Below in conjunction with the accompanying drawings, describe the present invention.

The invention provides a kind of method of Application on Voiceprint Recognition, as shown in figure 1, the method is concretely comprised the following steps：

S00, collected sound signal：

Sample rate is set as 3000HZ, one section of acoustical signal is sampled, sampled point number is 200.

S01, determine zero crossing：

If the sampled value when not having acoustical signal is X, when having acoustical signal, the value of each sampled point is X (1), X (2) ... X(k).When formula below is met：

(X(i)-X)(X(i+1)-X)≤0(i≤k-1) (1)

Then remember that X (i) is zero crossing, by that analogy, records all zero crossings.

The sampled value of all zero crossings is designated as into y (1), y (2) ... y (ε), wherein ε is the sum of all zero crossings.

S02, statistics zero passage spaced points：

For example, the element for occurring in matrix z1 is respectively α₁、α₂……α_k；α is counted respectively₁、α₂……α_kOccur in z1 Number of times, be designated as w1 (α₁)、w1(α₂)……w1(α_k), then using w1 as the first dimensional feature vector.

By that analogy, obtain successively the sampled point that is separated by between the zero crossing of two, three, four ... zero crossings Number, can obtain w3, w4 ...

S03, set up multidimensional characteristic matrix：

According to w1 obtained above, w2 ..., by zero padding behind wherein length short characteristic vector, make their length consistent, The vector of a multidimensional, feature of the multidimensional characteristic vectors comprising acoustical signal may finally be obtained.The dimension of characteristic vector It is not fixed, dimension is more, matching result can be more accurate, but amount of calculation can increases, typically takes 4.

S04, set up template base：

According to the mode of step S00 to S03,4 dimensional feature vectors are solved respectively for various different acoustical signals, build Template base.

S05, obtain matching result：

According to the mode of step S00 to S03, the multidimensional characteristic matrix for being detected acoustical signal is extracted, and by itself and template In storehouse, the multidimensional characteristic matrix of acoustical signal asks for Euclidean distance respectively, using Euclidean distance minimum corresponding to acoustical signal as The acoustical signal for matching.If the signal is considered as unknown signaling more than the thresholding of setting by minima.

Cite an actual example below to illustrate said method.

By taking tank, aircraft, train and the sound whistled as an example, every kind of sound collection is twice respectively as template and survey Sample sheet, calculates the Euclidean distance between each test sample and test template eigenmatrix, experimental result such as 1 institute of table respectively Show, it is seen that this method can be effectively differentiated to common all kinds of acoustical signals.

1 sample of table and template matching results

In sum, presently preferred embodiments of the present invention is these are only, is not intended to limit protection scope of the present invention. All any modification, equivalent substitution and improvements within the spirit and principles in the present invention, made etc., should be included in the present invention's Within protection domain.

Claims

1. a kind of method for recognizing sound-groove based on zero passage spaced points, it is characterised in that detailed process is：

S00, collected sound signal：

S01, determine zero crossing：

S02, statistics zero passage spaced points：

First, the number of statistics adjacent zero crossing y (i+1) sampled point and y (i) between, and be stored in matrix z1, wherein I=1,2 ... ε -1；The number of times that each element occurs in statistical matrix z1, and the result of statistics is stored in matrix w1, by w1 As the first dimensional feature vector；

Secondly, statistics is separated by the number of zero crossing y (i+2) sampled points and y (i) between of a zero crossing, and is stored to In matrix z2, wherein i=1,2 ... ε -2；The number of times that each element occurs in statistical matrix z2, and the result storage of statistics is arrived In matrix w2, using w2 as the second dimensional feature vector；

By that analogy, obtain successively be separated by two, three ..., the number of sampled point between the zero crossing of N-1 zero crossing, Obtain w3, w4 ..., wN；

S03, set up multidimensional characteristic matrix：

By w1, w2 ..., the short characteristic vector subsequent Zero of length in wN, make their length consistent, obtain the characteristic vector of N-dimensional；

S04, set up template base：

According to the mode of step S00 to S03, its corresponding N-dimensional characteristic vector is obtained respectively for various different acoustical signals, Build template base；

S05, obtain matching result：

According to the mode of step S00 to S03, the N-dimensional characteristic vector for being detected acoustical signal is extracted, and by itself and sound in template base The characteristic vector of message number is matched, and realizes the identification of vocal print.

2. method for recognizing sound-groove according to claim 1 based on zero passage spaced points, it is characterised in that the N=4.

3. method for recognizing sound-groove according to claim 1 based on zero passage spaced points, it is characterised in that described to be matched Process is：The N-dimensional characteristic vector of detected acoustical signal is asked for into Euclidean distance respectively with characteristic vector in template base, if minimum Euclidean distance less than setting threshold value, using the acoustical signal corresponding to minimum euclidean distance as the acoustical signal for matching, Otherwise, the signal being detected is considered as into unknown signaling.