CN107039051B

CN107039051B - Fundamental frequency detection method based on ant group optimization

Info

Publication number: CN107039051B
Application number: CN201610077857.2A
Authority: CN
Inventors: 张小恒
Original assignee: Chongqing Technology and Business Institute
Current assignee: Chongqing Technology and Business Institute
Priority date: 2016-02-03
Filing date: 2016-02-03
Publication date: 2019-11-26
Anticipated expiration: 2036-02-03
Also published as: CN107039051A

Abstract

The present invention provides the fundamental frequency detection method under a kind of Arctic ice area environment.It is characterized in that extracting the signature waveform of voice frame signal using PEFAC algorithm, then it optimizes to construct new signature waveform using optimum optimization factor pair signature waveform, finally finds out estimated value of the frequency values corresponding to signature waveform peak-peak as fundamental frequency.Wherein the optimum optimization factor is obtained by ant group optimization (ACO) algorithm search.

Description

Fundamental frequency detection method based on ant group optimization

Technical field

Fundamental frequency detection side the present invention relates to fundamental frequency detection method, under especially a kind of Arctic ice area environment Method.

Background technique

Fundamental frequency detects the basic parameter as voice, in the speech processes neck such as speech analysis synthesis and speech Separation There is extensive purposes in domain.Accurately and reliably estimate and to extract fundamental frequency most important to Speech processing.High s/n ratio Fundamental frequency detection it is very mature, but these methods are difficult to better effects under low signal-to-noise ratio environment, especially extremely low Detection effect under signal-to-noise ratio environment is very poor.In consideration of it, the present invention provides the fundamental frequency inspection under a kind of Arctic ice area environment Survey method.

Summary of the invention

There is obvious deficiency for carry out fundamental frequency detection of the prior art under Arctic ice area environment, the present invention mentions The fundamental frequency detection method under a kind of Arctic ice area environment is supplied.Method includes the following steps:

1. training process:

(1) speech database is made voice framing { frm (1), frm (2) ..., frm (N) } in chronological order, and utilizes mark Quasi- algorithm extracts the fundamental frequency F of speech frame₀As fundamental frequency true value, and constitute sequence { F₀(1),F₀(2),…,F₀ (N) }, wherein N be speech frame sum；

(2) superimposed noise makes new voice framing sequence { frm on the basis of clean speech frame_noise(1),frm_noise (2),…,frm_noise(N) }, and using PEFAC algorithm by voice frame signal it is converted into corresponding signature waveform sequence

(3) fitness function in ant colony path is constructed together with signature waveform using Optimization Factor, and carry out global search, Until obtaining the optimum optimization factor.Wherein Optimization Factor is unknown M dimensional vector α=[α₁,α₂,…,α_M], Optimization Factor optimization Signature waveform laterConstitute new signature waveform sequenceExtract characteristic wave ShapePeak-peak peak and its corresponding to frequency values f_peakAs fundamental frequency estimated value, and formation sequence {(peak_max(1),f_peak(1)),(peak_max(2),f_peak(2)),…,(peak_max(N),f_peak(N)) }, ant colony path is direct Determine α value, then the fitness function in ant colony path is I.e. fundamental frequency estimated value and true value error are no more than 5% probability.Then be arranged ant colony (ACO) algorithm relevant parameter into Row search, finally finds out optimal Optimization Factor α_optimal。

The training process specific steps of ant colony are as follows:

Step 1: α=[α is enabled₁,α₂,…,α_i,…,α_M], one-dimensional value range [x_down x_up], search precision prec, then α_i ∈{x_down+prec,x_down+2*prec,…,x_down+ L*prec },

Wherein, floor () is bracket function.α is divided into M × L node, node alpha_ijWith pheromones τ_ijAnd it inspires Formula information η_ijIt is associated, it is α_i=x_downThe expectation of+j*preci, heuristic information η_ij=1/ Δ d_ij, whereinFundamental tone signature waveform as under clean speech environment and it is optimized after fundamental tone signature waveform it Between deviation；

Step 2: the building in path, kth ant go to the probability of node (i, j) are as follows:

Step 3: Pheromone update: when all ants build path, the pheromones on each node are as follows more It is new:

The pheromones that kth ant discharges on the node of place are

WhereinFor path T_k Fitness value.

Step 4: termination condition is to meet maximum number of iterations, at this time the corresponding α value, that is, α of optimal path_optimal。

2. test process:

(1) sub-frame processing is carried out to tested speech signal, and extracts its fundamental tone signature waveform

(2) optimum optimization factor-alpha is utilized_optimalOptimization, i.e., the fundamental tone signature waveform after constitution optimization

(3) it identifies and finds outPeak-peak corresponding to estimated value of the frequency values as fundamental frequency.

Above-mentioned technical proposal of the invention has the advantage that compared with prior art

A, signature waveform is extracted using PEFAC algorithm, inherits the advantages of inhibiting noise under its low signal-to-noise ratio environment；

B, the optimum optimization factor is searched for using ant colony optimization algorithm, so that the obtained fundamental frequency estimation after optimization Value is under low signal-to-noise ratio environment closer to true fundamental frequency value；

Detailed description of the invention

Fig. 1 is the system block diagram constituted according to one embodiment of present invention；

Specific embodiment

Fundamental frequency detection method combination accompanying drawings and embodiments under Arctic ice area environment proposed by the present invention are further It is described as follows:

Method flow of the invention is as shown in Figure 1, the following steps are included:

1. training process:

(1) to speech database framing in chronological order；

(2) and using canonical algorithm the fundamental frequency of speech frame is extracted as fundamental frequency true value；(3) it makes an uproar to being superimposed with The sound bank signal of sound framing and converts corresponding fundamental tone feature for voice frame signal using PEFAC algorithm in chronological order Waveform；

(4) ant colony fitness function is constructed together with fundamental tone signature waveform using Optimization Factor as unknown parameter, go forward side by side Row global search, until obtaining the optimum optimization factor.

2. test process:

(1) to the voice signal framing of spy's test；

(2) its corresponding fundamental tone signature waveform is converted by voice frame signal；

(3) it is optimized using trained optimum optimization factor pair fundamental tone signature waveform, thus after generating optimization Fundamental tone signature waveform, and frequency corresponding to the peak-peak of the fundamental tone signature waveform after optimization is calculated as fundamental frequency Estimated value.

Detailed description are as follows for the specific embodiment of each step of the above method of the present invention:

Sound bank embodiment in above-mentioned training process step (1) is TIMIT international standard database, 30 males and 30 The voice of name women, everyone voice duration 20 minutes, total duration are 20 hours.Temporally the sample rate of framing is 16KHZ, every frame Data are 160 sampled points；Above-mentioned training process step (2) extracts the standard method of sound bank fundamental frequency as praat calculation Method tool；

The noise signal type of above-mentioned training process step (3) superposition is white Gaussian noise, and voice framing method and step Suddenly the algorithm that frame format voice signal is converted into fundamental tone signature waveform is PEFAC algorithm, algorithm flow is as follows unanimously by (1):

(a) voice frame signal is mapped to frequency domain by Short Time Fourier Transform, and makees standardization as X_t' (q), Wherein q is logarithmetics frequency, i.e. q=log (f).

(b) to X_t' (q) convolution algorithm generates fundamental tone signature waveformWherein filter defines Are as follows:

Wherein β is chosen for meeting ∫ h (q) dq=0, and γ is set as 1.8；

The embodiment of Optimization Factor in above-mentioned training process step (4) is 10 dimensional vectors, and per one-dimensional value range It is 0.5~1.5.Fundamental tone signature waveform is the frequency domain vector signal that dimension is 250 dimensions, and frequency domain span is the fundamental tone of 60~400Hz Frequency domain maximum magnitude；

Optimization Factor α and fundamental tone signature waveformDimension it is inconsistent, therefore making optimization operationThat is point When multiplication, α must be extended to 250 vectors, and 250 dimensions are divided into 10 sections to extended method and every 25 dimension is identical.

Ant colony training parameter setting, such as Optimization Factor one-dimensional value range [0.51.5], search precision 0.01, α=2.5, β =2.5, ρ=0.5, ant colony sum are 100, maximum number of iterations 60

Pheromone release function embodiment

It is consistent in framing method and training process step (1) in above-mentioned test process step (1)；It is above-mentioned to test Fundamental tone signature waveform method for transformation in journey step (2) is consistent with training process step (3)；

The optimization operation of fundamental tone signature waveform and above-mentioned training process step (4) in above-mentioned test process step (3) are kept Unanimously, and Optimization Factor is using the optimum optimization factor-alpha for training generation_optimal。

Claims

1. the fundamental frequency detection method under a kind of Arctic ice area environment, it is characterised in that method includes the following steps:

A. training process:

(1) speech database is made voice framing { frm (1), frm (2) ..., frm (N) } in chronological order, and is calculated using standard The fundamental frequency F of method extraction speech frame₀As fundamental frequency true value, and constitute sequence { F₀(1),F₀(2),…,F₀(N) }, Middle N is the sum of speech frame；

(2) superimposed noise makes new voice framing sequence { frm on the basis of clean speech frame_noise(1),frm_noise(2),…, frm_noise(N) }, and using PEFAC algorithm by voice frame signal it is converted into corresponding signature waveform sequence

(3) ant colony fitness function is constructed together with signature waveform using Optimization Factor, and carry out global search, until obtaining most Good Optimization Factor, wherein Optimization Factor is unknown M dimensional vector α=[α₁,α₂,…,α_M], the feature after Optimization Factor optimization WaveformConstitute new signature waveform sequenceExtract signature waveformMaximum The peak value peak and frequency values f corresponding to it_peakAs fundamental frequency estimated value, and formation sequence { (peak_max(1),f_peak (1)),(peak_max(2),f_peak(2)),…,(peak_max(N),f_peak(N)) },

Ant colony pheromone release functionThat is fundamental frequency estimated value and true value error Probability no more than 5%, the relevant parameter that ant group algorithm is then arranged scan for, and finally find out optimal Optimization Factor α_optimal；

B. test process:

2. fundamental frequency detection method according to claim 1, it is characterised in that the ant colony optimization algorithm packet in this method Include following steps:

Step 1: α=[α is enabled₁,α₂,…,α_i,…,α_M], one-dimensional value range [x_down x_up], search precision prec, then α_i∈ {x_down+prec,x_down+2*prec,…,x_down+ L*prec },

Wherein, floor () is bracket function, and α is divided into M × L node, node alpha_ijWith pheromones τ_ijAnd heuristic information η_ijIt is associated, it is α_i=x_downThe expectation of+j*prec, heuristic information η_ij=1/ Δ d_ij, wherein Fundamental tone signature waveform as under clean speech environment and it is optimized after fundamental tone signature waveform between deviation；

Step 3: Pheromone update: when all ants build path, the pheromones on each node update as follows:

The pheromones that kth ant discharges on the node of place are

WhereinFor path T_kAdaptation Angle value；

3. fundamental frequency detection method according to claim 1, it is characterised in that the parameter of this method be set as optimization because Son is 10 dimensional vectors, and one-dimensional value range is 0.5~1.5, and fundamental tone signature waveform is the frequency domain vector signal that dimension is 250 dimensions, Frequency domain span is the fundamental tone frequency domain maximum magnitude of 60~400Hz；Ant colony training parameter is provided that search precision 0.01, α= 2.5, β=2.5, ρ=0.5, ant colony sum are 100, maximum number of iterations 60.