CN103903624B

CN103903624B - Periodical pitch detection method under a kind of gauss heat source model environment

Info

Publication number: CN103903624B
Application number: CN201410124483.6A
Authority: CN
Inventors: 张小恒
Original assignee: Chongqing Technology and Business Institute
Current assignee: Chongqing Technology and Business Institute
Priority date: 2014-03-31
Filing date: 2014-03-31
Publication date: 2016-06-01
Anticipated expiration: 2034-03-31
Also published as: CN103903624A

Abstract

The present invention provides the Periodical pitch detection method under a kind of gauss heat source model environment. It is characterized in that utilizing the voice storehouse (voice storehouse B) containing gauss heat source model to construct 4 rank semi-invariant diagonal slices vectors, the voice storehouse under quiet environment (voice storehouse A) is utilized to extract fundamental tone cycle parameter, 4 rank semi-invariant diagonal slices vectors are trained with the input and output of fundamental tone cycle parameter as generalized regression nerve networks learning sample, again using the 4 vectorial inputs as the GRNNs trained of rank semi-invariant diagonal slices of input voice frame, namely the output obtaining neural network GRNNs input the fundamental tone cycle of voice frame.

Description

Periodical pitch detection method under a kind of gauss heat source model environment

Technical field

The present invention relates to Periodical pitch detection method, particularly the Periodical pitch detection method under a kind of gauss heat source model environment.

Background technology

The fundamental tone cycle, in compress speech, there was purposes widely in the speech processes fields such as voice analysis synthesis and speech recognition as the basic parameter of voice. Accurately and reliably estimating and to extract the fundamental tone cycle most important to Speech processing, it directly can affect the quality of synthetic speech, reduces the verity of voice and naturalness in phonetic recognization rate and speech coding and decoding system. Pitch determination mainly contains auto-relativity function method, average magnitude difference function method and the method for falling spectrum etc., but these methods are difficult to better effects under low signal-to-noise ratio environment, many innovatory algorithm are had in recent years for the pitch determination in noise environment, mostly make use of autocorrelative function, and autocorrelative function can only suppress white Gaussian noise, it is invalid to gauss heat source model. Given this, the present invention provides a kind of special in the Periodical pitch detection method under gauss heat source model.

Summary of the invention

Have obvious deficiency for the pitch determination that carries out of prior art under gauss heat source model, the present invention provide a kind of utilize the fourth-order cumulant diagonal slice vector of voice frame to carry out gauss heat source model under Periodical pitch detection method.

The method comprises the following steps:

(1) being reset under gauss heat source model environment the voice storehouse B made under gauss heat source model environment by the voice storehouse A under quiet environment, voice storehouse A is the some set of voice digital sample, voice storehouse B is the some set of voice digital sample, wherein L is total number of sample points;

(2) respectively to the speech signal sampling point temporally order framing in voice storehouse 1 and voice storehouse 2, paired voice frame is obtained

;

WhereinFor the voice frame of voice storehouse A,

WhereinFor the voice frame of voice storehouse B,

Wherein N is voice frame length, and i is voice frame ordinal number;

(3) voice frame is calculated4 rank semi-invariant diagonal slicesWhereinFor the voice frame sampling periodIntegral multiple, and construct 4 rank semi-invariant diagonal slices vectors of the i-th frame voice frame, and do stdn and can obtain

;

(4) voice frame is estimatedFundamental tone cycle parameter, and be designated as;

(5) will,Input and output as generalized regression nerve networks GRNNs learning sample are trained, n be input and output sample to sum, its core widthForSquare root of the variance;

(6) to the temporally order framing of input speech signal sampling point, and stdn 4 rank semi-invariant diagonal slices corresponding with it is calculated

;

(7) willIt is input in the GRNNs trained, can fundamental tone cycle of this voice frame��

The technique scheme of the present invention, compared with prior art, has the following advantages:

A, utilize voice frame fourth-order cumulant diagonal slice vector suppress gauss heat source model can have good effect;

The generalized regression nerve networks that B, utilization train estimates the fundamental tone cycle, it is possible to possess the performance advantage of precision and speed simultaneously;

Accompanying drawing explanation

Fig. 1 is the method flow diagram of the present invention

Embodiment

The present invention propose gauss heat source model environment under Periodical pitch detection method by reference to the accompanying drawings and embodiment be described as follows further:

The method flow of the present invention as shown in Figure 1, comprises the following steps:

(1) reset under gauss heat source model environment the voice storehouse B made under gauss heat source model environment by the voice storehouse A under quiet environment;

(2) respectively to the speech signal sampling point temporally order framing of voice storehouse A and voice storehouse B, paired voice frame is obtained

;

WhereinFor the voice frame of voice storehouse A,The voice frame of voice storehouse B, i is voice frame ordinal number;

;

(6) to the temporally order framing of input speech signal sampling point, and stdn 4 rank semi-invariant diagonal slices corresponding with it is calculated;

The specific embodiment of each step of aforesaid method of the present invention is described in detail as follows:

The embodiment of the voice storehouse A in aforesaid method step (1) records the voice of China 30, main province ,city and area male sex and 30 women, and length 20 minutes during everyone voice, time total, length is 20 hours. The embodiment of voice storehouse B is superposition gauss heat source model on the basis of voice storehouse A.

The embodiment of voice storehouse A and voice storehouse B signal sampling point temporally order framing is by 8KHz frequency sampling by aforesaid method step (2), removes the voice sampling point of Hz noise through high pass. Every 25ms is also exactly that 200 voice sampling points form a frame.

In aforesaid method step (3), the embodiment of 4 rank semi-invariant diagonal slices vectors of stdn is 10 rank vectors

��

The embodiment of aforesaid method step (4) is: the fundamental tone cycle parameter asking for present frame by the method described by linear prediction (MELP) the speech coding algorithm standard of United States Government's 2400b/s mixed excitation.

The embodiment of aforesaid method step (5) is: by 4 rank semi-invariant diagonal slices vectors of stdnWith the fundamental tone cycleThe input and output of generalized regression nerve networks learning sample are trained, the core width of neural networkForSquare root of the variance.

Temporally sequentially the embodiment of framing is consistent with the embodiment of method steps (2) to input speech signal sampling point for aforesaid method step (6).

Aforesaid method step (7) embodiment is: by the stdn 4 rank semi-invariant diagonal slices of input speech signal sampling point voice frameAs the input of the generalized regression nerve networks trained in method steps (5), the fundamental tone cycle of this voice frame can be obtained��

Claims

1. the Periodical pitch detection method under a gauss heat source model environment, it is characterised in that the method comprises the following steps:

;

(6) to the temporally order framing of input speech signal sampling point, and 4 corresponding with it rank semi-invariant diagonal slices are calculated;

2. the Periodical pitch detection method under gauss heat source model environment according to claim 1, it is characterised in that, in described step (2), each frame comprises 200 voice sampling points.