CN103903624B - Periodical pitch detection method under a kind of gauss heat source model environment - Google Patents

Periodical pitch detection method under a kind of gauss heat source model environment Download PDF

Info

Publication number
CN103903624B
CN103903624B CN201410124483.6A CN201410124483A CN103903624B CN 103903624 B CN103903624 B CN 103903624B CN 201410124483 A CN201410124483 A CN 201410124483A CN 103903624 B CN103903624 B CN 103903624B
Authority
CN
China
Prior art keywords
voice
heat source
frame
storehouse
source model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201410124483.6A
Other languages
Chinese (zh)
Other versions
CN103903624A (en
Inventor
张小恒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Technology and Business Institute
Original Assignee
Chongqing Technology and Business Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing Technology and Business Institute filed Critical Chongqing Technology and Business Institute
Priority to CN201410124483.6A priority Critical patent/CN103903624B/en
Publication of CN103903624A publication Critical patent/CN103903624A/en
Application granted granted Critical
Publication of CN103903624B publication Critical patent/CN103903624B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention provides the Periodical pitch detection method under a kind of gauss heat source model environment. It is characterized in that utilizing the voice storehouse (voice storehouse B) containing gauss heat source model to construct 4 rank semi-invariant diagonal slices vectors, the voice storehouse under quiet environment (voice storehouse A) is utilized to extract fundamental tone cycle parameter, 4 rank semi-invariant diagonal slices vectors are trained with the input and output of fundamental tone cycle parameter as generalized regression nerve networks learning sample, again using the 4 vectorial inputs as the GRNNs trained of rank semi-invariant diagonal slices of input voice frame, namely the output obtaining neural network GRNNs input the fundamental tone cycle of voice frame.

Description

Periodical pitch detection method under a kind of gauss heat source model environment
Technical field
The present invention relates to Periodical pitch detection method, particularly the Periodical pitch detection method under a kind of gauss heat source model environment.
Background technology
The fundamental tone cycle, in compress speech, there was purposes widely in the speech processes fields such as voice analysis synthesis and speech recognition as the basic parameter of voice. Accurately and reliably estimating and to extract the fundamental tone cycle most important to Speech processing, it directly can affect the quality of synthetic speech, reduces the verity of voice and naturalness in phonetic recognization rate and speech coding and decoding system. Pitch determination mainly contains auto-relativity function method, average magnitude difference function method and the method for falling spectrum etc., but these methods are difficult to better effects under low signal-to-noise ratio environment, many innovatory algorithm are had in recent years for the pitch determination in noise environment, mostly make use of autocorrelative function, and autocorrelative function can only suppress white Gaussian noise, it is invalid to gauss heat source model. Given this, the present invention provides a kind of special in the Periodical pitch detection method under gauss heat source model.
Summary of the invention
Have obvious deficiency for the pitch determination that carries out of prior art under gauss heat source model, the present invention provide a kind of utilize the fourth-order cumulant diagonal slice vector of voice frame to carry out gauss heat source model under Periodical pitch detection method.
The method comprises the following steps:
(1) being reset under gauss heat source model environment the voice storehouse B made under gauss heat source model environment by the voice storehouse A under quiet environment, voice storehouse A is the some set of voice digital sample, voice storehouse B is the some set of voice digital sample, wherein L is total number of sample points;
(2) respectively to the speech signal sampling point temporally order framing in voice storehouse 1 and voice storehouse 2, paired voice frame is obtained
;
WhereinFor the voice frame of voice storehouse A,
WhereinFor the voice frame of voice storehouse B,
Wherein N is voice frame length, and i is voice frame ordinal number;
(3) voice frame is calculated4 rank semi-invariant diagonal slicesWhereinFor the voice frame sampling periodIntegral multiple, and construct 4 rank semi-invariant diagonal slices vectors of the i-th frame voice frame, and do stdn and can obtain
;
(4) voice frame is estimatedFundamental tone cycle parameter, and be designated as;
(5) will,Input and output as generalized regression nerve networks GRNNs learning sample are trained, n be input and output sample to sum, its core widthForSquare root of the variance;
(6) to the temporally order framing of input speech signal sampling point, and stdn 4 rank semi-invariant diagonal slices corresponding with it is calculated
;
(7) willIt is input in the GRNNs trained, can fundamental tone cycle of this voice frame��
The technique scheme of the present invention, compared with prior art, has the following advantages:
A, utilize voice frame fourth-order cumulant diagonal slice vector suppress gauss heat source model can have good effect;
The generalized regression nerve networks that B, utilization train estimates the fundamental tone cycle, it is possible to possess the performance advantage of precision and speed simultaneously;
Accompanying drawing explanation
Fig. 1 is the method flow diagram of the present invention
Embodiment
The present invention propose gauss heat source model environment under Periodical pitch detection method by reference to the accompanying drawings and embodiment be described as follows further:
The method flow of the present invention as shown in Figure 1, comprises the following steps:
(1) reset under gauss heat source model environment the voice storehouse B made under gauss heat source model environment by the voice storehouse A under quiet environment;
(2) respectively to the speech signal sampling point temporally order framing of voice storehouse A and voice storehouse B, paired voice frame is obtained
;
WhereinFor the voice frame of voice storehouse A,The voice frame of voice storehouse B, i is voice frame ordinal number;
(3) voice frame is calculated4 rank semi-invariant diagonal slicesWhereinFor the voice frame sampling periodIntegral multiple, and construct 4 rank semi-invariant diagonal slices vectors of the i-th frame voice frame, and do stdn and can obtain
;
(4) voice frame is estimatedFundamental tone cycle parameter, and be designated as;
(5) will,Input and output as generalized regression nerve networks GRNNs learning sample are trained, n be input and output sample to sum, its core widthForSquare root of the variance;
(6) to the temporally order framing of input speech signal sampling point, and stdn 4 rank semi-invariant diagonal slices corresponding with it is calculated;
(7) willIt is input in the GRNNs trained, can fundamental tone cycle of this voice frame��
The specific embodiment of each step of aforesaid method of the present invention is described in detail as follows:
The embodiment of the voice storehouse A in aforesaid method step (1) records the voice of China 30, main province ,city and area male sex and 30 women, and length 20 minutes during everyone voice, time total, length is 20 hours. The embodiment of voice storehouse B is superposition gauss heat source model on the basis of voice storehouse A.
The embodiment of voice storehouse A and voice storehouse B signal sampling point temporally order framing is by 8KHz frequency sampling by aforesaid method step (2), removes the voice sampling point of Hz noise through high pass. Every 25ms is also exactly that 200 voice sampling points form a frame.
In aforesaid method step (3), the embodiment of 4 rank semi-invariant diagonal slices vectors of stdn is 10 rank vectors
��
The embodiment of aforesaid method step (4) is: the fundamental tone cycle parameter asking for present frame by the method described by linear prediction (MELP) the speech coding algorithm standard of United States Government's 2400b/s mixed excitation.
The embodiment of aforesaid method step (5) is: by 4 rank semi-invariant diagonal slices vectors of stdnWith the fundamental tone cycleThe input and output of generalized regression nerve networks learning sample are trained, the core width of neural networkForSquare root of the variance.
Temporally sequentially the embodiment of framing is consistent with the embodiment of method steps (2) to input speech signal sampling point for aforesaid method step (6).
Aforesaid method step (7) embodiment is: by the stdn 4 rank semi-invariant diagonal slices of input speech signal sampling point voice frameAs the input of the generalized regression nerve networks trained in method steps (5), the fundamental tone cycle of this voice frame can be obtained��

Claims (2)

1. the Periodical pitch detection method under a gauss heat source model environment, it is characterised in that the method comprises the following steps:
(1) reset under gauss heat source model environment the voice storehouse B made under gauss heat source model environment by the voice storehouse A under quiet environment;
(2) respectively to the speech signal sampling point temporally order framing of voice storehouse A and voice storehouse B, paired voice frame is obtained
;
WhereinFor the voice frame of voice storehouse A,The voice frame of voice storehouse B, i is voice frame ordinal number;
(3) voice frame is calculated4 rank semi-invariant diagonal slicesWhereinFor the voice frame sampling periodIntegral multiple, and construct 4 rank semi-invariant diagonal slices vectors of the i-th frame voice frame, and do stdn and can obtain
;
(4) voice frame is estimatedFundamental tone cycle parameter, and be designated as;
(5) will,Input and output as generalized regression nerve networks GRNNs learning sample are trained, n be input and output sample to sum, its core widthForSquare root of the variance;
(6) to the temporally order framing of input speech signal sampling point, and 4 corresponding with it rank semi-invariant diagonal slices are calculated;
(7) willIt is input in the GRNNs trained, can fundamental tone cycle of this voice frame��
2. the Periodical pitch detection method under gauss heat source model environment according to claim 1, it is characterised in that, in described step (2), each frame comprises 200 voice sampling points.
CN201410124483.6A 2014-03-31 2014-03-31 Periodical pitch detection method under a kind of gauss heat source model environment Expired - Fee Related CN103903624B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410124483.6A CN103903624B (en) 2014-03-31 2014-03-31 Periodical pitch detection method under a kind of gauss heat source model environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410124483.6A CN103903624B (en) 2014-03-31 2014-03-31 Periodical pitch detection method under a kind of gauss heat source model environment

Publications (2)

Publication Number Publication Date
CN103903624A CN103903624A (en) 2014-07-02
CN103903624B true CN103903624B (en) 2016-06-01

Family

ID=50994906

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410124483.6A Expired - Fee Related CN103903624B (en) 2014-03-31 2014-03-31 Periodical pitch detection method under a kind of gauss heat source model environment

Country Status (1)

Country Link
CN (1) CN103903624B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025911B (en) * 2016-01-29 2019-03-12 重庆工商职业学院 Fundamental frequency detection method based on particle group optimizing
CN107045875B (en) * 2016-02-03 2019-12-06 重庆工商职业学院 fundamental tone frequency detection method based on genetic algorithm
CN107039051B (en) * 2016-02-03 2019-11-26 重庆工商职业学院 Fundamental frequency detection method based on ant group optimization
CN108507782B (en) * 2018-01-29 2020-02-21 江苏大学 Method for detecting period signal crypto period under strong background noise

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
PITCH DETERMINATION OF NOISY SPEECH USING HIGHER ORDER;Asuncion Moreno et al;《Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International》;19920326;第1卷;全文 *
基于三阶累积量对角切片的信号特征检测;范虹等;《计算机工程与应用》;20061231(第36期);全文 *
基于切片谱和神经网络的旋转机械故障诊断方法;周鹏,秦树人;《计量技术》;20071231(第9期);全文 *
基于四阶累积量对角切片的短波自适应通信信号检测;柯宏发等;《2006军事电子信息学术会议论文集》;20061231;全文 *
基于对角切片谱的小波神经网络水下目标识别;顾江建等;《计算机仿真》;20120229;第29卷(第2期);全文 *

Also Published As

Publication number Publication date
CN103903624A (en) 2014-07-02

Similar Documents

Publication Publication Date Title
CN103903624B (en) Periodical pitch detection method under a kind of gauss heat source model environment
CN103559888B (en) Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle
CN103325381B (en) A kind of speech separating method based on fuzzy membership functions
CN111081268A (en) Phase-correlated shared deep convolutional neural network speech enhancement method
CN109378010A (en) Training method, the speech de-noising method and device of neural network model
CN108597496A (en) A kind of speech production method and device for fighting network based on production
CN106504763A (en) Based on blind source separating and the microphone array multiple target sound enhancement method of spectrum-subtraction
CN104021373A (en) Semi-supervised speech feature variable factor decomposition method
CN110164472A (en) Noise classification method based on convolutional neural networks
CN106653056A (en) Fundamental frequency extraction model based on LSTM recurrent neural network and training method thereof
Li et al. Sams-net: A sliced attention-based neural network for music source separation
CN103021405A (en) Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter
CN107247962A (en) A kind of real-time electrical appliance recognition and system based on sliding window
CN116434759B (en) Speaker identification method based on SRS-CL network
CN106340304B (en) A kind of online sound enhancement method under the environment suitable for nonstationary noise
CN102820037B (en) Chinese initial and final visualization method based on combination feature
Xu et al. The extraction and simulation of Mel frequency cepstrum speech parameters
WO2018001125A1 (en) Method and device for audio recognition
CN105741853B (en) A kind of digital speech perceptual hash method based on formant frequency
CN102637438A (en) Voice filtering method
Hongyan et al. Blind separation of noisy mixed speech signals based Independent Component Analysis
CN105206259A (en) Voice conversion method
CN111785262B (en) Speaker age and gender classification method based on residual error network and fusion characteristics
CN103559886A (en) Speech signal enhancing method based on group sparse low-rank expression
CN104636313B (en) A kind of redundancy extends the Blind Signal Separation method of single source observation signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160601

Termination date: 20190331

CF01 Termination of patent right due to non-payment of annual fee