CN101556795A - Method and device for computing voice fundamental frequency - Google Patents

Method and device for computing voice fundamental frequency Download PDF

Info

Publication number
CN101556795A
CN101556795A CNA2008100432334A CN200810043233A CN101556795A CN 101556795 A CN101556795 A CN 101556795A CN A2008100432334 A CNA2008100432334 A CN A2008100432334A CN 200810043233 A CN200810043233 A CN 200810043233A CN 101556795 A CN101556795 A CN 101556795A
Authority
CN
China
Prior art keywords
frequency
fundamental frequency
omega
definition
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2008100432334A
Other languages
Chinese (zh)
Other versions
CN101556795B (en
Inventor
黄鹤云
林福辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Spreadtrum Communications Shanghai Co Ltd
Original Assignee
Spreadtrum Communications Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Spreadtrum Communications Shanghai Co Ltd filed Critical Spreadtrum Communications Shanghai Co Ltd
Priority to CN2008100432334A priority Critical patent/CN101556795B/en
Publication of CN101556795A publication Critical patent/CN101556795A/en
Application granted granted Critical
Publication of CN101556795B publication Critical patent/CN101556795B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention belongs to the field of signal processing and discloses a method and a device for computing voice fundamental frequency. The method and the device can more accurately estimate the fundamental frequency. In the invention, before the fundamental frequency computation, frequency domain signals in use are reconstructed to generate a reconstruction function with a continuous definitional domain. The function undergoes curve fitting in a definitional domain near each frequency domain peak value according to the corresponding frequency domain signals and effectively inhibits the corresponding frequency domain signals in other definitional domains. When the fundamental frequency is searched, candidate fundamental frequencies and a plurality of frequency multiplications thereof are comprehensively taken into consideration.

Description

The method and apparatus of computing voice fundamental frequency
Technical field
The present invention relates to field of voice signal, particularly the technology of computing voice fundamental frequency.
Background technology
Fast development along with network and multimedia technology, speech processing system has spread in each field such as broadcasting, TV, communication, from broadcasting, TV programme making apparatus all be unable to do without speech processing system to hand-held phone, portable audio/video playback apparatus.
Handle and the encoding and decoding speech field at voice signal, estimate that correctly fundamental frequency is extremely important.
From the angle of voice generation principle, voice derive from the vibration of the operatic tunes, produce sound wave, and the modulation through the sound channel organ obtains voice signal again.The vibration of the operatic tunes can determine the type of the voice signal that produces usually, vowel for example, consonant, fricative or the like.In the voice that reality occurs, vowel is seized of significant proportion.An English word is comprising the vowel of major part usually.On the angle of signal analysis, vowel is mainly by humorous wave component, and promptly its frequency component is made up of a fundamental frequency (also can abbreviate fundamental frequency as) and several its integer multiple frequencies.
Be in 4,161,625 the United States Patent (USP), to disclose a kind of method that from voice signal, obtains fundamental frequency in the patent No..In this patent, by initial voice signal is handled, obtain difference signal, adopt auto-correlation algorithm to obtain fundamental frequency again according to difference signal.
Because in the algorithm of actual speech encoding and decoding and voice signal processing (for example G.729 the encoding and decoding speech standard waits), traditional calculating fundamental frequency algorithm mainly is an auto-correlation algorithm, and promptly the maximum coefficient of autocorrelation by the computing voice signal finds specific value.Because comprise a large amount of noises in the voice signal usually, so use computing method may have certain deviation based on the fundamental frequency of auto-correlation algorithm.
Summary of the invention
The object of the present invention is to provide a kind of method and apparatus of computing voice fundamental frequency, can estimate fundamental frequency more exactly.
The invention discloses a kind of method of computing voice fundamental frequency, may further comprise the steps:
The voice signal of time domain is transformed to discrete frequency domain signal X i, i=1 wherein, 2 ..., N;
| X i| in find out each peak value M as local maximum j, j=1 wherein, 2 ..., L, L are the number of peak value, || expression takes absolute value;
In the related field of definition of described discrete frequency-region signal, L nonoverlapping regional Z of structure j, each Z jSize be scheduled to each Z jCover a D j, D wherein jBe M jIn the pairing value of field of definition;
With each Z jFor field of definition is constructed continuous function S respectively j(ω), ω ∈ Z j, satisfy | S ji)-| X i||<C1, wherein ω iBe X iIn the pairing value of field of definition, C1 is a positive constant;
At each Z jIn the field of definition that does not have to cover, constructed fuction S 0(ω), ω ∈ 0 F s 2 And ω ∉ Z j , F wherein sBe sampling rate, satisfy S 0(ω)<| X i|;
Will be by each S j(ω) and S 0The S that (ω) is combined into (ω) calculates fundamental frequency as frequency spectrum.
The invention also discloses a kind of equipment of computing voice fundamental frequency, comprising:
Converter unit is used for the voice signal of time domain is transformed to discrete frequency domain signal X i, i=1 wherein, 2 ..., N;
The peak value computing unit is used for | X i| in find out each peak value M as local maximum j, j=1 wherein, 2 ..., L, L are the number of peak value, || expression takes absolute value;
Reconfiguration unit is used in the related field of definition of described discrete frequency-region signal, L nonoverlapping regional Z of structure j, each Z jSize be scheduled to each Z jCover a D j, D wherein jBe M jIn the pairing value of field of definition; With each Z jFor field of definition is constructed continuous function S respectively j(ω), ω ∈ Z j, satisfy | S ji)-| X i||<C1, wherein ω iBe X iIn the pairing value of field of definition, C1 is a positive constant; At each Z jIn the field of definition that does not have to cover, constructed fuction S 0(ω), ω ∈ 0 F s 2 And ω∉ Z j , F wherein sBe sampling rate, satisfy S 0(ω)<| X i|;
The fundamental tone computing unit is used for by each S j(ω) and S 0The S that (ω) is combined into (ω) calculates fundamental frequency as frequency spectrum.
Embodiment of the present invention compared with prior art, the key distinction and effect thereof are:
Before calculating fundamental frequency, earlier used frequency-region signal is reconstructed, generate the reconstruction of function that field of definition is continuous, carry out curve fitting by corresponding frequency-region signal near the field of definition of this function each frequency domain peak value, in other field of definition, corresponding frequency-region signal is effectively suppressed.Because candidate's fundamental frequency and multiple frequency thereof are usually expressed as peak value, so, can improve accuracy and antijamming capability that fundamental frequency calculates by keeping near the frequency-region signal in the field of definition each peak value, significantly weakening frequency-region signal in other field of definition.The frequency-region signal that obtains by conversion disperses, by can more accurately advancing in the frequency spectrum of reconstruction of function representative to search for fundamental frequency to the serialization of field of definition.
Further, when pitch search, take all factors into consideration candidate's fundamental frequency and a plurality of frequency multiplication thereof, can make Search Results more accurate.
Further, can be by the reconstruction of function value in other field of definition outside near the field of definition peak value be made as 0, thus weaken irrelevant frequency component to greatest extent, further improve accuracy and antijamming capability that fundamental frequency calculates.
Description of drawings
Fig. 1 is the method flow diagram according to a kind of computing voice fundamental frequency of first embodiment of the invention;
Fig. 2 is the equipment structure chart according to a kind of computing voice fundamental frequency of third embodiment of the invention.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, embodiments of the present invention are described in further detail below in conjunction with accompanying drawing.
First embodiment of the present invention relates to a kind of method of computing voice fundamental frequency, as shown in Figure 1.
In step 110, the voice signal of importing is transformed into frequency domain from time domain.Specifically, the time domain voice signal of supposing input is x i, i=1,2 ..., N then can be converted into discrete frequency domain signal X by Fast Fourier Transform (FFT) (FastFourier Transform is called for short " FFT ") i, i=1 wherein, 2 ..., N.
Need to prove that in this step, the conversion from the time-domain signal to the frequency-region signal is embodied as example with FFT and describes, but in actual applications, also can realize by other modes.Such as, can be by modes such as discrete cosine transform (Discrete Cosine Transform, be called for short " DCT ") or modified discrete cosine transforms, the voice signal of time domain is transformed to discrete frequency-region signal.
Then, in step 120, from each frequency domain signal X iAbsolute value in, find out each peak value M as local maximum j, j=1 wherein, 2 ..., L, L are the number of peak value.Such as, earlier according to X 1, X 2...., X N, obtain the absolute value Y of each frequency-region signal 1, Y 2...., Y N, Y wherein i=| X i|, i=1,2 ..., N.Then, search for local maximum Y again i, as search for all and satisfy Y i>max (Y I+1, Y I-1) Y iAs peak value M j, the choosing method of this local maximum is actually among 3 o'clock and chooses maximal value, certainly, in actual applications, also can (among as 5 or 6) choose maximal value in more point.
Then, in step 130, reconstruct the continuous frequency spectrum of voice signal according to the frequency domain peak value of electing.Because each frequency-region signal that obtains after the FFT conversion disperses, and the field of definition continuous functions of frequency spectrum promptly can't be provided, this calculating to fundamental tone has caused difficulty, therefore needs reconstruct and serialization frequency spectrum.Specific as follows:
At first, entire spectrum is divided into two types.One type frequency spectrum is corresponding with the frequency component of fundamental frequency or its certain multiple, and the frequency spectrum of another kind of type then is and the irrelevant pairing frequency spectrum of frequency component of fundamental frequency.Because the frequency component of fundamental frequency and its certain multiple all shows as a local maximum usually on frequency spectrum, can think that therefore in step 120 selected peak value represented the frequency component of fundamental frequency and its certain multiple.Then be considered to the frequency component that has nothing to do in other parts of whole frequency axis.
Secondly, respectively this frequency spectrum of two types is carried out function reconstruct.Specifically, in the related field of definition of discrete frequency-region signal, L of structure respectively with L the corresponding nonoverlapping regional Z of peak value j, each Z jSize be scheduled to each Z jCover a D j, D wherein jBe M jIn the pairing value of field of definition.With each Z jFor field of definition is constructed continuous function S respectively j(ω), ω ∈ Z j, satisfy | S ji)-| X i||<C1, wherein ω iBe X iIn the pairing value of field of definition, C1 is a positive constant.At each Z jIn the field of definition that does not have to cover, constructed fuction S 0(ω), ω ∈ 0 F s 2 And ω ∉ Z j , F wherein sBe sampling rate, satisfy S 0i)<| X i|.Below to continuous function S jMake (ω) is described further.
In the present embodiment, by with the frequency-region signal absolute value of peak value correspondence with and former and later two frequency-region signal absolute values carry out the binomial interpolation, realize continuous function S jStructure (ω).Such as, first peak value M 1Corresponding frequency-region signal absolute value Yi is ω in the pairing value of field of definition i, its former and later two frequency-region signal absolute value (Y then I-1, Y I+1) be (ω in the pairing value of field of definition I-1, ω I+1).Suppose that this interpolation polynomial represented by following second degree trinomial expression:
f(x)=ax 2+bx+c
Then by Substitution method can solve corresponding coefficient a, b, c}:
[ a , b , c ] = [ Y i - 1 , Y i , Y i + 1 ] ω i - 1 2 ω i 2 ω i + 1 2 ω i - 1 ω i ω i + 1 1 1 1 - 1
Therefore, can obtain: S 1(ω)=a 1ω 2+ b 1ω+c 1
In like manner, can construct each S by the binomial interpolation method j(ω) function, i.e. S j(ω)=a jω 2+ b jω+c j
Need to prove, because in the present embodiment, peak value M jBe actually the maximal value of among 3 o'clock, choosing, if so first peak value M 1Corresponding frequency-region signal absolute value is Y i, then regional Z 1Reference position be Y I-1In the pairing value of field of definition, end position is Y I+1In the pairing value of field of definition, promptly Z 1 = [ F s N ω i - 1 , F s N ω i + 1 ] , In like manner, can obtain each Z jThe zone.The field of definition of the matched curve of each peak value correspondence also can be taked other arbitrarily rational length
For each Z jThe field of definition that does not have covering, because Pitch Information is not contained in these zones, therefore can be simply with these parts arbitrary function S 0(ω) replace, ω ∈ 0 F s 2 And ω ∉ Z j , Function S 0(ω) only need satisfy S 0i)<| X i| this condition gets final product.Such as, adopt null function, that is: S 0(ω)=0.
Because in this step, respectively this frequency spectrum of two types has been carried out function reconstruct, so entire spectrum all to be reconstructed into be a field of definition continuous functions, that is:
S ( ω ) = S 1 ( ω ) = a 1 ω 2 + b 1 ω + c 1 , ω ∈ Z 1 S 2 ( ω ) = a 2 ω 2 + b 2 ω + c 2 , ω ∈ Z 2 · · · S L ( ω ) = a L ω 2 + b L ω + c L , ω ∈ Z L S 0 ( ω ) = 0 , ω ∈ [ 0 , F s 2 ] andω ∉ Z j
Then, enter step 140, calculate fundamental frequency.Specifically, because in step 130, drawn a field of definition continuous functions S (ω), can directly derive fundamental frequency according to the function characteristics of this function S (ω).Such as, to search in the scope (as from 50 hertz to 500 hertz) that may exist of fundamental tone, the criterion of search is to find the frequency that satisfies following formula:
ω p = arg max ω Σ k = 1 N ( ω ) | S ( kω ) | 2
Wherein, N (ω) is to be the harmonic wave number of fundamental frequency with ω, ω pBe fundamental frequency.Need to prove that above-mentioned formula is an object lesson as search criteria, in actual applications, also can adopt other formula, as will square changing to 4 powers or 1 power etc. in the above-mentioned formula.Above-mentioned ω pThe essence of correlation formula is to take all factors into consideration candidate's fundamental frequency and a plurality of frequency multiplication thereof when pitch search, and concrete formula form can have other variation, can make Search Results more accurate like this
Because in the present embodiment, before calculating fundamental frequency, earlier used frequency-region signal is reconstructed, generate the reconstruction of function that field of definition is continuous, carry out curve fitting by corresponding frequency-region signal near the field of definition of this function each frequency domain peak value, in other field of definition, corresponding frequency-region signal is effectively suppressed.Because candidate's fundamental frequency and multiple frequency thereof are usually expressed as peak value, so, can improve accuracy and antijamming capability that fundamental frequency calculates by keeping near the frequency-region signal in the field of definition each peak value, significantly weakening frequency-region signal in other field of definition.The frequency-region signal that obtains by conversion disperses, by can more accurately advancing in the frequency spectrum of reconstruction of function representative to search for fundamental frequency to the serialization of field of definition.
What deserves to be mentioned is, in the present embodiment, to each Z jThe function S that the field of definition that does not have to cover is constructed 0(ω) be: S 0(ω)=0, thereby weaken irrelevant frequency component to greatest extent, further improve accuracy and antijamming capability that fundamental frequency calculates.And in actual applications, also can be with function S 0(ω) be changed to a very little value, can search fundamental frequency comparatively exactly equally.
Second embodiment of the present invention relates to a kind of method of computing voice fundamental frequency, and the present embodiment and first embodiment are roughly the same, and its difference is, in the first embodiment, is constructing continuous function S jIn the time of (ω), be by with the frequency-region signal absolute value of peak value correspondence with and former and later two frequency-region signal absolute values carry out the binomial interpolation and realize; And in the present embodiment, can be by fitting to the segmentation straight line, or come match with cubic polynomial, realize continuous function S jStructure (ω).
Method embodiment of the present invention can be realized in software, hardware, firmware or the like mode.No matter the present invention be with software, hardware, or the firmware mode realize, instruction code can be stored in the storer of computer-accessible of any kind (for example permanent or revisable, volatibility or non-volatile, solid-state or non-solid-state, medium fixing or that change or the like).Equally, storer can for example be programmable logic array (Programmable Array Logic, be called for short " PAL "), random access memory (Random Access Memory, be called for short " RAM "), programmable read only memory (Programmable Read Only Memory, be called for short " PROM "), ROM (read-only memory) (Read-Only Memory, be called for short " ROM "), Electrically Erasable Read Only Memory (Electrically Erasable Programmable ROM, be called for short " EEPROM "), disk, CD, digital versatile disc (Digital Versatile Disc is called for short " DVD ") or the like.
The 3rd embodiment of the present invention relates to a kind of equipment of computing voice fundamental frequency, as shown in Figure 2, comprising: converter unit is used for the voice signal of time domain is transformed to discrete frequency domain signal X i, i=1 wherein, 2 ..., N; The peak value computing unit is used for | X i| in find out each peak value M as local maximum j, j=1 wherein, 2 ..., L, L are the number of peak value, || expression takes absolute value; Reconfiguration unit is used in the related field of definition of discrete frequency-region signal, L nonoverlapping regional Z of structure j, each Z jSize be scheduled to each Z jCover a D j, D wherein jBe M jIn the pairing value of field of definition; With each Z jFor field of definition is constructed continuous function S respectively j(ω), ω ∈ Z j, satisfy | S ji)-| X i||<C1, wherein ω iBe X iIn the pairing value of field of definition, C1 is a positive constant; At each Z jIn the field of definition that does not have to cover, constructed fuction S 0(ω), ω ∈ 0 F s 2 And ω ∉ Z j , F wherein sBe sampling rate, satisfy S 0(ω i)<| X i|; The fundamental tone computing unit is used for by each S j(ω) and S 0The S that (ω) is combined into (ω) calculates fundamental frequency as frequency spectrum.
The fundamental tone computing unit calculates fundamental frequency in the following manner: fundamental tone may have scope search, the criterion of search is to find the frequency that satisfies following formula:
ω p = arg max ω Σ k = 1 N ( ω ) | S ( kω ) | 2
Wherein, N (ω) is to be the harmonic wave number of fundamental frequency with ω, ω pBe fundamental frequency.
Converter unit can adopt modes such as FFT, discrete cosine transform, modified discrete cosine transform, and the voice signal of time domain is transformed to discrete frequency-region signal.
Reconfiguration unit one of can be in the following ways realized S jStructure (ω): with the frequency-region signal absolute value of peak value correspondence with and former and later two frequency-region signal absolute values carry out the binomial interpolation or fit to the segmentation straight line, or come match with cubic polynomial.
Need to prove, each unit of mentioning in the present embodiment all is a logical block, physically, a logical block can be a physical location, it also can be the part of a physical location, can also realize that the physics realization mode of these logical blocks itself is not most important with the combination of a plurality of physical locations, the combination of the function that these logical blocks realized is the key that just solves technical matters proposed by the invention.
In addition, for outstanding innovation part of the present invention, present embodiment will not introduced not too close unit with solving technical matters relation proposed by the invention, and this does not show that there is not other unit in this equipment embodiment.
Though by with reference to some preferred embodiment of the present invention, the present invention is illustrated and describes, those of ordinary skill in the art should be understood that and can do various changes to it in the form and details, and without departing from the spirit and scope of the present invention.

Claims (10)

1. the method for a computing voice fundamental frequency is characterized in that, may further comprise the steps:
The voice signal of time domain is transformed to discrete frequency domain signal X i, i=1 wherein, 2 ..., N;
| X i| in find out each peak value M as local maximum j, j=1 wherein, 2 ..., L, L are the number of peak value, || expression takes absolute value;
In the related field of definition of described discrete frequency-region signal, L nonoverlapping regional Z of structure j, each Z jSize be scheduled to each Z jCover a D j, D wherein jBe M jIn the pairing value of field of definition;
With each Z jFor field of definition is constructed continuous function S respectively j(ω), ω ∈ Z j, satisfy | S ji)-| X i||<C1, wherein ω iBe X iIn the pairing value of field of definition, C1 is a positive constant;
At each Z jIn the field of definition that does not have to cover, constructed fuction S 0(ω), ω ∈ 0 F s 2 And ω ∉ Z j , F wherein sBe sampling rate, satisfy S 0i)<| X i|;
Will be by each S j(ω) and S 0The S that (ω) is combined into (ω) calculates fundamental frequency as frequency spectrum.
2. the method for computing voice fundamental frequency according to claim 1 is characterized in that, and is described with the step of S (ω) as frequency spectrum calculating fundamental frequency, realizes by following substep:
Fundamental tone may have scope search, the criterion of search is to find the frequency that satisfies following formula:
ω p = arg max ω Σ k = 1 N ( ω ) | S ( kω ) | 2
Wherein, N (ω) is to be the harmonic wave number of fundamental frequency with ω, ω pBe the result of calculation of fundamental frequency.
3. the method for computing voice fundamental frequency according to claim 2 is characterized in that, is transformed in the step of discrete frequency-region signal at the voice signal with time domain, adopts one of following mapping mode:
Fast fourier transform, discrete cosine transform, modified discrete cosine transform.
4. the method for computing voice fundamental frequency according to claim 3 is characterized in that,
Described at | X i| in find out each peak value M as local maximum jStep comprise following substep:
Calculate Y i=| X i|;
Search for all and satisfy Y i>max (Y I+1, Y I-1) Y iAs peak value M j
5. the method for computing voice fundamental frequency according to claim 4 is characterized in that,
At described structure continuous function S jIn the step (ω), realize one of in the following ways S jStructure (ω):
With the frequency-region signal absolute value of peak value correspondence with and former and later two frequency-region signal absolute values carry out the binomial interpolation or fit to the segmentation straight line, or come match with cubic polynomial.
6. the method for computing voice fundamental frequency according to claim 5 is characterized in that,
Described S 0(ω)=0.
7. the equipment of a computing voice fundamental frequency is characterized in that, comprising:
Converter unit is used for the voice signal of time domain is transformed to discrete frequency domain signal X i, i=1 wherein, 2 ..., N;
The peak value computing unit is used for | X i| in find out each peak value M as local maximum j, j=1 wherein, 2 ..., L, L are the number of peak value, || expression takes absolute value;
Reconfiguration unit is used in the related field of definition of described discrete frequency-region signal, L nonoverlapping regional Z of structure j, each Z jSize be scheduled to each Z jCover a D j, D wherein jBe M jIn the pairing value of field of definition; With each Z jFor field of definition is constructed continuous function S respectively j(ω), ω ∈ Z j, satisfy | S ji)-| X i||<C1, wherein ω iBe X iIn the pairing value of field of definition, C1 is a positive constant; At each Z jIn the field of definition that does not have to cover, constructed fuction S 0(ω), ω ∈ 0 F s 2 And ω ∉ Z j , F wherein sBe sampling rate, satisfy S 0i)<| X i|;
The fundamental tone computing unit is used for by each S j(ω) and S 0The S that (ω) is combined into (ω) calculates fundamental frequency as frequency spectrum.
8. the equipment of computing voice fundamental frequency according to claim 7 is characterized in that,
Described fundamental tone computing unit calculates fundamental frequency in the following manner:
Fundamental tone may have scope search, the criterion of search is to find the frequency that satisfies following formula:
ω p = arg max ω Σ k = 1 N ( ω ) | S ( kω ) | 2
Wherein, N (ω) is to be the harmonic wave number of fundamental frequency with ω, ω pBe the result of calculation of fundamental frequency.
9. the equipment of computing voice fundamental frequency according to claim 8 is characterized in that,
One of the following mapping mode that adopts described converter unit realizes the voice signal of time domain is transformed to discrete frequency-region signal:
Fast fourier transform, discrete cosine transform, modified discrete cosine transform.
10. the equipment of computing voice fundamental frequency according to claim 9 is characterized in that,
Described reconfiguration unit is realized one of in the following ways S jStructure (ω):
With the frequency-region signal absolute value of peak value correspondence with and former and later two frequency-region signal absolute values carry out the binomial interpolation or fit to the segmentation straight line, or come match with cubic polynomial.
CN2008100432334A 2008-04-09 2008-04-09 Method and device for computing voice fundamental frequency Active CN101556795B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008100432334A CN101556795B (en) 2008-04-09 2008-04-09 Method and device for computing voice fundamental frequency

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008100432334A CN101556795B (en) 2008-04-09 2008-04-09 Method and device for computing voice fundamental frequency

Publications (2)

Publication Number Publication Date
CN101556795A true CN101556795A (en) 2009-10-14
CN101556795B CN101556795B (en) 2012-07-18

Family

ID=41174885

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008100432334A Active CN101556795B (en) 2008-04-09 2008-04-09 Method and device for computing voice fundamental frequency

Country Status (1)

Country Link
CN (1) CN101556795B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779526A (en) * 2012-08-07 2012-11-14 无锡成电科大科技发展有限公司 Pitch extraction and correcting method in speech signal
CN103258543A (en) * 2013-04-12 2013-08-21 大连理工大学 Method for expanding artificial voice bandwidth
CN103426441A (en) * 2012-05-18 2013-12-04 华为技术有限公司 Method and device for detecting correctness of pitch period
CN103794222A (en) * 2012-10-31 2014-05-14 展讯通信(上海)有限公司 Method and apparatus for detecting voice fundamental tone frequency
CN107045875A (en) * 2016-02-03 2017-08-15 重庆工商职业学院 Fundamental frequency detection method based on genetic algorithm
CN107833581A (en) * 2017-10-20 2018-03-23 广州酷狗计算机科技有限公司 A kind of method, apparatus and readable storage medium storing program for executing of the fundamental frequency for extracting sound

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6754630B2 (en) * 1998-11-13 2004-06-22 Qualcomm, Inc. Synthesis of speech from pitch prototype waveforms by time-synchronous waveform interpolation
JP2000209097A (en) * 1999-01-14 2000-07-28 Sony Corp Signal processor, signal processing method, signal recorder, signal reproducing device and recording medium
US7272551B2 (en) * 2003-02-24 2007-09-18 International Business Machines Corporation Computational effectiveness enhancement of frequency domain pitch estimators
CN1246825C (en) * 2003-08-04 2006-03-22 扬智科技股份有限公司 Method for predicationg intonation estimated value of voice signal

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10249315B2 (en) 2012-05-18 2019-04-02 Huawei Technologies Co., Ltd. Method and apparatus for detecting correctness of pitch period
US11741980B2 (en) 2012-05-18 2023-08-29 Huawei Technologies Co., Ltd. Method and apparatus for detecting correctness of pitch period
CN103426441A (en) * 2012-05-18 2013-12-04 华为技术有限公司 Method and device for detecting correctness of pitch period
US10984813B2 (en) 2012-05-18 2021-04-20 Huawei Technologies Co., Ltd. Method and apparatus for detecting correctness of pitch period
CN103426441B (en) * 2012-05-18 2016-03-02 华为技术有限公司 Detect the method and apparatus of the correctness of pitch period
US9633666B2 (en) 2012-05-18 2017-04-25 Huawei Technologies, Co., Ltd. Method and apparatus for detecting correctness of pitch period
CN102779526B (en) * 2012-08-07 2014-04-16 无锡成电科大科技发展有限公司 Pitch extraction and correcting method in speech signal
CN102779526A (en) * 2012-08-07 2012-11-14 无锡成电科大科技发展有限公司 Pitch extraction and correcting method in speech signal
CN103794222B (en) * 2012-10-31 2017-02-22 展讯通信(上海)有限公司 Method and apparatus for detecting voice fundamental tone frequency
CN103794222A (en) * 2012-10-31 2014-05-14 展讯通信(上海)有限公司 Method and apparatus for detecting voice fundamental tone frequency
CN103258543A (en) * 2013-04-12 2013-08-21 大连理工大学 Method for expanding artificial voice bandwidth
CN107045875A (en) * 2016-02-03 2017-08-15 重庆工商职业学院 Fundamental frequency detection method based on genetic algorithm
CN107045875B (en) * 2016-02-03 2019-12-06 重庆工商职业学院 fundamental tone frequency detection method based on genetic algorithm
CN107833581A (en) * 2017-10-20 2018-03-23 广州酷狗计算机科技有限公司 A kind of method, apparatus and readable storage medium storing program for executing of the fundamental frequency for extracting sound
CN107833581B (en) * 2017-10-20 2021-04-13 广州酷狗计算机科技有限公司 Method, device and readable storage medium for extracting fundamental tone frequency of sound

Also Published As

Publication number Publication date
CN101556795B (en) 2012-07-18

Similar Documents

Publication Publication Date Title
JP7053545B2 (en) Model-based predictions in critically sampled filter banks
CN101556795B (en) Method and device for computing voice fundamental frequency
EP3096317B1 (en) Signal processor and method for processing a signal
US20110090993A1 (en) Signal processing method and data processing method and apparatus
CN102257567A (en) Sound signal processing apparatus, sound encoding apparatus and sound decoding apparatus
DK2954517T3 (en) HIDE OF LOST AUDIO FRAMES
RU2677385C2 (en) Processing device, method and computer program for processing of sound signal using truncated part of overlapping window analysis or synthesis
CN104603873A (en) Device, method and computer program for freely selectable frequency shifts in the sub-band domain
US20090062945A1 (en) Method and System for Estimating Frequency and Amplitude Change of Spectral Peaks
CN102132342B (en) Method for updating an encoder by filter interpolation
CN102568484A (en) Warped spectral and fine estimate audio encoding
Bartkowiak et al. Harmonic Sinusoidal+ Noise Modeling of Audio based on Multiple F0 Estimation
Cohen The Propagation of Noise Fields in a Dispersive Medium
Miranda et al. Considerering Pure GPU Model for an Audio Fingerprinting System
Al Sallab et al. Hardware implementation of distributed speech recognition system front end

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20180410

Address after: The 300456 Tianjin FTA test area (Dongjiang Bonded Port) No. 6865 North Road, 1-1-1802-7 financial and trade center of Asia

Patentee after: Xinji Lease (Tianjin) Co.,Ltd.

Address before: 201203 Shanghai city Zuchongzhi road Pudong Zhangjiang hi tech park, Spreadtrum Center Building 1, Lane 2288

Patentee before: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd.

TR01 Transfer of patent right
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20091014

Assignee: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd.

Assignor: Xinji Lease (Tianjin) Co.,Ltd.

Contract record no.: 2018990000196

Denomination of invention: Method and device for computing voice fundamental frequency

Granted publication date: 20120718

License type: Exclusive License

Record date: 20180801

EE01 Entry into force of recordation of patent licensing contract
TR01 Transfer of patent right

Effective date of registration: 20221020

Address after: 201203 Shanghai city Zuchongzhi road Pudong New Area Zhangjiang hi tech park, Spreadtrum Center Building 1, Lane 2288

Patentee after: SPREADTRUM COMMUNICATIONS (SHANGHAI) Co.,Ltd.

Address before: 300456 1-1-1802-7, north area of financial and Trade Center, No. 6865, Asia Road, Tianjin pilot free trade zone (Dongjiang Bonded Port Area)

Patentee before: Xinji Lease (Tianjin) Co.,Ltd.

TR01 Transfer of patent right