TW200412570A

TW200412570A - Method for changing voice tone

Info

Publication number: TW200412570A
Application number: TW92100730A
Authority: TW
Inventors: Wen-Yuan Chen; Qi-Ren Zhong; Zhi-Yong Hong
Original assignee: Sounding Technology Inc
Priority date: 2003-01-14
Filing date: 2003-01-14
Publication date: 2004-07-16
Also published as: TW594672B

Abstract

The present invention provides a method for changing voice tone, which includes the following steps: providing a time frame; sequentially storing digital data into the frame with a constant sampling frequency; and using the sampling frequency to convert a synthesized value into an analog signal output, wherein the synthesized value is obtained by performing a weighting operation on the data taken from the frame by a first pointer and a second pointer, and the first pointer and second pointer are changed corresponding to an offset.

Description

200412570 五、發明說明（1) 〔發明所屬之技術領域〕本案為一種改變聲音聲調的方法，其特徵在於能即時 (real time)改變聲音聲調，且毋須經過複雜運算及大量記憶體即可完成。〔先前技術〕變聲器係一種應用相當廣泛的裝置，其可改變說話者聲調之特性，若應用於卡拉0K或兒童玩具等消費性產品中，能夠達到極佳之娛樂效果；而在一些通訊設備，如手機或電話，亦可使用變聲器達到隱藏發話者身份之目的。變聲器要能夠改變聲調，主要係透過聲調改變之運算法則來完成，由於聲調改變後之語音必須立即輸出，因此聲調改變之運算係即時的。習知用以改變聲調之習用技術，乃係先將語音信號轉換至頻域（f r e q u e n c y d 〇 m a i η )進行處理後，再轉換回時域（t i m e d o m a i η )輸出，然而此法不僅運算複雜度高，也需要大量記憶體配合，如要達到即時運算之需求，硬體成本相對提高，因而無法在中低價位之消費性產品採取此項技術。另外，利用唱盤轉速之增減可提高或降低音調的原理，吾人可在數位音訊處理過程，將放音之頻率加快（音調升高）或變慢（音調降低），以達到改變聲調高低之目的。然而，此種方法會造成聲音資料處理後，放音時間和原來說話者的時間長度不一樣，所以也無法直接應用在需要即時改變聲調之場合。200412570 V. Description of the invention (1) [Technical field to which the invention belongs] This case is a method for changing the tone of a sound, which is characterized by the ability to change the tone of the sound in real time without the need for complicated calculations and a large amount of memory. [Previous Technology] A voice changer is a device that is widely used. It can change the characteristics of the speaker's tone. If it is used in consumer products such as karaoke or children's toys, it can achieve excellent entertainment effects; and in some communication equipment , Such as mobile phones or telephones, you can also use a voice changer to hide the identity of the caller. To change the tone of a voice changer, it is mainly accomplished through the algorithm of the tone change. Since the voice after the tone change must be output immediately, the calculation of the tone change is immediate. The conventional technique used to change the tone is to first convert the speech signal to the frequency domain (frequencyd 0mai η) for processing, and then convert it back to the time domain (timedomai η) output. However, this method not only has high computational complexity, It also requires a large amount of memory. If the real-time computing needs are to be achieved, the hardware cost is relatively high, so this technology cannot be adopted in low- and medium-priced consumer products. In addition, by using the principle that the rotation speed of the turntable can increase or decrease the pitch, we can increase the frequency of the playback (the pitch increases) or slow down (the pitch decreases) during the digital audio processing to achieve the purpose of changing the pitch. . However, this method will cause the playback time to be different from the original speaker's time after processing the sound data, so it cannot be directly applied to situations where the tone needs to be changed immediately.

200412570 I五、發明說明（2〕〔本案目的〕為因應上述需求，本案乃構思一士法，其處理後之語音資料，不僅可在唯=聲音聲調的方儲轉硬度下達到聲調升降之效果，而且只需少】相同，訊時間長存量即可完成；另外，本案所構思之運算及記憶體換（ADC)及數位類比轉換（DAC)之取樣頻率其類比數位體實現上更加方便，進而達到即時（real、I .相同，使得音聲調之目的。 U me)改變聲〔發明内容〕為達上述目的，本案提出一種改變聲音聲調的方法，係包含下列步驟：提供一時框；以一取樣頻率，將一數’ 資料依序存入該時框；以及以該取樣頻率，將一合成值$ 為:類比信號輸出，其中該合成值係由一第一指標和一第二指標自該時框取出之資料，經一權值（weighting#w 後獲得，而該第一指標和該第二指標係因應一偏移量# 變。人一如所述之改變聲音聲調的方法，其中該時樞係為一聲音資料時框（frame)。 '如所述之改變聲音聲調的方法，其中該取樣頻率係為類比轉為數位之取樣頻率。如所述之改變聲音聲調的方法，其中該取樣頻率係為數位轉為類比之取樣頻率。200412570 I. Description of the invention (2) [Objective of the case] In order to meet the above requirements, this case is to conceive a magic law. The processed voice data can not only achieve the effect of tone rise and fall under the hardness of the square-storage rotation of the sound only. And, it only needs to be the same, and the time can be completed with a long inventory. In addition, the calculation and sampling frequency of the ADC and digital analog conversion (DAC) conceived in this case are more convenient to implement analog digital. To achieve the real-time (real, I., the same, to make the tone of the tone. U me) change the sound [Abstract] In order to achieve the above purpose, this case proposes a method of changing the tone of the sound, which includes the following steps: provide a time frame; take a sample Frequency, sequentially storing a number of data into the time frame; and using the sampling frequency, a composite value $ is: an analog signal output, where the composite value is determined by a first indicator and a second indicator since then The information extracted from the frame is obtained after a weighting (weighting # w), and the first index and the second index are changed according to an offset #. The person changes the sound as described Method, wherein the time axis is a frame of sound data. 'The method of changing the tone of a sound as described, wherein the sampling frequency is a frequency of sampling from analog to digital. Changing the tone of a sound as described The method, wherein the sampling frequency is a digital to analog sampling frequency.

第5頁 200412570 五、發明說明（3) 如所述之改變聲音聲調的方法，其中該數位資料係依序並週而復始地存入該時框。如所述之改變聲音聲調的方法，其中透過該偏移量5 之改變可以調整聲音聲調之高低，其中5 >0表示輸出聲音的聲調提高，5 < 0表示聲調下降，5 = 0則聲調不變。如所述之改變聲音聲調的方法，其中因應該偏移量5 之改變，於每次自該時框取出資料時，令該第一指標p 1 = ，再令該第二指標p2 = pl + (N/2)，N為該時框之長度。Page 5 200412570 V. Description of the invention (3) The method of changing the tone of a sound as described, wherein the digital data is sequentially and repeatedly stored in the time frame. As described in the method for changing the tone of a sound, the pitch of the sound can be adjusted by changing the offset 5, where 5 > 0 means that the tone of the output sound is increased, 5 < 0 means that the tone is decreased, and 5 = 0. The tone does not change. As described in the method for changing the tone of a sound, in response to a change of the offset 5, each time data is fetched from the time frame, the first index p 1 =, and then the second index p 2 = pl + (N / 2), where N is the length of the time frame.

如所述之改變聲音聲調的方法，其中因應該第一指標 p 1 > ( N - 1 )時，令pl=pl-N，N為該時框之長度。如所述之改變聲音聲調的方法，其中因應該第一指標 p 1 <0時，令pl=pl+N，N為該時框之長度。如所述之改變聲音聲調的方法，其中因應該第二指標 p2〉（N-1)時，令N為該時框之長度。如所述之改變聲音聲調的方法，其中因應該第二指標 ρ2 < 0時，令ρ2 = ρ2 + Ν，Ν為該時框之長度。如所述之改變聲音聲調的方法，其中該權值運算為一三角窗（Triangular Window)轉換。As described in the method for changing the tone of a sound, in response to the first index p 1 > (N-1), let pl = pl-N, where N is the length of the time frame. As described in the method for changing the tone of a sound, in response to the first index p 1 < 0, let pl = pl + N, where N is the length of the time frame. As described in the method for changing the tone of a sound, in response to the second index p2> (N-1), let N be the length of the time frame. As described in the method for changing the tone of a sound, in response to the second index ρ2 < 0, let ρ2 = ρ2 + Ν, where N is the length of the time frame. The method of changing the tone of a sound as described, wherein the weight operation is a Triangular Window conversion.

〔實施方式〕請參見第一圖，為本案改變聲音聲調的方法之較佳實施例之系統方塊圖。如圖所示，吾人設訂類比數位轉換器 11(ADC，Analog to Digital Converter)之取樣頻率為[Embodiment] Please refer to the first figure, which is a system block diagram of a preferred embodiment of the method for changing the tone of a sound. As shown in the figure, I set the sampling frequency of the analog to digital converter 11 (ADC, Analog to Digital Converter) as

第6頁 200412570 五、發明說明（4) SR，則取樣所得之資料，經由聲調改變運算法則1 2處理後，再透過數位類比轉換器13(DAC，Digital to Analog Converter)以相同取樣頻率SR送出。第二圖為聲音資料時框（frame)，其長度為N，時框内之資料以 x[0]，…，x[N-1 ]表示。第三圖為相對應於第二圖時框資料之權值（w e i g h t i n g )，為說明方便，本案係以三角窗（T r i a n g u 1 a r W i n d o w )轉換為例，實際應用上當然可以採用其他函數如漢明窗（H a m m i n g W i n d 〇 w )或漢尼窗 (Hanning Window)等。以下為聲調改變運算法則之詳細步驟：步驟一、將第二圖時框内之資料 X [ 0 ]，…，X [ N- 1 ]預設為 0 〇步驟二、將第三圖第一指標p 1預設為0。步驟三、決定偏移量5 ，5 >0表示輸出聲音的聲調提高， 5 < 0表示聲調下降，5 = 0則聲調不變。步驟四、由第一圖中之ADC取得一新的取樣值，並將該取樣值送入第二圖之時框内，更新時框内之資料，即：；Φ]=Φ + 1] for ι = 0Α (N-2) 对γ-1]=新的取樣値步驟五、計算第一指標p 1及第二指標p 2 :Page 6 200412570 V. Description of the invention (4) SR, then the sampled data is processed by the tone change algorithm 1 2 and then sent through the digital to analog converter 13 (DAC, Digital to Analog Converter) at the same sampling frequency SR . The second picture is the frame of sound data, whose length is N. The data in the frame is represented by x [0], ..., x [N-1]. The third picture is the weighting corresponding to the frame data in the second picture. For the convenience of explanation, this case uses the Triangu 1 ar W indow conversion as an example. Of course, other functions such as Hamming window or Hanning window. The following are the detailed steps of the tone changing algorithm: Step 1. Preset the data in the frame X [0],…, X [N-1] in the second picture to 0. Step 2. Set the first indicator in the third picture p 1 is preset to 0. Step 3: Determine the offset 5, 5 > 0 means the tone of the output sound is increased, 5 < 0 means the tone is decreased, and 5 = 0 the tone is not changed. Step 4. Obtain a new sampling value from the ADC in the first figure, and send the sampling value to the box at the time of the second picture, and update the data in the box at the time of updating, that is: Φ] = Φ + 1] for ι = 0Α (N-2) for γ-1] = new sampling. Step 5. Calculate the first index p 1 and the second index p 2:

200412570 五、發明說明（5) ρί = pi-l· δ if (pl>N) ρ\ = ρ\-Ν if (pi < 0) = + ^ ρ2 = ρ\Λ-^) if (ρ2 > Ν) ρ2 = ρ2 - Ν if {ρ2 < 0) ρ2 = p2-l· Ν 步驟六、進行權值運算，利用權值w [ ρ 1 ]和w [ ρ2 ]計算新的合成值：200412570 V. Description of the invention (5) ρί = pi-l · δ if (pl > N) ρ \ = ρ \ -N if (pi < 0) = + ^ ρ2 = ρ \ Λ- ^) if (ρ2 & gt Ν) ρ2 = ρ2-Ν if {ρ2 < 0) ρ2 = p2-l · Ν Step 6. Perform a weight calculation, and use the weights w [ρ 1] and w [ρ2] to calculate a new composite value:

新的合成値=4户1] X对户1] + ^[;?2]χ对户2] 上式可簡化成新的合成値=W|>1] X 4 户 1] + (1 - Η>1]) X ζ|>2] =4^2] + (pipl] - κ[ρ2]) χ ^ip\]The new compound 値 = 4 households 1] X pair of households 1] + ^ [;? 2] χ pair of households 2] The above formula can be simplified into a new compound 値 = W | > 1] X 4 households 1] + (1 -Η > 1]) X ζ | > 2] = 4 ^ 2] + (pipl)-κ [ρ2]) χ ^ ip \]

步驟七、將新的合成值送至DAC，以和ADC相同之取樣頻率 SR撥放。步驟八、重複步驟四，如此即可達到改變聲調的功能。上述步驟中，亦可以偏移量5 < 0時表示輸出聲音之聲調提高，（5 > 0表示聲調下降，而步驟五之第一指標ρ 1計算則變Step 7. Send the new synthesized value to the DAC, and put it at the same sampling frequency SR as the ADC. Step 8. Repeat step 4 to achieve the function of changing the tone. In the above steps, an offset of 5 < 0 indicates that the tone of the output sound is increased, (5 > 0 indicates that the tone is decreased, and the calculation of the first index ρ 1 in step 5 is changed.

第8頁 200412570Page 8 200412570

本案就技術層面而言性： θ 至少具有下面特徵以及進步 1 ·只使用一個時框暫存語音資料，可節省使用記情 UAM);而第三圖中之權值w[n]’由於對稱關係 ;時只要w[n], n = 0，.·.，（N/2) —i，此（N/2)個權值關可係先计异儲存於唯讀記憶體（R〇M)中，再以杳 …十 =…成本⑽之成—本遠Μ， •计#新5成值吩，只需計算第一指標Μ和第二乂如步驟A )，㈣對整個時框重新 ::r:rr時.(real ―)改變聲音聲調之目的… (·; 一.固（TriangUlar Window)轉換等權值函數 fUnCtl〇n)之特性，合成之語音資料車^；骨順，不會因資料不連續而產生、上異 ^ L . 、竿月丨頁理（smoothing)。爆3 ，因此無須再進行平滑化處本案所揭露之技術，々日而其前所未有mi;;热習*技術川康以實施，請，申請專利範圍如附專利十生，麦依法提出專利之申This case is technical in terms of: θ has at least the following characteristics and improvements1. Only one time frame is used to temporarily store voice data, which can save the use of memorization); and the weight w [n] 'in the third figure is due to symmetry Relationship; as long as w [n], n = 0, ..., (N / 2) —i, these (N / 2) weights can be stored in read-only memory (ROM) ), And then take 杳… ten =… cost ⑽—benyuan M, • Calculate #new 50% value phen, just calculate the first index M and the second (as in step A), and then re-evaluate the entire time frame :: r: rr Hours. (real ―) The purpose of changing the tone of the sound ... (·; I. The characteristics of the weight function fUnCtl〇n) such as the solid (TriangUlar Window) transformation, the synthesized voice data car ^; It will be generated due to discontinuities in data, differences ^ L., Smoothing. Explosion 3, so no further smoothing is required. The technology disclosed in this case was unprecedented the next day; hot learning * technology Chuan Kang implemented, please, apply for the scope of patents such as ten years with patents, and Mai filed a patent application according to law

200412570 圖式簡單說明本案得藉由下列圖示及詳細說明，俾得一更深入之瞭解：第一圖：本案改變聲音聲調的方法之較佳實施例之系統方塊圖第二圖：聲音資料時框（frame) 第三圖：相對應於第二圖時框資料之權值 (weighting) 圖不主要兀件之圖號如下·200412570 Schematic illustration of the case The following figure and detailed description can be used to gain a deeper understanding: Figure 1: System block diagram of the preferred embodiment of the method of changing the tone of the sound in the case. Figure 2: When the sound data Frame (third): Corresponding to the weighting of the frame data in the second image, the figure numbers of the main components are as follows:

1 1 :類比/數位轉換器（ADC ) 1 2 :聲調改變運算法則 13 :數位/類比轉換器（DAC) SR:取樣頻率 N :時框之長度 X [ 〇 ]，…，X [ N - 1 ]:時框内之資料 p 1 :第一指標 p2 :第二指標 δ :偏移量 w[pl]、 w[p2]:權值（weighting)1 1: Analog / digital converter (ADC) 1 2: Tone change algorithm 13: Digital / analog converter (DAC) SR: Sampling frequency N: Length of time frame X [〇], ..., X [N-1 ]: Data in the time frame p 1: First index p2: Second index δ: Offsets w [pl], w [p2]: Weighting

第10頁Page 10

Claims

200412570 6. Scope of patent application 1. A method for changing the tone of a sound, including the following steps: providing a time frame; sequentially storing a digital data into the time frame with a sampling frequency; and synthesizing a synthesis with the sampling frequency The value is converted into an analog signal output, where the composite value is obtained from a first indicator and a second indicator from the time frame, and is obtained after a weighting operation, and the first indicator and the first indicator The two indicators change in response to an offset. 2. The method for changing the tone of a sound as described in item 1 of the scope of the patent application, wherein the time frame is a sound data time frame (f r a m e). 3. The method for changing the tone of a sound as described in item 1 of the scope of the patent application, wherein the sampling frequency is an analog to digital sampling frequency. 4. The method for changing the tone of a sound as described in item 1 of the scope of the patent application, wherein the sampling frequency is a digital frequency converted to an analog sampling frequency. 5. The method of changing the tone of a sound as described in item 1 of the scope of the patent application, wherein the digital data is sequentially and repeatedly stored in the time frame. 6. The method for changing the tone of a sound as described in item 1 of the scope of the patent application, wherein the height of the tone of the sound can be adjusted through the change of the offset 5, wherein 5 > 0 means that the tone of the output sound is increased, 5 < 0 Indicates that the tone is decreasing, and 5 = 0 does not change the tone. 7. The method for changing the tone of a sound as described in item 6 of the scope of the patent application, wherein the first index pl = pl + 5 is set every time data is taken from the time frame in response to the change of the offset 5. Let the second index p2 = pl + (N / 2), where N is the length of the time frame. 8. The method for changing the tone of a sound as described in item 7 of the scope of patent application, which

200412570 6. In the scope of patent application, in response to the first index p 1 > (N-1), let p 1 = p BU N, where N is the length of the time frame. 9. The method for changing the tone of a sound as described in item 7 of the scope of the patent application, where in response to the first index p 1 < 0, let p 1 = p 1 + N, where N is the length of the time frame. 10. The method for changing the tone of a sound as described in item 7 of the scope of the patent application, wherein in response to the second index p2 > (N-1), let p2 = p2-N, where N is the length of the time frame.

11 1. The method for changing the tone of a sound as described in item 7 of the scope of the patent application, wherein in response to the second index p 2 < 0, let p 2 = p 2 + N, where N is the length of the time frame. 1 2. The method for changing the tone of a sound as described in item 1 of the scope of the patent application, wherein the weight calculation is a Triangular Window conversion.

Page 12