TW200412570A - Method for changing voice tone - Google Patents
Method for changing voice tone Download PDFInfo
- Publication number
- TW200412570A TW200412570A TW92100730A TW92100730A TW200412570A TW 200412570 A TW200412570 A TW 200412570A TW 92100730 A TW92100730 A TW 92100730A TW 92100730 A TW92100730 A TW 92100730A TW 200412570 A TW200412570 A TW 200412570A
- Authority
- TW
- Taiwan
- Prior art keywords
- tone
- sound
- changing
- time frame
- patent application
- Prior art date
Links
Landscapes
- Telephone Function (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
200412570 五、發明說明(1) 〔發明所屬之技術領域〕 本案為一種改變聲音聲調的方法,其特徵在於能即時 (real time)改變聲音聲調,且毋須經過複雜運算及大量 記憶體即可完成。 〔先前技術〕 變聲器係一種應用相當廣泛的裝置,其可改變說話者 聲調之特性,若應用於卡拉0K或兒童玩具等消費性產品 中,能夠達到極佳之娛樂效果;而在一些通訊設備,如手 機或電話,亦可使用變聲器達到隱藏發話者身份之目的。 變聲器要能夠改變聲調,主要係透過聲調改變之運算 法則來完成,由於聲調改變後之語音必須立即輸出,因此 聲調改變之運算係即時的。習知用以改變聲調之習用技 術,乃係先將語音信號轉換至頻域(f r e q u e n c y d 〇 m a i η )進 行處理後,再轉換回時域(t i m e d o m a i η )輸出,然而此法 不僅運算複雜度高,也需要大量記憶體配合,如要達到即 時運算之需求,硬體成本相對提高,因而無法在中低價位 之消費性產品採取此項技術。 另外,利用唱盤轉速之增減可提高或降低音調的原 理,吾人可在數位音訊處理過程,將放音之頻率加快(音 調升高)或變慢(音調降低),以達到改變聲調高低之目 的。然而,此種方法會造成聲音資料處理後,放音時間和 原來說話者的時間長度不一樣,所以也無法直接應用在需 要即時改變聲調之場合。200412570 V. Description of the invention (1) [Technical field to which the invention belongs] This case is a method for changing the tone of a sound, which is characterized by the ability to change the tone of the sound in real time without the need for complicated calculations and a large amount of memory. [Previous Technology] A voice changer is a device that is widely used. It can change the characteristics of the speaker's tone. If it is used in consumer products such as karaoke or children's toys, it can achieve excellent entertainment effects; and in some communication equipment , Such as mobile phones or telephones, you can also use a voice changer to hide the identity of the caller. To change the tone of a voice changer, it is mainly accomplished through the algorithm of the tone change. Since the voice after the tone change must be output immediately, the calculation of the tone change is immediate. The conventional technique used to change the tone is to first convert the speech signal to the frequency domain (frequencyd 0mai η) for processing, and then convert it back to the time domain (timedomai η) output. However, this method not only has high computational complexity, It also requires a large amount of memory. If the real-time computing needs are to be achieved, the hardware cost is relatively high, so this technology cannot be adopted in low- and medium-priced consumer products. In addition, by using the principle that the rotation speed of the turntable can increase or decrease the pitch, we can increase the frequency of the playback (the pitch increases) or slow down (the pitch decreases) during the digital audio processing to achieve the purpose of changing the pitch. . However, this method will cause the playback time to be different from the original speaker's time after processing the sound data, so it cannot be directly applied to situations where the tone needs to be changed immediately.
200412570 I五、發明說明(2〕 〔本案目的〕 為因應上述需求,本案乃構思一士 法,其處理後之語音資料,不僅可在唯=聲音聲調的方 儲 轉 硬 度下達到聲調升降之效果,而且只需少】相同,訊時間長 存量即可完成;另外,本案所構思之運算及記憶體 換(ADC)及數位類比轉換(DAC)之取樣頻率其類比數位 體實現上更加方便,進而達到即時(real、I .相同,使得 音聲調之目的。 U me)改變聲 〔發明内容〕 為達上述目的,本案提出一種改變聲音聲調的方法, 係包含下列步驟:提供一時框;以一取樣頻率,將一數’ 資料依序存入該時框;以及以該取樣頻率,將一合成值$ 為:類比信號輸出,其中該合成值係由一第一指標和一第 二指標自該時框取出之資料,經一權值(weighting#w 後獲得,而該第一指標和該第二指標係因應一偏移量# 變。 人 一如所述之改變聲音聲調的方法,其中該時樞係為一聲 音資料時框(frame)。 '如所述之改變聲音聲調的方法,其中該取樣頻率係為 類比轉為數位之取樣頻率。 如所述之改變聲音聲調的方法,其中該取樣頻率係為 數位轉為類比之取樣頻率。200412570 I. Description of the invention (2) [Objective of the case] In order to meet the above requirements, this case is to conceive a magic law. The processed voice data can not only achieve the effect of tone rise and fall under the hardness of the square-storage rotation of the sound only. And, it only needs to be the same, and the time can be completed with a long inventory. In addition, the calculation and sampling frequency of the ADC and digital analog conversion (DAC) conceived in this case are more convenient to implement analog digital. To achieve the real-time (real, I., the same, to make the tone of the tone. U me) change the sound [Abstract] In order to achieve the above purpose, this case proposes a method of changing the tone of the sound, which includes the following steps: provide a time frame; take a sample Frequency, sequentially storing a number of data into the time frame; and using the sampling frequency, a composite value $ is: an analog signal output, where the composite value is determined by a first indicator and a second indicator since then The information extracted from the frame is obtained after a weighting (weighting # w), and the first index and the second index are changed according to an offset #. The person changes the sound as described Method, wherein the time axis is a frame of sound data. 'The method of changing the tone of a sound as described, wherein the sampling frequency is a frequency of sampling from analog to digital. Changing the tone of a sound as described The method, wherein the sampling frequency is a digital to analog sampling frequency.
第5頁 200412570 五、發明說明(3) 如所述之改變聲音聲調的方法,其中該數位資料係依 序並週而復始地存入該時框。 如所述之改變聲音聲調的方法,其中透過該偏移量5 之改變可以調整聲音聲調之高低,其中5 >0表示輸出聲 音的聲調提高,5 < 0表示聲調下降,5 = 0則聲調不變。 如所述之改變聲音聲調的方法,其中因應該偏移量5 之改變,於每次自該時框取出資料時,令該第一指標p 1 = ,再令該第二指標p2 = pl + (N/2),N為該時框之長 度。Page 5 200412570 V. Description of the invention (3) The method of changing the tone of a sound as described, wherein the digital data is sequentially and repeatedly stored in the time frame. As described in the method for changing the tone of a sound, the pitch of the sound can be adjusted by changing the offset 5, where 5 > 0 means that the tone of the output sound is increased, 5 < 0 means that the tone is decreased, and 5 = 0. The tone does not change. As described in the method for changing the tone of a sound, in response to a change of the offset 5, each time data is fetched from the time frame, the first index p 1 =, and then the second index p 2 = pl + (N / 2), where N is the length of the time frame.
如所述之改變聲音聲調的方法,其中因應該第一指標 p 1 > ( N - 1 )時,令pl=pl-N,N為該時框之長度。 如所述之改變聲音聲調的方法,其中因應該第一指標 p 1 <0時,令pl=pl+N,N為該時框之長度。 如所述之改變聲音聲調的方法,其中因應該第二指標 p2〉(N-1)時,令N為該時框之長度。 如所述之改變聲音聲調的方法,其中因應該第二指標 ρ2 < 0時,令ρ2 = ρ2 + Ν,Ν為該時框之長度。 如所述之改變聲音聲調的方法,其中該權值運算為一 三角窗(Triangular Window)轉換。As described in the method for changing the tone of a sound, in response to the first index p 1 > (N-1), let pl = pl-N, where N is the length of the time frame. As described in the method for changing the tone of a sound, in response to the first index p 1 < 0, let pl = pl + N, where N is the length of the time frame. As described in the method for changing the tone of a sound, in response to the second index p2> (N-1), let N be the length of the time frame. As described in the method for changing the tone of a sound, in response to the second index ρ2 < 0, let ρ2 = ρ2 + Ν, where N is the length of the time frame. The method of changing the tone of a sound as described, wherein the weight operation is a Triangular Window conversion.
〔實施方式〕 請參見第一圖,為本案改變聲音聲調的方法之較佳實 施例之系統方塊圖。如圖所示,吾人設訂類比數位轉換器 11(ADC,Analog to Digital Converter)之取樣頻率為[Embodiment] Please refer to the first figure, which is a system block diagram of a preferred embodiment of the method for changing the tone of a sound. As shown in the figure, I set the sampling frequency of the analog to digital converter 11 (ADC, Analog to Digital Converter) as
第6頁 200412570 五、發明說明(4) SR,則取樣所得之資料,經由聲調改變運算法則1 2處理 後,再透過數位類比轉換器13(DAC,Digital to Analog Converter)以相同取樣頻率SR送出。 第二圖為聲音資料時框(frame),其長度為N,時框内 之資料以 x[0],…,x[N-1 ]表示。第三圖為相對應於第二 圖時框資料之權值(w e i g h t i n g ),為說明方便,本案係以 三角窗(T r i a n g u 1 a r W i n d o w )轉換為例,實際應用上當然 可以採用其他函數如漢明窗(H a m m i n g W i n d 〇 w )或漢尼窗 (Hanning Window)等 。 以下為聲調改變運算法則之詳細步驟: 步驟一、將第二圖時框内之資料 X [ 0 ],…,X [ N- 1 ]預設為 0 〇 步驟二、將第三圖第一指標p 1預設為0。 步驟三、決定偏移量5 ,5 >0表示輸出聲音的聲調提高, 5 < 0表示聲調下降,5 = 0則聲調不變。 步驟四、由第一圖中之ADC取得一新的取樣值,並將該取 樣值送入第二圖之時框内,更新時框内之資料,即: ;Φ]=Φ + 1] for ι = 0Α (N-2) 对γ-1]=新的取樣値 步驟五、計算第一指標p 1及第二指標p 2 :Page 6 200412570 V. Description of the invention (4) SR, then the sampled data is processed by the tone change algorithm 1 2 and then sent through the digital to analog converter 13 (DAC, Digital to Analog Converter) at the same sampling frequency SR . The second picture is the frame of sound data, whose length is N. The data in the frame is represented by x [0], ..., x [N-1]. The third picture is the weighting corresponding to the frame data in the second picture. For the convenience of explanation, this case uses the Triangu 1 ar W indow conversion as an example. Of course, other functions such as Hamming window or Hanning window. The following are the detailed steps of the tone changing algorithm: Step 1. Preset the data in the frame X [0],…, X [N-1] in the second picture to 0. Step 2. Set the first indicator in the third picture p 1 is preset to 0. Step 3: Determine the offset 5, 5 > 0 means the tone of the output sound is increased, 5 < 0 means the tone is decreased, and 5 = 0 the tone is not changed. Step 4. Obtain a new sampling value from the ADC in the first figure, and send the sampling value to the box at the time of the second picture, and update the data in the box at the time of updating, that is: Φ] = Φ + 1] for ι = 0Α (N-2) for γ-1] = new sampling. Step 5. Calculate the first index p 1 and the second index p 2:
200412570 五、發明說明(5) ρί = pi-l· δ if (pl>N) ρ\ = ρ\-Ν if (pi < 0) = + ^ ρ2 = ρ\Λ-^) if (ρ2 > Ν) ρ2 = ρ2 - Ν if {ρ2 < 0) ρ2 = p2-l· Ν 步驟六、進行權值運算,利用權值w [ ρ 1 ]和w [ ρ2 ]計算新的 合成值:200412570 V. Description of the invention (5) ρί = pi-l · δ if (pl > N) ρ \ = ρ \ -N if (pi < 0) = + ^ ρ2 = ρ \ Λ- ^) if (ρ2 & gt Ν) ρ2 = ρ2-Ν if {ρ2 < 0) ρ2 = p2-l · Ν Step 6. Perform a weight calculation, and use the weights w [ρ 1] and w [ρ2] to calculate a new composite value:
新的合成値=4户1] X对户1] + ^[;?2]χ对户2] 上式可簡化成 新的合成値=W|>1] X 4 户 1] + (1 - Η>1]) X ζ|>2] =4^2] + (pipl] - κ[ρ2]) χ ^ip\]The new compound 値 = 4 households 1] X pair of households 1] + ^ [;? 2] χ pair of households 2] The above formula can be simplified into a new compound 値 = W | > 1] X 4 households 1] + (1 -Η > 1]) X ζ | > 2] = 4 ^ 2] + (pipl)-κ [ρ2]) χ ^ ip \]
步驟七、將新的合成值送至DAC,以和ADC相同之取樣頻率 SR撥放。 步驟八、重複步驟四,如此即可達到改變聲調的功能。 上述步驟中,亦可以偏移量5 < 0時表示輸出聲音之聲調提 高,(5 > 0表示聲調下降,而步驟五之第一指標ρ 1計算則變Step 7. Send the new synthesized value to the DAC, and put it at the same sampling frequency SR as the ADC. Step 8. Repeat step 4 to achieve the function of changing the tone. In the above steps, an offset of 5 < 0 indicates that the tone of the output sound is increased, (5 > 0 indicates that the tone is decreased, and the calculation of the first index ρ 1 in step 5 is changed.
第8頁 200412570Page 8 200412570
本案就技術層面而言 性: θ 至少具有下面特徵以及進步 1 ·只使用一個時框暫存語音資料,可節省使用記情 UAM);而第三圖中之權值w[n]’由於對稱 關係 ;時只要w[n], n = 0,.·.,(N/2) —i,此(N/2)個權值關可係先计 异儲存於唯讀記憶體(R〇M)中,再以杳 …十 =…成本⑽之成—本遠Μ, •计#新5成值吩,只需計算第一指標Μ和第二 乂如步驟A ),㈣對整個時框重新 ::r:rr時.(real ―)改變聲音聲調之目的… (·; 一.固(TriangUlar Window)轉換等權值函數 fUnCtl〇n)之特性,合成之語音資料車^;骨順, 不會因資料不連續而產生、上異 ^ L . 、竿月丨頁 理(smoothing)。 爆3 ,因此無須再進行平滑化處 本案所揭露之技術,々日 而其前所未有mi;;热習*技術川康以實施, 請,申請專利範圍如附專利十生,麦依法提出專利之申This case is technical in terms of: θ has at least the following characteristics and improvements1. Only one time frame is used to temporarily store voice data, which can save the use of memorization); and the weight w [n] 'in the third figure is due to symmetry Relationship; as long as w [n], n = 0, ..., (N / 2) —i, these (N / 2) weights can be stored in read-only memory (ROM) ), And then take 杳… ten =… cost ⑽—benyuan M, • Calculate #new 50% value phen, just calculate the first index M and the second (as in step A), and then re-evaluate the entire time frame :: r: rr Hours. (real ―) The purpose of changing the tone of the sound ... (·; I. The characteristics of the weight function fUnCtl〇n) such as the solid (TriangUlar Window) transformation, the synthesized voice data car ^; It will be generated due to discontinuities in data, differences ^ L., Smoothing. Explosion 3, so no further smoothing is required. The technology disclosed in this case was unprecedented the next day; hot learning * technology Chuan Kang implemented, please, apply for the scope of patents such as ten years with patents, and Mai filed a patent application according to law
200412570 圖式簡單說明 本案得藉由下列圖示及詳細說明,俾得一更深入之瞭 解: 第一圖:本案改變聲音聲調的方法之較佳實施例之系 統方塊圖 第二圖:聲音資料時框(frame) 第三圖:相對應於第二圖時框資料之權值 (weighting) 圖不主要兀件之圖號如下·200412570 Schematic illustration of the case The following figure and detailed description can be used to gain a deeper understanding: Figure 1: System block diagram of the preferred embodiment of the method of changing the tone of the sound in the case. Figure 2: When the sound data Frame (third): Corresponding to the weighting of the frame data in the second image, the figure numbers of the main components are as follows:
1 1 :類比/數位轉換器(ADC ) 1 2 :聲調改變運算法則 13 :數位/類比轉換器(DAC) SR:取樣頻率 N :時框之長度 X [ 〇 ],…,X [ N - 1 ]:時框内之資料 p 1 :第一指標 p2 :第二指標 δ :偏移量 w[pl]、 w[p2]:權值(weighting)1 1: Analog / digital converter (ADC) 1 2: Tone change algorithm 13: Digital / analog converter (DAC) SR: Sampling frequency N: Length of time frame X [〇], ..., X [N-1 ]: Data in the time frame p 1: First index p2: Second index δ: Offsets w [pl], w [p2]: Weighting
第10頁Page 10
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW92100730A TW594672B (en) | 2003-01-14 | 2003-01-14 | Method for changing voice tone |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW92100730A TW594672B (en) | 2003-01-14 | 2003-01-14 | Method for changing voice tone |
Publications (2)
Publication Number | Publication Date |
---|---|
TW594672B TW594672B (en) | 2004-06-21 |
TW200412570A true TW200412570A (en) | 2004-07-16 |
Family
ID=34075935
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW92100730A TW594672B (en) | 2003-01-14 | 2003-01-14 | Method for changing voice tone |
Country Status (1)
Country | Link |
---|---|
TW (1) | TW594672B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111739544B (en) * | 2019-03-25 | 2023-10-20 | Oppo广东移动通信有限公司 | Voice processing method, device, electronic equipment and storage medium |
-
2003
- 2003-01-14 TW TW92100730A patent/TW594672B/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
TW594672B (en) | 2004-06-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101370365B1 (en) | A method of and a device for generating 3D sound | |
CN108701465A (en) | Audio signal decoding | |
JP2009522895A (en) | Decoding binaural audio signals | |
KR20130108391A (en) | Method, apparatus and machine-readable storage medium for decomposing a multichannel audio signal | |
WO2011121782A1 (en) | Bandwidth extension device and bandwidth extension method | |
CN111916093B (en) | Audio processing method and device | |
CN105612578B (en) | Method and apparatus for signal processing | |
RU2411595C2 (en) | Improved intelligibility of speech in mobile communication device by control of vibrator operation depending on background noise | |
WO2019107379A1 (en) | Audio synthesizing method, audio synthesizing device, and program | |
CN109671422B (en) | Recording method for obtaining pure voice | |
CN108962277A (en) | Speech signal separation method, apparatus, computer equipment and storage medium | |
CN107112027A (en) | The bi-directional scaling of gain shape circuit | |
JP6821970B2 (en) | Speech synthesizer and speech synthesizer | |
CN113674723A (en) | Audio processing method, computer equipment and readable storage medium | |
WO2004072951A1 (en) | Multiple speech synthesizer using pitch alteration method | |
CN117079623A (en) | Audio noise reduction model training method, singing work processing equipment and medium | |
WO2002058053A1 (en) | Encoding method and decoding method for digital voice data | |
TW200412570A (en) | Method for changing voice tone | |
US7230176B2 (en) | Method and apparatus to modify pitch estimation function in acoustic signal musical note pitch extraction | |
JP5051782B2 (en) | How to combine speech synthesis and spatialization | |
CN104424971B (en) | A kind of audio file play method and device | |
CN112086085B (en) | Audio signal sound processing method, device, electronic equipment and storage medium | |
JPH11338500A (en) | Formant shift compensating sound synthesizer, and operation thereof | |
CN112309410A (en) | Song sound repairing method and device, electronic equipment and storage medium | |
CN112435680A (en) | Audio processing method and device, electronic equipment and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM4A | Annulment or lapse of patent due to non-payment of fees |