TW559782B

TW559782B - Real-time music composition method

Info

Publication number: TW559782B
Application number: TW90133568A
Authority: TW
Inventors: Jeng-Yuan Lin; Jr-Shing Jang
Original assignee: Cweb Technology Inc
Priority date: 2001-12-31
Filing date: 2001-12-31
Publication date: 2003-11-01

Abstract

The present invention uses the computer to real-time compose the music, which first records some basic word sound, which is the sound database of the system; after entering the score and lyrics for a song by the user, the music synthesizer produced according to the present invention can real-time synthesize the music. The major functions of the synthesizer include: arbitrarily rising or falling the tone, adjusting the volume, and scaling the tone lengths.

Description

559782 A7559782 A7

五、發明說明（2 ) N 耳日日伏變化 L赞明概述j 本發明的歌聲即時合成方法，使用者指定任意-首i #使_ ”4⑽可以言詞，系统人+ * 曲之歌譜鋼广:先曰即日才合成為歌聲(此合成單元為，乾淨的男音或女音）；⑵使用者村藉己: 语音資料檔作為合成單元(此指4n 二自己自 :)’系統便能自動依據使用者所錄製的語;；= 合前述輸人歌曲之歌譜與歌詞，完成即時之歌聲 ::製::基一本的語音資_ ’電腦便可相立個者對线以母—段歌詞的語 :二八百歌輸入完畢，系統即時將此轉成歌聲，此功 U '使用者錄製完整的4I i個國語單音節的狂立。本發明之主要目的’健供—種電腦自動合°成日歌聲的 :利用系統資料庫中的語音資料檔，並配合使 ^歌曲之歌譜與歌詞’進而令電腦將使用者所輸入之^ 與歌詞即時合成為歌聲。 °曰發明之次要目的，係、令使用者能將自己的語音資料才虽或是某一歌星的語音資料檔建立於系統的資料庫令，當，用^輸人歌曲之歌譜與歌詞，就可以透過電腦的運算: 即時°曰出一首擁有個人風味的歌曲，無論是民歌或流行歌曲；亦或藉由本發明令使用者的聲音與某一位歌星的歌聲本紙張尺度顧中^^i^CNSMoiiTilO X 297公fl 559782 A7 B7 五、發明說明（V. Description of the invention (2) N Day-to-day fluctuations L Zanming overview j The instant singing voice synthesis method of the present invention is specified by the user as arbitrary- 首 i # 使 _ ”4⑽ can be verbal, systematic person + * Qu Zhi Ge Pu Gang Guang : The first day is synthesized into a singing voice (this synthesizing unit is a clean male or female voice); ⑵ user village borrows it: the voice data file is used as the synthesizing unit (this refers to 4n): 'The system can automatically According to the words recorded by the user;; = Compose the song sheet and lyrics of the input song, and complete the real-time singing voice :: system :: based on the original voice data _ 'Computer can stand alone to match the line with the mother-paragraph The words of the lyrics: After the input of two or eight hundred songs, the system instantly converts this into a singing voice. This function allows the user to record a complete 4I i mandarin monosyllable. The main purpose of the present invention is' healthy supply — a computer automatic Combined into a Japanese singing voice: Use the voice data file in the system database and cooperate with the ^ song's song score and lyrics' to make the computer synthesize the ^ and lyrics entered by the user into a singing voice in real time. Purpose, to enable users to translate their own voice Although the data or the voice data file of a certain singer is built in the database database of the system, when you use ^ to enter the score and lyrics of the song, you can use the computer to calculate: Really say a song with personal flavor , Whether it is a folk song or a popular song; or the user's voice and the singing voice of a certain singer by the present invention can be adjusted to the paper size ^^ i ^ CNSMoiiTilO X 297 public fl 559782 A7 B7 V. Description of the invention (

I 員工消費社印製共同合唱一首動聽的歌。本發明另_目沾語音與合成歌聲同時用：可藉由本發明使-合成中，進，佳之音之歌詞部份盘庫中’透過電腦的運算將語声史$座田一σ °曰”才s成完成一首由電腦合成的歌二又用於以咖播案為主之卡拉。κ軟體的「歌聲包含為達 =::的，本發明提供一歌聲即時合成之方法二⑽序’係令使用者藉由輸入褒置式·端二=郎，並將輸入的語音儲存成電子資料格立期（.S 係利用錄製聲音樓前後段之無聲或靜曰』（Sllence)來彳貞測出聲音的起迄點的位置，· 二標位程序’藉由前述之端點偵測程序所：聲音基本週期的尋找，並對之後每個聲技起始點標示所在位置；歌聲合成器程序，係二康=述之聲音基本週期粹取與標位程序，得到每一筆語二之，字音的正確聲音基本週期位置與大小，再依照使用ϋ 所逆定的歌譜與歌詞，對歌詞中的每一個，調整音長、音調、音量、抖音、回音、連接音；;：； ^及歌譜輸入程序’當前述之歌聲合成器程序完成後，用者將所輸入歌詞及歌譜，經由電腦的演算後能即時合成出-首歌曲；結果輸出程序’係根據前述之程序所產生的結果立即播放’並將之存成電子資料格式樓，進而令使項頁本紙張尺度適用中國國家標準（CNSM4規格（210 X 297公爱 559782 A7 B7 五、發明說明（部智慧財員工消費社印製用者能透過此歌聲即時合成系統獲得更 [主要元件符號對照說明] 、1作及樂趣 11 語音資料檔的錄製程序 ]2 —_端點彳貞測程序 13-—聲音基本週期粹取與標位程序 14 ---歌聲合成器程序 15 歌詞及歌譜輸入 16 結果輸出程序 3 1 —-音源訊號 32— -聲音基本週期的範圍值 33— -搜尋聲音基本週期的範圍值 34 —-取樣點的位置 3 5 ---局部區段的最大值 36 音高 [發明之詳細說明] 雖然本發明將參閱含有本發明較佳實施例之㈣式予以充份描述，但在此描述之前應瞭解熟悉本行之人士可修改在本文中所描述之發明，同時獲致本發明之功效。因此’須瞭解以下之描述對熟悉本行技藝之人士而言為一廣泛之揭示，且其内容不在於限制本發明。請參閱如圖一所不，係顯示本發明歌聲即時合成系統之架構圖。本發明係-能令使用者藉由輸入歌曲之歌詞及歌譜以獲得即歌聲合成之方法，此方法包含；語音資料播之錄製程序u:係令使用者藉由輸入裝一輸入-語言之單音節，並將輸人的語音儲存成電子資料格請頁訂圖時置 559782I Staff Consumer Printing Co., Ltd. printed a beautiful chorus together. Another aspect of the present invention is the simultaneous use of the voice of the eye and the synthesis of the singing voice: the present invention can be used to synthesize, synthesize, advance, and sing the sound of the song. Only scheng completed a song synthesized by a computer and used it for the karaoke mainly for coffee broadcasts. Κ software "Song contains Wei Da = ::, the present invention provides a method for the instant synthesis of a song." It is to make the user input the setting type · Tuan Er = Lang and store the input voice as electronic data. (.S is to use the silent or silent sound of the front and rear sections of the recording sound to test the truth.) The position of the starting point and ending point of the sound. · The two-position program uses the aforementioned endpoint detection program: the search for the basic cycle of the sound, and the location of each starting point of the sound technique; the singing synthesizer program , Department of Erkang = The basic cycle of sound extraction and labeling procedures, to get the correct basic cycle position and size of the sound of each word, and then according to the score and lyrics reversed by using ϋ, the Each, adjust the length, pitch, Volume, vibrato, echo, connection sound ;;:; ^ and song score input program 'After the aforementioned song synthesizer program is completed, the user can synthesize the entered lyrics and song score through computer calculations-a song ; The result output program 'plays immediately based on the results produced by the aforementioned program' and saves it as an electronic data format building, so that the paper size of the item sheet is adapted to the Chinese national standard (CNSM4 specification (210 X 297 public love 559782 A7) B7 V. Description of the invention (Printed by the Ministry of Intellectual Property Employees' Consumer Corporation, users can obtain more [main component symbol comparison instructions], 1 works and fun 11 voice data file recording procedures through this singing voice real-time synthesis system] 2 —_ endpoint彳 Chast test program 13 --- sound basic period extraction and labeling program 14 --- singing voice synthesizer program 15 lyrics and score input 16 result output program 3 1 --- sound source signal 32 --- range value of sound basic period 33-- -The range value of the basic period of the search sound 34 —- the position of the sampling point 3 5 —- the maximum value of the local section 36 the pitch [detailed description of the invention] Please read the formula containing the preferred embodiment of the present invention for a full description, but before this description, it should be understood that those familiar with the bank can modify the invention described in this article and obtain the effect of the invention at the same time. The description is a broad disclosure for those who are familiar with the skills of the bank, and its content is not intended to limit the present invention. Please refer to Fig. 1, which shows the architecture diagram of the instant singing voice synthesis system of the present invention. The present invention is- A method for enabling users to obtain instant singing voice by inputting lyrics and scores of songs. This method includes: a recording program for voice data broadcasting u: enables users to load a single syllable of input-language by input, and The input voice is stored as an electronic data box.

經濟部智慧財產局員工消費合作社印製 :(點她=利用錄製聲音樓前後段之静曰期（Sllence)來伯測出聲音的起迄點的位置。聲音基本週期粹取與標位程序13:藉由前述之洌程序I2所獲得的聲音檔資料進行聲音美 ’’、、找，並對之後每個聲音基本週期之起始點標；戶二::尋 =合成器程序14••根據前述之聲音基斑 “位私序13，得到每—筆語音之單字音的 ^ =與大:、’再依照使用者所選定的歌譜與歌;基： ::中的個字進行合成，包括調整音長抖音、回音、連接音等特性；里序：及:=入程序15:當前述之歌聲合成器14程後能即時合成出一首歌曲；以及丄由電^的演算結果輸出程序16:根據前述之程序所播放更儲存成電子資料格π 更進一步的成明，請繼續參閱圖一並請來係顯示本發明之某語音端點偵測圖與本發明二；二=二自：關演算法流程圖。本發明能令使用擇某百崎歌曲，而由電腦合成歌聲播放。二取:製程序11、端點侦測程序12、基件耳ϋ立私序13、歌聲合成器程序14 t程序15以及結果輸出程序16等六個不同的;驟= 語音資料㈣_程序u :在使”合成歌聲之前 .ΦΜ-------丨-訂--------- (請先閱讀背面之注音？事項再填寫本頁) . ^/82 ^/82 A7 B7 五、發明說明（6 ^私序’需要建立—聲音檔的資料庫，此資料可利用來、’所=叹的聲音檔或者由使用者錄製自己聲音的程序 A、、f =日杈，錄製系統預設的聲音檔以外的聲音檔時， ::國ί吾早字音的的發音而言，該發音的各種組合約有411 ΡΓΜ二效可為單音或立體音效，儲存電子資料格式可以是、恥”袼式檔或是其他的音效電子檔；及過二二ΐ程ί 此程序所採用的方法是能量值與 ' 。凊繼續參閱圖二並配合參閱圖一所示，此主：方法係利用兩個能量值的臨界值(τ η:::: 值，來斷定一個語音單音節的起迄點； , ％序也迠夠分離出聲母（Consonant)和韻母聲^ 般而言’聲母可以區Μ有聲㈤㈣與無與無韻母則是當轉聲母音，判斷有聲 1·過零率（zero-crossingrate)低的ergy)低時視為··有聲；刀M (Log 視：無】零!：。—一高'聲音分貝低時都視3為聲有;㈣高於*輪值(―)時大邹份 :::聲與無聲的部份的主要原因在於，最後歌聲合、聲日基本仙調整的部份只針對在有聲部份，無故子曰部份是獨立於聲音基本週期調整。 "、、耳基本週期粹取與標位程序13:請參閱如圖不’係_示本發明之某語音聲音基本週期標位圖，並^ 本紙張尺度適i中國國家標準（CNS)A4規格（210 xlW公i"Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs: (point her = use the silent period (Sllence) at the front and back of the recorded sound to measure the position of the origin and end of the sound. The basic cycle of sound extraction and labeling procedures 13 : Use the sound file data obtained by the aforementioned 洌 program I2 to perform sound beauty, and find, and mark the starting point of each basic sound cycle thereafter; Huer 2 :: Find = Synthesizer Program 14 •• According The above-mentioned sound base spot "Private Sequence 13", get the ^ = and big :, 'of each single-voice sound of each stroke of speech, and then synthesize according to the song score and song selected by the user; Base: :: individual characters in the include, including Adjust the characteristics of pitch, vibrato, echo, connection sound, etc .; in the sequence: and: = into the program 15: when the aforementioned song synthesizer 14 passes, it can synthesize a song in real time; and 电 the calculation result output program 16: Played according to the aforementioned program and stored as an electronic data cell π. For further brilliance, please continue to refer to FIG. 1. Please come to show a voice endpoint detection diagram of the present invention and the present invention; two = two since : Closed algorithm flow chart. The present invention can make use of A song from Baizaki, which is synthesized by a computer and played back. Two: six: system program 11, endpoint detection program 12, base earphone private sequence 13, song synthesizer program 14 t program 15 and result output program 16 Different; step = voice data ㈣_program u: before synthesizing the singing voice. ΦΜ --------- 丨 -order --------- (Please read the note on the back? Fill in the matter before filling in (This page). ^ / 82 ^ / 82 A7 B7 V. Description of the invention (6 ^ Private sequence 'needs to be established-a database of sound files, this data can be used, the sound file of the sigh = or the user records himself Sound program A,, f = Japanese branch, when recording sound files other than the sound file preset by the system, in terms of the pronunciation of :: 国 ί 吾早字音, the various combinations of the pronunciation are about 411 ΡΓΜ The second effect can be Monophonic or stereo sound effects, the format of the stored electronic data can be, "shame" files or other electronic files of sound effects; and the method used in this procedure is the energy value and '. 凊 Continue to Figure 2 As shown in Figure 1, the main method is to use two critical values (τ η ::::: The beginning and end of a single phonetic syllable;,% sequence is also enough to separate the consonant and final vowel ^ Generally speaking, the initial consonant can distinguish between voiced and non-voweled vowels. · Low zero-crossing rate (low ergy) is considered as sound. · Knife M (Log View: None) Zero !:-When a high 'sound decibel is low, 3 is considered as sound; The main reason for the Zoufen ::: voice and unvoiced part in the * rotation (―) is that the final singing part and the basic adjustment of the sound day are only for the sound part, and the part without reason is independent. Adjust for the basic sound cycle. ", Ear Basic Period Extraction and Marking Procedure 13: Please refer to the figure below for a description of a basic periodical map of speech and sound of the present invention, and this paper is suitable for China National Standard (CNS) A4 Specifications (210 xlW Male i "

---------------------訂----- (請先閱讀背面之注意事項再填寫本頁) -I I I - 559782 A7 五、發明說明（7 ) 合參閱圖三所示。聲音基本週期粹取是採用自相關 (Autocorrelation)演算法；自相關演算法，首先當音源訊號31輸入後，其係呈現如圖四中的弦波訊號，由該弦波訊號可以獲知聲音基本週期的最大值與最小值的範圍 32，故可以利用聲音基本週期的最大值與最小值當作一個搜尋範圍值33來作處理，例如：聲音基本週期的最大值為2〇〇，最小值為100，假如利用最大值法找到某一取樣點的位置34為125〇，則往左右兩邊各別尋找下一個聲音基本週期，其位置應該是介於1350〜U50之間，在這區段找到局部區段的最大值35 (Local maximum)的位置，即，一個聲音基本週期位置；利用前述之自相關演算法一直疊代下去，即能求出如圖四中所示，圓圈的位置即代表聲音基本週期的位置。歌聲合成器程序14·•請參閱如圖五所示，係顯示本發明之語音合成器中聲音基本週期同步重疊累加方法之不意圖。經過了聲音基本週期粹取與標位程序程序的前置處理，电月匈便可以藉由這些資訊來任意調整歌譜中所需要的旋律；在調整音調參數是使用聲音基本週期同步重疊累加（Pitch Synchronous 〇verLap and Add，ps〇LA)技術，這個技術係對於聲音基本週期可以做彈性的調整而且叶异也，較有效率，如圖五中所示，聲音基本週期為原先的 γ倍鬲’則原先聲音基本週期標位之間的距離則縮減為原來的1/2，反之，若為原先的1/2音高大小，則聲音基本週期標位間的距離則是放大2倍，而且每兩個聲音基本週功之間白以一個漢明窗（Hamming window)來重新塑造頁部智慧財產局員工消費 Μ 559782--------------------- Order ----- (Please read the notes on the back before filling out this page) -III-559782 A7 V. Description of the invention ( 7) Refer to Figure 3 together. The basic period of sound is extracted using an autocorrelation algorithm. The autocorrelation algorithm, first, when the audio source signal 31 is input, it presents a sine wave signal as shown in Figure 4. From this sine wave signal, the basic period of the sound can be obtained. The range of the maximum and minimum values of 32 is 32. Therefore, the maximum and minimum values of the basic period of the sound can be used as a search range value 33 for processing. For example, the maximum value of the basic period of the sound is 200, and the minimum value is 100. If the position 34 of a certain sampling point is found to be 125 through the maximum value method, then the left and right sides should be searched for the next basic period of sound. The position should be between 1350 and U50. Find a local area in this section. The position of the segment maximum 35 (Local maximum), that is, the position of a basic period of the sound; using the aforementioned autocorrelation algorithm, iteratively repeated, that is, as shown in Figure 4, the position of the circle represents the basic sound The position of the cycle. Singing synthesizer program 14 · Please refer to Fig. 5, which shows the intention of the method of synchronizing and accumulating the basic period of sound in the speech synthesizer of the present invention. After pre-processing of the basic sound cycle extraction and labeling program, Dian Yue Hung can use this information to arbitrarily adjust the melody required in the song score; in adjusting the pitch parameters, the basic sound cycle is used to synchronize and accumulate (Pitch Synchronous 〇verLap and Add (ps〇LA) technology, this technology can make flexible adjustments to the basic period of sound and leaves different, more efficient, as shown in Figure 5, the basic period of sound is the original γ times 鬲 ' Then the distance between the original sound basic period marks is reduced to 1/2, otherwise, if it is the original 1/2 pitch, the distance between the sound basic period marks is doubled, and every time The basic sound of the two voices is to use a Hamming window to reshape the consumption of the employees of the Bureau of Intellectual Property Bureau. M 559782

經濟部智慧財產局員工消費合作社印製五、發明說明（8 ) 一新音源波形，然後再將此經過漢明窗加成的波形以重疊方式累加起來，形成一個新的波形。漢明窗公式為：Printed by the Consumer Cooperatives of the Intellectual Property Bureau of the Ministry of Economic Affairs. 5. Description of the invention (8) A new sound source waveform, and then the waveform added by the Hamming window is added up in an overlapping manner to form a new waveform. The Hamming window formula is:

W㈣= m〇s[j^，Q<m<N 巧參閱如圖六所示，係顯示本發明之調整聲音長度示思圖。對於聲音長度的調整則是採用線性映射 mapping)法；每個合成單元都有其各自的聲音長度，而欲。f的樂谱對於每一個聲音都會有節拍的資訊，根據節拍多养可以計算出秒數，其中一個節拍的長度計算公式為：一個節拍的秒數（t) x取樣頻率（f)= 一個節拍的長度以一般速度而言，一拍（ΐ)大約是0.66秒，若是取樣頻率（Sample rate)設定成2〇ΚΗζ⑴，那麼一拍的長2大約是0·66 X 20〇〇〇= 132〇點數，當歌譜輸入後再與貝料庫中的合成單讀比較，便可得知聲音長度需要伸長或收縮’當前述之聲音長度決錢，再套用線性映射的原則去調整即能得到最後的聲音長度。 “ 了 θ而一曰長的改變外，有時候也需適時地改變曰里大小，一般而言，流行歌曲的前奏通常是音量較小的，副歌部份的音量會較大，且須控制整個音量的變化程度’不能有忽大忽小的情形出現。其測量音量大小的單位為分貝（dB)，其公式為：、 l〇*l〇gl〇(l^^(/)) 為了使合成歌聲的品質與真又歌聲相近，本發明採用 -------------·11111.11 ^---I I--ΙΛ (請先閱讀背面之注咅心事項再填寫本頁)W㈣ = m〇s [j ^, Q < m < N Refer to FIG. 6 for a schematic diagram of adjusting the sound length of the present invention. For the adjustment of the sound length, a linear mapping method is used; each synthesis unit has its own sound length, which is desirable. The score of f will have beat information for each sound. The number of seconds can be calculated according to the multi-beat support. The length of one beat is calculated as follows: the number of seconds of a beat (t) x the sampling frequency (f) = one beat In terms of general speed, one beat (ΐ) is about 0.66 seconds. If the sampling rate is set to 2〇ΚΗζ⑴, then the length 2 of a beat is about 0.66 X 200. 00 = 132. The number of points, when the song score is input, and compared with the synthetic single reading in the shell database, you can know that the sound length needs to be expanded or contracted. When the aforementioned sound length determines the money, and then apply the principle of linear mapping to adjust to get the final Sound length. "In addition to the long change of θ, sometimes it is necessary to change the size in time. Generally speaking, the intro of popular songs is usually low, and the volume of the chorus part is higher, and it must be controlled. The degree of change in the overall volume cannot be changed suddenly. The unit for measuring the volume is decibel (dB), and its formula is:, l〇 * l〇gl〇 (l ^^ (/)) The quality of the synthesized singing voice is similar to that of the real singing voice. The present invention uses ------------- 11111.11 ^ --- I I--ΙΛ (Please read the note on the back before filling in this note page)

559782 A7 B7559782 A7 B7

經濟部智慧財產局員工消費合作社印製Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs

…、枓晋以及回音等技巧釋。連接音的好處在於避免合成更-真的s 象，在本發明的實作系統上，所採自， (Cross一Fading)，而需要重；：乂又宜加法應該以每個音的特質而定;但是大小，原則上可以當做連接音的，有4b單字立的子日部伤都八# 曰旧子音部份為重要特徽部伤，右以子音部份做連接音，則會破壞它所如大部份的無聲子音，如以勹，女，:、、曰貝，例丄斤 Θ，虫，p，六寻’以此開頭的單字音是不f要連接音的。凊參考如圖七所示，係顯示本發明之結果輸出輸出圖。實力派歌手在詮釋—首歌時，通常會在唱高音或低音部份會有聲音顫抖的現象，這就是抖音的 =系統資料庫也設計了此項功能，當歌譜輸人系統㈣庫後’請參閱如圖七中制指標所指示的部份，該能將歌譜中的高低音部份描繪出來，當歌曲進行到高低音時，线資料庫即能主動按照此時該音節的旋律^ 一抖音的現象，使電腦所合成的歌聲能更為人性化；至於回音（Ech。）部份，則是採用有限脈衝響應遽波器（―仏 I_Se Response Filter ’ _ ’ 其脈衝響應（Impuise Response )為有限的區間，也就是只在有限的時域有非灾解，經由有限脈衝響應濾波器將波形整形後，使其合成二果如同KTV唱歌一般。一。> 歌詞及歌譜輸入程序15:當前述之程序完成後，使用者輸入歌詞及歌譜，經由電腦的演算後即能合成出一首歌曲，而此輸入歌詞的程序，可以是文字檔，之後再由預…, Explanations of Jin and echo. The advantage of connecting sounds is to avoid synthesizing more real s-images. In the implementation system of the present invention, it is taken from (Cross-Fading), and needs to be emphasized; 乂 and the addition should be based on the characteristics of each sound But the size, in principle, can be used as a connecting sound, there is a 4b single character Ziribe injury wound eight # said the old consonant part is an important special emblem injury, the right consonant part as the connection sound, it will destroy it For example, most of the silent consonants, such as 勹, female,: ,,, and shell, such as 丄, Θ, worm, p, Liu Xun, are not consonant.凊 Refer to Fig. 7, which shows the result output diagram of the present invention. The interpretation of the talented singers—the song usually trembles in the treble or bass, this is the vibrato = system database also designed this function, when the song score is entered into the system ㈣ library 'Please refer to the part indicated by the system index in Figure 7. This can depict the high and low parts of the song score. When the song reaches the high and low, the line database can actively follow the melody of the syllable at this time ^ The phenomenon of a vibrato makes the computer-synthesized singing sound more humane. As for the echo (Ech.) Part, a finite impulse response chirper (― 仏 I_Se Response Filter '_' Response) is a limited interval, that is, there is a non-disaster solution only in a limited time domain. After the waveform is shaped by a finite impulse response filter, it is synthesized into two results like KTV singing. 1.> Lyrics and song score input program 15: After the foregoing procedure is completed, the user enters lyrics and song scores, and a song can be synthesized through computer calculations. The lyrics input procedure can be a text file, and then Advance

本紙張尺度適用中國國家標準（CNS)A4規格（210 X 297公爱）This paper size applies to China National Standard (CNS) A4 (210 X 297 public love)

請先間讀背面之注意事項再填寫本頁) ：--訂---------Please read the precautions on the back before filling out this page): --Order ---------

5597¾ 五 A7 發明說明（1。）元另合:也歌聲:也可以是使用者自行錄製的⑴ 二另外也可讓使用者針對不同的歌曲，錚製每，利用前述之程序即時合成歌聲。職母 .、秀、輪出程序16:由前述之程序處理後，電腦及-透過其輸出裳置將結果送至^ “及月匕 _播或其他的音效擋。]#放出來，同時也儲存成本备月另一實施例，利用前述之声男女使用者的聲音錄製於電腦資料庫中' 方法將唱内容，將殳依照使用者所設定的演曰円令扣男女的耸音合成即能完成曲’若歌曲有大合唱的部，對曰的歌方法完成。本發明除了上；：：用“之歌聲即時合成 x物、了上迷之實施例外，有時候m 早中難免會有語音及歌聲的發聲方、在1文字使電腦利用語音的方式發聲，或i在文字部=力輸入文譜，此時電腦及能合成出歌聲；當使歌二實施本發明時’電腦就能利用其系統資料庫：：篇= 表現的更為人性化。扁又早人士較佳實施例之後，熟悉該項技術疋的瞭解，結不脫離下述冑請下:進行各種變化與改變，而且本發” = ί =施例的實施方式，其中聲音基本週_取方法可利; 均大小差異函數（八州响Μ咖tUde Difference5597¾ Five A7 invention description (1.) Yuan Yuanhe: also singing voice: it can also be recorded by the user. In addition, it can also allow users to control each song for different songs and use the aforementioned program to synthesize singing voices in real time. Worker, mother, show, turn-out program 16: After being processed by the aforementioned program, the computer sends the result to ^ "and moon dagger_broadcast or other sound effects block." # The storage cost is prepared in another embodiment. Using the aforementioned voices of male and female users to record in the computer database, the method will sing the content and synthesize the deduction of male and female shouts in accordance with the user-defined performance command. Completion song 'If the song has a chorus part, the song method is completed. In addition to the above, the present invention uses the "song voice to synthesize x objects in real time, except for the implementation of the fan, sometimes m will inevitably have voice and Singing the sound, make the computer use the voice in 1 text, or i input the score in the text part, and then the computer can synthesize the singing voice; when making the second song, the computer can use it System database :: articles = more humane performance. After Bian and early people are familiar with the preferred embodiment, they will be familiar with the technology and understand the following. Please make the following changes: make various changes and changes, and the present "" = ί = implementation of the embodiment, in which the sound is basically _Take method can benefit; mean size difference function (八州响Ｍ Coffee tUde Difference

Function，麵）’或是利用非時域的方法，例如：諸來求取基週大小。前述之歌聲合成器程序，、整聲音基本週期的方法可以為：再取樣法等在時域Function) or using non-time domain methods, such as: to find the size of the base cycle. The aforementioned song synthesizer program, and the method of adjusting the basic period of the sound may be: the resampling method in the time domain

本紙張尺度適用中國國家標準（CNS)A4規格（210x 297公爱T ------.丨丨訂·--------- /請先閲讀背面之注意事項再填寫本頁> 經濟部智慧財產局員工消費合作社印製 559782 A7 B7 五、發明說明（ D〇main )上的調整方法；以及剩餘誤差訊號之聲音基本週期同步重豐累加（Residual signal with ps〇LA)和弦波取樣（SinUS〇idal)等屬於在頻域（Frequency D⑽ain) 的調整方法。 [發明功效] 根據本發明所實施之歌聲即時合成系統與方法，讓原本僅能發出語音的聲音合成系統進—步達到歌聲的合成，當使用者利用本發明歌聲即時合成系統*方法，口要紗道歌㈣歌譜將其輸人電腦即能利㈣腦運算ς成口曰出一首動人的歌曲，亦或是由使苴士留〜立, 忧用考錄製一自己本身的基本早子0，透過輸人㈣與歌譜即唱出-首屬於自己歌聲的歌曲。 -------—訂-------- (請先閱讀背面之注意事項再填寫本頁) 經濟部智慧財產局員工消費合作社印製This paper size applies to China National Standard (CNS) A4 specification (210x 297 public love T ------. 丨丨 Order ----------- / Please read the precautions on the back before filling this page > Printed by the Employees' Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs 559782 A7 B7 V. Adjustment method on the invention description (Domain); and the basic period of the residual error signal is synchronized and re-accumulated (Residual signal with ps〇LA) chords Wave sampling (SinUSOidal) and the like belong to the adjustment method in the frequency domain (Frequency D⑽ain). [Invention effect] According to the instant singing voice synthesis system and method implemented according to the present invention, the original voice synthesis system can only advance To achieve the synthesis of singing voice, when the user uses the instant singing voice synthesis system * method of the present invention, it is necessary to enter the song into the computer and then enter the computer into the computer, which can be used for brain calculations, or a moving song, or By making the princes stay, Li You, you use the test to record a basic early child of 0, and sing through the input of the cricket and score-a song that belongs to your own singing. --------- Order --- ----- (Please read the notes on the back before filling This page) Ministry of Economic Affairs Intellectual Property Office employees consumer cooperatives printed

559782 A7 五、發明說明（)559782 A7 V. Description of Invention ()

[圖式之簡單說明J =-為本發明歌聲即時合成系統圖一為本發明之某語音端點偵剛圖:冓圖。、“圖三為本發明之聲音基本週期粹V之白流程圖。取之自相闕演算法 Γ為本發明之某語音聲音基本—圖。 β圖五為本發明之語音合成器中聲音基本调j 項疊累加方法之示意圖。土週功同步重圖六為本發明之調整音長示意圖。圖七為本發明之結果輸出程序之輸出圖。訂經濟部智慧財產局員工消費合作社印製本紙張尺度適用中國國家標準（CNS)A4規格（210 X 297公爱）[Simplified description of the diagram J =-This is the instant voice synthesis system of the present invention. Figure 1 is a voice endpoint detection diagram of the present invention: a diagram. "Figure 3 is the white flow chart of the basic period of the sound of the present invention. V is taken from the phase algorithm. Γ is a basic figure of a speech sound of the present invention. Β Figure 5 is the basic sound of the speech synthesizer of the present invention. Schematic diagram of the method of stacking and accumulating j-terms. Figure 6 is the schematic diagram of adjusting the sound length of the present invention. Figure 7 is the output diagram of the result output procedure of the present invention. Paper size applies to China National Standard (CNS) A4 (210 X 297 public love)

Claims

559782, patent application scope of Jibei Smart Employees Consumer Press 1. A method for real-time synthesis of singing voices, including: type; early ... and save the input voice as the end point of the electronic data grid_programs ·· 系彻录声㈣后 # Measure the position of the starting and ending points of sounds during the sound period;…, ear or basic period of sound extraction and labeling procedures: The sound m obtained by the measurement procedure is m; Where the starting point is marked = find, the AHE program: it is based on the basic cycle of sound mentioned above & bit private order 'Paid to the mother-pen voice of the single-word sound as per-word = The "song score and lyrics" of the lyrics f song score input program: when the aforementioned song synthesizer program is completed = the user synthesizes the lyrics and song scores entered over time, and puts it on the back. The result output program: The results produced according to the aforementioned procedure are immediately 2. As the instant singing voice input device described in the item of the scope of the patent application can be a microphone or a keyboard. 彳 32 二; ^: 1 method of instant singing voice synthesis History ... It can be said that the input can be a variety of Mandarin monophone characters, and the input characters, Wave format files, or other electronic data formats can be entered. 4. The singing voice as described in the item in the scope of the patent application is in real time. T-shirts are averaged by using a different function or a field method to set the line. The size of the paper rule is * Guo ^ Voting standard ⑵ Specifications 〇〇 Norwegian public hair 559782

6. The scope of patent application 5 Gan Rushen. Please sing the song mentioned in item 1 of the scope of the patent in real time. The mid-point detection procedure can be used to detect the position of the starting point. The method is to determine whether the detection is wrong. Manual inspection to 6 · If the scope of patent application! The method of real-time singing voice described in the above item is a method in which the endpoint detection program can separate the initials or finals. 7. In the song real-time device program described in item i of the patent application scope, the composition of the lyrics and text is called tone, volume, Features such as vibrato, echo, or connection. i said long, 8 · such as the scope of patent application! The real-time and second-song of the song mentioned in the item: Synthesizer program, which mainly adjusts the square of the basic period of the sound. The second is: the resampling method, the sound of the residual error signal is basically heavy :: addition or sine wave sampling and other methods. Less heavy and rich 9. The singing voice as described in the scope of the patent application, which is the sound output file of the result output program, can be -or: he :: 10 · The singing voice as described in the scope of the patent application, i. The output program can be an electronic data file. Method, 11 · The singing voice described in item 1 of the scope of Chinese Patent No. 4 captures the same day. In the lyrics and score input program, the lyrics input part can be input by voice.疋 Youyu or I2 · — A method for real-time synthesis of voice and singing voice, including ... A voice depends on _ Procedure: The person who borrows the costumes = a single syllable of clean speech, and stores the input voice as electronic Information: Endpoint system program ... Using the silence of the front and rear sections of the sound recording floor (Please read the precautions on the back before filling this page) --------------

Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs

Paper size_CNS Standard A4 (297)