TW559782B - Real-time music composition method - Google Patents

Real-time music composition method Download PDF

Info

Publication number
TW559782B
TW559782B TW90133568A TW90133568A TW559782B TW 559782 B TW559782 B TW 559782B TW 90133568 A TW90133568 A TW 90133568A TW 90133568 A TW90133568 A TW 90133568A TW 559782 B TW559782 B TW 559782B
Authority
TW
Taiwan
Prior art keywords
sound
song
program
voice
scope
Prior art date
Application number
TW90133568A
Other languages
Chinese (zh)
Inventor
Jeng-Yuan Lin
Jr-Shing Jang
Original Assignee
Cweb Technology Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cweb Technology Inc filed Critical Cweb Technology Inc
Priority to TW90133568A priority Critical patent/TW559782B/en
Application granted granted Critical
Publication of TW559782B publication Critical patent/TW559782B/en

Links

Landscapes

  • Auxiliary Devices For Music (AREA)

Abstract

The present invention uses the computer to real-time compose the music, which first records some basic word sound, which is the sound database of the system; after entering the score and lyrics for a song by the user, the music synthesizer produced according to the present invention can real-time synthesize the music. The major functions of the synthesizer include: arbitrarily rising or falling the tone, adjusting the volume, and scaling the tone lengths.

Description

559782 A7559782 A7

五 、發明說明(2 ) N 耳日日伏變化 L赞明概述j 本發明的歌聲即時合成方法, 使用者指定任意-首i #使_ ”4⑽可以言 詞,系统人+ * 曲之歌譜鋼 广:先曰即日才合成為歌聲(此合成單元為 ,乾淨的男音或女音);⑵使用者村藉 己: 语音資料檔作為合成單元(此指4n 二自己自 :)’系統便能自動依據使用者所錄製的語;;= 合前述輸人歌曲之歌譜與歌詞,完成即時之歌聲 ::製::基一本的語音資_ ’電腦便可相 立 個者對线以母—段歌詞的語 :二八百歌輸入完畢,系統即時將此轉成歌聲,此功 U '使用者錄製完整的4I i個國語單音節的狂立。 本發明之主要目的’健供—種電腦自動合°成日歌聲的 :利用系統資料庫中的語音資料檔,並配合使 ^歌曲之歌譜與歌詞’進而令電腦將使用者所輸入之^ 與歌詞即時合成為歌聲。 °曰 發明之次要目的,係、令使用者能將自己的語音資料 才虽或是某一歌星的語音資料檔建立於系統的資料庫令,當 ,用^輸人歌曲之歌譜與歌詞,就可以透過電腦的運算: 即時°曰出一首擁有個人風味的歌曲,無論是民歌或流行歌 曲;亦或藉由本發明令使用者的聲音與某一位歌星的歌聲 本紙張尺度顧中^^i^CNSMoiiTilO X 297公fl 559782 A7 B7 五、發明說明(V. Description of the invention (2) N Day-to-day fluctuations L Zanming overview j The instant singing voice synthesis method of the present invention is specified by the user as arbitrary- 首 i # 使 _ ”4⑽ can be verbal, systematic person + * Qu Zhi Ge Pu Gang Guang : The first day is synthesized into a singing voice (this synthesizing unit is a clean male or female voice); ⑵ user village borrows it: the voice data file is used as the synthesizing unit (this refers to 4n): 'The system can automatically According to the words recorded by the user;; = Compose the song sheet and lyrics of the input song, and complete the real-time singing voice :: system :: based on the original voice data _ 'Computer can stand alone to match the line with the mother-paragraph The words of the lyrics: After the input of two or eight hundred songs, the system instantly converts this into a singing voice. This function allows the user to record a complete 4I i mandarin monosyllable. The main purpose of the present invention is' healthy supply — a computer automatic Combined into a Japanese singing voice: Use the voice data file in the system database and cooperate with the ^ song's song score and lyrics' to make the computer synthesize the ^ and lyrics entered by the user into a singing voice in real time. Purpose, to enable users to translate their own voice Although the data or the voice data file of a certain singer is built in the database database of the system, when you use ^ to enter the score and lyrics of the song, you can use the computer to calculate: Really say a song with personal flavor , Whether it is a folk song or a popular song; or the user's voice and the singing voice of a certain singer by the present invention can be adjusted to the paper size ^^ i ^ CNSMoiiTilO X 297 public fl 559782 A7 B7 V. Description of the invention (

I 員 工 消 費 社 印 製 共同合唱一首動聽的歌。 本發明另_目沾 語音與合成歌聲同時用:可藉由本發明使-合成中,進,佳之 音之歌詞部份盘庫中’透過電腦的運算將語 声史$座田一σ °曰”才s成完成一首由電腦合成的歌 二又用於以咖播案為主之卡拉。κ軟體的「歌聲 包含為達 =::的,本發明提供一歌聲即時合成之方法 二⑽序’係令使用者藉由輸入褒置 式·端二=郎,並將輸入的語音儲存成電子資料格 立期(.S 係利用錄製聲音樓前後段之無聲或靜 曰』(Sllence)來彳貞測出聲音的起迄點的位置,· 二標位程序’藉由前述之端點偵測程序所: 聲音基本週期的尋找,並對之後每個聲 技起始點標示所在位置;歌聲合成器程序,係 二康=述之聲音基本週期粹取與標位程序,得到每一筆語 二之,字音的正確聲音基本週期位置與大小,再依照使用ϋ 所逆定的歌譜與歌詞,對歌詞中的每一個 ,調整音長、音調、音量、抖音、回音、連接音;;:; ^及歌譜輸入程序’當前述之歌聲合成器程序完成後, 用者將所輸入歌詞及歌譜,經由電腦的演算後能即時合 成出-首歌曲;結果輸出程序’係根據前述之程序所產生 的結果立即播放’並將之存成電子資料格式樓,進而令使 項 頁 本紙張尺度適用中國國家標準(CNSM4規格(210 X 297公爱 559782 A7 B7 五、發明說明( 部 智 慧 財 員 工 消 費 社 印 製 用者能透過此歌聲即時合成系統獲得更 [主要元件符號對照說明] 、1作及樂趣 11 語音資料檔的錄製程序 ]2 —_端點彳貞測程序 13-—聲音基本週期粹取與標位程序 14 ---歌聲合成器程序 15 歌詞及歌譜輸入 16 結果輸出程序 3 1 —-音源訊號 32— -聲音基本週期的範圍值 33— -搜尋聲音基本週期的範圍值 34 —-取樣點的位置 3 5 ---局部區段的最大值 36 音高 [發明之詳細說明] 雖然本發明將參閱含有本發明較佳實施例之㈣ 式予以充份描述,但在此描述之前應瞭解熟悉本行之人士 可修改在本文中所描述之發明,同時獲致本發明之功效。 因此’須瞭解以下之描述對熟悉本行技藝之人士而言為一 廣泛之揭示,且其内容不在於限制本發明。請參閱如圖一 所不,係顯示本發明歌聲即時合成系統之架構圖。本發明 係-能令使用者藉由輸入歌曲之歌詞及歌譜以獲得即 歌聲合成之方法,此方法包含; 語音資料播之錄製程序u:係令使用者藉由輸入裝一 輸入-語言之單音節,並將輸人的語音儲存成電子資料格 請 頁 訂 圖 時 置 559782I Staff Consumer Printing Co., Ltd. printed a beautiful chorus together. Another aspect of the present invention is the simultaneous use of the voice of the eye and the synthesis of the singing voice: the present invention can be used to synthesize, synthesize, advance, and sing the sound of the song. Only scheng completed a song synthesized by a computer and used it for the karaoke mainly for coffee broadcasts. Κ software "Song contains Wei Da = ::, the present invention provides a method for the instant synthesis of a song." It is to make the user input the setting type · Tuan Er = Lang and store the input voice as electronic data. (.S is to use the silent or silent sound of the front and rear sections of the recording sound to test the truth.) The position of the starting point and ending point of the sound. · The two-position program uses the aforementioned endpoint detection program: the search for the basic cycle of the sound, and the location of each starting point of the sound technique; the singing synthesizer program , Department of Erkang = The basic cycle of sound extraction and labeling procedures, to get the correct basic cycle position and size of the sound of each word, and then according to the score and lyrics reversed by using ϋ, the Each, adjust the length, pitch, Volume, vibrato, echo, connection sound ;;:; ^ and song score input program 'After the aforementioned song synthesizer program is completed, the user can synthesize the entered lyrics and song score through computer calculations-a song ; The result output program 'plays immediately based on the results produced by the aforementioned program' and saves it as an electronic data format building, so that the paper size of the item sheet is adapted to the Chinese national standard (CNSM4 specification (210 X 297 public love 559782 A7) B7 V. Description of the invention (Printed by the Ministry of Intellectual Property Employees' Consumer Corporation, users can obtain more [main component symbol comparison instructions], 1 works and fun 11 voice data file recording procedures through this singing voice real-time synthesis system] 2 —_ endpoint彳 Chast test program 13 --- sound basic period extraction and labeling program 14 --- singing voice synthesizer program 15 lyrics and score input 16 result output program 3 1 --- sound source signal 32 --- range value of sound basic period 33-- -The range value of the basic period of the search sound 34 —- the position of the sampling point 3 5 —- the maximum value of the local section 36 the pitch [detailed description of the invention] Please read the formula containing the preferred embodiment of the present invention for a full description, but before this description, it should be understood that those familiar with the bank can modify the invention described in this article and obtain the effect of the invention at the same time. The description is a broad disclosure for those who are familiar with the skills of the bank, and its content is not intended to limit the present invention. Please refer to Fig. 1, which shows the architecture diagram of the instant singing voice synthesis system of the present invention. The present invention is- A method for enabling users to obtain instant singing voice by inputting lyrics and scores of songs. This method includes: a recording program for voice data broadcasting u: enables users to load a single syllable of input-language by input, and The input voice is stored as an electronic data box.

經濟部智慧財產局員工消費合作社印製 :(點她=利用錄製聲音樓前後段之 静曰期(Sllence)來伯測出聲音的起迄點的位置。 聲音基本週期粹取與標位程序13:藉由前述之 洌程序I2所獲得的聲音檔資料進行聲音美 ’’、、 找,並對之後每個聲音基本週期之起始點標;戶二::尋 =合成器程序14••根據前述之聲音基 斑 “位私序13,得到每—筆語音之單字音的 ^ =與大:、’再依照使用者所選定的歌譜與歌;基: ::中的個字進行合成,包括調整音長 抖音、回音、連接音等特性; 里 序:及:=入程序15:當前述之歌聲合成器14程 後能即時合成出一首歌曲;以及 丄由電^的演算 結果輸出程序16:根據前述之程序所 播放更儲存成電子資料格π 更進一步的成明,請繼續參閱圖一並請來 係顯示本發明之某語音端點偵測圖與本發明二; 二=二自:關演算法流程圖。本發明能令使用 擇某百崎歌曲,而由電腦合成歌聲播放。 二取:製程序11、端點侦測程序12、基 件耳ϋ立私序13、歌聲合成器程序14 t程序15以及結果輸出程序16等六個不同的;驟= 語音資料㈣_程序u :在使”合成歌聲之前 .ΦΜ-------丨-訂--------- (請先閱讀背面之注音?事項再填寫本頁) . ^/82 ^/82 A7 B7 五、發明說明(6 ^私序’需要建立—聲音檔的資料庫,此資料可利用 來、’所=叹的聲音檔或者由使用者錄製自己聲音的程序 A、 、f =日杈,錄製系統預設的聲音檔以外的聲音檔時, ::國ί吾早字音的的發音而言,該發音的各種組合約有411 ΡΓΜ二效可為單音或立體音效,儲存電子資料格式可以是 、恥”袼式檔或是其他的音效電子檔;及 過二二ΐ程ί 此程序所採用的方法是能量值與 ' 。凊繼續參閱圖二並配合參閱圖一所示,此 主:方法係利用兩個能量值的臨界值(τ η:::: 值,來斷定一個語音單音節的起迄點; , %序也迠夠分離出聲母(Consonant)和韻母 聲^ 般而言’聲母可以區Μ有聲㈤㈣與無 與無韻母則是當轉聲母音,判斷有聲 1·過零率(zero-crossingrate)低 的ergy)低時視為··有聲; 刀M (Log 視:無】零!:。—一高'聲音分貝低時 都視3為聲有;㈣高於*輪值(―)時大邹份 :::聲與無聲的部份的主要原因在於,最後歌聲合 、聲日基本仙調整的部份只針對在有聲部份,無故 子曰部份是獨立於聲音基本週期調整。 "、、耳 基本週期粹取與標位程序13:請參閱如圖 不’係_示本發明之某語音聲音基本週期標位圖,並^ 本紙張尺度適i中國國家標準(CNS)A4規格(210 xlW公i"Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs: (point her = use the silent period (Sllence) at the front and back of the recorded sound to measure the position of the origin and end of the sound. The basic cycle of sound extraction and labeling procedures 13 : Use the sound file data obtained by the aforementioned 洌 program I2 to perform sound beauty, and find, and mark the starting point of each basic sound cycle thereafter; Huer 2 :: Find = Synthesizer Program 14 •• According The above-mentioned sound base spot "Private Sequence 13", get the ^ = and big :, 'of each single-voice sound of each stroke of speech, and then synthesize according to the song score and song selected by the user; Base: :: individual characters in the include, including Adjust the characteristics of pitch, vibrato, echo, connection sound, etc .; in the sequence: and: = into the program 15: when the aforementioned song synthesizer 14 passes, it can synthesize a song in real time; and 电 the calculation result output program 16: Played according to the aforementioned program and stored as an electronic data cell π. For further brilliance, please continue to refer to FIG. 1. Please come to show a voice endpoint detection diagram of the present invention and the present invention; two = two since : Closed algorithm flow chart. The present invention can make use of A song from Baizaki, which is synthesized by a computer and played back. Two: six: system program 11, endpoint detection program 12, base earphone private sequence 13, song synthesizer program 14 t program 15 and result output program 16 Different; step = voice data ㈣_program u: before synthesizing the singing voice. ΦΜ --------- 丨 -order --------- (Please read the note on the back? Fill in the matter before filling in (This page). ^ / 82 ^ / 82 A7 B7 V. Description of the invention (6 ^ Private sequence 'needs to be established-a database of sound files, this data can be used, the sound file of the sigh = or the user records himself Sound program A,, f = Japanese branch, when recording sound files other than the sound file preset by the system, in terms of the pronunciation of :: 国 ί 吾 早 字音, the various combinations of the pronunciation are about 411 ΡΓΜ The second effect can be Monophonic or stereo sound effects, the format of the stored electronic data can be, "shame" files or other electronic files of sound effects; and the method used in this procedure is the energy value and '. 凊 Continue to Figure 2 As shown in Figure 1, the main method is to use two critical values (τ η ::::: The beginning and end of a single phonetic syllable;,% sequence is also enough to separate the consonant and final vowel ^ Generally speaking, the initial consonant can distinguish between voiced and non-voweled vowels. · Low zero-crossing rate (low ergy) is considered as sound. · Knife M (Log View: None) Zero !:-When a high 'sound decibel is low, 3 is considered as sound; The main reason for the Zoufen ::: voice and unvoiced part in the * rotation (―) is that the final singing part and the basic adjustment of the sound day are only for the sound part, and the part without reason is independent. Adjust for the basic sound cycle. ", Ear Basic Period Extraction and Marking Procedure 13: Please refer to the figure below for a description of a basic periodical map of speech and sound of the present invention, and this paper is suitable for China National Standard (CNS) A4 Specifications (210 xlW Male i "

---------------------訂----- (請先閱讀背面之注意事項再填寫本頁) -I I I - 559782 A7 五、發明說明(7 ) 合參閱圖三所示。聲音基本週期粹取是採用自相關 (Autocorrelation)演算法;自相關演算法,首先當音源 訊號31輸入後,其係呈現如圖四中的弦波訊號,由該弦 波訊號可以獲知聲音基本週期的最大值與最小值的範圍 32,故可以利用聲音基本週期的最大值與最小值當作一個 搜尋範圍值33來作處理,例如:聲音基本週期的最大值 為2〇〇,最小值為100,假如利用最大值法找到某一取樣 點的位置34為125〇,則往左右兩邊各別尋找下一個聲音 基本週期,其位置應該是介於1350〜U50之間,在這區段 找到局部區段的最大值35 (Local maximum)的位置,即 ,一個聲音基本週期位置;利用前述之自相關演算法一直 疊代下去,即能求出如圖四中所示,圓圈的位置即代表聲 音基本週期的位置。 歌聲合成器程序14·•請參閱如圖五所示,係顯示本 發明之語音合成器中聲音基本週期同步重疊累加方法之 不意圖。經過了聲音基本週期粹取與標位程序程序的前置 處理,电月匈便可以藉由這些資訊來任意調整歌譜中所需要 的旋律;在調整音調參數是使用聲音基本週期同步重疊累 加(Pitch Synchronous 〇verLap and Add,ps〇LA)技術, 這個技術係對於聲音基本週期可以做彈性的調整而且叶 异也,較有效率,如圖五中所示,聲音基本週期為原先的 γ倍鬲’則原先聲音基本週期標位之間的距離則縮減為原 來的1/2,反之,若為原先的1/2音高大小,則聲音基本 週期標位間的距離則是放大2倍,而且每兩個聲音基本週 功之間白以一個漢明窗(Hamming window)來重新塑造 頁 部 智 慧 財 產 局 員 工 消 費 Μ 559782--------------------- Order ----- (Please read the notes on the back before filling out this page) -III-559782 A7 V. Description of the invention ( 7) Refer to Figure 3 together. The basic period of sound is extracted using an autocorrelation algorithm. The autocorrelation algorithm, first, when the audio source signal 31 is input, it presents a sine wave signal as shown in Figure 4. From this sine wave signal, the basic period of the sound can be obtained. The range of the maximum and minimum values of 32 is 32. Therefore, the maximum and minimum values of the basic period of the sound can be used as a search range value 33 for processing. For example, the maximum value of the basic period of the sound is 200, and the minimum value is 100. If the position 34 of a certain sampling point is found to be 125 through the maximum value method, then the left and right sides should be searched for the next basic period of sound. The position should be between 1350 and U50. Find a local area in this section. The position of the segment maximum 35 (Local maximum), that is, the position of a basic period of the sound; using the aforementioned autocorrelation algorithm, iteratively repeated, that is, as shown in Figure 4, the position of the circle represents the basic sound The position of the cycle. Singing synthesizer program 14 · Please refer to Fig. 5, which shows the intention of the method of synchronizing and accumulating the basic period of sound in the speech synthesizer of the present invention. After pre-processing of the basic sound cycle extraction and labeling program, Dian Yue Hung can use this information to arbitrarily adjust the melody required in the song score; in adjusting the pitch parameters, the basic sound cycle is used to synchronize and accumulate (Pitch Synchronous 〇verLap and Add (ps〇LA) technology, this technology can make flexible adjustments to the basic period of sound and leaves different, more efficient, as shown in Figure 5, the basic period of sound is the original γ times 鬲 ' Then the distance between the original sound basic period marks is reduced to 1/2, otherwise, if it is the original 1/2 pitch, the distance between the sound basic period marks is doubled, and every time The basic sound of the two voices is to use a Hamming window to reshape the consumption of the employees of the Bureau of Intellectual Property Bureau. M 559782

經濟部智慧財產局員工消費合作社印製 五、發明說明(8 ) 一新音源波形,然後再將此經過漢明窗加成的波形以重疊 方式累加起來,形成一個新的波形。 漢明窗公式為:Printed by the Consumer Cooperatives of the Intellectual Property Bureau of the Ministry of Economic Affairs. 5. Description of the invention (8) A new sound source waveform, and then the waveform added by the Hamming window is added up in an overlapping manner to form a new waveform. The Hamming window formula is:

W㈣= m〇s[j^,Q<m<N 巧參閱如圖六所示,係顯示本發明之調整聲音長度示 思圖。對於聲音長度的調整則是採用線性映射 mapping)法;每個合成單元都有其各自的聲音長度,而 欲。f的樂谱對於每一個聲音都會有節拍的資訊,根據節 拍多养可以計算出秒數,其中一個節拍的長度計算公式 為: 一個節拍的秒數(t) x取樣頻率(f)= 一個節拍的 長度 以一般速度而言,一拍(ΐ)大約是0.66秒,若是取 樣頻率(Sample rate)設定成2〇ΚΗζ⑴,那麼一拍的 長2大約是0·66 X 20〇〇〇= 132〇點數,當歌譜輸入後再 與貝料庫中的合成單讀比較,便可得知聲音長度需要伸 長或收縮’當前述之聲音長度決錢,再套用線性映射的 原則去調整即能得到最後的聲音長度。 “ 了 θ而一曰長的改變外,有時候也需適時地改變 曰里大小,一般而言,流行歌曲的前奏通常是音量較小 的,副歌部份的音量會較大,且須控制整個音量的變化程 度’不能有忽大忽小的情形出現。其測量音量大小的單位 為分貝(dB),其公式為: 、 l〇*l〇gl〇(l^^(/)) 為了使合成歌聲的品質與真又歌聲相近,本發明採用 -------------·11111.11 ^---I I--ΙΛ (請先閱讀背面之注咅心事項再填寫本頁)W㈣ = m〇s [j ^, Q < m < N Refer to FIG. 6 for a schematic diagram of adjusting the sound length of the present invention. For the adjustment of the sound length, a linear mapping method is used; each synthesis unit has its own sound length, which is desirable. The score of f will have beat information for each sound. The number of seconds can be calculated according to the multi-beat support. The length of one beat is calculated as follows: the number of seconds of a beat (t) x the sampling frequency (f) = one beat In terms of general speed, one beat (ΐ) is about 0.66 seconds. If the sampling rate is set to 2〇ΚΗζ⑴, then the length 2 of a beat is about 0.66 X 200. 00 = 132. The number of points, when the song score is input, and compared with the synthetic single reading in the shell database, you can know that the sound length needs to be expanded or contracted. When the aforementioned sound length determines the money, and then apply the principle of linear mapping to adjust to get the final Sound length. "In addition to the long change of θ, sometimes it is necessary to change the size in time. Generally speaking, the intro of popular songs is usually low, and the volume of the chorus part is higher, and it must be controlled. The degree of change in the overall volume cannot be changed suddenly. The unit for measuring the volume is decibel (dB), and its formula is:, l〇 * l〇gl〇 (l ^^ (/)) The quality of the synthesized singing voice is similar to that of the real singing voice. The present invention uses ------------- 11111.11 ^ --- I I--ΙΛ (Please read the note on the back before filling in this note page)

559782 A7 B7559782 A7 B7

經濟部智慧財產局員工消費合作社印製Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs

…、枓晋以及回音等技巧 釋。連接音的好處在於避免合成更-真的s 象,在本發明的實作系統上,所採自, (Cross一Fading),而需要重 ;:乂又宜加法 應該以每個音的特質而定;但是大小,原則上 可以當做連接音的,有4b單字立 的子日部伤都 八# 曰旧子音部份為重要特徽部 伤,右以子音部份做連接音,則會破壞它所 如大部份的無聲子音,如以勹,女,:、、曰貝,例 丄斤 Θ,虫,p, 六寻’以此開頭的單字音是不f要連接音的。 凊參考如圖七所示,係顯示本發明之結果輸出 輸出圖。實力派歌手在詮釋—首歌時,通常會在唱高音或 低音部份會有聲音顫抖的現象,這就是抖音的 =系統資料庫也設計了此項功能,當歌譜輸人系統㈣ 庫後’請參閱如圖七中制指標所指示的部份,該 能將歌譜中的高低音部份描繪出來,當歌曲進行到高 低音時,线資料庫即能主動按照此時該音節的旋律^ 一抖音的現象,使電腦所合成的歌聲能更為人性化;至於 回音(Ech。)部份,則是採用有限脈衝響應遽波器(―仏 I_Se Response Filter ’ _ ’ 其脈衝響應(Impuise Response )為有限的區間,也就是只在有限的時域有非灾 解,經由有限脈衝響應濾波器將波形整形後,使其合成二 果如同KTV唱歌一般。 一。> 歌詞及歌譜輸入程序15:當前述之程序完成後,使 用者輸入歌詞及歌譜,經由電腦的演算後即能合成出一首 歌曲,而此輸入歌詞的程序,可以是文字檔,之後再由預…, Explanations of Jin and echo. The advantage of connecting sounds is to avoid synthesizing more real s-images. In the implementation system of the present invention, it is taken from (Cross-Fading), and needs to be emphasized; 乂 and the addition should be based on the characteristics of each sound But the size, in principle, can be used as a connecting sound, there is a 4b single character Ziribe injury wound eight # said the old consonant part is an important special emblem injury, the right consonant part as the connection sound, it will destroy it For example, most of the silent consonants, such as 勹, female,: ,,, and shell, such as 丄, Θ, worm, p, Liu Xun, are not consonant.凊 Refer to Fig. 7, which shows the result output diagram of the present invention. The interpretation of the talented singers—the song usually trembles in the treble or bass, this is the vibrato = system database also designed this function, when the song score is entered into the system ㈣ library 'Please refer to the part indicated by the system index in Figure 7. This can depict the high and low parts of the song score. When the song reaches the high and low, the line database can actively follow the melody of the syllable at this time ^ The phenomenon of a vibrato makes the computer-synthesized singing sound more humane. As for the echo (Ech.) Part, a finite impulse response chirper (― 仏 I_Se Response Filter '_' Response) is a limited interval, that is, there is a non-disaster solution only in a limited time domain. After the waveform is shaped by a finite impulse response filter, it is synthesized into two results like KTV singing. 1.> Lyrics and song score input program 15: After the foregoing procedure is completed, the user enters lyrics and song scores, and a song can be synthesized through computer calculations. The lyrics input procedure can be a text file, and then Advance

本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公爱)This paper size applies to China National Standard (CNS) A4 (210 X 297 public love)

請先間讀背面之注意事項再填寫本頁) :--訂---------Please read the precautions on the back before filling out this page): --Order ---------

5597¾ 五 A7 發明說明(1。) 元另合:也歌聲:也可以是使用者自行錄製的⑴ 二另外也可讓使用者針對不同的歌曲,錚製每 ,利用前述之程序即時合成歌聲。職母 .、秀、輪出程序16:由前述之程序處理後,電腦及-透過其輸出裳置將結果送至^ “及月匕 _播或其他的音效擋。]#放出來,同時也儲存成 本备月另一實施例,利用前述之声 男女使用者的聲音錄製於電腦資料庫中' 方法將 唱内容,將殳依照使用者所設定的演 曰円令扣男女的耸音合成即能完成 曲’若歌曲有大合唱的部 ,對曰的歌 方法完成。本發明除了上;::用“之歌聲即時合成 x物、了上迷之實施例外,有時候m 早中難免會有語音及歌聲的發聲方 、在1文 字使電腦利用語音的方式發聲,或i在文字部=力輸入文 譜,此時電腦及能合成出歌聲;當使歌 二實施本發明時’電腦就能利用其系統資料庫::篇= 表現的更為人性化。 扁又早 人士 較佳實施例之後,熟悉該項技術 疋的瞭解,結不脫離下述冑請 下:進行各種變化與改變,而且本發” = ί =施例的實施方式,其中聲音基本週_取方法可利; 均大小差異函數(八州响Μ咖tUde Difference5597¾ Five A7 invention description (1.) Yuan Yuanhe: also singing voice: it can also be recorded by the user. In addition, it can also allow users to control each song for different songs and use the aforementioned program to synthesize singing voices in real time. Worker, mother, show, turn-out program 16: After being processed by the aforementioned program, the computer sends the result to ^ "and moon dagger_broadcast or other sound effects block." # The storage cost is prepared in another embodiment. Using the aforementioned voices of male and female users to record in the computer database, the method will sing the content and synthesize the deduction of male and female shouts in accordance with the user-defined performance command. Completion song 'If the song has a chorus part, the song method is completed. In addition to the above, the present invention uses the "song voice to synthesize x objects in real time, except for the implementation of the fan, sometimes m will inevitably have voice and Singing the sound, make the computer use the voice in 1 text, or i input the score in the text part, and then the computer can synthesize the singing voice; when making the second song, the computer can use it System database :: articles = more humane performance. After Bian and early people are familiar with the preferred embodiment, they will be familiar with the technology and understand the following. Please make the following changes: make various changes and changes, and the present "" = ί = implementation of the embodiment, in which the sound is basically _Take method can benefit; mean size difference function (八 州 响 M Coffee tUde Difference

Function,麵)’或是利用非時域的方法,例如: 諸來求取基週大小。前述之歌聲合成器程序,、 整聲音基本週期的方法可以為:再取樣法等在時域Function) or using non-time domain methods, such as: to find the size of the base cycle. The aforementioned song synthesizer program, and the method of adjusting the basic period of the sound may be: the resampling method in the time domain

本紙張尺度適用中國國家標準(CNS)A4規格(210x 297公爱T ------.丨丨訂·--------- /請先閲讀背面之注意事項再填寫本頁> 經濟部智慧財產局員工消費合作社印製 559782 A7 B7 五、發明說明( D〇main )上的調整方法;以及剩餘誤差訊號之聲音基本週 期同步重豐累加(Residual signal with ps〇LA)和弦波 取樣(SinUS〇idal)等屬於在頻域(Frequency D⑽ain) 的調整方法。 [發明功效] 根據本發明所實施之歌聲即時合成系統與方法,讓原 本僅能發出語音的聲音合成系統進—步達到歌聲的合 成,當使用者利用本發明歌聲即時合成系統*方法,口要 紗道歌㈣歌譜將其輸人電腦即能利㈣腦運算ς成 口曰出一首動人的歌曲,亦或是由使 苴士留〜立, 忧用考錄製一自己本身的 基本早子0,透過輸人㈣與歌譜即 唱出-首屬於自己歌聲的歌曲。 -------—訂-------- (請先閱讀背面之注意事項再填寫本頁) 經濟部智慧財產局員工消費合作社印製This paper size applies to China National Standard (CNS) A4 specification (210x 297 public love T ------. 丨 丨 Order ----------- / Please read the precautions on the back before filling this page > Printed by the Employees' Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs 559782 A7 B7 V. Adjustment method on the invention description (Domain); and the basic period of the residual error signal is synchronized and re-accumulated (Residual signal with ps〇LA) chords Wave sampling (SinUSOidal) and the like belong to the adjustment method in the frequency domain (Frequency D⑽ain). [Invention effect] According to the instant singing voice synthesis system and method implemented according to the present invention, the original voice synthesis system can only advance To achieve the synthesis of singing voice, when the user uses the instant singing voice synthesis system * method of the present invention, it is necessary to enter the song into the computer and then enter the computer into the computer, which can be used for brain calculations, or a moving song, or By making the princes stay, Li You, you use the test to record a basic early child of 0, and sing through the input of the cricket and score-a song that belongs to your own singing. --------- Order --- ----- (Please read the notes on the back before filling This page) Ministry of Economic Affairs Intellectual Property Office employees consumer cooperatives printed

559782 A7 五、發明說明()559782 A7 V. Description of Invention ()

[圖式之簡單說明J =-為本發明歌聲即時合成系統 圖一為本發明之某語音端點偵剛圖:冓圖。 、“圖三為本發明之聲音基本週期粹V之白 流程圖。 取之自相闕演算法 Γ為本發明之某語音聲音基本—圖。 β圖五為本發明之語音合成器中聲音基本调j 項 疊累加方法之示意圖。 土週功同步重 圖六為本發明之調整音長示意圖。 圖七為本發明之結果輸出程序之輸出圖。 訂 經濟部智慧財產局員工消費合作社印製 本紙張尺度適用中國國家標準(CNS)A4規格(210 X 297公爱)[Simplified description of the diagram J =-This is the instant voice synthesis system of the present invention. Figure 1 is a voice endpoint detection diagram of the present invention: a diagram. "Figure 3 is the white flow chart of the basic period of the sound of the present invention. V is taken from the phase algorithm. Γ is a basic figure of a speech sound of the present invention. Β Figure 5 is the basic sound of the speech synthesizer of the present invention. Schematic diagram of the method of stacking and accumulating j-terms. Figure 6 is the schematic diagram of adjusting the sound length of the present invention. Figure 7 is the output diagram of the result output procedure of the present invention. Paper size applies to China National Standard (CNS) A4 (210 X 297 public love)

Claims (1)

559782 、申請專利範圍 濟 部 智 慧 員 工 消 費 社 印 1· 一種歌聲即時合成之方法,包含· 式;早…並將輸入的語音儲存成電子資料格 端點_程序··係彻錄製聲音㈣後 #音期來债測出聲音的起迄點的位置; …、耳或 聲音基本週期粹取與標位程序: 測程序所獲得的聲音m枝行聲;mi點偵 對之後每個聲音基本週期之起始點標示所在=了找,亚 合ί器程序:係根據前述之聲音基本週期粹取盘 &位私序’付到母—筆語音之單字音的如 中的每-個字進行=所“的歌譜與歌詞,對歌詞 f 歌譜輸人程序:當前述之歌聲合成器程序完成 =使用者將所輸入歌詞及歌譜,經 時合成出歌曲,·以及 後旎即 放。結果輸出程序:根據前述之程序所產生的結果立即 2.如申請專利範圍第〗項所述之歌聲即時 中辇音輸入裝置可為麥克風或鍵盤。 彳 3二;^:1項所述之歌聲即時合成之方法, 史。。曰輸人可為多種國語單音字,並將輸人的單字 、的Wave格式擋或其他的電子資料格式。 4·如申請專利範圍第〗項所述之歌聲即時 中基週粹財衫為平均大以異函數或_域的方法 訂 線 其 其 本紙張尺涵用*國^票準㈣―規格⑵〇 χ挪公髮 559782559782, patent application scope of Jibei Smart Employees Consumer Press 1. A method for real-time synthesis of singing voices, including: type; early ... and save the input voice as the end point of the electronic data grid_programs ·· 系 彻 录 声 ㈣ 后 # Measure the position of the starting and ending points of sounds during the sound period;…, ear or basic period of sound extraction and labeling procedures: The sound m obtained by the measurement procedure is m; Where the starting point is marked = find, the AHE program: it is based on the basic cycle of sound mentioned above & bit private order 'Paid to the mother-pen voice of the single-word sound as per-word = The "song score and lyrics" of the lyrics f song score input program: when the aforementioned song synthesizer program is completed = the user synthesizes the lyrics and song scores entered over time, and puts it on the back. The result output program: The results produced according to the aforementioned procedure are immediately 2. As the instant singing voice input device described in the item of the scope of the patent application can be a microphone or a keyboard. 彳 32 二; ^: 1 method of instant singing voice synthesis History ... It can be said that the input can be a variety of Mandarin monophone characters, and the input characters, Wave format files, or other electronic data formats can be entered. 4. The singing voice as described in the item in the scope of the patent application is in real time. T-shirts are averaged by using a different function or a field method to set the line. The size of the paper rule is * Guo ^ Voting standard ⑵ Specifications 〇〇 Norwegian public hair 559782 六、申請專利範圍 5甘如申.請專利範圍第1項所述之歌聲即時合出 ,、中端點偵測程序之起迄點位置偵測可利;之方法, 判斷偵測是否有誤。 人工檢查以 6·如申請專利範圍第!項所述之歌聲即時 其中端點偵測程序能夠分離出聲母或韻母之方法, 7·如申請專利範圍第i項所述之歌聲即時 器程序中,歌詞文字的合成物調 曰调、音量、抖音、回音或連接音等特性。i曰長、 8·如申請專利範圍第!項所述之歌聲即時、 二歌:合成器程序,其主要調整聲音基本週°期的方方二其 為:再取樣法、剩餘誤差訊號之聲音基本重:: 加或弦波取樣等方法。 少重豐累 9.如申請專利範圍第】項所述之歌聲即 其中結果輸出程序之音效檔可為—或:他:: 10·如申請專利範圍第i項所述之歌聲即時合 其中結果輸出程序可為電子資料袼式檔。 法, 11 ·如中4專利範圍帛1項所述之歌聲即日夺合 其中歌詞及歌譜輸人程序中,歌詞輸人部分可 = 利用語音輸入。 疋又予或 I2· —種語音與歌聲即時合成之方法,包含·· 一語音資賴之_程序:料❹者藉㈣人裝 =洁言之單音節,並將輸人的語音儲存成電子資料: 端點制程序··係利用錄t聲音樓前後段之無聲或 (請先閱讀背面之注意事項再填寫本頁) --------------6. The scope of patent application 5 Gan Rushen. Please sing the song mentioned in item 1 of the scope of the patent in real time. The mid-point detection procedure can be used to detect the position of the starting point. The method is to determine whether the detection is wrong. Manual inspection to 6 · If the scope of patent application! The method of real-time singing voice described in the above item is a method in which the endpoint detection program can separate the initials or finals. 7. In the song real-time device program described in item i of the patent application scope, the composition of the lyrics and text is called tone, volume, Features such as vibrato, echo, or connection. i said long, 8 · such as the scope of patent application! The real-time and second-song of the song mentioned in the item: Synthesizer program, which mainly adjusts the square of the basic period of the sound. The second is: the resampling method, the sound of the residual error signal is basically heavy :: addition or sine wave sampling and other methods. Less heavy and rich 9. The singing voice as described in the scope of the patent application, which is the sound output file of the result output program, can be -or: he :: 10 · The singing voice as described in the scope of the patent application, i. The output program can be an electronic data file. Method, 11 · The singing voice described in item 1 of the scope of Chinese Patent No. 4 captures the same day. In the lyrics and score input program, the lyrics input part can be input by voice.疋 Youyu or I2 · — A method for real-time synthesis of voice and singing voice, including ... A voice depends on _ Procedure: The person who borrows the costumes = a single syllable of clean speech, and stores the input voice as electronic Information: Endpoint system program ... Using the silence of the front and rear sections of the sound recording floor (Please read the precautions on the back before filling this page) -------------- 經濟部智慧財產局員工消費合作社印製Printed by the Consumer Cooperative of the Intellectual Property Bureau of the Ministry of Economic Affairs 本紙張尺度_巾關家標準(CNS)A4規袼297公愛Paper size_CNS Standard A4 (297)
TW90133568A 2001-12-31 2001-12-31 Real-time music composition method TW559782B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW90133568A TW559782B (en) 2001-12-31 2001-12-31 Real-time music composition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW90133568A TW559782B (en) 2001-12-31 2001-12-31 Real-time music composition method

Publications (1)

Publication Number Publication Date
TW559782B true TW559782B (en) 2003-11-01

Family

ID=32322799

Family Applications (1)

Application Number Title Priority Date Filing Date
TW90133568A TW559782B (en) 2001-12-31 2001-12-31 Real-time music composition method

Country Status (1)

Country Link
TW (1) TW559782B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI497483B (en) * 2012-04-02 2015-08-21 Yamaha Corp Singing support devices, methods and programs
CN112133269A (en) * 2020-09-22 2020-12-25 腾讯音乐娱乐科技(深圳)有限公司 Audio processing method, device, equipment and medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI497483B (en) * 2012-04-02 2015-08-21 Yamaha Corp Singing support devices, methods and programs
CN112133269A (en) * 2020-09-22 2020-12-25 腾讯音乐娱乐科技(深圳)有限公司 Audio processing method, device, equipment and medium
CN112133269B (en) * 2020-09-22 2024-03-15 腾讯音乐娱乐科技(深圳)有限公司 Audio processing method, device, equipment and medium

Similar Documents

Publication Publication Date Title
US9595256B2 (en) System and method for singing synthesis
US10008193B1 (en) Method and system for speech-to-singing voice conversion
Cano et al. Voice Morphing System for Impersonating in Karaoke Applications.
Sharma et al. NHSS: A speech and singing parallel database
JP2011048335A (en) Singing voice synthesis system, singing voice synthesis method and singing voice synthesis device
Mesaros Singing voice identification and lyrics transcription for music information retrieval invited paper
Vijayan et al. Analysis of speech and singing signals for temporal alignment
JP5598516B2 (en) Voice synthesis system for karaoke and parameter extraction device
Gupta et al. Deep learning approaches in topics of singing information processing
JPH11184490A (en) Singing synthesizing method by rule voice synthesis
Lee et al. The musical impact of multicultural London English (MLE) speech rhythm
TWI377558B (en) Singing synthesis systems and related synthesis methods
TW559782B (en) Real-time music composition method
Dong et al. I2r speech2singing perfects everyone's singing.
Duinker Functions of expressive timing in hip-hop flow
JP2009075611A (en) Chorus synthesizer, chorus synthesizing method and program
JP2007140548A (en) Portrait output device and karaoke device
Eckenroth Once Again, on the Music of Laurie Anderson's" O Superman (for Massenet)"
Loscos Spectral processing of the singing voice.
JP2022065554A (en) Method for synthesizing voice and program
ZA et al. Investigating ornamentation in Malay traditional, Asli Music.
Blaauw Modeling timbre for neural singing synthesis: methods for data-efficient, reduced effort voice creation, and fast and stable inference
Janse Time-compressing natural and synthetic speech.
TWI269191B (en) Method of synchronizing speech waveform playback and text display
Beaudoin Dashon Burton's Song Sermon: Corporeal Liveness and the Solemnizing Breath

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent
MM4A Annulment or lapse of patent due to non-payment of fees