TW569182B

TW569182B - Voice tone changing recognition method in Chinese voice recognition system

Info

Publication number: TW569182B
Application number: TW91123526A
Authority: TW
Inventors: Kuan-Nung Lin; Chiou-Yung Chen
Original assignee: Kuan-Nung Lin
Priority date: 2002-10-11
Filing date: 2002-10-11
Publication date: 2004-01-01

Abstract

The invention provides a voice tone changing recognition method in Chinese voice recognition system targeting at automatically recognizing voice tone changing sentence in terms of text string relation and sentence boundary to increase accuracy of system's recognition when continuously generating with level-3 tone characters or repeated characters during Chinese voice (pronunciation).

Description

569182 五、發明說明（i) <發明領域> 本發明係提俾—播中今# 識方法，尤指—種在中文語；；辨=統中語音變調之辨言變調詞句，…成所1i之語 <發明背景> 辦硪正確度之增進目的者。按’目前國内外之語音辨嗜糸部=是:對各國語言中= Ϊ中、文方法中藉以增進系統辨識之正域度；然，丄成：：：；之困擾及困難度外，更有音聲、「二聲m;::讀卜單字之讀*(注；）的「1」二：般中文語句中，若依文語句都沒有問題，此時語；；立=聲或輕聲來哈中中包含了連續「二聲 σ貝曰兀王相同，但當語句，在語音（唸法7丄：：：=’ ί使用於稱謂的相同疊字和讀音間產生變化及差里'有變音的情形發生，使語音關：巧例說明如下之幾種；情形又與詞的字數有，則第-dn::串當中聲若:個字皆為「三」聲你」等兩字字串，而「i你:原ί音：:；!：： V—想第4頁 569182 五、發明說明（2) V )音調，但語音（唸音時）則改變為（丁一尤/ 云一 V )音調。 (2 )、在三個字的字串當中，若其中有連續兩個「三」聲字，則第一個「三」聲字必須改唸「二」聲；若三個字皆「三」聲，則第一個字與第二個字必須改唸「二」聲；如：「潛水艇」之三字字串中，原注音為（< 一弓/ 尸乂 \ V 六一厶V )音調，但唸音時則改變為 (〈一巧/ 尸乂 \ / 六一厶v)音調；又如：「總統獎」之三字字串，原注音為（卩乂厶v 六乂厶v tj — 尤v )音調，但唸音時則改變為（卩乂厶/ 六乂厶/ Μ —尤v )音調。 (3)、在四個字的字串當中，若四個字皆「三」聲，則第一個字與第三個字必須改唸「二」聲；如：「寥寥可數」等四字字串，原注音為（力一幺ν 力一幺ν 万古 V 尸乂 V )音調，但唸音時則改變為（力一幺/ 力一幺V 乃i / 尸乂 V)音調。 (4 )、在五個字的詞當中，若五個字皆「三」聲，則第一個字與第三個、第四個字必須改唸「二」聲；如：「九九九九九」之五字字串（包括本例之數字型態），原注音為W — 又 v W — I) — v Μ 一又V )音調，但唸音時則改變為（Η —又/ Μ —又V Μ—又〆卩一又/ 又ν)音調。 (5)、若連續「三」聲的字數為偶數，且六個字或六個字以上時，前面每兩個字為一組，依照連續兩個「三」569182 V. Description of the invention (i) < Field of invention > The present invention is a method for identifying, broadcasting in Chinese, especially in Chinese; The language of 1i < Background of the Invention > According to the current speech recognition at home and abroad = Yes: in the languages of various countries = in Chinese and Chinese methods to improve the positive degree of system identification; however, the success of :::; There are sounds, "two sounds m ::: reading the single word reading * (Note;)" 1 "2: in the ordinary Chinese sentence, if there is no problem with the sentence according to the text, at this time; Li = sound or soft Laiha contains the continuous "two sounds of σ, beijing, the same as the king, but when the sentence, the change in the pronunciation and pronunciation of the same superposition and pronunciation used in the appellation (pronunciation 7 丄 :: == ί) and the difference between the pronunciation" There are cases of inflection that make the sound off: clever examples explain the following; the situation is also related to the number of words in the word, then the -dn :: string in the voice if: each word is "three" you "etc. Two-character string, and "i 你: 原 ί 音 ::;! :: V—think page 4 569182 V. Description of the invention (2) V) The tone, but the voice (when pronounced) is changed to (Ding Yi You / Yunyi V) Tone. (2) In the three-character string, if there are two consecutive "three" sounds, the first "three" sounds must be changed to "two" sounds; If All three words are "three" sounds, so the first and second words must be changed to "two" sounds. For example, in the three-character string of "submarine", the original note is (< Yi bow / corpse乂 \ V 六一厶 V) tone, but it changes to (<一巧 / (乂 \ / 六一厶 v) tone when you pronounce it; for example: the three-character string of "Presidential Award", the original note is (卩乂厶 v 六乂厶 v tj — You v) tone, but it changes to (改变 / 六乂厶 / Μ —You v) tone when you pronounce. (3) Among the four-character strings, if the four characters are all “three” sounds, then the first and third characters must be changed to “two” sounds; for example, “rarely countable”, etc. In the word string, the original Zhuyin is (Li Yi 幺 ν Li Yi 万万 Vango V corpse V), but when it is pronounced, it changes to (Li Yi 幺 / 力一幺 V i / corpse V). (4) Among the five-character words, if the five characters are all “three” sounds, then the first character and the third and fourth characters must be changed to “two” sounds, such as: "Nine-nine" five-character string (including the digital form of this example), the original Zhuyin is W — and v W — I) — v Μ a and V), but when it is pronounced, it changes to (Η — and / Μ — 又 V Μ—The tone is again and again. (5) If the number of consecutive "three" sounds is even and there are six or more words, the first two words are a group, according to two consecutive "three"

569182 五、發明說明（3) 聲的啥法；如：「九九九九九九」之六字字串，原注音為 (ij -一 V L) ^一 v Lj 一 V Lj '一' % V ij 一569182 V. Explanation of the invention (3) What is the sound? For example: the six-character string of "99,999,99", the original phonetic is (ij-一 VL) ^ 一 v Lj 一 V Lj '一'% V ij a

Lj 一又V )音調，但唸音時則改變為（I) 一又/ q — ^ v Μ — 又 / I] — ^ ν % y — 音調〇 (6)、若連續三聲的字數為奇數，且七個字或七個字以上時，前面每兩個字為一組，依照連續兩個「三」聲的唸法，但最後一組為三個字，依照連續三個「三」聲的唸法；如：「五五五五五五五」原注音為（乂 v 乂 v 乂 V 乂 V 乂 V 乂 V 乂 V )音調，但唸音時則改變為 (乂 / 乂 v 乂〆乂 V 乂〆乂〆乂 V)音調。由上可知，上述之音調變化法則乃係適用於一般詞句與專有名詞，而當三字字串中有「詞界」區分時，須先依「詞界」區分後再套用上述準則，也適用於數字與一般文字之組合，如用於「姓名」時亦適用之，而其音調變化法則係另依下列方式： (1 )、在「姓名」之辨識應用時，必須把「姓」與「名」分開，如：「馬友友」或「柳水扁」，雖三字皆「三」聲，原注音為（门丫 v —又V —5<v)及（力一又V 尸乂 \ V 勹一弓V )，但「馬」、「柳」為「姓」，不改音調而一樣唸「三」聲，「友友」或「水扁」兩字則依上述兩個連續「三」聲的法則，將兩字中之第一個字唸「二」聲，所以改唸為（门丫 V —又/ 一又V)及 (力一尸乂 \ / 勹一弓V)。Lj and V) tones, but change to (I) and / q — ^ v Μ — and / I] — ^ ν% y — tone 0 (6), if the number of words in three consecutive sounds is When the number is odd and there are seven or more characters, the first two characters are a group, according to two consecutive “three” sounds, but the last group is three characters, according to three consecutive “three” Pronunciation: For example: "Five Five Five Five Five Five Five" the original note is (乂 v 乂 v 乂 V 乂 V 乂 V 乂 V 乂 V 乂), but when it is pronounced, it changes to (乂 / 乂 v 乂〆乂 V 乂〆乂〆乂 V) tones. It can be seen from the above that the above-mentioned tone change rules are applicable to general words and proper nouns. When there is a "word boundary" distinction in a three-character string, it must be distinguished according to the "word boundary" before applying the above criteria. Applicable to the combination of numbers and general words, such as when used for "name", and its tone change law is based on the following methods: (1) In the application of "name" identification, the "last name" and the Separate the "name", such as "Yo You Ma" or "Liu Shuibian". Although the three words are all "three" sounds, the original Zhuyin is (门丫 v — 又 V —5 < v) and (力一又 V deadlock \ V 勹一弓 V), but "horse" and "liu" are "surnames", and they say the "three" sounds without changing the pitch. The words "youyou" or "shuibian" follow the two consecutive "three" The rule of "voice" reads the "two" sound in the first word of the two words, so it is changed to (门丫 V — 又 / 一又 V) and (力一 dead 乂 \ / 勹一弓 V).

569182 五、發明說明（4) (2 )其他三字字串中有「詞界」區分時，如：「蔣總統」或「總統獎」等，其中，「總統獎」三個字皆「三」聲，原注音為（卩乂厶V 六乂厶v I] —尤V)，但因不包括「姓」在内，故仍利用上述法則，前面兩個字唸「二」聲，所以唸做「卩乂厶/ 六乂厶/ L] 一尤V」。而「蔣總統」三字皆「三」聲，原注音為（Μ —尤V V 乂厶V 六乂厶V)，但「蔣」字為「姓」，並非詞之一部分，故一樣唸「三」聲，後兩字「總統」則依上述兩個連續三聲的法則將第一個字唸「二」聲，所以唸做「1] 一尤V 卩乂厶/ 六乂厶V 」。又另一種呈疊字型態之稱謂時，即於一中文詞句之稱謂中若有兩個相同的字而呈疊字，就算該兩字為「三」聲字，語音（唸法）也不一樣，疊字的第一個按原來的音唸，第二個字要哈為輕聲。例如：「爺爺」、「奶奶」、「爸爸」、「媽媽」、「哥哥」、「姊姊」、「弟弟」、「妹妹」、「獲狼」等等。綜上所述，中文詞句中音調變化是必然發生之現象，而音調變化所形成語音及讀音的差異性卻明顯會增加語音辨識之困難度及不正確性，造成實際使用上之困擾，如：在一中文語音辨識系統中，係先利用讀音方式輸入一中文資料庫，再而欲利用語音方式將一中文詞句（或字串）輸入該語音辨識系統中，期能藉該語音辨識系統之辨識處理，而可從中文資料庫中擷取出正確之中文詞句或訊號並輸出使用，以達成語音操控之便利性；然，實際使用時，往569182 V. Description of the invention (4) (2) When there is a "word boundary" distinction in other three-character strings, such as: "President Jiang" or "Presidential Award", among which, the three words "Presidential Award" are "three ", The original Zhuyin is (V 六乂厶 v I] — 尤 V), but because it does not include the" surname ", so still using the above rule, the first two words pronounce" two ", so Do "卩乂厶 / 六乂厶 / L] 一尤 V". And the words "President Jiang" have "three" sounds. The original note is (M—You VV 乂厶 V 六乂厶 V), but the word "Jiang" is "surname" and is not part of the word. "The second two words" President "read the first word" two "according to the above two rules of three consecutive tones, so say" 1] Yiyou V 卩乂厶 / 六乂厶 V ". In another type of title with a superimposed form, that is, if there are two identical words in the title of a Chinese phrase, the superimposed words are even if the two words are "three" sounds, and the pronunciation (pronunciation) is not Similarly, the first word of the superimposed word is pronounced according to the original sound, and the second word should be soft. For example: "Grandpa", "Grandma", "Dad", "Mom", "Brother", "Sister", "Brother", "Sister", "Wolf" and so on. In summary, tonal changes in Chinese words and phrases are inevitable, and the differences in pronunciation and pronunciation formed by the tonal changes will obviously increase the difficulty and incorrectness of speech recognition, causing practical problems such as: In a Chinese speech recognition system, a Chinese database is first input by using a pronunciation method, and then a Chinese word (or string) is to be input into the speech recognition system by using a speech method, and the recognition by the speech recognition system is expected. Processing, and can extract the correct Chinese words or signals from the Chinese database and output them for use in order to achieve the convenience of voice control; however, in actual use, go to

569182 五、發明說明（5) 往》同—中文^上不同’致無m r串),中士語音因變音關係而與讀音範例說明如ί錯誤’也影響後續作業之進行，試再舉實‘ 電話系統裝^::語音轉接分機之公司電話系㉟：在上述入之語音資名已具備辨識系、统，以將來電輸中；而公司之任立桃：猎麥克而進入該語音辨識系統 -中文資料；：：辨識糸統一般係先已利用讀音方式輸入號碼」，則使用二m某i:姓名」與其所屬「分機音資料加以辨气、’ θ辨識系統須針對來電輸入之語句或’以從中文資料庫中擷取出正確之中文詞」，“人ϊϊί用，㈤：來電輸人資料原係為「馬友友音—辨識系統之中文資料庫中「馬友友」係以讀音)(门 ^ ^ V 一又V)儲存，致易使輸入之語音資料（门一又/一又V )無法簡易辨識而造成後續系統之作、、員如·因無法辨識正確姓名致無法快速地自動轉接至正確分機，降低了語音轉接系統之使用功效。 ^ 2 )、如使用在醫院之語音掛號系統：在上述語音掛號系統裝置中皆已具備一套語音辨識系統，以將來電輸入 σ曰"k料如姓名荨’藉由麥克風而進入該語音辨識系統中’而醫院之語音辨識系統一般係先已利用讀音方式輸入一中文資料庫，如：醫院中一某人姓名與其所屬掛號單，則使用時，該語音辨識系統須針對來電輸入之語音資料加 $ 8頁 569182 以而庠 j簡作發明說明（6) 辨識，以從中文資料庫中擷取出正確之中輪出使用，如··來電輪入資料原係為「馬辨識號碼（如掛號單號碼），但輸入之語 η丫 v —又/ 一又\/)，而語音辨識系中「馬友友」係以讀音（门丫 V —又V 致易使輸入之語音資料（门丫 V —又/ 易辨識以輸出正確之中文詞句或訊號，造業麻煩，如：醫院中無法辨識正確姓名致其中，掛號單號碼之長字串也可能發生語文詞句友友」音資料統之中 —又V —Κ ν 成後續無法語音變調或訊號並連同卻為文資料 )儲存 )無法系統之音掛號之情形 :3 )、如飯店客房分機之語音撥接系布统裝置中皆已具備一套扭立 :隹上疋語音資料，如姓名或4::;識=來=來電輪入之音辨識系統中；而飯店之進入該語音大4 μ &〜< °口曰辨識系統一般係先已利用讀房登資料庫，㈤：飯店中某-客房號及已住電D 則使用時，該語音辨識系統須針對來之：t資料f以辨識，以從中文資料庫中擁取出正為「民e f ° * 5fl旎而輪出使用，如：來電輸入資料原係 =友友」及連同一辨識號碼（如客房號碼），但輸入 m :為（门丫v……），而語音辨識糸統之中文資料庫中「馬友友」係以讀音（门丫v 一又一又^)儲存丄致易使輸入之語音資料（ηγν 一又一又w無法簡易辨識以輸出正確之中文詞句或訊號，造成後縯系統之作業麻煩，如··飯店因無法辨識來電語 V /569182 V. Description of the invention (5) Going to the same—Chinese ^ is different from 'zhiwu mr string), the sergeant's voice due to the inflection relationship and the pronunciation example description is wrong also affects the follow-up operation, try to be realistic '' Telephone system installation ^ :: The company telephone system of the voice extension extension: The above-mentioned voice name already has the identification system and system to input the incoming call; and the company's Ren Litao: Mike Hunt enters the voice recognition system -Chinese data; :: Identification systems generally use the pronunciation method to enter the number ", then use two m: i: name" and its extension "extension data to discriminate," θ identification system must input the sentence or 'To extract the correct Chinese words from the Chinese database', "People use them, ㈤: The original data of the caller and input person is" Ma Youyou "—the Chinese database of the identification system is based on pronunciation) (Door ^ ^ V I and V) storage, so that the input voice data (gate I / I and V) could not be easily identified and caused the subsequent system to work, and the staff could not quickly transfer automatically because they could not recognize the correct name to Indeed extension, reducing the effectiveness of the use of the voice switching system. ^ 2) If using a voice registration system in a hospital: All of the above voice registration system devices already have a voice recognition system to input incoming calls σ quot; k material like name netting 'into the voice through a microphone In the recognition system, the speech recognition system in a hospital usually uses a pronunciation method to input a Chinese database. For example, the name of a person in the hospital and the registration form to which they belong. In use, the speech recognition system must respond to the voice input by a caller. Data plus $ 8 pages, 569182, and 简 j brief description of invention (6) identification, in order to extract the correct from the Chinese database for use in rotation, such as ·· Incoming call rotation data was originally "horse identification number (such as Registered order number), but the input language η 丫 v — / / 一又 \ /), and the “Ma Youyou” in the speech recognition system uses pronunciation (门丫 V — and V to make it easy to input the voice data (门丫 V —Also, it is easy to identify and output correct Chinese words or signals, which is troublesome in the manufacturing industry. For example, the hospital cannot recognize the correct name and the long string of registered slip numbers may also occur. In the system of data—and V—Κ ν, it is impossible to change the tone or signal in the subsequent voice, and it is stored together with the document data)) The system cannot register the voice of the system: 3), such as the voice dialing of the hotel room extension system All have a set of stand-up: 隹上隹 voice data, such as name or 4 ::; cognition = come = incoming call recognition system; and the restaurant's access to the voice is 4 μ & ~ < ° 口Generally speaking, the identification system usually uses the room reading database first. ㈤: When a hotel room-room number and live electricity D are used, the speech recognition system must be targeted at: t data f to identify, and Chinese data The data in the library is taken out as "Min ef ° * 5fl" and used in turns. For example, the original data of the incoming call = Friends and the same identification number (such as the room number), but the input m: (（门丫 v… …), And the "Ma Youyou" in the Chinese database of the speech recognition system is stored in pronunciation (门丫 v one after another ^), which makes it easy to input the voice data (ηγν one after another and w cannot be easily identified to output Correct Chinese words or signals, resulting in Industry in trouble, such as the inability to identify the caller · Hotel phrase V /

569182 五、發明說明（7) 曰中之房客正確姓名，致無法自動轉接至店電話系統中要求欲轉接客房之來電-〜而一般飯客之正禮姓名，若吻合始將來轉^ =分機與房當之推銷。 J得得至客房，以防不〈發明目的〉本發明之一目的在於提供一種中立音變古周之挪坤士^ ^ T又。口9辨識系統中語 ii:=i 其係於中文語音中，當連續有「三」 =子連接赉音而自然產生語音變調ϋ =依子串關係及詞界而改變，並藉以辨識已 …ί调之§司句，俾增加系統辨識之正確度。。有關本發明為達上述目的、其功效，兹例舉較佳實施#丨# j徵所十木用的技術手段及〈發明說明〉“施例並配合圖式說明如下：本發明係一種「中文言五音方法」，請參照第一乐統中語音變調之辨識 $ # ί ί s辨識系統之中文資料庫方面：俜利用鍅般或其他輸入裝置（、、丁寸厚万命加利用鍵盤 (2 )，而#由1 h ρ，將中文資料輸入於資料緩衝區 ])而籍由巧句判斷分解處理罝；r Q、. 緩衝區（2 )内之中文眘袓* I早疋（3 )，處理資料斷為「稱褶轟宝，、’、、’加以判斷，若該中文資料剌辦马％明疊子」則產生稱貝抖判硬碟記憶體等儲存媒# f •予貝枓（4 )，再儲存於「連續『三』聲」則產生：浐；而若該：文資料判斷為存於硬碟記憶體等儲存媒體^』聲#料（5)，再儲媒體（6 )中得建置成」j H，使硬碟記憶體等儲存中文貝料庫方面：而硬碟記憶體聊182569182 V. Description of the invention (7) The correct name of the tenant in the middle cannot be automatically transferred to the call in the store's telephone system that requires the transfer of the guest room-~ and the name of the official salute of the meal, if it matches, it will be transferred in the future ^ = Extensions and rooms should be promoted. J got the guest room just in case <Objective of the Invention> One of the objects of the present invention is to provide a neutral sound-changing ancient week of Nokushi ^ ^ T again. Oral 9 recognition system Chinese ii: = i It is in Chinese phonetics. When there are “three” in succession = sub-connected 赉 sounds will naturally produce phonetic transliteration ϋ = change according to the substring relationship and the word boundary, and identify the already ... Adjust the § sentence to increase the accuracy of the system identification. . For the purpose of the present invention to achieve the above-mentioned objectives and its effects, the technical means for better implementation # 丨 # j 征所十木 and the "Explanation of the Invention" "exemplary and illustrated with drawings are described below: The present invention is a" Chinese "Five-voice method", please refer to the identification of the tone change in the first music system. $ # Ί ί The Chinese database of the s recognition system: 俜 Using ordinary or other input devices (,, Ding inch thick, and using the keyboard (2 ) And # by 1 h ρ, input Chinese data into the data buffer]] and decompose and process it by clever sentence judgment; r Q ,. Chinese caution in buffer (2) * I 早疋 (3) The processing data is judged to be "called fold bangbao ,,,,,," to judge, if the Chinese data does not work, it will generate storage media such as hard disk memory, etc. # f • 于贝予(4), and then stored in "continuous" three "sounds" will produce: 浐; if this: the text data is judged to be stored in the storage medium such as hard disk memory ^ "sound # material (5), and then stored in the media (6 ) In the "J H", so that hard disk memory, etc. to store the Chinese shell database: and hard disk memory chat 18 2

等儲存媒體（6 )則與業系統。信號比對處理器（7 )連結成一Other storage media (6) are related to the industry system. The signal comparison processor (7) is connected into one

而使用時，經由麥克風入一中文語音資料，該中文 ’再藉由信號比對處理器（輪出（10 )，供後續系統接凊參照第二圖所示，其係中接發音或相同疊字而自然產 ’當比對資料輸入後，即先 )，將比對資料分為一般比比對程序（丨3 )及稱謂疊字一般比對程序（1 2 )，則比 )中；而若為連續「三」聲進入依連續「三」聲字數分分為2個連續「三」聲（17 、4個連續「三」聲（19 ) 6個連續「三」聲（21 )、同字數，並依不同字數進行針對2個連續「三」聲（17 」聲比對資料（23 )，針對其中第1 、2個字建立「二個連續「三」聲（19 )則對聲比對資料（2 5 )，針對5 中第1、3、4個字建立「或其它語音輸入裝置（8)輸語音資料先經信號編碼（9 ) 7 )進行辨識，再將比對結果收正確訊號而進行後續作業。文語音中連續有「三」聲字連生語音變調之變化方式示意圖進入依詞界規則分解程序（j工對程序（12)、連續「三」聲比對程序（14 )，其中，若為對資料直接建置在資料庫（i 5 比對程序（13 )，則比對資料類程序（1 6 )，將比對資料區 )、3個連續「三」聲（丨8 ) 、5個連續「三」聲（2 〇 )、 7個連續「三」聲（22 )等不不同比對資料之建置，包括： )則對其中第1個字建立「二 3個連續「三」聲（18 )則& 」聲比對資料（24 )，針對4 其中第1 、3個字建立「二個連續「二」聲（2 〇 )則對其二」聲比對資料（2 6 )，針對When in use, a Chinese voice data is input through the microphone, and the Chinese language is then compared with the signal processor (round out (10)) for subsequent systems to refer to as shown in the second figure, which is connected with pronunciation or the same stack. When the comparison data is entered, it is divided into the general comparison process (丨 3) and the titled general comparison process (1 2). For continuous "three" sounds, the number of consecutive "three" sounds is divided into two consecutive "three" sounds (17, 4 consecutive "three" sounds (19), and six consecutive "three" sounds (21). The number of words, and according to the different number of words for two consecutive "three" sounds (17) sound comparison data (23), for the first one, two words to establish "two consecutive" three "sounds (19) Acoustic comparison data (2 5). For the first, third, and fourth words in 5 or other speech input devices (8), the input speech data is identified by signal encoding (9) 7), and the comparison result is then compared. Receiving the correct signal for follow-up operations. Schematic diagram of the changing pattern of the continuous three-voice continuous tone in the mantra Enter the decomposition procedure according to the word boundary rules (j-matching procedure (12), continuous "three" sound comparison procedure (14), where if the data is directly built in the database (i 5 comparison procedure (13), The data program (16) will compare the data area), 3 consecutive "three" sounds (丨 8), 5 consecutive "three" sounds (20), and 7 consecutive "three" sounds ( 22) The construction of different comparison data, including:) For the first word, "two 3 consecutive" three "sounds (18) then &" sound comparison data (24), for 4 of which The first and third words establish "two consecutive" two "sounds (20), and then compare the two" sound comparison data (2 6).

569182 五、發明說明（9) 6個連續「三」聲（21 )則對其中第1 、3、5個字建立「二」聲比對資料（27 )，針對7個連續「三」聲（22 ) 則對其中第1 、3、5、6個字建立「二」聲比對資料 (2 8 )，並將所有經過變音之比對資料（2 3 ) 、（ 2 4 )、 (25) 、（26) 、（27) 、（28)建置在在資料庫（15) 中。又若為稱謂疊字比對程序（1 4 )，則對其中第2個字建立輕聲比對資料（2 9 )，並將變音之比對資料（2 9 )建置在在資料庫（1 5 )中。則資料庫（丨5 )中已針對一般比對程序（12 )、連續「三」聲比對程序（13 )及稱謂叠字比對程序（14)建置完成完整之比對資料，尤其是語音及讀音間最可能產生之變音部分；則當語音信號（3〇 ) ^入而欲辯識時，可藉資料庫（15 )已儲存之完整比對資^進行语音比對（3 1 )，再將比對結果輸出（3 2 )，供後續系統接收正確訊號而得進行後續作業。、尔為了讓熟知該項技術之人士能更加瞭解本發明之特徵’特舉語音轉接之電話系統的實施例，說明本發明每際使用情形：貝清參照第三A圖所示，先是使用者來話（4 〇 );進行 =自動接聽並宣告歡迎詞（41 )，如：『0〇公司您好^ 清告訴我您要找誰？』；再進行至使用者說出（語音）要找的人C 42 )，如：『陳總統（考~ /卩乂厶/六乂乙 V )』；再進行至一般比對（4 3 )，即陳總統之語音（考 T7 乂人/云乂厶v)與讀音（彳4 /卩乂厶v 六之比對；若比對失敗（因語音與讀音已有差異569182 V. Description of the invention (9) Six consecutive "three" sounds (21) For the first, third, and fifth words, "two" sound comparison data (27) is created, and for seven consecutive "three" sounds ( 22) then establish "two" sound comparison data (2 8) for the first, 3, 5, and 6 words, and compare all the converted data (2 3), (2 4), (25 ), (26), (27), (28) are built in the database (15). If it is the title overlapping program (1 4), a soft comparison data (2 9) is established for the second word, and the variable comparison data (2 9) is built in the database ( 1 5). Then the database (丨 5) has completed the complete comparison data for the general comparison program (12), the continuous "three" sound comparison program (13) and the title superposition comparison program (14), especially The most probable part of voice change between speech and pronunciation; when the voice signal (30) is entered and you want to identify, you can use the complete comparison data stored in the database (15) to perform a voice comparison (3 1 ), And then output the comparison result (3 2) for subsequent systems to receive the correct signal to perform subsequent operations. In order to let those who are familiar with the technology to better understand the features of the present invention, the embodiment of the "telephone system of special voice transfer" is used to explain the use of the present invention: Bei Qing refers to Figure 3A and uses it first. Caller (4 〇); proceed = automatically answer and announce the welcome speech (41), such as: "0〇 Hello company ^ tell me who are you looking for? ”; Then proceed until the user speaks (voice) the person C 42), such as:“ President Chen (test ~ / 卩乂厶 / 六乂乙 V) ”; then proceed to the general comparison (4 3) , That is, the comparison between President Chen ’s voice (test T7 乂人 / 云乂厶 v) and pronunciation (彳 4 / 卩乂厶 v six; if the comparison fails (due to the difference between the pronunciation and the pronunciation)

第12頁 569182 五、發明說明（ίο) ，則進行至連續「， (彳㈠〆T7 乂厶/ 卩乂厶/六乂厶v 比對43提高 )，如：『去乂厶v ) 識（46 ) 〇」聲比對（4 4 )，即陳總統之語音六乂人v)與轉換後之語音（4 比對；比對成功（正確率已比一般 )’再進行至語音宣讀比對結果並作處理（4 5 您要找的是陳總統之語音（4 4 /卩乂厶/ ’我馬上為您轉接』：轉接成功而結束語音辯至自^ ί照第三B圖所示，先是使用者來話（47 );進行，請生士聽並宣告歡迎詞（48 )，如：『這裡是寶貝家庭要找的 \ \ 门 ( b 幺 V 讀音已妹妹 ( 音 ( ( 正確結果並 ( b 功而結综語音辨更可门入\门\ \ 要找我您要找誰？』；再進行至使用者說出（語音）伐的人（49)，如：『陳小妹妹（^ / 丁一么ν门讀音已 ' ，· 、' y之比對；若比對失敗（因語音與妹妷r z差異）’則進行至稱謂疊字比對（51 )，即陳小丁一么v门入、门入·）與轉換後之語一么v门\、门\ ·)比對；比對成功般比對43提高），再進行至語音宣讀比對『您等一下，我叫陳小妹妹 \ ·)』：再進行至一般比對（5 〇 )，即陳小妹妹 / 丁一么\、门〜·）與讀音（才—一 ^7 / b y ， ^ 率已比狀〜β π妒 7 ’並作處理（5 2 )，如：功”：語丁音一:識η'·)聽電話』；轉接成語音：i可知，針對輸入之任何中文語音，本發明之中文，i;;糸統除可進行一般語音比對如⑷…50”卜比對失敗時，自動依字串關係及詞界，再利用轉Page 12 569182 V. Description of the invention (ίο), proceed to continuous ", (彳㈠〆T7 乂厶 / 卩乂厶 / 六乂厶 v compared to 43 to increase), such as:" 去乂厶 v） cognition ( 46) 〇 ”sound comparison (4 4), that is, the speech of President Chen ’s speech six people v) and the converted speech (4 comparison; the comparison is successful (the correct rate has been more than normal)” and then proceed to the speech reading comparison Results and processing (4 5 What you are looking for is the voice of President Chen (4 4 / 卩乂厶 / 'I will transfer it for you right away'): the transfer is successful and the speech is over to ^ 照 According to the third picture Instructions, the user's incoming call (47); proceed, please listen and announce the welcome speech (48), such as: "Here is what the baby family is looking for \ \ 门 (b 幺 V pronunciation has sister (音 ((correct The result is (b), and the comprehensive speech recognition can be more accessible. \ Do you want me? Whom are you looking for? ”, And then proceed to the person who spoke (voice) and fell (49), such as" Chen Xiao Younger sister (^ / 丁一么 ν door pronunciation has been compared with ', ·,' y; if the comparison fails (due to the difference between the voice and the sister's rz), the process is complete Predicate superposition comparison (51), that is, the comparison of Chen Xiaoding Yi Mo v gate entry, gate entry ·) and the converted language Yi gate v gate \, gate \ ·); the comparison is successful, compared with 43)) Proceed to the voice reading comparison "Wait a minute, my name is Chen Xiaomei \ ·)": then proceed to the general comparison (50), that is, Chen Xiaomei / Ding Yima \, door ~ ·) and the pronunciation (cause —One ^ 7 / by, ^ rate has been compared ~ β π jealousy 7 'and processed (5 2), such as: "work": tingyin one: know η' ·) listen to the phone "; transfer into voice: It can be known that, for any Chinese speech input, the Chinese language of the present invention, i ;; 糸 system can perform general speech comparison such as ⑷ ... 50 ". When the comparison fails, it will automatically use the string relationship and word boundary, and then use the conversion.

569182569182

五、發明說明（11) 換後之语音來辨識對（4 4 )或稱謂疊續「三」聲字連接加系統辨識之正確語音變調後之詞句字比對（51 )，以或相同疊字時所產度。，如連續「三」聲比有效降低語音中有連生之辯識困難，而增她砷：ΐ所述’本發明「中文語音辨識系統中語音變調之辨識方法」，的確能藉由上述所揭露之方法，達到所述之 ί ί ° ΐ本發明申請前未見於刊物亦未公開使用，誠已符 a發明專利之新穎性、進步牲等要件。V. Description of the invention (11) Recognition of the changed speech (4 4) or the term “triple” sound word connection plus the system's identification of the correct tone change of the system (51), or the same superposition When produced. For example, the continuous "three" sound ratio can effectively reduce the difficulty of recognizing continuous speech in speech, and increase her arsenic: "The invention" recognition method of tone change in Chinese speech recognition system "according to the present invention can indeed be used by the above The method of disclosure achieves the above-mentioned ί ΐ ΐ The application of the present invention has not been seen in publications or used publicly, and it has met the requirements for novelty and progress of the invention patent.

准’上述所揭之圖式及說明，僅為本發明之實施例而已，非為限定本發明之實施例；大凡熟悉該項技藝之人士其所依本發明之特徵範疇，所作之其它等效變化或修飾，皆應涵蓋在以下本案之申請專利範圍内。 <The figures and descriptions disclosed above are only examples of the present invention, and are not intended to limit the embodiments of the present invention. Anyone who is familiar with the technology may make other equivalents based on the features and scope of the present invention. Changes or modifications shall be covered by the scope of patent application in the following case. <

第14頁 569182 圖式簡單說明第一圖：係本發明之流程方塊圖。第二圖：係中文語音中連續有「三」聲字連接發音或相同疊字而自然產生語音變調之變化方式示意圖。第三A圖：係本發明一實施例之實際操控方塊示意圖。第三B圖··係本發明另一實施例之實際操控方塊示意圖。 < 元件符號說明 > ( 1 ) m 入裝置 ( 2 ) 資料缓衝 ( 3 ) 詞句判斷 ( 4 ) 產生稱謂 ( 5 ) 產生連續 ( 6 ) 儲存媒體 ( 7 ) 信號比對 ( 8 ) 語音入 ( 9 ) 信號編碼 (10 ) 比對結果 ( 11 ) 依詞界規 ( 12 ) _ _· 般比對 ( 13 ) 連續厂一一 ( 14 ) 稱謂疊字 ( 15 ) 資料庫 ( 16 ) 依連續厂 ( 17 ) 2 個連續區分解處理單元疊字資料「三」聲資料處理器裝置輸出則分解程序程序」聲比對程序比對程序三」聲字數分類程序「三」聲 569182 圖式簡單說明 (18 ) 3個連續「三」聲 (19 ) 4個連續「三」聲 (20 ) 5個連續「三」聲 (21 ) 6個連續「三」聲 (22 ) 7個連續厂三」聲 (23 )、 (24 )、 (25 ) 、（26 ) 、（27 )、 (29 ) 比對資料輕聲比對資料 (30 ) 語音信號 (31 ) 語音比對 (32 ) 比對結果輸出 (40 ) > (47 ) 使用者來話 (41 )、 (48 ) 自動接聽並宣告歡迎詞 (42 ) > (49 ) 使用者說出要找的人 (43 )、 (50 ) 一般比對 (44 ) 連續「三」聲比對 (45 )、 (52 ) 語音宣讀比對結果並作處理 (46 )、 (53 ) 結束 (51 ) 稱謂疊字比對 (28 )Page 14 569182 Schematic description of the first diagram: The block diagram of the process of the present invention. The second picture: It is a schematic diagram of the way of changing the pronunciation of the "three" sounds in the Chinese phonetic sound. FIG. 3A is a schematic diagram of an actual control block according to an embodiment of the present invention. FIG. 3B is a schematic diagram of an actual control block according to another embodiment of the present invention. < Explanation of component symbols > (1) m input device (2) data buffering (3) sentence judgment (4) generating titles (5) generating continuous (6) storage media (7) signal comparison (8) voice input (9) Signal coding (10) Comparison result (11) According to word boundary rules (12) _ _ · General comparison (13) Continuous plant one by one (14) Appellation superposition (15) Database (16) According to continuous Factory (17) 2 continuous zone decomposition processing units superimposed data "three" sound data processor device output is decomposed program program "sound comparison program comparison program three" sound word number classification program "three" sound 569182 simple diagram Explanation (18) 3 consecutive "three" sounds (19) 4 consecutive "three" sounds (20) 5 consecutive "three" sounds (21) 6 consecutive "three" sounds (22) 7 consecutive factory three " (23), (24), (25), (26), (27), (29) Compare data softly Compare data (30) Voice signal (31) Voice comparison (32) The comparison result is output ( 40) > (47) User calls (41), (48) Automatically answer And announce the welcome speech (42) > (49) users say who they are looking for (43), (50) general comparison (44) continuous "three" sound comparison (45), (52) speech reading ratio The results are processed (46), (53) End (51) Appellation overlap (28)

第16頁Page 16

Claims

569182

The method for identifying tone changes includes the following steps of inputting Chinese data into the data and a Chinese speech recognition system: using a keyboard or other input devices to punch the area; processing the Chinese sound of the data buffer or the title superposition; 'generating the title Reduplicated data, the vocalist 'produces continuous "three 乂 use sentence to determine the decomposition of processing units, materials, and distinguish the continuous according to the rules of the word field" Third, the Chinese word for the reduplication and stored in storage media; The continuous sound data in the data is stored in the storage medium so that the storage medium is connected to the signal comparison processor. The Chinese speech data of the signal-input speech input; Use k to compare the processor, according to the Chinese new data The media data is stored and the voice T is identified; the comparison result is output. 2. The method described in the first item of the patent scope, in which the word judgment second solution processing unit decomposes the input according to the word boundary rules. Chinese materials: for-general comparison, continuous "three" sound comparison and title overlap comparison 3 patent scope first The method of claim 2, wherein the processing solution words blade unit may be generally aligned in the build data to form the data storage media library. 4. ΐ :::! The method described in item 1 or 2 of the patent scope, wherein the word processing unit can classify the continuous "three" sounds into 2

569182 Six, the scope of patent application is a continuous "three" sound, three consecutive "" ears, five consecutive "three tones, two" chirps, four consecutive "three" three "tones, etc. different words =: three, 7 words to establish "two" sound comparison information [,; two "voices to it"] then to the first and second words, "to 3 consecutive" three "sounds 4 consecutive" three "sounds The sound comparison data of the “Fu order” ρ, the comparison data of f, and the five consecutive “three ::: words” “two” f, the four words of the “two” sound comparison. The "1" and "2" sounds are compared to the 2q, "quote", and for the 6 consecutive plant data, the "2nd" sounds are compared: each of the "2" sounds is compared to the 2nd, 3, and 5 ★ I :; :: Built in Jiancun Media to form the second; After the voice change declaration patent scope i or 2 item 2 decomposition processing unit can input the Chinese = 2 of the two words in the ::: The second word in the: to establish soft comparison data 2 Titles are built on storage media to form a database. For the ratio of patent application scope i 7 including hard disk memory. With COSCO storage media = the method described in item 丄 or 2 of the scope of the patent application, the H !! ί: device and the speech turn based on the information of the storage media to identify the data, you can perform general speech first By comparison, ί ί: When t loses, it is identified by the string relationship and word boundary, and the converted ^ is used for identification. 1 Watt Heart Page 18