TW569182B - Voice tone changing recognition method in Chinese voice recognition system - Google Patents

Voice tone changing recognition method in Chinese voice recognition system Download PDF

Info

Publication number
TW569182B
TW569182B TW91123526A TW91123526A TW569182B TW 569182 B TW569182 B TW 569182B TW 91123526 A TW91123526 A TW 91123526A TW 91123526 A TW91123526 A TW 91123526A TW 569182 B TW569182 B TW 569182B
Authority
TW
Taiwan
Prior art keywords
data
comparison
chinese
sounds
words
Prior art date
Application number
TW91123526A
Other languages
Chinese (zh)
Inventor
Kuan-Nung Lin
Chiou-Yung Chen
Original Assignee
Kuan-Nung Lin
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kuan-Nung Lin filed Critical Kuan-Nung Lin
Priority to TW91123526A priority Critical patent/TW569182B/en
Application granted granted Critical
Publication of TW569182B publication Critical patent/TW569182B/en

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The invention provides a voice tone changing recognition method in Chinese voice recognition system targeting at automatically recognizing voice tone changing sentence in terms of text string relation and sentence boundary to increase accuracy of system's recognition when continuously generating with level-3 tone characters or repeated characters during Chinese voice (pronunciation).

Description

569182 五、發明說明(i) &lt;發明領域&gt; 本發明係提俾—播中今# 識方法,尤指—種在中文語;;辨=統中語音變調之辨 言變調詞句,…成所1i之語 &lt;發明背景&gt; 辦硪正確度之增進目的者。 按’目前國内外之語音辨嗜糸 部=是:對各國語言中= Ϊ中、文方法中藉以增進系統辨識之正域度;然, 丄成:::;之困擾及困難度外,更有音 聲、「二聲m;::讀卜 單字之讀*(注;)的「1」二:般中文語句中,若依 文語句都沒有問題,此時語;;立=聲或輕聲來哈中 中包含了連續「二聲 σ貝曰兀王相同,但當語句 ,在語音(唸法7丄:::=’ ί使用於稱謂的相同疊字 和讀音間產生變化及差里'有變音的情形發生,使語音 關:巧例說明如下之幾種;情形又與詞的字數有 ,則第-dn::串當中聲若:個字皆為「三」聲 你」等兩字字串,而「i你:原ί音::;!:: V—想 第4頁 569182 五、發明說明(2) V )音調,但語音(唸音時)則改變為(丁一尤/ 云一 V )音調。 (2 )、在三個字的字串當中,若其中有連續兩個 「三」聲字,則第一個「三」聲字必須改唸「二」聲;若 三個字皆「三」聲,則第一個字與第二個字必須改唸 「二」聲;如:「潛水艇」之三字字串中,原注音為(&lt; 一弓/ 尸乂 \ V 六一厶V )音調,但唸音時則改變為 (〈一巧/ 尸乂 \ / 六一厶v)音調;又如:「總統 獎」之三字字串,原注音為(卩乂厶v 六乂厶v tj — 尤v )音調,但唸音時則改變為(卩乂厶/ 六乂厶/ Μ —尤v )音調。 (3)、在四個字的字串當中,若四個字皆「三」聲, 則第一個字與第三個字必須改唸「二」聲;如:「寥寥可 數」等四字字串,原注音為(力一幺ν 力一幺ν 万古 V 尸乂 V )音調,但唸音時則改變為(力一幺/ 力一 幺V 乃i / 尸乂 V)音調。 (4 )、在五個字的詞當中,若五個字皆「三」聲,則 第一個字與第三個、第四個字必須改唸「二」聲;如: 「九九九九九」之五字字串(包括本例之數字型態),原 注音為W — 又 v W — I) — v Μ 一又V )音調,但唸音時則改變為(Η —又/ Μ —又V Μ—又〆 卩一又/ 又ν)音調。 (5)、若連續「三」聲的字數為偶數,且六個字或六 個字以上時,前面每兩個字為一組,依照連續兩個「三」569182 V. Description of the invention (i) &lt; Field of invention &gt; The present invention is a method for identifying, broadcasting in Chinese, especially in Chinese; The language of 1i &lt; Background of the Invention &gt; According to the current speech recognition at home and abroad = Yes: in the languages of various countries = in Chinese and Chinese methods to improve the positive degree of system identification; however, the success of :::; There are sounds, "two sounds m ::: reading the single word reading * (Note;)" 1 "2: in the ordinary Chinese sentence, if there is no problem with the sentence according to the text, at this time; Li = sound or soft Laiha contains the continuous "two sounds of σ, beijing, the same as the king, but when the sentence, the change in the pronunciation and pronunciation of the same superposition and pronunciation used in the appellation (pronunciation 7 丄 :: == ί) and the difference between the pronunciation" There are cases of inflection that make the sound off: clever examples explain the following; the situation is also related to the number of words in the word, then the -dn :: string in the voice if: each word is "three" you "etc. Two-character string, and "i 你: 原 ί 音 ::;! :: V—think page 4 569182 V. Description of the invention (2) V) The tone, but the voice (when pronounced) is changed to (Ding Yi You / Yunyi V) Tone. (2) In the three-character string, if there are two consecutive "three" sounds, the first "three" sounds must be changed to "two" sounds; If All three words are "three" sounds, so the first and second words must be changed to "two" sounds. For example, in the three-character string of "submarine", the original note is (&lt; Yi bow / corpse乂 \ V 六 一 厶 V) tone, but it changes to (<一 巧 / (乂 \ / 六 一 厶 v) tone when you pronounce it; for example: the three-character string of "Presidential Award", the original note is (卩 乂 厶 v 六 乂 厶 v tj — You v) tone, but it changes to (改变 / 六 乂 厶 / Μ —You v) tone when you pronounce. (3) Among the four-character strings, if the four characters are all “three” sounds, then the first and third characters must be changed to “two” sounds; for example, “rarely countable”, etc. In the word string, the original Zhuyin is (Li Yi 幺 ν Li Yi 万 万 Vango V corpse V), but when it is pronounced, it changes to (Li Yi 幺 / 力 一 幺 V i / corpse V). (4) Among the five-character words, if the five characters are all “three” sounds, then the first character and the third and fourth characters must be changed to “two” sounds, such as: "Nine-nine" five-character string (including the digital form of this example), the original Zhuyin is W — and v W — I) — v Μ a and V), but when it is pronounced, it changes to (Η — and / Μ — 又 V Μ—The tone is again and again. (5) If the number of consecutive "three" sounds is even and there are six or more words, the first two words are a group, according to two consecutive "three"

569182 五、發明說明(3) 聲的啥法;如:「九九九九九九」之六字字串,原注音為 (ij -一 V L) ^一 v Lj 一 V Lj '一' % V ij 一569182 V. Explanation of the invention (3) What is the sound? For example: the six-character string of "99,999,99", the original phonetic is (ij-一 VL) ^ 一 v Lj 一 V Lj '一'% V ij a

Lj 一又V )音調,但唸音時則改變為(I) 一又/ q — ^ v Μ — 又 / I] — ^ ν % y — 音調 〇 (6)、若連續三聲的字數為奇數,且七個字或七個字 以上時,前面每兩個字為一組,依照連續兩個「三」聲的 唸法,但最後一組為三個字,依照連續三個「三」聲的唸 法;如:「五五五五五五五」原注音為(乂 v 乂 v 乂 V 乂 V 乂 V 乂 V 乂 V )音調,但唸音時則改變為 (乂 / 乂 v 乂〆 乂 V 乂〆 乂〆 乂 V)音調。 由上可知,上述之音調變化法則乃係適用於一般詞句 與專有名詞,而當三字字串中有「詞界」區分時,須先依 「詞界」區分後再套用上述準則,也適用於數字與一般文 字之組合,如用於「姓名」時亦適用之,而其音調變化法 則係另依下列方式: (1 )、在「姓名」之辨識應用時,必須把「姓」與 「名」分開,如:「馬友友」或「柳水扁」,雖三字皆 「三」聲,原注音為(门丫 v —又V —5&lt;v)及(力一 又V 尸乂 \ V 勹一弓V ),但「馬」、「柳」為「姓」 ,不改音調而一樣唸「三」聲,「友友」或「水扁」兩字 則依上述兩個連續「三」聲的法則,將兩字中之第一個字 唸「二」聲,所以改唸為(门丫 V —又/ 一又V)及 (力一 尸乂 \ / 勹一弓V)。Lj and V) tones, but change to (I) and / q — ^ v Μ — and / I] — ^ ν% y — tone 0 (6), if the number of words in three consecutive sounds is When the number is odd and there are seven or more characters, the first two characters are a group, according to two consecutive “three” sounds, but the last group is three characters, according to three consecutive “three” Pronunciation: For example: "Five Five Five Five Five Five Five" the original note is (乂 v 乂 v 乂 V 乂 V 乂 V 乂 V 乂 V 乂), but when it is pronounced, it changes to (乂 / 乂 v 乂〆 乂 V 乂 〆 乂 〆 乂 V) tones. It can be seen from the above that the above-mentioned tone change rules are applicable to general words and proper nouns. When there is a "word boundary" distinction in a three-character string, it must be distinguished according to the "word boundary" before applying the above criteria. Applicable to the combination of numbers and general words, such as when used for "name", and its tone change law is based on the following methods: (1) In the application of "name" identification, the "last name" and the Separate the "name", such as "Yo You Ma" or "Liu Shuibian". Although the three words are all "three" sounds, the original Zhuyin is (门 丫 v — 又 V —5 &lt; v) and (力 一 又 V deadlock \ V 勹 一 弓 V), but "horse" and "liu" are "surnames", and they say the "three" sounds without changing the pitch. The words "youyou" or "shuibian" follow the two consecutive "three" The rule of "voice" reads the "two" sound in the first word of the two words, so it is changed to (门 丫 V — 又 / 一 又 V) and (力 一 dead 乂 \ / 勹 一 弓 V).

569182 五、發明說明(4) (2 )其他三字字串中有「詞界」區分時,如:「蔣總 統」或「總統獎」等,其中,「總統獎」三個字皆「三」 聲,原注音為(卩乂厶V 六乂厶v I] —尤V),但因不 包括「姓」在内,故仍利用上述法則,前面兩個字唸 「二」聲,所以唸做「卩乂厶/ 六乂厶/ L] 一尤V」。 而「蔣總統」三字皆「三」聲,原注音為(Μ —尤V V 乂厶V 六乂厶V),但「蔣」字為「姓」,並非詞之一 部分,故一樣唸「三」聲,後兩字「總統」則依上述兩個 連續三聲的法則將第一個字唸「二」聲,所以唸做「1] 一 尤V 卩乂厶/ 六乂厶V 」。 又另一種呈疊字型態之稱謂時,即於一中文詞句之稱 謂中若有兩個相同的字而呈疊字,就算該兩字為「三」聲 字,語音(唸法)也不一樣,疊字的第一個按原來的音唸 ,第二個字要哈為輕聲。例如:「爺爺」、「奶奶」、 「爸爸」、「媽媽」、「哥哥」、「姊姊」、「弟弟」、 「妹妹」、「獲狼」等等。 綜上所述,中文詞句中音調變化是必然發生之現象, 而音調變化所形成語音及讀音的差異性卻明顯會增加語音 辨識之困難度及不正確性,造成實際使用上之困擾,如: 在一中文語音辨識系統中,係先利用讀音方式輸入一中文 資料庫,再而欲利用語音方式將一中文詞句(或字串)輸 入該語音辨識系統中,期能藉該語音辨識系統之辨識處理 ,而可從中文資料庫中擷取出正確之中文詞句或訊號並輸 出使用,以達成語音操控之便利性;然,實際使用時,往569182 V. Description of the invention (4) (2) When there is a "word boundary" distinction in other three-character strings, such as: "President Jiang" or "Presidential Award", among which, the three words "Presidential Award" are "three ", The original Zhuyin is (V 六 乂 厶 v I] — 尤 V), but because it does not include the" surname ", so still using the above rule, the first two words pronounce" two ", so Do "卩 乂 厶 / 六 乂 厶 / L] 一 尤 V". And the words "President Jiang" have "three" sounds. The original note is (M—You VV 乂 厶 V 六 乂 厶 V), but the word "Jiang" is "surname" and is not part of the word. "The second two words" President "read the first word" two "according to the above two rules of three consecutive tones, so say" 1] Yiyou V 卩 乂 厶 / 六 乂 厶 V ". In another type of title with a superimposed form, that is, if there are two identical words in the title of a Chinese phrase, the superimposed words are even if the two words are "three" sounds, and the pronunciation (pronunciation) is not Similarly, the first word of the superimposed word is pronounced according to the original sound, and the second word should be soft. For example: "Grandpa", "Grandma", "Dad", "Mom", "Brother", "Sister", "Brother", "Sister", "Wolf" and so on. In summary, tonal changes in Chinese words and phrases are inevitable, and the differences in pronunciation and pronunciation formed by the tonal changes will obviously increase the difficulty and incorrectness of speech recognition, causing practical problems such as: In a Chinese speech recognition system, a Chinese database is first input by using a pronunciation method, and then a Chinese word (or string) is to be input into the speech recognition system by using a speech method, and the recognition by the speech recognition system is expected. Processing, and can extract the correct Chinese words or signals from the Chinese database and output them for use in order to achieve the convenience of voice control; however, in actual use, go to

569182 五、發明說明(5) 往》同—中文^上 不同’致無m r串),中士語音因變音關係而與讀音 範例說明如ί錯誤’也影響後續作業之進行,試再舉實‘ 電話系統裝^::語音轉接分機之公司電話系㉟:在上述 入之語音資名已具備辨識系、统,以將來電輸 中;而公司之任立桃:猎麥克而進入該語音辨識系統 -中文資料;::辨識糸統一般係先已利用讀音方式輸入 號碼」,則使用二m某i:姓名」與其所屬「分機 音資料加以辨气、’ θ辨識系統須針對來電輸入之語 句或’以從中文資料庫中擷取出正確之中文詞 」,“人ϊϊί用,㈤:來電輸人資料原係為「馬友友 音—辨識系統之中文資料庫中「馬友友」係以讀音)(门 ^ ^ V 一又V)儲存,致易使輸入之語音資料(门 一又/一又V )無法簡易辨識而造成後續系統之作 、、員如·因無法辨識正確姓名致無法快速地自動轉接 至正確分機,降低了語音轉接系統之使用功效。 ^ 2 )、如使用在醫院之語音掛號系統:在上述語音掛 號系統裝置中皆已具備一套語音辨識系統,以將來電輸入 σ曰&quot;k料如姓名荨’藉由麥克風而進入該語音辨識系統 中’而醫院之語音辨識系統一般係先已利用讀音方式輸入 一中文資料庫,如:醫院中一某人姓名與其所屬掛號單, 則使用時,該語音辨識系統須針對來電輸入之語音資料加 $ 8頁 569182 以 而 庠 j簡作 發明說明(6) 辨識,以從中文資料庫中擷取出正確之中 輪出使用,如··來電輪入資料原係為「馬 辨識號碼(如掛號單號碼),但輸入之語 η丫 v —又/ 一又\/),而語音辨識系 中「馬友友」係以讀音(门丫 V —又V 致易使輸入之語音資料(门丫 V —又/ 易辨識以輸出正確之中文詞句或訊號,造 業麻煩,如:醫院中無法辨識正確姓名致 其中,掛號單號碼之長字串也可能發生語 文詞句友友」 音資料 統之中 —又V —Κ ν 成後續 無法語 音變調 或訊號 並連同 卻為 文資料 )儲存 )無法 系統之 音掛號 之情形 :3 )、如飯店客房分機之語音撥接系 布统裝置中皆已具備一套扭立 :隹上疋 語音資料,如姓名或4::;識=來=來電輪入之 音辨識系統中;而飯店之進入該語 音大4 μ &amp;〜&lt; °口曰辨識系統一般係先已利用讀 房登資料庫,㈤:飯店中某-客房號及已住 電D 則使用時,該語音辨識系統須針對來 之:t資料f以辨識,以從中文資料庫中擁取出正為「民e f ° * 5fl旎而輪出使用,如:來電輸入資料原係 =友友」及連同一辨識號碼(如客房號碼),但輸入 m :為(门丫v……),而語音辨識 糸統之中文資料庫中「馬友友」係以讀音(门丫v 一又 一又^)儲存丄致易使輸入之語音資料(ηγν 一又 一又w無法簡易辨識以輸出正確之中文詞句或訊號 ,造成後縯系統之作業麻煩,如··飯店因無法辨識來電語 V /569182 V. Description of the invention (5) Going to the same—Chinese ^ is different from 'zhiwu mr string), the sergeant's voice due to the inflection relationship and the pronunciation example description is wrong also affects the follow-up operation, try to be realistic '' Telephone system installation ^ :: The company telephone system of the voice extension extension: The above-mentioned voice name already has the identification system and system to input the incoming call; and the company's Ren Litao: Mike Hunt enters the voice recognition system -Chinese data; :: Identification systems generally use the pronunciation method to enter the number ", then use two m: i: name" and its extension "extension data to discriminate," θ identification system must input the sentence or 'To extract the correct Chinese words from the Chinese database', "People use them, ㈤: The original data of the caller and input person is" Ma Youyou "—the Chinese database of the identification system is based on pronunciation) (Door ^ ^ V I and V) storage, so that the input voice data (gate I / I and V) could not be easily identified and caused the subsequent system to work, and the staff could not quickly transfer automatically because they could not recognize the correct name to Indeed extension, reducing the effectiveness of the use of the voice switching system. ^ 2) If using a voice registration system in a hospital: All of the above voice registration system devices already have a voice recognition system to input incoming calls σ quot; k material like name netting 'into the voice through a microphone In the recognition system, the speech recognition system in a hospital usually uses a pronunciation method to input a Chinese database. For example, the name of a person in the hospital and the registration form to which they belong. In use, the speech recognition system must respond to the voice input by a caller. Data plus $ 8 pages, 569182, and 简 j brief description of invention (6) identification, in order to extract the correct from the Chinese database for use in rotation, such as ·· Incoming call rotation data was originally "horse identification number (such as Registered order number), but the input language η 丫 v — / / 一 又 \ /), and the “Ma Youyou” in the speech recognition system uses pronunciation (门 丫 V — and V to make it easy to input the voice data (门 丫 V —Also, it is easy to identify and output correct Chinese words or signals, which is troublesome in the manufacturing industry. For example, the hospital cannot recognize the correct name and the long string of registered slip numbers may also occur. In the system of data—and V—Κ ν, it is impossible to change the tone or signal in the subsequent voice, and it is stored together with the document data)) The system cannot register the voice of the system: 3), such as the voice dialing of the hotel room extension system All have a set of stand-up: 隹 上 隹 voice data, such as name or 4 ::; cognition = come = incoming call recognition system; and the restaurant's access to the voice is 4 μ &amp; ~ &lt; ° 口Generally speaking, the identification system usually uses the room reading database first. ㈤: When a hotel room-room number and live electricity D are used, the speech recognition system must be targeted at: t data f to identify, and Chinese data The data in the library is taken out as "Min ef ° * 5fl" and used in turns. For example, the original data of the incoming call = Friends and the same identification number (such as the room number), but the input m: ((门 丫 v… …), And the "Ma Youyou" in the Chinese database of the speech recognition system is stored in pronunciation (门 丫 v one after another ^), which makes it easy to input the voice data (ηγν one after another and w cannot be easily identified to output Correct Chinese words or signals, resulting in Industry in trouble, such as the inability to identify the caller · Hotel phrase V /

569182 五、發明說明(7) 曰中之房客正確姓名,致無法自動轉接至 店電話系統中要求欲轉接客房之來電-〜而一般飯 客之正禮姓名,若吻合始將來轉^ =分機與房 當之推銷。 J得得至客房,以防不 〈發明目的〉 本發明之一目的在於提供一種中立 音變古周之挪坤士^ ^ T又。口9辨識系統中語 ii:=i 其係於中文語音中,當連續有「三」 =子連接赉音而自然產生語音變調ϋ =依子串關係及詞界而改變,並藉以辨識已 …ί调之§司句,俾增加系統辨識之正確度。。 有關本發明為達上述目的、 其功效,兹例舉較佳實施#丨# j徵所十木用的技術手段及 〈發明說明〉“施例並配合圖式說明如下: 本發明係一種「中文言五音 方法」,請參照第一乐統中語音變調之辨識 $ # ί ί s辨識系統之中文資料庫方面:俜利用鍅般 或其他輸入裝置(、 、丁寸厚万命加利用鍵盤 (2 ),而#由1 h ρ,將中文資料輸入於資料緩衝區 ])而籍由巧句判斷分解處理罝;r Q、. 緩衝區(2 )内之中文眘袓* I早疋(3 ),處理資料 斷為「稱褶轟宝 ,、’、、’加以判斷,若該中文資料剌 辦马%明疊子」則產生稱貝抖判 硬碟記憶體等儲存媒# f •予貝枓(4 ),再儲存於 「連續『三』聲」則產生:浐;而若該:文資料判斷為 存於硬碟記憶體等儲存媒體^』聲#料(5),再儲 媒體(6 )中得建置成」j H,使硬碟記憶體等儲存 中文貝料庫方面:而硬碟記憶體 聊182569182 V. Description of the invention (7) The correct name of the tenant in the middle cannot be automatically transferred to the call in the store's telephone system that requires the transfer of the guest room-~ and the name of the official salute of the meal, if it matches, it will be transferred in the future ^ = Extensions and rooms should be promoted. J got the guest room just in case <Objective of the Invention> One of the objects of the present invention is to provide a neutral sound-changing ancient week of Nokushi ^ ^ T again. Oral 9 recognition system Chinese ii: = i It is in Chinese phonetics. When there are “three” in succession = sub-connected 赉 sounds will naturally produce phonetic transliteration ϋ = change according to the substring relationship and the word boundary, and identify the already ... Adjust the § sentence to increase the accuracy of the system identification. . For the purpose of the present invention to achieve the above-mentioned objectives and its effects, the technical means for better implementation # 丨 # j 征 所 十 木 and the "Explanation of the Invention" "exemplary and illustrated with drawings are described below: The present invention is a" Chinese "Five-voice method", please refer to the identification of the tone change in the first music system. $ # Ί ί The Chinese database of the s recognition system: 俜 Using ordinary or other input devices (,, Ding inch thick, and using the keyboard (2 ) And # by 1 h ρ, input Chinese data into the data buffer]] and decompose and process it by clever sentence judgment; r Q ,. Chinese caution in buffer (2) * I 早 疋 (3) The processing data is judged to be "called fold bangbao ,,,,,," to judge, if the Chinese data does not work, it will generate storage media such as hard disk memory, etc. # f • 于 贝 予(4), and then stored in "continuous" three "sounds" will produce: 浐; if this: the text data is judged to be stored in the storage medium such as hard disk memory ^ "sound # material (5), and then stored in the media (6 ) In the "J H", so that hard disk memory, etc. to store the Chinese shell database: and hard disk memory chat 18 2

等儲存媒體(6 )則與 業系統。 信號比對處 理器(7 )連結成一Other storage media (6) are related to the industry system. The signal comparison processor (7) is connected into one

而使用時,經由麥克風 入一中文語音資料,該中文 ’再藉由信號比對處理器( 輪出(10 ),供後續系統接 凊參照第二圖所示,其係中 接發音或相同疊字而自然產 ’當比對資料輸入後,即先 ),將比對資料分為一般比 比對程序(丨3 )及稱謂疊字 一般比對程序(1 2 ),則比 )中;而若為連續「三」聲 進入依連續「三」聲字數分 分為2個連續「三」聲(17 、4個連續「三」聲(19 ) 6個連續「三」聲(21 )、 同字數,並依不同字數進行 針對2個連續「三」聲(17 」聲比對資料(23 ),針對 其中第1 、2個字建立「二 個連續「三」聲(19 )則對 聲比對資料(2 5 ),針對5 中第1、3、4個字建立「 或其它語音輸入裝置(8)輸 語音資料先經信號編碼(9 ) 7 )進行辨識,再將比對結果 收正確訊號而進行後續作業。 文語音中連續有「三」聲字連 生語音變調之變化方式示意圖 進入依詞界規則分解程序(j工 對程序(12)、連續「三」聲 比對程序(14 ),其中,若為 對資料直接建置在資料庫(i 5 比對程序(13 ),則比對資料 類程序(1 6 ),將比對資料區 )、3個連續「三」聲(丨8 ) 、5個連續「三」聲(2 〇 )、 7個連續「三」聲(22 )等不 不同比對資料之建置,包括: )則對其中第1個字建立「二 3個連續「三」聲(18 )則&amp; 」聲比對資料(24 ),針對4 其中第1 、3個字建立「二 個連續「二」聲(2 〇 )則對其 二」聲比對資料(2 6 ),針對When in use, a Chinese voice data is input through the microphone, and the Chinese language is then compared with the signal processor (round out (10)) for subsequent systems to refer to as shown in the second figure, which is connected with pronunciation or the same stack. When the comparison data is entered, it is divided into the general comparison process (丨 3) and the titled general comparison process (1 2). For continuous "three" sounds, the number of consecutive "three" sounds is divided into two consecutive "three" sounds (17, 4 consecutive "three" sounds (19), and six consecutive "three" sounds (21). The number of words, and according to the different number of words for two consecutive "three" sounds (17) sound comparison data (23), for the first one, two words to establish "two consecutive" three "sounds (19) Acoustic comparison data (2 5). For the first, third, and fourth words in 5 or other speech input devices (8), the input speech data is identified by signal encoding (9) 7), and the comparison result is then compared. Receiving the correct signal for follow-up operations. Schematic diagram of the changing pattern of the continuous three-voice continuous tone in the mantra Enter the decomposition procedure according to the word boundary rules (j-matching procedure (12), continuous "three" sound comparison procedure (14), where if the data is directly built in the database (i 5 comparison procedure (13), The data program (16) will compare the data area), 3 consecutive "three" sounds (丨 8), 5 consecutive "three" sounds (20), and 7 consecutive "three" sounds ( 22) The construction of different comparison data, including:) For the first word, "two 3 consecutive" three "sounds (18) then &amp;" sound comparison data (24), for 4 of which The first and third words establish "two consecutive" two "sounds (20), and then compare the two" sound comparison data (2 6).

569182 五、發明說明(9) 6個連續「三」聲(21 )則對其中第1 、3、5個字建立 「二」聲比對資料(27 ),針對7個連續「三」聲(22 ) 則對其中第1 、3、5、6個字建立「二」聲比對資料 (2 8 ),並將所有經過變音之比對資料(2 3 ) 、( 2 4 )、 (25) 、(26) 、(27) 、(28)建置在在資料庫(15) 中。又若為稱謂疊字比對程序(1 4 ),則對其中第2個字 建立輕聲比對資料(2 9 ),並將變音之比對資料(2 9 )建 置在在資料庫(1 5 )中。則資料庫(丨5 )中已針對一般比 對程序(12 )、連續「三」聲比對程序(13 )及稱謂叠字 比對程序(14)建置完成完整之比對資料,尤其是語音及 讀音間最可能產生之變音部分;則當語音信號(3〇 ) ^入 而欲辯識時,可藉資料庫(15 )已儲存之完整比對資^進 行语音比對(3 1 ),再將比對結果輸出(3 2 ),供後續系 統接收正確訊號而得進行後續作業。 、尔 為了讓熟知該項技術之人士能更加瞭解本發明之特 徵’特舉語音轉接之電話系統的實施例,說明本發明每 際使用情形: 貝 清參照第三A圖所示,先是使用者來話(4 〇 );進行 =自動接聽並宣告歡迎詞(41 ),如:『0〇公司您好^ 清告訴我您要找誰?』;再進行至使用者說出(語音)要 找的人C 42 ),如:『陳總統(考~ /卩乂厶/六乂乙 V )』;再進行至一般比對(4 3 ),即陳總統之語音(考 T7 乂人/云乂厶v)與讀音(彳4 /卩乂厶v 六 之比對;若比對失敗(因語音與讀音已有差異569182 V. Description of the invention (9) Six consecutive "three" sounds (21) For the first, third, and fifth words, "two" sound comparison data (27) is created, and for seven consecutive "three" sounds ( 22) then establish "two" sound comparison data (2 8) for the first, 3, 5, and 6 words, and compare all the converted data (2 3), (2 4), (25 ), (26), (27), (28) are built in the database (15). If it is the title overlapping program (1 4), a soft comparison data (2 9) is established for the second word, and the variable comparison data (2 9) is built in the database ( 1 5). Then the database (丨 5) has completed the complete comparison data for the general comparison program (12), the continuous "three" sound comparison program (13) and the title superposition comparison program (14), especially The most probable part of voice change between speech and pronunciation; when the voice signal (30) is entered and you want to identify, you can use the complete comparison data stored in the database (15) to perform a voice comparison (3 1 ), And then output the comparison result (3 2) for subsequent systems to receive the correct signal to perform subsequent operations. In order to let those who are familiar with the technology to better understand the features of the present invention, the embodiment of the "telephone system of special voice transfer" is used to explain the use of the present invention: Bei Qing refers to Figure 3A and uses it first. Caller (4 〇); proceed = automatically answer and announce the welcome speech (41), such as: "0〇 Hello company ^ tell me who are you looking for? ”; Then proceed until the user speaks (voice) the person C 42), such as:“ President Chen (test ~ / 卩 乂 厶 / 六 乂 乙 V) ”; then proceed to the general comparison (4 3) , That is, the comparison between President Chen ’s voice (test T7 乂 人 / 云 乂 厶 v) and pronunciation (彳 4 / 卩 乂 厶 v six; if the comparison fails (due to the difference between the pronunciation and the pronunciation)

第12頁 569182 五、發明說明(ίο) ,則進行至連續「, (彳㈠〆T7 乂厶/ 卩乂厶/六乂厶v 比對43提高 ),如:『 去乂厶v ) 識(46 ) 〇 」聲比對(4 4 ),即陳總統之語音 六乂人v)與轉換後之語音(4 比對;比對成功(正確率已比一般 )’再進行至語音宣讀比對結果並作處理(4 5 您要找的是陳總統之語音(4 4 /卩乂厶/ ’我馬上為您轉接』:轉接成功而結束語音辯 至自^ ί照第三B圖所示,先是使用者來話(47 );進行 ,請生士聽並宣告歡迎詞(48 ),如:『這裡是寶貝家庭 要 找 的 \ \ 门 ( b 幺 V 讀 音 已 妹妹 ( 音 ( ( 正 確 結 果 並 ( b 功 而 結 综 語 音 辨 更 可 门入\门\ \ 要找我您要找誰?』;再進行至使用者說出(語音) 伐的人(49),如:『陳小妹妹(^ / 丁一么ν门 讀音已 ' ,· 、' y之比對;若比對失敗(因語音與 妹妷r z差異)’則進行至稱謂疊字比對(51 ),即陳小 丁一么v门入、门入·)與轉換後之語 一么v门\、门\ ·)比對;比對成功 般比對43提高),再進行至語音宣讀比對 『您等一下,我叫陳小妹妹 \ ·)』:再進行至一般比對(5 〇 ),即陳小妹妹 / 丁一么\、门〜·)與讀音(才—一 ^7 / b y , ^ 率已比 狀〜β π妒 7 ’並作處理(5 2 ),如: 功”:語丁音一:識η'·)聽電話』;轉接成 語音:i可知,針對輸入之任何中文語音,本發明之中文 ,i;;糸統除可進行一般語音比對如⑷…50”卜 比對失敗時,自動依字串關係及詞界,再利用轉Page 12 569182 V. Description of the invention (ίο), proceed to continuous ", (彳 ㈠〆T7 乂 厶 / 卩 乂 厶 / 六 乂 厶 v compared to 43 to increase), such as:" 去 乂 厶 v) cognition ( 46) 〇 ”sound comparison (4 4), that is, the speech of President Chen ’s speech six people v) and the converted speech (4 comparison; the comparison is successful (the correct rate has been more than normal)” and then proceed to the speech reading comparison Results and processing (4 5 What you are looking for is the voice of President Chen (4 4 / 卩 乂 厶 / 'I will transfer it for you right away'): the transfer is successful and the speech is over to ^ 照 According to the third picture Instructions, the user's incoming call (47); proceed, please listen and announce the welcome speech (48), such as: "Here is what the baby family is looking for \ \ 门 (b 幺 V pronunciation has sister (音 ((correct The result is (b), and the comprehensive speech recognition can be more accessible. \ Do you want me? Whom are you looking for? ”, And then proceed to the person who spoke (voice) and fell (49), such as" Chen Xiao Younger sister (^ / 丁 一 么 ν door pronunciation has been compared with ', ·,' y; if the comparison fails (due to the difference between the voice and the sister's rz), the process is complete Predicate superposition comparison (51), that is, the comparison of Chen Xiaoding Yi Mo v gate entry, gate entry ·) and the converted language Yi gate v gate \, gate \ ·); the comparison is successful, compared with 43)) Proceed to the voice reading comparison "Wait a minute, my name is Chen Xiaomei \ ·)": then proceed to the general comparison (50), that is, Chen Xiaomei / Ding Yima \, door ~ ·) and the pronunciation (cause —One ^ 7 / by, ^ rate has been compared ~ β π jealousy 7 'and processed (5 2), such as: "work": tingyin one: know η' ·) listen to the phone "; transfer into voice: It can be known that, for any Chinese speech input, the Chinese language of the present invention, i ;; 糸 system can perform general speech comparison such as ⑷ ... 50 ". When the comparison fails, it will automatically use the string relationship and word boundary, and then use the conversion.

569182569182

五、發明說明(11) 換後之语音來辨識 對(4 4 )或稱謂疊 續「三」聲字連接 加系統辨識之正確 語音變調後之詞句 字比對(51 ),以 或相同疊字時所產 度。 ,如連續「三」聲比 有效降低語音中有連 生之辯識困難,而增 她砷:ΐ所述’本發明「中文語音辨識系統中語音變調之 辨識方法」,的確能藉由上述所揭露之方法,達到所述之 ί ί ° ΐ本發明申請前未見於刊物亦未公開使用,誠已符 a發明專利之新穎性、進步牲等要件。V. Description of the invention (11) Recognition of the changed speech (4 4) or the term “triple” sound word connection plus the system's identification of the correct tone change of the system (51), or the same superposition When produced. For example, the continuous "three" sound ratio can effectively reduce the difficulty of recognizing continuous speech in speech, and increase her arsenic: "The invention" recognition method of tone change in Chinese speech recognition system "according to the present invention can indeed be used by the above The method of disclosure achieves the above-mentioned ί ΐ ΐ The application of the present invention has not been seen in publications or used publicly, and it has met the requirements for novelty and progress of the invention patent.

准’上述所揭之圖式及說明,僅為本發明之實施例而 已,非為限定本發明之實施例;大凡熟悉該項技藝之人士 其所依本發明之特徵範疇,所作之其它等效變化或修飾 ,皆應涵蓋在以下本案之申請專利範圍内。 &lt;The figures and descriptions disclosed above are only examples of the present invention, and are not intended to limit the embodiments of the present invention. Anyone who is familiar with the technology may make other equivalents based on the features and scope of the present invention. Changes or modifications shall be covered by the scope of patent application in the following case. &lt;

第14頁 569182 圖式簡單說明 第一圖 :係本發明之流程方塊圖。 第二圖 :係中文語音中連續有「三」聲字連接發音或相 同疊字而自然產生語音變調之變化方式示意圖。 第三A圖:係本發明一實施例之實際操控方塊示意圖。 第三B圖··係本發明另一實施例之實際操控方塊示意圖。 &lt; 元件符號 說 明 &gt; ( 1 ) m 入 裝 置 ( 2 ) 資 料 缓 衝 ( 3 ) 詞 句 判 斷 ( 4 ) 產 生 稱 謂 ( 5 ) 產 生 連 續 ( 6 ) 儲 存 媒 體 ( 7 ) 信 號 比 對 ( 8 ) 語 音 入 ( 9 ) 信 號 編 碼 (10 ) 比 對 結 果 ( 11 ) 依 詞 界 規 ( 12 ) _ _· 般 比 對 ( 13 ) 連 續 厂 一一 ( 14 ) 稱 謂 疊 字 ( 15 ) 資 料 庫 ( 16 ) 依 連 續 厂 ( 17 ) 2 個 連 續 區 分解處理單元 疊字資料 「三」聲資料 處理器 裝置 輸出 則分解程序 程序 」聲比對程序 比對程序 三」聲字數分類程序 「三」聲 569182 圖式簡單說明 (18 ) 3個連續 「三」聲 (19 ) 4個連續 「三」聲 (20 ) 5個連續 「三」聲 (21 ) 6個連續 「三」聲 (22 ) 7個連續 厂三」聲 (23 )、 (24 )、 (25 ) 、 (26 ) 、 (27 )、 (29 ) 比對資料 輕聲比對資料 (30 ) 語音信號 (31 ) 語音比對 (32 ) 比對結果輸出 (40 ) &gt; (47 ) 使用者來話 (41 )、 (48 ) 自動接聽並宣告歡迎詞 (42 ) &gt; (49 ) 使用者說出要找的人 (43 )、 (50 ) 一般比對 (44 ) 連續「三」聲比對 (45 )、 (52 ) 語音宣讀比對結果並作處理 (46 )、 (53 ) 結束 (51 ) 稱謂疊字比對 (28 )Page 14 569182 Schematic description of the first diagram: The block diagram of the process of the present invention. The second picture: It is a schematic diagram of the way of changing the pronunciation of the "three" sounds in the Chinese phonetic sound. FIG. 3A is a schematic diagram of an actual control block according to an embodiment of the present invention. FIG. 3B is a schematic diagram of an actual control block according to another embodiment of the present invention. &lt; Explanation of component symbols &gt; (1) m input device (2) data buffering (3) sentence judgment (4) generating titles (5) generating continuous (6) storage media (7) signal comparison (8) voice input (9) Signal coding (10) Comparison result (11) According to word boundary rules (12) _ _ · General comparison (13) Continuous plant one by one (14) Appellation superposition (15) Database (16) According to continuous Factory (17) 2 continuous zone decomposition processing units superimposed data "three" sound data processor device output is decomposed program program "sound comparison program comparison program three" sound word number classification program "three" sound 569182 simple diagram Explanation (18) 3 consecutive "three" sounds (19) 4 consecutive "three" sounds (20) 5 consecutive "three" sounds (21) 6 consecutive "three" sounds (22) 7 consecutive factory three " (23), (24), (25), (26), (27), (29) Compare data softly Compare data (30) Voice signal (31) Voice comparison (32) The comparison result is output ( 40) &gt; (47) User calls (41), (48) Automatically answer And announce the welcome speech (42) &gt; (49) users say who they are looking for (43), (50) general comparison (44) continuous "three" sound comparison (45), (52) speech reading ratio The results are processed (46), (53) End (51) Appellation overlap (28)

第16頁Page 16

Claims (1)

569182569182 曰變調之辨識方法,包括 將中文資料輸入於資料緩 一種中文語音辨識系統中語 以下步驟: 利用鍵盤或其他輸入裝置, 衝區; 處理資料緩衝區之中文 」聲或稱謂疊字; ’產生稱謂疊字資料, 聲者’產生連續「三 乂 利用詞句判斷分解處理單元, 貝料,並依詞界規則區分連續「三 、 針對中文資料中之稱謂疊字者 並儲存於儲存媒體; 士 針對中文資料中之連續 聲資料,並儲存於儲存媒體 使儲存媒體連接於信號比對處理器· 過信號編碼之語音輸入的中文語音資料; 利用k就比對處理器,根據在、 輸入的中文纽據储存媒體之資料而對語音 Τ又曰貝枓進行辨識; 再將比對結果輸出。 2、專利範圍第1項所述之方法,其中該詞句判斷 二解處理單元係依詞界規則分解輸入之中文資料,: 为為—般比對、連續「三」聲比對及稱謂疊字比對 3專利範圍第1或2項所述之方法,其中該詞句 刀解處理單元可將一般比對資料建置在儲存媒體 以形成資料庫。 4、ΐ:::!專利範圍第1或2項所述之方法,其中該詞句 刀解處理單元可將連續「三」聲分類,區分為2The method for identifying tone changes includes the following steps of inputting Chinese data into the data and a Chinese speech recognition system: using a keyboard or other input devices to punch the area; processing the Chinese sound of the data buffer or the title superposition; 'generating the title Reduplicated data, the vocalist 'produces continuous "three 乂 use sentence to determine the decomposition of processing units, materials, and distinguish the continuous according to the rules of the word field" Third, the Chinese word for the reduplication and stored in storage media; The continuous sound data in the data is stored in the storage medium so that the storage medium is connected to the signal comparison processor. The Chinese speech data of the signal-input speech input; Use k to compare the processor, according to the Chinese new data The media data is stored and the voice T is identified; the comparison result is output. 2. The method described in the first item of the patent scope, in which the word judgment second solution processing unit decomposes the input according to the word boundary rules. Chinese materials: for-general comparison, continuous "three" sound comparison and title overlap comparison 3 patent scope first The method of claim 2, wherein the processing solution words blade unit may be generally aligned in the build data to form the data storage media library. 4. ΐ :::! The method described in item 1 or 2 of the patent scope, wherein the word processing unit can classify the continuous "three" sounds into 2 569182 六、申請專利範圍 個Ϊ續「三」聲、3個連續「 」耳、5個連續「三聲、二」耷、4個連續「三 「三」聲等不同字= :三」聲、7個 個字建立「二」聲比對資【,;二」聲則對其”】 則對其中第1 、2個字建立「對3個連續「三」聲 4個連續「三」聲則對复令第ρ」聲比對資料,針對 f比對資料,針對5個連續「三、::字建立「二」 f、4個字建立「二」聲比對資二聲f對其中第1、 二」聲則對其中第2 q 、&quot;、,針對6個連續厂 資料,針”個第二」聲比對 :個子建立「二」聲比 二二、3、5 ★I:;::建置在健存媒體以形成二;經過變音 申明專利範圍第i或2項 2分解處理單元可將輸入之中文=中其2詞句 ::中之第2個字建立輕聲比對資料 2稱謂 子身料建置在儲存媒體以形成資料庫。、支《之比 如申請專利範圍第i 7 包括硬碟記憶體。 具中遠儲存媒體 =申請專利範圍第丄或2項所述之方法,其 H!! ί:器並根據儲存媒體之資料而對語音輪入‘ 斗 ^曰資料進行辨識時,可先進行一般語音比對, ί ί:t敗時再依字串關係及詞界,而利用轉換後之 ^曰來進行辨識。 1瓦心 第18頁569182 Six, the scope of patent application is a continuous "three" sound, three consecutive "" ears, five consecutive "three tones, two" chirps, four consecutive "three" three "tones, etc. different words =: three, 7 words to establish "two" sound comparison information [,; two "voices to it"] then to the first and second words, "to 3 consecutive" three "sounds 4 consecutive" three "sounds The sound comparison data of the “Fu order” ρ, the comparison data of f, and the five consecutive “three ::: words” “two” f, the four words of the “two” sound comparison. The "1" and "2" sounds are compared to the 2q, "quote", and for the 6 consecutive plant data, the "2nd" sounds are compared: each of the "2" sounds is compared to the 2nd, 3, and 5 ★ I :; :: Built in Jiancun Media to form the second; After the voice change declaration patent scope i or 2 item 2 decomposition processing unit can input the Chinese = 2 of the two words in the ::: The second word in the: to establish soft comparison data 2 Titles are built on storage media to form a database. For the ratio of patent application scope i 7 including hard disk memory. With COSCO storage media = the method described in item 丄 or 2 of the scope of the patent application, the H !! ί: device and the speech turn based on the information of the storage media to identify the data, you can perform general speech first By comparison, ί ί: When t loses, it is identified by the string relationship and word boundary, and the converted ^ is used for identification. 1 Watt Heart Page 18
TW91123526A 2002-10-11 2002-10-11 Voice tone changing recognition method in Chinese voice recognition system TW569182B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW91123526A TW569182B (en) 2002-10-11 2002-10-11 Voice tone changing recognition method in Chinese voice recognition system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW91123526A TW569182B (en) 2002-10-11 2002-10-11 Voice tone changing recognition method in Chinese voice recognition system

Publications (1)

Publication Number Publication Date
TW569182B true TW569182B (en) 2004-01-01

Family

ID=32590488

Family Applications (1)

Application Number Title Priority Date Filing Date
TW91123526A TW569182B (en) 2002-10-11 2002-10-11 Voice tone changing recognition method in Chinese voice recognition system

Country Status (1)

Country Link
TW (1) TW569182B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI684148B (en) * 2014-02-26 2020-02-01 華為技術有限公司 Grouping processing method and device of contact person

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI684148B (en) * 2014-02-26 2020-02-01 華為技術有限公司 Grouping processing method and device of contact person

Similar Documents

Publication Publication Date Title
CA2043667C (en) Written language parser system
TW462037B (en) Context dependent phoneme networks for encoding speech information
US11776541B2 (en) Communicating announcements
US20090144048A1 (en) Method and device for instant translation
KR102462219B1 (en) Method of Automatically Generating Meeting Minutes Using Speaker Diarization Technology
US20210350784A1 (en) Correct pronunciation of names in text-to-speech synthesis
TWI270052B (en) System for selecting audio content by using speech recognition and method therefor
US12073849B2 (en) Systems and methods for filtering unwanted sounds from a conference call
JP2018170743A (en) Conference support system, conference support method, program of conference support device, and program of terminal
US20220076686A1 (en) Systems and methods for filtering unwanted sounds from a conference call using voice synthesis
JPWO2017200079A1 (en) Dialogue method, dialogue system, dialogue apparatus, and program
TW200304638A (en) Network-accessible speaker-dependent voice models of multiple persons
US11024303B1 (en) Communicating announcements
JP2003216564A (en) Communication supporting method, communication server using therefor and communication supporting system
Meliones et al. SeeSpeech: an android application for the hearing impaired
JP2013037512A (en) Social networking service system, social networking service server, and social networking service program
CN105427856A (en) Invitation data processing method and system for intelligent robot
TW569182B (en) Voice tone changing recognition method in Chinese voice recognition system
JP2005283972A (en) Speech recognition method, and information presentation method and information presentation device using the speech recognition method
JP2002101203A (en) Speech processing system, speech processing method and storage medium storing the method
Rihi et al. The Analysis Of Grammatical Cohesion Reference In Jack Ma Speech Text “Believe In Your Dreams”
JPH04167749A (en) Audio response equipment
JP7545934B2 (en) Program, device and method for associating person&#39;s speech with user&#39;s comment
Sangeetha et al. Communication Assistance for Blind and Deaf using Natural Language Processing
KR102476497B1 (en) Apparatus and method for outputting image corresponding to language

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent
MM4A Annulment or lapse of patent due to non-payment of fees