TWI783084B - Method and system of weight-based usage model for dynamic speech recognition channel selection - Google Patents
Method and system of weight-based usage model for dynamic speech recognition channel selection Download PDFInfo
- Publication number
- TWI783084B TWI783084B TW107142335A TW107142335A TWI783084B TW I783084 B TWI783084 B TW I783084B TW 107142335 A TW107142335 A TW 107142335A TW 107142335 A TW107142335 A TW 107142335A TW I783084 B TWI783084 B TW I783084B
- Authority
- TW
- Taiwan
- Prior art keywords
- channel
- voice recognition
- weight
- speech recognition
- usage
- Prior art date
Links
Images
Landscapes
- Meter Arrangements (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
本發明係有關於一種語音辨識方法及系統,且特別是一種權重用量模型應用於動態選擇語音辨識通道的方法與系統。The present invention relates to a speech recognition method and system, and in particular to a method and system for dynamically selecting a speech recognition channel using a weight usage model.
在人工智慧的時代,語音辨識是生活中不可或缺的一項人工智慧辨識技術。語音辨識有其特有的領域性,無法提供一體適用的解決方案,再加上其訓練成本偏高,使得企業或個人用途之使用者無法負荷其費用。In the era of artificial intelligence, speech recognition is an indispensable artificial intelligence recognition technology in life. Speech recognition has its own unique domain, and it cannot provide an all-in-one solution. In addition, its training cost is too high, making it unaffordable for corporate or personal users.
近期相關專利:使用多重語法網絡之語音辨識系統(中華民國專利編號:I394926,申請日:2000/06/21),此篇提出一種多語系語音辨識方法,步驟為:接收聲音音框;複數個不同語系的語音模型;根據聲音音框進入語音模型所產生對應於複數個語音狀態的複數個語音狀態分數;由該些語音狀態分數中選擇複數個最高的狀態分數;以及根據該些狀態分數產生修正值。多語系語音辨識方法以動態Dynamic)和個別(Frame by Frame)的方式,消除多語系偏移現象。Recent related patents: Speech recognition system using multi-grammar network (Republic of China patent number: I394926, application date: 2000/06/21), this article proposes a multilingual speech recognition method, the steps are: receive sound frame; plural Speech models of different language families; entering the speech model according to the sound frame into a plurality of speech state scores corresponding to a plurality of speech states; selecting a plurality of the highest state scores from these speech state scores; and generating according to the state scores correction value. The multilingual speech recognition method eliminates the phenomenon of multilingual deviation in a dynamic and individual (Frame by Frame) manner.
近期相關專利:多語系語音辨識裝置及其方法(中華民國專利I579829, 2017/04/21),此專利提出一種多語系語音辨識方法,其可包含下列步驟:接收聲音音框;複數個不同語系的語音模型;根據聲音音框進入語音模型所產生對應於複數個語音狀態的複數個語音狀態分數;由該些語音狀態分數中選擇複數個最高的狀態分數;以及根據該些狀態分數產生修正值。此多語系語音辨識方法以動態(Dynamic)和個別(Frame by Frame)的方式,消除多語系偏移現象。Recent related patents: multilingual speech recognition device and method (Republic of China patent I579829, 2017/04/21), this patent proposes a multilingual speech recognition method, which may include the following steps: receiving sound frame; multiple different language families The voice model of the voice; according to the sound frame entering the voice model, a plurality of voice state scores corresponding to the multiple voice states are generated; selecting a plurality of the highest state scores from the voice state scores; and generating correction values according to the state scores . This multilingual speech recognition method eliminates the phenomenon of multilingual deviation in a dynamic (Dynamic) and individual (Frame by Frame) manner.
以上與本發明相近之專利仍有以下缺點:(1)未考慮使用者語音辨識需求提供動態切換;以及(2)未考慮動態更新語音辨識通道使用量,用以提升下一次計算使用者選用語音辨識通道的準確度。The above patents that are similar to the present invention still have the following disadvantages: (1) provide dynamic switching without considering the needs of the user's voice recognition; and (2) do not consider dynamically updating the usage of the voice recognition channel to improve the voice selected by the user for the next calculation Identify the accuracy of the channel.
近期相關美國專利編號US9569174 B2,發明名稱為「Methods and systems for managing speech recognition in a multi-speech system environment 」(申請日為:2017/02/17),此篇專利提出管理多語音系統環境的語音管理方法與系統,透過使用者的語音、手勢、注視方式選擇語音辨識系統,此篇專利提出記錄使用者動作、語音、手勢模組,處理器依據使用者動作、語音、手勢模組紀錄判斷選用的語音辨識系統。The recent related U.S. patent number US9569174 B2, the invention name is " Methods and systems for managing speech recognition in a multi-speech system environment " (application date: 2017/02/17), this patent proposes to manage voice in a multi-speech system environment Management method and system, select the voice recognition system through the user's voice, gesture, and gaze. This patent proposes to record the user's actions, voice, and gesture modules, and the processor judges the selection based on the records of the user's actions, voice, and gesture modules. voice recognition system.
以上與本發明相近之專利仍有以下缺點:(1)未考慮語音辨識通道權重及使用者語音辨識通道使用量;以及(2)未考慮語音辨識通道容量,無法依據通道容量動態配置語音辨識通道資源。The above patents similar to the present invention still have the following disadvantages: (1) The voice recognition channel weight and the user's voice recognition channel usage are not considered; and (2) The voice recognition channel capacity is not considered, and the voice recognition channel cannot be dynamically configured according to the channel capacity resource.
由此可見,上述習用方式仍有諸多缺失,實非一良善之設計,而亟待加以改良。本案發明人鑑於上述習用方式所衍生的各項缺點,乃亟思加以改良創新,並經多年苦心孤詣潛心研究後,終於成功研發完成本件利用一種權重用量模型應用於動態選擇語音辨識通道的方法與系統。It can be seen that the above-mentioned customary method still has many deficiencies, which is not a good design and needs to be improved urgently. In view of the shortcomings derived from the above-mentioned conventional methods, the inventor of this case is eager to improve and innovate. After years of painstaking research, he finally successfully developed and completed the method and system for dynamically selecting speech recognition channels using a weight usage model. .
本發明提供一種可達成上述發明目的之一體適用的語音辨識通道選擇解決方案,係指一種權重用量模型應用於動態選擇語音辨識通道的方法與系統。前述系統包括:(1) 語音辨識通道設定模組,用以進行通道與通道權重設定與取用;(2) 語音辨識通道模組,用以接收使用者語音資料及通道策略資料,並進行通道切換與回傳辨識結果資料;(3)語音辨識通道權重運算模組,用以計算語音辨識通道的權重值;(4)語音辨識通道容量運算模組,用以動態計算語音辨識通道容量;(5)語音辨識通道決策模組,用以決策與排序使用者選用語音辨識通道結果;(6)語音辨識通道使用量運算模組,用以依據使用者通道使用量動態更新通道權重,由上述模組可達成於滿足使用者服務考量,提升語音辨識通道動態資源配置與配置精確度。The present invention provides an all-in-one voice recognition channel selection solution that can achieve the above-mentioned purpose of the invention, which refers to a method and system for dynamically selecting voice recognition channels using a weight usage model. The aforementioned system includes: (1) voice recognition channel setting module, used for channel and channel weight setting and access; (2) voice recognition channel module, used to receive user voice data and channel strategy data, and perform channel Switching and returning the recognition result data; (3) voice recognition channel weight calculation module, used to calculate the weight value of the voice recognition channel; (4) voice recognition channel capacity calculation module, used to dynamically calculate the voice recognition channel capacity; ( 5) Speech recognition channel decision-making module, which is used to decide and sort the results of the voice recognition channel selected by the user; (6) Speech recognition channel usage calculation module, which is used to dynamically update the channel weight according to the user's channel usage. The group can meet the user's service considerations and improve the dynamic resource allocation and allocation accuracy of the speech recognition channel.
為讓本發明的上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明如下。In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail together with the accompanying drawings.
在人工智慧的時代,語音辨識是生活中不可或缺的一項人工智慧辨識技術,語音辨識有其特有的領域性,無法提供一體適用的解決方案,再加上其訓練成本偏高,使得企業或個人用途之使用者無法負荷其費用,本方法依據使用者語音辨識通道設定、語音辨識通道容量及使用者語音辨識通道用量提供動態切換語音辨識通道。In the era of artificial intelligence, speech recognition is an indispensable artificial intelligence recognition technology in life. Speech recognition has its own unique domain and cannot provide an all-in-one solution. In addition, its high training cost makes enterprises Or if the user for personal use cannot afford the cost, this method provides dynamic switching of the voice recognition channel based on the user's voice recognition channel setting, voice recognition channel capacity, and user voice recognition channel usage.
請參閱圖1所示,為本發明一種權重用量模型應用於動態選擇語音辨識通道的方法與系統之架構圖,透過本發明的方法可有效提升語音辨識通道資源調配,本系統係包括:語音辨識通道設定模組1,係為設定語音辨識通道與通道權重並取用通道設定資料;語音辨識通道資料庫2,係為儲存語音辨識通道資料;語音辨識通道模組3,係用以接收使用者語音資料及通道策略資料,並進行通道切換且回傳辨識結果資料;語音辨識通道權重運算模組4,係為計算通道權重並傳送與計算通道權重值;語音辨識通道容量運算模組5,係用以計算通道容量與傳遞通道資料;語音辨識通道決策模組6,係為計算通道決策結果,並進行通道決策排序;語音辨識通道使用量運算模組7,係為計算使用者的通道使用量並更新通道權重值。Please refer to Fig. 1, which is a structure diagram of a method and a system for dynamically selecting speech recognition channels based on a weight usage model of the present invention. The method of the present invention can effectively improve resource allocation of speech recognition channels. The system includes: speech recognition Channel setting module 1 is for setting voice recognition channels and channel weights and accessing channel setting data; voice
請參閱圖2所示,為本發明應用一種權重用量模型應用於動態選擇語音辨識通道的方法與系統建立權重用量模型的元件圖。Please refer to FIG. 2 , which is a component diagram for establishing a weight usage model for the method and system for dynamically selecting speech recognition channels by applying a weight usage model according to the present invention.
首先,需透過語音辨識通道設定模組1的通道設定元件11設定使用者(User: Ui
,其中i=1,2,…,M表示使用者代碼)的語音辨識通道策略(Method: Mj
,其中j=1,2,…,N表示語音辨識通道代碼),表一為使用者選用此語音辨識通道策略,0表示非選用此策略,1表示選用此策略,通道設定元件11會將語音辨識通道設定資料寫入語音辨識通道資料庫2,資料寫入內容如下表1:
接著,透過語音辨識通道設定模組1的通道權重設定元件12設定使用者(User : Ui
,其中i=1,2,…,M表示使用者代碼)的語音辨識通道策略權重(Weight: WMj
,其中j=1,2,…,N語音辨識通道代碼),通道權重設定元件12會將使用者的語音辨識通道權重(WM1,UM
表示第M個User設定語音辨識通道策略1的權重值)的設定資料寫入語音辨識通道資料庫2,資料寫入內容如下表2:
當語音資料傳送至系統時,由語音辨識通道模組3的通道策略元件31接收語音資料,並將傳送語音資料的使用者資訊取出,經由語音辨識通道設定模組1的通道設定取用元件13至語音辨識通道資料庫取出語音辨識通道設定資料及語音辨識通道權重設定資料,將取出的資料傳送至語音辨識通道權重運算模組4的權重資料傳送元件41,權重計算元件42負責計算使用者語音辨識通道權重,語音辨識通道權重計算公式如下:(1)(2)When the voice data is sent to the system, the voice data is received by the
式(1)為使用者1使用語音辨識通道策略1的計算公式,W’M1,U1 表示使用者1語音辨識通道策略1的權重計算結果,式(2)為使用者1使用語音辨識通道策略N的計算公式,W’MN,U1 表示使用者1語音辨識通道策略N的權重計算結果。Equation (1) is the calculation formula for user 1 using speech recognition channel strategy 1, W' M1, U1 represents the weight calculation result of user 1 speech recognition channel strategy 1, and equation (2) is the speech recognition channel strategy for user 1 The calculation formula of N, W' MN, U1 represents the weight calculation result of user 1's speech recognition channel strategy N.
權重資料傳送元件41將各語音辨識通道權重計算結果傳送至語音辨識容量運算模組5的通道資料傳送元件51,容量計算元件52負責即時運算語音辨識通道容量,計算公式如下:(3) …(4)The weight
式(3)為語音辨識通道1的通道容量,其中CCH1 為語音辨識通道1可用通道總數量,CCH1,R 為語音辨識通道1剩餘的可用通道數量,式(4)為語音辨識通道N的通道容量,其中CCHN 為語音辨識通道N可用通道總數量,CCHN,R 為語音辨識通道N剩餘的可用通道數量。Equation (3) is the channel capacity of voice recognition channel 1, where C CH1 is the total number of channels available for voice recognition channel 1, C CH1, R is the remaining number of channels available for voice recognition channel 1, and formula (4) is voice recognition channel N The channel capacity of , where C CHN is the total number of channels available for speech recognition channel N, and C CHN,R is the remaining number of available channels for speech recognition channel N.
通道資料傳送元件51至語音辨識通道使用量運算模組7的通道使用量更新元件71取得即時通道使用量數據,通道使用量更新元件71至語音辨識通道資料庫2取得通道使用量數據,如下表3:
通道使用量更新元件71將表3數據傳送至通道使用量權重計算元件72,計算通道使用量權重,計算公式如下:(5) …(6)The channel
式(5)為使用者1使用語音辨識通道策略1的使用量權重,其中PM1,U1 為使用者1使用語音辨識通道策略1的機率,式(6)為使用者1使用語音辨識通道策略N的使用量權重,其中PMN,U1 為使用者1使用語音辨識通道策略N的機率。Formula (5) is the usage weight of user 1 using speech recognition channel strategy 1, where P M1, U1 is the probability of user 1 using speech recognition channel strategy 1, and formula (6) is user 1 using speech recognition channel strategy The usage weight of N, where P MN,U1 is the probability that user 1 uses speech recognition channel strategy N.
通道資料傳送元件51傳送語音辨識通道權重計算結果、即時運算語音辨識通道容量結果及通道使用量權重計算結果傳送至語音辨識通道決策模組6的決策排序元件61,決策排序元件61先將資料傳送至決策計算元件62計算決策結果,計算公式如下:(7)(8)The channel
在式(7)中為使用者1的語音辨識通道1的決策結果,為使用者1的語音辨識通道N的決策結果,決策排序元件61將排序、…、的結果,輸出結果以陣列方式排序{, … ,},數值最高的結果排序於第一位,以D’ = {, … ,}表示決策結果數值陣列,其中CM表示決策結果陣列中元素個數,排序最後一個以1表示,排序最高以CM表示,即D’={CM,CM-1,…,1}。決策的結果數值若為0則不排列於結果陣列中;決策排序元件61將結果陣列傳送至語音辨識通道模組3的通道切換元件32,通道切換元件依據結果陣列進行切換,將辨識結果資料輸出。In formula (7) is the decision result of user 1's speech recognition channel 1, For the decision result of user 1's voice recognition channel N, the
決策排序元件61將決策結果數值陣列{, … ,}傳送至語音辨識通道使用量運算模組7的通道使用量更新元件71,通道使用量更新元件71至語音辨識通道資料庫2取得通道使用量記錄如表3,再將記錄傳送至通道使用量權重計算元件72進行使用量更新,語音辨識通道使用量更新的公式如下:(9) …(10)The
式(9)為P’M1,U1
使用者1的語音辨識通道1之使用量更新數值,式(10)P’MN,U1
為使用者1的語音辨識通道N之使用量更新數值,表3為經過使用量計算元件72更新數據如表4,通道使用量更新元件71將表五結果儲存至語音辨識通道資料庫2。
請參閱圖3所示,為本發明之一種權重用量模型應用於動態選擇語音辨識通道的方法與系統之系統流程圖。首先步驟101由通道策略元件31通知通道設定取用元件13至語音辨識通道資料庫2取出語音辨識通道設定資料。步驟102判斷是否已存在語音辨識通道設定資料。若否,則須先執行步驟103,由通道設定元件11設定語音辨識通道資料,接著步驟104通道權重設定元件12設定語音辨識通道權重資料,步驟105將語音辨識通道資料及語音辨識通道權重資料寫入語音辨識通道資料庫2;若是,通道策略元件31將語音辨識通道資料傳送至權重資料傳送元件41,再傳送至權重計算元件42進行步驟106計算語音辨識通道權重。計算的結果需透過權重資料傳送元件41傳送至通道資料傳送元件51,再傳送至容量計算元件52執行步驟107計算語音辨識通道容量。通道資料傳送元件51將語音辨識通道使用量資料傳送至通道使用量更新元件71,再傳送至通道使用量權重計算元件72。執行步驟108計算語音辨識通道使用量,最後由通道資料傳送元件51將語音辨識通道權重計算結果(式(1)、式(2)的計算結果)、語音辨識通道容量計算結果(式(3)、式(4)的計算結果)、語音辨識通道使用量計算結果(式(5)、式(6)的計算結果)透過通道資料傳送元件51傳送至決策排序元件61,再傳送至62決策計算元件執行步驟109計算語音辨識通道決策。決策計算結果(式(7)、式(8)的計算結果)將由決策排序元件61進行排序並將排序結果傳送決策結果數值至通道使用量更新元件71進行步驟110更新語音辨識通道使用量(式(9)、式(10)的計算結果)。決策排序元件61將排序結果傳送至通道切換元件32執行步驟111執行語音辨識通道切換,最後輸出語音辨識結果資料輸出。Please refer to FIG. 3 , which is a system flowchart of a method and system for dynamically selecting speech recognition channels using a weight usage model according to the present invention. Firstly, in
以3個使用者及3個語音辨識通道策略為例,其中3個語音辨識通道策略各策略總通道數為:策略1=4個、策略2=4個、策略3=2個通道;首先須進行語音辨識通道資料設定,使用者透過通道設定元件11設定語音辨識通道資料,如下表5,再將設定的資料寫入語音辨識通道資料庫2:
接著,使用者透過通道權重設定元件12設定語音辨識通道策略權重,如下表6,再將設定的資料寫入語音辨識通道資料庫2:
當使用者1傳送語音資料至本系統時,由語音辨識通道模組3的通道策略元件31接收語音資料,並將使用者1的語音辨識通道設定資料及語音辨識通道權重設定資料取出,將取出的設定資料傳送至語音辨識通道權重運算模組4的權重資料傳送元件41。權重計算元件42負責計算使用者1語音辨識通道權重,計算結果如下: When user 1 sends voice data to this system, the
權重資料傳送元件41將使用者1語音辨識通道權重計算結果傳送至語音辨識容量運算模組5的通道資料傳送元件51。容量計算元件52負責即時運算語音辨識通道容量。此範例假設策略1共有4個通道數,未使用1個通道、策略2共有4個通道數,未使用3個通道、策略3共有2個通道數。未使用任何通道時,即時運算語音辨識通道容量計算公式如下:= 0.25= 0.75= 1The weight
通道資料傳送元件51至語音辨識通道使用量運算模組7的通道使用量更新元件71取得即時通道使用量數據,通道使用量更新元件71至語音辨識通道資料庫2取得通道使用量數據,如下表7:
通道使用量更新元件71將表3數據傳送至通道使用量權重計算元件72,計算通道使用量權重,計算公式如下: The channel
通道資料傳送元件51傳送使用者1語音辨識通道權重計算結果、即時運算語音辨識通道容量結果及通道使用量權重結果至語音辨識通道決策模組6的決策排序元件61,決策排序元件61先將資料傳送至決策計算元件62計算決策結果,計算結果如下:= 0.0975 .39The channel
決策排序元件61將排序、、的結果,輸出結果以陣列方式排序{,}={0.39,0.0975},決策的結果數值為0則不列於結果陣列中;決策結果數值陣列D’ = {,}={2,1},決策排序元件61將排序結果傳送至語音辨識通道模組3的通道切換元件32,通道切換元件依據排序結果進行切換,將辨識結果資料輸出。
決策排序元件61將決策結果數值陣列傳送至語音辨識通道使用量運算模組7的通道使用量更新元件71。通道使用量更新元件71至語音辨識通道資料庫取得通道使用量記錄如表7,再將記錄傳送至通道使用量權重計算元件72進行使用量更新,依據決策結果數值結果為{,}={2,1},其中共有兩個非0的方案,故CM=2,更新的公式如下:=3.33= 2=1.67The
將上述的通道使用量更新資料透過通道使用量更新元件71寫入語音辨識通道資料庫2,更新的資料如表8:
特點及功效Features and functions
本發明所提供之一種權重用量模型應用於動態選擇語音辨識通道的方法與系統,與其他習用技術相互比較時,更具有下列之優點:A weight usage model provided by the present invention is applied to the method and system for dynamically selecting speech recognition channels. When compared with other conventional technologies, it has the following advantages:
1. 本發明之方法係依據使用者通道設定資料、語音辨識通道容量及通道使用量權重決策語音辨識通道,提升語音辨識通道切換的精確度,於使用者通道服務滿意度考量下有效提升語音辨識通道資源配置。1. The method of the present invention is based on the user channel setting data, voice recognition channel capacity and channel usage weight to determine the voice recognition channel, improve the accuracy of voice recognition channel switching, and effectively improve voice recognition under the consideration of user channel service satisfaction Channel resource configuration.
2. 本發明之語音辨識通道設定模組用以設定使用者選用的語音辨識通道策略,使用者可依據需求設定多個語音辨識通道,提供使用者多元選擇方案。2. The voice recognition channel setting module of the present invention is used to set the voice recognition channel strategy selected by the user. The user can set multiple voice recognition channels according to the needs, providing users with multiple options.
3. 本發明之語音辨識通道模組用以依據語音辨識通道決策結果動態切換通道。3. The voice recognition channel module of the present invention is used to dynamically switch channels according to the decision result of the voice recognition channel.
4. 本發明之語音辨識通道權重運算模組用以依據使用者的通道權重設定值計算語音辨識通道的權重。4. The speech recognition channel weight calculation module of the present invention is used to calculate the weight of the speech recognition channel according to the user's channel weight setting value.
5. 本發明之語音辨識通道容量運算模組用以動態計算語音辨識通道容量,提升語音辨識通道容量,可提升語音辨識通道資源配置。5. The voice recognition channel capacity calculation module of the present invention is used to dynamically calculate the voice recognition channel capacity, increase the voice recognition channel capacity, and improve the resource allocation of the voice recognition channel.
6. 本發明之語音辨識通道決策模組,依據語音辨識通道權重運算、語音辨識通道容量運算模組及語音辨識通道使用量運算模組進行語音辨識通道決策計算與排序,於考量使用者需求與通道容量下,提升語音辨識通道動態資源分配。6. The voice recognition channel decision-making module of the present invention performs voice recognition channel decision-making calculation and sorting according to the voice recognition channel weight calculation, voice recognition channel capacity calculation module and voice recognition channel usage calculation module, considering user needs and Under the channel capacity, the dynamic resource allocation of the speech recognition channel is improved.
7. 本發明之語音辨識通道使用量運算模組,係用以計算使用者通道使用量權重,提供動態更新使用者通道使用量權重,提升使用者下一次使用語音通道辨識的精確度。7. The speech recognition channel usage calculation module of the present invention is used to calculate the weight of the user channel usage, provide dynamic update of the user channel usage weight, and improve the accuracy of the user's voice channel recognition next time.
8. 本發明係有關於一種權重用量模型應用於動態選擇語音辨識通道的方法與系統,特別是應用語音辨識通道權重運算模組、語音辨識通道容量運算模組、語音辨識通道使用量運算模組及語音辨識通道決策模組,利用此方法依據使用者語音辨識通道設定與語音辨識通道容量及語音辨識通道使用量權重提供動態語音辨識通道選擇,提升切換語音辨識通道的精確度並有效提升語音辨識通道資源調配。8. The present invention relates to a method and system for applying a weight usage model to dynamically select a voice recognition channel, especially the application of a voice recognition channel weight calculation module, a voice recognition channel capacity calculation module, and a voice recognition channel usage calculation module And voice recognition channel decision-making module, using this method to provide dynamic voice recognition channel selection according to user voice recognition channel settings, voice recognition channel capacity and voice recognition channel usage weight, improve the accuracy of switching voice recognition channels and effectively improve voice recognition Channel resource allocation.
本發明可獲致的功效如下:(1)經語音辨識通道權重運算模組、語音辨識通道容量運算模組及語音辨識通道使用量運算模組,依據使用者語音辨識通道權重及語音辨識通道容量動態計算權重,於考量使用者通道服務滿意度,動態配置語音辨識通道;(2)經語音辨識通道決策模組,依據決策運算結果切換使用者的語音辨識通道,有效提升語音辨識通道資源調配(3)經語音辨識通道使用量運算模組動態更新通道使用量權重,提升切換語音辨識通道的精確度。The effects obtained by the present invention are as follows: (1) Through the voice recognition channel weight calculation module, the voice recognition channel capacity calculation module and the voice recognition channel usage calculation module, according to the user's voice recognition channel weight and voice recognition channel capacity dynamic Calculate the weight to dynamically configure the speech recognition channel in consideration of the service satisfaction of the user channel; (2) The speech recognition channel decision module switches the user's speech recognition channel according to the decision calculation result, effectively improving the resource allocation of the speech recognition channel (3 ) through the voice recognition channel usage calculation module to dynamically update the channel usage weight to improve the accuracy of switching voice recognition channels.
雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明的精神和範圍內,當可作些許的更動與潤飾,故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed above with the embodiments, it is not intended to limit the present invention. Anyone with ordinary knowledge in the technical field may make some changes and modifications without departing from the spirit and scope of the present invention. The scope of protection of the present invention should be defined by the scope of the appended patent application.
1:語音辨識通道設定模組11:通道設定元件12:通道權重設定元件13:通道設定取用元件2:語音辨識通道資料庫3:語音辨識通道模組31:通道策略元件32:通道切換元件4:語音辨識通道權重運算模組41:權重資料傳送元件42:權重計算元件5:語音辨識通道容量運算模組51:通道資料傳送元件52:容量計算元件6:語音辨識通道決策模組61:決策排序元件62:決策計算元件7:語音辨識通道使用量運算模組71:通道使用量更新元件72:通道使用量權重計算元件101~111:一種權重用量模型應用於動態選擇語音辨識通道的方法與系統之流程圖1: Voice recognition channel setting module 11: Channel setting component 12: Channel weight setting component 13: Channel setting access component 2: Voice recognition channel database 3: Voice recognition channel module 31: Channel strategy component 32: Channel switching component 4: voice recognition channel weight calculation module 41: weight data transmission component 42: weight calculation component 5: voice recognition channel capacity calculation module 51: channel data transmission component 52: capacity calculation component 6: voice recognition channel decision module 61: Decision sorting component 62: Decision calculation component 7: Voice recognition channel usage calculation module 71: Channel usage updating component 72: Channel usage
圖1為一種權重用量模型應用於動態選擇語音辨識通道的方法與系統之系統架構圖。 圖2為一種權重用量模型應用於動態選擇語音辨識通道的方法與系統之元件圖。 圖3為一種權重用量模型應用於動態選擇語音辨識通道的方法與系統之流程圖。FIG. 1 is a system architecture diagram of a method and system for dynamically selecting speech recognition channels using a weight usage model. FIG. 2 is a component diagram of a method and system for dynamically selecting speech recognition channels using a weight usage model. FIG. 3 is a flow chart of a method and system for dynamically selecting speech recognition channels using a weight usage model.
1:語音辨識通道設定模組 1: Speech recognition channel setting module
2:語音辨識通道資料庫 2: Speech recognition channel database
3:語音辨識通道模組 3: Speech recognition channel module
4:語音辨識通道權重運算模組 4: Speech recognition channel weight calculation module
5:語音辨識通道容量運算模組 5: Speech recognition channel capacity calculation module
6:語音辨識通道決策模組 6:Speech recognition channel decision module
7:語音辨識通道使用量運算模組 7: Speech recognition channel usage calculation module
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW107142335A TWI783084B (en) | 2018-11-27 | 2018-11-27 | Method and system of weight-based usage model for dynamic speech recognition channel selection |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW107142335A TWI783084B (en) | 2018-11-27 | 2018-11-27 | Method and system of weight-based usage model for dynamic speech recognition channel selection |
Publications (2)
Publication Number | Publication Date |
---|---|
TW202020858A TW202020858A (en) | 2020-06-01 |
TWI783084B true TWI783084B (en) | 2022-11-11 |
Family
ID=72175500
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW107142335A TWI783084B (en) | 2018-11-27 | 2018-11-27 | Method and system of weight-based usage model for dynamic speech recognition channel selection |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWI783084B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090048824A1 (en) * | 2007-08-16 | 2009-02-19 | Kabushiki Kaisha Toshiba | Acoustic signal processing method and apparatus |
US20120243715A1 (en) * | 2011-03-24 | 2012-09-27 | Oticon A/S | Audio processing device, system, use and method |
US20140316788A1 (en) * | 2001-12-14 | 2014-10-23 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
US20180192349A1 (en) * | 2000-06-13 | 2018-07-05 | Comcast Cable Communications, Llc | Transmission of OFDM Symbols |
-
2018
- 2018-11-27 TW TW107142335A patent/TWI783084B/en active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180192349A1 (en) * | 2000-06-13 | 2018-07-05 | Comcast Cable Communications, Llc | Transmission of OFDM Symbols |
US20140316788A1 (en) * | 2001-12-14 | 2014-10-23 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
US20090048824A1 (en) * | 2007-08-16 | 2009-02-19 | Kabushiki Kaisha Toshiba | Acoustic signal processing method and apparatus |
US20120243715A1 (en) * | 2011-03-24 | 2012-09-27 | Oticon A/S | Audio processing device, system, use and method |
Also Published As
Publication number | Publication date |
---|---|
TW202020858A (en) | 2020-06-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI698830B (en) | Method and device for transferring robot customer service to manual customer service, computer equipment and computer readable storage medium | |
CN107911491B (en) | Information recommendation method, device and storage medium, server and mobile terminal | |
US20200334493A1 (en) | Method , apparatus, and storage medium for annotating image | |
US10769006B2 (en) | Ensemble risk assessment method for networked devices | |
CN108319599A (en) | A kind of interactive method and apparatus | |
WO2019228232A1 (en) | Method for sharing knowledge between dialog systems, and dialog method and apparatus | |
Zanbouri et al. | A cloud service composition method using a trust‐based clustering algorithm and honeybee mating optimization algorithm | |
CN107947951A (en) | Groups of users recommends method, apparatus and storage medium and server | |
US20190065627A1 (en) | Ancillary speech generation via query answering in knowledge graphs | |
JP5819572B1 (en) | A machine learning model for hierarchically based arrays | |
CN108431832A (en) | Neural network is expanded using external memory | |
CN106453608B (en) | A kind of background request adaptive scheduling algorithm of the mobile application based on cloud | |
WO2017143773A1 (en) | Crowdsourcing learning method and device | |
CN111858034B (en) | Resource management method, system, device and medium | |
CN110533459A (en) | A kind of terminal replacement probability prediction technique and device based on gradient boosting algorithm | |
CN110175469A (en) | A kind of social media privacy of user leakage detection method, system, equipment and medium | |
WO2021197223A1 (en) | Model compression method, system, terminal, and storage medium | |
CN109918574A (en) | Item recommendation method, device, equipment and storage medium | |
CN115034836B (en) | Model training method and related device | |
JP6307822B2 (en) | Program, computer and training data creation support method | |
TWI783084B (en) | Method and system of weight-based usage model for dynamic speech recognition channel selection | |
WO2021129534A1 (en) | Method for migrating function node, and related device | |
CN111813902A (en) | Intelligent response method and system and computing device | |
CN109885668A (en) | A kind of expansible field interactive system status tracking method and apparatus | |
US20050282125A1 (en) | Individualized retention plans for students |