TW201947428A

TW201947428A - Method and system for automated learning of a virtual assistant

Info

Publication number: TW201947428A
Application number: TW107115794A
Authority: TW
Inventors: 周忠信; 吳兆麟; 許旭正
Original assignee: 鼎新電腦股份有限公司
Priority date: 2018-05-09
Filing date: 2018-05-09
Publication date: 2019-12-16
Also published as: TWI679548B

Abstract

A method for automated learning of a virtual assistant is disclosed herein. The method includes the following operations: receiving an audio and identifying the audio to generate a corpus data; utilizing a natural language processing model to analyze the corpus data, and to generate a language characteristic data corresponding to the corpus data; performing a scenario analysis for the language characteristic data according to a situation information confirm an operation corresponding to one of the intentions; if the scenario analysis could not determine the operation corresponding to one of the intentions, performing a segmentation processing for the corpus data; determining whether there is new vocabulary or new corpus data according to results of the segmentation processing; if there is new vocabulary, updating the natural language processing model according to meaning of new vocabulary; if there is new corpus data, updating the scenario analysis according to intention of new corpus data.

Description

Automatic learning method and system of virtual assistant

本案是有關於一種自動學習的方法及系統，且特別是有關於一種虛擬助理的自動學習方法及系統。 This case relates to an automatic learning method and system, and in particular to an automatic learning method and system of a virtual assistant.

企業資源規劃系統(Enterprise Resource Planning，ERP)，簡稱ERP系統，是指建立在資訊技術的基礎上為企業決策層提供決策的管理平台。其主要是將企業的人流、物流、資訊流、資金流進行統一的管理，以最大限度的利用企業的資源。而ERP系統包含有生產控制、物流管理和財務管理等三大方面的功能，因此ERP系統規模非常的龐大。 Enterprise resource planning system (Enterprise Resource Planning, ERP), referred to as ERP system, refers to a management platform that provides decision-making for the enterprise decision-making layer based on information technology. Its main purpose is to uniformly manage the flow of people, logistics, information, and funds in order to maximize the use of corporate resources. The ERP system contains three major functions, such as production control, logistics management, and financial management. Therefore, the scale of the ERP system is very large.

將虛擬助理應用於ERP系統中，更可以快速的幫助使用者與龐大的ERP系統交流，能夠節省使用者在使用ERP系統中所花的時間，但由於每個使用者使用ERP系統習慣的不同，因此會有虛擬助理無法理解使用者問題的情況，反而造成使用者在使用ERP系統上的困難。 Applying the virtual assistant to the ERP system can also help users quickly communicate with the huge ERP system, which can save users the time spent in using the ERP system, but because each user has different habits in using the ERP system, Therefore, there may be situations where the virtual assistant cannot understand the user's problem, which will cause the user's difficulty in using the ERP system.

本發明之主要目的係在提供一種虛擬助理的自動學習方法及系統，其主要係讓虛擬助理具有自動學習的功能，讓虛擬助理可以在與使用者交流的過程中，自動學習到使用者的說話習慣，或是行業中的特殊用語用詞，達到讓使用者使用ERP系統是能夠更快速便利的功效。 The main object of the present invention is to provide an automatic learning method and system for a virtual assistant, which is mainly to allow the virtual assistant to have an automatic learning function, so that the virtual assistant can automatically learn the user's speech during the process of communicating with the user. Habits, or special terminology in the industry, enable users to use the ERP system more quickly and conveniently.

為達成上述目的，本案之第一態樣是在提供一種虛擬助理的自動學習方法，此方法包含以下步驟：接收音訊輸入並辨識音訊以形成語料資料；利用自然語言處理模型分析語料資料，以產生與語料資料對應的語言特徵資訊，其中語言特徵資訊包含複數個意圖、該些意圖對應的機率以及複數個詞彙；依據職能情境資訊對語言特徵資訊進行職能情境分析，判斷該些意圖的其中之一對應的操作；如果職能情境分析無法判斷該些意圖的其中之一對應的操作，則針對語料資料進行分詞處理；根據分詞處理後的結果，判斷是否存在新詞彙或新語料資料；如果存在新詞彙，根據新詞彙的意義更新自然語言處理模型，如果存在新語料資料，根據新語料資料的意圖更新職能情境分析；其中，操作包含查詢資料操作及執行指令操作的其中之一。 In order to achieve the above purpose, the first aspect of the present case is to provide an automatic learning method for a virtual assistant. This method includes the following steps: receiving audio input and identifying the audio to form a corpus data; using a natural language processing model to analyze the corpus data, In order to generate linguistic feature information corresponding to the corpus data, the linguistic feature information includes a plurality of intents, the probability corresponding to the intents, and a plurality of vocabularies; a functional context analysis is performed on the linguistic feature information based on the functional context information to determine the intent of the intents. One of them corresponds to the operation; if the functional context analysis cannot determine the operation corresponding to one of these intentions, the word segmentation processing is performed on the corpus data; according to the result of the word segmentation processing, it is determined whether there is a new vocabulary or new corpus data; If there is a new vocabulary, the natural language processing model is updated according to the meaning of the new vocabulary. If there is a new corpus data, the functional context analysis is updated according to the intention of the new corpus data. Among them, the operation includes one of querying data and performing instruction operations.

本案之第二態樣是在提供一種虛擬助理的自動學習系統，分別與企業資料庫及企業資源系統連接，其包含：處理器、儲存裝置以及輸入/輸出裝置。儲存裝置電性連接至處理器，用以儲存總體資料庫、應用知識資料庫、領域知識資料庫以及歷史資料庫。輸入/輸出裝置電性連接至處理器，用以提供介面以供輸入音訊。其中，處理器包含：語音辨識模組、語料分析模組、情境辨識模組、未知語料判斷模組以及更新資訊模組。語音辨識模組用以辨識音訊以形成語料資料。語料分析模組與語音辨識模組電性連接，用以利用自然語言處理模型分析語料資料，以產生與語料資料對應的語言特徵資訊，其中語言特徵資訊包含複數個意圖、該些意圖對應的機率以及複數個詞彙。情境辨識模組與語料分析模組電性連接，用以依據職能情境資訊對語言特徵資訊進行職能情境分析，判斷該些意圖的其中之一對應的操作。未知語料判斷模組與情境辨識模組電性連接，用以在情境辨識模組無法辨識該些意圖的其中之一對應的操作時，針對語料資料進行分詞處理，並根據分詞處理後的結果，判斷是否存在新詞彙或新語料資料。更新資訊模組與未知語料判斷模組電性連接，用以在有新詞彙產生時，根據該新詞彙的意義更新該自然語言處理模型，以及在該新語料資料產生時，根據該新語料資料的意圖更新該職能情境分析；其中，該操作包含一查詢資料操作及一執行指令操作的其中之一。 The second aspect of the case is to provide an automatic learning system for virtual assistants, which is connected to the enterprise database and the enterprise resource system, respectively, and includes: a processor, a storage device, and an input / output device. The storage device is electrically connected to the processor, and is used for storing a general database, an application knowledge database, a domain knowledge database, and a historical database. The input / output device is electrically connected to the processor to provide an interface for inputting audio. The processor includes a speech recognition module, a corpus analysis module, a situation recognition module, an unknown corpus determination module, and an update information module. The speech recognition module is used to recognize audio to form corpus data. The corpus analysis module and the speech recognition module are electrically connected to analyze the corpus data using a natural language processing model to generate linguistic feature information corresponding to the corpus data, where the linguistic feature information includes a plurality of intents, the intents Corresponding probability and plural words. The context recognition module and the corpus analysis module are electrically connected to perform a functional context analysis on the linguistic feature information according to the functional context information to determine an operation corresponding to one of the intentions. The unknown corpus judgment module is electrically connected to the situation recognition module, and is used to perform word segmentation processing on the corpus data when the situation recognition module cannot recognize the operation corresponding to one of these intentions, and according to the word segmentation processing, As a result, it is determined whether there is a new vocabulary or a new corpus. The update information module and the unknown corpus judgment module are electrically connected to update the natural language processing model according to the meaning of the new vocabulary when a new vocabulary is generated, and according to the new corpus when the new corpus data is generated The intent of the data updates the functional context analysis; wherein the operation includes one of a data query operation and an execution instruction operation.

本發明之虛擬助理的自動學習方法及虛擬助理的自動學習系統主要係讓虛擬助理具有自動學習的功能，讓虛擬助理可以在與使用者交流的過程中，自動學習到使用者的說話習慣，或是行業中的特殊用語用詞，達到讓使用者使用ERP系統時能夠更快速便利的功效。 The automatic learning method of the virtual assistant and the automatic learning system of the virtual assistant of the present invention mainly provide the virtual assistant with an automatic learning function, so that the virtual assistant can automatically learn the user's speaking habits during the process of communicating with the user, or It is a special terminology in the industry to achieve the effect of enabling users to use the ERP system more quickly and conveniently.

100‧‧‧虛擬助理的自動學習系統 100‧‧‧ Automatic learning system for virtual assistants

110‧‧‧處理器 110‧‧‧ processor

130‧‧‧儲存裝置 130‧‧‧Storage device

150‧‧‧輸入/輸出裝置 150‧‧‧ input / output device

131‧‧‧總體資料庫 131‧‧‧Overall Database

132‧‧‧應用知識資料庫 132‧‧‧Application Knowledge Database

133‧‧‧領域知識資料庫 133‧‧‧ Domain Knowledge Database

134‧‧‧歷史資料庫 134‧‧‧History database

111‧‧‧語音辨識模組 111‧‧‧Speech recognition module

112‧‧‧語料分析模組 112‧‧‧ Corpus Analysis Module

113‧‧‧情境辨識模組 113‧‧‧Situation recognition module

114‧‧‧未知語料判斷模組 114‧‧‧Unknown corpus judgment module

115‧‧‧更新資訊模組 115‧‧‧Update Information Module

121‧‧‧訓練模組 121‧‧‧ Training Module

122‧‧‧範本建立模組 122‧‧‧Template Creation Module

123‧‧‧語意模型建立模組 123‧‧‧ Semantic model building module

124‧‧‧詞彙模型建立模組 124‧‧‧vocabulary model building module

125‧‧‧情境訓練模組 125‧‧‧Scenario Training Module

126‧‧‧詞彙訓練模組 126‧‧‧ Vocabulary Training Module

300‧‧‧虛擬助理的自動學習方法 300‧‧‧ Automatic learning method of virtual assistant

S310~S360、S410~S480、S321~S323、S331~S332、S341~S342、S361~S363‧‧‧步驟 S310 ~ S360, S410 ~ S480, S321 ~ S323, S331 ~ S332, S341 ~ S342, S361 ~ S363‧‧‧Steps

為讓本發明之上述和其他目的、特徵、優點與實施例能更明顯易懂，所附圖式之說明如下：第1圖係根據本案之一些實施例所繪示之一種虛擬助理的自動學習系統的示意圖；第2圖係根據本案之一些實施例所繪示之處理器的示意圖；第3圖係根據本案之一些實施例所繪示之一種虛擬助理的自動學習方法的流程圖；第4圖係根據本案之一些實施例所繪示之訓練資料模型的流程圖；第5圖係根據本案之一些實施例所繪示之步驟S320的流程圖；第6圖係根據本案之一些實施例所繪示之步驟S330的流程圖；第7圖係根據本案之一些實施例所繪示之步驟S340的流程圖；以及第8圖係根據本案之一些實施例所繪示之步驟S360的流程圖。 In order to make the above and other objects, features, advantages, and embodiments of the present invention more comprehensible, the description of the drawings is as follows: FIG. 1 is an automatic learning of a virtual assistant according to some embodiments of the present invention A schematic diagram of the system; FIG. 2 is a schematic diagram of a processor according to some embodiments of the present case; FIG. 3 is a flowchart of an automatic learning method of a virtual assistant according to some embodiments of the present case; FIG. 5 is a flowchart of a training data model according to some embodiments of the present case; FIG. 5 is a flowchart of step S320 according to some embodiments of the present case; and FIG. 6 is a flowchart according to some embodiments of the present case. FIG. 7 is a flowchart of step S330; FIG. 7 is a flowchart of step S340 according to some embodiments of the present case; and FIG. 8 is a flowchart of step S360 according to some embodiments of the present case.

以下揭示提供許多不同實施例或例證用以實施本發明的不同特徵。特殊例證中的元件及配置在以下討論中被用來簡化本揭示。所討論的任何例證只用來作解說的用途，並不會以任何方式限制本發明或其例證之範圍和意義。此外，本揭示在不同例證中可能重複引用數字符號且/或字母，這些重複皆為了簡化及闡述，其本身並未指定以下討論中不同實施例且/或配置之間的關係。 The following disclosure provides many different embodiments or illustrations to implement different features of the invention. The elements and configurations in the particular example are used in the following discussion to simplify the present disclosure. Any illustrations discussed are for illustrative purposes only and do not in any way limit the scope and meaning of the invention or its illustrations. In addition, the present disclosure may repeatedly refer to numerical symbols and / or letters in different examples, and these repetitions are for simplification and explanation, and do not themselves specify the relationship between different embodiments and / or configurations in the following discussion.

在全篇說明書與申請專利範圍所使用之用詞(terms)，除有特別註明外，通常具有每個用詞使用在此領域中、在此揭露之內容中與特殊內容中的平常意義。某些用以描述本揭露之用詞將於下或在此說明書的別處討論，以提供本領域技術人員在有關本揭露之描述上額外的引導。 The terms used throughout the specification and the scope of patent applications, unless otherwise specified, usually have the ordinary meaning of each term used in this field, in the content disclosed here, and in special content. Certain terms used to describe this disclosure are discussed below or elsewhere in this specification to provide additional guidance to those skilled in the art on the description of this disclosure.

關於本文中所使用之『耦接』或『連接』，均可指二或多個元件相互直接作實體或電性接觸，或是相互間接作實體或電性接觸，而『耦接』或『連接』還可指二或多個元件相互操作或動作。 As used herein, "coupling" or "connection" can mean that two or more components make direct physical or electrical contact with each other, or indirectly make physical or electrical contact with each other, and "coupling" or " "Connected" may also mean that two or more elements operate or act on each other.

在本文中，使用第一、第二與第三等等之詞彙，是用於描述各種元件、組件、區域、層與/或區塊是可以被理解的。但是這些元件、組件、區域、層與/或區塊不應該被這些術語所限制。這些詞彙只限於用來辨別單一元件、組件、區域、層與/或區塊。因此，在下文中的一第一元件、組件、區域、層與/或區塊也可被稱為第二元件、組件、區域、層與/或區塊，而不脫離本發明的本意。如本文所用，詞彙『與/或』包含了列出的關聯項目中的一個或多個的任何組合。本案文件中提到的「及/或」是指表列元件的任一者、全部或至少一者的任意組合。 In this article, the terms first, second, third, etc. are used to describe various elements, components, regions, layers, and / or blocks that are understandable. However, these elements, components, regions, layers and / or blocks should not be limited by these terms. These terms are limited to identifying single elements, components, regions, layers, and / or blocks. Therefore, a first element, component, region, layer, and / or block in the following may also be referred to as a second element, component, region, layer, and / or block without departing from the intention of the present invention. As used herein, the term "and / or" includes any combination of one or more of the associated listed items. The "and / or" mentioned in this document refers to any, all or any combination of at least one of the listed elements.

請參閱第1圖。第1圖係根據本案之一些實施例所繪示之一種虛擬助理的自動學習系統100的示意圖。如第1圖所繪示，虛擬助理的自動學習系統100包含處理器110、儲存裝置130以及輸入/輸出裝置150。儲存裝置130用以儲存總體資料庫131、應用知識資料庫132、領域知識資料庫133以及歷史資料庫134，儲存總體資料庫131、應用知識資料庫132、領域知識資料庫133以及歷史資料庫134電性連接至處理器110。輸入/輸出裝置150電性連接至處理器110，用以提供介面以供輸入音訊。於一實施例中，輸入/輸出裝置150可以是鍵盤、觸控式螢幕、麥克風、喇叭或其它合適的輸入/輸出裝置。使用者可透過輸入/輸出裝置提供的介面輸入音訊。 See Figure 1. FIG. 1 is a schematic diagram of an automatic learning system 100 for a virtual assistant according to some embodiments of the present invention. As shown in FIG. 1, the automatic learning system 100 of the virtual assistant includes a processor 110, a storage device 130, and an input / output device 150. The storage device 130 is configured to store a general database 131, an application knowledge database 132, a domain knowledge database 133, and a historical database 134, and a general database 131, an application knowledge database 132, a domain knowledge database 133, and a historical database 134. Electrically connected to the processor 110. The input / output device 150 is electrically connected to the processor 110 for providing an interface for inputting audio. In one embodiment, the input / output device 150 may be a keyboard, a touch screen, a microphone, a speaker, or other suitable input / output devices. The user can input audio through the interface provided by the input / output device.

於本發明各實施例中，處理器110可以實施為積體電路如微控制單元(microcontroller)、微處理器(microprocessor)、數位訊號處理器(digital signal processor)、特殊應用積體電路(application specific integrated circuit，ASIC)、邏輯電路或其他類似元件或上述元件的組合。儲存裝置150可以實施為記憶體、硬碟、隨身碟、記憶卡等。 In various embodiments of the present invention, the processor 110 may be implemented as an integrated circuit such as a microcontroller, a microprocessor, a digital signal processor, and an application specific integrated circuit. integrated circuit (ASIC), logic circuit, or other similar components or a combination of the above. The storage device 150 may be implemented as a memory, a hard disk, a flash drive, a memory card, and the like.

請參閱第2圖，第2圖係根據本案之一些實施例所繪示之一種處理器110的示意圖。處理器110包含語音辨識模組111、語料分析模組112、情境辨識模組113、未知語料判斷模組114、更新資訊模組115、訓練模組121、範本建立模組122、語意模型建立模組123、詞彙模型建立模組124、情境訓練模組125以及詞彙訓練模組126。語料分析模組112與語音辨識模組111電性連接，情境辨識模組113與語料分析模組112電性連接，未知語料判斷模組114與情境判斷模組113電性連接，更新資訊模組115與未知語料判斷模組114電性連接。訓練模組121與語料分析模組112電性連接，範本建立模組122與訓練模組121電性連接，語意模型建立模組123以及詞彙模型建立模組124與範本建立模組122電性連接，情境訓練模組125與情境辨識模組113電性連接，未知語料判斷模組114與詞彙訓練模組126電性連接。 Please refer to FIG. 2. FIG. 2 is a schematic diagram of a processor 110 according to some embodiments of the present invention. The processor 110 includes a speech recognition module 111, a corpus analysis module 112, a situation recognition module 113, an unknown corpus judgment module 114, an update information module 115, a training module 121, a template creation module 122, and a semantic model A building module 123, a vocabulary model building module 124, a situation training module 125, and a vocabulary training module 126. Corpus analysis module 112 is electrically connected with speech recognition module 111, context recognition module 113 is electrically connected with corpus analysis module 112, unknown corpus judgment module 114 is electrically connected with context judgment module 113, and updated The information module 115 is electrically connected to the unknown corpus judgment module 114. The training module 121 is electrically connected to the corpus analysis module 112. The template creation module 122 is electrically connected to the training module 121. The semantic model creation module 123 and the vocabulary model creation module 124 and the template creation module 122 are electrically connected. Connected, the situation training module 125 is electrically connected with the situation recognition module 113, and the unknown corpus judgment module 114 is electrically connected with the vocabulary training module 126.

請一併參閱第1圖~第3圖。第3圖係根據本案之一些實施例所繪示之一種虛擬助理的自動學習方法300的流程圖。如第3圖所示，虛擬助理的自動學習方法300包含以下步驟：步驟S310：接收音訊輸入並辨識音訊以形成語料資料；步驟S320：利用自然語言處理模型分析語料資料，以產生與語料資料對應的語言特徵資訊；步驟S330：依據職能情境資訊對語言特徵資訊進行職能情境分析，判斷該些意圖的其中之一對應的操作；步驟S340：如果職能情境分析無法判斷該些意圖的其中之一對應的操作，則針對語料資料進行分詞處理；步驟S350：根據分詞處理後的結果，判斷是否存在新詞彙或新語料資料；以及步驟S360：如果存在新詞彙，根據新詞彙的意義更新自然語言處理模型，如果存在新語料資料，根據新語料資料的意圖更新職能情境分析。 Please refer to Figures 1 to 3 together. FIG. 3 is a flowchart of an automatic learning method 300 for a virtual assistant according to some embodiments of the present invention. As shown in FIG. 3, the automatic learning method 300 of the virtual assistant includes the following steps: Step S310: Receive audio input and identify the audio to form corpus data; Step S320: Use natural language processing model to analyze the corpus data to generate and language Step S330: Perform a functional situation analysis on the linguistic feature information according to the functional situation information to determine the operation corresponding to one of the intentions; Step S340: If the functional situation analysis cannot determine which of the intentions One of the corresponding operations, word segmentation processing is performed on the corpus data; step S350: judging whether a new vocabulary or new corpus data exists according to the result of the word segmentation processing; and step S360: if a new vocabulary exists, update according to the meaning of the new vocabulary Natural language processing model, if there is new corpus data, the functional situation analysis is updated according to the intent of the new corpus data.

於步驟S310中，接收音訊輸入並辨識音訊以形成語料資料。於一實施例中，經由輸入/輸出裝置150接收到的音訊可以由處理器110的語音辨識模組111進行語音辨識，將使用者的自然語言轉換為語料資料。於另一實施例中，語音辨識也可以藉由網際網路將音訊傳送至雲端語音辨識系統，經由雲端語音辨識系統辨識音訊後，再將辨識結果作為語料資料，舉例而言，雲端語音辨識系統可以實施為google的語音辨識系統。 In step S310, audio input is received and audio is recognized to form corpus data. In one embodiment, the audio received through the input / output device 150 can be recognized by the speech recognition module 111 of the processor 110 to convert the user's natural language into corpus data. In another embodiment, speech recognition can also send audio to the cloud speech recognition system through the Internet. After the audio is recognized by the cloud speech recognition system, the recognition result is used as corpus data. For example, cloud speech recognition The system can be implemented as Google's speech recognition system.

在執行步驟S320之前，需先建立共通詞彙模型以及共通語意模型。因此請參考第4圖，第4圖係根據本案之一些實施例所繪示之訓練資料模型的流程圖。如第4圖所示，訓練資料模型階段包含以下步驟：步驟S410：根據應用知識資料庫及領域知識資料庫產生系統領域詞彙集合；步驟S420：系統領域詞彙集合及複數個服務應用參數形成為關鍵實體集合；步驟S430：將複數個訓練語料分類為查詢資料操作及執行指令操作的其中之一；步驟S440：依照企業資料庫中的類別區分對應查詢資料操作的該些訓練語料的意圖形成複數個查詢資料操作意圖，以及依照企業資源系統提供的服務行為區分對應執行指令操作的該些訓練語料的意圖形成複數個執行指令操作意圖；步驟S450：建立查詢資料操作意圖的範本，以及執行指令操作意圖的範本；步驟S460：根據關鍵實體集合、查詢資料操作意圖的範本以及執行指令操作意圖的範本建立總體資料庫；步驟S470：辨識關鍵實體集合中的系統領域詞彙在訓練語料中出現的複數個第一機率，並藉由辨識出的系統領域詞彙分析訓練語料的複數個句型結構，以及系統領域詞彙彼此之間的複數個關聯性，並根據第一機率以及關聯性建立共通詞彙模型；以及步驟S480：分析查詢資料操作意圖以及執行指令操作意圖中出現系統領域詞彙的複數個第二機率，並根據句型結構以及第二機率建立共通語意模型。 Before executing step S320, a common vocabulary model and a common semantic model need to be established. Therefore, please refer to FIG. 4, which is a flowchart of a training data model according to some embodiments of the present invention. As shown in Figure 4, the training data model stage includes the following steps: Step S410: Generate a system domain vocabulary based on the application knowledge database and domain knowledge database; Step S420: The system domain vocabulary and a plurality of service application parameters are formed as the key Entity set; step S430: classify a plurality of training corpora into one of query data operation and execute instruction operation; step S440: distinguish the training corpus corresponding to query data operation according to the category in the enterprise database The plurality of query data operation intentions and the intention to distinguish the training corpus corresponding to the execution of the instruction operation according to the service behavior provided by the enterprise resource system form a plurality of execution instruction operation intentions; step S450: establishing a template of the query data operation intention and the execution The template of the instruction operation intention; Step S460: Establish an overall database according to the key entity set, the query data operation intention template, and the template for executing the instruction operation intention; Step S470: identify the system domain words in the key entity set appear in the training corpus Plural first Rate, and analyze the plurality of sentence structure structures of the training corpus through the identified system domain vocabulary, and the plurality of correlations between the system domain vocabularies, and establish a common vocabulary model based on the first probability and correlation; and steps S480: Analyze a plurality of second chances of a system domain vocabulary appearing in the query data operation intention and the execution instruction operation intention, and establish a common semantic model according to the sentence structure and the second probability.

於步驟S410及步驟S420中，根據應用知識資料庫132及領域知識資料庫133產生系統領域詞彙集合，再利用系統領域詞彙集合及複數個服務應用參數形成為關鍵實體集合，關鍵實體集合包含複數個系統領域詞彙。舉例而言，關鍵實體集合包含企業領域詞彙以及企業系統的服務應用參數等資訊。企業領域詞彙則是指每個不同領域的企業可能會需要用到的詞彙，例如醫療業運用到的詞彙與運輸業運用到的詞彙一定不相同，因此企業領域詞彙會依照每個使用ERP系統的企業不同而有所變化。企業系統的服務應用參數則是企業系統所提供的各項服務對應的參數，舉例而言，企業系統中的請假功能可能需要請假時間、假別等資訊，關鍵實體集合中的系統領域詞彙就需要包含事假、年假、病假、出差假等資訊。 In steps S410 and S420, a system domain vocabulary set is generated according to the application knowledge database 132 and the domain knowledge database 133, and then the system domain vocabulary set and a plurality of service application parameters are used to form a key entity set. The key entity set includes a plurality of System domain vocabulary. For example, the key entity set contains information such as enterprise domain vocabulary and service application parameters of the enterprise system. The corporate vocabulary refers to the vocabulary that companies in different fields may need. For example, the vocabulary used in the medical industry and the vocabulary used in the transportation industry must be different. Enterprises vary. The service application parameters of the enterprise system are the parameters corresponding to various services provided by the enterprise system. For example, the leave function in the enterprise system may require information such as leave time and leave, and the system domain vocabulary in the key entity collection needs Contains information on leave, annual leave, sick leave, and business leave.

詳細而言，關鍵實體集合更包含存取資料時會有的資料欄位名稱、企業系統提供給使用者的服務名稱、使用者在查詢時所設定的限制條件之參數值、服務應用的參數值以及企業系統的操作函數等，企業系統的操作函數可以為請假、加班申請、出差申請、報支等操作函數。而上述的這些資訊也可能會有對應的別名，也需在訓練資料庫時一併輸入，例如：出貨單對於特定領域的廠商有可能有出貨明細表或銷貨單等不同的名稱。 In detail, the key entity collection also includes the data field names that will be available when accessing the data, the service name provided by the enterprise system to the user, the parameter values of the restriction conditions set by the user during the query, and the parameter values of the service application As well as operation functions of the enterprise system, the operation functions of the enterprise system may be operation functions such as leave of absence, overtime application, business application, and reimbursement. The above-mentioned information may also have corresponding aliases, which also need to be entered in the training database. For example, the shipping order may have different names such as shipping details or sales orders for manufacturers in specific fields.

於步驟S430中，將複數個訓練語料分類為查詢資料操作及執行指令操作的其中之一。訓練語料可以是使用者的可能會下的指令或會問的問題等自然語言的資料，在建立好關鍵實體集合後會將訓練語料按照意圖分類，於一實施例中，使用者的意圖分為查詢資料操作及執行指令操作，但也可以將使用者的意圖分類的更精細，本發明不限於此。舉例而言，使用者如果對虛擬助理說：「請幫我找XX公司的出貨單」，在本發明的意圖分類中會分類為查詢資料操作，虛擬助理就會去企業資料庫中幫使用者查詢XX公司的出貨單。如果使用者對虛擬助理說：「幫我請1月30日的出差假」，在本發明的意圖分類中會分類為執行指令操作，虛擬助理就會進入企業資源系統中幫使用者請假。 In step S430, the plurality of training corpora are classified into one of a query data operation and a command execution operation. The training corpus can be natural language data such as the user's instructions or questions that may be asked. After the key entity set is established, the training corpus will be classified according to the intention. In one embodiment, the user's intention It is divided into query data operation and execution instruction operation, but the user's intention can be classified more finely, and the present invention is not limited to this. For example, if the user says to the virtual assistant: "Please help me find the shipping order of XX company", it will be classified as query data operation in the intent classification of the present invention, and the virtual assistant will go to the enterprise database to help Check the shipping order of company XX. If the user says to the virtual assistant: "Please ask me for a business trip on January 30." In the intent classification of the present invention, it will be classified as performing an instruction operation, and the virtual assistant will enter the enterprise resource system to help the user ask for leave.

於步驟S440中，依照企業資料庫中的類別區分對應查詢資料操作的該些訓練語料的意圖形成複數個查詢資料操作意圖，以及依照企業資源系統提供的服務行為區分對應執行指令操作的該些訓練語料的意圖形成複數個執行指令操作意圖。於一實施例中，會先按照每個不同領域的企業資料庫對查詢資料操作區分意圖。舉例而言，醫療業的企業資料庫所儲存的資料欄位一定與運輸業的企業資料庫不相同，因此兩者的使用者需求也不一定相同。例如，對醫療業的使用者可能會有查詢病歷資料、查詢病房空位等都是查詢資料操作的不同意圖，對運輸業的使用者可能會有查詢出貨紀錄、查詢包裹運送狀態等都是查詢資料操作的不同意圖。當然也會按照每個不同領域的企業資源系統提供的服務行為對執行指令操作區分意圖，如上所述醫療業的企業資源系統所提供的服務也當然會和運輸業有所不同，每個不同領域的企業所提供的查詢資料操作或服務行為操作也不一定可以通用，因此也需要對每個不同領域的企業所提供的服務區分意圖，例如，對醫療業的使用者可能會有提供掛號的服務、提供住院訂健康餐的服務等都是服務行為操作的不同意圖，對運輸業的使用者可能會有提供自動分類貨物的服務、安排貨物出貨順序的服務等都是服務行為操作的不同意圖。 In step S440, the intention of distinguishing the training corpus corresponding to the query data operation according to the category in the enterprise database is to form a plurality of query data operation intentions, and to distinguish the corresponding execution instruction operations according to the service behavior provided by the enterprise resource system. The intent of the training corpus forms a plurality of intents to execute the instruction operation. In one embodiment, the operation of querying data is distinguished according to the enterprise database in each different domain. For example, the database stored in the enterprise database of the medical industry must be different from the enterprise database of the transportation industry, so the user needs of the two are not necessarily the same. For example, users in the medical industry may have different intents for querying medical records, checking room vacancies, etc., and users in the transportation industry may have querying shipping records, querying parcel delivery status, etc. Different intentions of data manipulation. Of course, according to the service behavior provided by the enterprise resource system in each different field, the intention is to distinguish the execution of the instruction operation. As mentioned above, the services provided by the enterprise resource system in the medical industry will of course be different from the transportation industry. Each different field The query data operation or service behavior operation provided by the company may not be universal, so it is also necessary to distinguish the intent of the service provided by the enterprise in each different field. For example, users in the medical industry may have registered services. The provision of services such as hospitalization and ordering healthy meals are different intentions of service behavior operations. For users in the transportation industry, there may be different intentions of service behavior operations to provide services of automatically classifying goods and arranging the order of shipment of goods. .

於步驟S450及步驟S460中，建立查詢資料操作意圖的範本以及執行指令操作意圖的範本，並根據關鍵實體集合、查詢資料操作意圖的範本以及執行指令操作意圖的範本建立總體資料庫131。舉例而言，將使用者在操作某個領域企業的虛擬助理會有的查詢資料操作意圖及執行指令操作意圖都區分好後，就可以針對每個意圖產生對應的範本，根據上方的範例，醫療業就會有對應查詢病歷資料、查詢病房空位、提供掛號的服務及提供住院訂健康餐的服務的4個範本，運輸業就會有對應查詢出貨紀錄、查詢包裹運送狀態、提供自動分類貨物的服務、安排貨物出貨順序的服務的4個範本，接著會根據上述這些範本以及關鍵實體集合建立總體資料庫131。 In steps S450 and S460, a template for querying data operation intention and a template for executing instruction operation is established, and an overall database 131 is established according to a set of key entities, a template for querying data operation intention and a template for executing instruction operation. For example, after distinguishing the query data operation intention and execution instruction operation intention that a user's virtual assistant operating in a certain field company has, the corresponding template can be generated for each intent. According to the above example, medical treatment There will be 4 templates corresponding to querying medical records, querying vacancies in the ward, providing registered services, and providing services for ordering healthy meals in hospitals. The transportation industry will have corresponding querying shipping records, querying parcel delivery status, and providing automatic sorting of goods. Services, four services for arranging the order in which goods are shipped, and then an overall database 131 will be created based on these templates and a collection of key entities.

於步驟S470中，辨識關鍵實體集合中的系統領域詞彙在訓練語料中出現的複數個第一機率，並藉由辨識出的系統領域詞彙分析訓練語料的複數個句型結構，以及系統領域詞彙彼此之間的複數個關聯性，並根據第一機率以及關聯性建立共通詞彙模型。在一實施例中，利用n元語法(n-GRAM)以及上下文無關文法(Context-free grammar,CFG)兩種演算法計算每一系統領域詞彙在訓練語料中出現的機率，並藉由系統領域詞彙分析訓練語料的句型結構以及系統領域詞彙彼此之間的關聯性以建立共通詞彙模型。舉例而言，如果訓練語料中有「我要查詢XX公司的報價單」以及「我要查詢XX公司的出貨單」，而「XX公司」、「報價單」及「出貨單」都是系統領域詞彙，但在上述的範例中，由於「XX公司」可能平均出現在每一個查詢資料操作的意圖中，因此「XX公司」的機率在每一個查詢資料操作的意圖中都幾乎相同，而「報價單」及「出貨單」則只在查詢某些特定資料之意圖的訓練語料中大量出現，而不會出現在查詢其他資料之意圖的訓練語料中，因此「報價單」及「出貨單」的機率在對應的意圖中會特別高，而在其他意圖中會較低。 In step S470, identify the first multiple occurrences of the system domain vocabulary in the key entity set in the training corpus, and analyze the plural sentence structure of the training corpus through the identified system domain vocabulary, and the system domain There are multiple correlations between words, and a common vocabulary model is established according to the first probability and the correlation. In one embodiment, two algorithms, n-GRAM and Context-free grammar (CFG), are used to calculate the probability of each system domain vocabulary appearing in the training corpus. The domain vocabulary analysis training corpus's sentence structure and system domain vocabulary are related to each other to establish a common vocabulary model. For example, if the training corpus contains "I want to query the quotation of XX company" and "I want to query the XX company's shipping order", and "XX company", "quotation" and "shipment order" are all Is a vocabulary in the system field, but in the above example, because "XX company" may appear on average for each query data operation intent, the probability of "XX company" is almost the same in each query data operation intention. The "quotations" and "shipments" only appear in large numbers in the training corpus intended to query some specific data, and will not appear in the training corpus intended to query other data, so the "quotation" And the probability of "shipment order" will be particularly high in the corresponding intent, and will be lower in other intents.

於步驟S480中，分析查詢資料操作意圖以及執行指令操作意圖中出現系統領域詞彙的複數個第二機率，並根據句型結構以及第二機率建立共通語意模型。在一實施例中，利用隱馬爾可夫模型(Hidden Markov Model，HMM)演算法計算系統領域詞彙在查詢資料操作意圖以及執行指令操作意圖中出現的機率，以建立共通語意模型，舉例而言，在訓練資料模型階段時會輸入許多訓練語料，隱馬爾可夫模型演算法必須計算系統領域詞彙在不同意圖出現的機率。結合上述的範例，如果訓練語料中有「我要查詢XX公司的出貨單」，依照n元語法以及上下文無關文法可以找出「XX公司」及「出貨單」都是系統領域詞彙，而隱馬爾可夫模型演算法可以依據所有辨識出的系統領域詞彙於查詢資料操作意圖以及執行指令操作意圖中的機率以及系統領域詞彙之間的關係，進一步判斷「出貨單」與查詢出貨資料的意圖相關聯，再結合「XX公司」的系統領域詞彙，可以自動幫使用者在企業資料庫中查詢XX公司的出貨相關資料。 In step S480, a plurality of second probabilities of vocabulary of the system domain appear in the query data operation intention and the execution instruction operation intention, and a common semantic model is established according to the sentence structure and the second probability. In an embodiment, a Hidden Markov Model (HMM) algorithm is used to calculate the probability of a system vocabulary appearing in query data operation intentions and executing instruction operation intentions to establish a common semantic model. For example, During the training data model phase, many training corpora are input. The hidden Markov model algorithm must calculate the probability of the system domain vocabulary appearing at different intents. In combination with the above example, if the training corpus contains "I want to query the shipping order of XX company", according to the n-gram syntax and context-free grammar, it can be found that "XX company" and "shipping order" are both system vocabulary. The hidden Markov model algorithm can further determine the "shipment order" and the query shipment according to the relationship between all the identified system domain vocabularies in the query data operation intention, the probability of executing the instruction operation intention, and the system domain vocabulary. The intent of the data is related, and combined with the system field vocabulary of "XX company", users can automatically query the company's shipping related data in the enterprise database.

當建立完共通詞彙模型及共通語意模型後，接著進行步驟S320，利用自然語言處理模型分析語料資料，以產生與語料資料對應的語言特徵資訊，語言特徵資訊包含複數個意圖、意圖對應的機率以及複數個詞彙。步驟S320的細部流程請參考第5圖，第5圖係根據本案之一些實施例所繪示之步驟S320的流程圖。如第5圖所示，步驟S320包含以下步驟：步驟S321：利用共通詞彙模型辨識語料資料中是否具有符合關鍵實體集合中的系統領域詞彙，將辨識結果設定為語言特徵資訊中的詞彙，並分析語言特徵資訊中的詞彙出現的機率；步驟S322：根據特徵資訊中的詞彙分析語料資料的句型結構；以及步驟S323：利用共通語意模型根據特徵資訊中的詞彙出現的機率以及語料資料的句型結構辨識語料資料的意圖以及意圖對應的機率。 After the common vocabulary model and the common semantic model are established, step S320 is performed, and the corpus data is analyzed by using a natural language processing model to generate linguistic feature information corresponding to the corpus data. The linguistic feature information includes a plurality of intents and intent correspondence Probability and plural words. For the detailed flow of step S320, please refer to FIG. 5. FIG. 5 is a flowchart of step S320 according to some embodiments of the present invention. As shown in FIG. 5, step S320 includes the following steps: Step S321: use a common vocabulary model to identify whether the corpus data has a system domain vocabulary that matches the key entity set, set the recognition result to the vocabulary in the language feature information, and Analyze the probability of occurrence of vocabulary in the linguistic feature information; step S322: analyze the sentence structure of the corpus data based on the vocabulary in the feature information; and step S323: use a common semantic model based on the probability of vocabulary occurrence in the feature information and the corpus data The structure of the sentence recognizes the intent of the corpus and the probability of the intent.

於步驟S321及步驟S322中，利用共通詞彙模型辨識語料資料中是否具有符合關鍵實體集合中的系統領域詞彙，將辨識結果設定為語言特徵資訊中的詞彙，並分析語言特徵資訊中的詞彙出現的機率，再根據特徵資訊中的詞彙分析語料資料的句型結構。舉例而言，將使用者輸入的語料資料，利用共通詞彙模型將語料資料中含有系統領域詞彙的詞彙辨識出來，再進一步判斷出語料資料的句型結構。舉例而言，如果使用者對虛擬助理說：「我想要查XX公司上個月的出貨單」，根據共通詞彙模型可以辨識出「XX公司」、「上個月」及「出貨單」等符合系統領域詞彙的詞彙。 In step S321 and step S322, the common vocabulary model is used to identify whether the corpus data has vocabulary in the system domain in the key entity set, set the recognition result to the vocabulary in the language feature information, and analyze the occurrence of the vocabulary in the language feature information. Probability, and then analyze the sentence structure of the corpus data based on the vocabulary in the feature information. For example, the corpus data input by the user is used to identify the vocabulary in the corpus data that contains the vocabulary of the system domain, and then the sentence structure of the corpus data is further determined. For example, if the user says to the virtual assistant: "I want to check the shipping order of XX company last month", the common vocabulary model can identify "XX company", "last month" and "shipping order" "And other words that fit the vocabulary of the system domain.

於步驟S323中，利用共通語意模型根據特徵資訊中的詞彙出現的機率以及語料資料的句型結構辨識語料資料的意圖以及意圖對應的機率。根據上方的範例，辨識出「XX公司」、「上個月」及「出貨單」等詞彙後，會再進一步判斷這些詞彙在所有意圖中的機率。此處指的所有意圖包含所有查詢資料操作意圖以及執行指令操作意圖的機率。 In step S323, the common semantic model is used to identify the intention of the corpus data and the probability corresponding to the intent according to the probability of vocabulary appearance in the feature information and the sentence structure of the corpus data. According to the example above, after identifying the words "XX company", "last month", and "shipment order", we will further judge the probability of these words in all intents. All intents here include all query data operation intentions and the probability of performing instruction operation intentions.

於步驟S330中，依據職能情境資訊對語言特徵資訊進行職能情境分析，判斷該些意圖的其中之一對應的操作。在進行職能情境分析之前需先建立職能情境模型及職能詞彙模型，職能情境模型在進行職能情境分析時是先將歷史資料庫134中的資料所轉換成的特徵向量，然後會利用機器學習演算法將歷史資料庫134中的資料依據各種不同的情境分類後計算特徵向量與各情境之間的強弱關係，接著產生職能情境模型。適合建立上述職能情境的機器學習演算法包括：傳統機器學習常用的支援向量機(Support Vector Machine,SVM)，以及目前深度學習(Deep Learning)相關的卷積神經網路(Convolutional Neural Networks,CNN)、遞歸神經網路(Recurrent Neural Networks,RNN)和長短期記憶模型(Long Short-Term Memory,LSTM)等演算法。 In step S330, functional context analysis is performed on the linguistic feature information according to the functional context information to determine an operation corresponding to one of the intentions. Before performing the functional situation analysis, a functional situation model and a functional vocabulary model need to be established. When performing the functional situation analysis, the functional situation model first converts the feature vectors from the data in the historical database 134, and then uses machine learning algorithms. The data in the historical database 134 is classified according to various scenarios, and the strong and weak relationship between the feature vector and each scenario is calculated, and then a functional scenario model is generated. Machine learning algorithms suitable for establishing the above functional scenarios include: Support Vector Machines (SVMs) commonly used in traditional machine learning, and Convolutional Neural Networks (CNN) related to deep learning. , Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) algorithms.

承上述，職能詞彙模型係根據大量輸入的訓練語料利用隱馬爾可夫模型演算法分析後再進行斷詞處理，接著會統計分詞的出現頻率以產生分詞頻率表，進而建立職能詞彙模型。步驟S330的細部流程請參考第6圖，第6圖係根據本案之一些實施例所繪示之步驟S330的流程圖。如第6圖所示，步驟S330包含以下步驟：步驟S331：利用語料資料以及職能情境資訊與職能情境模型進行比對，並產生職能情境辨識結果；以及步驟S332：根據職能情境辨識結果判斷該些意圖的其中之一對應查詢資料操作及執行指令操作的其中之一。 Continuing the above, functional vocabulary models are based on a large amount of input training corpus using hidden Markov model algorithm analysis and then word segmentation processing. Then the frequency of word segmentation is counted to generate a word segmentation frequency table, and then a functional vocabulary model is established. For the detailed flow of step S330, please refer to FIG. 6, which is a flowchart of step S330 according to some embodiments of the present invention. As shown in FIG. 6, step S330 includes the following steps: step S331: using corpus data and functional context information to compare with the functional context model and generating a functional context identification result; and step S332: judging the result based on the functional context identification result One of these intentions corresponds to one of querying data operations and executing instruction operations.

於步驟S331中，利用語料資料以及職能情境資訊與職能情境模型進行比對，並產生職能情境辨識結果。職能情境資訊包含使用者的身份、使用者的職位、使用者的部門、時間以及地點。職能情境資訊的部分資訊可以由輸入/輸出裝置150所感測，例如可以偵測使用者目前的狀態(例如，是否出差回來)。根據前面辨識使用者語料資料後所得到的所有意圖對應的機率以及詞彙，再結合職能情境資訊可以進一步估算使用者的語料資料與訓練資料模型中的資料之相似程度，作為對應之意圖的機率。 In step S331, the corpus data and the functional context information are compared with the functional context model, and a functional context identification result is generated. Functional context information includes the user's identity, the user's position, the user's department, time, and location. Part of the information of the functional context information can be sensed by the input / output device 150, for example, it can detect the current status of the user (for example, whether or not he is back on a business trip). According to the probability and vocabulary of all intents obtained after identifying the user's corpus data, combined with the functional context information, the similarity between the user's corpus data and the data in the training data model can be further estimated as the corresponding intention. Chance.

於步驟S332中，根據職能情境辨識結果判斷該些意圖的其中之一對應查詢資料操作及執行指令操作的其中之一。由於在訓練資料模型中會有多個查詢資料操作意圖以及多個執行指令操作意圖，並且在經過前述的共通語意模型的計算後會產生每個意圖對應的機率，具有較低機率值的意圖可以利用門檻值過濾，以得到最有可能的意圖並確認對應的操作。由前述的範例可知，當辨識出「XX公司」、「上個月」及「出貨單」等詞彙後，會判斷這些詞彙搭配職能情境資訊找出最符合的查詢資料操作意圖或執行指令操作意圖，在經過上述操作後即會判斷出使用者對虛擬助理說：「我想要查XX公司上個月的出貨單」，最有可能會要查XX公司的出貨單，因此即可對應出使用者想要執行的是查詢資料操作。需要有職能情境的判斷是因為會因為使用者的職位、部門、操作時間、操作地點等資訊不同，而有不同的需求，舉例而言，採購人員與財務人員都會看[廠商每月統計表]，但是可能這兩者的[廠商每月統計表]的統計目標並不相同：一個是統計廠商的進貨狀況，另一個是統計自己公司付款給廠商的狀況。但使用者在與虛擬助理對話時不一定會明確指說需要什麼[廠商每月統計表]，可能只說：「我需要上個月的廠商每月統計表」這種簡單的句型，因此才更需要搭配使用者的職能情境資訊再進行進一步精準的判斷。 In step S332, it is determined that one of the intentions corresponds to one of a query data operation and an execution instruction operation according to the functional context recognition result. Because there will be multiple query data operation intentions and multiple instruction operation operation intentions in the training data model, and after the calculation of the common semantic model described above, the probability corresponding to each intent will be generated. Intents with lower probability values can be Use threshold filtering to get the most likely intent and confirm the corresponding operation. As can be seen from the foregoing example, when the words "XX company", "last month", and "shipment order" are identified, these words will be judged with the functional context information to find the most suitable query data operation intention or execute the command operation The intention is that after the above operation, the user will be judged that the user said to the virtual assistant: "I want to check the shipping order of XX company last month". Most likely, you will want to check the shipping order of XX company, so you can Corresponding to what the user wants to perform is a query data operation. The judgement of the functional situation is needed because the user's position, department, operation time, operation place and other information have different needs. For example, purchasers and financial staff will look at [vendor monthly statistics table] However, it is possible that the statistical goals of the [vendor monthly statistics table] of the two are not the same: one is to count the purchase status of the manufacturer, and the other is to count the payment status of the company to the manufacturer. However, when talking with the virtual assistant, the user may not necessarily specify exactly what the [vendor monthly statistics table] is needed. It may only say, "I need the monthly monthly statistics table of the manufacturer last month." It is even more necessary to make a more accurate judgment with the user's functional situation information.

於步驟S340中，如果職能情境分析無法判斷該些意圖的其中之一對應的操作，則針對語料資料進行分詞處理。步驟S340的細部流程請參考第7圖，第7圖係根據本案之一些實施例所繪示之步驟S340的流程圖。如第7圖所示，步驟S340包含以下步驟：步驟S341：根據職能詞彙模型對語料資料進行斷詞，以產生複數個分詞；以及步驟S342：計算該些分詞的頻率。 In step S340, if the functional context analysis cannot determine the operation corresponding to one of the intentions, the word segmentation processing is performed on the corpus data. For the detailed flow of step S340, please refer to FIG. 7, which is a flowchart of step S340 according to some embodiments of the present invention. As shown in FIG. 7, step S340 includes the following steps: step S341: segment the corpus data according to the functional vocabulary model to generate a plurality of segmentations; and step S342: calculate the frequency of the segmentations.

於步驟S341及步驟S342中，根據職能詞彙模型對語料資料進行斷詞，以產生複數個分詞；接著計算該些分詞的頻率。如果在步驟S330中職能情境分析無法判斷輸入的語料資料對應的操作時，就需要對語料資料進行分詞處理。首先，會根據先前預先建立好的職能詞彙模型中儲存的詞彙對語料資料進行斷詞，接著計算斷詞後產生的多個分詞的頻率。 In step S341 and step S342, the corpus is segmented according to the functional vocabulary model to generate a plurality of segmentations; then the frequency of the segmentations is calculated. If the functional context analysis cannot determine the operation corresponding to the input corpus data in step S330, the corpus data needs to be segmented. First, the corpus is segmented based on the vocabulary stored in the pre-established functional vocabulary model, and then the frequency of multiple segmentations generated after segmentation is calculated.

於步驟S350及步驟S360中，根據分詞處理後的結果，判斷是否存在新詞彙或新語料資料；如果存在新詞彙，根據新詞彙的意義更新自然語言處理模型，如果存在新語料資料，根據新語料資料的意圖更新職能情境分析。步驟S360的細部流程請參考第8圖，第8圖係根據本案之一些實施例所繪示之步驟S360的流程圖。如第8圖所示，步驟S360包含以下步驟：步驟S361：判斷分詞處理計算出的該些分詞的頻率是否低於門檻值；步驟S362：如果該些分詞的其中之一低於門檻值，該些分詞的其中之一則為新詞彙，並接收新詞彙的定義，以更新共通詞彙模型及共通語意模型；以及步驟S363：如果該些分詞均高於門檻值，則語料資料則為新語料資料，並接收新語料資料的意圖，以更新職能情境模型。 In step S350 and step S360, it is determined whether there is a new vocabulary or new corpus data according to the results of the word segmentation processing; if there is a new vocabulary, the natural language processing model is updated according to the meaning of the new vocabulary; if there is a new corpus data, the new corpus is Data intent update functional situation analysis. For the detailed flow of step S360, please refer to FIG. 8. FIG. 8 is a flowchart of step S360 according to some embodiments of the present invention. As shown in FIG. 8, step S360 includes the following steps: step S361: judging whether the frequency of the participles calculated by the word segmentation process is lower than a threshold value; step S362: if one of the participles is lower than the threshold value, the One of the participles is a new vocabulary and receives a definition of the new vocabulary to update the common vocabulary model and the common semantic model; and step S363: if the participles are higher than the threshold value, the corpus data is the new corpus data , And receive the intent of the new corpus material to update the functional context model.

於步驟S361及步驟S362中，判斷分詞處理計算出的該些分詞的頻率是否低於門檻值，如果該些分詞的其中之一低於門檻值，該些分詞的其中之一則為新詞彙，並接收新詞彙的定義，以更新共通詞彙模型及共通語意模型。於一實施例中，經過分詞處理計算完該些分詞的頻率後，將低於門檻值的分詞設定為新詞彙，虛擬助理會詢問使用者新詞彙的定義，並將新詞彙以及新詞彙的定義一起存入共通詞彙模型及共通語意模型中。舉例而言，使用者輸入的語料是「我想找XX公司的聯絡人」，而如果虛擬助理無法判斷「我想找XX公司的聯絡人」的意義，會在分詞處理後分出「我」、「想找」、「XX公司」、「的」、「聯絡人」等詞彙，如果「XX公司」低於門檻值，虛擬助理會詢問使用者「XX公司」是什麼意思，接著將使用者的回答及「XX公司」一起存入共通詞彙模型及共通語意模型；而新詞彙也需要一起存入系統領域詞彙集合中，與所有人共用。 In step S361 and step S362, it is determined whether the frequency of the participles calculated by the segmentation process is lower than the threshold value. If one of the participles is lower than the threshold value, one of the participles is a new word, and Receive new vocabulary definitions to update common vocabulary models and common semantic models. In one embodiment, after calculating the frequency of the segmentation after segmentation processing, set the segmentation below the threshold as a new vocabulary, the virtual assistant will ask the user for the definition of the new vocabulary, and define the new vocabulary and the definition of the new vocabulary. Stored in the common vocabulary model and common semantic model together. For example, the corpus entered by the user is "I want to find the contact person of XX company", and if the virtual assistant cannot determine the meaning of "I want to find the contact person of XX company", it will sort out "I "," Want to find "," XX company "," of "," contact "and other words. If" XX company "is lower than the threshold, the virtual assistant will ask the user what" XX company "means, and then use The answer of the author and the "XX company" are stored in the common vocabulary model and the common semantic model; the new vocabulary also needs to be stored in the system domain vocabulary collection and shared with everyone.

於步驟S363中，如果該些分詞均高於門檻值，則語料資料則為新語料資料，並接收新語料資料的意圖，以更新職能情境模型。接續上方「我想找XX公司的聯絡人」的範例，在分詞處理後分出「我」、「想找」、「XX公司」、「的」、「聯絡人」等詞彙，如果都沒有詞彙低於門檻值，表示虛擬助理不理解的是語料的意圖，有可能在訓練智慧助理時的訓練語料都是關於「幫我查XX公司的聯絡人」的敘述，因此虛擬助理就會無法理解「我想找XX公司的聯絡人」的意圖，而虛擬助理就需要再詢問使用者「我想找XX公司的聯絡人」是什麼意思，接著將使用者的回答及「我想找XX公司的聯絡人」的新語料一起存入職能情境模型。在存入職能模型之前需要再判斷新語料是否為共通語料，如果是的話則代表其他人在使用虛擬助理時也會使用到新語料，因此需要將新語料存入系統領域詞彙集合，讓所有人共用；但如果不是的話則代表新語料只是使用者本身的說話習慣而有的不同的用語，因此只需要更新職能情境模型即可，不需要再更新系統領域詞彙集合。 In step S363, if the participles are higher than the threshold, the corpus data is the new corpus data, and the intent of the new corpus data is received to update the functional context model. Continuing the example of "I want to find the contact person of XX company" above, after the word segmentation processing, the words "I", "want to find", "XX company", "of", and "contact person" are separated. If there is no vocabulary Below the threshold value, it means that the virtual assistant does not understand the intention of the corpus. It is possible that the training corpus when training the smart assistant is all about “help me check the contacts of XX company”, so the virtual assistant will not be able to Understand the intent of "I want to find the contact person of XX company", and the virtual assistant needs to ask the user what "I want to find the contact person of XX company", and then the user's response and "I want to find the contact person of XX company" The new corpus of "contacts" is stored in the functional situation model. Before storing the functional model, you need to determine whether the new corpus is a common corpus. If it is, it means that other people will also use the new corpus when using the virtual assistant. Therefore, the new corpus needs to be stored in the system domain vocabulary collection, so that all People share it; but if it is not, it means that the new corpus is just a different term for the user ’s own speaking habits, so only the functional context model needs to be updated, and there is no need to update the system domain vocabulary set.

由上述本案之實施方式可知，主要係讓虛擬助理具有自動學習的功能，讓虛擬助理可以在與使用者交流的過程中，如果有智慧助理不懂的詞彙可以在詢問使用者過後，更新虛擬助理的資料庫，使得虛擬助理可以自動學習到使用者的說話習慣，或是行業中的特殊用語用詞，達到讓使用者使用ERP系統是能夠更快速便利的功效。 It can be known from the implementation of the above-mentioned case that the virtual assistant is mainly provided with an automatic learning function, so that the virtual assistant can communicate with the user. If there is a vocabulary that the smart assistant does not understand, you can update the virtual assistant after asking the user The database allows the virtual assistant to automatically learn the user's speaking habits or the special terms in the industry, so that users can use the ERP system more quickly and conveniently.

另外，上述例示包含依序的示範步驟，但該些步驟不必依所顯示的順序被執行。以不同順序執行該些步驟皆在本揭示內容的考量範圍內。在本揭示內容之實施例的精神與範圍內，可視情況增加、取代、變更順序及/或省略該些步驟。 In addition, the above-mentioned illustration includes sequential exemplary steps, but the steps need not be performed in the order shown. It is within the scope of this disclosure to perform these steps in different orders. Within the spirit and scope of the embodiments of the present disclosure, these steps may be added, replaced, changed, and / or omitted as appropriate.

雖然本案已以實施方式揭示如上，然其並非用以限定本案，任何熟習此技藝者，在不脫離本案之精神和範圍內，當可作各種之更動與潤飾，因此本案之保護範圍當視後附之申請專利範圍所界定者為準。 Although this case has been disclosed as above in the form of implementation, it is not intended to limit the case. Any person skilled in this art can make various modifications and retouches without departing from the spirit and scope of the case. Therefore, the scope of protection of this case should be considered after The attached application patent shall prevail.

Claims

An automatic learning method for a virtual assistant includes: receiving an audio input and identifying the audio to form a corpus data; analyzing the corpus data using a natural language processing model to generate a language feature information corresponding to the corpus data , Where the linguistic feature information includes a plurality of intentions, the probability corresponding to the intents, and a plurality of words; according to a functional context information, a functional context analysis is performed on the linguistic feature information to determine an operation corresponding to one of the intentions ; If the functional context analysis cannot determine the operation corresponding to one of the intentions, a word segmentation process is performed on the corpus data; based on the result of the word segmentation process, determine whether a new vocabulary or a new corpus data exists ; If the new vocabulary exists, update the natural language processing model according to the meaning of the new vocabulary; if the new corpus data exists, update the functional context analysis according to the intention of the new corpus data; wherein the operation includes a query data operation And one of performing an instruction operation.

The automatic learning method for a virtual assistant according to claim 1, further comprising: generating a system domain vocabulary set according to an application knowledge database and a domain knowledge database; the system domain vocabulary set and a plurality of service application parameters are formed into one A set of key entities, which includes a plurality of system domain vocabularies; classify a plurality of training corpora into one of the query data operation and the execution instruction operation; distinguish the corresponding query data according to the category in the enterprise database The intentions of the training corpora that are operated form a plurality of query data operation intentions, and the intention of the training corpora that should perform the instruction operation is formed according to the service behavior provided by the enterprise resource system to form a plurality of execution instruction operation intentions; establish The query data operation intention template and the execution instruction operation intention template; the overall database is established according to the set of key entities, the query data operation intention template and the execution instruction operation intention template; identifying the System domain words in the set of key entities A plurality of first chances appearing in the training corpora, and analyzing the sentence structure of the training corpora through the identified system domain vocabularies, and the complex numbers of the system domain vocabularies among each other Correlations, and build a common vocabulary model based on the first probabilities and the correlations; and analyze the query data operation intentions and the plurality of second chances of the system domain words appearing in the execution instruction operation intentions , And build a common semantic model based on the sentence structure and the second chances.

The automatic learning method for the virtual assistant according to claim 2, further comprising: using a classifier to classify the data in a historical database to generate a functional context model; and segmenting the training corpus And analyze, and generate a functional vocabulary model based on the data in the historical database.

The automatic learning method for a virtual assistant according to claim 3, wherein the functional context analysis further comprises: using the corpus data and the functional context information to compare with the functional context model and generating a functional context identification result; and According to the result of the functional context identification, it is determined that one of the intentions corresponds to one of the data query operation and the execution instruction operation.

The automatic learning method for a virtual assistant according to claim 4, wherein the word segmentation processing further comprises: segmenting the corpus data according to the functional vocabulary model to generate a plurality of segmentation words; and calculating the frequency of the segmentation words.

The automatic learning method for a virtual assistant according to claim 5, further comprising: determining whether the frequency of the participles calculated by the segmentation processing is lower than a threshold value; if one of the participles is lower than the threshold value, One of the participles is the new vocabulary, and the definition of the new vocabulary is received to update the common vocabulary model and the common semantic model; if the participles are above the threshold, the corpus data is The new corpus material, and receive the intent of the new corpus material to update the functional context model.

The automatic learning method for a virtual assistant according to claim 6, further comprising: judging whether the new corpus data is a common corpus, and if so, updating the vocabulary set of the system domain according to the new corpus data; and updating the vocabulary according to the new vocabulary; System domain vocabulary collection.

The automatic learning method for a virtual assistant according to claim 2, wherein the analysis of the corpus data by the natural language processing model further comprises: using the common vocabulary model to identify whether the corpus data has those that match the key entity set. Vocabulary in the system domain, set the recognition result to the vocabulary, and analyze the probability of the vocabulary appearing; analyze the sentence structure of the corpus data according to the vocabulary; and use the common semantic model according to the probability of the vocabulary appearing and The sentence structure of the corpus data identifies the intentions of the corpus data and the probability corresponding to the intents.

An automatic learning system for a virtual assistant, which is respectively connected to an enterprise database and an enterprise resource system and includes: a processor; a storage device electrically connected to the processor for storing an overall database and an application knowledge A database, a domain knowledge database, and a historical database; an input / output device electrically connected to the processor to provide an interface for inputting an audio; wherein the processor includes: a speech recognition module A group for identifying the audio to form a corpus data; a corpus analysis module electrically connected to the speech recognition module for analyzing the corpus data using a natural language processing model to generate the corpus data A linguistic feature information corresponding to the source data, wherein the linguistic feature information includes a plurality of intents, a probability corresponding to the intents, and a plurality of vocabularies; a context recognition module is electrically connected to the corpus analysis module to A functional contextual information performs a functional contextual analysis of the linguistic feature information to determine an operation corresponding to one of the intentions; an unknown corpus judgment A module electrically connected to the context recognition module for performing a word segmentation process on the corpus data when the situation recognition module cannot recognize the operation corresponding to one of the intentions, and according to the word segmentation The processed result determines whether there is a new vocabulary or a new corpus data; and an updated information module, which is electrically connected to the unknown corpus determination module, and is used to generate a new vocabulary based on the new vocabulary. Update the natural language processing model, and when the new corpus data is generated, update the functional context analysis according to the intention of the new corpus data; wherein the operation includes one of a query data operation and an execution instruction operation.

According to the automatic learning system of the virtual assistant according to claim 9, the processor further comprises: a training module, which is electrically connected to the corpus analysis module, according to the application knowledge database and the field knowledge database A system domain vocabulary set is generated, and the system domain vocabulary set and a plurality of service application parameters are formed into a key entity set, the key entity set includes a plurality of system domain vocabularies, and a plurality of training corpora are classified into the query data operation and One of the execution instruction operations, according to the category in the enterprise database, distinguishes the intention of the training corpus corresponding to the query data operation to form a plurality of query data operation intentions, and distinguishes according to the service behavior provided by the enterprise resource system. Forming a plurality of execution instruction operation intentions for the intentions of the training corpus that should execute the instruction operations; a template creation module, which is electrically connected to the training module, establishes the query data operation intention templates, and the execution The template of the instruction operation intention, according to the set of key entities, the operation intention of the query data The template and the templates for executing the instruction operation intention establish the overall database; a vocabulary model building module is electrically connected with the template building module to identify the system domain words in the key entity set in the training languages The first probabilities appearing in the data, and analyze the plural sentence structure of the training corpora through the identified system domain vocabularies, and the plurality of correlations between the system domain vocabularies with each other, and Establish a common vocabulary model according to the first chances and the correlations; and a semantic model establishment module, which is electrically connected to the template establishment module, analyzes the query data operation intentions and the execution instruction operation intentions A plurality of second probabilities of the vocabularies of the system domains appear, and a common semantic model is established according to the sentence structure and the second probabilities.

The automatic learning system for a virtual assistant according to claim 10, wherein the processor further includes: a situation training module, which is electrically connected to the situation analysis module, and is configured to use a classifier to store the historical data in the historical database. Classify the relationship strength of the data to generate a functional context model; and a vocabulary training module that is electrically connected to the unknown corpus judgment module to perform word segmentation and analysis on the training corpus and based on the historical data The data in the database produces a functional vocabulary model.

The automatic learning system for a virtual assistant according to claim 11, wherein the context analysis module is further configured to compare the functional context information and the functional context information with the functional context model and generate a functional context identification result, And it is judged according to the result of the functional context identification that one of the intentions corresponds to one of the data query operation and the execution instruction operation.

The automatic learning system for a virtual assistant according to claim 12, wherein the unknown corpus judgment module is further configured to perform word segmentation on the corpus data according to the functional vocabulary model to generate a plurality of participles to calculate the participles. Frequency of.

The automatic learning system for a virtual assistant according to claim 13, wherein the update information module is further configured to determine whether the frequency of the participles calculated by the segmentation processing is lower than a threshold value; if one of the participles is Below the threshold, one of the participles is the new vocabulary, and the definition of the new vocabulary is received to update the common vocabulary model and the common semantic model; if the participles are above the threshold, then The corpus data is the new corpus data, and receives the intention of the new corpus data to update the functional context model.

The automatic learning system for a virtual assistant according to claim 14, wherein the update information module is further configured to determine whether the new corpus data is a common corpus, and if so, update the system domain vocabulary set according to the new corpus data; and The system domain vocabulary set is updated based on the new vocabulary.

The automatic learning system for a virtual assistant according to claim 10, wherein the corpus analysis module is further configured to use the common vocabulary model to identify whether the corpus data has vocabularies corresponding to the system domain in the key entity set, Set the recognition result to the words, and analyze the occurrence probability of the words, analyze the sentence structure of the corpus data according to the words, and use the common semantic model according to the occurrence probability of the words and the corpus data The structure of the sentence pattern identifies the intentions of the corpus data and the probability of the intentions.