TWI674530B - Method and system for operating a virtual assistant - Google Patents

Method and system for operating a virtual assistant Download PDF

Info

Publication number
TWI674530B
TWI674530B TW107105340A TW107105340A TWI674530B TW I674530 B TWI674530 B TW I674530B TW 107105340 A TW107105340 A TW 107105340A TW 107105340 A TW107105340 A TW 107105340A TW I674530 B TWI674530 B TW I674530B
Authority
TW
Taiwan
Prior art keywords
corpus
data
database
vocabulary
module
Prior art date
Application number
TW107105340A
Other languages
Chinese (zh)
Other versions
TW201935229A (en
Inventor
周忠信
吳兆麟
許旭正
Original Assignee
鼎新電腦股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 鼎新電腦股份有限公司 filed Critical 鼎新電腦股份有限公司
Priority to TW107105340A priority Critical patent/TWI674530B/en
Publication of TW201935229A publication Critical patent/TW201935229A/en
Application granted granted Critical
Publication of TWI674530B publication Critical patent/TWI674530B/en

Links

Abstract

一種虛擬助理的方法,包含接收音訊輸入並辨識音訊以形成語料資料;利用自然語言處理模型分析語料資料,以產生與語料資料對應的語言特徵資訊;依據職能情境資訊對語言特徵資訊進行職能情境分析,確認該些意圖的其中之一對應的操作;從總體資料庫中找出操作的範本,並將操作對應的詞彙輸入操作的範本以形成操作對應的指令;針對企業資料庫或企業資源系統執行操作對應的指令並產生操作對應的結果;以及輸出操作對應的結果。 A virtual assistant method includes receiving audio input and identifying audio to form corpus data; using natural language processing models to analyze corpus data to generate linguistic feature information corresponding to the corpus data; and performing linguistic feature information based on functional context information Functional situation analysis to confirm the operation corresponding to one of these intents; find out the operation template from the overall database, and enter the vocabulary corresponding to the operation into the operation template to form the operation corresponding instruction; for the enterprise database or enterprise The resource system executes an instruction corresponding to the operation and generates a result corresponding to the operation; and outputs a result corresponding to the operation.

Description

操作虛擬助理的方法及系統 Method and system for operating virtual assistant

本案是有關於一種語音辨識及處理的方法及系統,且特別是有關於一種操作虛擬助理的方法及系統。 This case relates to a method and system for speech recognition and processing, and in particular to a method and system for operating a virtual assistant.

企業資源規劃系統(Enterprise Resource Planning,ERP),簡稱ERP系統,是指建立在資訊技術的基礎上為企業決策層提供決策的管理平台。其主要是將企業的人流、物流、資訊流、資金流進行統一的管理,以最大限度的利用企業的資源。而ERP系統包含有生產控制、物流管理和財務管理等三大方面的功能,因此ERP系統規模非常的龐大。 Enterprise resource planning system (Enterprise Resource Planning, ERP), referred to as ERP system, refers to a management platform that provides decision-making for the enterprise decision-making layer based on information technology. Its main purpose is to uniformly manage the flow of people, logistics, information, and funds in order to maximize the use of corporate resources. The ERP system contains three major functions, such as production control, logistics management, and financial management. Therefore, the scale of the ERP system is very large.

在現代化生活中,虛擬助理(或虛擬助理)能夠幫助使用者直接以口頭/或文字形式的自然語言與電子產品溝通,提供使用者更方便快速的溝通模式。而將虛擬助理應用於ERP系統中,更可以快速的幫助使用者與龐大的ERP系統交流,能夠節省使用者在使用ERP系統中所花的時間,以及因ERP系統功能繁瑣可能會造成使用者難以查詢的問題。 In modern life, virtual assistants (or virtual assistants) can help users communicate with electronic products directly in verbal and / or textual natural language, providing users with a more convenient and fast communication mode. The use of virtual assistants in ERP systems can help users quickly communicate with the large ERP system, which can save users the time spent in using the ERP system, and the complexity of the ERP system may make it difficult for users. Query issues.

本發明之主要目的係在提供一種操作虛擬助理的方法及系統,其主要係改進以往ERP系統龐大繁雜造成使用者使用不易的問題,利用虛擬助理更可以協助使用者以更方便的方式使用ERP系統,達到節省時間以及降低使用ERP系統困難度的功效。 The main purpose of the present invention is to provide a method and a system for operating a virtual assistant, which mainly improve the problem that the previous ERP system is huge and complicated, which makes it difficult for the user to use. Using the virtual assistant can further assist the user to use the ERP system in a more convenient way. To achieve the effect of saving time and reducing the difficulty of using ERP systems.

為達成上述目的,本案之第一態樣是在提供一種操作虛擬助理的方法,此方法包含以下步驟:接收音訊輸入並辨識音訊以形成語料資料;利用自然語言處理模型分析語料資料,以產生與語料資料對應的語言特徵資訊,其中語言特徵資訊包含複數個意圖、該些意圖對應的機率以及複數個詞彙;依據職能情境資訊對語言特徵資訊進行職能情境分析,確認該些意圖的其中之一對應的操作;從總體資料庫中找出操作的範本,並將操作對應的詞彙輸入操作的範本以形成操作對應的指令;針對企業資料庫或企業資源系統執行操作對應的指令並產生操作對應的結果;以及輸出操作對應的結果;其中,操作包含查詢資料操作及執行指令操作的其中之一。 In order to achieve the above purpose, the first aspect of the present case is to provide a method for operating a virtual assistant. The method includes the following steps: receiving audio input and identifying the audio to form a corpus data; using a natural language processing model to analyze the corpus data, and Generate linguistic feature information corresponding to the corpus data, where the linguistic feature information includes multiple intents, the probability corresponding to those intents, and multiple vocabularies; perform a functional context analysis of the linguistic feature information based on the functional context information to confirm which of these intents One of the corresponding operations; find the template of the operation from the overall database, and enter the vocabulary of the operation into the template of the operation to form the instruction corresponding to the operation; execute the instruction corresponding to the operation and generate the operation for the enterprise database or enterprise resource system A corresponding result; and a result corresponding to the output operation; wherein the operation includes one of a query data operation and a command execution operation.

本案之第二態樣是在提供一種操作虛擬助理的系統,分別與企業資料庫及企業資源系統連接,其包含:處理器、儲存裝置以及輸入/輸出裝置。儲存裝置電性連接至處理器,用以儲存總體資料庫、應用知識資料庫、領域知識資料庫以及歷史資料庫。輸入/輸出裝置電性連接至處理器,用以提供介面以供輸入音訊。其中,處理器包含:語音辨識模組、語料分析模組、情境分析模組、指令產生模組以及操作執行模組。語音辨識模組用以辨識音訊以形成語料資料。語料分析模組與語音辨識模組電性連接,用以利用自然語言處理模型分析語料資料,以產生與語料資料對應的語言特徵資訊,其中語言特徵資訊包含複數個意圖、該些意圖對應的機率以及複數個詞彙。情境分析模組與語料分析模組電性連接,用以依據職能情境資訊對語言特徵資訊進行職能情境分析,確認該些意圖的其中之一對應的操作。指令產生模組與情境分析模組電性連接,用以從總體資料庫中找出操作的範本,並將操作對應的詞彙輸入操作的範本以形成操作對應的指令。操作執行模組與指令產生模組電性連接,用以針對企業資料庫或企業資源系統執行操作對應的指令並產生操作對應的結果,並輸出操作對應的結果至輸入/輸出裝置;其中,該操作包含一查詢資料操作及一執行指令操作的其中之一。 The second aspect of the present case is to provide a system for operating a virtual assistant, which is connected to an enterprise database and an enterprise resource system, respectively, and includes a processor, a storage device, and an input / output device. The storage device is electrically connected to the processor, and is used for storing a general database, an application knowledge database, a domain knowledge database, and a historical database. The input / output device is electrically connected to the processor to provide an interface for inputting audio. The processor includes a speech recognition module, a corpus analysis module, a situation analysis module, an instruction generation module, and an operation execution module. The speech recognition module is used to recognize audio to form corpus data. The corpus analysis module and the speech recognition module are electrically connected to analyze the corpus data using a natural language processing model to generate linguistic feature information corresponding to the corpus data, where the linguistic feature information includes a plurality of intents, the intents Corresponding probability and plural words. The situation analysis module and the corpus analysis module are electrically connected to perform a function situation analysis on the linguistic feature information according to the function situation information to confirm an operation corresponding to one of the intentions. The instruction generation module is electrically connected with the situation analysis module, and is used to find the operation template from the overall database, and input the vocabulary corresponding to the operation to the operation template to form the operation corresponding instruction. The operation execution module and the instruction generation module are electrically connected to execute an operation corresponding instruction and generate an operation corresponding result for an enterprise database or an enterprise resource system, and output the operation corresponding result to an input / output device; wherein, the The operation includes one of a query data operation and an execution instruction operation.

本發明之操作虛擬助理的方法及操作虛擬助理的系統主要係改進以往ERP系統龐大繁雜造成使用者使用不易的問題,結合虛擬助理更可以協助使用者以更方便的方式使用ERP系統,達到節省時間以及降低使用ERP系統困難度的功效。 The method and system for operating a virtual assistant of the present invention mainly improve the problem that the previous ERP system is huge and complicated, which makes it difficult for the user to use. Combining the virtual assistant can help the user to use the ERP system in a more convenient way, thereby saving time. And the effect of reducing the difficulty of using the ERP system.

100‧‧‧操作虛擬助理的系統 100‧‧‧ system for operating virtual assistants

110‧‧‧處理器 110‧‧‧ processor

130‧‧‧儲存裝置 130‧‧‧Storage device

150‧‧‧輸入/輸出裝置 150‧‧‧ input / output device

131‧‧‧總體資料庫 131‧‧‧Overall Database

132‧‧‧應用知識資料庫 132‧‧‧Application Knowledge Database

133‧‧‧領域知識資料庫 133‧‧‧ Domain Knowledge Database

134‧‧‧歷史資料庫 134‧‧‧History database

111‧‧‧語音辨識模組 111‧‧‧Speech recognition module

112‧‧‧語料分析模組 112‧‧‧ Corpus Analysis Module

113‧‧‧情境分析模組 113‧‧‧Scenario Analysis Module

114‧‧‧指令產生模組 114‧‧‧Command generation module

115‧‧‧操作執行模組 115‧‧‧operation execution module

121‧‧‧訓練模組 121‧‧‧ Training Module

122‧‧‧範本建立模組 122‧‧‧Template Creation Module

123‧‧‧語意模型建立模組 123‧‧‧ Semantic model building module

124‧‧‧詞彙模型建立模組 124‧‧‧vocabulary model building module

125‧‧‧情境訓練模組 125‧‧‧Scenario Training Module

300‧‧‧操作虛擬助理的方法 300‧‧‧ Method for operating virtual assistant

710‧‧‧使用者 710‧‧‧users

720‧‧‧企業資源系統 720‧‧‧Enterprise Resource System

730‧‧‧企業資料庫 730‧‧‧Enterprise Database

S310~S350、S410~S480、S231~S322、S231~S322、S331~S332‧‧‧步驟 S310 ~ S350, S410 ~ S480, S231 ~ S322, S231 ~ S322, S331 ~ S332‧‧‧Steps

為讓本發明之上述和其他目的、特徵、優點與實施例能更明顯易懂,所附圖式之說明如下: 第1圖係根據本案之一些實施例所繪示之一種操作虛擬助理的系統的示意圖;第2圖係根據本案之一些實施例所繪示之處理器的示意圖;第3圖係根據本案之一些實施例所繪示之一種操作虛擬助理的方法的流程圖;第4圖係根據本案之一些實施例所繪示之訓練資料模型的流程圖;第5圖係根據本案之一些實施例所繪示之步驟S320的流程圖;第6圖係根據本案之一些實施例所繪示之步驟S330的流程圖;以及第7圖係根據本案之一些實施例所繪示之一種使用者與操作虛擬助理的系統互動的示意圖。 In order to make the above and other objects, features, advantages, and embodiments of the present invention more comprehensible, the description of the drawings is as follows: FIG. 1 is a schematic diagram of a system for operating a virtual assistant according to some embodiments of the present case; FIG. 2 is a schematic diagram of a processor according to some embodiments of the present case; and FIG. 3 is a view of some processors according to the present case. A flowchart of a method for operating a virtual assistant as shown in the embodiments; FIG. 4 is a flowchart of a training data model according to some embodiments of the present case; and FIG. 5 is a diagram according to some embodiments of the present case. The flowchart of step S320; FIG. 6 is a flowchart of step S330 according to some embodiments of the present case; and FIG. 7 is a diagram of a user and operating a virtual assistant according to some embodiments of the present case Schematic representation of system interaction.

以下揭示提供許多不同實施例或例證用以實施本發明的不同特徵。特殊例證中的元件及配置在以下討論中被用來簡化本揭示。所討論的任何例證只用來作解說的用途,並不會以任何方式限制本發明或其例證之範圍和意義。此外,本揭示在不同例證中可能重複引用數字符號且/或字母,這些重複皆為了簡化及闡述,其本身並未指定以下討論中不同實施例且/或配置之間的關係。 The following disclosure provides many different embodiments or illustrations to implement different features of the invention. The elements and configurations in the particular example are used in the following discussion to simplify the present disclosure. Any illustrations discussed are for illustrative purposes only and do not in any way limit the scope and meaning of the invention or its illustrations. In addition, the present disclosure may repeatedly refer to numerical symbols and / or letters in different examples, and these repetitions are for simplification and explanation, and do not themselves specify the relationship between different embodiments and / or configurations in the following discussion.

在全篇說明書與申請專利範圍所使用之用詞 (terms),除有特別註明外,通常具有每個用詞使用在此領域中、在此揭露之內容中與特殊內容中的平常意義。某些用以描述本揭露之用詞將於下或在此說明書的別處討論,以提供本領域技術人員在有關本揭露之描述上額外的引導。 Terms used throughout the specification and patent application (Terms), unless otherwise specified, usually have the ordinary meaning of each term used in this field, in the content disclosed here, and in special content. Certain terms used to describe this disclosure are discussed below or elsewhere in this specification to provide additional guidance to those skilled in the art on the description of this disclosure.

關於本文中所使用之『耦接』或『連接』,均可指二或多個元件相互直接作實體或電性接觸,或是相互間接作實體或電性接觸,而『耦接』或『連接』還可指二或多個元件相互操作或動作。 As used herein, "coupling" or "connection" can mean that two or more components make direct physical or electrical contact with each other, or indirectly make physical or electrical contact with each other, and "coupling" or " "Connected" may also mean that two or more elements operate or act on each other.

在本文中,使用第一、第二與第三等等之詞彙,是用於描述各種元件、組件、區域、層與/或區塊是可以被理解的。但是這些元件、組件、區域、層與/或區塊不應該被這些術語所限制。這些詞彙只限於用來辨別單一元件、組件、區域、層與/或區塊。因此,在下文中的一第一元件、組件、區域、層與/或區塊也可被稱為第二元件、組件、區域、層與/或區塊,而不脫離本發明的本意。如本文所用,詞彙『與/或』包含了列出的關聯項目中的一個或多個的任何組合。本案文件中提到的「及/或」是指表列元件的任一者、全部或至少一者的任意組合。 In this article, the terms first, second, third, etc. are used to describe various elements, components, regions, layers, and / or blocks that are understandable. However, these elements, components, regions, layers and / or blocks should not be limited by these terms. These terms are limited to identifying single elements, components, regions, layers, and / or blocks. Therefore, a first element, component, region, layer, and / or block in the following may also be referred to as a second element, component, region, layer, and / or block without departing from the intention of the present invention. As used herein, the term "and / or" includes any combination of one or more of the associated listed items. The "and / or" mentioned in this document refers to any, all or any combination of at least one of the listed elements.

請參閱第1圖。第1圖係根據本案之一些實施例所繪示之一種操作虛擬助理的系統100的示意圖。如第1圖所繪示,操作虛擬助理的系統100包含處理器110、儲存裝置130以及輸入/輸出裝置150。儲存裝置130用以儲存總體資料庫131、應用知識資料庫132、領域知識資料庫133以及歷史資料庫134,儲存總體資料庫131、應用知識資料庫 132、領域知識資料庫133以及歷史資料庫134電性連接至處理器110。輸入/輸出裝置150電性連接至處理器110,用以提供介面以供輸入音訊。於一實施例中,輸入/輸出裝置150可以是鍵盤、觸控式螢幕、麥克風、喇叭或其它合適的輸入/輸出裝置。使用者可透過輸入/輸出裝置提供的介面輸入音訊或是得到操作虛擬助理的系統100輸出的結果。 See Figure 1. FIG. 1 is a schematic diagram of a system 100 for operating a virtual assistant according to some embodiments of the present invention. As shown in FIG. 1, the system 100 for operating a virtual assistant includes a processor 110, a storage device 130, and an input / output device 150. The storage device 130 is configured to store a general database 131, an application knowledge database 132, a domain knowledge database 133, and a history database 134, and store the general database 131 and the application knowledge database. 132. The domain knowledge database 133 and the history database 134 are electrically connected to the processor 110. The input / output device 150 is electrically connected to the processor 110 for providing an interface for inputting audio. In one embodiment, the input / output device 150 may be a keyboard, a touch screen, a microphone, a speaker, or other suitable input / output devices. The user can input audio through the interface provided by the input / output device or obtain the result output by the system 100 operating the virtual assistant.

於本發明各實施例中,處理器110可以實施為積體電路如微控制單元(microcontroller)、微處理器(microprocessor)、數位訊號處理器(digital signal processor)、特殊應用積體電路(application specific integrated circuit,ASIC)、邏輯電路或其他類似元件或上述元件的組合。儲存裝置150可以實施為記憶體、硬碟、隨身碟、記憶卡等。 In various embodiments of the present invention, the processor 110 may be implemented as an integrated circuit such as a microcontroller, a microprocessor, a digital signal processor, and an application specific integrated circuit. integrated circuit (ASIC), logic circuit, or other similar components or a combination of the above. The storage device 150 may be implemented as a memory, a hard disk, a flash drive, a memory card, and the like.

請參閱第2圖,第2圖係根據本案之一些實施例所繪示之一種處理器110的示意圖。處理器110包含語音辨識模組111、語料分析模組112、情境分析模組113、指令產生模組114、操作執行模組115、訓練模組121、範本建立模組122、語意模型建立模組123、詞彙模型建立模組124以及情境訓練模組125。語料分析模組112與語音辨識模組111電性連接,情境分析模組113與語料分析模組112電性連接,指令產生模組114與情境分析模組113電性連接,操作執行模組115與指令產生模組114電性連接。訓練模組121與語料分析模組112電性連接,範本建立模組122與訓練模組121電性連接,語意模型建立模組123以及詞彙模型建立模組124與範本建立模組122電性連接,情境訓練模組125與情境分析模組11電性連接。 Please refer to FIG. 2. FIG. 2 is a schematic diagram of a processor 110 according to some embodiments of the present invention. The processor 110 includes a speech recognition module 111, a corpus analysis module 112, a situation analysis module 113, an instruction generation module 114, an operation execution module 115, a training module 121, a template creation module 122, and a semantic model creation module. Group 123, vocabulary model building module 124, and situation training module 125. The corpus analysis module 112 is electrically connected to the speech recognition module 111, the context analysis module 113 is electrically connected to the corpus analysis module 112, the instruction generation module 114 is electrically connected to the context analysis module 113, and the operation execution module The group 115 is electrically connected to the command generating module 114. The training module 121 is electrically connected to the corpus analysis module 112. The template creation module 122 is electrically connected to the training module 121. The semantic model creation module 123 and the vocabulary model creation module 124 and the template creation module 122 are electrically connected. Connected, the situation training module 125 and the situation analysis module 11 are electrically connected.

請一併參閱第1圖~第3圖。第3圖係根據本案之一些實施例所繪示之一種操作虛擬助理的方法300的流程圖。如第3圖所示,操作虛擬助理的方法300包含以下步驟:步驟S310:接收音訊輸入並辨識音訊以形成語料資料;步驟S320:利用自然語言處理模型分析語料資料,以產生與語料資料對應的語言特徵資訊;步驟S330:依據職能情境資訊對語言特徵資訊進行職能情境分析,確認該些意圖的其中之一對應的操作;步驟S340:從總體資料庫中找出操作的範本,並將操作對應的詞彙輸入操作的範本以形成操作對應的指令;步驟S350:針對企業資料庫或企業資源系統執行操作對應的指令並產生操作對應的結果;以及步驟S360:輸出操作對應的結果。 Please refer to Figures 1 to 3 together. FIG. 3 is a flowchart of a method 300 for operating a virtual assistant according to some embodiments of the present invention. As shown in FIG. 3, the method 300 for operating a virtual assistant includes the following steps: Step S310: receiving audio input and identifying the audio to form corpus data; step S320: analyzing the corpus data using a natural language processing model to generate and corpus Step S330: Perform a functional situation analysis on the linguistic feature information according to the functional situation information to confirm the operation corresponding to one of the intents; Step S340: find the operation template from the overall database, and Input the vocabulary corresponding to the operation into the operation template to form the operation corresponding instruction; step S350: execute the operation corresponding instruction and generate the result corresponding to the operation for the enterprise database or the enterprise resource system; and step S360: output the result corresponding to the operation.

於步驟S310中,接收音訊輸入並辨識音訊以形成語料資料。於一實施例中,經由輸入/輸出裝置150接收到的音訊可以由處理器110的語音辨識模組111進行語音辨識,將使用者的自然語言轉換為語料資料。於另一實施例中,語音辨識也可以藉由網際網路將音訊傳送至雲端語音辨識系統,經由雲端語音辨識系統辨識音訊後,再將辨識結果 作為語料資料,舉例而言,雲端語音辨識系統可以實施為google的語音辨識系統。 In step S310, audio input is received and audio is recognized to form corpus data. In one embodiment, the audio received through the input / output device 150 can be recognized by the speech recognition module 111 of the processor 110 to convert the user's natural language into corpus data. In another embodiment, the voice recognition may also send audio to the cloud voice recognition system through the Internet, and after the audio is recognized by the cloud voice recognition system, the recognition result is then transmitted. As corpus data, for example, the cloud speech recognition system can be implemented as Google's speech recognition system.

在執行步驟S320之前,需先建立共通詞彙模型以及共通語意模型。因此請參考第4圖,第4圖係根據本案之一些實施例所繪示之訓練資料模型的流程圖。如第4圖所示,訓練資料模型階段包含以下步驟:步驟S410:根據應用知識資料庫及領域知識資料庫產生系統領域詞彙集合;步驟S420:系統領域詞彙集合及複數個服務應用參數形成為關鍵實體集合;步驟S430:將複數個訓練語料分類為查詢資料操作及執行指令操作的其中之一;步驟S440:依照企業資料庫中的類別區分對應查詢資料操作的該些訓練語料的意圖形成複數個查詢資料操作意圖,以及依照企業資源系統提供的服務行為區分對應執行指令操作的該些訓練語料的意圖形成複數個執行指令操作意圖;步驟S450:建立查詢資料操作意圖的範本,以及執行指令操作意圖的範本;步驟S460:根據關鍵實體集合、查詢資料操作意圖的範本以及執行指令操作意圖的範本建立總體資料庫;步驟S470:辨識關鍵實體集合中的系統領域詞彙在訓練語料中出現的複數個第一機率,並藉由辨識出的系統領域詞彙分析訓練語料的複數個句型結構,以及系統領域 詞彙彼此之間的複數個關聯性,並根據第一機率以及關聯性建立共通詞彙模型;以及步驟S480:分析查詢資料操作意圖以及執行指令操作意圖中出現系統領域詞彙的複數個第二機率,並根據句型結構以及第二機率建立共通語意模型。 Before executing step S320, a common vocabulary model and a common semantic model need to be established. Therefore, please refer to FIG. 4, which is a flowchart of a training data model according to some embodiments of the present invention. As shown in Figure 4, the training data model stage includes the following steps: Step S410: Generate a system domain vocabulary based on the application knowledge database and domain knowledge database; Step S420: The system domain vocabulary and a plurality of service application parameters are formed as the key Entity set; step S430: classify a plurality of training corpora into one of query data operation and execute instruction operation; step S440: distinguish the training corpus corresponding to query data operation according to the category in the enterprise database The plurality of query data operation intentions, and the intention to distinguish the training corpus corresponding to the execution of the instruction operation according to the service behavior provided by the enterprise resource system to form a plurality of execution instruction operation intentions; step S450: establishing a template of the query data operation intention, and execution The template of the instruction operation intention; Step S460: Establish an overall database according to the key entity set, the query data operation intention template, and the template for executing the instruction operation intention; Step S470: identify the system domain words in the key entity set appear in the training corpus Plural first Rate, and system areas identified by the lexical analysis training corpus of a plurality of sentence structure, and system areas Plural correlations between vocabularies, and establishing a common vocabulary model according to the first probability and correlation; and step S480: analyzing the query data operation intention and the execution instruction operation intention of the plural second probability of the system domain vocabulary, and Establish a common semantic model based on the sentence structure and the second probability.

於步驟S410及步驟S420中,根據應用知識資料庫132及領域知識資料庫133產生系統領域詞彙集合,再利用系統領域詞彙集合及複數個服務應用參數形成為關鍵實體集合,關鍵實體集合包含複數個系統領域詞彙。舉例而言,關鍵實體集合包含企業領域詞彙以及企業系統的服務應用參數等資訊。企業領域詞彙則是指每個不同領域的企業可能會需要用到的詞彙,例如醫療業運用到的詞彙與運輸業運用到的詞彙一定不相同,因此企業領域詞彙會依照每個使用ERP系統的企業不同而有所變化。企業系統的服務應用參數則是企業系統所提供的各項服務對應的參數,舉例而言,企業系統中的請假功能可能需要請假時間、假別等資訊,關鍵實體集合中的系統領域詞彙就需要包含事假、年假、病假、出差假等資訊。 In steps S410 and S420, a system domain vocabulary set is generated according to the application knowledge database 132 and the domain knowledge database 133, and then the system domain vocabulary set and a plurality of service application parameters are used to form a key entity set. The key entity set includes a plurality of System domain vocabulary. For example, the key entity set contains information such as enterprise domain vocabulary and service application parameters of the enterprise system. The corporate vocabulary refers to the vocabulary that companies in different fields may need. For example, the vocabulary used in the medical industry and the vocabulary used in the transportation industry must be different. Therefore, the vocabulary in the enterprise field will be based on each use of the ERP system. Enterprises vary. The service application parameters of the enterprise system are the parameters corresponding to various services provided by the enterprise system. For example, the leave function in the enterprise system may require information such as leave time and leave, and the system domain vocabulary in the key entity collection needs Contains information on leave, annual leave, sick leave, and business leave.

詳細而言,關鍵實體集合更包含存取資料時會有的資料欄位名稱、企業系統提供給使用者的服務名稱、使用者在查詢時所設定的限制條件之參數值、服務應用的參數值以及企業系統的操作函數等,企業系統的操作函數可以為請假、加班申請、出差申請、報支等操作函數。而上述的這些資訊也可能會有對應的別名,也需在訓練資料庫時一併輸 入,例如:出貨單對於特定領域的廠商有可能有出貨明細表或銷貨單等不同的名稱。 In detail, the key entity collection also includes the data field names that will be available when accessing the data, the service name provided by the enterprise system to the user, the parameter values of the restriction conditions set by the user during the query, and the service application parameter values As well as operation functions of the enterprise system, the operation functions of the enterprise system may be operation functions such as leave of absence, overtime application, business application, and reimbursement. The above information may also have corresponding aliases, which also need to be entered in the training database. Incoming, for example: shipping orders may have different names such as shipping details or sales orders for manufacturers in specific fields.

於步驟S430中,將複數個訓練語料分類為查詢資料操作及執行指令操作的其中之一。訓練語料可以是使用者的可能會下的指令或會問的問題等自然語言的資料,在建立好關鍵實體集合後會將訓練語料按照意圖分類,於一實施例中,使用者的意圖分為查詢資料操作及執行指令操作,但也可以將使用者的意圖分類的更精細,本發明不限於此。舉例而言,使用者如果對虛擬助理說:「請幫我找XX公司的出貨單」,在本發明的意圖分類中會分類為查詢資料操作,虛擬助理就會去企業資料庫中幫使用者查詢XX公司的出貨單。如果使用者對虛擬助理說:「幫我請1月30日的出差假」,在本發明的意圖分類中會分類為執行指令操作,虛擬助理就會進入企業資源系統中幫使用者請假。 In step S430, the plurality of training corpora are classified into one of a query data operation and a command execution operation. The training corpus can be natural language data such as the user's instructions or questions that may be asked. After the key entity set is established, the training corpus will be classified according to the intention. In one embodiment, the user's intention It is divided into query data operation and execution instruction operation, but the user's intention can be classified more finely, and the present invention is not limited to this. For example, if the user says to the virtual assistant: "Please help me find the shipping order of XX company", it will be classified as query data operation in the intent classification of the present invention, and the virtual assistant will go to the enterprise database to help Check the shipping order of company XX. If the user says to the virtual assistant: "Please ask me for a business trip on January 30." In the intent classification of the present invention, it will be classified as performing an instruction operation, and the virtual assistant will enter the enterprise resource system to help the user ask for leave.

於步驟S440中,依照企業資料庫中的類別區分對應查詢資料操作的該些訓練語料的意圖形成複數個查詢資料操作意圖,以及依照企業資源系統提供的服務行為區分對應執行指令操作的該些訓練語料的意圖形成複數個執行指令操作意圖。於一實施例中,會先按照每個不同領域的企業資料庫對查詢資料操作區分意圖。舉例而言,醫療業的企業資料庫所儲存的資料欄位一定與運輸業的企業資料庫不相同,因此兩者的使用者需求也不一定相同。例如,對醫療業的使用者可能會有查詢病歷資料、查詢病房空位等都是查詢資料操作的不同意圖,對運輸業的使用者可能會有查詢出 貨紀錄、查詢包裹運送狀態等都是查詢資料操作的不同意圖。當然也會按照每個不同領域的企業資源系統提供的服務行為對執行指令操作區分意圖,如上所述醫療業的企業資源系統所提供的服務也當然會和運輸業有所不同,每個不同領域的企業所提供的查詢資料操作或服務行為操作也不一定可以通用,因此也需要對每個不同領域的企業所提供的服務區分意圖,例如,對醫療業的使用者可能會有提供掛號的服務、提供住院訂健康餐的服務等都是服務行為操作的不同意圖,對運輸業的使用者可能會有提供自動分類貨物的服務、安排貨物出貨順序的服務等都是服務行為操作的不同意圖。 In step S440, the intention of distinguishing the training corpus corresponding to the query data operation according to the category in the enterprise database is to form a plurality of query data operation intentions, and to distinguish the corresponding execution instruction operations according to the service behavior provided by the enterprise resource system. The intent of the training corpus forms a plurality of intents to execute the instruction operation. In one embodiment, the operation of querying data is distinguished according to the enterprise database in each different domain. For example, the database stored in the enterprise database of the medical industry must be different from the enterprise database of the transportation industry, so the user needs of the two are not necessarily the same. For example, users in the medical industry may have different intents for querying medical records and inquiring vacancies in the ward, and users in the transportation industry may have queries Cargo records, querying the status of parcel delivery, etc. are the different intentions of querying data operations. Of course, according to the service behavior provided by the enterprise resource system in each different field, the intention is to distinguish the execution of the instruction operation. As mentioned above, the services provided by the enterprise resource system in the medical industry will of course be different from the transportation industry. Each different field The query data operation or service behavior operation provided by the company may not be universal, so it is also necessary to distinguish the intent of the service provided by the enterprise in each different field. For example, users in the medical industry may have registered services. The provision of services such as hospitalization and ordering healthy meals are different intentions of service behavior operations. For users in the transportation industry, there may be different intentions of service behavior operations to provide services of automatically classifying goods and arranging the order of shipment of goods. .

於步驟S450及步驟S460中,建立查詢資料操作意圖的範本以及執行指令操作意圖的範本,並根據關鍵實體集合、查詢資料操作意圖的範本以及執行指令操作意圖的範本建立總體資料庫131。舉例而言,將使用者在操作某個領域企業的虛擬助理會有的查詢資料操作意圖及執行指令操作意圖都區分好後,就可以針對每個意圖產生對應的範本,根據上方的範例,醫療業就會有對應查詢病歷資料、查詢病房空位、提供掛號的服務及提供住院訂健康餐的服務的4個範本,運輸業就會有對應查詢出貨紀錄、查詢包裹運送狀態、提供自動分類貨物的服務、安排貨物出貨順序的服務的4個範本,接著會根據上述這些範本以及關鍵實體集合建立總體資料庫131。 In steps S450 and S460, a template for querying data operation intention and a template for executing instruction operation is established, and an overall database 131 is established according to a set of key entities, a template for querying data operation intention and a template for executing instruction operation. For example, after distinguishing the query data operation intention and execution instruction operation intention that a user's virtual assistant operating in a certain field company has, the corresponding template can be generated for each intent. According to the above example, medical treatment There will be 4 templates corresponding to querying medical records, querying vacancies in the ward, providing registered services, and providing services for ordering healthy meals in hospitals. The transportation industry will have corresponding querying shipping records, querying parcel delivery status, and providing automatic sorting of goods. Services, four services for arranging the order in which goods are shipped, and then an overall database 131 will be created based on these templates and a collection of key entities.

於步驟S470中,辨識關鍵實體集合中的系統領域詞彙在訓練語料中出現的複數個第一機率,並藉由辨識出 的系統領域詞彙分析訓練語料的複數個句型結構,以及系統領域詞彙彼此之間的複數個關聯性,並根據第一機率以及關聯性建立共通詞彙模型。在一實施例中,利用n元語法(n-GRAM)以及上下文無關文法(Context-free grammar,CFG)兩種演算法計算每一系統領域詞彙在訓練語料中出現的機率,並藉由系統領域詞彙分析訓練語料的句型結構以及系統領域詞彙彼此之間的關聯性以建立共通詞彙模型。舉例而言,如果訓練語料中有「我要查詢XX公司的報價單」以及「我要查詢XX公司的出貨單」,而「XX公司」、「報價單」及「出貨單」都是系統領域詞彙,但在上述的範例中,由於「XX公司」可能平均出現在每一個查詢資料操作的意圖中,因此「XX公司」的機率在每一個查詢資料操作的意圖中都幾乎相同,而「報價單」及「出貨單」則只在查詢某些特定資料之意圖的訓練語料中大量出現,而不會出現在查詢其他資料之意圖的訓練語料中,因此「報價單」及「出貨單」的機率在對應的意圖中會特別高,而在其他意圖中會較低。 In step S470, identify the first multiple occurrences of the system domain vocabulary in the key entity set in the training corpus, and identify the System sentence lexical analysis training corpus of multiple sentence structure, and the system domain vocabulary of multiple correlations between each other, and based on the first probability and relevance to establish a common vocabulary model. In one embodiment, two algorithms, n-GRAM and Context-free grammar (CFG), are used to calculate the probability of each system domain vocabulary appearing in the training corpus. The domain vocabulary analysis training corpus's sentence structure and system domain vocabulary are related to each other to establish a common vocabulary model. For example, if the training corpus contains "I want to query the quotation of XX company" and "I want to query the XX company's shipping order", and "XX company", "quotation" and "shipment order" are all Is a vocabulary in the system field, but in the above example, because "XX company" may appear on average for each query data operation intent, the probability of "XX company" is almost the same in each query data operation intention. The "quotations" and "shipments" only appear in large numbers in the training corpus intended to query some specific data, and will not appear in the training corpus intended to query other data, so the "quotation" And the probability of "shipment order" will be particularly high in the corresponding intent, and will be lower in other intents.

於步驟S480中,分析查詢資料操作意圖以及執行指令操作意圖中出現系統領域詞彙的複數個第二機率,並根據句型結構以及第二機率建立共通語意模型。在一實施例中,利用隱馬爾可夫模型(Hidden Markov Model,HMM)演算法計算系統領域詞彙在查詢資料操作意圖以及執行指令操作意圖中出現的機率,以建立共通語意模型,舉例而言,在訓練資料模型階段時會輸入許多訓練語料,隱馬爾可 夫模型演算法必須計算系統領域詞彙在不同意圖出現的機率。結合上述的範例,如果訓練語料中有「我要查詢XX公司的出貨單」,依照n元語法以及上下文無關文法可以找出「XX公司」及「出貨單」都是系統領域詞彙,而隱馬爾可夫模型演算法可以依據所有辨識出的系統領域詞彙於查詢資料操作意圖以及執行指令操作意圖中的機率以及系統領域詞彙之間的關係,進一步判斷「出貨單」與查詢出貨資料的意圖相關聯,再結合「XX公司」的系統領域詞彙,可以自動幫使用者在企業資料庫中查詢XX公司的出貨相關資料。 In step S480, a plurality of second probabilities of vocabulary of the system domain appear in the query data operation intention and the execution instruction operation intention, and a common semantic model is established according to the sentence structure and the second probability. In an embodiment, a Hidden Markov Model (HMM) algorithm is used to calculate the probability of a system vocabulary appearing in query data operation intentions and executing instruction operation intentions to establish a common semantic model. For example, During the training data model phase, many training corpora are input. Hidden Markov The husband model algorithm must calculate the probability of the system domain vocabulary appearing at different intents. In combination with the above example, if the training corpus contains "I want to query the shipping order of XX company", according to the n-gram syntax and context-free grammar, it can be found that "XX company" and "shipping order" are both system vocabulary. The hidden Markov model algorithm can further determine the "shipment order" and the query shipment according to the relationship between all the identified system domain vocabularies in the query data operation intention, the probability of executing the instruction operation intention, and the system domain vocabulary. The intent of the data is related, and combined with the system field vocabulary of "XX company", users can automatically query the company's shipping related data in the enterprise database.

當建立完共通詞彙模型及共通語意模型後,接著進行步驟S320,利用自然語言處理模型分析語料資料,以產生與語料資料對應的語言特徵資訊,語言特徵資訊包含複數個意圖、意圖對應的機率以及複數個詞彙。步驟S320的細部流程請參考第5圖,第5圖係根據本案之一些實施例所繪示之步驟S320的流程圖。如第5圖所示,步驟S320包含以下步驟:步驟S321:利用共通詞彙模型辨識語料資料中是否具有符合關鍵實體集合中的系統領域詞彙,將辨識結果設定為語言特徵資訊中的詞彙,並分析語言特徵資訊中的詞彙出現的機率;步驟S322:根據特徵資訊中的詞彙分析語料資料的句型結構;以及步驟S323:利用共通語意模型根據特徵資訊中 的詞彙出現的機率以及語料資料的句型結構辨識語料資料的意圖以及意圖對應的機率。 After the common vocabulary model and the common semantic model are established, step S320 is performed, and the corpus data is analyzed by using a natural language processing model to generate linguistic feature information corresponding to the corpus data. The linguistic feature information includes a plurality of intents and intent correspondence Probability and plural words. For the detailed flow of step S320, please refer to FIG. 5. FIG. 5 is a flowchart of step S320 according to some embodiments of the present invention. As shown in FIG. 5, step S320 includes the following steps: step S321: use a common vocabulary model to identify whether the corpus data has vocabulary in the system domain that matches the key entity set, set the recognition result to the vocabulary in the language feature information, and Analyze the occurrence probability of vocabulary in the linguistic feature information; step S322: analyze the sentence structure of the corpus data based on the vocabulary in the feature information; and step S323: use the common semantic model to The probability of the occurrence of vocabulary and the sentence structure of the corpus data identify the intention of the corpus data and the probability of the corresponding intent.

於步驟S321及步驟S322中,利用共通詞彙模型辨識語料資料中是否具有符合關鍵實體集合中的系統領域詞彙,將辨識結果設定為語言特徵資訊中的詞彙,並分析語言特徵資訊中的詞彙出現的機率,再根據特徵資訊中的詞彙分析語料資料的句型結構。舉例而言,將使用者輸入的語料資料,利用共通詞彙模型將語料資料中含有系統領域詞彙的詞彙辨識出來,再進一步判斷出語料資料的句型結構。舉例而言,如果使用者對虛擬助理說:「我想要查XX公司上個月的出貨單」,根據共通詞彙模型可以辨識出「XX公司」、「上個月」及「出貨單」等符合系統領域詞彙的詞彙。 In step S321 and step S322, the common vocabulary model is used to identify whether the corpus data has vocabulary in the system domain in the key entity set, set the recognition result to the vocabulary in the language feature information, and analyze the occurrence of the vocabulary in the language feature information Probability, and then analyze the sentence structure of the corpus data based on the vocabulary in the feature information. For example, the corpus data input by the user is used to identify the vocabulary in the corpus data that contains the vocabulary of the system domain, and then the sentence structure of the corpus data is further determined. For example, if the user says to the virtual assistant: "I want to check the shipping order of XX company last month", the common vocabulary model can identify "XX company", "last month" and "shipping order" "And other words that fit the vocabulary of the system domain.

於步驟S323中,利用共通語意模型根據特徵資訊中的詞彙出現的機率以及語料資料的句型結構辨識語料資料的意圖以及意圖對應的機率。根據上方的範例,辨識出「XX公司」、「上個月」及「出貨單」等詞彙後,會再進一步判斷這些詞彙在所有意圖中的機率。此處指的所有意圖包含所有查詢資料操作意圖以及執行指令操作意圖的機率。 In step S323, the common semantic model is used to identify the intention of the corpus data and the probability corresponding to the intent according to the probability of vocabulary appearance in the feature information and the sentence structure of the corpus data. According to the example above, after identifying the words "XX company", "last month", and "shipment order", we will further judge the probability of these words in all intents. All intents referred to here include all query data operation intentions and the probability of performing instruction operation intentions.

於步驟S330中,依據職能情境資訊對語言特徵資訊進行職能情境分析,確認該些意圖的其中之一對應的操作。在進行職能情境分析之前需先建立職能情境模型,職能情境模型在進行職能情境分析時是先將歷史資料庫134中的資料所轉換成的特徵向量,然後會利用機器學習演算法將歷史資料庫134中的資料依據各種不同的情境分類後計算 特徵向量與各情境之間的強弱關係,接著產生職能情境模型。適合建立上述職能情境的機器學習演算法包括:傳統機器學習常用的支援向量機(Support Vector Machine,SVM),以及目前深度學習(Deep Learning)相關的卷積神經網路(Convolutional Neural Networks,CNN)、遞歸神經網路(Recurrent Neural Networks,RNN)和長短期記憶模型(Long Short-Term Memory,LSTM)等演算法。步驟S330的細部流程請參考第6圖,第6圖係根據本案之一些實施例所繪示之步驟S330的流程圖。如第6圖所示,步驟S330包含以下步驟:步驟S331:利用語料資料以及職能情境資訊與職能情境模型進行比對,並產生職能情境辨識結果;以及步驟S332:根據職能情境辨識結果判斷該些意圖的其中之一對應查詢資料操作及執行指令操作的其中之一。 In step S330, a functional context analysis is performed on the linguistic feature information according to the functional context information to confirm an operation corresponding to one of the intentions. Before performing functional situation analysis, a functional situation model needs to be established first. When performing a functional situation analysis, the functional situation model first converts the features in the historical database 134 into feature vectors, and then uses machine learning algorithms to convert the historical database The data in 134 are classified and calculated according to different scenarios. The strong and weak relationship between the feature vector and each situation, then a functional situation model is generated. Machine learning algorithms suitable for establishing the above functional scenarios include: Support Vector Machines (SVMs) commonly used in traditional machine learning, and Convolutional Neural Networks (CNN) related to deep learning. , Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) algorithms. For the detailed flow of step S330, please refer to FIG. 6, which is a flowchart of step S330 according to some embodiments of the present invention. As shown in FIG. 6, step S330 includes the following steps: step S331: using corpus data and functional context information to compare with the functional context model and generating a functional context identification result; and step S332: judging the result based on the functional context identification result One of these intentions corresponds to one of querying data operations and executing instruction operations.

於步驟S331中,利用語料資料以及職能情境資訊與職能情境模型進行比對,並產生職能情境辨識結果。職能情境資訊包含使用者的身份、使用者的職位、使用者的部門、時間以及地點。職能情境資訊的部分資訊可以由輸入/輸出裝置150所感測,例如可以偵測使用者目前的狀態(例如,是否出差回來)。根據前面辨識使用者語料資料後所得到的所有意圖對應的機率以及詞彙,再結合職能情境資訊可以近一步確認使用者的語料資料是符合訓練資料模型中的資料。 In step S331, the corpus data and the functional context information are compared with the functional context model, and a functional context identification result is generated. Functional context information includes the user's identity, the user's position, the user's department, time, and location. Part of the information of the functional context information can be sensed by the input / output device 150, for example, it can detect the current status of the user (for example, whether or not he is back on a business trip). According to the probability and vocabulary of all the intents obtained after identifying the user's corpus data, combined with the functional context information, it can be further confirmed that the user's corpus data is consistent with the data in the training data model.

於步驟S332中,根據職能情境辨識結果判斷該些意圖的其中之一對應查詢資料操作及執行指令操作的其中之一。由於在訓練資料模型中會有多個查詢資料操作意圖以及多個執行指令操作意圖,並且在經過前述的共通語意模型的計算後會產生每個意圖對應的機率,具有較低機率值的意圖可以利用門檻值過濾,以得到最有可能的意圖並確認對應的操作。由前述的範例可知,當辨識出「XX公司」、「上個月」及「出貨單」等詞彙後,會判斷這些詞彙搭配職能情境資訊找出最符合的查詢資料操作意圖或執行指令操作意圖,在經過上述操作後即會判斷出使用者對虛擬助理說:「我想要查XX公司上個月的出貨單」,最有可能會要查XX公司的出貨單,因此即可對應出使用者想要執行的是查詢資料操作。需要有職能情境的判斷是因為會因為使用者的職位、部門、操作時間、操作地點等資訊不同,而有不同的需求,舉例而言,採購人員與財務人員都會看[廠商每月統計表],但是可能這兩者的[廠商每月統計表]的統計目標並不相同:一個是統計廠商的進貨狀況,另一個是統計自己公司付款給廠商的狀況。但使用者在與虛擬助理對話時不一定會明確指說需要什麼[廠商每月統計表],可能只說:「我需要上個月的廠商每月統計表」這種簡單的句型,因此才更需要搭配使用者的職能情境資訊再進行進一步精準的判斷。 In step S332, it is determined that one of the intentions corresponds to one of a query data operation and an execution instruction operation according to the functional context recognition result. Because there will be multiple query data operation intentions and multiple instruction operation operation intentions in the training data model, and after the calculation of the common semantic model described above, the probability corresponding to each intent will be generated. Intents with lower probability values can be Use threshold filtering to get the most likely intent and confirm the corresponding operation. As can be seen from the foregoing example, when the words "XX company", "last month", and "shipment order" are identified, these words will be judged with the functional context information to find the most suitable query data operation intention or execute the command operation The intention is that after the above operation, the user will be judged that the user said to the virtual assistant: "I want to check the shipping order of XX company last month". Most likely, you will want to check the shipping order of XX company, so you can Corresponding to what the user wants to perform is a query data operation. The judgement of the functional situation is needed because the user's position, department, operation time, operation place and other information have different needs. For example, purchasers and financial staff will look at [vendor monthly statistics table] However, it is possible that the statistical goals of the [vendor monthly statistics table] of the two are not the same: one is to count the purchase status of the manufacturer, and the other is to count the payment status of the company to the manufacturer. However, when talking with the virtual assistant, the user may not necessarily specify exactly what the [vendor monthly statistics table] is needed. It may only say, "I need the monthly monthly statistics table of the manufacturer last month." It is even more necessary to make a more accurate judgment with the user's functional situation information.

於步驟S340中,從總體資料庫131中找出操作的範本,並將操作對應的詞彙輸入操作的範本以形成操作對應的指令。接續上述範例,在找出使用者想要執行的是查詢 資料操作後,虛擬助理會從總體資料庫131中找出之前建立好的查詢資料操作的範本,然後將「XX公司」、「上個月」及「出貨單」等詞彙輸入進範本中以形成操作對應的指令,此處的指令可以為SQL程式指令,當然也可以是其他類型的程式指令。 In step S340, a template of the operation is found from the overall database 131, and a vocabulary corresponding to the operation is input into the template of the operation to form an instruction corresponding to the operation. Following the example above, find out what the user wants to run is a query After the data operation, the virtual assistant will find the previously created query data operation template from the overall database 131, and then enter the words "XX company", "last month" and "shipment order" into the template to The instruction corresponding to the operation is formed. The instruction here may be a SQL program instruction, of course, it may also be another type of program instruction.

於步驟S350及步驟S360中,針對企業資料庫或企業資源系統執行操作對應的指令並產生操作對應的結果,及輸出操作對應的結果。接續上述範例,當設定好查詢資料操作對應的程式指令後,虛擬助理就會進入企業資料庫查詢使用者想要找的資料,即為「XX公司上個月的出貨單」,接著虛擬助理會將「XX公司上個月的出貨單」的結果經由輸入/輸出裝置150回覆給使用者,可以經由顯示器顯示給使用者查看,也可以直接經由印表機列印出報表交由使用者查看。 In steps S350 and S360, an operation corresponding instruction is executed and an operation corresponding result is generated for the enterprise database or an enterprise resource system, and an operation corresponding result is output. Continuing the above example, when the corresponding program instructions for querying data operations are set, the virtual assistant will enter the enterprise database to query the data the user wants, which is "the shipping order of XX company last month", and then the virtual assistant The results of "XX company's last month's shipping order" will be returned to the user via the input / output device 150, which can be displayed to the user for viewing, or the report can be printed directly to the user via the printer and submitted to the user Check it out.

請繼續參考第7圖,第7圖係根據本案之一些實施例所繪示之一種使用者與操作虛擬助理的系統互動的示意圖。如第7圖所示,使用者710會和操作虛擬助理的系統100進行互動,使用者710可以使用自然語言與虛擬助理溝通,操作虛擬助理的系統100會將使用者的要求分為查詢資料操作以及執行指令操作,如果是查詢資料操作,操作虛擬助理的系統100會到企業資料庫730中查資料再反饋給使用者,如果是執行指令操作操作虛擬助理的系統100會到企業資源系統720執行相關的服務再反饋給使用者,而操作虛擬助理的系統100也會主動的將可能隱藏的背景事件通知給 使用者,例如:使用者如果出差回來有差旅費要核銷,操作虛擬助理的系統100也會主動通知使用者要核銷單據。 Please continue to refer to FIG. 7. FIG. 7 is a schematic diagram illustrating interaction between a user and a system for operating a virtual assistant according to some embodiments of the present invention. As shown in FIG. 7, the user 710 interacts with the system 100 operating the virtual assistant. The user 710 can communicate with the virtual assistant using natural language. The system 100 operating the virtual assistant divides the user's request into data query operations. And execute the instruction operation. If it is a data query operation, the system 100 that operates the virtual assistant will check the data in the enterprise database 730 and feed it back to the user. If the system 100 executes the instruction operation, the virtual assistant system 100 will execute the enterprise resource system 720 Relevant services are then fed back to the user, and the system 100 operating the virtual assistant will also actively notify background events that may be hidden The user, for example, if the user has to travel expenses to be written off after returning from a business trip, the system 100 operating the virtual assistant will also actively notify the user to write off the documents.

由上述本案之實施方式可知,主要係改進以往ERP系統龐大繁雜造成使用者使用不易的問題,結合虛擬助理更可以協助使用者以更方便的方式使用ERP系統,而將使用者想要進行的操作分類,搭配使用者的職能情境資訊可以更準確的判斷使用者的意圖,達到節省使用者的操作時間以及降低使用ERP系統困難度的功效。 According to the implementation of the above case, it is mainly to improve the problem that the previous ERP system is huge and complicated, which makes it difficult for the user to use. In combination with the virtual assistant, it can help the user to use the ERP system in a more convenient way, and the operation that the user wants to perform. Classification, combined with the user's functional context information can more accurately determine the user's intention, achieve the effect of saving the user's operation time and reducing the difficulty of using the ERP system.

另外,上述例示包含依序的示範步驟,但該些步驟不必依所顯示的順序被執行。以不同順序執行該些步驟皆在本揭示內容的考量範圍內。在本揭示內容之實施例的精神與範圍內,可視情況增加、取代、變更順序及/或省略該些步驟。 In addition, the above-mentioned illustration includes sequential exemplary steps, but the steps need not be performed in the order shown. It is within the scope of this disclosure to perform these steps in different orders. Within the spirit and scope of the embodiments of the present disclosure, these steps may be added, replaced, changed, and / or omitted as appropriate.

雖然本案已以實施方式揭示如上,然其並非用以限定本案,任何熟習此技藝者,在不脫離本案之精神和範圍內,當可作各種之更動與潤飾,因此本案之保護範圍當視後附之申請專利範圍所界定者為準。 Although this case has been disclosed as above in the form of implementation, it is not intended to limit the case. Any person skilled in this art can make various modifications and retouches without departing from the spirit and scope of the case. Therefore, the scope of protection of this case should be considered after The attached application patent shall prevail.

Claims (8)

一種操作虛擬助理的方法,包含:接收一音訊輸入並辨識該音訊以形成一語料資料;利用一自然語言處理模型分析該語料資料,以產生與該語料資料對應的一語言特徵資訊,其中該語言特徵資訊包含複數個意圖、該些意圖對應的機率以及複數個詞彙,更包含:利用一共通詞彙模型辨識該語料資料中是否具有符合該關鍵實體集合中的複數個系統領域詞彙,將辨識結果設定為該些詞彙,並分析該些詞彙出現的機率;以及根據該些詞彙分析該語料資料的句型結構;利用一共通語意模型根據該些詞彙出現的機率以及該語料資料的句型結構辨識該語料資料的該些意圖以及該些意圖對應的機率;依據一職能情境資訊對該語料資料的該些意圖進行一職能情境分析,確認該些意圖的其中之一對應的一操作;從一總體資料庫中找出該操作的範本,並將該操作對應的詞彙輸入該操作的範本以形成該操作對應的指令;針對一企業資料庫或一企業資源系統執行該操作對應的指令並產生該操作對應的結果;以及輸出該操作對應的結果;其中,該操作包含一查詢資料操作及一執行指令操作的其中之一。 A method for operating a virtual assistant includes: receiving an audio input and identifying the audio to form a corpus data; analyzing the corpus data using a natural language processing model to generate a language feature information corresponding to the corpus data, The linguistic feature information includes a plurality of intents, a probability corresponding to the intents, and a plurality of vocabularies, and further includes: using a common vocabulary model to identify whether the corpus data has a plurality of system domain vocabularies that conform to the key entity set. Set the recognition result to the words, and analyze the probability of the words appearing; and analyze the sentence structure of the corpus data according to the words; use a common semantic model to predict the occurrence of the words and the corpus data according to a common semantic model The sentence structure of the corpus identifies the intents of the corpus and the probability of the intents; perform a functional context analysis of the intents of the corpus according to a functional context information to confirm that one of the intents corresponds Of an operation; find a template for that operation from a general database, and compare the words for that operation Input a template of the operation to form an instruction corresponding to the operation; execute an instruction corresponding to the operation against an enterprise database or an enterprise resource system and generate a result corresponding to the operation; and output a result corresponding to the operation; wherein the operation includes One of a data query operation and an instruction execution operation. 如請求項1所述的操作虛擬助理的方法,更包含:根據一應用知識資料庫及一領域知識資料庫產生一系統領域詞彙集合;該系統領域詞彙集合及複數個服務應用參數形成為一關鍵實體集合,該關鍵實體集合包含該些系統領域詞彙;將複數個訓練語料分類為該查詢資料操作及該執行指令操作的其中之一;依照該企業資料庫中的類別區分對應該查詢資料操作的該些訓練語料的意圖形成複數個查詢資料操作意圖,以及依照該企業資源系統提供的服務行為區分對應該執行指令操作的該些訓練語料的意圖形成複數個執行指令操作意圖;建立該些查詢資料操作意圖的範本,以及該些執行指令操作意圖的範本;根據該關鍵實體集合、該些查詢資料操作意圖的範本以及該些執行指令操作意圖的範本建立該總體資料庫;辨識該關鍵實體集合中的該些系統領域詞彙在該些訓練語料中出現的複數個第一機率,並藉由辨識出的該些系統領域詞彙分析該些訓練語料的複數個句型結構,以及該些系統領域詞彙彼此之間的複數個關聯性,並根據該些第一機率以及該些關聯性建立該共通詞彙模型;以及分析該些查詢資料操作意圖以及該些執行指令操作意圖中出現該些系統領域詞彙的複數個第二機率,並根據該 些句型結構以及該些第二機率建立該共通語意模型。 The method for operating a virtual assistant according to claim 1, further comprising: generating a system domain vocabulary set based on an application knowledge database and a domain knowledge database; the system domain vocabulary set and a plurality of service application parameters form a key Entity set, the key entity set contains the vocabulary of the system domains; classifying a plurality of training corpora into one of the query data operation and the execution instruction operation; distinguishing corresponding query data operations according to categories in the enterprise database The intentions of the training corpus form a plurality of query data operation intentions, and the intention of the training corpus that should perform the instruction operation is formed according to the service behavior provided by the enterprise resource system to form a plurality of execution instruction operation intentions; establishing the Templates of query data operation intentions, and templates of execution command operation intentions; establishing the overall database based on the set of key entities, query data operation intention templates, and execution command operation intention templates; identifying the key The system domain vocabulary in the entity collection is in these The plurality of first chances appearing in the training corpus, and the plurality of sentence structures of the training corpus are analyzed based on the identified system domain vocabularies, and the plurality of correlations between the system domain vocabularies with each other , And establish the common vocabulary model according to the first probabilities and the correlations; and analyze the plurality of second probabilities that the system domain words appear in the query data operation intentions and the execution instruction operation intentions, and according to The The sentence structure and the second chances establish the common semantic model. 如請求項1所述的操作虛擬助理的方法,更包含:利用一分類器將一歷史資料庫中的資料進行關係強弱分類,產生一職能情境模型。 The method for operating a virtual assistant according to claim 1, further comprising: using a classifier to classify the data in a historical database into relationship strengths to generate a functional situation model. 如請求項3所述的操作虛擬助理的方法,其中該職能情境分析更包含:利用該語料資料以及該職能情境資訊與該職能情境模型進行比對,並產生一職能情境辨識結果;以及根據該職能情境辨識結果判斷該些意圖的其中之一對應該查詢資料操作及該執行指令操作的其中之一。 The method for operating a virtual assistant according to claim 3, wherein the functional situation analysis further comprises: using the corpus data and the functional situation information to compare with the functional situation model, and generating a functional situation identification result; and according to The functional situation identification result judges that one of the intentions should correspond to one of the query data operation and the execution instruction operation. 一種操作虛擬助理的系統,分別與一企業資料庫及一企業資源系統連接,包含:一處理器;一儲存裝置,電性連接至該處理器,用以儲存一總體資料庫、一應用知識資料庫、一領域知識資料庫以及一歷史資料庫;一輸入/輸出裝置,電性連接至該處理器,用以提供一介面以供輸入一音訊;其中,該處理器包含:一語音辨識模組,用以辨識該音訊以形成一語料資料; 一語料分析模組,與該語音辨識模組電性連接,用以利用一自然語言處理模型分析該語料資料,以產生與該語料資料對應的一語言特徵資訊,其中該語言特徵資訊包含複數個意圖、該些意圖對應的機率以及複數個詞彙;該語料分析模組更利用一共通詞彙模型辨識該語料資料中是否具有符合該關鍵實體集合中的複數個系統領域詞彙,將辨識結果設定為該些詞彙,並分析該些詞彙出現的機率,根據該些詞彙分析該語料資料的句型結構;以及利用一共通語意模型根據該些詞彙出現的機率以及該語料資料的句型結構辨識該語料資料的該些意圖以及該些意圖對應的機率;一情境分析模組,與該語料分析模組電性連接,用以依據一職能情境資訊對該語料資料的該些意圖進行一職能情境分析,確認該些意圖的其中之一對應的一操作;一指令產生模組,與該情境分析模組電性連接,用以從該總體資料庫中找出該操作的範本,並將該操作對應的詞彙輸入該操作的範本以形成該操作對應的指令;以及一操作執行模組,與該指令產生模組電性連接,用以針對該企業資料庫或該企業資源系統執行該操作對應的指令並產生該操作對應的結果,並輸出該操作對應的結果至該輸入/輸出裝置;其中,該操作包含一查詢資料操作及一執行指令操作的其中之一。 A system for operating a virtual assistant, which is respectively connected to an enterprise database and an enterprise resource system, and includes: a processor; a storage device electrically connected to the processor for storing an overall database and an application knowledge data Database, a domain knowledge database and a historical database; an input / output device electrically connected to the processor for providing an interface for inputting an audio; wherein the processor includes: a speech recognition module To identify the audio to form a corpus; A corpus analysis module electrically connected to the speech recognition module for analyzing the corpus data using a natural language processing model to generate a linguistic feature information corresponding to the corpus data, wherein the linguistic feature information The corpus analysis module uses a common vocabulary model to identify whether the corpus data has a plurality of system domain vocabularies that match the key entity set. The recognition result is set to the words, and the probability of occurrence of the words is analyzed, and the sentence structure of the corpus data is analyzed according to the words; and a common semantic model is used to estimate the occurrence of the words and the corpus data based on the common semantic model. The sentence structure identifies the intents of the corpus data and the probability corresponding to the intents; a context analysis module is electrically connected to the corpus analysis module, and is used to analyze the corpus data based on functional context information. The intents perform a functional situation analysis to confirm an operation corresponding to one of the intents; an instruction generation module, which is separated from the situation The module is electrically connected to find a template of the operation from the overall database, and input the vocabulary corresponding to the operation into the template of the operation to form an instruction corresponding to the operation; and an operation execution module, connected with the The instruction generating module is electrically connected to execute the instruction corresponding to the operation against the enterprise database or the enterprise resource system and generate a result corresponding to the operation, and output the result corresponding to the operation to the input / output device; wherein, This operation includes one of a data query operation and an instruction execution operation. 如請求項6所述的操作虛擬助理的系統,該 處理器更包含:一訓練模組,與該語料分析模組電性連接,用以根據該應用知識資料庫及該領域知識資料庫產生一系統領域詞彙集合,該系統領域詞彙集合及複數個服務應用參數形成為一關鍵實體集合,該關鍵實體集合包含該些系統領域詞彙,並將複數個訓練語料分類為該查詢資料操作及該執行指令操作的其中之一,依照該企業資料庫中的類別區分對應該查詢資料操作的該些訓練語料的意圖形成複數個查詢資料操作意圖,以及依照該企業資源系統提供的服務行為區分對應該執行指令操作的該些訓練語料的意圖形成複數個執行指令操作意圖;一範本建立模組,與該訓練模組電性連接,建立該些查詢資料操作意圖的範本,以及該些執行指令操作意圖的範本,根據該關鍵實體集合、該些查詢資料操作意圖的範本以及該些執行指令操作意圖的範本建立該總體資料庫;一詞彙模型建立模組,與該範本建立模組電性連接,辨識該關鍵實體集合中的該些系統領域詞彙在該些訓練語料中出現的複數個第一機率,並藉由辨識出的該些系統領域詞彙分析該些訓練語料的複數個句型結構,以及該些系統領域詞彙彼此之間的複數個關聯性,並根據該些第一機率以及該些關聯性建立一共通詞彙模型;以及一語意模型建立模組,與該範本建立模組電性連接,分析該些查詢資料操作意圖以及該些執行指令操作意圖中出現該些系統領域詞彙的複數個第二機率,並根據該些句型 結構以及該些第二機率建立該共通語意模型。 The system for operating a virtual assistant according to claim 6, the The processor further includes: a training module electrically connected to the corpus analysis module for generating a system domain vocabulary based on the application knowledge database and the domain knowledge database, the system domain vocabulary collection and a plurality of The service application parameters are formed as a set of key entities. The set of key entities includes the vocabulary of the system domains, and classifies a plurality of training corpora into one of the query data operation and the execution instruction operation, according to the enterprise database. The classification of the training corpus corresponding to the query data operation intention forms a plurality of query data operation intentions, and according to the service behavior provided by the enterprise resource system, the intention of the training corpus that should perform the instruction operation forms a plural number. An execution instruction operation intention; a template establishment module, which is electrically connected with the training module, establishes the query data operation intention templates, and the execution command operation intention templates, according to the key entity set, the queries The data operation intention template and the execution instruction operation intention templates establish the general Database; a vocabulary model building module, which is electrically connected to the template to establish a module, to identify the plural first chances that the system domain words in the key entity set appear in the training corpora, and The identified system domain vocabularies analyze the plurality of sentence structure structures of the training corpora, and the plurality of associations between the system domain vocabularies with each other, and establish a relationship based on the first probabilities and the associations. A common vocabulary model; and a semantic model building module, which is electrically connected to the template, analyzes the query data operation intentions and the execution instruction operation intentions, and a plurality of second chances of the system domain vocabulary appear, And according to those sentence patterns Structure and the second chances to build the common semantic model. 如請求項5所述的操作虛擬助理的系統,其中該處理器更包含:一情境訓練模組,與該情境分析模組電性連接,用以利用一分類器將該歷史資料庫中的資料進行關係強弱分類,產生一職能情境模型。 The system for operating a virtual assistant according to claim 5, wherein the processor further includes: a situation training module, which is electrically connected to the situation analysis module for using a classifier to store the data in the historical database Classification of relationship strengths produces a functional situation model. 如請求項7所述的操作虛擬助理的系統,其中該情境分析模組更用以利用該語料資料以及該職能情境資訊與該職能情境模型進行比對,並產生一職能情境辨識結果,以及根據該職能情境辨識結果判斷該些意圖的其中之一對應該查詢資料操作及該執行指令操作的其中之一。 The system for operating a virtual assistant according to claim 7, wherein the context analysis module is further configured to compare the functional context information and the functional context information with the functional context model, and generate a functional context identification result, and According to the result of the functional context identification, it is determined that one of the intentions corresponds to one of the data query operation and the execution instruction operation.
TW107105340A 2018-02-13 2018-02-13 Method and system for operating a virtual assistant TWI674530B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW107105340A TWI674530B (en) 2018-02-13 2018-02-13 Method and system for operating a virtual assistant

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW107105340A TWI674530B (en) 2018-02-13 2018-02-13 Method and system for operating a virtual assistant

Publications (2)

Publication Number Publication Date
TW201935229A TW201935229A (en) 2019-09-01
TWI674530B true TWI674530B (en) 2019-10-11

Family

ID=68618458

Family Applications (1)

Application Number Title Priority Date Filing Date
TW107105340A TWI674530B (en) 2018-02-13 2018-02-13 Method and system for operating a virtual assistant

Country Status (1)

Country Link
TW (1) TWI674530B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI725577B (en) * 2019-10-17 2021-04-21 中國信託商業銀行股份有限公司 Intelligent voice information quality inspection method and system

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7264115B2 (en) * 2020-05-28 2023-04-25 Jfeスチール株式会社 Information retrieval system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130325448A1 (en) * 2012-05-31 2013-12-05 Elwha LLC, a limited liability company of the State of Delaware Speech recognition adaptation systems based on adaptation data
US20140337048A1 (en) * 2012-09-07 2014-11-13 Next It Corporation Conversational Virtual Healthcare Assistant
TW201626362A (en) * 2014-09-30 2016-07-16 蘋果公司 Providing an indication of the suitability of speech recognition
TW201636996A (en) * 2014-10-06 2016-10-16 英特爾公司 System and method of automatic speech recognition using on-the-fly word lattice generation with word histories
CN107564518A (en) * 2017-08-21 2018-01-09 百度在线网络技术(北京)有限公司 Smart machine control method, device and computer equipment
CN107657952A (en) * 2017-09-14 2018-02-02 成都晓懋科技有限公司 A kind of phonetic controller of smart home lamp

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130325448A1 (en) * 2012-05-31 2013-12-05 Elwha LLC, a limited liability company of the State of Delaware Speech recognition adaptation systems based on adaptation data
US20140337048A1 (en) * 2012-09-07 2014-11-13 Next It Corporation Conversational Virtual Healthcare Assistant
TW201626362A (en) * 2014-09-30 2016-07-16 蘋果公司 Providing an indication of the suitability of speech recognition
TW201636996A (en) * 2014-10-06 2016-10-16 英特爾公司 System and method of automatic speech recognition using on-the-fly word lattice generation with word histories
CN107564518A (en) * 2017-08-21 2018-01-09 百度在线网络技术(北京)有限公司 Smart machine control method, device and computer equipment
CN107657952A (en) * 2017-09-14 2018-02-02 成都晓懋科技有限公司 A kind of phonetic controller of smart home lamp

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI725577B (en) * 2019-10-17 2021-04-21 中國信託商業銀行股份有限公司 Intelligent voice information quality inspection method and system

Also Published As

Publication number Publication date
TW201935229A (en) 2019-09-01

Similar Documents

Publication Publication Date Title
US10311377B2 (en) Categorization of user interactions into predefined hierarchical categories
AU2019261735B2 (en) System and method for recommending automation solutions for technology infrastructure issues
WO2018196684A1 (en) Method and device for generating conversational robot
US11106983B2 (en) Intelligent interaction method and intelligent interaction system
Pittke et al. Automatic detection and resolution of lexical ambiguity in process models
US9672490B2 (en) Procurement system
Van der Aa et al. Detecting inconsistencies between process models and textual descriptions
CN110297868A (en) Construct enterprise's specific knowledge figure
US11928430B2 (en) Detecting unrelated utterances in a chatbot system
US20220335223A1 (en) Automated generation of chatbot
WO2019041517A1 (en) Electronic device, question recognition and confirmation method, and computer-readable storage medium
US9910909B2 (en) Method and apparatus for extracting journey of life attributes of a user from user interactions
TWI674530B (en) Method and system for operating a virtual assistant
WO2020139865A1 (en) Systems and methods for improved automated conversations
CN110489517B (en) Automatic learning method and system of virtual assistant
US20230351121A1 (en) Method and system for generating conversation flows
TWI679548B (en) Method and system for automated learning of a virtual assistant
CN110209776B (en) Method and system for operating virtual assistant
US20220245340A1 (en) Electronic device for processing user's inquiry, and operation method of the electronic device
CN114546326A (en) Virtual human sign language generation method and system
TWI652587B (en) Method and system for automatically training virtual assistant
CN110298372B (en) Method and system for automatically training virtual assistant
TWM590719U (en) Intelligent conversation management system based on natural language processing
US11860824B2 (en) Graphical user interface for display of real-time feedback data changes
CN110442324B (en) Software requirement text expression defect detection method, system and storage medium