TWM635534U

TWM635534U - Artificial intelligence voice controlled banking transaction system

Info

Publication number: TWM635534U
Application number: TW111210256U
Authority: TW
Inventors: 白庭楷
Original assignee: 華南商業銀行股份有限公司
Priority date: 2022-09-21
Filing date: 2022-09-21
Publication date: 2022-12-11

Abstract

一種人工智能聲控銀行交易系統，包含自動櫃員機、運算裝置、記憶體及語音接收裝置。運算裝置用於進行身分驗證操作及指令確認操作，記憶體用於儲存兩語音訊息，以及語音識別模型、登記身分資料和指令資料庫，語音接收裝置用於接收兩語音訊息並傳輸至記憶體。身分驗證操作包含藉由語音識別模型擷取第一語音訊息中的待定身分資料，以及比對並確認待定身分資料符合登記身分資料，指令確認操作包含藉由語音識別模型擷取一第二語音訊息中的待定指令資料，以及比對並判斷待定指令資料與指令資料庫中的多個執行指令資料對應的一執行指令資料。An artificial intelligence voice-activated bank transaction system includes an automatic teller machine, a computing device, a memory, and a voice receiving device. The computing device is used for identity verification operation and instruction confirmation operation, the memory is used for storing two voice messages, and a voice recognition model, registered identity data and command database, and the voice receiving device is used for receiving two voice messages and transmitting them to the memory. The identity verification operation includes extracting the pending identity data in the first voice message through the voice recognition model, and comparing and confirming that the pending identity data matches the registered identity data, and the command confirmation operation includes extracting a second voice message through the voice recognition model The pending instruction data in the command database, and an execution instruction data that compares and determines that the pending instruction data corresponds to a plurality of execution instruction data in the instruction database.

Description

Artificial intelligence voice-activated banking transaction system

本創作係關於一種銀行交易系統，特別是關於一種人工智能聲控銀行交易系統。 This creation is about a bank transaction system, especially an artificial intelligence voice-activated bank transaction system.

目前的銀行交易系統，如自動櫃員機(ATM)的交易介面大多是採取按鍵或面板的形式，在某些情況或對於特定族群(如視力受損、行動不便者)而言，造成了一定的不便性。 Current banking transaction systems, such as automatic teller machine (ATM) transaction interfaces are mostly in the form of buttons or panels, which in some cases or for specific groups (such as visually impaired, mobility-impaired) has caused certain inconvenience sex.

在某些系統中，語音功能往往都是輔助性的，需要搭配螢幕顯示以及觸摸點字等，因此對於特定使用者而言，使用輔助性的自動櫃員機往往不如透過手機使用網路銀行來得方便。 In some systems, the voice function is often assisted, and needs to be matched with screen display and touch Braille. Therefore, for certain users, using assisted ATMs is often not as convenient as using online banking through mobile phones.

鑒於上述，本創作提供一種人工智能聲控銀行交易系統。 In view of the above, this creation provides an artificial intelligence voice-activated banking transaction system.

依據本創作一實施例的人工智能聲控銀行交易系統，包含自動櫃員機、連接於自動櫃員機的運算裝置、連接於運算裝置的記憶體及連接於記憶體的語音接收裝置。自動櫃員機用於讓使用者自助辦理銀行櫃檯服務，運算裝置用於對使用者進行一身分驗證操作及一指令確認操作，記憶體用於儲存一第一及第二語音訊息，並儲存有語音識別模型、使用者的一登記身分資料及一指令資料庫，語音接收裝置用於接收使用者的第一及第二語音訊息並傳輸至記憶體。所述運算裝置執行的身分驗證操作包含：藉由語音識別模型擷取第一語音訊息中含有的一待定身分資料，以及比對待定身分資料與登記身分資料並確認待定身分資料符合該登記身分資料，且運算裝置執行的該指令確認操作包含：藉由語音識別模型擷取一第二語音訊息中含有的一待定指令資料，以及比對待定指令資料與指令資料庫中的多個執行指令資料，以判斷與所述待定指令資料對應的一執行指令資料。 According to an embodiment of the invention, the artificial intelligence voice-activated bank transaction system includes an automatic teller machine, a computing device connected to the automatic teller machine, a memory connected to the computing device, and a voice receiving device connected to the memory. The automatic teller machine is used to allow the user to handle the bank counter service by himself, the computing device is used to perform an identity verification operation and an instruction confirmation operation on the user, and the memory is used to store a first and a second voice message, and store a voice recognition The model, a user's registered identity data and an instruction database, the voice receiving device is used to receive the user's first and second voice messages and transmit them to the memory. The identity verification operation performed by the computing device includes: using a voice recognition model to extract a pending message contained in the first voice message The identity data, and comparing the pending identity data with the registered identity data and confirming that the pending identity data conforms to the registered identity data, and the command confirmation operation performed by the computing device includes: extracting the information contained in a second voice message by a voice recognition model A pending order data, and comparing the pending order data with a plurality of execution order data in the order database to determine an execution order data corresponding to the pending order data.

藉由上述結構，本案所揭示的人工智能聲控銀行交易系統，可透過儲存有人工智慧語音識別模型的記憶體搭配運算裝置，對接收的語音訊息進行內容判斷，以安全有效地執行身分驗證以及指令確認的功能。讓使用者只需再自動櫃員機前講述自己的身分或帳戶資料完成驗證後，再口語表達欲執行的銀行服務項目，如此一來一般的銀行業務皆能完全透過聲音控制的方式來完成，對各種族群來說都是便利的措施。 With the above structure, the artificial intelligence voice-activated banking transaction system disclosed in this case can judge the content of the received voice message through the memory with the artificial intelligence voice recognition model and the computing device, so as to safely and effectively execute identity verification and instructions Confirmed function. Let the user only need to tell his identity or account information in front of the automatic teller machine to complete the verification, and then verbally express the banking service items he wants to perform. In this way, the general banking business can be completed completely through voice control. It is a convenient measure for the ethnic group.

以上之關於本揭露內容之說明及以下之實施方式之說明係用以示範與解釋本創作之精神與原理，並且提供本創作之專利申請範圍更進一步之解釋。 The above description about the content of this disclosure and the following description of the implementation are used to demonstrate and explain the spirit and principle of this creation, and to provide a further explanation of the patent application scope of this creation.

1:人工智能聲控銀行交易系統 1: Artificial intelligence voice-activated bank transaction system

10:自動櫃員機 10: Automatic teller machine

20:運算裝置 20: computing device

30,30’,30”,30''':記憶體 30,30',30",30''': memory

31:語音識別模型 31: Speech Recognition Model

32:登記身分資料 32: Registration of identity information

33:指令資料庫 33: Command database

34:臉部特徵偵測模型 34: Facial feature detection model

35:登錄臉部特徵 35:Register facial features

36:唇形識別模型 36: Lip recognition model

37:第一觸發訊息 37: The first trigger message

38:第二觸發訊息 38: The second trigger message

40:語音接收裝置 40: Voice receiving device

50:影像擷取裝置 50: Image capture device

C:使用者 C: user

S40:身分驗證操作 S40: Identity verification operation

S400~S402,S500~S503,S600~S604,S700~S703, S400~S402,S500~S503,S600~S604,S700~S703,

圖1a為依據本創作一實施例所繪示的人工智能聲控銀行交易系統的方塊圖。 Fig. 1a is a block diagram of an artificial intelligence voice-activated banking transaction system according to an embodiment of the invention.

圖1b為依據本創作一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。 FIG. 1 b is a block diagram of a usage scenario of an artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖1c為依據本創作一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息的方塊示意圖。 Fig. 1c is a schematic block diagram of receiving voice messages by the memory of the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖2a係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的操作流程圖。 Fig. 2a is an operation flowchart of the identity verification performed by the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖2b係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的操作流程圖。 Fig. 2b is an operation flow chart of the execution order confirmation of the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖3係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的另一操作流程圖。 FIG. 3 is another flow chart of the identity verification performed by the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖4係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行操作確認的另一操作流程圖。 Fig. 4 is another operation flow chart showing the operation confirmation of the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖5a為依據本創作另一實施例所繪示的人工智能聲控銀行交易系統的方塊圖。 Fig. 5a is a block diagram of an artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖5b為依據本創作另一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。 Fig. 5b is a block diagram of a usage scenario of an artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖5c為依據本創作另一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息及臉部影像資料的方塊示意圖。 Fig. 5c is a schematic block diagram of receiving voice messages and facial image data by the memory of the artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖6係依據本創作另一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的臉部識別的一操作流程圖。 FIG. 6 is an operation flowchart of face recognition for identity verification performed by the artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖7為依據本創作又一實施例所繪示的人工智能聲控銀行交易系統的又一記憶體接收訊息及資料的方塊示意圖。 FIG. 7 is a schematic block diagram of another memory receiving messages and data of the artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖8為依據本創作又一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的唇形識別的一操作流程圖。 FIG. 8 is a flow chart of an operation of the artificial intelligence voice-activated bank transaction system performing lip recognition for order confirmation according to another embodiment of the present invention.

圖9為依據本創作其他實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。 FIG. 9 is a block diagram of a usage scenario of an artificial intelligence voice-activated banking transaction system according to other embodiments of the present invention.

圖10為依據本創作其他實施例所繪示的人工智能聲控銀行交易系統的其他記憶體接收訊息及資料的方塊示意圖。 FIG. 10 is a schematic block diagram of receiving messages and data from other memories of the artificial intelligence voice-activated banking transaction system shown in other embodiments of the present invention.

圖11為依據本創作其他實施例所繪示的人工智能聲控銀行交易系統在受觸發的情況下的方塊流程圖。 FIG. 11 is a block flow diagram of the artificial intelligence voice-activated banking transaction system when triggered according to other embodiments of the present invention.

以下在實施方式中詳細敘述本創作之詳細特徵以及優點，其內容足以使任何熟習相關技藝者了解本創作之技術內容並據以實施，且根據本說明書所揭露之內容、申請專利範圍及圖式，任何熟習相關技藝者可輕易地理解本創作相關之目的及優點。以下之實施例係進一步詳細說明本創作之觀點，但非以任何觀點限制本創作之範疇。 The detailed features and advantages of this creation are described in detail below in the implementation mode. The content is enough to enable anyone familiar with the relevant art to understand the technical content of this creation and implement it according to the content disclosed in this specification, the patent scope and drawings , anyone who is familiar with the related art can easily understand the purpose and advantages related to this creation. The following examples are to further describe the viewpoints of this creation in detail, but not to limit the scope of this creation in any way.

請參考圖1a至圖1c，圖1a為依據本創作一實施例所繪示的人工智能聲控銀行交易系統的方塊圖，圖1b為依據本創作一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖，圖1c為依據本創作一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息的方塊示意圖。 Please refer to Fig. 1a to Fig. 1c, Fig. 1a is a block diagram of an artificial intelligence voice-activated bank transaction system according to an embodiment of the invention, and Fig. 1b is an artificial intelligence voice-activated bank transaction system according to an embodiment of this creation 1c is a block diagram of receiving voice messages by the memory of the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

如圖1a及圖1b所示，人工智能聲控銀行交易系統1包含自動櫃員機10、訊號連接於自動櫃員機10的運算裝置20、訊號連接於運算裝置20的記憶體30，及訊號連接於記憶體30的語音接收裝置40。自動櫃員機10是用於讓使用者C自助辦理銀行櫃檯服務，例如但不以此為限的，可為一般大眾使用的自動櫃員機(ATM)。運算裝置20用於對使用者C進行一身分驗證操作及一指令確認操作，也就是對使用者C進行身分確認並判斷使用者欲下達的指令，其中運算裝置20可為各種具有運算能力的處理器。記憶體30用於儲存一第一及第二語音訊息，並儲存有語音識別模型31、使用者C的一登記身分資料32及一指令資料庫33，其中記憶體30可為各種具有儲存資料之能力的儲存裝置。語音接收裝置40用於接收使用者C的第一及第二語音訊息並傳輸至記憶體30，其中語音接收裝置40可為一麥克風，且可鄰近設置於自動櫃員機10以接收來自使用者C的語音。 As shown in Figure 1a and Figure 1b, the artificial intelligence voice-activated banking transaction system 1 includes an automatic teller machine 10, a computing device 20 whose signal is connected to the automatic teller machine 10, a memory 30 whose signal is connected to the computing device 20, and a signal connected to the memory 30 The voice receiving device 40. The automatic teller machine 10 is used to allow the user C to handle bank counter services by himself, for example but not limited thereto, it can be an automatic teller machine (ATM) used by the general public. Computing device 20 is used for user C to carry out Perform an identity verification operation and an instruction confirmation operation, that is, verify the identity of the user C and determine the instruction the user intends to issue, wherein the computing device 20 can be a variety of processors with computing capabilities. The memory 30 is used to store a first and second voice message, and stores a voice recognition model 31, a registered identity data 32 of the user C and an instruction database 33, wherein the memory 30 can be various storage capacity. The voice receiving device 40 is used to receive the first and second voice messages of the user C and transmit them to the memory 30, wherein the voice receiving device 40 can be a microphone, and can be arranged adjacent to the automatic teller machine 10 to receive messages from the user C. voice.

請一同參考圖1c，語音接收裝置40將第一語音訊息及第二語音訊息以訊號傳輸的方式傳遞至記憶體30中。上述兩語音訊息都會被運算裝置20透過語音識別模型31進行分析判斷，且第一語音訊息及第二語音訊息分別對應至登記身分資料32及指令資料庫33，此部分於後續描述。需要注意的是，圖1a至圖1c的不同方塊之間的連線可為實線或虛線，具體來說，不同方塊之間的訊號連皆可透過實體訊號線連接或透過無線通訊方式連接，本案不予限制。另一方面，如具本案通常知識者能理解的，上述除了圖1b的語音接收裝置40需鄰近設置於自動櫃員機10以滿足物理上聲波傳遞的有利條件外，其餘方塊可遠端設置於其他位置，例如運算裝置20可為中央處理器，記憶體30可為雲端資料庫等。當然，在一些實施例中，運算裝置20及記憶體30可鄰近設置於自動櫃員機10附近，以達到高效率的邊緣運算及防止資料攔截等的功效，此部分將於後續進行描述且不應成為本案限制條件。 Please refer to FIG. 1c together. The voice receiving device 40 transmits the first voice message and the second voice message to the memory 30 in the form of signal transmission. The above two voice messages will be analyzed and judged by the computing device 20 through the voice recognition model 31, and the first voice message and the second voice message are respectively corresponding to the registered identity data 32 and the instruction database 33, which will be described later. It should be noted that the connection lines between different blocks in Fig. 1a to Fig. 1c can be solid lines or dotted lines. Specifically, the signal connections between different blocks can be connected through physical signal lines or through wireless communication. This case is not limited. On the other hand, as those with ordinary knowledge of this case can understand, except that the voice receiving device 40 in FIG. For example, the computing device 20 can be a central processing unit, and the memory 30 can be a cloud database, etc. Of course, in some embodiments, the computing device 20 and the memory 30 can be arranged adjacent to the automatic teller machine 10 to achieve high-efficiency edge computing and prevent data interception, etc. This part will be described later and should not be Limitations in this case.

關於本例的語音識別模型31，可為一種預先透過深度學習方法訓練的神經網路，具體來說，語音識別模型31可包含自動語音識別(Automatic Speech Recognition，ASR)軟體，使得理論上，運算裝置20可透過自動語音識別軟體進行以下步驟來完成本案的語音識別功能。一、使用者向語音接收裝置發出聲音訊息。二、將聲音訊息轉為聲波訊號。三、語音接收裝置中的濾波器濾除聲波訊號中的雜訊。四、將濾波後的聲波訊號分解為多組音素(Phonemes)(所謂音素，指組成語言的聲音的基本聲音塊，以英語來說，具有44個音素如「wh」、「th」及「t」等，而中文系統則較缺乏統一標準而沒有定數)。五、每個音素可組成一列表(list)，且可依序在統計上被分析。六、可透過自動語音識別軟體理解一段話的語意。 Regarding the speech recognition model 31 of this example, it can be a neural network trained by a deep learning method in advance. Specifically, the speech recognition model 31 can include automatic speech recognition (Automatic Speech Recognition, ASR) software, so that theoretically, the calculation The device 20 can perform the following steps through the automatic speech recognition software to complete the speech recognition function of this case. 1. The user sends a voice message to the voice receiving device. 2. Convert the voice message into a sound wave signal. 3. The filter in the voice receiving device filters out the noise in the sound wave signal. 4. Decompose the filtered sound wave signal into multiple groups of phonemes (Phonemes) (the so-called phonemes refer to the basic sound blocks that make up the sound of a language. In English, there are 44 phonemes such as "wh", "th" and "t ", etc., while the Chinese system lacks a unified standard and there is no fixed number). 5. Each phoneme can form a list (list), and can be statistically analyzed sequentially. 6. Can understand the meaning of a passage through automatic speech recognition software.

建立在上述自動語音識別的語音識別模型31可更具有兩種主要的變化形態，分別為直接對話(Directed Dialogue)模型及自然語言處理(Natural Language Processing，NLP)模型。在本文中，將主要以直接對話模型進行舉例，但是在其他實施例中，也可以使用自然語言處理模型，本案不限於此。所謂直接對話模型，係指可直接提供使用者一定範圍的特定選擇詞彙，使得機器在判斷語意上較使用自然語言處理模型容易。 The speech recognition model 31 based on the above-mentioned automatic speech recognition can have two main variations, which are Directed Dialogue (Directed Dialogue) model and Natural Language Processing (NLP) model. In this article, a direct dialogue model will be used as an example, but in other embodiments, a natural language processing model may also be used, and this case is not limited thereto. The so-called direct dialogue model refers to the ability to directly provide users with a certain range of specific choice vocabulary, making it easier for machines to judge semantics than using natural language processing models.

基於上述圖1a至圖1c的人工智能聲控銀行交易系統1，請參考圖2a及圖2b，圖2a係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的操作流程圖，圖2b係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的操作流程圖。 Based on the above-mentioned artificial intelligence voice-activated banking transaction system 1 shown in FIG. 1a to FIG. 1c, please refer to FIG. 2a and FIG. 2b. FIG. Fig. 2b is an operation flow chart of confirming execution instructions of the artificial intelligence voice-activated banking transaction system according to an embodiment of the invention.

如圖2a所示，運算裝置執行的身分驗證操作S40包含，步驟S400：藉由語音識別模型擷取第一語音訊息中含有的一待定身分資料、步驟S401：比對待定身分資料與登記身分資料並確認待定身分資料符合登記身分資料，以及步驟S402：當判斷待定身分資料符合登記身分資料，運算裝置可對使用者進行指令確認操作。舉例來說，在步驟S400中，自動櫃員機可先向使用者發出訊息：「請講述使用者帳號及密碼。」，使用者便向語音接收裝置講述舉例由英文與數字組成的帳號及密碼(即第一語音訊息)，語音接收裝置在將聲波訊號傳輸至記憶體中，供運算裝置分析。在此情況下，運算裝置可主動採用包含英文與數字的音素的模型對聲波訊號進行分析，以得到一串輸入帳號及輸入密碼(即待定身分資料)。當然在其他實施例中，使用者未必要講述帳號及密碼，也可以是使用者姓名、身分證字號或生日等資訊，實現方式同上且本案不限於此。舉例來說，在步驟S401中，運算裝置會將該輸入帳號及輸入密碼與預先儲存的登記帳號及登記密碼(即登記身分資料)進行比對。進一步，運算裝置會先比對帳號以確定輸入帳號的對應密碼，再將輸入密碼與預先儲存的登記密碼進行比對，兩者都符合才算通過身分驗證操作。當然此步驟也可對使用者的其他資訊如姓名、生日等進行比對，在此不贅述。 As shown in FIG. 2a, the identity verification operation S40 performed by the computing device includes, step S400: extract a pending identity data contained in the first voice message through the speech recognition model, step S401: compare the pending identity data with the registered identity data And confirm that the pending identity data matches the registered identity data, and step S402: when judging that the pending identity data matches the registered identity data, the computing device can perform an instruction confirmation operation for the user. For example, in step S400, the automatic teller machine can first send a message to the user: "Please tell me the user account number and password." The user then tells the voice receiving device the account number and password composed of English and numbers (ie The first voice message), the voice receiving device transmits the sound wave signal to the memory for analysis by the computing device. In this case, the computing device can actively use the phoneme model including English and numbers to analyze the sound wave signal, so as to obtain a series of input account numbers and input passwords (ie pending identity information). Of course, in other embodiments, the user does not need to tell the account number and password, but also information such as the user's name, ID card number or birthday, and the implementation method is the same as above and this case is not limited thereto. For example, in step S401, the computing device compares the input account number and input password with the pre-stored registration account number and registration password (ie registration identity information). Further, the computing device will first compare the account number to determine the corresponding password of the input account, and then compare the input password with the pre-stored registration password, and the identity verification operation will be passed if the two match. Of course, this step can also compare other information of the user, such as name, birthday, etc., which will not be repeated here.

如圖2b所示，運算裝置執行的指令確認操作S50包含，步驟S500：運算裝置藉由語音識別模型擷取第二語音訊息中含有的一待定指令資料、步驟S501：比對待定指令資料與指令資料庫中的多個執行指令資料、步驟S502：判斷待定指令資料對應的一執行指令資料，以及步驟S503：將該執行指令資料傳送至該自動櫃員機以執行相關服務。舉例來說，在步驟S500中，自動櫃員機可先向使用者發出訊息：「請講述您想進行的業務，包含存款、提款、轉帳......」，其中業務選項可被包含在圖1c的指令資料庫33中，使用者便向語音接收裝置講述欲執行的業務內容(即第二語音訊息)，語音接收裝置在將聲波訊號傳輸至記憶體中，供運算裝置分析。在此情況下，運算裝置可主動採用包含相關業務內容的音素的模型對聲波訊號進行分析，以得到相關的業務選項(即待定指令資料)。舉例來說，在步驟S501中，運算裝置會將待定指令資料(如「轉帳」)與預先儲存在指令資料庫中的執行指令資料(如「提款」、「轉帳」、「存款」等)進行比對，使得在步驟S502中能判斷出一執行指令資料(以本例來說，為「轉帳」)，並將該執行指令資料傳送至自動櫃員機中以執行相關服務。 As shown in Figure 2b, the command confirmation operation S50 performed by the computing device includes, step S500: the computing device extracts a pending command data contained in the second voice message through the voice recognition model, and step S501: compares the pending command data with the command A plurality of execution order data in the database, step S502: determine an execution order data corresponding to the pending order data, and step S503: transmit the execution order data to the automatic teller machine to execute related services. example For example, in step S500, the automatic teller machine can first send a message to the user: "Please tell us about the business you want to perform, including deposit, withdrawal, transfer...", wherein the business options can be included in the figure In the instruction database 33 of 1c, the user tells the voice receiving device about the business content to be executed (that is, the second voice message), and the voice receiving device transmits the sound wave signal to the memory for analysis by the computing device. In this case, the computing device can actively use the phoneme model containing relevant service content to analyze the sound wave signal to obtain relevant service options (ie pending command data). For example, in step S501, the computing device will combine the pending instruction data (such as "transfer") with the executed instruction data (such as "withdrawal", "transfer", "deposit", etc.) stored in the command database in advance The comparison is performed so that an execution instruction data (in this example, "transfer") can be determined in step S502, and the execution instruction data is sent to the automatic teller machine to execute related services.

具體來說，使用者可講述「轉帳」的服務項目，而運算裝置可將執行指令資料判斷為「轉帳」並讓自動櫃員機提供服務。而在其他實施例中，記憶體可更包含自然語言處理模型，用於讓運算裝置在特定情形下斟酌使用。舉例來說，當使用者講述的業務項目為「我要領錢」時，運算裝置透過自然語言模型可先抓出關鍵字為「錢」而降低較不相關的「我要」的權重，進一步理解「領錢」同義於指令資料庫中的「提款」指令，以避免「我要領錢」無法產生對應至「提款」指令之情形。 Specifically, the user can describe the service item of "transfer", and the computing device can judge the execution instruction data as "transfer" and allow the automatic teller machine to provide the service. In other embodiments, the memory may further include a natural language processing model for the computing device to consider and use in a specific situation. For example, when the business item described by the user is "I want to receive money", the computing device can first capture the keyword "money" through the natural language model and reduce the weight of the less relevant "I want" to further understand "Receive money" is synonymous with the "withdrawal" command in the command database to avoid the situation that "I want to get money" cannot be corresponding to the "withdrawal" command.

需要注意的是，步驟S503所述的執行相關服務在本例中可理解為執行與轉帳相關的服務，包括透過自動櫃員機向使用者索取轉帳帳號及金額等，也就是說，使用者會再次向語音接收裝置發出另一第二語音訊息，讓運算裝置擷取分析出進一步的資料。此外，步驟S503為可選擇性被設置的，例如當交易結束時，運算裝置可能直接向雲端伺服器發送交易紀錄而不將執行指令資料傳送至自動櫃員機。 It should be noted that, in this example, the execution of related services described in step S503 can be understood as the execution of transfer-related services, including requesting the transfer account number and amount from the user through the automatic teller machine, that is, the user will again submit the The voice receiving device sends out another second voice message, allowing the computing device to retrieve and analyze further data. In addition, step S503 is Optionally, for example, when the transaction is completed, the computing device may directly send the transaction record to the cloud server without sending the execution instruction data to the automatic teller machine.

請參考圖3，圖3係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的另一操作流程圖。如圖3所示，在身分驗證的操作(S60)中，當運算裝置藉由語音識別模型擷取第一語音訊息中含有的一待定身分資料(步驟S600)時，運算裝置透過自動櫃員機向使用者發送一身分提示訊息，且透過從語音接收裝置接收來自使用者的一身分確認訊息(步驟S601)。舉例來說，當使用者講述帳號為「abc123」時，自動櫃員機可回應「abc123」的身分提示訊息，讓使用者確認本案系統所接收的語音訊息是否正確而回應一「是」或「否」(即身分確認訊息)。接著在步驟S602：運算裝置判斷身分確認訊息為一肯定訊息或一否定訊息。舉例來說，當自動櫃員機回應「abc723」的身分提示訊息，使用者可回應「否」(即否定訊息)，使得系統必須重新執行上述身分驗證操作，以獲得另一第一語音訊息。所謂重新執行身分驗證操作，表示使用者需重新講述帶有身分資料的第一語音訊息。具體來說，當運算裝置判斷身分確認訊息為否定訊息時，會透過自動櫃員機發出請求以要求該使用者提供另一第一語音訊息以更新該待定身分資料，之後再回到步驟S600。另一方面，若身分確認訊息為一肯定訊息，則運算裝置維持待定身分資料並執行步驟S603：運算裝置比對待定身分資料與登記身分資料以判斷待定身分資料通過身分驗證操作，以及步驟S604：運算裝置對使用者進行指令確認操作。此二步驟與上述的步驟S401及步驟S402相同，在此不贅述。特別來說，當運算裝置判斷待定身分資料與登記身分資料不相符而判斷待定身分資料未通過身分驗證時，運算裝置可則等待新的第一語音訊息輸入或透過ATM要求使用者輸入新的第一語音訊息。 Please refer to FIG. 3 . FIG. 3 is another operation flowchart of the identity verification performed by the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention. As shown in Figure 3, in the operation of identity verification (S60), when the computing device extracts a pending identity data contained in the first voice message through the voice recognition model (step S600), the computing device uses the automatic teller machine to The user sends an identity prompt message, and receives an identity confirmation message from the user through the voice receiving device (step S601). For example, when the user tells that the account number is "abc123", the automatic teller machine can respond to the identity prompt message of "abc123", allowing the user to confirm whether the voice message received by the system in this case is correct and respond with a "yes" or "no" (i.e. identity verification message). Then in step S602: the computing device determines whether the identity confirmation message is a positive message or a negative message. For example, when the automatic teller machine responds to the identity prompt message of "abc723", the user can respond with "No" (that is, a negative message), so that the system must re-execute the above identity verification operation to obtain another first voice message. The so-called re-execution of the identity verification operation means that the user needs to retell the first voice message with the identity data. Specifically, when the computing device determines that the identity confirmation message is a negative message, it sends a request through the automatic teller machine to request the user to provide another first voice message to update the pending identity information, and then returns to step S600. On the other hand, if the identity confirmation message is a positive message, the computing device maintains the pending identity data and executes step S603: the computing device compares the pending identity data with the registered identity data to determine that the pending identity data has passed the identity verification operation, and step S604: The computing device performs an instruction confirmation operation for the user. These two steps are the same as the above-mentioned step S401 and step S402, and will not be repeated here. Specifically, when the computing device determines that pending identity data is When the recorded identity data do not match and it is judged that the pending identity data has not passed the identity verification, the computing device may then wait for a new first voice message to be input or require the user to input a new first voice message through the ATM.

請參考圖4，圖4係依據本創作一實施例所繪示的人工智能聲控銀行交易系統在執行操作確認的另一操作流程圖。如圖4所示，在指令確認的操作(S70)中，當運算裝置判斷待定指令資料對應的一執行指令資料(步驟S700)時，運算裝置可透過自動櫃員機向使用者發送一指令提示訊息，且透過從語音接收裝置接收來自使用者的一指令確認訊息(步驟S701)。舉例來說，當使用者說出「我要領兩千元」時，自動櫃員機可發出「請問您要提款嗎？」的指令提示訊息，使用者再回復「是」的指令確認訊息。關於如何從「領兩千元」對應到「提款」的方法，可參照上述直接對話模型或自然語言模型的描述，在此不贅述。接著於步驟S702，運算裝置判斷指令確認訊息為一肯定訊息或一否定訊息，若為肯定訊息則運算裝置可將執行指令資料傳送至自動櫃員機以執行相關服務(步驟S703)。或者，當指令確認訊息為否定訊息時，可重新執行指令確認操作，以獲得另一執行指令資料。也就是說，當運算裝置判斷指令確認訊息為非同意訊息時，運算裝置可透過自動櫃員機發出請求以要求該使用者提供另一第二語音訊息以更新該待定指令資料。例如：「請問您要提款嗎？」(指令提示訊息)、「不是」(指令確認訊息)、「請問您要進行何種業務？」(重新要求第二語音訊息)、「轉帳」(更新待定指令資料)、「請問您要轉帳嗎？」(指令提示訊息)、「是」(指令確認訊息)。 Please refer to FIG. 4 . FIG. 4 is another operation flow chart of an artificial intelligence voice-activated banking transaction system performing operation confirmation according to an embodiment of the present invention. As shown in Figure 4, in the operation of order confirmation (S70), when the computing device judges the execution command data corresponding to the pending command data (step S700), the computing device can send an instruction prompt message to the user through the automatic teller machine, And by receiving an instruction confirmation message from the user from the voice receiving device (step S701). For example, when the user says "I want two thousand yuan", the automatic teller machine can issue an instruction prompt message of "Do you want to withdraw money?", and the user can reply the instruction confirmation message of "Yes". For the method of how to correspond from "receiving two thousand yuan" to "withdrawing money", you can refer to the description of the above-mentioned direct dialogue model or natural language model, and will not go into details here. Then in step S702, the computing device judges that the instruction confirmation message is a positive message or a negative message, and if it is a positive message, the computing device can transmit the execution command data to the automatic teller machine to execute related services (step S703). Or, when the command confirmation message is a negative message, the command confirmation operation can be re-executed to obtain another execution command data. That is to say, when the computing device judges that the command confirmation message is an unapproved message, the computing device can send a request through the automatic teller machine to request the user to provide another second voice message to update the pending command data. For example: "Do you want to withdraw money?" (command prompt message), "No" (command confirmation message), "What kind of business do you want to do?" (Re-request the second voice message), "Transfer" (update Pending order information), "Do you want to transfer money?" (order prompt message), "Yes" (order confirmation message).

本案系統的變化實施例的身分驗證可進一步涵蓋影像辨識功能。請參照圖5a至圖5c，圖5a為依據本創作另一實施例所繪示的人工智能聲控銀行交易系統的方塊圖，圖5b為依據本創作另一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖，圖5c為依據本創作另一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息及臉部影像資料的方塊示意圖。 The identity verification in the variant embodiment of the system of this case may further include the image recognition function. Please refer to Fig. 5a to Fig. 5c, Fig. 5a is a block diagram of an artificial intelligence voice-activated bank transaction system according to another embodiment of this creation, and Fig. 5b is a block diagram of an artificial intelligence voice-activated bank according to another embodiment of this creation A block diagram of the usage scenario of the transaction system, FIG. 5c is a schematic block diagram of receiving voice messages and facial image data by the memory of the artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

如圖5a至圖5c所示，人工智能聲控銀行交易系統1’除了包含自動櫃員機10、運算裝置20、記憶體30’、語音接收裝置40以外，更包含影像擷取裝置50，連接至記憶體30’。如圖5b所示，影像擷取裝置50可例如為安裝在自動櫃員機10上的一攝像機，用於擷取使用者C的一臉部影像資料。如圖5c所示，記憶體30’更事先儲存有一臉部特徵偵測模型34及使用者的一登錄臉部特徵35。 As shown in Figures 5a to 5c, the artificial intelligence voice-activated banking transaction system 1' includes not only the automatic teller machine 10, the computing device 20, the memory 30', and the voice receiving device 40, but also includes an image capture device 50 connected to the memory 30'. As shown in FIG. 5 b , the image capture device 50 can be, for example, a camera installed on the automatic teller machine 10 for capturing a face image data of the user C. As shown in FIG. As shown in FIG. 5c, the memory 30' further stores a facial feature detection model 34 and a registered facial feature 35 of the user in advance.

請參照圖6，圖6係依據本創作另一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的臉部識別的一操作流程圖。如圖6所示，運算裝置執行的身分驗證操作更包含一臉部識別操作，包含步驟S900：藉由該臉部特徵偵測模型擷取該臉部影像資料中含有的一待定臉部特徵、步驟S901：比對該待定臉部特徵與該登錄臉部特徵、步驟S902：判斷該待定臉部特徵是否符合該登錄臉部特徵，若該待定臉部特徵符合該登錄臉部特徵，則進行步驟S903：運算裝置執行其他身分驗證操作或該指令確認操作，而若待定臉部特徵不符合該登錄臉部特徵，則運算裝置重新執行該身分驗證操作。 Please refer to FIG. 6 . FIG. 6 is an operation flowchart of face recognition for identity verification in an artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention. As shown in FIG. 6, the identity verification operation performed by the computing device further includes a facial recognition operation, including step S900: using the facial feature detection model to extract a pending facial feature contained in the facial image data, Step S901: Compare the pending facial feature with the registered facial feature. Step S902: Determine whether the pending facial feature matches the registered facial feature. If the pending facial feature matches the registered facial feature, proceed to step S901. S903: the computing device executes other identity verification operations or the instruction confirmation operation, and if the pending facial features do not match the registered facial features, the computing device executes the identity verification operation again.

舉例來說，影像擷取裝置在拍攝使用者的臉部影像後，可將影像資料傳輸至記憶體中，運算裝置可藉由臉部特徵偵測模型擷取該影像資料的特徵，其中，臉部特徵偵測模型可為具有深度學習能力的神經網路(如VGG-Face、Google FaceNet、OpenFace等)。以Google FaceNet為例，其並非一開始就輸出分類結果，而是先輸出量化特徵值，此運算可藉此將臉部特徵如輪廓擷取出來成為一待定臉部特徵，再於特徵向量空間中對不同臉部照片進行比較，以判別不同臉孔的相似度，換言之，在特徵向量空間中距離越近的兩點表示臉孔相似度越高。另外，記憶體儲存的登錄臉部特徵是關聯於上述登記身分資料，具體來說，當使用者先進行如圖2a步驟S401所示的比對身分資料後，該身分資料會對應至一登陸臉部特徵以用於比對。在臉部特徵的比對判斷上，可透過擷取使用者的多張影像來增加識別準確率，在此不贅述。需要注意的是，當運算裝置判斷待定臉部特徵不符合登錄臉部特徵時，表示使用者的身分資料可能有誤，需要重新執行身分驗證操作，也因此會再度進行上述臉部識別操作，然而圖6並沒有示出所有身分驗證操作，故步驟S902的否定箭頭以指向步驟S900表示，本案不限於此。 For example, after the image capture device captures the user's face image, it can transmit the image data to the memory, and the computing device can use the facial feature detection model to extract the features of the image data, wherein the face The internal feature detection model can be a neural network with deep learning capabilities (such as VGG-Face, Google FaceNet, OpenFace, etc.). Taking Google FaceNet as an example, instead of outputting classification results at the beginning, it outputs quantized feature values first. This operation can extract facial features such as contours into undetermined facial features, and then in the feature vector space Compare different facial photos to determine the similarity of different faces. In other words, the closer the two points in the feature vector space, the higher the similarity of the faces. In addition, the registered facial features stored in the memory are associated with the above-mentioned registered identity data. Specifically, after the user performs the identity data comparison as shown in step S401 of FIG. 2a, the identity data will correspond to a login face. internal features for comparison. In terms of comparing and judging facial features, the accuracy of recognition can be increased by capturing multiple images of the user, which will not be described here. It should be noted that when the computing device judges that the pending facial features do not match the registered facial features, it means that the user's identity information may be incorrect, and the identity verification operation needs to be performed again, so the above-mentioned facial recognition operation will be performed again. FIG. 6 does not show all identity verification operations, so the negative arrow in step S902 is indicated by pointing to step S900 , and this case is not limited thereto.

請參照圖7，圖7為依據本創作又一實施例所繪示的人工智能聲控銀行交易系統的又一記憶體接收訊息及資料的方塊示意圖。如圖7所示，記憶體30”更事先儲存有一唇形識別模型36(如LipNet)，而運算裝置更用於透過影像擷取裝置50擷取使用者的的多個唇部影像以獲得一動態唇形資料，且運算裝置20執行的該指令確認操作更包含一唇形識別操作。 Please refer to FIG. 7 . FIG. 7 is a schematic block diagram of another memory receiving messages and data of an artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention. As shown in Figure 7, the memory 30" stores a lip shape recognition model 36 (such as LipNet) in advance, and the computing device is further used to capture multiple lip images of the user through the image capture device 50 to obtain a lip shape recognition model 36 (such as LipNet). Dynamic lip shape data, and the instruction confirmation operation performed by the computing device 20 further includes a lip shape recognition operation.

請參照圖8，圖8為依據本創作又一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的唇形識別的一操作流程圖。如圖8所示，運算裝置執行的唇形識別操作S110包含步驟S1100：藉由唇形識別模型擷取動態唇形資料所包含的一參照指令資料、步驟S1101：判斷待定指令資料與參照指令資料是否相同，以及步驟S1102：保留待定指令資料。在步驟S1101中，若判斷待定指令資料與參照指令資料相同，則進行步驟S1102，若判斷待定指令資料與參照指令資料不同，表示語音識別模型得到的待定指令資料與唇形識別模型得到的參照指令資料不同，可重新進行上述指令確認操作。需要注意的是，本圖未示出所有指令確認操作，因此將表示「重新進行指令確認操作」的否定箭頭指向步驟S1100(意味著當重新進行指令確認操作，也可重新進行唇形識別操作)，本案不限於此。具體來說，唇形識別模型的原理類似於上述語音識別模型，可透過將口語分解為對應至不同發音的口形元素，並儲存為一序列供運算裝置進行分析，在此不詳述。需要注意的是，用於前述實施例的影像擷取裝置可僅具有拍攝影像功能，但在本實施例中，影像擷取裝置需具有錄製連續影片或拍攝高速連續影像的功能，以達成唇形識別。 Please refer to FIG. 8 . FIG. 8 is a flow chart of an operation of the artificial intelligence voice-activated banking transaction system performing lip recognition for order confirmation according to another embodiment of the present invention. As shown in FIG. 8 , the lip shape recognition operation S110 performed by the computing device includes step S1100: using the lip shape recognition model to extract a reference command data included in the dynamic lip shape data, step S1101: judging the pending command data and the reference command data Whether they are the same, and step S1102: keep the pending order data. In step S1101, if it is determined that the pending instruction data is the same as the reference instruction data, proceed to step S1102, if it is judged that the pending instruction data is different from the reference instruction data, it means that the pending instruction data obtained by the speech recognition model and the reference instruction obtained by the lip shape recognition model If the data is different, you can redo the above command confirmation operation. It should be noted that this figure does not show all the command confirmation operations, so the negative arrow indicating "re-perform the command confirmation operation" points to step S1100 (meaning that when the command confirmation operation is performed again, the lip shape recognition operation can also be performed again) , this case is not limited to this. Specifically, the principle of the lip shape recognition model is similar to the above-mentioned speech recognition model, which can decompose spoken language into lip shape elements corresponding to different pronunciations, and store them as a sequence for analysis by the computing device, which will not be described in detail here. It should be noted that the image capture device used in the foregoing embodiments may only have the function of shooting images, but in this embodiment, the image capture device must have the function of recording continuous videos or shooting high-speed continuous images to achieve lip shape identify.

請參照圖9，圖9為依據本創作其他實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。如圖9所示，語音接收裝置40可為使用者C的一行動裝置(如智慧型手機)，且語音接收裝置40可更連接於運算裝置20。在本例中，使用者C的手機可與運算裝置20及記憶體30產生連線，具體來說，手機可透過與自動櫃員機10連線，再連線至運算裝置20及記憶體30，本案不限於此。使用者C可向手機講述個人資料及欲執行的銀行業務，使得周遭他人不易聽見隱私資料。同理，系統可透過手機向使用者C發出提示訊息，也避免被周遭人聽見。此外，手機更可用於接收來自系統的經過加密的一提示訊息並將該提示訊息轉為一提示音後播放給該使用者，增加資訊安全性。 Please refer to FIG. 9 . FIG. 9 is a block diagram of a usage scenario of an artificial intelligence voice-activated banking transaction system according to other embodiments of the present invention. As shown in FIG. 9 , the voice receiving device 40 can be a mobile device (such as a smart phone) of the user C, and the voice receiving device 40 can be further connected to the computing device 20 . In this example, the mobile phone of user C can be connected to the computing device 20 and the memory 30. Specifically, the mobile phone can be connected to the automatic teller machine 10, and then connected to The computing device 20 and the memory 30 are not limited in this case. User C can tell the personal information and the banking business to be performed to the mobile phone, making it difficult for others around to hear the private information. In the same way, the system can send a reminder message to user C through the mobile phone without being heard by the people around. In addition, the mobile phone can also be used to receive an encrypted prompt message from the system and convert the prompt message into a prompt tone and play it to the user, thereby increasing information security.

請參照圖10，圖10為依據本創作其他實施例所繪示的人工智能聲控銀行交易系統的其他記憶體接收訊息及資料的方塊示意圖。如圖10所示，記憶體30'''更事先儲存有一第一觸發訊息37及一第二觸發訊息38，且更用於接收來自語音接收裝置的一第三語音訊息。 Please refer to FIG. 10 . FIG. 10 is a schematic block diagram of receiving messages and data from other memories of the artificial intelligence voice-activated banking transaction system shown in other embodiments of the present invention. As shown in FIG. 10 , the memory 30''' further stores a first trigger message 37 and a second trigger message 38 in advance, and is further used to receive a third voice message from the voice receiving device.

請結合圖10參照圖11，圖11為依據本創作其他實施例所繪示的人工智能聲控銀行交易系統在受觸發的情況下的方塊流程圖。如圖11所示，當運算裝置透過語音接收裝置接收來自使用者的一第三語音訊息時，可判斷第三語音訊息是否符合第一觸發訊息或第二觸發訊息，若第三語音訊息符合第一觸發訊息，則重新執行上述身分驗證操作，若第三語音訊息符合第二觸發訊息，則重新執行指令確認操作，否則，不執行額外操作。具體來說，若一個使用者完成身分驗證訊息後，發現他/她是想對另一帳戶進行操作，便可對語音接收裝置發出符合第一觸發訊息的一第三語音訊息(如「重新驗證」)。類似的，若一個使用者說出要轉帳後，發現他/她其實是想提款，便可對語音接收裝置發出符合第二觸發訊息的一第三語音訊息(如「重新交易」)。當然，只要在過程中使用者避談上述第一觸發訊息或第二觸發訊息，系統便不會受到第三語音訊息之觸發而中斷交易。 Please refer to FIG. 11 in conjunction with FIG. 10 . FIG. 11 is a block flow diagram of the artificial intelligence voice-activated banking transaction system according to other embodiments of the present invention when it is triggered. As shown in Figure 11, when the computing device receives a third voice message from the user through the voice receiving device, it can determine whether the third voice message matches the first trigger message or the second trigger message, if the third voice message matches the first trigger message Once the trigger message is received, the above identity verification operation is re-executed, and if the third voice message matches the second trigger message, the instruction confirmation operation is re-executed; otherwise, no additional operation is performed. Specifically, if a user finds that he/she wants to operate another account after completing the identity verification message, he/she can send a third voice message that matches the first trigger message to the voice receiving device (such as "re-authenticate "). Similarly, if a user says that he/she wants to transfer money and finds that he/she actually wants to withdraw money, he/she can send a third voice message (such as "re-transaction") that meets the second trigger message to the voice receiving device. Of course, as long as the user avoids talking about the first trigger message or the second trigger message during the process, the system will not be triggered by the third voice message and interrupt the transaction.

依據本文的揭露，可達成以下描述的一種使用者情境，需要注意的是，此情境僅為一種舉例，本案不限於此。客戶前往自動櫃員機，依指示插入金融卡或點選無卡交易功能後，自動櫃員機播放語音指示客戶於指定時間內靠近麥克風，並講述身分驗證問題，如生日、電話、帳號及密碼等個人資訊。麥克風收到客戶語音，透過降躁技術過濾雜音，傳輸至本案系統內以判定客戶語音內容。系統判定身分驗證通過，於自動櫃員機播放系語音指示客戶於指定時間內靠近麥克風，講述交易項目(存款/提款/轉帳)及交易內容(帳號/金額)，若系統判定身分驗證未通過或逾時，重新回到上述身分驗證步驟。麥克風接收到客戶語音，系統辨識客戶交易內容，並再次於自動櫃員機播放語音指示客戶於指定時間內靠近麥克風，回覆交易確認/修改/取消指令。系統收到客戶確認指令，於自動櫃員機執行交易，若系統收到客戶修改指令，重新回到上述對應流程，若系統收到客戶取消或系統交易逾時，重新回到初始流程。客戶完成交易後，系統以電子郵件及簡訊傳送留存單據。 According to the disclosure of this article, a user situation described below can be achieved. It should be noted that this situation is only an example, and this case is not limited thereto. After the customer goes to the ATM, inserts the financial card according to the instructions or clicks the card-not-present transaction function, the ATM will play a voice instructing the customer to approach the microphone within a specified time, and tell the identity verification questions, such as birthday, phone number, account number, password and other personal information. The microphone receives the voice of the customer, filters the noise through noise reduction technology, and transmits it to the system of this case to determine the content of the customer's voice. The system determines that the identity verification is passed, and the voice will be played on the ATM to instruct the customer to approach the microphone within the specified time, and describe the transaction items (deposit/withdrawal/transfer) and transaction content (account number/amount). , return to the identity verification steps above. The microphone receives the customer's voice, and the system recognizes the customer's transaction content, and plays the voice again on the ATM to instruct the customer to approach the microphone within a specified time and reply to the transaction confirmation/modification/cancellation order. The system receives the customer's confirmation instruction and executes the transaction at the ATM. If the system receives the customer's modification instruction, it returns to the above corresponding process. If the system receives the customer's cancellation or the system transaction times out, it returns to the initial process. After the customer completes the transaction, the system sends the retained receipt by email and SMS.

藉由上述結構，本案所揭示的人工智能聲控銀行交易系統，可透過儲存有人工智慧識別模型的記憶體搭配運算裝置，對接收的語音訊息進行內容判斷，以安全有效地執行身分驗證以及指令控制的功能。讓使用者只需再自動櫃員機前講述自己的身分資料完成驗證後，再口語表達欲執行的銀行服務項目，如此一來一般的銀行業務皆能完全透過聲音控制的方式來完成，對各種族群來說都是便利的措施。 With the above structure, the artificial intelligence voice-activated banking transaction system disclosed in this case can judge the content of the received voice message through the memory with the artificial intelligence recognition model and the computing device, so as to safely and effectively perform identity verification and command control function. Let users only need to tell their identity information in front of the ATM to complete the verification, and then verbally express the banking service items they want to perform. In this way, general banking services can be completed completely through voice control, and it is suitable for various ethnic groups. It is said that it is a convenient measure.

雖然本創作以前述之實施例揭露如上，然其並非用以限定本創作。在不脫離本創作之精神和範圍內，所為之更動與潤飾，均屬本創作之專利保護範圍。關於本創作所界定之保護範圍請參考所附之申請專利範圍。 Although the invention is disclosed as above with the aforementioned embodiments, it is not intended to limit the invention. Without departing from the spirit and scope of this creation, all changes and modifications are The scope of patent protection for this creation. For the scope of protection defined by this creation, please refer to the attached scope of patent application.

10:自動櫃員機 10: Automatic teller machine

20:運算裝置 20: computing device

30:記憶體 30: memory

40:語音接收裝置 40: Voice receiving device

Claims

An artificial intelligence voice-activated bank transaction system, comprising: an automatic teller machine for allowing a user to handle bank counter services by himself; a computing device connected to the automatic teller machine for performing an identity verification operation and an instruction on the user confirming the operation; a memory connected to the computing device for storing a first voice message and a second voice message, and storing a voice recognition model, a registered identity data of the user and an instruction database in advance; and a voice receiving device connected to the memory for receiving the first voice message and the second voice message of the user and transmitting them to the memory, wherein the identity verification operation performed by the computing device includes: by extracting a pending identity data contained in the first voice message from the voice recognition model; and comparing the pending identity data with the registered identity data and confirming that the pending identity data conforms to the registered identity data, and the computing device executes The command confirmation operation includes: using the voice recognition model to extract a pending command data contained in the second voice message; and comparing the pending command data with a plurality of execution command data in the command database to determine the same An execution order data corresponding to the pending order data.

The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the identity verification operation performed by the computing device further includes: When the computing device retrieves the pending identity data contained in the first voice message through the speech recognition model, it executes: sending an identity reminder message to the user through the automatic teller machine; The device receives an identity confirmation message from the user, and determines whether the identity confirmation message is a positive message or a negative message; if the identity confirmation message is the positive message, the computing device maintains the pending identity data; and if the identity The confirmation message is the negative message, and the computing device sends a request through the automatic teller machine to request the user to provide another first voice message to update the pending identity information.

The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the instruction confirmation operation performed by the computing device further includes: when the computing device determines that the pending instruction data corresponds to an execution instruction data, execute: through the automatic teller machine Sending an instruction prompt message to the user; receiving an instruction confirmation message from the user from the voice receiving device, and judging whether the instruction confirmation message is an approval message or a non-agreement message; if the instruction confirmation message is the If the confirmation message of the instruction is the non-consent message, the computing device sends a request through the ATM to request the user to provide another second Voice message to update the pending order data.

The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the instruction confirmation operation performed by the computing device further includes: when the computing device judges an execution instruction data corresponding to the pending instruction data, execute: the execution instruction The data is sent to the ATM to perform related services.

The artificial intelligence voice-activated banking transaction system as described in claim 1 further includes an image capture device connected to the memory for capturing a facial image data of the user, and the memory further stores a A facial feature detection model and a registered facial feature of the user, and the registered facial feature is associated with the registered identity data, wherein the identity verification operation performed by the computing device further includes a facial recognition operation, including : Use the facial feature detection model to extract a pending facial feature contained in the facial image data; compare the pending facial feature with the registered facial feature to determine whether the pending facial feature matches the Registered facial features; if the pending facial features match the registered facial features, the computing device performs other identity verification operations or the command confirmation operation; if the pending facial features do not match the registered facial features, the computing device through The automatic teller machine sends a request to require the user to provide another first voice message to update the pending identity information and the registered identity information.

The artificial intelligence voice-activated banking transaction system as described in claim 5, wherein the memory further stores a lip shape recognition model, and the computing device is further used to use the image capture device The device captures a plurality of lip images of the user to obtain a dynamic lip shape data, and the command confirmation operation performed by the computing device further includes a lip shape recognition operation, including: using the lip shape recognition model to capture the A reference command data included in the dynamic lip shape data; judging whether the pending command data is the same as the reference command data; if the pending command data is the same as the reference command data, retaining the pending command data; and if the pending command data is the same as the reference command data The reference command data is different, and the computing device sends a request through the automatic teller machine to request the user to provide another second voice message to update the pending command data.

The artificial intelligence voice-activated banking transaction system as described in Claim 1, wherein the voice receiving device is a mobile device or an earphone device of the user.

The artificial intelligence voice-activated banking transaction system as described in claim 1 further includes a mobile device of the user connected to the automatic teller machine for receiving an encrypted prompt message and converting the prompt message into a prompt tone for playback to the user.

The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the memory further stores a first trigger message and a second trigger message, so that when the computing device receives a first trigger message from the user through the voice receiving device When there are three voice messages, judge whether the third voice message matches the first trigger message or the second trigger message, if the third voice message matches the first trigger message, re-execute the identity verification operation, if the third If the voice message matches the second trigger message, re-execute the instruction confirmation operation; otherwise, no additional operation is performed.

The artificial intelligence voice-activated banking transaction system as described in Claim 1, wherein the voice receiving device has a filter for filtering noises in the first voice message and the second voice message.