TWI835304B - Ai voice control banking transaction system - Google Patents

Ai voice control banking transaction system Download PDF

Info

Publication number
TWI835304B
TWI835304B TW111135658A TW111135658A TWI835304B TW I835304 B TWI835304 B TW I835304B TW 111135658 A TW111135658 A TW 111135658A TW 111135658 A TW111135658 A TW 111135658A TW I835304 B TWI835304 B TW I835304B
Authority
TW
Taiwan
Prior art keywords
voice
message
computing device
pending
data
Prior art date
Application number
TW111135658A
Other languages
Chinese (zh)
Other versions
TW202414385A (en
Inventor
白庭楷
Original Assignee
華南商業銀行股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 華南商業銀行股份有限公司 filed Critical 華南商業銀行股份有限公司
Priority to TW111135658A priority Critical patent/TWI835304B/en
Application granted granted Critical
Publication of TWI835304B publication Critical patent/TWI835304B/en
Publication of TW202414385A publication Critical patent/TW202414385A/en

Links

Images

Landscapes

  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

An artificial intelligence voice control banking transaction system includes an automatic teller machine, a computing device, a memory and a voice receiving device. The computing device is used for authentication operation and command confirmation operation, the memory is used for storing two voice messages, as well as the voice recognition model, registered identity data and command database, and the voice receiving device is used for receiving and transmitting the two voice messages to the memory. The authentication operation includes retrieving the pending identity data in the first voice message by using the voice recognition model, and comparing and confirming that the pending identity data is consistent with the registered identity data, and the command confirmation operation includes retrieving a second voice message through the voice recognition model pending command data in the command database, and an execution command data corresponding to the plurality of execution command data in the command database by comparing and judging the pending command data

Description

人工智能聲控銀行交易系統Artificial intelligence voice-controlled banking transaction system

本發明係關於一種銀行交易系統,特別是關於一種人工智能聲控銀行交易系統。 The invention relates to a bank transaction system, and in particular to an artificial intelligence voice-activated bank transaction system.

目前的銀行交易系統,如自動櫃員機(ATM)的交易介面大多是採取按鍵或面板的形式,在某些情況或對於特定族群(如視力受損、行動不便者)而言,造成了一定的不便性。 Current bank transaction systems, such as automated teller machines (ATMs), mostly have transaction interfaces in the form of buttons or panels, which causes certain inconveniences in certain situations or for specific groups (such as those with visual impairment and mobility issues). sex.

在某些系統中,語音功能往往都是輔助性的,需要搭配螢幕顯示以及觸摸點字等,因此對於特定使用者而言,使用輔助性的自動櫃員機往往不如透過手機使用網路銀行來得方便。 In some systems, voice functions are often assistive and need to be paired with screen display and touch braille. Therefore, for certain users, using assistive ATMs is often not as convenient as using online banking through mobile phones.

鑒於上述,本發明提供一種人工智能聲控銀行交易系統。 In view of the above, the present invention provides an artificial intelligence voice-activated bank transaction system.

依據本發明一實施例的人工智能聲控銀行交易系統,包含自動櫃員機、連接於自動櫃員機的運算裝置、連接於運算裝置的記憶體及連接於記憶體的語音接收裝置。自動櫃員機用於讓使用者自助辦理銀行櫃檯服務,運算裝置用於對使用者進行一身分驗證操作及一指令確認操作,記憶體用於儲存一第一及第二語音訊息,並儲存有語音識別模型、使用者的一登記身分資料及一指令資料庫,語音接收裝置用於接收 使用者的第一及第二語音訊息並傳輸至記憶體。所述運算裝置執行的身分驗證操作包含:藉由語音識別模型擷取第一語音訊息中含有的一待定身分資料,以及比對待定身分資料與登記身分資料並確認待定身分資料符合該登記身分資料,且運算裝置執行的該指令確認操作包含:藉由語音識別模型擷取一第二語音訊息中含有的一待定指令資料,以及比對待定指令資料與指令資料庫中的多個執行指令資料,以判斷與所述待定指令資料對應的一執行指令資料。 An artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention includes an automatic teller machine, a computing device connected to the automatic teller machine, a memory connected to the computing device, and a voice receiving device connected to the memory. The automatic teller machine is used to allow users to handle bank counter services by themselves. The computing device is used to perform an identity verification operation and an instruction confirmation operation on the user. The memory is used to store a first and second voice message and also stores voice recognition. model, a registered identity information of the user and a command database, and the voice receiving device is used to receive The user's first and second voice messages are transmitted to the memory. The identity verification operation performed by the computing device includes: retrieving a pending identity data contained in the first voice message through a speech recognition model, and comparing the pending identity data with the registered identity data and confirming that the pending identity data matches the registered identity data. , and the instruction confirmation operation executed by the computing device includes: retrieving a pending instruction data contained in a second voice message through a speech recognition model, and comparing the pending instruction data with multiple execution instruction data in the instruction database, To determine an execution instruction data corresponding to the pending instruction data.

藉由上述結構,本案所揭示的人工智能聲控銀行交易系統,可透過儲存有人工智慧語音識別模型的記憶體搭配運算裝置,對接收的語音訊息進行內容判斷,以安全有效地執行身分驗證以及指令確認的功能。讓使用者只需再自動櫃員機前講述自己的身分或帳戶資料完成驗證後,再口語表達欲執行的銀行服務項目,如此一來一般的銀行業務皆能完全透過聲音控制的方式來完成,對各種族群來說都是便利的措施。 With the above structure, the artificial intelligence voice-activated banking transaction system disclosed in this case can judge the content of the received voice messages through a memory and computing device that stores an artificial intelligence speech recognition model, so as to safely and effectively execute identity verification and instructions. Confirm function. Users only need to state their identity or account information in front of the ATM to complete the verification, and then verbally express the banking services they want to perform. In this way, general banking services can be completed completely through voice control, and various It is a convenient measure for all ethnic groups.

以上之關於本揭露內容之說明及以下之實施方式之說明係用以示範與解釋本發明之精神與原理,並且提供本發明之專利申請範圍更進一步之解釋。 The above description of the present disclosure and the following description of the embodiments are used to demonstrate and explain the spirit and principles of the present invention, and to provide further explanation of the patent application scope of the present invention.

1:人工智能聲控銀行交易系統 1: Artificial intelligence voice-controlled banking transaction system

10:自動櫃員機 10:ATM

20:運算裝置 20:Computing device

30,30’,30”,30''':記憶體 30,30’,30”,30''’: memory

31:語音識別模型 31: Speech recognition model

32:登記身分資料 32:Register identity information

33:指令資料庫 33: Command database

34:臉部特徵偵測模型 34: Facial feature detection model

35:登錄臉部特徵 35: Log in facial features

36:唇形識別模型 36: Lip shape recognition model

37:第一觸發訊息 37: First trigger message

38:第二觸發訊息 38: Second trigger message

40:語音接收裝置 40: Voice receiving device

50:影像擷取裝置 50:Image capture device

C:使用者 C:user

S40:身分驗證操作 S40: Identity verification operation

S400~S402,S500~S503,S600~S604,S700~S703, S400~S402,S500~S503,S600~S604,S700~S703,

圖1a為依據本發明一實施例所繪示的人工智能聲控銀行交易系統的方塊圖。 Figure 1a is a block diagram of an artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖1b為依據本發明一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。 Figure 1b is a block diagram illustrating a usage scenario of the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention.

圖1c為依據本發明一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息的方塊示意圖。 FIG. 1c is a block diagram illustrating the memory of the artificial intelligence voice-activated banking transaction system receiving voice messages according to an embodiment of the present invention.

圖2a係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的操作流程圖。 FIG. 2a is an operation flow chart of an artificial intelligence voice-activated banking transaction system performing identity verification according to an embodiment of the present invention.

圖2b係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的操作流程圖。 Figure 2b is an operation flow chart of an artificial intelligence voice-activated banking transaction system executing instruction confirmation according to an embodiment of the present invention.

圖3係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的另一操作流程圖。 FIG. 3 is another operational flow chart of the artificial intelligence voice-activated banking transaction system performing identity verification according to an embodiment of the present invention.

圖4係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行操作確認的另一操作流程圖。 FIG. 4 is another operation flow chart of the artificial intelligence voice-activated banking transaction system performing operation confirmation according to an embodiment of the present invention.

圖5a為依據本發明另一實施例所繪示的人工智能聲控銀行交易系統的方塊圖。 Figure 5a is a block diagram of an artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖5b為依據本發明另一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。 Figure 5b is a block diagram illustrating a usage scenario of the artificial intelligence voice-activated banking transaction system according to another embodiment of the present invention.

圖5c為依據本發明另一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息及臉部影像資料的方塊示意圖。 FIG. 5c is a block diagram illustrating the memory of the artificial intelligence voice-activated banking transaction system receiving voice messages and facial image data according to another embodiment of the present invention.

圖6係依據本發明另一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的臉部識別的一操作流程圖。 FIG. 6 is an operation flow chart of an artificial intelligence voice-activated banking transaction system performing facial recognition for identity verification according to another embodiment of the present invention.

圖7為依據本發明又一實施例所繪示的人工智能聲控銀行交易系統的又一記憶體接收訊息及資料的方塊示意圖。 FIG. 7 is a block diagram illustrating another memory of the artificial intelligence voice-activated banking transaction system receiving messages and data according to another embodiment of the present invention.

圖8為依據本發明又一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的唇形識別的一操作流程圖。 FIG. 8 is an operation flow chart of an artificial intelligence voice-activated banking transaction system executing lip recognition for instruction confirmation according to yet another embodiment of the present invention.

圖9為依據本發明其他實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。 FIG. 9 is a block diagram illustrating the usage scenario of the artificial intelligence voice-activated banking transaction system according to other embodiments of the present invention.

圖10為依據本發明其他實施例所繪示的人工智能聲控銀行交易系統的其他記憶體接收訊息及資料的方塊示意圖。 FIG. 10 is a block diagram illustrating other memories of the artificial intelligence voice-activated banking transaction system receiving messages and data according to other embodiments of the present invention.

圖11為依據本發明其他實施例所繪示的人工智能聲控銀行交易系統在受觸發的情況下的方塊流程圖。 FIG. 11 is a block flow chart of the artificial intelligence voice-activated banking transaction system when triggered according to other embodiments of the present invention.

以下在實施方式中詳細敘述本發明之詳細特徵以及優點,其內容足以使任何熟習相關技藝者了解本發明之技術內容並據以實施,且根據本說明書所揭露之內容、申請專利範圍及圖式,任何熟習相關技藝者可輕易地理解本發明相關之目的及優點。以下之實施例係進一步詳細說明本發明之觀點,但非以任何觀點限制本發明之範疇。 The detailed features and advantages of the present invention are described in detail below in the implementation mode. The content is sufficient to enable anyone skilled in the relevant art to understand the technical content of the present invention and implement it according to the content disclosed in this specification, the patent scope and the drawings. , anyone familiar with the relevant art can easily understand the relevant objectives and advantages of the present invention. The following examples further illustrate the aspects of the present invention in detail, but do not limit the scope of the present invention in any way.

請參考圖1a至圖1c,圖1a為依據本發明一實施例所繪示的人工智能聲控銀行交易系統的方塊圖,圖1b為依據本發明一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖,圖1c為依據本發明一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息的方塊示意圖。 Please refer to Figures 1a to 1c. Figure 1a is a block diagram of an artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention. Figure 1b is a block diagram of an artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention. A block diagram of a usage scenario, FIG. 1c is a block diagram of the memory of the artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention receiving voice messages.

如圖1a及圖1b所示,人工智能聲控銀行交易系統1包含自動櫃員機10、訊號連接於自動櫃員機10的運算裝置20、訊號連接於運算裝置20的記憶體30,及訊號連接於記憶體30的語音接收裝置40。自動櫃 員機10是用於讓使用者C自助辦理銀行櫃檯服務,例如但不以此為限的,可為一般大眾使用的自動櫃員機(ATM)。運算裝置20用於對使用者C進行一身分驗證操作及一指令確認操作,也就是對使用者C進行身分確認並判斷使用者欲下達的指令,其中運算裝置20可為各種具有運算能力的處理器。記憶體30用於儲存一第一及第二語音訊息,並儲存有語音識別模型31、使用者C的一登記身分資料32及一指令資料庫33,其中記憶體30可為各種具有儲存資料之能力的儲存裝置。語音接收裝置40用於接收使用者C的第一及第二語音訊息並傳輸至記憶體30,其中語音接收裝置40可為一麥克風,且可鄰近設置於自動櫃員機10以接收來自使用者C的語音。 As shown in Figures 1a and 1b, the artificial intelligence voice-activated banking transaction system 1 includes an automatic teller machine 10, a computing device 20 connected with signals to the ATM 10, a memory 30 connected with signals to the computing device 20, and a memory 30 connected with signals Voice receiving device 40. Automatic cabinet The employee machine 10 is used to allow the user C to handle bank counter services by himself, such as, but not limited to, an automated teller machine (ATM) used by the general public. The computing device 20 is used to perform an identity verification operation and an instruction confirmation operation on the user C, that is, to verify the identity of the user C and determine the instructions that the user wants to issue. The computing device 20 can be a variety of processes with computing capabilities. device. The memory 30 is used to store a first and a second voice message, and stores a voice recognition model 31, a registered identity information 32 of the user C, and an instruction database 33. The memory 30 can be a variety of devices with the ability to store data. Capacity storage device. The voice receiving device 40 is used to receive the first and second voice messages of the user C and transmit them to the memory 30. The voice receiving device 40 can be a microphone and can be disposed adjacent to the automatic teller machine 10 to receive the voice messages from the user C. Voice.

請一同參考圖1c,語音接收裝置40將第一語音訊息及第二語音訊息以訊號傳輸的方式傳遞至記憶體30中。上述兩語音訊息都會被運算裝置20透過語音識別模型31進行分析判斷,且第一語音訊息及第二語音訊息分別對應至登記身分資料32及指令資料庫33,此部分於後續描述。需要注意的是,圖1a至圖1c的不同方塊之間的連線可為實線或虛線,具體來說,不同方塊之間的訊號連皆可透過實體訊號線連接或透過無線通訊方式連接,本案不予限制。另一方面,如具本案通常知識者能理解的,上述除了圖1b的語音接收裝置40需鄰近設置於自動櫃員機10以滿足物理上聲波傳遞的有利條件外,其餘方塊可遠端設置於其他位置,例如運算裝置20可為中央處理器,記憶體30可為雲端資料庫等。當然,在一些實施例中,運算裝置20及記憶體30可鄰近設置於自動櫃員機10附 近,以達到高效率的邊緣運算及防止資料攔截等的功效,此部分將於後續進行描述且不應成為本案限制條件。 Please refer to FIG. 1c together. The voice receiving device 40 transmits the first voice message and the second voice message to the memory 30 in the form of signal transmission. The above two voice messages will be analyzed and judged by the computing device 20 through the speech recognition model 31, and the first voice message and the second voice message respectively correspond to the registered identity data 32 and the command database 33, which will be described later. It should be noted that the connections between different blocks in Figures 1a to 1c can be solid lines or dotted lines. Specifically, the signal connections between different blocks can be connected through physical signal lines or through wireless communication. There is no restriction in this case. On the other hand, as can be understood by those with ordinary knowledge of the case, except for the voice receiving device 40 in FIG. 1 b which needs to be disposed adjacent to the ATM 10 to meet the favorable conditions for physical sound wave transmission, the other blocks can be disposed remotely in other locations. , for example, the computing device 20 can be a central processing unit, and the memory 30 can be a cloud database, etc. Of course, in some embodiments, the computing device 20 and the memory 30 may be disposed adjacent to the ATM 10 Recently, in order to achieve efficient edge computing and prevent data interception, this part will be described later and should not be a restriction in this case.

關於本例的語音識別模型31,可為一種預先透過深度學習方法訓練的神經網路,具體來說,語音識別模型31可包含自動語音識別(Automatic Speech Recognition,ASR)軟體,使得理論上,運算裝置20可透過自動語音識別軟體進行以下步驟來完成本案的語音識別功能。一、使用者向語音接收裝置發出聲音訊息。二、將聲音訊息轉為聲波訊號。三、語音接收裝置中的濾波器濾除聲波訊號中的雜訊。四、將濾波後的聲波訊號分解為多組音素(Phonemes)(所謂音素,指組成語言的聲音的基本聲音塊,以英語來說,具有44個音素如「wh」、「th」及「t」等,而中文系統則較缺乏統一標準而沒有定數)。五、每個音素可組成一列表(list),且可依序在統計上被分析。六、可透過自動語音識別軟體理解一段話的語意。 The speech recognition model 31 in this example can be a neural network pre-trained by a deep learning method. Specifically, the speech recognition model 31 can include automatic speech recognition (Automatic Speech Recognition, ASR) software, so that in theory, the calculation The device 20 can perform the following steps through automatic speech recognition software to complete the speech recognition function of this case. 1. The user sends a voice message to the voice receiving device. 2. Convert the sound message into a sound wave signal. 3. The filter in the voice receiving device filters out the noise in the acoustic signal. 4. Decompose the filtered sound wave signal into multiple groups of phonemes (phonemes refer to the basic sound blocks that make up the sounds of language. In English, there are 44 phonemes such as "wh", "th" and "t". ", etc., while the Chinese system lacks unified standards and has no fixed number). 5. Each phoneme can form a list and can be analyzed statistically in sequence. 6. The semantic meaning of a passage can be understood through automatic speech recognition software.

建立在上述自動語音識別的語音識別模型31可更具有兩種主要的變化形態,分別為直接對話(Directed Dialogue)模型及自然語言處理(Natural Language Processing,NLP)模型。在本文中,將主要以直接對話模型進行舉例,但是在其他實施例中,也可以使用自然語言處理模型,本案不限於此。所謂直接對話模型,係指可直接提供使用者一定範圍的特定選擇詞彙,使得機器在判斷語意上較使用自然語言處理模型容易。 The speech recognition model 31 based on the above-mentioned automatic speech recognition can further have two main variations, namely a direct dialogue (Directed Dialogue) model and a natural language processing (NLP) model. In this article, the direct dialogue model will be mainly used as an example, but in other embodiments, a natural language processing model can also be used, and this case is not limited thereto. The so-called direct dialogue model refers to a specific vocabulary selection that can directly provide the user with a certain range, making it easier for the machine to determine semantic meaning than using a natural language processing model.

基於上述圖1a至圖1c的人工智能聲控銀行交易系統1,請參考圖2a及圖2b,圖2a係依據本發明一實施例所繪示的人工智能聲控銀 行交易系統在執行身分驗證的操作流程圖,圖2b係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的操作流程圖。 Based on the above-mentioned artificial intelligence voice-activated banking transaction system 1 shown in Figures 1a to 1c, please refer to Figures 2a and 2b. Figure 2a illustrates an artificial intelligence voice-activated banking transaction system according to an embodiment of the present invention. Figure 2b is an operation flow chart of an artificial intelligence voice-activated bank transaction system executing instruction confirmation according to an embodiment of the present invention.

如圖2a所示,運算裝置執行的身分驗證操作S40包含,步驟S400:藉由語音識別模型擷取第一語音訊息中含有的一待定身分資料、步驟S401:比對待定身分資料與登記身分資料並確認待定身分資料符合登記身分資料,以及步驟S402:當判斷待定身分資料符合登記身分資料,運算裝置可對使用者進行指令確認操作。舉例來說,在步驟S400中,自動櫃員機可先向使用者發出訊息:「請講述使用者帳號及密碼。」,使用者便向語音接收裝置講述舉例由英文與數字組成的帳號及密碼(即第一語音訊息),語音接收裝置在將聲波訊號傳輸至記憶體中,供運算裝置分析。在此情況下,運算裝置可主動採用包含英文與數字的音素的模型對聲波訊號進行分析,以得到一串輸入帳號及輸入密碼(即待定身分資料)。當然在其他實施例中,使用者未必要講述帳號及密碼,也可以是使用者姓名、身分證字號或生日等資訊,實現方式同上且本案不限於此。舉例來說,在步驟S401中,運算裝置會將該輸入帳號及輸入密碼與預先儲存的登記帳號及登記密碼(即登記身分資料)進行比對。進一步,運算裝置會先比對帳號以確定輸入帳號的對應密碼,再將輸入密碼與預先儲存的登記密碼進行比對,兩者都符合才算通過身分驗證操作。當然此步驟也可對使用者的其他資訊如姓名、生日等進行比對,在此不贅述。 As shown in Figure 2a, the identity verification operation S40 performed by the computing device includes: Step S400: Retrieving a pending identity data contained in the first voice message through a speech recognition model; Step S401: Comparing the pending identity data with the registered identity data And confirm that the pending identity information matches the registered identity information, and step S402: when it is determined that the pending identity information matches the registered identity information, the computing device can perform an instruction confirmation operation on the user. For example, in step S400, the ATM may first send a message to the user: "Please tell the user account and password.", and the user then tells the voice receiving device the account number and password composed of English and numbers (i.e. First voice message), the voice receiving device transmits the acoustic signal to the memory for analysis by the computing device. In this case, the computing device can actively analyze the acoustic signal using a model including phonemes of English and numbers to obtain a string of input account numbers and input passwords (ie, pending identity information). Of course, in other embodiments, the user does not have to provide the account number and password, and it can also be the user's name, ID card number, birthday and other information. The implementation method is the same as above and this case is not limited thereto. For example, in step S401, the computing device compares the input account number and input password with a pre-stored registration account number and registration password (ie, registration identity information). Furthermore, the computing device will first compare the account to determine the corresponding password of the entered account, and then compare the entered password with the pre-stored registration password. If both are consistent, the identity verification operation is passed. Of course, this step can also compare other user information such as name, birthday, etc., which will not be described here.

如圖2b所示,運算裝置執行的指令確認操作S50包含,步驟S500:運算裝置藉由語音識別模型擷取第二語音訊息中含有的一待定指令資料、步驟S501:比對待定指令資料與指令資料庫中的多個執行指 令資料、步驟S502:判斷待定指令資料對應的一執行指令資料,以及步驟S503:將該執行指令資料傳送至該自動櫃員機以執行相關服務。舉例來說,在步驟S500中,自動櫃員機可先向使用者發出訊息:「請講述您想進行的業務,包含存款、提款、轉帳......」,其中業務選項可被包含在圖1c的指令資料庫33中,使用者便向語音接收裝置講述欲執行的業務內容(即第二語音訊息),語音接收裝置在將聲波訊號傳輸至記憶體中,供運算裝置分析。在此情況下,運算裝置可主動採用包含相關業務內容的音素的模型對聲波訊號進行分析,以得到相關的業務選項(即待定指令資料)。舉例來說,在步驟S501中,運算裝置會將待定指令資料(如「轉帳」)與預先儲存在指令資料庫中的執行指令資料(如「提款」、「轉帳」、「存款」等)進行比對,使得在步驟S502中能判斷出一執行指令資料(以本例來說,為「轉帳」),並將該執行指令資料傳送至自動櫃員機中以執行相關服務。 As shown in Figure 2b, the instruction confirmation operation S50 performed by the computing device includes: Step S500: The computing device retrieves a pending instruction data contained in the second voice message through a speech recognition model; Step S501: Compares the pending instruction data and instructions. Multiple execution instructions in the database order data, step S502: determine an execution instruction data corresponding to the pending instruction data, and step S503: transmit the execution instruction data to the ATM to execute related services. For example, in step S500, the ATM can first send a message to the user: "Please describe the business you want to perform, including deposits, withdrawals, transfers...", where the business options can be included in In the command database 33 of Figure 1c, the user tells the voice receiving device the business content to be executed (ie, the second voice message), and the voice receiving device transmits the acoustic wave signal to the memory for analysis by the computing device. In this case, the computing device can actively analyze the acoustic signal using a model containing phonemes of relevant service content to obtain relevant service options (ie, pending instruction data). For example, in step S501, the computing device combines the pending instruction data (such as "transfer") and the execution instruction data (such as "withdrawal", "transfer", "deposit", etc.) pre-stored in the instruction database. Comparison is performed so that an execution instruction data (in this example, "transfer") can be determined in step S502, and the execution instruction data is sent to the ATM to perform related services.

具體來說,使用者可講述「轉帳」的服務項目,而運算裝置可將執行指令資料判斷為「轉帳」並讓自動櫃員機提供服務。而在其他實施例中,記憶體可更包含自然語言處理模型,用於讓運算裝置在特定情形下斟酌使用。舉例來說,當使用者講述的業務項目為「我要領錢」時,運算裝置透過自然語言模型可先抓出關鍵字為「錢」而降低較不相關的「我要」的權重,進一步理解「領錢」同義於指令資料庫中的「提款」指令,以避免「我要領錢」無法產生對應至「提款」指令之情形。 Specifically, the user can describe the service item of "transfer", and the computing device can determine the execution instruction data as "transfer" and allow the ATM to provide the service. In other embodiments, the memory may further include a natural language processing model for allowing the computing device to consider usage in specific situations. For example, when the business item described by the user is "I want to get money", the computing device can first capture the keyword "money" through the natural language model and reduce the weight of the less relevant "I want" to further understand "Get money" is synonymous with the "withdraw" command in the command database to avoid the situation where "I want to get money" cannot be mapped to the "withdraw" command.

需要注意的是,步驟S503所述的執行相關服務在本例中可理解為執行與轉帳相關的服務,包括透過自動櫃員機向使用者索取轉 帳帳號及金額等,也就是說,使用者會再次向語音接收裝置發出另一第二語音訊息,讓運算裝置擷取分析出進一步的資料。此外,步驟S503為可選擇性被設置的,例如當交易結束時,運算裝置可能直接向雲端伺服器發送交易紀錄而不將執行指令資料傳送至自動櫃員機。 It should be noted that the execution of related services described in step S503 in this example can be understood as the execution of transfer-related services, including requesting the transfer from the user through the ATM. Account number and amount, etc., that is to say, the user will send another second voice message to the voice receiving device again, allowing the computing device to retrieve and analyze further information. In addition, step S503 is optionally set. For example, when the transaction is completed, the computing device may directly send the transaction record to the cloud server without sending the execution instruction data to the ATM.

請參考圖3,圖3係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的另一操作流程圖。如圖3所示,在身分驗證的操作(S60)中,當運算裝置藉由語音識別模型擷取第一語音訊息中含有的一待定身分資料(步驟S600)時,運算裝置透過自動櫃員機向使用者發送一身分提示訊息,且透過從語音接收裝置接收來自使用者的一身分確認訊息(步驟S601)。舉例來說,當使用者講述帳號為「abc123」時,自動櫃員機可回應「abc123」的身分提示訊息,讓使用者確認本案系統所接收的語音訊息是否正確而回應一「是」或「否」(即身分確認訊息)。接著在步驟S602:運算裝置判斷身分確認訊息為一肯定訊息或一否定訊息。舉例來說,當自動櫃員機回應「abc723」的身分提示訊息,使用者可回應「否」(即否定訊息),使得系統必須重新執行上述身分驗證操作,以獲得另一第一語音訊息。所謂重新執行身分驗證操作,表示使用者需重新講述帶有身分資料的第一語音訊息。具體來說,當運算裝置判斷身分確認訊息為否定訊息時,會透過自動櫃員機發出請求以要求該使用者提供另一第一語音訊息以更新該待定身分資料,之後再回到步驟S600。另一方面,若身分確認訊息為一肯定訊息,則運算裝置維持待定身分資料並執行步驟S603:運算裝置比對待定身分資料與登記身分資料以判斷待定身分資料通過身分驗證操作,以及步驟S604:運 算裝置對使用者進行指令確認操作。此二步驟與上述的步驟S401及步驟S402相同,在此不贅述。特別來說,當運算裝置判斷待定身分資料與登記身分資料不相符而判斷待定身分資料未通過身分驗證時,運算裝置可則等待新的第一語音訊息輸入或透過ATM要求使用者輸入新的第一語音訊息。 Please refer to FIG. 3 , which is another operational flow chart of the artificial intelligence voice-activated banking transaction system performing identity verification according to an embodiment of the present invention. As shown in Figure 3, in the identity verification operation (S60), when the computing device retrieves a pending identity information contained in the first voice message through the speech recognition model (step S600), the computing device uses the automatic teller machine to The user sends an identity prompt message and receives an identity confirmation message from the user through the voice receiving device (step S601). For example, when the user reports that the account number is "abc123", the ATM can respond with the identity prompt message of "abc123", allowing the user to confirm whether the voice message received by the system is correct and respond with a "yes" or "no" (i.e. identity confirmation message). Then in step S602: the computing device determines whether the identity confirmation message is a positive message or a negative message. For example, when the ATM responds to the identity prompt message "abc723", the user can respond "no" (ie, a negative message), so that the system must re-perform the above identity verification operation to obtain another first voice message. The so-called re-executing the identity verification operation means that the user needs to re-narrate the first voice message containing the identity information. Specifically, when the computing device determines that the identity confirmation message is a negative message, it will send a request through the ATM to require the user to provide another first voice message to update the pending identity information, and then return to step S600. On the other hand, if the identity confirmation message is a positive message, the computing device maintains the pending identity data and executes step S603: the computing device compares the pending identity data and the registered identity data to determine that the pending identity data passes the identity verification operation, and step S604: Luck The computing device performs instruction confirmation operations on the user. These two steps are the same as the above-mentioned steps S401 and S402, and will not be described again here. Specifically, when the computing device determines that the pending identity information does not match the registered identity information and determines that the pending identity information has not passed identity verification, the computing device may wait for a new first voice message to be input or require the user to enter a new third voice message through the ATM. A voice message.

請參考圖4,圖4係依據本發明一實施例所繪示的人工智能聲控銀行交易系統在執行操作確認的另一操作流程圖。如圖4所示,在指令確認的操作(S70)中,當運算裝置判斷待定指令資料對應的一執行指令資料(步驟S700)時,運算裝置可透過自動櫃員機向使用者發送一指令提示訊息,且透過從語音接收裝置接收來自使用者的一指令確認訊息(步驟S701)。舉例來說,當使用者說出「我要領兩千元」時,自動櫃員機可發出「請問您要提款嗎?」的指令提示訊息,使用者再回復「是」的指令確認訊息。關於如何從「領兩千元」對應到「提款」的方法,可參照上述直接對話模型或自然語言模型的描述,在此不贅述。接著於步驟S702,運算裝置判斷指令確認訊息為一肯定訊息或一否定訊息,若為肯定訊息則運算裝置可將執行指令資料傳送至自動櫃員機以執行相關服務(步驟S703)。或者,當指令確認訊息為否定訊息時,可重新執行指令確認操作,以獲得另一執行指令資料。也就是說,當運算裝置判斷指令確認訊息為非同意訊息時,運算裝置可透過自動櫃員機發出請求以要求該使用者提供另一第二語音訊息以更新該待定指令資料。例如:「請問您要提款嗎?」(指令提示訊息)、「不是」(指令確認訊息)、「請問您 要進行何種業務?」(重新要求第二語音訊息)、「轉帳」(更新待定指令資料)、「請問您要轉帳嗎?」(指令提示訊息)、「是」(指令確認訊息)。 Please refer to FIG. 4 , which is another operation flow chart of the artificial intelligence voice-activated banking transaction system performing operation confirmation according to an embodiment of the present invention. As shown in Figure 4, in the instruction confirmation operation (S70), when the computing device determines an execution instruction data corresponding to the pending instruction data (step S700), the computing device can send an instruction prompt message to the user through the ATM, And by receiving a command confirmation message from the user from the voice receiving device (step S701). For example, when the user says "I want to withdraw two thousand dollars", the ATM can send out a command prompt message "Do you want to withdraw money?", and the user then replies with a "Yes" command confirmation message. Regarding how to correspond from "receiving two thousand yuan" to "withdrawing money", you can refer to the description of the direct conversation model or natural language model mentioned above, and will not be described in detail here. Next, in step S702, the computing device determines whether the instruction confirmation message is a positive message or a negative message. If it is a positive message, the computing device can send the execution instruction data to the ATM to perform related services (step S703). Alternatively, when the instruction confirmation message is a negative message, the instruction confirmation operation can be re-executed to obtain another execution instruction data. That is to say, when the computing device determines that the instruction confirmation message is a non-agreed message, the computing device can send a request through the ATM to require the user to provide another second voice message to update the pending instruction data. For example: "Do you want to withdraw money?" (instruction prompt message), "No" (instruction confirmation message), "Excuse me What kind of business is going on? "(Request the second voice message), "Transfer" (update the pending instruction data), "Do you want to transfer money?" (Instruction prompt message), "Yes" (Instruction confirmation message).

本案系統的變化實施例的身分驗證可進一步涵蓋影像辨識功能。請參照圖5a至圖5c,圖5a為依據本發明另一實施例所繪示的人工智能聲控銀行交易系統的方塊圖,圖5b為依據本發明另一實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖,圖5c為依據本發明另一實施例所繪示的人工智能聲控銀行交易系統的記憶體接收語音訊息及臉部影像資料的方塊示意圖。 The identity verification of the modified embodiment of the system in this case can further include the image recognition function. Please refer to Figures 5a to 5c. Figure 5a is a block diagram of an artificial intelligence voice-controlled banking transaction system according to another embodiment of the present invention. Figure 5b is a block diagram of an artificial intelligence voice-activated banking system according to another embodiment of the present invention. A block diagram of the usage scenario of the transaction system. Figure 5c is a block diagram of the memory of the artificial intelligence voice-activated bank transaction system receiving voice messages and facial image data according to another embodiment of the present invention.

如圖5a至圖5c所示,人工智能聲控銀行交易系統1’除了包含自動櫃員機10、運算裝置20、記憶體30’、語音接收裝置40以外,更包含影像擷取裝置50,連接至記憶體30’。如圖5b所示,影像擷取裝置50可例如為安裝在自動櫃員機10上的一攝像機,用於擷取使用者C的一臉部影像資料。如圖5c所示,記憶體30’更事先儲存有一臉部特徵偵測模型34及使用者的一登錄臉部特徵35。 As shown in Figures 5a to 5c, the artificial intelligence voice-activated banking transaction system 1' not only includes an automatic teller machine 10, a computing device 20, a memory 30', and a voice receiving device 40, but also includes an image capture device 50 connected to the memory. 30'. As shown in FIG. 5b , the image capture device 50 may be, for example, a camera installed on the ATM 10 for capturing facial image data of the user C. As shown in Figure 5c, the memory 30' further stores a facial feature detection model 34 and a registered facial feature 35 of the user in advance.

請參照圖6,圖6係依據本發明另一實施例所繪示的人工智能聲控銀行交易系統在執行身分驗證的臉部識別的一操作流程圖。如圖6所示,運算裝置執行的身分驗證操作更包含一臉部識別操作,包含步驟S900:藉由該臉部特徵偵測模型擷取該臉部影像資料中含有的一待定臉部特徵、步驟S901:比對該待定臉部特徵與該登錄臉部特徵、步驟S902:判斷該待定臉部特徵是否符合該登錄臉部特徵,若該待定臉部特徵符合該登錄臉部特徵,則進行步驟S903:運算裝置執行其他身分驗證 操作或該指令確認操作,而若待定臉部特徵不符合該登錄臉部特徵,則運算裝置重新執行該身分驗證操作。 Please refer to FIG. 6 , which is an operation flow chart of an artificial intelligence voice-activated banking transaction system performing facial recognition for identity verification according to another embodiment of the present invention. As shown in Figure 6, the identity verification operation performed by the computing device further includes a facial recognition operation, including step S900: capturing a pending facial feature contained in the facial image data through the facial feature detection model, Step S901: Compare the pending facial features with the logged-in facial features. Step S902: Determine whether the pending facial features match the logged-in facial features. If the pending facial features match the logged-in facial features, proceed to steps S903: The computing device performs additional authentication The operation or the instruction confirms the operation, and if the pending facial features do not match the logged-in facial features, the computing device re-executes the identity verification operation.

舉例來說,影像擷取裝置在拍攝使用者的臉部影像後,可將影像資料傳輸至記憶體中,運算裝置可藉由臉部特徵偵測模型擷取該影像資料的特徵,其中,臉部特徵偵測模型可為具有深度學習能力的神經網路(如VGG-Face、Google FaceNet、OpenFace等)。以Google FaceNet為例,其並非一開始就輸出分類結果,而是先輸出量化特徵值,此運算可藉此將臉部特徵如輪廓擷取出來成為一待定臉部特徵,再於特徵向量空間中對不同臉部照片進行比較,以判別不同臉孔的相似度,換言之,在特徵向量空間中距離越近的兩點表示臉孔相似度越高。另外,記憶體儲存的登錄臉部特徵是關聯於上述登記身分資料,具體來說,當使用者先進行如圖2a步驟S401所示的比對身分資料後,該身分資料會對應至一登陸臉部特徵以用於比對。在臉部特徵的比對判斷上,可透過擷取使用者的多張影像來增加識別準確率,在此不贅述。需要注意的是,當運算裝置判斷待定臉部特徵不符合登錄臉部特徵時,表示使用者的身分資料可能有誤,需要重新執行身分驗證操作,也因此會再度進行上述臉部識別操作,然而圖6並沒有示出所有身分驗證操作,故步驟S902的否定箭頭以指向步驟S900表示,本案不限於此。 For example, after taking an image of the user's face, the image capture device can transfer the image data to the memory, and the computing device can capture the characteristics of the image data through a facial feature detection model, where the face The partial feature detection model can be a neural network with deep learning capabilities (such as VGG-Face, Google FaceNet, OpenFace, etc.). Taking Google FaceNet as an example, it does not output classification results at the beginning, but first outputs quantified feature values. This operation can extract facial features such as contours into a to-be-determined facial feature, and then use it in the feature vector space. Different facial photos are compared to determine the similarity of different faces. In other words, the closer the two points are in the feature vector space, the higher the similarity of the faces. In addition, the login facial features stored in the memory are associated with the above-mentioned registered identity information. Specifically, when the user first performs the comparison of identity information as shown in step S401 in Figure 2a, the identity information will correspond to a login face. features for comparison. In terms of comparison and judgment of facial features, the recognition accuracy can be increased by capturing multiple images of the user, which will not be described in detail here. It should be noted that when the computing device determines that the undetermined facial features do not match the logged-in facial features, it means that the user's identity information may be incorrect and the identity verification operation needs to be performed again. Therefore, the above-mentioned facial recognition operation will be performed again. However, Figure 6 does not show all the identity verification operations, so the negative arrow in step S902 is shown pointing to step S900. This case is not limited thereto.

請參照圖7,圖7為依據本發明又一實施例所繪示的人工智能聲控銀行交易系統的又一記憶體接收訊息及資料的方塊示意圖。如圖7所示,記憶體30”更事先儲存有一唇形識別模型36(如LipNet),而運算裝置更用於透過影像擷取裝置50擷取使用者的的多個唇部影像以獲 得一動態唇形資料,且運算裝置20執行的該指令確認操作更包含一唇形識別操作。 Please refer to FIG. 7 . FIG. 7 is a block diagram illustrating another memory of the artificial intelligence voice-activated banking transaction system receiving messages and data according to another embodiment of the present invention. As shown in Figure 7, the memory 30" further stores a lip recognition model 36 (such as LipNet) in advance, and the computing device is further used to capture multiple lip images of the user through the image capture device 50 to obtain A dynamic lip shape data is obtained, and the instruction confirmation operation performed by the computing device 20 further includes a lip shape recognition operation.

請參照圖8,圖8為依據本發明又一實施例所繪示的人工智能聲控銀行交易系統在執行指令確認的唇形識別的一操作流程圖。如圖8所示,運算裝置執行的唇形識別操作S110包含步驟S1100:藉由唇形識別模型擷取動態唇形資料所包含的一參照指令資料、步驟S1101:判斷待定指令資料與參照指令資料是否相同,以及步驟S1102:保留待定指令資料。在步驟S1101中,若判斷待定指令資料與參照指令資料相同,則進行步驟S1102,若判斷待定指令資料與參照指令資料不同,表示語音識別模型得到的待定指令資料與唇形識別模型得到的參照指令資料不同,可重新進行上述指令確認操作。需要注意的是,本圖未示出所有指令確認操作,因此將表示「重新進行指令確認操作」的否定箭頭指向步驟S1100(意味著當重新進行指令確認操作,也可重新進行唇形識別操作),本案不限於此。具體來說,唇形識別模型的原理類似於上述語音識別模型,可透過將口語分解為對應至不同發音的口形元素,並儲存為一序列供運算裝置進行分析,在此不詳述。需要注意的是,用於前述實施例的影像擷取裝置可僅具有拍攝影像功能,但在本實施例中,影像擷取裝置需具有錄製連續影片或拍攝高速連續影像的功能,以達成唇形識別。 Please refer to FIG. 8 , which is an operation flow chart of an artificial intelligence voice-activated banking transaction system executing lip recognition for instruction confirmation according to yet another embodiment of the present invention. As shown in Figure 8, the lip recognition operation S110 performed by the computing device includes step S1100: acquiring a reference command data included in the dynamic lip data through the lip recognition model, step S1101: judging the pending command data and the reference command data. Whether they are the same, and step S1102: retain the pending instruction data. In step S1101, if it is determined that the pending instruction data is the same as the reference instruction data, then proceed to step S1102. If it is determined that the pending instruction data is different from the reference instruction data, it means that the pending instruction data obtained by the speech recognition model and the reference instruction obtained by the lip recognition model If the data is different, the above command confirmation operation can be performed again. It should be noted that this figure does not show all the command confirmation operations, so the negative arrow indicating "re-perform the command confirmation operation" points to step S1100 (meaning that when the command confirmation operation is re-performed, the lip recognition operation can also be re-performed) , this case is not limited to this. Specifically, the principle of the lip shape recognition model is similar to the above-mentioned speech recognition model. It can decompose spoken language into mouth shape elements corresponding to different pronunciations, and store them as a sequence for analysis by the computing device, which will not be described in detail here. It should be noted that the image capture device used in the foregoing embodiments may only have the function of capturing images, but in this embodiment, the image capture device needs to have the function of recording continuous videos or capturing high-speed continuous images to achieve lip shape. Identify.

請參照圖9,圖9為依據本發明其他實施例所繪示的人工智能聲控銀行交易系統的使用情境的方塊圖。如圖9所示,語音接收裝置40可為使用者C的一行動裝置(如智慧型手機),且語音接收裝置40可更 連接於運算裝置20。在本例中,使用者C的手機可與運算裝置20及記憶體30產生連線,具體來說,手機可透過與自動櫃員機10連線,再連線至運算裝置20及記憶體30,本案不限於此。使用者C可向手機講述個人資料及欲執行的銀行業務,使得周遭他人不易聽見隱私資料。同理,系統可透過手機向使用者C發出提示訊息,也避免被周遭人聽見。此外,手機更可用於接收來自系統的經過加密的一提示訊息並將該提示訊息轉為一提示音後播放給該使用者,增加資訊安全性。 Please refer to FIG. 9 , which is a block diagram of a usage scenario of an artificial intelligence voice-activated banking transaction system according to other embodiments of the present invention. As shown in FIG. 9 , the voice receiving device 40 can be a mobile device (such as a smart phone) of user C, and the voice receiving device 40 can be changed. Connected to the computing device 20. In this example, user C's mobile phone can be connected to the computing device 20 and the memory 30. Specifically, the mobile phone can be connected to the ATM 10 and then connected to the computing device 20 and the memory 30. In this case Not limited to this. User C can tell his personal information and the banking business he wants to perform to his mobile phone, making it difficult for others around him to hear the private information. In the same way, the system can send a prompt message to user C through the mobile phone without being heard by the people around him. In addition, the mobile phone can also be used to receive an encrypted prompt message from the system and convert the prompt message into a prompt tone and then play it to the user, thereby increasing information security.

請參照圖10,圖10為依據本發明其他實施例所繪示的人工智能聲控銀行交易系統的其他記憶體接收訊息及資料的方塊示意圖。如圖10所示,記憶體30'''更事先儲存有一第一觸發訊息37及一第二觸發訊息38,且更用於接收來自語音接收裝置的一第三語音訊息。 Please refer to FIG. 10 , which is a schematic block diagram of other memories of the artificial intelligence voice-activated banking transaction system receiving messages and data according to other embodiments of the present invention. As shown in FIG. 10 , the memory 30'' further stores a first trigger message 37 and a second trigger message 38 in advance, and is further used to receive a third voice message from the voice receiving device.

請結合圖10參照圖11,圖11為依據本發明其他實施例所繪示的人工智能聲控銀行交易系統在受觸發的情況下的方塊流程圖。如圖11所示,當運算裝置透過語音接收裝置接收來自使用者的一第三語音訊息時,可判斷第三語音訊息是否符合第一觸發訊息或第二觸發訊息,若第三語音訊息符合第一觸發訊息,則重新執行上述身分驗證操作,若第三語音訊息符合第二觸發訊息,則重新執行指令確認操作,否則,不執行額外操作。具體來說,若一個使用者完成身分驗證訊息後,發現他/她是想對另一帳戶進行操作,便可對語音接收裝置發出符合第一觸發訊息的一第三語音訊息(如「重新驗證」)。類似的,若一個使用者說出要轉帳後,發現他/她其實是想提款,便可對語音接收裝置發出符合第二觸發訊息的一第三語音訊息(如「重新交易」)。當然,只要在過程中使用 者避談上述第一觸發訊息或第二觸發訊息,系統便不會受到第三語音訊息之觸發而中斷交易。 Please refer to FIG. 11 in conjunction with FIG. 10 . FIG. 11 is a block flow chart of an artificial intelligence voice-activated banking transaction system when triggered according to other embodiments of the present invention. As shown in Figure 11, when the computing device receives a third voice message from the user through the voice receiving device, it can determine whether the third voice message matches the first trigger message or the second trigger message. If the third voice message matches the first trigger message, Once the message is triggered, the above identity verification operation is re-executed. If the third voice message matches the second trigger message, the command confirmation operation is re-executed. Otherwise, no additional operation is performed. Specifically, if a user completes the identity verification message and finds that he/she wants to operate another account, he/she can send a third voice message that matches the first trigger message to the voice receiving device (such as "re-verification"). ”). Similarly, if a user says that he/she wants to transfer money and then finds that he/she actually wants to withdraw money, he/she can send a third voice message (such as "re-transaction") that matches the second trigger message to the voice receiving device. Of course, as long as you use If the person avoids talking about the first trigger message or the second trigger message, the system will not be triggered by the third voice message to interrupt the transaction.

依據本文的揭露,可達成以下描述的一種使用者情境,需要注意的是,此情境僅為一種舉例,本案不限於此。客戶前往自動櫃員機,依指示插入金融卡或點選無卡交易功能後,自動櫃員機播放語音指示客戶於指定時間內靠近麥克風,並講述身分驗證問題,如生日、電話、帳號及密碼等個人資訊。麥克風收到客戶語音,透過降躁技術過濾雜音,傳輸至本案系統內以判定客戶語音內容。系統判定身分驗證通過,於自動櫃員機播放系語音指示客戶於指定時間內靠近麥克風,講述交易項目(存款/提款/轉帳)及交易內容(帳號/金額),若系統判定身分驗證未通過或逾時,重新回到上述身分驗證步驟。麥克風接收到客戶語音,系統辨識客戶交易內容,並再次於自動櫃員機播放語音指示客戶於指定時間內靠近麥克風,回覆交易確認/修改/取消指令。系統收到客戶確認指令,於自動櫃員機執行交易,若系統收到客戶修改指令,重新回到上述對應流程,若系統收到客戶取消或系統交易逾時,重新回到初始流程。客戶完成交易後,系統以電子郵件及簡訊傳送留存單據。 Based on the disclosure of this article, a user scenario described below can be achieved. It should be noted that this scenario is only an example, and this case is not limited to this. After the customer goes to the ATM and follows the instructions to insert a debit card or click on the card-not-present transaction function, the ATM plays a voice instructing the customer to approach the microphone within a specified time and tell them identity verification questions, such as birthday, phone number, account number, password and other personal information. The microphone receives the customer's voice, filters the noise through noise reduction technology, and transmits it to the system in this case to determine the content of the customer's voice. The system determines that the identity verification has passed, and the voice played in the ATM instructs the customer to approach the microphone within the specified time and describe the transaction items (deposit/withdrawal/transfer) and transaction content (account number/amount). If the system determines that the identity verification has failed or exceeded , go back to the identity verification steps above. The microphone receives the customer's voice, the system identifies the customer's transaction content, and plays the voice again on the ATM to instruct the customer to approach the microphone within the specified time and reply to the transaction confirmation/modification/cancellation instructions. The system receives the customer's confirmation instruction and executes the transaction at the ATM. If the system receives the customer's modification instruction, it returns to the above corresponding process. If the system receives the customer's cancellation or the system transaction times out, it returns to the initial process. After the customer completes the transaction, the system sends the retained receipt via email and SMS.

藉由上述結構,本案所揭示的人工智能聲控銀行交易系統,可透過儲存有人工智慧識別模型的記憶體搭配運算裝置,對接收的語音訊息進行內容判斷,以安全有效地執行身分驗證以及指令控制的功能。讓使用者只需再自動櫃員機前講述自己的身分資料完成驗證後,再口語表達欲執行的銀行服務項目,如此一來一般的銀行業務皆能完全透過聲音控制的方式來完成,對各種族群來說都是便利的措施。 With the above structure, the artificial intelligence voice-controlled banking transaction system disclosed in this case can perform content judgment on the received voice messages through a memory and computing device that stores an artificial intelligence recognition model, so as to safely and effectively perform identity verification and command control. function. Users only need to tell their identity information in front of the ATM to complete the verification, and then verbally express the banking services they want to perform. In this way, general banking services can be completed completely through voice control, which is suitable for various ethnic groups. They are all convenient measures.

雖然本發明以前述之實施例揭露如上,然其並非用以限定本發明。在不脫離本發明之精神和範圍內,所為之更動與潤飾,均屬本發明之專利保護範圍。關於本發明所界定之保護範圍請參考所附之申請專利範圍。 Although the present invention is disclosed in the foregoing embodiments, they are not intended to limit the present invention. All changes and modifications made without departing from the spirit and scope of the present invention shall fall within the scope of patent protection of the present invention. Regarding the protection scope defined by the present invention, please refer to the attached patent application scope.

1:人工智能聲控銀行交易系統 1: Artificial intelligence voice-controlled banking transaction system

10:自動櫃員機 10:ATM

20:運算裝置 20:Computing device

30:記憶體 30:Memory

40:語音接收裝置 40: Voice receiving device

Claims (8)

一種人工智能聲控銀行交易系統,包含:一自動櫃員機,用於讓一使用者自助辦理銀行櫃檯服務;一運算裝置,連接於該自動櫃員機,用於對該使用者進行一身分驗證操作及一指令確認操作;一記憶體,連接於該運算裝置,用於儲存一第一語音訊息及一第二語音訊息,並事先儲存有一語音識別模型、該使用者的一登記身分資料及一指令資料庫;一語音接收裝置,連接於該記憶體,用於接收該使用者的該第一語音訊息及該第二語音訊息並傳輸至該記憶體,其中該語音接收裝置為一耳機設備;以及一行動裝置,連接於該自動櫃員機及該運算裝置,用於自該運算裝置接收加密的一提示訊息並將該提示訊息轉為一提示音後播放給該使用者,其中該運算裝置執行的該身分驗證操作包含:藉由該語音識別模型擷取該第一語音訊息中含有的一待定身分資料;以及比對該待定身分資料與該登記身分資料並確認該待定身分資料符合該登記身分資料,且該運算裝置執行的該指令確認操作包含:藉由該語音識別模型擷取該第二語音訊息中含有的一待定指令資料;以及 比對該待定指令資料與該指令資料庫中的多個執行指令資料,以判斷與該待定指令資料對應的一執行指令資料。 An artificial intelligence voice-activated bank transaction system includes: an automatic teller machine for allowing a user to handle bank counter services by themselves; a computing device connected to the automatic teller machine for performing an identity verification operation and an instruction on the user Confirm operation; a memory connected to the computing device for storing a first voice message and a second voice message, and pre-stored a voice recognition model, a registered identity information of the user and a command database; A voice receiving device connected to the memory for receiving the first voice message and the second voice message of the user and transmitting them to the memory, wherein the voice receiving device is an earphone device; and a mobile device , connected to the ATM and the computing device, for receiving an encrypted prompt message from the computing device and converting the prompt message into a prompt sound and then playing it to the user, wherein the identity verification operation performed by the computing device including: capturing a pending identity data contained in the first voice message through the speech recognition model; and comparing the pending identity data with the registered identity data and confirming that the pending identity data matches the registered identity data, and the calculation The command confirmation operation performed by the device includes: retrieving a pending command data contained in the second voice message through the voice recognition model; and Compare the pending instruction data with multiple execution instruction data in the instruction database to determine an execution instruction data corresponding to the pending instruction data. 如請求項1所述的人工智能聲控銀行交易系統,其中該運算裝置執行的該身分驗證操作更包含:當該運算裝置藉由該語音識別模型擷取該第一語音訊息中含有的該待定身分資料時,執行:透過該自動櫃員機向該使用者發送一身分提示訊息;該運算裝置透過從該語音接收裝置接收來自該使用者的一身分確認訊息,並判斷該身分確認訊息為一肯定訊息或一否定訊息;若該身分確認訊息為該肯定訊息,該運算裝置維持該待定身分資料;以及若該身分確認訊息為該否定訊息,該運算裝置透過該自動櫃員機發出請求以要求該使用者提供另一第一語音訊息以更新該待定身分資料。 The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the identity verification operation performed by the computing device further includes: when the computing device retrieves the pending identity contained in the first voice message through the speech recognition model When receiving data, execute: send an identity reminder message to the user through the ATM; the computing device receives an identity confirmation message from the user from the voice receiving device, and determines that the identity confirmation message is a positive message or a negative message; if the identity confirmation message is a positive message, the computing device maintains the pending identity information; and if the identity confirmation message is a negative message, the computing device issues a request through the ATM to require the user to provide another A first voice message to update the pending identity information. 如請求項1所述的人工智能聲控銀行交易系統,其中該運算裝置執行的該指令確認操作更包含:當該運算裝置判斷該待定指令資料對應的一執行指令資料時,執行:透過該自動櫃員機向該使用者發送一指令提示訊息;透過從該語音接收裝置接收來自該使用者的一指令確認訊息,並判斷該指令確認訊息為一同意訊息或一非同意訊息; 若該指令確認訊息為該同意訊息,該運算裝置將該執行指令資料傳送至該自動櫃員機以執行相關服務;若該指令確認訊息為該非同意訊息,該運算裝置透過該自動櫃員機發出請求以要求該使用者提供另一第二語音訊息以更新該待定指令資料。 The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the instruction confirmation operation performed by the computing device further includes: when the computing device determines an execution instruction data corresponding to the pending instruction data, execute: through the automatic teller machine Send a command prompt message to the user; receive a command confirmation message from the user from the voice receiving device, and determine whether the command confirmation message is an consent message or a non-agree message; If the instruction confirmation message is the consent message, the computing device sends the execution instruction data to the ATM to perform related services; if the instruction confirmation message is the non-agree message, the computing device sends a request through the ATM to request the The user provides another second voice message to update the pending instruction data. 如請求項1所述的人工智能聲控銀行交易系統,其中該運算裝置執行的該指令確認操作更包含:當該運算裝置判斷該待定指令資料對應的一執行指令資料時,執行:將該執行指令資料傳送至該自動櫃員機以執行相關服務。 The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the instruction confirmation operation performed by the computing device further includes: when the computing device determines an execution instruction data corresponding to the pending instruction data, execute: convert the execution instruction The information is transmitted to the ATM to perform related services. 如請求項1所述的人工智能聲控銀行交易系統,更包含一影像擷取裝置,連接於該記憶體,用於擷取該使用者的一臉部影像資料,且該記憶體更事先儲存有一臉部特徵偵測模型及該使用者的一登錄臉部特徵,且該登錄臉部特徵關聯於該登記身分資料,其中,該運算裝置執行的該身分驗證操作更包含一臉部識別操作,包含:藉由該臉部特徵偵測模型擷取該臉部影像資料中含有的一待定臉部特徵;比對該待定臉部特徵與該登錄臉部特徵,以判斷該待定臉部特徵是否符合該登錄臉部特徵;若該待定臉部特徵符合該登錄臉部特徵,該運算裝置執行其他身分驗證操作或該指令確認操作; 若該待定臉部特徵不符合該登錄臉部特徵,該運算裝置透過該自動櫃員機發出請求以要求該使用者提供另一第一語音訊息以更新該待定身分資料與該登記身分資料。 The artificial intelligence voice-activated banking transaction system as described in claim 1 further includes an image capture device connected to the memory for capturing facial image data of the user, and the memory further stores a facial image data in advance. A facial feature detection model and a logged-in facial feature of the user, and the logged-in facial feature is associated with the registered identity information, wherein the identity verification operation performed by the computing device further includes a facial recognition operation, including : Acquire an undetermined facial feature contained in the facial image data through the facial feature detection model; compare the undetermined facial feature with the registered facial feature to determine whether the undetermined facial feature matches the Log in facial features; if the pending facial features match the logged in facial features, the computing device performs other identity verification operations or the command confirmation operation; If the pending facial feature does not match the registered facial feature, the computing device sends a request through the ATM to require the user to provide another first voice message to update the pending identity information and the registered identity information. 如請求項5所述的人工智能聲控銀行交易系統,其中該記憶體更儲存有一唇形識別模型,該運算裝置更用於透過該影像擷取裝置擷取該使用者的多個唇部影像以獲得一動態唇形資料,且該運算裝置執行的該指令確認操作更包含一唇形識別操作,包含:藉由該唇形識別模型擷取該動態唇形資料所包含的一參照指令資料;判斷該待定指令資料與該參照指令資料是否相同;若該待定指令資料與該參照指令資料相同,保留該待定指令資料;以及若該待定指令資料與該參照指令資料不同,該運算裝置透過該自動櫃員機發出請求以要求該使用者提供另一第二語音訊息以更新該待定指令資料。 The artificial intelligence voice-activated banking transaction system as described in claim 5, wherein the memory further stores a lip recognition model, and the computing device is further used to capture multiple lip images of the user through the image capture device. Obtaining a dynamic lip shape data, and the command confirmation operation executed by the computing device further includes a lip shape recognition operation, including: retrieving a reference command data included in the dynamic lip shape data through the lip shape recognition model; judging Whether the pending order data is the same as the reference order data; if the pending order data is the same as the reference order data, the pending order data is retained; and if the pending order data is different from the reference order data, the computing device uses the ATM A request is issued to require the user to provide another second voice message to update the pending instruction data. 如請求項1所述的人工智能聲控銀行交易系統,其中該記憶體更儲存有一第一觸發訊息及一第二觸發訊息,使得當該運算裝置透過該語音接收裝置接收來自該使用者的一第三語音訊息時,判斷該第三語音訊息是否符合該第一觸發訊息或該第二觸發訊息,若該第三語音訊息符合該第一觸發訊息,則重新執行該身分驗證操作,若該第三語音訊息符合該第二觸發訊息,則重新執行該指令確認操作,否則,不執行額外操作。 The artificial intelligence voice-activated banking transaction system as described in claim 1, wherein the memory further stores a first trigger message and a second trigger message, so that when the computing device receives a first trigger message from the user through the voice receiving device, When three voice messages are sent, it is determined whether the third voice message matches the first trigger message or the second trigger message. If the third voice message matches the first trigger message, the identity verification operation is re-executed. If the third voice message matches the first trigger message, the identity verification operation is re-executed. If the voice message matches the second trigger message, the command confirmation operation is re-executed, otherwise, no additional operation is performed. 如請求項1所述的人工智能聲控銀行交易系統,其中該語音接收裝置內具有一濾波器,用於濾除該第一語音訊息及該第二語音訊息中的雜音。The artificial intelligence voice-activated banking transaction system according to claim 1, wherein the voice receiving device has a filter for filtering out noise in the first voice message and the second voice message.
TW111135658A 2022-09-21 2022-09-21 Ai voice control banking transaction system TWI835304B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW111135658A TWI835304B (en) 2022-09-21 2022-09-21 Ai voice control banking transaction system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW111135658A TWI835304B (en) 2022-09-21 2022-09-21 Ai voice control banking transaction system

Publications (2)

Publication Number Publication Date
TWI835304B true TWI835304B (en) 2024-03-11
TW202414385A TW202414385A (en) 2024-04-01

Family

ID=91269682

Family Applications (1)

Application Number Title Priority Date Filing Date
TW111135658A TWI835304B (en) 2022-09-21 2022-09-21 Ai voice control banking transaction system

Country Status (1)

Country Link
TW (1) TWI835304B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1244984A (en) * 1996-11-22 2000-02-16 T-内提克斯公司 Voice recognition for information system access and transaction processing
US20120089410A1 (en) * 1999-11-22 2012-04-12 Accenture Global Services Limited System, method and article of manufacture for enhanced visibility during installation management in a network- based supply chain environment
TW201701605A (en) * 2015-01-26 2017-01-01 創研騰智權信託有限公司 Secure dynamic communication network and protocol
CN107633627A (en) * 2017-09-22 2018-01-26 深圳怡化电脑股份有限公司 One kind is without card withdrawal method, apparatus, equipment and storage medium
CN109472608A (en) * 2018-10-16 2019-03-15 深圳壹账通智能科技有限公司 Business confirmation method and terminal device based on Emotion identification
TWM635534U (en) * 2022-09-21 2022-12-11 華南商業銀行股份有限公司 Artificial intelligence voice controlled banking transaction system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1244984A (en) * 1996-11-22 2000-02-16 T-内提克斯公司 Voice recognition for information system access and transaction processing
US20120089410A1 (en) * 1999-11-22 2012-04-12 Accenture Global Services Limited System, method and article of manufacture for enhanced visibility during installation management in a network- based supply chain environment
TW201701605A (en) * 2015-01-26 2017-01-01 創研騰智權信託有限公司 Secure dynamic communication network and protocol
CN107633627A (en) * 2017-09-22 2018-01-26 深圳怡化电脑股份有限公司 One kind is without card withdrawal method, apparatus, equipment and storage medium
CN109472608A (en) * 2018-10-16 2019-03-15 深圳壹账通智能科技有限公司 Business confirmation method and terminal device based on Emotion identification
TWM635534U (en) * 2022-09-21 2022-12-11 華南商業銀行股份有限公司 Artificial intelligence voice controlled banking transaction system

Also Published As

Publication number Publication date
TW202414385A (en) 2024-04-01

Similar Documents

Publication Publication Date Title
US10650824B1 (en) Computer systems and methods for securing access to content provided by virtual assistants
CN106373575B (en) User voiceprint model construction method, device and system
KR102611751B1 (en) Augmentation of key phrase user recognition
US8812319B2 (en) Dynamic pass phrase security system (DPSS)
WO2019090834A1 (en) Express cabinet pickup method and apparatus based on voiceprint
US12021857B2 (en) Voice biometric authentication in a virtual assistant
US20100219936A1 (en) Apparatus and method for identifying a name corresponding to a face or voice using a database
CN110169014A (en) Device, method and computer program product for certification
CN106415719A (en) Robust end-pointing of speech signals using speaker recognition
CN101685635A (en) Identity authentication system and method
JP6561345B2 (en) Authentication server, voiceprint authentication system, and voiceprint authentication method
CN109378006A (en) A kind of striding equipment method for recognizing sound-groove and system
CN112417412A (en) Bank account balance inquiry method, device and system
TWM635534U (en) Artificial intelligence voice controlled banking transaction system
TWI835304B (en) Ai voice control banking transaction system
CN112562690A (en) Business processing method, device, equipment and storage medium based on article distribution
CN203480601U (en) Self-service equipment based on voice interaction
TW202424959A (en) Ai voice control banking transaction system based on facial recognition
TW202424958A (en) Interactive ai voice control banking transaction system
US11227610B1 (en) Computer-based systems for administering patterned passphrases
JP7487668B2 (en) Information processing device and information processing method
US11755118B2 (en) Input commands via visual cues
CN115620713A (en) Dialog intention recognition method, device, equipment and storage medium
JP7254316B1 (en) Program, information processing device, and method
TWI234762B (en) Voiceprint identification system for e-commerce