TW201923621A - Conversational financial service device and system comprising a transceiver module, a control module and an emotion recognition module - Google Patents

Conversational financial service device and system comprising a transceiver module, a control module and an emotion recognition module Download PDF

Info

Publication number
TW201923621A
TW201923621A TW106139835A TW106139835A TW201923621A TW 201923621 A TW201923621 A TW 201923621A TW 106139835 A TW106139835 A TW 106139835A TW 106139835 A TW106139835 A TW 106139835A TW 201923621 A TW201923621 A TW 201923621A
Authority
TW
Taiwan
Prior art keywords
emotion
module
voice message
text
unit
Prior art date
Application number
TW106139835A
Other languages
Chinese (zh)
Other versions
TWI675303B (en
Inventor
郭明贊
何韋辰
陸茗庭
江孟軒
Original Assignee
中國信託金融控股股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中國信託金融控股股份有限公司 filed Critical 中國信託金融控股股份有限公司
Priority to TW106139835A priority Critical patent/TWI675303B/en
Publication of TW201923621A publication Critical patent/TW201923621A/en
Application granted granted Critical
Publication of TWI675303B publication Critical patent/TWI675303B/en

Links

Abstract

A conversational financial service device includes a transceiver module, a control module connected to the transceiver module, and an emotion recognition module connected to the control module. The transceiver module receives a voice message. The control module transmits the voice message to the emotion recognition module and generates a reply message according to the emotions contained in the voice message recognized by the emotion recognition module. The conversational financial service device further includes a natural language dialogue module connected to the control module. The natural language dialogue module is configured to identify contents of the voice message.

Description

對話式金融服務裝置及系統Conversational financial service device and system

本發明是有關於一種金融服務裝置及系統,特別是指一種用自然語言對話方式取得金融服務之對話式金融服務裝置及系統。The present invention relates to a financial service device and system, and more particularly to a conversational financial service device and system for obtaining financial services by natural language dialogue.

近年來,由於通訊及網路相關產業的高度發展,人們對各項產業電子化與行動化的要求日益升高。而對於金融業方面,除了提供網路銀行之外,也提供行動銀行的服務,使得使用者可以透過電腦、智慧型手機等各式的電子裝置來登入網路銀行或行動銀行,藉以直接在網路上進行查帳、轉帳、繳費、外匯兌換、匯款等金融交易。In recent years, due to the high development of communication and network-related industries, people are increasingly demanding the electronic and mobileization of various industries. For the financial industry, in addition to providing online banking, it also provides mobile banking services, enabling users to log in to online banking or mobile banking through various electronic devices such as computers and smart phones. Financial transactions such as auditing, transfer, payment, foreign exchange, and remittance are carried out on the road.

然而,在現有的技術中,網路銀行或行動銀行雖然能帶來許多的便利,但在操作各項金融服務時,仍然需要雙眼與雙手停留在實體鍵盤或螢幕上才能操作,不僅十分麻煩,且對於視障或身障的使用者,現有的點擊操作介面也並不友善。However, in the existing technology, although online banking or mobile banking can bring a lot of convenience, when operating various financial services, it is still necessary to keep eyes and hands on the physical keyboard or screen to operate, not only very Trouble, and for users with visual impairment or physical disabilities, the existing click-to-operate interface is also not friendly.

因此,本發明之目的,即在提供一種利用自然語言對話方式取得金融服務之對話式金融服務裝置。Accordingly, it is an object of the present invention to provide a conversational financial service device that utilizes natural language dialogue to obtain financial services.

於是,本發明對話式金融服務裝置,包含一收發模組、一連接收發模組的控制模組、一連接控制模組的自然語言對話模組及一連接控制模組的情緒識別模組,收發模組接收一語音訊息,自然語言對話模組用以識別該語音訊息之內容,情緒識別模組用以識別該語音訊息內含之情緒,控制模組根據情緒識別模組及自然語言對話模組之辨識結果而對應產生一回覆訊息,其中控制模組針對情緒識別模組之辨識結果的判斷權重會高於自然語言對話模組之辨識結果。Therefore, the conversational financial service device of the present invention comprises a transceiver module, a control module connected to the transceiver module, a natural language dialogue module connected to the control module, and an emotion recognition module connected to the control module, which are sent and received. The module receives a voice message, the natural language dialogue module is used to identify the content of the voice message, the emotion recognition module is used to identify the emotion contained in the voice message, and the control module is based on the emotion recognition module and the natural language dialogue module. The recognition result is corresponding to generate a reply message, wherein the control module determines the recognition result of the emotion recognition module to be higher than the recognition result of the natural language dialogue module.

在一實施例中,情緒識別模組包括一文字辨識單元及一連接該文字辨識單元的情緒用字資料庫,情緒用字資料庫儲存複數與情緒文字有關的詞彙或句型,文字辨識單元擷取該語音訊息中對應之語音文字,且配合情緒用字資料庫判斷該語音訊息之情緒文字,並產生一情緒機率評分值,而控制模組係根據該情緒機率評分值而判定語音訊息內含之情緒。In an embodiment, the emotion recognition module includes a text recognition unit and an emotional word database connected to the text recognition unit, and the emotional word database stores a plurality of words or sentence patterns related to the emotional text, and the character recognition unit captures Corresponding voice text in the voice message, and the emotional word database is used to determine the emotional text of the voice message, and an emotional probability score value is generated, and the control module determines the voice message according to the emotional probability score value. mood.

此外,情緒識別模組還可包括一聲紋情緒辨識單元及一連接該聲紋情緒辨識單元的情緒特徵資料庫,情緒特徵資料庫儲存複數與情緒有關之聲紋訊號與預測模型,聲紋情緒辨識單元擷取該語音訊息中對應之聲紋訊號,且配合情緒特徵資料庫判斷該語音訊息之情緒,並產生一情緒特徵值,而控制模組係根據情緒機率評分值及情緒特徵值而判定語音訊息內含之情緒。In addition, the emotion recognition module may further include a voiceprint emotion recognition unit and a mood feature database connected to the voiceprint emotion recognition unit, and the emotion feature database stores a plurality of emotion-related voiceprint signals and prediction models, and voiceprint emotions. The identification unit captures the corresponding voiceprint signal in the voice message, and determines the emotion of the voice message with the emotional feature database, and generates an emotional feature value, and the control module determines the emotional probability score and the emotional feature value. The emotions contained in the voice message.

在一實施例中,自然語言對話模組包括一自然語言理解單元及一對話生成單元,自然語言理解單元用以根據該語音訊息之內容執行對應之金融服務,對話生成單元用以根據該語音訊息產生對應之對話內容。In an embodiment, the natural language dialogue module includes a natural language understanding unit and a dialog generation unit. The natural language understanding unit is configured to execute a corresponding financial service according to the content of the voice message, and the dialog generating unit is configured to use the voice message according to the voice message. Generate corresponding dialogue content.

進一步地,自然語言對話模組還包括一語言轉文字單元及一文字轉語音單元,語言轉文字單元用以比對該語音訊息中的文字是否符合至少一金融詞彙,文字轉語音單元用以將文字轉換為對應之聲音訊號輸出。Further, the natural language dialogue module further includes a language-to-text unit and a text-to-speech unit, wherein the language-to-text unit is configured to compare the text in the voice message with at least one financial vocabulary, and the text-to-speech unit is used to convert the text. Convert to the corresponding audio signal output.

此外,本發明之另一目的,即在提供一種利用自然語言對話方式取得金融服務之對話式金融服務系統。Further, another object of the present invention is to provide a conversational financial service system for obtaining financial services using natural language dialogue.

本發明對話式金融服務系統,包含一產生一語音訊息的使用端裝置,以及一與使用端裝置通訊的對話式金融服務裝置,對話式金融服務裝置包括一接收該語音訊息的收發模組、一連接收發模組的控制模組、一連接控制模組的自然語言對話模組及一連接控制模組的情緒識別模組,自然語言對話模組用以識別該語音訊息之內容,情緒識別模組用以識別該語音訊息內含之情緒,控制模組根據情緒識別模組及自然語言對話模組之辨識結果而對應產生一回覆訊息,其中控制模組針對情緒識別模組之辨識結果的判斷權重會高於自然語言對話模組之辨識結果。The conversational financial service system of the present invention comprises a user device for generating a voice message, and a conversational financial service device for communicating with the user device, the conversational financial service device comprising a transceiver module for receiving the voice message, A control module connected to the transceiver module, a natural language dialogue module connected to the control module, and an emotion recognition module connected to the control module, the natural language dialogue module is used to identify the content of the voice message, and the emotion recognition module For identifying the emotion contained in the voice message, the control module generates a reply message according to the recognition result of the emotion recognition module and the natural language dialogue module, wherein the control module determines the weight of the recognition result of the emotion recognition module. It will be higher than the recognition result of the natural language dialogue module.

在一實施例中,情緒識別模組包括一文字辨識單元及一連接文字辨識單元的情緒用字資料庫,情緒用字資料庫儲存複數與情緒文字有關的詞彙或句型,文字辨識單元擷取該語音訊息中對應之語音文字,且配合情緒用字資料庫判斷該語音訊息之情緒文字,並產生一情緒機率評分值,而控制模組係根據該情緒機率評分值而判定語音訊息內含之情緒。In an embodiment, the emotion recognition module includes a character recognition unit and a mood word database connected to the character recognition unit, and the emotion word database stores a plurality of words or sentence patterns related to the emotion text, and the text recognition unit captures the Corresponding voice text in the voice message, and using the word database to determine the emotional text of the voice message, and generating an emotional probability score value, and the control module determines the emotion included in the voice message according to the emotional probability score value .

此外,情緒識別模組還可包括一聲紋情緒辨識單元及一連接聲紋情緒辨識單元的情緒特徵資料庫,情緒特徵資料庫儲存複數與情緒有關之聲紋訊號與預測模型,聲紋情緒辨識單元擷取該語音訊息中對應之聲紋訊號,且配合情緒特徵資料庫判斷該語音訊息之情緒,並產生一情緒特徵值,而控制模組係根據該情緒機率評分值及該情緒特徵值而判定語音訊息內含之情緒。In addition, the emotion recognition module may further comprise a voiceprint emotion recognition unit and a sentiment feature database connected with the voiceprint emotion recognition unit, the emotion feature database stores the complex sound and sound pattern and prediction model, and the voiceprint emotion recognition. The unit captures the corresponding voiceprint signal in the voice message, and determines the emotion of the voice message with the emotional feature database, and generates an emotional feature value, and the control module is based on the emotional probability score value and the emotional feature value. Determine the emotions contained in the voice message.

在一實施例中,自然語言對話模組包括一自然語言理解單元及一對話生成單元,自然語言理解單元用以根據該語音訊息之內容執行對應之金融服務,對話生成單元用以根據該語音訊息產生對應之對話內容。In an embodiment, the natural language dialogue module includes a natural language understanding unit and a dialog generation unit. The natural language understanding unit is configured to execute a corresponding financial service according to the content of the voice message, and the dialog generating unit is configured to use the voice message according to the voice message. Generate corresponding dialogue content.

進一步地,自然語言對話模組還包括一語言轉文字單元及一文字轉語音單元,語言轉文字單元用以比對該語音訊息中的文字是否符合至少一金融詞彙,文字轉語音單元用以將文字轉換為對應之聲音訊號輸出。Further, the natural language dialogue module further includes a language-to-text unit and a text-to-speech unit, wherein the language-to-text unit is configured to compare the text in the voice message with at least one financial vocabulary, and the text-to-speech unit is used to convert the text. Convert to the corresponding audio signal output.

本發明之功效在於:利用自然語言互動的方式取得金融服務,在操作上可更為便利,不僅省去使用者必須雙眼與雙手停留在實體鍵盤或螢幕上操作之麻煩,對於身障或視障的使用者,同樣也能輕鬆簡單地完成金融服務。The effect of the invention is that the use of natural language interaction to obtain financial services can be more convenient in operation, which not only saves the user from having to both eyes and hands to stay on the physical keyboard or the screen, and for the obstacle or vision. Users of the barrier can also easily complete financial services.

在本發明被詳細描述之前,應當注意在以下的說明內容中,類似的元件是以相同的編號來表示。Before the present invention is described in detail, it should be noted that in the following description, similar elements are denoted by the same reference numerals.

參閱圖1,為本發明對話式金融服務系統100之一實施例,本對話式金融服務系統100可透過自然對話的方式取得金融服務,不僅可提供服務給有特殊需求的客戶,同時也解決現行用戶必須維持雙眼與雙手停留在實體螢幕上操作之麻煩,增加操作及生活的便利性。Referring to FIG. 1 , which is an embodiment of a conversational financial service system 100 of the present invention, the conversational financial service system 100 can obtain financial services through natural dialogue, and not only provides services to customers with special needs, but also solves the current problem. The user must maintain the trouble of both eyes and hands staying on the physical screen, increasing the convenience of operation and life.

對話式金融服務系統100包含一使用端裝置10及一與使用端裝置10通訊的對話式金融服務裝置20。The conversational financial service system 100 includes a consumer device 10 and a conversational financial service device 20 that communicates with the user device 10.

使用端裝置10可為一行動裝置、個人電腦(PC)、平板電腦、筆記型電腦(NB)等位於使用者端操作的電子裝置,其可透過安裝/開啟應用程式(APP)、社交軟體、網頁等介面執行本發明之對話式金融服務,即使用者可透過上述電子裝置與對話式金融服務裝置20對話,以取得金融服務。使用端裝置10會根據使用者所訴說的言語,產生一對應的語音訊息。The user device 10 can be a mobile device, such as a mobile device, a personal computer (PC), a tablet computer, a notebook computer (NB), etc., which can be installed/opened by an application (APP), social software, The web page and the like perform the dialog financial service of the present invention, that is, the user can talk to the conversational financial service device 20 through the electronic device to obtain the financial service. The user device 10 generates a corresponding voice message according to the words spoken by the user.

對話式金融服務裝置20為一位於銀行端(或企業端)的後台主機或伺服器,其中包括一收發模組21、一連接收發模組21的控制模組22及一連接控制模組22的情緒識別模組23。The conversational financial service device 20 is a background host or server located at the bank (or enterprise), and includes a transceiver module 21, a control module 22 connected to the transceiver module 21, and a connection control module 22. Emotion recognition module 23.

收發模組21為一無線傳輸電路,例如:無線射頻電路(RFID)、或近場通訊(Near Field Communication,NFC)、WiFi、藍芽(Bluetooth)、3G/4G網路等,其用以與使用端裝置10通訊,以相互傳輸訊號。The transceiver module 21 is a wireless transmission circuit, such as: a radio frequency circuit (RFID), or a near field communication (NFC), a WiFi, a Bluetooth, a 3G/4G network, etc. The end device 10 is used to communicate to transmit signals to each other.

控制模組22可為一微處理器(MCU)或中央控制器,用以控制各模組間的運作與訊號傳輸,控制模組22將收發模組21所接收到的語音訊息傳送至情緒識別模組23,且根據情緒識別模組23所識別的語音訊息內含之情緒,對應產生一回覆訊息,並透過收發模組21將該回覆訊息回傳至使用端裝置10,以回覆使用者所述說之內容。The control module 22 can be a microprocessor (MCU) or a central controller for controlling the operation and signal transmission between the modules. The control module 22 transmits the voice message received by the transceiver module 21 to the emotion recognition. The module 23 generates a reply message according to the emotion contained in the voice message recognized by the emotion recognition module 23, and transmits the reply message to the user device 10 through the transceiver module 21 to reply to the user's office. The content of the description.

在本實施例中,情緒識別模組23包括一文字辨識單元231、一連接文字辨識單元231的情緒用字資料庫232、一聲紋情緒辨識單元233及一連接聲紋情緒辨識單元233的情緒特徵資料庫234。In this embodiment, the emotion recognition module 23 includes a character recognition unit 231, an emotion word database 232 connected to the character recognition unit 231, a voiceprint emotion recognition unit 233, and an emotional feature of the voiceprint emotion recognition unit 233. Database 234.

情緒用字資料庫232儲存有複數個與情緒文字有關的詞彙或句型,例如:憤怒情緒詞彙(如「髒話」、「在搞什麼」等)、疑惑情緒詞彙(如「確定嗎?」、「為什麼?」等)、不耐煩情緒詞彙(如「可以快點嗎?」、「很慢耶」等)、高興情緒詞彙(如「很好」、「好棒」等)等各種情緒詞彙,不以上述為限。文字辨識單元231接收該語音訊息,並擷取出其中對應之語音文字,且配合情緒用字資料庫232,依文法句型與關鍵字之組合產生一情緒機率評分值,並按照預設閥值(threshold)判斷上述各種情緒文字,例如:當句型為否定句或包含辱罵字眼,則組合之憤怒情緒機率高(即高情緒機率評分值)。The Emotional Character Database 232 stores a plurality of vocabulary or sentence patterns related to emotional words, such as: anger vocabulary words (such as "dirty words", "what is doing", etc.), doubtful vocabulary words (such as "OK?", "Why?", etc.), impatient emotional vocabulary (such as "can you hurry?", "very slow", etc.), happy emotional vocabulary (such as "very good", "good", etc.) and other emotional vocabulary, Not limited to the above. The character recognition unit 231 receives the voice message and extracts the corresponding voice text, and cooperates with the emotion word database 232 to generate an emotional probability score according to the combination of the grammar sentence type and the keyword, and according to the preset threshold ( Threshold) Judging the above various emotional words, for example, when the sentence pattern is a negative sentence or contains abusive words, the combined anger is high (ie, high emotional probability score).

情緒特徵資料庫234儲存複數與情緒有關之聲紋訊號與預測模型。聲紋情緒辨識單元233接收該語音訊息,並擷取出其中對應之聲紋訊號,且配合情緒特徵資料庫234所儲存之聲紋訊號與預測模型,判斷上述各種情緒(包括憤怒、疑惑、不耐煩、高興等),並產生一情緒特徵值。The emotional feature database 234 stores complex vocal tract signals and predictive models related to emotions. The voiceprint emotion recognition unit 233 receives the voice message, and extracts the corresponding voiceprint signal, and cooperates with the voiceprint signal and the prediction model stored in the emotion feature database 234 to determine the various emotions (including anger, doubt, and impatientness). , happy, etc.) and produce an emotional feature value.

換言之,文字辨識單元231及情緒用字資料庫232係利用如情緒字彙等文字來判斷語音訊息之情緒,聲紋情緒辨識單元233及情緒特徵資料庫234則係利用如音調起伏等聲紋來判斷語音訊息之情緒,當然,兩種方式可以配合不同應用而選擇其中一種方式判斷(即情緒識別模組23僅包括文字辨識單元231及情緒用字資料庫232,或聲紋情緒辨識單元233及情緒特徵資料庫234),因此,控制模組22係根據文字辨識單元231產生的情緒機率評分值及/或聲紋情緒辨識單元233產生的情緒特徵值而判定語音訊息內含之情緒,並對應產生回覆訊息。In other words, the character recognition unit 231 and the emotion word database 232 use words such as emotional vocabulary to judge the emotion of the voice message, and the voiceprint emotion recognition unit 233 and the emotion feature database 234 are judged by using voiceprints such as pitch fluctuations. The emotion of the voice message, of course, the two methods can be selected in accordance with different applications and the method is selected (ie, the emotion recognition module 23 includes only the character recognition unit 231 and the emotion word database 232, or the voiceprint emotion recognition unit 233 and the emotion. The feature database 234), therefore, the control module 22 determines the emotion contained in the voice message according to the emotion probability score value generated by the character recognition unit 231 and/or the emotion feature value generated by the voiceprint emotion recognition unit 233, and correspondingly generates Reply to the message.

此外,對話式金融服務裝置20還可包括一連接控制模組22的自然語言對話模組24,用以判斷收發模組21所接收到的語音訊息之內容。In addition, the conversational financial service device 20 may further include a natural language dialogue module 24 connected to the control module 22 for determining the content of the voice message received by the transceiver module 21.

自然語言對話模組24包括一語言轉文字單元241、一自然語言理解單元242、一對話生成單元243及一文字轉語音單元244。The natural language dialogue module 24 includes a language-to-text unit 241, a natural language understanding unit 242, a dialog generation unit 243, and a text-to-speech unit 244.

語言轉文字單元241內含一金融詞彙資料庫,其中儲存有複數與金融服務及產品有關的金融詞彙,例如:玉璽卡、LINE PAY卡等,語言轉文字單元241用以從該語音訊息中擷取對應之文字,並配合金融詞彙資料庫比對該些文字中是否有符合至少一相關的金融詞彙,以提高語音轉文字之辨識的正確率。此外,語言轉文字單元241還可內含一中文聲學模型,可針對同一金融字彙蒐集不同使用者之發音方式,以增加對於口音(例如:台灣國語)的適應性,可更優化語音轉文字之正確轉換率。The language-to-text unit 241 includes a financial vocabulary database in which a plurality of financial vocabularies related to financial services and products are stored, such as a 玺 Y card, a LINE PAY card, etc., and the language-to-text unit 241 is used to extract from the voice message. Take the corresponding text, and cooperate with the financial vocabulary database to compare whether there is at least one relevant financial vocabulary in the text to improve the accuracy of the recognition of the voice-to-text. In addition, the language-to-text unit 241 can also include a Chinese acoustic model, which can collect the pronunciation patterns of different users for the same financial vocabulary, so as to increase the adaptability to the accent (for example, Taiwanese Mandarin), and optimize the voice-to-text Correct conversion rate.

自然語言理解單元242可內含一意圖與指令資料庫,該意圖與指令資料庫儲存有複數與金融交易關的控制指令,自然語言理解單元242可根據該語音訊息之內容,並配合意圖與指令資料庫中的控制指令執行對應之金融服務,包含個人帳戶餘額查詢、帳單查詢、交易明細查詢、匯款轉帳、帳單繳款、收支紀錄與查詢、股匯價查詢、到價提醒、交易預定、投資交易執行、線上客服問答、保單內容查詢等,但不以上述為限。The natural language comprehension unit 242 can include an intent and instruction database, the intent and instruction database storing a plurality of control instructions for the financial transaction, and the natural language understanding unit 242 can cooperate with the intent and the instruction according to the content of the speech message. The control instructions in the database execute the corresponding financial services, including personal account balance inquiry, billing inquiry, transaction detail inquiry, remittance transfer, bill payment, income and expenditure record and inquiry, stock exchange price inquiry, arrival price reminder, transaction reservation , investment transaction execution, online customer service Q&A, policy content inquiry, etc., but not limited to the above.

對話生成單元243可內含一儲存複數情境與使用者情緒之預設對答內容與角色句型結構的品牌角色個性資料庫,以提供多種助理角色個性並產生對話內容,讓使用者可自行決定服務互動的方式。The dialog generating unit 243 can include a brand personality database for storing the complex context and the user's emotions, and providing a variety of assistant character personalities and generating conversation content, so that the user can decide the service. The way to interact.

文字轉語音單元244用以將任意文字轉換為對應之聲音訊號輸出,且在本實施例中,文字轉語音單元244可針對特定角色之合成聲音所蒐集並建立一品牌識別性聲紋模型,以輸出特定角色之聲音訊號。The text-to-speech unit 244 is configured to convert any text into a corresponding audio signal output, and in this embodiment, the text-to-speech unit 244 can collect and establish a brand-recognized voiceprint model for the synthesized sound of the specific character. Output the sound signal of a specific character.

再者,對話式金融服務裝置20還可包括一連接控制模組22的身份識別模組25,用以針對使用者的身份進行核對與驗證。身份識別模組25包括一身份辨識單元251、一連接身份辨識單元251的個人資料庫252、一聲紋辨識單元253及一連接聲紋辨識單元253的用戶聲紋資料庫254。Moreover, the conversational financial service device 20 can further include an identity recognition module 25 connected to the control module 22 for checking and verifying the identity of the user. The identity recognition module 25 includes an identity recognition unit 251, a personal data library 252 connected to the identity recognition unit 251, a voiceprint recognition unit 253, and a user voiceprint database 254 connected to the voiceprint recognition unit 253.

個人資料庫252為一使用者之身分驗認、帳號密碼、數位憑證資料庫,其中儲存複數筆使用者的金融個人資訊;身份辨識單元251根據該語音訊息之內容(例如:使用者說出自己的帳號密碼)並配合個人資料庫252之金融個人資訊而核對使用者之身分。用戶聲紋資料庫254則是儲存使用者每次講話的語音訊息之聲紋;聲紋辨識單元253可利用聲紋提取方式建立個人之可辨識模型,以核對使用者日後使用時之聲音。The personal database 252 is a user identity verification, account password, and digital certificate database, wherein the financial personal information of the plurality of users is stored; the identity recognition unit 251 is based on the content of the voice message (for example, the user speaks himself) Check the identity of the user with the personal account information of the personal database 252. The user voiceprint database 254 is a voiceprint for storing the voice message of the user each time; the voiceprint recognition unit 253 can establish a personal identifiable model by using the voiceprint extraction method to check the voice of the user in the future use.

補充說明的是,身份識別模組25也可以位於另一主機或伺服器中(例如:現有的身份核對伺服器),控制模組22可透過收發模組21而與該主機或伺服器通訊,以核對使用者之身份。另外,對話式金融服務裝置20也可包括一連接控制模組22的個人金融資料庫(圖未示),用以儲存使用者的帳戶、信用、金融交易等資訊與紀錄,或是透過收發模組21的通訊,而將該些資訊與紀錄儲存至另一電子裝置(例如:現有的個人資料伺服器)之資料庫中,故以上皆不以本實施例為限。In addition, the identity module 25 can also be located in another host or server (for example, an existing identity verification server), and the control module 22 can communicate with the host or server through the transceiver module 21. To verify the identity of the user. In addition, the conversational financial service device 20 may also include a personal financial database (not shown) connected to the control module 22 for storing information and records of the user's account, credit, financial transaction, or the like. The communication of the group 21 is stored in the database of another electronic device (for example, an existing personal data server), so the above is not limited to the embodiment.

因此,當使用者身份核對無誤後,控制模組22會將收發模組21每次接收到的語音訊息傳送至情緒識別模組23及自然語言對話模組24,情緒識別模組23會透過文字與聲紋而識別該語音訊息內含之情緒,自然語言對話模組24則會判斷語音訊息之內容,例如:是否有提到特定金融詞彙,或是否需要執行如匯款轉帳、帳單繳款等金融交易服務,而當情緒識別模組23及自然語言對話模組24會分別將其判斷結果回傳至控制模組22,以供控制模組22進行綜合評估並藉由對話生成單元243對應產生回覆訊息。Therefore, after the user identity is verified, the control module 22 transmits the voice message received by the transceiver module 21 to the emotion recognition module 23 and the natural language dialogue module 24, and the emotion recognition module 23 transmits the text. The natural language dialogue module 24 determines the content of the voice message, such as whether a specific financial vocabulary is mentioned, or whether it is necessary to perform such a transfer transfer, bill payment, etc. The financial transaction service, and the emotion recognition module 23 and the natural language dialogue module 24 respectively transmit the determination result to the control module 22 for the control module 22 to perform comprehensive evaluation and correspondingly generated by the dialog generation unit 243. Reply to the message.

特別說明的是,情緒識別模組23的判斷權重會高於自然語言對話模組24,也就是說當情緒辨識結果超過預設之閥值(Threshold),則優先進行情緒之處理,並以對話生成單元243中之品牌角色個性資料庫內針對情緒之預設對答內容加以回應,舉例來說,當使用者反應「哇!這也太貴了吧」,則對應產生的回覆訊息會為「別擔心」,當情緒回應優先處理完成後,則依照自然語言對話模組24所判斷之使用者意圖,執行對應之金融交易服務。Specifically, the judgment weight of the emotion recognition module 23 is higher than that of the natural language dialogue module 24, that is, when the emotion recognition result exceeds a preset threshold (Threshold), the emotion is processed preferentially, and the conversation is performed. In the branding personality database of the generating unit 243, the default response to the emotion is responded to. For example, when the user responds "Wow! This is too expensive," the corresponding reply message will be "Don't Worried, when the emotional response priority processing is completed, the corresponding financial transaction service is executed according to the user's intention determined by the natural language dialogue module 24.

配合參閱圖2,圖2為本發明對話式金融服務方法,本方法可應用於對話式金融服務系統100,或是任何可執行以下流程之系統或電子裝置。本對話式金融服務方法之詳細流程如下:Referring to FIG. 2, FIG. 2 is a schematic financial service method of the present invention. The method can be applied to the conversational financial service system 100, or any system or electronic device that can execute the following processes. The detailed process of this dialogue financial service method is as follows:

步驟S10,使用者透過安裝/開啟應用程式(APP)、社交軟體、網頁等介面執行對話式金融服務,並開始對話,例如:「您好,我是王小明,請幫我服務」。In step S10, the user performs a conversational financial service by installing/opening an application (APP), a social software, a webpage, etc., and starts a conversation, for example, "Hello, I am Wang Xiaoming, please help me."

步驟S20,使用端裝置10會將該對話之內容轉換成語音訊息,並傳送至對話式金融服務裝置20。In step S20, the user terminal 10 converts the content of the conversation into a voice message and transmits it to the conversational financial service device 20.

步驟S30,控制模組22先將語音訊息傳送至身份識別模組25,以進行身分之核對。身份辨識單元251可針對「王小明」之個人資訊進行判斷,並配合聲紋辨識單元253針對其對話之聲紋判定,以綜合評估核對使用者之身份,若核對身份無誤,則控制模組22產生回覆訊息,例如:「您好,王先生,很高興為您服務」。In step S30, the control module 22 first transmits the voice message to the identity recognition module 25 for checking the identity. The identity recognition unit 251 can determine the personal information of "Wang Xiaoming" and cooperate with the voiceprint recognition unit 253 to determine the identity of the user for the voiceprint determination of the dialogue. If the identity is correct, the control module 22 generates Reply to the message, for example: "Hello, Mr. Wang, I am very happy to serve you."

步驟S40,接著使用端裝置10可再將該對話之內容(例如:「我這個月信用卡帳單多少錢?」)轉換成語音訊息,並傳送至對話式金融服務裝置20。In step S40, the content of the conversation (for example, "How much is my credit card bill this month?") can be converted into a voice message by using the terminal device 10, and transmitted to the conversational financial service device 20.

步驟S50,控制模組22根據情緒識別模組23所識別的語音訊息內含之情緒,以及自然語言對話模組24所判斷的語音訊息之內容,進行綜合評估並對應產生回覆訊息,例如:「您本月信用卡帳單費用為50000元」。In step S50, the control module 22 performs a comprehensive evaluation based on the emotion contained in the voice message recognized by the emotion recognition module 23 and the content of the voice message judged by the natural language dialogue module 24, and correspondingly generates a reply message, for example: Your credit card bill for this month is 50,000 yuan."

步驟S60,透過收發模組21回傳至使用端裝置10,以輸出該回覆訊息。特別說明的是,輸出的方式不僅可以語音的方式,還可以同時透過使用端裝置10的互動介面顯示對應之文字,並不限於何種方式。Step S60 is transmitted back to the user device 10 through the transceiver module 21 to output the reply message. In particular, the output mode can display the corresponding text not only in a voice manner but also through the interactive interface of the terminal device 10, and is not limited to which manner.

綜上所述,本發明對話式金融服務系統100及方法,利用自然語言互動的形式,可更為便利輕鬆獲得金融相關服務,可以解決現行使用者必須維持雙眼與雙手停留在實體螢幕上操作之麻煩,例如在行車的同時,即可以對話方式完成日常金融服務,故確實能達成本發明之目的。In summary, the conversational financial service system 100 and method of the present invention can facilitate the easy access to financial related services by using the form of natural language interaction, and can solve the problem that the current user must maintain both eyes and hands on the physical screen. The trouble of the operation, for example, at the same time as driving, can complete the daily financial service in a dialogue manner, so that the object of the present invention can be achieved.

惟以上所述者,僅為本發明之實施例而已,當不能以此限定本發明實施之範圍,凡是依本發明申請專利範圍及專利說明書內容所作之簡單的等效變化與修飾,皆仍屬本發明專利涵蓋之範圍內。However, the above is only the embodiment of the present invention, and the scope of the invention is not limited thereto, and all the equivalent equivalent changes and modifications according to the scope of the patent application and the patent specification of the present invention are still The scope of the invention is covered.

100‧‧‧對話式金融服務系統 100‧‧‧Dialogue Financial Services System

10‧‧‧使用端裝置10‧‧‧Using end device

20‧‧‧對話式金融服務裝置20‧‧‧Dialogue financial service installation

21‧‧‧收發模組21‧‧‧ transceiver module

22‧‧‧控制模組22‧‧‧Control Module

23‧‧‧情緒識別模組23‧‧‧Emotion Recognition Module

231‧‧‧文字辨識單元231‧‧‧Text recognition unit

232‧‧‧情緒用字資料庫232‧‧‧Emotional word database

233‧‧‧聲紋情緒辨識單元233‧‧‧ voiceprint emotion recognition unit

234‧‧‧情緒特徵資料庫234‧‧‧Emotional Characteristics Database

24‧‧‧自然語言對話模組24‧‧‧Natural Language Dialogue Module

241‧‧‧語言轉文字單元241‧‧‧ language to text unit

242‧‧‧自然語言理解單元242‧‧‧Natural Language Understanding Unit

243‧‧‧對話生成單元243‧‧‧Dialog generation unit

244‧‧‧文字轉語音單元244‧‧‧Text-to-speech unit

25‧‧‧身份識別模組25‧‧‧identification module

251‧‧‧身份辨識單元251‧‧‧identity unit

252‧‧‧個人資料庫252‧‧‧Personal Database

253‧‧‧聲紋辨識單元253‧‧‧ voiceprint identification unit

254‧‧‧用戶聲紋資料庫254‧‧‧User voiceprint database

步驟S10~S60Steps S10 to S60

本發明之其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現,其中: 圖1是本發明對話式金融服務系統的一實施例的電路方塊示意圖;及 圖2是本發明對話式金融服務方法的流程圖。The other features and advantages of the present invention will be apparent from the embodiments of the present invention, wherein: Figure 1 is a circuit block diagram of an embodiment of the dialog financial service system of the present invention; and Figure 2 is a dialog of the present invention. Flow chart of the financial services method.

Claims (10)

一種對話式金融服務裝置,包含: 一收發模組,接收一語音訊息; 一控制模組,連接該收發模組; 一自然語言對話模組,連接該控制模組,該自然語言對話模組用以識別該語音訊息之內容;及 一情緒識別模組,連接該控制模組,該情緒識別模組用以識別該語音訊息內含之情緒,該控制模組根據該情緒識別模組及該自然語言對話模組之辨識結果而對應產生一回覆訊息,其中該控制模組針對該情緒識別模組之辨識結果的判斷權重會高於該自然語言對話模組之辨識結果。A conversational financial service device comprising: a transceiver module for receiving a voice message; a control module for connecting the transceiver module; a natural language dialogue module for connecting the control module, the natural language dialogue module The emotion recognition module is configured to identify the emotion contained in the voice message, and the emotion recognition module is configured to identify the emotion included in the voice message, and the control module identifies the module according to the emotion and the natural The recognition result of the language dialogue module correspondingly generates a reply message, wherein the control module determines the recognition result of the emotion recognition module to be higher than the recognition result of the natural language dialogue module. 如請求項1所述的對話式金融服務裝置,其中,該情緒識別模組包括一文字辨識單元及一連接該文字辨識單元的情緒用字資料庫,該情緒用字資料庫儲存複數與情緒文字有關的詞彙或句型,該文字辨識單元擷取該語音訊息中對應之語音文字,且配合該情緒用字資料庫判斷該語音訊息之情緒文字,並產生一情緒機率評分值,該控制模組係根據該情緒機率評分值而判定語音訊息內含之情緒。The conversational financial service device of claim 1, wherein the emotion recognition module comprises a text recognition unit and an emotional word database connected to the character recognition unit, wherein the emotional word database stores a plurality of characters related to the emotional text. a vocabulary or sentence pattern, the character recognition unit captures the corresponding voice text in the voice message, and uses the emotion word database to determine the emotional text of the voice message, and generates an emotional probability score value, and the control module is The emotion contained in the voice message is determined based on the emotional probability score value. 如請求項2所述的對話式金融服務裝置,其中,該情緒識別模組還包括一聲紋情緒辨識單元及一連接該聲紋情緒辨識單元的情緒特徵資料庫,該情緒特徵資料庫儲存複數與情緒有關之聲紋訊號與預測模型,該聲紋情緒辨識單元擷取該語音訊息中對應之聲紋訊號,且配合該情緒特徵資料庫判斷該語音訊息之情緒,並產生一情緒特徵值,該控制模組係根據該情緒機率評分值及該情緒特徵值而判定該語音訊息內含之情緒。The conversational financial service device of claim 2, wherein the emotion recognition module further comprises a voiceprint emotion recognition unit and a mood feature database connected to the voiceprint emotion recognition unit, wherein the emotion feature database stores a plurality of An emotion-related voiceprint signal and prediction model, the voiceprint emotion recognition unit captures a corresponding voiceprint signal in the voice message, and cooperates with the emotion feature database to determine the emotion of the voice message, and generates an emotional feature value. The control module determines the emotion contained in the voice message according to the emotional probability score value and the emotion feature value. 如請求項1所述的對話式金融服務裝置,其中,該自然語言對話模組包括一自然語言理解單元及一對話生成單元,該自然語言理解單元用以根據該語音訊息之內容執行對應之金融服務,該對話生成單元用以根據該語音訊息產生對應之對話內容。The conversational financial service device of claim 1, wherein the natural language dialogue module comprises a natural language understanding unit and a dialog generation unit, wherein the natural language understanding unit is configured to execute the corresponding financial according to the content of the voice message. The service, the dialog generating unit is configured to generate a corresponding conversation content according to the voice message. 如請求項4所述的對話式金融服務裝置,其中,該自然語言對話模組還包括一語言轉文字單元及一文字轉語音單元,該語言轉文字單元用以比對該語音訊息中的文字是否符合至少一金融詞彙,該文字轉語音單元用以將文字轉換為對應之聲音訊號輸出。The conversational financial service device of claim 4, wherein the natural language dialogue module further comprises a language-to-text unit and a text-to-speech unit, wherein the language-to-text unit is configured to compare whether the text in the voice message is The at least one financial vocabulary is matched, and the text-to-speech unit is used to convert the text into a corresponding audio signal output. 一種對話式金融服務系統,包含: 一使用端裝置,產生一語音訊息;及 一對話式金融服務裝置,與該使用端裝置通訊,該對話式金融服務裝置包括一接收該語音訊息的收發模組、一連接該收發模組的控制模組、一連接該控制模組的自然語言對話模組及一連接該控制模組的情緒識別模組,該自然語言對話模組用以識別該語音訊息之內容,該情緒識別模組用以識別該語音訊息內含之情緒,該控制模組根據該情緒識別模組及該自然語言對話模組之辨識結果而對應產生一回覆訊息,其中該控制模組針對該情緒識別模組之辨識結果的判斷權重會高於該自然語言對話模組之辨識結果。A conversational financial service system comprising: a user device for generating a voice message; and a conversational financial service device communicating with the user device, the conversational financial service device comprising a transceiver module for receiving the voice message a control module connected to the transceiver module, a natural language dialogue module connected to the control module, and an emotion recognition module connected to the control module, the natural language dialogue module is configured to identify the voice message Contenting, the emotion recognition module is configured to identify the emotion contained in the voice message, and the control module generates a reply message according to the recognition result of the emotion recognition module and the natural language dialogue module, wherein the control module The judgment weight of the recognition result of the emotion recognition module is higher than the identification result of the natural language dialogue module. 如請求項6所述的對話式金融服務系統,其中,該情緒識別模組包括一文字辨識單元及一連接該文字辨識單元的情緒用字資料庫,該情緒用字資料庫儲存複數與情緒文字有關的詞彙或句型,該文字辨識單元擷取該語音訊息中對應之語音文字,且配合該情緒用字資料庫判斷該語音訊息之情緒文字,並產生一情緒機率評分值,該控制模組係根據該情緒機率評分值而判定語音訊息內含之情緒。The conversational financial service system of claim 6, wherein the emotion recognition module comprises a text recognition unit and an emotional word database connected to the character recognition unit, wherein the emotion word database stores the plural and the emotional text a vocabulary or sentence pattern, the character recognition unit captures the corresponding voice text in the voice message, and uses the emotion word database to determine the emotional text of the voice message, and generates an emotional probability score value, and the control module is The emotion contained in the voice message is determined based on the emotional probability score value. 如請求項7所述的對話式金融服務系統,其中,該情緒識別模組還包括一聲紋情緒辨識單元及一連接該聲紋情緒辨識單元的情緒特徵資料庫,該情緒特徵資料庫儲存複數與情緒有關之聲紋訊號與預測模型,該聲紋情緒辨識單元擷取該語音訊息中對應之聲紋訊號,且配合該情緒特徵資料庫判斷該語音訊息之情緒,並產生一情緒特徵值,該控制模組係根據該情緒機率評分值及該情緒特徵值而判定該語音訊息內含之情緒。The conversational financial service system of claim 7, wherein the emotion recognition module further comprises a voiceprint emotion recognition unit and a mood feature database connected to the voiceprint emotion recognition unit, wherein the emotion feature database stores a plurality of An emotion-related voiceprint signal and prediction model, the voiceprint emotion recognition unit captures a corresponding voiceprint signal in the voice message, and cooperates with the emotion feature database to determine the emotion of the voice message, and generates an emotional feature value. The control module determines the emotion contained in the voice message according to the emotional probability score value and the emotion feature value. 如請求項6所述的對話式金融服務系統,其中,該自然語言對話模組包括一自然語言理解單元及一對話生成單元,該自然語言理解單元用以根據該語音訊息之內容執行對應之金融服務,該對話生成單元用以根據該語音訊息產生對應之對話內容。The conversational financial service system of claim 6, wherein the natural language dialogue module comprises a natural language understanding unit and a dialog generation unit, wherein the natural language understanding unit is configured to perform a corresponding financial according to the content of the voice message. The service, the dialog generating unit is configured to generate a corresponding conversation content according to the voice message. 如請求項9所述的對話式金融服務系統,其中,該自然語言對話模組還包括一語言轉文字單元及一文字轉語音單元,該語言轉文字單元用以比對該語音訊息中的文字是否符合至少一金融詞彙,該文字轉語音單元用以將文字轉換為對應之聲音訊號輸出。The conversational financial service system of claim 9, wherein the natural language dialogue module further comprises a language-to-text unit and a text-to-speech unit, wherein the language-to-text unit is used to compare whether the text in the voice message is The at least one financial vocabulary is matched, and the text-to-speech unit is used to convert the text into a corresponding audio signal output.
TW106139835A 2017-11-17 2017-11-17 Conversational financial service device and system TWI675303B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW106139835A TWI675303B (en) 2017-11-17 2017-11-17 Conversational financial service device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW106139835A TWI675303B (en) 2017-11-17 2017-11-17 Conversational financial service device and system

Publications (2)

Publication Number Publication Date
TW201923621A true TW201923621A (en) 2019-06-16
TWI675303B TWI675303B (en) 2019-10-21

Family

ID=67702307

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106139835A TWI675303B (en) 2017-11-17 2017-11-17 Conversational financial service device and system

Country Status (1)

Country Link
TW (1) TWI675303B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI752367B (en) * 2019-10-30 2022-01-11 中國信託商業銀行股份有限公司 Intelligent dialogue management method and system based on natural language processing

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI430189B (en) * 2009-11-10 2014-03-11 Inst Information Industry System, apparatus and method for message simulation
AU2014274913B2 (en) * 2013-06-07 2017-05-11 Apple Inc. Intelligent automated assistant

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI752367B (en) * 2019-10-30 2022-01-11 中國信託商業銀行股份有限公司 Intelligent dialogue management method and system based on natural language processing

Also Published As

Publication number Publication date
TWI675303B (en) 2019-10-21

Similar Documents

Publication Publication Date Title
US11750741B2 (en) Systems for transitioning telephony-based and in-person servicing interactions to and from an artificial intelligence (AI) chat session
US9742912B2 (en) Method and apparatus for predicting intent in IVR using natural language queries
JP2017058673A (en) Dialog processing apparatus and method, and intelligent dialog processing system
JP2016529567A (en) Method, apparatus and system for verifying payment
CN110738998A (en) Voice-based personal credit evaluation method, device, terminal and storage medium
US11909917B2 (en) Methods, apparatuses, and systems for dynamically navigating interactive communication systems
KR102594838B1 (en) Electronic device for performing task including call in response to user utterance and method for operation thereof
TWI675303B (en) Conversational financial service device and system
TWM556359U (en) Dialogue type financial service device and system
KR20210109914A (en) Apparatus and method for filling electronic document using dialogue comprehension based on format of electronic document
JP4000828B2 (en) Information system, electronic equipment, program
CN203480601U (en) Self-service equipment based on voice interaction
KR102583434B1 (en) Method and system for evaluating quality of voice counseling
TWM578858U (en) Cross-channel artificial intelligence dialogue platform
CN113053409B (en) Audio evaluation method and device
CN115379054A (en) Method and device for processing call-out operation
WO2021098637A1 (en) Voice transliteration method and apparatus, and related system and device
KR100383391B1 (en) Voice Recogizing System and the Method thereos
US11924378B2 (en) Systems for transitioning telephony-based and in-person servicing interactions to and from an artificial intelligence (AI) chat session
KR102610360B1 (en) Method for providing labeling for spoken voices, and apparatus implementing the same method
US20240163372A1 (en) Systems for transitioning telephony-based and in-person servicing interactions to and from an artificial intelligence (ai) chat session
WO2021139737A1 (en) Method and system for man-machine interaction
TWM613629U (en) service system
JP2021096293A (en) Guide system, control method of guide system, and program
TW202333099A (en) Online speech service system with semantic learning and method thereof