TW202328951A

TW202328951A - Low-noise voiceprint identification device for financial transaction system and method thereof

Info

Publication number: TW202328951A
Application number: TW112103776A
Authority: TW
Inventors: 翁子崴
Original assignee: 華南商業銀行股份有限公司
Priority date: 2021-09-08
Filing date: 2021-09-08
Publication date: 2023-07-16
Also published as: TWI817897B

Abstract

A low-noise voiceprint identification device for a financial transaction system and method thereof are provided, which improve the quality of voiceprint recognition and reduce noise by adding a recognition buffer mechanism and processing sound signals through filtering. It can be used as an auxiliary authentication function for financial transaction behavior on electronic devices. The voiceprint identification device includes an audio receiving unit, a voiceprint recognition unit, a storage unit, and an arithmetic unit. The arithmetic unit compares at least one voiceprint feature extracted from audio data with the first feature data and the second feature data of the storage unit, utilizes deep learning and neural network methods to train the first feature data and the second feature data, in order to security verification.

Description

Low-noise voiceprint recognition device and method for financial transaction system

本發明係有關一種聲紋辨識裝置與其方法，尤其是有關一種用於金融交易系統之低雜訊聲紋辨識裝置與其方法。The present invention relates to a voiceprint recognition device and its method, in particular to a low-noise voiceprint recognition device and its method for a financial transaction system.

聲紋具有唯一性、獨特性和不易竄改性，加之較不涉及使用者隱私問題，故聲紋識別可用於要求從音頻信號中提取個體差異，擷取出能夠反映使用者是誰的信息，從而進行使用者識別，其基本原理是每一個使用者建立一個能夠描述這一使用者個性特徵的模組，作為此使用者個性特徵的描述。Voiceprint is unique, unique and not easy to tamper with, and it does not involve user privacy issues, so voiceprint recognition can be used to extract individual differences from audio signals and extract information that can reflect who the user is, so as to carry out The basic principle of user identification is that each user creates a module that can describe the user's personality characteristics as a description of the user's personality characteristics.

聲紋辨識目前已經應用於身份辨識，作為判斷使用者身份之工具。然而，聲紋識別的缺點在於，聲音容易受到年齡、情緒、或身體狀況等因素影響，導致識別性能降低，故目前聲紋辨識技術的準確度仍有待改善。此外設備的收音品質與環境噪音等因素，也會影響聲紋辨識的識別效能。Voiceprint recognition has been applied to identity recognition as a tool for judging the user's identity. However, the disadvantage of voiceprint recognition is that the voice is easily affected by factors such as age, emotion, or physical condition, resulting in reduced recognition performance. Therefore, the accuracy of the current voiceprint recognition technology still needs to be improved. In addition, factors such as the sound quality of the device and environmental noise will also affect the recognition performance of voiceprint recognition.

有鑑於先前技術的上述缺點，本發明之一態樣提供了一低雜訊聲紋辨識裝置，該裝置包含：一音訊接收單元，係用以擷取一音頻資料；一聲紋辨識單元，係連接於該音訊接收單元，該聲紋辨識單元由該音訊接收單元接收該音頻資料，且該聲紋辨識單元辨識出該音頻資料的至少一聲紋特徵；一儲存單元，儲存一第一特徵資料；一運算單元，係連接於該聲紋辨識單元與該儲存單元，該運算單元自該聲紋辨識單元接收該至少一聲紋特徵以及自該儲存單元接收該第一特徵資料，且該運算單元包含：一驗證計算器，該驗證計算器之計數值於一初始狀態為0單位，其中該運算單元根據該第一特徵資料判斷該至少一聲紋特徵與該第一特徵資料的是否相符；其中，若該運算單元判斷該至少一聲紋特徵符合該第一特徵資料，則該運算單元將該至少一聲紋特徵儲存於該儲存單元的一暫存空間內，根據該暫存空間內的該至少一聲紋特徵訓練該第一特徵資料，以產生一調整後第一特徵資料，並根據該調整後第一特徵資料進行安全性驗證；若該運算單元判斷該至少一聲紋特徵不符該第一特徵資料，則該運算單元傳送一驗證請求，並於該驗證請求完成確認後，該運算單元將該至少一聲紋特徵儲存於該暫存空間內，但該儲存單元不根據該至少一聲紋特徵變更該第一特徵資料，以根據該第一特徵資料進行安全性驗證，其中該驗證請求包含使用者身份特徵之識別；若該運算單元判斷該至少一聲紋特徵不符該第一特徵資料，且該計算值達該預定數值，則該運算單元根據該暫存空間內的該至少一聲紋特徵與該第一特徵資料產生一第二特徵資料；以及於該第二特徵資料產生後，該驗證計算器調整該計算值為0單位，該運算單元將該第一特徵資料、該第二特徵資料儲存於儲存單元，以根據該第一特徵資料、該第二特徵資料判斷該至少一聲紋特徵與該第一特徵資料、該第二特徵資料是否相符，以進行安全性驗證，包含：若該運算單元判斷該至少一聲紋特徵符合該第二特徵資料，則該運算單元傳送該至少一聲紋特徵儲存於該暫存空間內，且根據該暫存空間內的該至少一聲紋特徵訓練該第二特徵資料，以產生一調整後第二特徵資料，並根據該調整後第二特徵資料進行安全性驗證；若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料、但符合該第一特徵資料，則該運算單元自該儲存單元移除該第二特徵資料，以根據該第一特徵資料進行安全性驗證；以及於該第二特徵資料產生後，若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料且不符該第一特徵資料，則該驗證計算器對該計數值增加1單位，且當該計算值小於一預定數值時，該運算單元傳送一驗證請求，並於該驗證請求完成確認後，該運算單元將該至少一聲紋特徵儲存於該暫存空間內，但該儲存單元不根據該至少一聲紋特徵變更該第一特徵資料、該第二特徵資料，以根據該第一特徵資料、該第二特徵資料進行安全性驗證，其中辨識該音頻資料內的該至少一聲紋特徵進一步包含以下操作中至少一者：濾波、降低雜訊、抑制背景噪音、放大特定聲紋特徵、計算該音頻資料之向量參數、偵測特定關鍵詞、偵測特定聲波波段、偵測特定聲波波形以及偵測特定聲波頻率。In view of the above-mentioned shortcomings of the prior art, an aspect of the present invention provides a low-noise voiceprint recognition device, which includes: an audio receiving unit for retrieving an audio data; a voiceprint recognition unit for connected to the audio receiving unit, the voiceprint recognition unit receives the audio data from the audio receiving unit, and the voiceprint recognition unit recognizes at least the voiceprint characteristics of the audio data; a storage unit stores a first characteristic data ; a computing unit connected to the voiceprint recognition unit and the storage unit, the computing unit receives the at least voiceprint features from the voiceprint recognition unit and the first characteristic data from the storage unit, and the computing unit Including: a verification calculator, the count value of the verification calculator is 0 units in an initial state, wherein the operation unit judges whether the at least voiceprint feature matches the first feature data according to the first feature data; wherein , if the computing unit judges that the at least voiceprint features conform to the first feature data, then the computing unit stores the at least voiceprint features in a temporary storage space of the storage unit, according to the At least the voiceprint feature trains the first feature data to generate an adjusted first feature data, and performs security verification according to the adjusted first feature data; if the computing unit judges that the at least voiceprint feature does not match the first feature data feature data, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least one voiceprint feature in the temporary storage space, but the storage unit does not The fingerprint feature changes the first feature data to perform security verification based on the first feature data, wherein the verification request includes the identification of the user's identity feature; if the computing unit judges that the at least one voiceprint feature does not match the first feature data , and the calculated value reaches the predetermined value, then the computing unit generates a second feature data according to the at least voiceprint feature and the first feature data in the temporary storage space; and after the second feature data is generated, The verification calculator adjusts the calculation value to 0 units, and the computing unit stores the first characteristic data and the second characteristic data in the storage unit, so as to judge the at least one sound according to the first characteristic data and the second characteristic data. Whether the fingerprint features are consistent with the first feature data and the second feature data is used for security verification, including: if the computing unit judges that the at least one voiceprint feature matches the second feature data, then the computing unit sends the at least The voiceprint feature is stored in the temporary storage space, and the second feature data is trained according to the at least voiceprint feature in the temporary storage space to generate an adjusted second feature data, and according to the adjusted second feature data for safety verification; if the computing unit judges that the at least one voiceprint feature does not match the second feature data but matches the first feature data, the computing unit removes the second feature data from the storage unit, to Perform security verification according to the first feature data; and after the second feature data is generated, if the computing unit judges that the at least voiceprint feature does not match the second feature data and does not match the first feature data, the verification calculation The counter value is increased by 1 unit, and when the calculated value is less than a predetermined value, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least voiceprint feature in the In the temporary storage space, but the storage unit does not change the first characteristic data and the second characteristic data according to the at least voiceprint characteristics, so as to perform security verification based on the first characteristic data and the second characteristic data, wherein the identification The at least one voiceprint feature in the audio data further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, calculating vector parameters of the audio data, detecting specific keywords, Detect specific sound wave bands, detect specific sound wave waveforms, and detect specific sound wave frequencies.

本發明之另一態樣提供了一低雜訊聲紋辨識方法，應用於一聲紋辨識裝置，該聲紋辨識裝置包含一運算單元以及一驗證計算器，該驗證計算器之一計數值於一初始狀態為0單位，該聲紋辨識方法包含控制該運算單元進行以下步驟：擷取一音頻資料；辨識該音頻資料內的至少一聲紋特徵；提供一第一特徵資料、一暫存空間以及一驗證請求；根據該第一特徵資料，比對該至少一聲紋特徵與該第一特徵資料是否相符；若該至少一聲紋特徵符合該第一特徵資料，則儲存該至少一聲紋特徵於該暫存空間內，並根據該暫存空間內的該至少一聲紋特徵訓練該第一特徵資料，以產生一調整後第一特徵資料，以根據該調整後第一特徵資料進行安全性驗證；若該至少一聲紋特徵不符該第一特徵資料，則傳送該驗證請求，當該驗證請求完成確認後，儲存該至少一聲紋特徵於該暫存空間內，但不根據該至少一聲紋特徵變更該第一特徵資料，以根據該第一特徵資料進行安全性驗證，其中該驗證請求包含使用者身份特徵之識別；若該計算值達該預定數值，則根據該暫存空間內的該至少一聲紋特徵與該第一特徵資料產生一第二特徵資料；以及於該第二特徵資料產生後，調整該驗證計算器之該計算值為0單位，且根據該第一特徵資料、該第二特徵資料來比對該至少一聲紋特徵與該第一特徵資料、該第二特徵資料是否相符，以進行安全性驗證，包含：若該運算單元判斷該至少一聲紋特徵符合該第二特徵資料，則該運算單元傳送該至少一聲紋特徵儲存於該暫存空間內，且根據該暫存空間內的該至少一聲紋特徵訓練該第二特徵資料，以產生一調整後第二特徵資料，並根據該調整後第二特徵資料進行安全性驗證；若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料、但符合該第一特徵資料，則該運算單元自該儲存單元移除該第二特徵資料，以根據該第一特徵資料進行安全性驗證；以及於該第二特徵資料產生後，若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料且不符該第一特徵資料，則該驗證計算器對該計數值增加1單位，且當該計算值小於一預定數值時，該運算單元傳送一驗證請求，並於該驗證請求完成確認後，該運算單元將該至少一聲紋特徵儲存於該暫存空間內，但該儲存單元不根據該至少一聲紋特徵變更該第一特徵資料、該第二特徵資料，以根據該第一特徵資料、該第二特徵資料進行安全性驗證，其中辨識該音頻資料內的該至少一聲紋特徵進一步包含以下操作中至少一者：濾波、降低雜訊、抑制背景噪音、放大特定聲紋特徵、計算該音頻資料之向量參數、偵測特定關鍵詞、偵測特定聲波波段、偵測特定聲波波形以及偵測特定聲波頻率。Another aspect of the present invention provides a low-noise voiceprint recognition method, which is applied to a voiceprint recognition device. The voiceprint recognition device includes an arithmetic unit and a verification calculator. One count value of the verification calculator is An initial state is 0 units, and the voiceprint recognition method includes controlling the computing unit to perform the following steps: capture an audio data; identify at least voiceprint features in the audio data; provide a first feature data, a temporary storage space and a verification request; according to the first feature data, compare whether the at least voiceprint features match the first feature data; if the at least voiceprint features match the first feature data, store the at least voiceprint features feature in the temporary storage space, and train the first feature data according to the at least one voiceprint feature in the temporary storage space to generate adjusted first feature data for performing security based on the adjusted first feature data verification; if the at least one voiceprint feature does not match the first feature data, then send the verification request, and when the verification request is confirmed, store the at least one voiceprint feature in the temporary storage space, but not according to the at least The voiceprint feature changes the first feature data to perform security verification based on the first feature data, wherein the verification request includes the identification of the user's identity feature; if the calculated value reaches the predetermined value, then according to the temporary storage space The at least one voiceprint feature and the first feature data in generate a second feature data; and after the second feature data is generated, adjust the calculation value of the verification calculator to 0 units, and according to the first feature data, the second feature data to compare whether the at least voiceprint features are consistent with the first feature data and the second feature data, so as to perform security verification, including: if the computing unit judges that the at least voiceprint features If the second feature data is met, the computing unit transmits the at least voiceprint feature to be stored in the temporary storage space, and trains the second feature data according to the at least voiceprint feature in the temporary storage space to generate a Adjusted second feature data, and perform security verification based on the adjusted second feature data; The unit removes the second feature data from the storage unit to perform security verification based on the first feature data; and after the second feature data is generated, if the computing unit determines that the at least voiceprint features do not match the second If the characteristic data does not match the first characteristic data, the verification calculator will increase the count value by 1 unit, and when the calculated value is less than a predetermined value, the operation unit will send a verification request, and after the verification request is confirmed , the computing unit stores the at least voiceprint feature in the temporary storage space, but the storage unit does not change the first feature data and the second feature data according to the at least voiceprint feature, so as to change the first feature data according to the first feature data and the second feature data for security verification, wherein identifying the at least voiceprint features in the audio data further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, Calculate vector parameters of the audio data, detect specific keywords, detect specific sound wave bands, detect specific sound wave waveforms, and detect specific sound wave frequencies.

綜上所述，使用者於登入系統環境欲進行金融交易等動作時，藉由讀取裝置使用時所記錄的聲紋特徵，搭配深度學習、加入神經網路等訓練聲紋特徵模型，可改善認證的品質，讓模型可以更準確地辨認使用者本人，亦進一步地將此項認證應用於各種金融交易系統當中。To sum up, when the user logs into the system environment and intends to perform financial transactions and other actions, by reading the voiceprint features recorded during the use of the device, and training the voiceprint feature model with deep learning and adding neural networks, it can improve The quality of authentication allows the model to more accurately identify the user himself, and further applies this authentication to various financial transaction systems.

以下說明係為完成發明的較佳實現方式，其目的在於描述本發明的基本精神，但並不用以限定本發明。實際的發明內容必須參考之後的權利要求範圍。The following description is a preferred implementation of the invention, and its purpose is to describe the basic spirit of the invention, but not to limit the invention. For the actual content of the invention, reference must be made to the scope of the claims that follow.

必須了解的是，使用於本說明書中的“包含”、“包括”等詞，係用以表示存在特定的技術特徵、數值、方法步驟、作業處理、元件以及/或組件，但並不排除可加上更多的技術特徵、數值、方法步驟、作業處理、元件、組件，或以上的任意組合。It must be understood that words such as "comprising" and "including" used in this specification are used to indicate the existence of specific technical features, values, method steps, operations, components and/or components, but do not exclude possible Add more technical characteristics, values, method steps, operation processes, components, components, or any combination of the above.

於權利要求中使用如“第一”、“第二”等詞係用來修飾權利要求中的元件，並非用來表示之間具有優先權順序，先行關係，或者是一個元件先於另一個元件，或者是執行方法步驟時的時間先後順序，僅用來區別具有相同名字的元件。Words such as "first" and "second" used in the claims are used to modify the elements in the claims, and are not used to indicate that there is an order of priority, an antecedent relationship, or that one element is prior to another element , or the chronological order in which the steps of the method are performed, are only used to distinguish elements with the same name.

請一併參閱圖1與圖2，圖1係依據本發明一實施例繪示一種聲紋辨識裝置之方塊圖，圖2係依據本發明另一實施例繪示一種聲紋辨識裝置之方塊圖。Please refer to FIG. 1 and FIG. 2 together. FIG. 1 is a block diagram of a voiceprint recognition device according to an embodiment of the present invention, and FIG. 2 is a block diagram of a voiceprint recognition device according to another embodiment of the present invention. .

於一實施例中，如圖1所示，聲紋辨識裝置100包含一音訊接收單元10、一聲紋辨識單元20、一運算單元30以及一儲存單元40。In one embodiment, as shown in FIG. 1 , the voiceprint recognition device 100 includes an audio receiving unit 10 , a voiceprint recognition unit 20 , a computing unit 30 and a storage unit 40 .

音訊接收單元10係擷取一音頻資料。聲紋辨識單元20係連接音訊接收單元10以接收上述音頻資料，且可辨識出上述音頻資料內的至少一聲紋特徵。在一些實施例中，上述至少一聲紋特徵可以是特定關鍵詞、聲波波段、聲波波形、聲波頻率等。於一實施例中，聲紋辨識單元20用於辨識音頻資料內的上述至少一聲紋特徵，並且聲紋辨識單元20可包含以下操作中至少一者，例如為濾波、降低雜訊、抑制背景噪音、放大特定聲紋特徵、計算音頻資料之一向量參數、偵測特定關鍵詞、偵測特定聲波波段、偵測特定聲波波形、偵測特定聲波頻率等。The audio receiving unit 10 retrieves an audio data. The voiceprint recognition unit 20 is connected to the audio receiving unit 10 to receive the above-mentioned audio data, and can recognize at least voiceprint features in the above-mentioned audio data. In some embodiments, the aforementioned at least voiceprint feature may be a specific keyword, a sound wave band, a sound wave waveform, a sound wave frequency, and the like. In one embodiment, the voiceprint identification unit 20 is used to identify at least the aforementioned voiceprint features in the audio data, and the voiceprint identification unit 20 may include at least one of the following operations, such as filtering, noise reduction, and background suppression Noise, amplification of specific voiceprint features, calculation of vector parameters of audio data, detection of specific keywords, detection of specific sound wave bands, detection of specific sound wave waveforms, detection of specific sound wave frequencies, etc.

儲存單元40可儲存第一特徵資料44。在一些實施例中，第一特徵資料44可以是符合使用者之任何一相關聲紋資訊，例如為特定關鍵詞、聲波波段、聲波波形、聲波頻率等。在一些實施例中，第一特徵資料44可以是類比訊號、數位訊號、類比/數位混合訊號等資料儲存之模式。The storage unit 40 can store the first feature data 44 . In some embodiments, the first characteristic data 44 may be any relevant voiceprint information corresponding to the user, such as a specific keyword, sound wave band, sound wave waveform, sound wave frequency, and the like. In some embodiments, the first feature data 44 may be a data storage mode such as an analog signal, a digital signal, or a mixed analog/digital signal.

運算單元30係連接聲紋辨識單元20與儲存單元40，可接收自聲紋辨識單元20的上述至少一聲紋特徵，以及接收自儲存單元40的第一特徵資料44，於運算單元30內判斷上述至少一聲紋特徵與第一特徵資料44是否相符。The computing unit 30 is connected to the voiceprint recognition unit 20 and the storage unit 40, and can receive at least the above-mentioned voiceprint characteristics from the voiceprint recognition unit 20, and the first characteristic data 44 received from the storage unit 40, and judge in the computing unit 30 Whether the above-mentioned at least voiceprint feature is consistent with the first feature data 44 .

若運算單元30判斷上述至少一聲紋特徵符合第一特徵資料44，則運算單元30將上述至少一聲紋特徵儲存於儲存單元40的一暫存空間42內，且根據暫存空間42內的上述至少一聲紋特徵來訓練第一特徵資料44，以產生一調整後第一特徵資料44，並根據調整後第一特徵資料44進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第一特徵資料44建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。If the calculation unit 30 judges that the above-mentioned at least voiceprint features conform to the first feature data 44, the calculation unit 30 stores the above-mentioned at least voiceprint features in a temporary storage space 42 of the storage unit 40, and according to the data in the temporary storage space 42 The at least one voiceprint feature is used to train the first feature data 44 to generate an adjusted first feature data 44 , and security verification is performed according to the adjusted first feature data 44 . For example, the above security verification can be performed immediately after the adjusted first characteristic data 44 is created, so as to further achieve a fast security verification effect.

在另一範例中，上述安全性驗證可以待後續收到新的一聲紋特徵，再將此新的一聲紋特徵與調整後第一特徵資料44比對，如此同樣可達到安全性驗證之目的，且相較於前一個範例，可降低誤判斷為「不通過」之機率，提昇整體流程的順暢度，進而提昇使用者體驗。In another example, the above-mentioned security verification can wait for a new voiceprint feature to be received later, and then compare the new voiceprint feature with the adjusted first feature data 44, so that the security verification can also be achieved. Compared with the previous example, it can reduce the probability of misjudgment as "failure", improve the smoothness of the overall process, and improve user experience.

在一些實施例中，上述判斷的至少一聲紋特徵可對應於使用者之一正常聲紋，或受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第一特徵資料44則可視為使用者原始預設之聲紋特徵，若至少一聲紋特徵能與第一特徵資料44相符，則使用調整後第一特徵資料44作為聲紋識別標準來進行安全性驗證，且第一特徵資料44可被持續訓練，以持續更新第一特徵資料44可接受之聲紋範疇。In some embodiments, at least the voiceprint characteristics determined above may correspond to a normal voiceprint of the user, or an abnormal voiceprint affected by factors such as age, emotion, or physical condition, and the first characteristic data 44 may be visualized. It is the voiceprint feature originally preset by the user. If at least the voiceprint feature is consistent with the first feature data 44, the adjusted first feature data 44 is used as the voiceprint recognition standard for security verification, and the first feature The data 44 can be continuously trained, so as to continuously update the range of acceptable voiceprints of the first characteristic data 44 .

於一實施例中，如圖2所示，聲紋辨識裝置200之運算單元30可進一步包含一驗證計算器32，驗證計算器32之計數值於一初始狀態為0單位，若運算單元30判斷上述至少一聲紋特徵不符第一特徵資料44，則驗證計算器32之計數值增加1單位。In one embodiment, as shown in FIG. 2, the computing unit 30 of the voiceprint recognition device 200 may further include a verification calculator 32. The count value of the verification calculator 32 is 0 units in an initial state. If the computing unit 30 judges If at least the above-mentioned features of the voiceprint do not match the first feature data 44, the count value of the verification calculator 32 is increased by 1 unit.

若上述計算值小於一預定數值，若運算單元30判斷上述至少一聲紋特徵不符第一特徵資料44，則運算單元30將傳送一驗證請求給使用者，且等待上述驗證請求被使用者完成確認後，運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，但儲存單元40不根據上述至少一聲紋特徵來變更第一特徵資料44，而是仍以第一特徵資料44進行安全性驗證。在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第一特徵資料44則可視為使用者原始預設之聲紋特徵。若異常聲紋的上述至少一聲紋特徵與第一特徵資料44不符，則本發明仍保留第一特徵資料44，並且僅將異常聲紋另外儲存於暫存空間42內，以此可作為一辨識緩衝機制，上述辨識緩衝機制主要避免使用者之上述至少一聲紋特徵僅為聲紋暫時性異常、或聲紋辨識裝置100判斷失誤之可能。If the above-mentioned calculated value is less than a predetermined value, if the calculation unit 30 judges that the above-mentioned at least one voiceprint feature does not match the first feature data 44, the calculation unit 30 will send a verification request to the user, and wait for the above-mentioned verification request to be confirmed by the user Afterwards, the computing unit 30 stores the above-mentioned at least voiceprint features in the temporary storage space 42, but the storage unit 40 does not change the first feature data 44 according to the above-mentioned at least voiceprint features, but still uses the first feature data 44 for processing. Security verification. In some embodiments, at least the above-mentioned voiceprint characteristics may correspond to the abnormal voiceprint of the user affected by factors such as age, emotion, or physical condition, and the first feature data 44 may be regarded as the user's original preset voiceprint feature. If the above-mentioned at least voiceprint characteristics of the abnormal voiceprint do not match the first characteristic data 44, the present invention still retains the first characteristic data 44, and only stores the abnormal voiceprint in the temporary storage space 42, which can be used as a Recognition buffer mechanism. The above recognition buffer mechanism mainly avoids the possibility that at least the voiceprint feature of the user is only temporarily abnormal, or that the voiceprint recognition device 100 makes a wrong judgment.

若上述計算值達該預定數值，則運算單元30根據暫存空間42內的上述至少一聲紋特徵與第一特徵資料44產生一第二特徵資料46。在一些實施例中，第二特徵資料46可以是類比訊號、數位訊號、類比/數位混合訊號等資料儲存之模式。If the calculated value reaches the predetermined value, the computing unit 30 generates a second feature data 46 according to the at least voiceprint features in the temporary storage space 42 and the first feature data 44 . In some embodiments, the second feature data 46 may be a data storage mode such as an analog signal, a digital signal, or a mixed analog/digital signal.

於第二特徵資料46產生後，驗證計算器32調整計算值為0單位，運算單元30將第一特徵資料44、第二特徵資料46儲存於儲存單元40，並修改為根據第一特徵資料44、第二特徵資料46，使運算單元30判斷後續接收之至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符，以進行安全性驗證。在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第一特徵資料44則可視為使用者原始預設之聲紋特徵。若異常聲紋的上述至少一聲紋特徵已達一定預定次數（預定時間）而不能符合第一特徵資料44，則本發明仍保留第一特徵資料44，並且另根據暫存空間42內儲存的上述至少一聲紋特徵與第一特徵資料44產生一第二特徵資料46，以此可作為另一辨識緩衝機制，上述辨識緩衝機制主要增加使用者受到年齡、情緒、或身體狀況等因素一定時間影響後的聲紋特徵辨識資料。After the second characteristic data 46 is generated, the verification calculator 32 adjusts the calculation value to 0 units, and the computing unit 30 stores the first characteristic data 44 and the second characteristic data 46 in the storage unit 40, and modifies them according to the first characteristic data 44 . The second characteristic data 46, so that the computing unit 30 judges whether at least the subsequently received voiceprint characteristics are consistent with the first characteristic data 44 and the second characteristic data 46, so as to perform security verification. In some embodiments, at least the above-mentioned voiceprint characteristics may correspond to the abnormal voiceprint of the user affected by factors such as age, emotion, or physical condition, and the first feature data 44 may be regarded as the user's original preset voiceprint feature. If the above-mentioned at least voiceprint characteristics of the abnormal voiceprint have reached a certain predetermined number of times (predetermined time) and cannot meet the first characteristic data 44, then the present invention still retains the first characteristic data 44, and additionally according to the data stored in the temporary storage space 42 The above-mentioned at least voiceprint feature and the first feature data 44 generate a second feature data 46, which can be used as another identification buffer mechanism. The above-mentioned identification buffer mechanism mainly increases the user's age, emotion, or physical condition and other factors for a certain period of time. The affected voiceprint feature recognition data.

於一實施例中，運算單元30於判斷至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符時，若上述至少一聲紋特徵符合第二特徵資料46，則運算單元30傳送上述至少一聲紋特徵儲存於暫存空間42內，且根據暫存空間42內的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46，並根據該調整後第二特徵資料46進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。In one embodiment, when the computing unit 30 judges whether at least the voiceprint feature matches the first feature data 44 and the second feature data 46, if the above-mentioned at least voiceprint feature matches the second feature data 46, the computing unit 30 Sending the at least voiceprint features to be stored in the temporary storage space 42, and training the second feature data 46 according to the at least voiceprint features in the temporary storage space 42 to generate an adjusted second feature data 46, and according to the After adjustment, the second feature data 46 performs safety verification. For example, the above security verification can be performed immediately after the adjusted second feature data 46 is created, so as to further achieve a fast security verification effect.

在另一範例中，上述安全性驗證可以待後續收到新的一聲紋特徵，再將此新的一聲紋特徵與調整後第二特徵資料46比對，如此同樣可達到安全性驗證之目的，且相較於前一個範例，可降低誤判斷為「不通過」之機率，提昇整體流程的順暢度，進而提昇使用者體驗。In another example, the above-mentioned security verification can wait for a new voiceprint feature to be received later, and then compare the new voiceprint feature with the adjusted second feature data 46, so that the security verification can also be achieved. Compared with the previous example, it can reduce the probability of misjudgment as "failure", improve the smoothness of the overall process, and improve user experience.

在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第二特徵資料46則可視為使用者受一定時間異常影響後的聲紋特徵辨識資料。若異常聲紋的上述至少一聲紋特徵已達一定預定次數（預定時間）而卻與第二特徵資料46相符，則本發明仍保留第一特徵資料44，並且僅判斷為使用者已達一聲紋異常變化後的穩定狀況，而使用調整後第二特徵資料46作為此穩定狀況的聲紋識別標準來進行安全性驗證，且第二特徵資料46可被持續訓練，以持續更新第二特徵資料46可接受之聲紋範疇。In some embodiments, at least the above-mentioned voiceprint characteristics may correspond to the abnormal voiceprint of the user affected by factors such as age, emotion, or physical condition. Voiceprint feature recognition data. If the above-mentioned at least voiceprint characteristics of the abnormal voiceprint have reached a certain predetermined number of times (predetermined time) but match the second characteristic data 46, then the present invention still retains the first characteristic data 44, and only judges that the user has reached a predetermined number of times (predetermined time). The stable situation after the abnormal change of the voiceprint, and the adjusted second characteristic data 46 is used as the voiceprint recognition standard of this stable situation for security verification, and the second characteristic data 46 can be continuously trained to continuously update the second characteristics Information 46 Acceptable Voiceprint Range.

若運算單元30判斷上述至少一聲紋特徵不符第二特徵資料46、但符合第一特徵資料44，則運算單元30自儲存單元40中移除第二特徵資料46。第二特徵資料46移除後，重新根據第一特徵資料44進行安全性驗證。在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋特徵，第一特徵資料44可視為使用者原始預設之聲紋特徵，第二特徵資料46則可視為使用者受一定時間異常影響後的聲紋特徵辨識資料。若使用者由造成異常聲紋特徵之狀況恢復時，則可判斷為第二特徵資料46已不適用為一聲紋識別標準，故予以移除第二特徵資料46、恢復為根據第一特徵資料44作為聲紋識別之標準來進行安全性驗證。If the computing unit 30 judges that at least the voiceprint feature does not match the second feature data 46 but matches the first feature data 44 , the computing unit 30 removes the second feature data 46 from the storage unit 40 . After the second characteristic data 46 is removed, security verification is performed again according to the first characteristic data 44 . In some embodiments, the aforementioned at least voiceprint characteristics may correspond to the abnormal voiceprint characteristics of the user affected by factors such as age, emotion, or physical condition, and the first characteristic data 44 may be regarded as the user's original preset voiceprint feature, the second feature data 46 can be regarded as the voiceprint feature identification data after the user has been affected by the abnormality for a certain period of time. If the user recovers from the situation that caused the abnormal voiceprint characteristics, it can be judged that the second characteristic data 46 is no longer applicable as a voiceprint recognition standard, so the second characteristic data 46 is removed and restored to the first characteristic data. 44 as a standard for voiceprint recognition for security verification.

若運算單元30判斷上述至少一聲紋特徵不符第二特徵資料46且不符第一特徵資料44，則驗證計算器32對上述計數值增加1單位。若上述計算值小於一預定數值，則運算單元30傳送一驗證請求給使用者，且等待上述驗證請求被使用者完成確認後，運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，但儲存單元40不根據上述至少一聲紋特徵變更第一特徵資料44、第二特徵資料46，以根據該第一特徵資料、該第二特徵進行安全性驗證。If the computing unit 30 judges that the above-mentioned at least one voiceprint feature does not match the second feature data 46 and does not match the first feature data 44 , then the verification calculator 32 adds 1 unit to the count value. If the calculated value is less than a predetermined value, the computing unit 30 sends a verification request to the user, and waits for the verification request to be confirmed by the user, and the computing unit 30 stores the at least voiceprint feature in the temporary storage space 42 , but the storage unit 40 does not change the first characteristic data 44 and the second characteristic data 46 according to the above-mentioned at least voiceprint characteristics, so as to perform security verification according to the first characteristic data and the second characteristic.

於一實施例中，若上述計算值達該預定數值，則運算單元30根據暫存空間42內的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46。產生調整後第二特徵資料46後，驗證計算器32調整該計算值為0單位，且運算單元30判斷至少一聲紋特徵與第一特徵資料44、調整後第二特徵資料46是否相符，以進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。In one embodiment, if the calculated value reaches the predetermined value, the computing unit 30 trains the second feature data 46 according to the at least voiceprint features in the temporary storage space 42 to generate an adjusted second feature data 46 . After the adjusted second feature data 46 is generated, the verification calculator 32 adjusts the calculated value to 0 units, and the computing unit 30 judges whether at least the voiceprint feature is consistent with the first feature data 44 and the adjusted second feature data 46, so as to Perform security verification. For example, the above security verification can be performed immediately after the adjusted second feature data 46 is created, so as to further achieve a fast security verification effect.

在另一範例中，上述安全性驗證可以待後續收到新的一聲紋特徵，再將此新的一聲紋特徵與第一特徵資料44、調整後第二特徵資料46比對，如此同樣可達到安全性驗證之目的，且相較於前一個範例，可降低誤判斷為「不通過」之機率，提昇整體流程的順暢度，進而提昇使用者體驗。In another example, the above-mentioned security verification can wait for a new voiceprint feature to be received later, and then compare the new voiceprint feature with the first feature data 44 and the adjusted second feature data 46, and so on It can achieve the purpose of security verification, and compared with the previous example, it can reduce the probability of misjudgment as "failure", improve the smoothness of the overall process, and improve user experience.

在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋特徵，且造成異常聲紋特徵之狀況恐已難以恢復。若異常聲紋的上述至少一聲紋特徵已達一定預定次數（預定時間）而不能符合第一特徵資料44、第二特徵資料46，則可判斷為第二特徵資料46已不適用為一聲紋異常變化達穩定狀況的聲紋識別標準，故利用暫存空間42內的聲紋特徵重新訓練第二特徵資料46，產生一調整後第二特徵資料46，使其重新作為聲紋識別之標準來進行安全性驗證。In some embodiments, at least the above-mentioned voiceprint characteristics may correspond to abnormal voiceprint characteristics of the user affected by factors such as age, emotion, or physical condition, and the conditions causing abnormal voiceprint characteristics may be difficult to recover. If the above-mentioned at least voiceprint characteristics of the abnormal voiceprint have reached a certain predetermined number of times (predetermined time) and cannot meet the first characteristic data 44 and the second characteristic data 46, then it can be judged that the second characteristic data 46 is no longer suitable for a voice. The abnormal change of the pattern reaches the standard of voiceprint recognition in a stable state, so the second feature data 46 is retrained by using the voiceprint features in the temporary storage space 42, and an adjusted second feature data 46 is generated, so that it can be used as the standard of voiceprint recognition again for security verification.

於一實施例中，聲紋辨識裝置100、200可設置於透過網路執行金融交易的個人電腦、膝上型電腦、蜂窩電話、相機電話、智慧型電話、個人數位助理、媒體播放機、導航設備、電子郵件收發設備、遊戲控制台、平板電腦、可穿戴設備或者這些設備中的任意幾種設備的組合。In one embodiment, the voiceprint recognition devices 100 and 200 can be installed in personal computers, laptop computers, cellular phones, camera phones, smart phones, personal digital assistants, media players, navigation device, email device, game console, tablet, wearable, or any combination of these.

於一實施例中，運算單元30可以是一中央處理器（central processing unit，CPU），亦可配置為其他運算能力足夠的元件。In an embodiment, the computing unit 30 may be a central processing unit (central processing unit, CPU), and may also be configured as other components with sufficient computing capability.

於一實施例中，儲存單元40可以是一伺服器（server），用於儲存一或多個公司、銀行或機構的聲紋辨識資料庫。於另一實施例中，儲存單元40可以是一任何型態的固定式或可移動式的隨機存取記憶體（random access memory，RAM）、唯讀記憶體（read-only memory，ROM）、快閃記憶體（flash memory）、硬碟（hard disk drive，HDD）、固態硬碟（solid state drive，SSD）或類似元件或上述元件的組合。In one embodiment, the storage unit 40 may be a server (server) for storing voiceprint recognition databases of one or more companies, banks or institutions. In another embodiment, the storage unit 40 may be any type of fixed or removable random access memory (random access memory, RAM), read-only memory (read-only memory, ROM), Flash memory (flash memory), hard disk drive (hard disk drive, HDD), solid state drive (solid state drive, SSD) or similar components or a combination of the above components.

於一實施例中，上述驗證請求所包含的使用者身份特徵之識別方法可以是簡訊、傳真、電子郵件、應用程式、數位簽章、或基於其他生物學特徵（例如為臉部影像、指紋、虹膜等辨識）等。In one embodiment, the identification method of the user's identity included in the verification request can be text message, fax, email, application program, digital signature, or based on other biological characteristics (such as facial image, fingerprint, iris, etc. identification), etc.

於一實施例中，儲存單元40可包含一深度學習模型，用以訓練第一特徵資料44、第二特徵資料46，上述深度學習模型可包括各種類型的神經網路模型中的至少一者，所述各種類型的神經網路模型包括卷積神經網路（convolution neural network，CNN）、區域卷積神經網路（region with convolution neural network，R-CNN）、區域建議網絡（region proposal network，RPN）、遞迴神經網路（recurrent neural network，RNN）、基於堆疊的深度神經網路（stacking-based deep neural network，S-DNN）、狀態空間動態神經網路（state-space dynamic neural network，S-SDNN）、反卷積網路（deconvolution network）、深度置信網路（deep belief network，DBN）、受限玻爾茲曼機（restricted Boltzmann machine，RBM）、全卷積網路（fully convolutional network）、長短期記憶體（long short-term memory，LSTM）網路以及分類網路（classification network）。In one embodiment, the storage unit 40 may include a deep learning model for training the first feature data 44 and the second feature data 46. The above deep learning model may include at least one of various types of neural network models, The various types of neural network models include convolutional neural network (convolution neural network, CNN), region convolutional neural network (region with convolution neural network, R-CNN), region proposal network (region proposal network, RPN) ), recurrent neural network (RNN), stacking-based deep neural network (S-DNN), state-space dynamic neural network (state-space dynamic neural network, S-DNN) -SDNN), deconvolution network (deconvolution network), deep belief network (deep belief network, DBN), restricted Boltzmann machine (restricted Boltzmann machine, RBM), fully convolutional network (fully convolutional network) ), long short-term memory (LSTM) networks, and classification networks.

請一併參閱圖3與圖4，圖3係根據本發明之一實施例繪示一種聲紋辨識方法之流程圖，圖4係根據本發明之另一實施例繪示一種聲紋辨識方法之流程圖。Please refer to FIG. 3 and FIG. 4 together. FIG. 3 is a flow chart of a voiceprint recognition method according to an embodiment of the present invention, and FIG. 4 is a flowchart of a voiceprint recognition method according to another embodiment of the present invention. flow chart.

於一實施例中，如圖3所示，於步驟S1中，藉由音訊接收單元10擷取一音頻資料。In one embodiment, as shown in FIG. 3 , in step S1 , an audio data is captured by the audio receiving unit 10 .

於步驟S2中，聲紋辨識單元20接收音訊接收單元10所傳送的上述音頻資料，且辨識出上述音頻資料內的至少一聲紋特徵。In step S2, the voiceprint recognition unit 20 receives the audio data transmitted by the audio receiving unit 10, and recognizes at least voiceprint features in the audio data.

於步驟S3中，運算單元30接收聲紋辨識單元20所傳送的上述至少一聲紋特徵、接收儲存單元40所傳送的第一特徵資料44，用以比對上述聲紋特徵與第一特徵資料44是否相符。In step S3, the computing unit 30 receives the above-mentioned at least voiceprint features sent by the voiceprint recognition unit 20, and receives the first feature data 44 sent by the storage unit 40 to compare the above-mentioned voiceprint features with the first feature data 44 is consistent.

於步驟S4中，若上述至少一聲紋特徵符合第一特徵資料44，則運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，且運算單元30根據暫存空間42內的上述至少一聲紋特徵訓練第一特徵資料44，以產生一調整後第一特徵資料44，以根據調整後第一特徵資料44進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第一特徵資料44建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。In step S4, if the above-mentioned at least voiceprint feature matches the first feature data 44, the computing unit 30 stores the above-mentioned at least voiceprint feature in the temporary storage space 42, and the computing unit 30 according to the above-mentioned voiceprint feature in the temporary storage space 42 At least voiceprint feature training first feature data 44 to generate an adjusted first feature data 44 for safety verification according to the adjusted first feature data 44 . For example, the above security verification can be performed immediately after the adjusted first characteristic data 44 is created, so as to further achieve a fast security verification effect.

於步驟S5中，運算單元30可包含一驗證計算器32，驗證計算器32之計數值於一初始狀態為0單位，若上述聲紋特徵不符第一特徵資料44，則增加驗證計算器32之計數值1單位。若上述計算值小於一預定數值，則運算單元30將傳送一驗證請求給使用者，當上述驗證請求被使用者完成確認後，則運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，但不根據上述至少一聲紋特徵變更第一特徵資料44，以根據第一特徵資料44進行安全性驗證。In step S5, the computing unit 30 may include a verification calculator 32, the count value of the verification calculator 32 is 0 units in an initial state, if the above-mentioned voiceprint features do not match the first feature data 44, then increase the count of the verification calculator 32 The count value is 1 unit. If the above-mentioned calculated value is less than a predetermined value, the computing unit 30 will send a verification request to the user. After the above-mentioned verification request is confirmed by the user, the computing unit 30 will store the above-mentioned at least one voiceprint feature in the temporary storage space 42 However, the first feature data 44 is not changed according to at least the above-mentioned voiceprint features, so as to perform security verification based on the first feature data 44 .

於步驟S6中，若上述至少一聲紋特徵不符第一特徵資料44，且上述計算值達一預定數值，則運算單元30使暫存空間42內的至少一聲紋特徵與第一特徵資料44產生一第二特徵資料46。In step S6, if the above-mentioned at least voiceprint feature does not match the first feature data 44, and the above-mentioned calculated value reaches a predetermined value, then the computing unit 30 makes at least the voiceprint feature in the temporary storage space 42 consistent with the first feature data 44 A second characteristic data 46 is generated.

於步驟S7中，第二特徵資料46產生後，驗證計算器32調整計算值為0單位，且運算單元30修正成根據第一特徵資料44、第二特徵資料46來比對至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符，以進行安全性驗證。In step S7, after the second feature data 46 is generated, the verification calculator 32 adjusts the calculation value to 0 units, and the calculation unit 30 is modified to compare at least the voiceprint features according to the first feature data 44 and the second feature data 46 Whether it is consistent with the first characteristic data 44 and the second characteristic data 46 is used for safety verification.

於一實施例中，如圖4所示，於步驟S8中，運算單元30修正成根據第一特徵資料44、第二特徵資料46來比對至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符，而其中運算單元30先根據第二特徵資料46來比對至少一聲紋特徵與第二特徵資料46是否相符。In one embodiment, as shown in FIG. 4 , in step S8, the computing unit 30 is modified to compare at least voiceprint features with the first feature data 44 and the second feature data 46 according to the first feature data 44 and the second feature data 46. Whether the two characteristic data 46 are consistent, and wherein the computing unit 30 first compares whether at least the voiceprint characteristics are consistent with the second characteristic data 46 according to the second characteristic data 46 .

於步驟S9中，若上述聲紋特徵符合第二特徵資料46，則運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，且儲存單元40根據暫存空間42內的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46，並根據調整後第二特徵資料46進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。In step S9, if the above-mentioned voiceprint feature matches the second feature data 46, the computing unit 30 stores the above-mentioned at least one voiceprint feature in the temporary storage space 42, and the storage unit 40 stores the above-mentioned at least one voiceprint feature in the temporary storage space 42 according to the above-mentioned at least one feature data. The voiceprint feature trains the second feature data 46 to generate an adjusted second feature data 46 , and performs security verification according to the adjusted second feature data 46 . For example, the above security verification can be performed immediately after the adjusted second feature data 46 is created, so as to further achieve a fast security verification effect.

於步驟S10中，若上述聲紋特徵不符第二特徵資料46，則運算單元30再根據第一特徵資料44來比對上述聲紋特徵與第一特徵資料44是否相符。In step S10 , if the above-mentioned voiceprint features do not match the second feature data 46 , the computing unit 30 then compares whether the above-mentioned voiceprint features match the first feature data 44 according to the first feature data 44 .

於步驟S11中，若上述聲紋特徵不符第二特徵資料46、但符合第一特徵資料44，則運算單元30移除第二特徵資料46，以根據第一特徵資料44進行安全性驗證。In step S11 , if the above-mentioned voiceprint features do not match the second feature data 46 but match the first feature data 44 , the computing unit 30 removes the second feature data 46 to perform security verification according to the first feature data 44 .

於步驟S12中，驗證計算器32之計數值於一初始狀態為0單位，若上述聲紋特徵不符第二特徵資料46且不符第一特徵資料44，則增加驗證計算器32之計數值1單位。若上述計算值小於一預定數值，則傳送一驗證請求給使用者，上述驗證請求被使用者完成確認後，運算單元30傳送上述聲紋特徵至暫存空間42內儲存，但儲存單元40不根據上述至少一聲紋特徵變更第一特徵資料44、第二特徵資料46，以根據第一特徵資料44、第二特徵資料46進行安全性驗證。In step S12, the count value of the verification calculator 32 is 0 units in an initial state, if the above-mentioned voiceprint feature does not match the second feature data 46 and does not match the first feature data 44, then increase the count value of the verification calculator 32 by 1 unit . If the calculated value is less than a predetermined value, a verification request is sent to the user. After the verification request is confirmed by the user, the computing unit 30 sends the voiceprint feature to the temporary storage space 42 for storage, but the storage unit 40 does not The above-mentioned at least voiceprint features change the first feature data 44 and the second feature data 46 to perform security verification based on the first feature data 44 and the second feature data 46 .

於步驟S13中，若上述至少一聲紋特徵不符第二特徵資料46且不符第一特徵資料44，且上述計算值達一預定數值，則運算單元30根據暫存空間42的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46。In step S13, if the above-mentioned at least voiceprint feature does not match the second feature data 46 and does not match the first feature data 44, and the above-mentioned calculated value reaches a predetermined value, then the computing unit 30 will use the above-mentioned at least voiceprint feature in the temporary storage space 42 Feature Training the second feature data 46 to generate an adjusted second feature data 46 .

於步驟S14中，產生調整後第二特徵資料46後，該驗證計算器調整該計算值為0單位，運算單元30修正成根據第一特徵資料44、調整後第二特徵資料46來判斷至少一聲紋特徵與第一特徵資料44、調整後第二特徵資料46是否相符，以進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。In step S14, after the adjusted second feature data 46 is generated, the verification calculator adjusts the calculation value to 0 units, and the calculation unit 30 is revised to determine at least one Whether the voiceprint feature is consistent with the first feature data 44 and the adjusted second feature data 46 is used for safety verification. For example, the above security verification can be performed immediately after the adjusted second feature data 46 is created, so as to further achieve a fast security verification effect.

於一實施例中，聲紋辨識方法300、400可設置於電子金融交易系統，例如為自動櫃員機（ATM）、網路銀行、或其他金融交易系統上。In one embodiment, the voiceprint recognition methods 300 and 400 can be set in an electronic financial transaction system, such as an automatic teller machine (ATM), online banking, or other financial transaction systems.

於一實施例中，聲紋辨識方法300、400中之聲紋辨識裝置可包含一深度學習模型，用以訓練第一特徵資料44、第二特徵資料46，上述深度學習模型可包括各種類型的神經網路模型中的至少一者，所述各種類型的神經網路模型包括卷積神經網路（convolution neural network，CNN）、區域卷積神經網路（region with convolution neural network，R-CNN）、區域建議網絡（region proposal network，RPN）、遞迴神經網路（recurrent neural network，RNN）、基於堆疊的深度神經網路（stacking-based deep neural network，S-DNN）、狀態空間動態神經網路（state-space dynamic neural network，S-SDNN）、反卷積網路（deconvolution network）、深度置信網路（deep belief network，DBN）、受限玻爾茲曼機（restricted Boltzmann machine，RBM）、全卷積網路（fully convolutional network）、長短期記憶體（long short-term memory，LSTM）網路以及分類網路（classification network）。In one embodiment, the voiceprint recognition device in the voiceprint recognition methods 300 and 400 may include a deep learning model for training the first feature data 44 and the second feature data 46. The deep learning model may include various types of At least one of the neural network models, the various types of neural network models include convolutional neural network (convolution neural network, CNN), regional convolutional neural network (region with convolution neural network, R-CNN) , region proposal network (region proposal network, RPN), recurrent neural network (recurrent neural network, RNN), stacking-based deep neural network (stacking-based deep neural network, S-DNN), state space dynamic neural network Road (state-space dynamic neural network, S-SDNN), deconvolution network (deconvolution network), deep belief network (deep belief network, DBN), restricted Boltzmann machine (restricted Boltzmann machine, RBM) , fully convolutional network, long short-term memory (LSTM) network, and classification network.

本發明對於電子裝置上的金融交易系統可做為一道輔助認證的功能。使用者於登入系統環境欲進行金融交易等動作時，藉由讀取裝置使用時所記錄的聲紋圖譜，透過深度學習、加入遞歸神經網路來訓練聲紋圖譜模型，藉此改善認證的品質，讓模型可以更準確地辨認使用者本人的指令，進而將此項認證應用於各種金融交易系統中，例如為行動銀行（網路銀行）等。The invention can be used as an auxiliary authentication function for the financial transaction system on the electronic device. When the user logs into the system environment and intends to perform financial transactions and other actions, by reading the voiceprint recorded during the use of the device, the voiceprint model is trained through deep learning and adding a recurrent neural network, thereby improving the quality of authentication , so that the model can more accurately identify the user's own instructions, and then apply this authentication to various financial transaction systems, such as mobile banking (online banking), etc.

惟以上所述者，僅為本發明之實施例而已，當不能以此限定本發明實施之範圍，凡是依本發明申請專利範圍及專利說明書內容所作之簡單的等效變化與修飾，皆仍屬本發明專利涵蓋之範圍內。But what is described above is only an embodiment of the present invention, and should not limit the scope of the present invention. All simple equivalent changes and modifications made according to the patent scope of the present invention and the content of the patent specification are still within the scope of the present invention. Within the scope covered by the patent of the present invention.

100,200:聲紋辨識裝置 300,400:聲紋辨識方法 10:音訊接收單元 20:聲紋辨識單元 30:運算單元 32:驗證計算器 40:儲存單元 42:暫存空間 44:第一特徵資料 46:第二特徵資料 S1~S14:步驟 S1’,S2’:步驟 100,200: voiceprint recognition device 300,400: voiceprint recognition method 10: Audio receiving unit 20:Voiceprint recognition unit 30: Operation unit 32: Verification Calculator 40: storage unit 42: temporary storage space 44: The first feature data 46: Second characteristic data S1~S14: Steps S1', S2': step

圖1係依據本發明一實施例繪示一種聲紋辨識裝置之方塊圖。FIG. 1 is a block diagram of a voiceprint recognition device according to an embodiment of the present invention.

圖2係依據本發明另一實施例繪示一種聲紋辨識裝置之方塊圖。FIG. 2 is a block diagram of a voiceprint recognition device according to another embodiment of the present invention.

圖3係依據本發明之一實施例繪示一種聲紋辨識方法之流程圖。FIG. 3 is a flowchart illustrating a voiceprint recognition method according to an embodiment of the present invention.

圖4係依據本發明之另一實施例繪示一種聲紋辨識方法之流程圖。FIG. 4 is a flowchart illustrating a voiceprint recognition method according to another embodiment of the present invention.

無none

400:聲紋辨識方法 400:Voiceprint recognition method

S1~S14:步驟 S1~S14: Steps

S1’,S2’:步驟 S1', S2': step

Claims

A low-noise voiceprint recognition device for a financial transaction system, comprising: An audio receiving unit is used for retrieving an audio data; The voiceprint recognition unit is connected to the audio receiving unit, the voiceprint recognition unit receives the audio data from the audio receiving unit, and the voiceprint recognition unit recognizes at least the voiceprint characteristics of the audio data; a storage unit for storing a first characteristic data; A computing unit is connected to the voiceprint recognition unit and the storage unit, the computing unit receives the at least voiceprint features from the voiceprint recognition unit and the first feature data from the storage unit, and the computing unit includes : a verification calculator, the count value of the verification calculator is 0 units in an initial state, wherein the computing unit judges whether the at least voiceprint feature matches the first feature data according to the first feature data; Wherein, if the computing unit judges that the at least voiceprint features conform to the first feature data, the computing unit stores the at least voiceprint features in a temporary storage space of the storage unit, and according to the The at least one voiceprint feature trains the first feature data to generate an adjusted first feature data, and performs safety verification according to the adjusted first feature data; If the operation unit judges that the at least one voiceprint feature does not match the first feature data, the verification calculator increases the count value by 1 unit, and when the calculation value is less than a predetermined value, the operation unit sends a verification request, And after the confirmation of the verification request is completed, the computing unit stores the at least voiceprint feature in the temporary storage space, but the storage unit does not change the first feature data according to the at least voiceprint feature, so that according to the second A feature data for security verification, wherein the verification request includes the identification of the user's identity; If the computing unit judges that the at least voiceprint feature does not match the first feature data, and the calculated value reaches the predetermined value, the computing unit will generating a second characteristic data; and After the second characteristic data is generated, the verification calculator adjusts the calculated value to 0 units, and the computing unit stores the first characteristic data and the second characteristic data in the storage unit, so as to use the first characteristic data, the second characteristic data The second feature data judges whether the at least voiceprint features match the first feature data and the second feature data for safety verification, including: If the computing unit judges that the at least voiceprint feature matches the second feature data, the computing unit sends the at least voiceprint feature to be stored in the temporary storage space, and according to the at least voiceprint feature in the temporary storage space feature training the second feature data to generate an adjusted second feature data, and perform safety verification according to the adjusted second feature data; If the computing unit judges that the at least one voiceprint feature does not match the second feature data but conforms to the first feature data, the computing unit removes the second feature data from the storage unit to perform the operation according to the first feature data. security verification; and After the second feature data is generated, if the computing unit judges that the at least one voiceprint feature does not match the second feature data and does not match the first feature data, then the verification calculator increases the count value by 1 unit, and when the When the calculated value is less than a predetermined value, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least voiceprint feature in the temporary storage space, but the storage unit does not changing the first feature data and the second feature data to perform security verification based on the first feature data and the second feature data, at least voiceprint features, Wherein identifying the at least voiceprint feature in the audio data further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, calculating vector parameters of the audio data, detecting specific key words, detect specific sound wave bands, detect specific sound wave waveforms, and detect specific sound wave frequencies.

The low-noise voiceprint recognition device according to Claim 1, wherein the computing unit judging whether the at least one voiceprint feature matches the first feature data and the second feature data further includes: After the second feature data is generated, if the calculated value reaches the predetermined value, the computing unit trains the second feature data according to the at least voiceprint feature in the temporary storage space to generate an adjusted second feature data; and After generating the adjusted second feature data, the verification calculator adjusts the calculation value to 0 units, and the computing unit judges the at least voiceprint feature and the at least voiceprint feature according to the first feature data and the adjusted second feature data. Whether the first characteristic data and the adjusted second characteristic data are consistent, so as to perform safety verification.

The low-noise voiceprint recognition device as claimed in claim 1, wherein the voiceprint recognition device further includes: a deep learning algorithm for training the first feature data and the second feature data, and the deep learning algorithm includes a One of a Deep Neural Network (DNN) model, a Convolutional Neural Network (CNN) model, a Recurrent Neural Network (RNN), and combinations thereof.

A low-noise voiceprint recognition method for a financial transaction system, applied to a voiceprint recognition device, the voiceprint recognition device includes an arithmetic unit and a verification calculator, one count value of the verification calculator is in an initial state is 0 unit, the voiceprint identification method includes controlling the computing unit to perform the following steps: Retrieve an audio data; identifying at least voiceprint features in the audio data; providing a first characteristic data, a temporary storage space and a verification request; According to the first feature data, comparing whether the at least one voiceprint feature is consistent with the first feature data; If the at least voiceprint feature matches the first feature data, storing the at least voiceprint feature in the temporary storage space, and training the first feature data according to the at least voiceprint feature in the temporary storage space, to generate an adjusted first characteristic data, so as to perform safety verification according to the adjusted first characteristic data; If the at least one voiceprint feature does not match the first feature data, the count value of the verification calculator is increased by 1 unit, and when the calculated value is less than a predetermined value, the verification request is sent, and when the verification request is confirmed Afterwards, storing the at least voiceprint feature in the temporary storage space, but not changing the first feature data according to the at least voiceprint feature, so as to perform security verification based on the first feature data, wherein the verification request includes using Identification of identity characteristics; If the calculated value reaches the predetermined value, generating a second characteristic data according to the at least voiceprint characteristic in the temporary storage space and the first characteristic data; and After the second feature data is generated, adjust the calculation value of the verification calculator to 0 units, and compare the at least one voiceprint feature with the first feature data according to the first feature data and the second feature data . Whether the second feature data is consistent for safety verification, including: If the computing unit judges that the at least voiceprint feature matches the second feature data, the computing unit sends the at least voiceprint feature to be stored in the temporary storage space, and according to the at least voiceprint feature in the temporary storage space feature training the second feature data to generate an adjusted second feature data, and perform safety verification according to the adjusted second feature data; If the computing unit judges that the at least one voiceprint feature does not match the second feature data but conforms to the first feature data, the computing unit removes the second feature data from the storage unit to perform the operation according to the first feature data. security verification; and After the second feature data is generated, if the computing unit judges that the at least one voiceprint feature does not match the second feature data and does not match the first feature data, then the verification calculator increases the count value by 1 unit, and when the When the calculated value is less than a predetermined value, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least voiceprint feature in the temporary storage space, but the storage unit does not changing the first feature data and the second feature data to perform security verification based on the first feature data and the second feature data, at least voiceprint features, Wherein identifying the at least voiceprint feature in the audio data further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, calculating vector parameters of the audio data, detecting specific key words, detect specific sound wave bands, detect specific sound wave waveforms, and detect specific sound wave frequencies.

The low-noise voiceprint recognition method as claimed in Claim 4, wherein comparing whether the at least one voiceprint feature is consistent with the first feature data and the second feature data further includes the following steps: After the second feature data is generated, and the calculated value reaches the predetermined value, train the second feature data according to the at least voiceprint feature in the temporary storage space to generate an adjusted second feature data; and After generating the adjusted second feature data, adjust the calculation value of the verification calculator to 0 units, and according to whether the at least voiceprint feature is consistent with the first feature data and the adjusted second feature data, to Perform security verification.

The low-noise voiceprint recognition method of Claim 4, wherein the voiceprint recognition device further includes: a deep learning algorithm for training the first feature data and the second feature data, and the deep learning algorithm includes a One of a Deep Neural Network (DNN) model, a Convolutional Neural Network (CNN) model, a Recurrent Neural Network (RNN), and combinations thereof.