TWI817897B

TWI817897B - Low-noise voiceprint identification device for financial transaction system and method thereof

Info

Publication number: TWI817897B
Application number: TW112103776A
Authority: TW
Inventors: 翁子崴
Original assignee: 華南商業銀行股份有限公司
Priority date: 2021-09-08
Filing date: 2021-09-08
Publication date: 2023-10-01
Also published as: TW202328951A

Abstract

A low-noise voiceprint identification device for a financial transaction system and method thereof are provided, which improve the quality of voiceprint recognition and reduce noise by adding a recognition buffer mechanism and processing sound signals through filtering. It can be used as an auxiliary authentication function for financial transaction behavior on electronic devices. The voiceprint identification device includes an audio receiving unit, a voiceprint recognition unit, a storage unit, and an arithmetic unit. The arithmetic unit compares at least one voiceprint feature extracted from audio data with the first feature data and the second feature data of the storage unit, utilizes deep learning and neural network methods to train the first feature data and the second feature data, in order to security verification.

Description

Low-noise voiceprint recognition device and method used in financial trading system

本發明係有關一種聲紋辨識裝置與其方法，尤其是有關一種用於金融交易系統之低雜訊聲紋辨識裝置與其方法。 The present invention relates to a voiceprint recognition device and a method thereof, and in particular to a low-noise voiceprint recognition device and a method used in a financial transaction system.

聲紋具有唯一性、獨特性和不易竄改性，加之較不涉及使用者隱私問題，故聲紋識別可用於要求從音頻信號中提取個體差異，擷取出能夠反映使用者是誰的信息，從而進行使用者識別，其基本原理是每一個使用者建立一個能夠描述這一使用者個性特徵的模組，作為此使用者個性特徵的描述。 Voiceprints are unique, unique and not easily tampered with. In addition, they do not involve user privacy issues. Therefore, voiceprint recognition can be used to extract individual differences from audio signals and extract information that can reflect who the user is. The basic principle of user identification is that each user creates a module that can describe the user's personality characteristics as a description of the user's personality characteristics.

聲紋辨識目前已經應用於身份辨識，作為判斷使用者身份之工具。然而，聲紋識別的缺點在於，聲音容易受到年齡、情緒、或身體狀況等因素影響，導致識別性能降低，故目前聲紋辨識技術的準確度仍有待改善。此外設備的收音品質與環境噪音等因素，也會影響聲紋辨識的識別效能。 Voiceprint recognition is currently used in identity recognition as a tool to determine the user's identity. However, the disadvantage of voiceprint recognition is that the voice is easily affected by factors such as age, emotion, or physical condition, resulting in reduced recognition performance. Therefore, the accuracy of current voiceprint recognition technology still needs to be improved. In addition, factors such as the equipment’s sound quality and environmental noise will also affect the recognition performance of voiceprint recognition.

有鑑於先前技術的上述缺點，本發明之一態樣提供了一低雜訊聲紋辨識裝置，該裝置包含：一音訊接收單元，係用以擷取一音頻資料；一聲紋辨識單元，係連接於該音訊接收單元，該聲紋辨識單元由該音訊接收單元接收該音頻資料，且該聲紋辨識單元辨識出該音頻資料的至少一聲紋特徵；一儲存單元，儲存一第一特徵資料；一運算單元，係連接於該聲紋辨識單元與該儲存單元，該運算單元自該聲紋辨識單元接收該至少一聲紋特徵以及自該儲存單元接收該第一特徵資料，且該運算單元包含：一驗證計算器，該驗證計算器之計數值於一初始狀態為0單位，其中該運算單元根據該第一特徵資料判斷該至少一聲紋特徵與該第一特徵資料的是否相符；其中，若該運算單元判斷該至少一聲紋特徵符合該第一特徵資料，則該運算單元將該至少一聲紋特徵儲存於該儲存單元的一暫存空間內，根據該暫存空間內的該至少一聲紋特徵訓練該第一特徵資料，以產生一調整後第一特徵資料，並根據該調整後第一特徵資料進行安全性驗證；若該運算單元判斷該至少一聲紋特徵不符該第一特徵資料，則該運算單元傳送一驗證請求，並於該驗證請求完成確認後，該運算單元將該至少一聲紋特徵儲存於該暫存空間內，但該儲存單元不根據該至少一聲紋特徵變更該第一特徵資料，以根據該第一特徵資料進行安全性驗證，其中該驗證請求包含使用者身份特徵之識別；若該運算單元判斷該至少一聲紋特徵不符該第一特徵資料，且該計數值達該預定數值，則該運算單元根據該暫存空間內的該至少一聲紋特徵與該第一特徵資料產生一第二特徵資料；以及於該第二特徵資料產生後，該驗證計算器調整該計數值為0單位，該運算單元將該第一特徵資料、該第二特徵資料儲存於儲存單元，以根據該第一特徵資料、該第二特徵資料判斷該至少一聲紋特徵與該第一特徵資料、該第二特徵資料是否相符，以進行安全性驗證，包含：若該運算單元判斷該至少一聲紋特徵符合該第二特徵資料，則該運算單元傳送該至少一聲紋特徵儲存於該暫存空間內，且根據該暫存空間內的該至少一聲紋特徵訓練該第二特徵資料，以產生一調整後第二特徵資料，並根據該調整後第二特徵資料進行安全性驗證；若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料、但符合該第一特徵資料，則該運算單元自該儲存單元移除該第二特徵資料，以根據該第一特徵資料進行安全性驗證；以及於該第二特徵資料產生後，若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料且不符該第一特徵資料，則該驗證計算器對該計數值增加1單位，且當該計數值小於一預定數值時，該運算單元傳送一驗證請求，並於該驗證請求完成確認後，該運算單元將該至少一聲紋特徵儲存於該暫存空間內，但該儲存單元不根據該至少一聲紋特徵變更該第一特徵資料、該第二特徵資料，以根據該第一特徵資料、該第二特徵資料進行安全性驗證，其中辨識該音頻資料內的該至少一聲紋特徵進一步包含以下操作中至少一者：濾波、降低雜訊、抑制背景噪音、放大特定聲紋特徵、計算該音頻資料之向量參數、偵測特定關鍵詞、偵測特定聲波波段、偵測特定聲波波形以及偵測特定聲波頻率。 In view of the above shortcomings of the prior art, one aspect of the present invention provides a low-noise voiceprint recognition device. The device includes: an audio receiving unit for retrieving audio data; a voiceprint recognition unit Connected to the audio receiving unit, the voiceprint recognition unit receives the audio from the audio receiving unit audio data, and the voiceprint recognition unit identifies at least one voiceprint feature of the audio data; a storage unit stores a first characteristic data; a computing unit is connected to the voiceprint recognition unit and the storage unit, the The computing unit receives the at least voiceprint characteristics from the voiceprint recognition unit and the first characteristic data from the storage unit, and the computing unit includes: a verification calculator, the count value of the verification calculator in an initial state is 0 units, wherein the computing unit determines whether the at least one voiceprint feature matches the first feature data based on the first feature data; wherein, if the computing unit determines that the at least one voiceprint feature matches the first feature data, Then the computing unit stores the at least one voice fingerprint feature in a temporary storage space of the storage unit, and trains the first feature data based on the at least one voice fingerprint feature in the temporary storage space to generate an adjusted first feature data, and perform security verification based on the adjusted first feature data; if the computing unit determines that the at least one voiceprint feature does not match the first feature data, the computing unit sends a verification request, and completes the verification request After confirmation, the computing unit stores the at least one voice fingerprint characteristic in the temporary storage space, but the storage unit does not change the first characteristic data based on the at least one voice fingerprint characteristic to perform security operations based on the first characteristic data. Verification, wherein the verification request includes identification of user identity characteristics; if the computing unit determines that the at least voiceprint characteristics do not match the first characteristic data, and the count value reaches the predetermined value, then the computing unit determines according to the temporary storage space The at least one voiceprint feature and the first feature data in the generated second feature data; and after the second feature data is generated, the verification calculator adjusts the count value to 0 units, and the computing unit converts the first The characteristic data and the second characteristic data are stored in the storage unit, so as to determine whether the at least one voiceprint characteristic matches the first characteristic data and the second characteristic data based on the first characteristic data and the second characteristic data, so as to perform Security verification includes: if the computing unit determines that the at least one voiceprint feature matches the second feature data, the computing unit transmits the at least one voiceprint feature and stores it in the temporary storage space, and according to the temporary storage space The at least voiceprint characteristics train the second characteristic data to generate an adjusted second characteristic data, and perform security operations based on the adjusted second characteristic data. sexual verification; if the computing unit determines that the at least one voiceprint characteristic does not match the second characteristic data, but matches the first characteristic data, the computing unit removes the second characteristic data from the storage unit to obtain the result according to the first characteristic data. The characteristic data performs security verification; and after the second characteristic data is generated, if the computing unit determines that the at least one voiceprint characteristic does not match the second characteristic data and does not match the first characteristic data, the verification calculator The value increases by 1 unit, and when the count value is less than a predetermined value, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least one voice fingerprint feature in the temporary storage space , but the storage unit does not change the first characteristic data and the second characteristic data based on the at least audio fingerprint characteristics, so as to perform security verification based on the first characteristic data and the second characteristic data, wherein the content of the audio data is identified. The at least one voiceprint feature further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, calculating vector parameters of the audio data, detecting specific keywords, and detecting specific sound waves. band, detect specific sound waveforms, and detect specific sound wave frequencies.

本發明之另一態樣提供了一低雜訊聲紋辨識方法，應用於一聲紋辨識裝置，該聲紋辨識裝置包含一運算單元以及一驗證計算器，該驗證計算器之一計數值於一初始狀態為0單位，該聲紋辨識方法包含控制該運算單元進行以下步驟：擷取一音頻資料；辨識該音頻資料內的至少一聲紋特徵；提供一第一特徵資料、一暫存空間以及一驗證請求；根據該第一特徵資料，比對該至少一聲紋特徵與該第一特徵資料是否相符；若該至少一聲紋特徵符合該第一特徵資料，則儲存該至少一聲紋特徵於該暫存空間內，並根據該暫存空間內的該至少一聲紋特徵訓練該第一特徵資料，以產生一調整後第一特徵資料，以根據該調整後第一特徵資料進行安全性驗證；若該至少一聲紋特徵不符該第一特徵資料，則傳送該驗證請求，當該驗證請求完成確認後，儲存該至少一聲紋特徵於該暫存空間內，但不根據該至少一聲紋特徵變更該第一特徵資料，以根據該第一特徵資料進行安全性驗證，其中該驗證請求包含使用者身份特徵之識別；若該計數值達該預定數值，則根據該暫存空間內的該至少一聲紋特徵與該第一特徵資料產生一第二特徵資料；以及於該第二特徵資料產生後，調整該驗證計算器之該計數值為0單位，且根據該第一特徵資料、該第二特徵資料來比對該至少一聲紋特徵與該第一特徵資料、該第二特徵資料是否相符，以進行安全性驗證，包含：若該運算單元判斷該至少一聲紋特徵符合該第二特徵資料，則該運算單元傳送該至少一聲紋特徵儲存於該暫存空間內，且根據該暫存空間內的該至少一聲紋特徵訓練該第二特徵資料，以產生一調整後第二特徵資料，並根據該調整後第二特徵資料進行安全性驗證；若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料、但符合該第一特徵資料，則該運算單元自該儲存單元移除該第二特徵資料，以根據該第一特徵資料進行安全性驗證；以及於該第二特徵資料產生後，若該運算單元判斷該至少一聲紋特徵不符該第二特徵資料且不符該第一特徵資料，則該驗證計算器對該計數值增加1單位，且當該計數值小於一預定數值時，該運算單元傳送一驗證請求，並於該驗證請求完成確認後，該運算單元將該至少一聲紋特徵儲存於該暫存空間內，但該儲存單元不根據該至少一聲紋特徵變更該第一特徵資料、該第二特徵資料，以根據該第一特徵資料、該第二特徵資料進行安全性驗證，其中辨識該音頻資料內的該至少一聲紋特徵進一步包含以下操作中至少一者：濾波、降低雜訊、抑制背景噪音、放大特定聲紋特徵、計算該音頻資料之向量參數、偵測特定關鍵詞、偵測特定聲波波段、偵測特定聲波波形以及偵測特定聲波頻率。 Another aspect of the present invention provides a low-noise voiceprint recognition method applied to a voiceprint recognition device. The voiceprint recognition device includes an arithmetic unit and a verification calculator, and a count value of the verification calculator is An initial state is 0 units, and the voiceprint recognition method includes controlling the computing unit to perform the following steps: acquiring an audio data; identifying at least one voiceprint feature in the audio data; providing a first feature data and a temporary storage space and a verification request; based on the first characteristic data, compare whether the at least one voiceprint feature matches the first feature data; if the at least one voiceprint feature matches the first feature data, store the at least one voiceprint feature Characterize in the temporary storage space, and train the first characteristic data based on the at least voiceprint characteristics in the temporary storage space to generate an adjusted first characteristic data to perform security operations based on the adjusted first characteristic data. Sexual verification; if the at least one voiceprint feature does not match the first feature data, the verification request is sent. After the verification request is confirmed, the at least one voiceprint feature is stored in the temporary storage space, but the at least one voiceprint feature is not stored in the temporary storage space according to the at least one voiceprint feature. The voiceprint characteristic changes the first characteristic data to perform installation according to the first characteristic data. Completeness verification, wherein the verification request includes identification of user identity characteristics; if the count value reaches the predetermined value, a second characteristic is generated based on the at least one fingerprint characteristic and the first characteristic data in the temporary storage space data; and after the second characteristic data is generated, adjust the count value of the verification calculator to 0 units, and compare the at least one voiceprint characteristic with the third characteristic data based on the first characteristic data and the second characteristic data. Whether a characteristic data and the second characteristic data are consistent for security verification includes: if the computing unit determines that the at least one voice fingerprint characteristic matches the second characteristic data, the computing unit sends the at least one voice fingerprint characteristic to store in the temporary storage space, and train the second characteristic data based on the at least one voiceprint characteristic in the temporary storage space to generate an adjusted second characteristic data, and perform security operations based on the adjusted second characteristic data Verify; if the computing unit determines that the at least one voiceprint characteristic does not match the second characteristic data but matches the first characteristic data, the computing unit removes the second characteristic data from the storage unit to determine the first characteristic data based on the first characteristic data. The data is subjected to security verification; and after the second characteristic data is generated, if the computing unit determines that the at least one voiceprint characteristic does not match the second characteristic data and does not match the first characteristic data, the verification calculator calculates the count value Increase by 1 unit, and when the count value is less than a predetermined value, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least one fingerprint feature in the temporary storage space, However, the storage unit does not change the first characteristic data and the second characteristic data based on the at least audio fingerprint characteristics to perform security verification based on the first characteristic data and the second characteristic data, wherein the audio data is identified. The at least one voiceprint feature further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, calculating vector parameters of the audio data, detecting specific keywords, and detecting specific sound wave bands. , detect specific sound wave waveforms and detect specific sound wave frequencies.

綜上所述，使用者於登入系統環境欲進行金融交易等動作時，藉由讀取裝置使用時所記錄的聲紋特徵，搭配深度學習、加入神經網路等訓練聲紋特徵模型，可改善認證的品質，讓模型可以更準確地辨認使用者本人，亦進一步地將此項認證應用於各種金融交易系統當中。 To sum up, when users log into the system environment and want to perform financial transactions and other actions, they read the voiceprint characteristics recorded when the device is used, combined with deep learning, neural network and other training The voiceprint feature model can improve the quality of authentication, allowing the model to more accurately identify the user, and further apply this authentication to various financial transaction systems.

100,200:聲紋辨識裝置 100,200: Voiceprint recognition device

300,400:聲紋辨識方法 300,400: Voiceprint recognition method

10:音訊接收單元 10: Audio receiving unit

20:聲紋辨識單元 20: Voiceprint recognition unit

30:運算單元 30:Arithmetic unit

32:驗證計算器 32: Validation Calculator

40:儲存單元 40:Storage unit

42:暫存空間 42: Temporary storage space

44:第一特徵資料 44: First characteristic data

46:第二特徵資料 46: Second characteristic data

S1~S14:步驟 S1~S14: steps

S1’,S2’:步驟 S1’, S2’: steps

圖1係依據本發明一實施例繪示一種聲紋辨識裝置之方塊圖。 FIG. 1 is a block diagram of a voiceprint recognition device according to an embodiment of the present invention.

圖2係依據本發明另一實施例繪示一種聲紋辨識裝置之方塊圖。 FIG. 2 is a block diagram of a voiceprint recognition device according to another embodiment of the present invention.

圖3係依據本發明之一實施例繪示一種聲紋辨識方法之流程圖。 FIG. 3 is a flow chart illustrating a voiceprint recognition method according to an embodiment of the present invention.

圖4係依據本發明之另一實施例繪示一種聲紋辨識方法之流程圖。 FIG. 4 is a flow chart illustrating a voiceprint recognition method according to another embodiment of the present invention.

以下說明係為完成發明的較佳實現方式，其目的在於描述本發明的基本精神，但並不用以限定本發明。實際的發明內容必須參考之後的權利要求範圍。 The following description is a preferred implementation manner for completing the invention, and its purpose is to describe the basic spirit of the invention, but is not intended to limit the invention. For the actual invention, reference must be made to the following claims.

必須了解的是，使用於本說明書中的“包含”、“包括”等詞，係用以表示存在特定的技術特徵、數值、方法步驟、作業處理、元件以及/或組件，但並不排除可加上更多的技術特徵、數值、方法步驟、作業處理、元件、組件，或以上的任意組合。 It must be understood that the words "including" and "including" used in this specification are used to indicate the existence of specific technical features, values, method steps, work processes, components and/or components, but do not exclude the possibility of Plus further technical features, values, method steps, processes, components, assemblies, or any combination of the above.

於權利要求中使用如“第一”、“第二”等詞係用來修飾權利要求中的元件，並非用來表示之間具有優先權順序，先行關係，或者是一個元件先於另一個元件，或者是執行方法步驟時的時間先後順序，僅用來區別具有相同名字的元件。 The use of words such as "first" and "second" in the claims is used to modify the elements in the claims, and is not used to indicate a priority order, precedence relationship, or that one element precedes another element. , or the time sequence when executing method steps, is only used to distinguish components with the same name.

請一併參閱圖1與圖2，圖1係依據本發明一實施例繪示一種聲紋辨識裝置之方塊圖，圖2係依據本發明另一實施例繪示一種聲紋辨識裝置之方塊圖。 Please refer to FIGS. 1 and 2 together. FIG. 1 is a block diagram of a voiceprint recognition device according to one embodiment of the present invention. FIG. 2 is a block diagram of a voiceprint recognition device according to another embodiment of the present invention. .

於一實施例中，如圖1所示，聲紋辨識裝置100包含一音訊接收單元10、一聲紋辨識單元20、一運算單元30以及一儲存單元40。 In one embodiment, as shown in FIG. 1 , the voiceprint recognition device 100 includes an audio receiving unit 10 , a voiceprint recognition unit 20 , a computing unit 30 and a storage unit 40 .

音訊接收單元10係擷取一音頻資料。聲紋辨識單元20係連接音訊接收單元10以接收上述音頻資料，且可辨識出上述音頻資料內的至少一聲紋特徵。在一些實施例中，上述至少一聲紋特徵可以是特定關鍵詞、聲波波段、聲波波形、聲波頻率等。於一實施例中，聲紋辨識單元20用於辨識音頻資料內的上述至少一聲紋特徵，並且聲紋辨識單元20可包含以下操作中至少一者，例如為濾波、降低雜訊、抑制背景噪音、放大特定聲紋特徵、計算音頻資料之一向量參數、偵測特定關鍵詞、偵測特定聲波波段、偵測特定聲波波形、偵測特定聲波頻率等。 The audio receiving unit 10 captures audio data. The voiceprint recognition unit 20 is connected to the audio receiving unit 10 to receive the audio data, and can identify at least one voiceprint feature in the audio data. In some embodiments, the above-mentioned at least one sound fingerprint feature may be a specific keyword, a sound wave band, a sound wave waveform, a sound wave frequency, etc. In one embodiment, the voiceprint recognition unit 20 is used to identify the at least one voiceprint feature in the audio data, and the voiceprint recognition unit 20 may include at least one of the following operations, such as filtering, noise reduction, and background suppression. Noise, amplify specific voiceprint features, calculate a vector parameter of audio data, detect specific keywords, detect specific sound wave bands, detect specific sound wave waveforms, detect specific sound wave frequencies, etc.

儲存單元40可儲存第一特徵資料44。在一些實施例中，第一特徵資料44可以是符合使用者之任何一相關聲紋資訊，例如為特定關鍵詞、聲波波段、聲波波形、聲波頻率等。在一些實施例中，第一特徵資料44可以是類比訊號、數位訊號、類比/數位混合訊號等資料儲存之模式。 The storage unit 40 can store the first characteristic data 44 . In some embodiments, the first characteristic data 44 may be any relevant voiceprint information that matches the user, such as specific keywords, sound wave bands, sound wave waveforms, sound wave frequencies, etc. In some embodiments, the first characteristic data 44 may be an analog signal, a digital signal, an analog/digital hybrid signal, or other data storage modes.

運算單元30係連接聲紋辨識單元20與儲存單元40，可接收自聲紋辨識單元20的上述至少一聲紋特徵，以及接收自儲存單元40的第一特徵資料44，於運算單元30內判斷上述至少一聲紋特徵與第一特徵資料44是否相符。 The computing unit 30 is connected to the voiceprint recognition unit 20 and the storage unit 40, and can receive the at least one voiceprint feature from the voiceprint recognition unit 20 and the first feature data 44 received from the storage unit 40, and determine in the computing unit 30 Whether the above-mentioned at least one voiceprint feature matches the first feature data 44.

若運算單元30判斷上述至少一聲紋特徵符合第一特徵資料44，則運算單元30將上述至少一聲紋特徵儲存於儲存單元40的一暫存空間42內，且根據暫存空間42內的上述至少一聲紋特徵來訓練第一特徵資料44，以產生一調整後第一特徵資料44，並根據調整後第一特徵資料44進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第一特徵資料44建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。 If the computing unit 30 determines that the at least one voiceprint feature matches the first feature data 44, the computing unit 30 stores the at least one voiceprint feature in a temporary storage space 42 of the storage unit 40, and based on the data in the temporary storage space 42 The at least one voiceprint feature is used to train the first feature data 44 to generate an adjusted The first characteristic data 44 is adjusted, and security verification is performed based on the adjusted first characteristic data 44 . For example, the above security verification can be executed immediately after the adjusted first characteristic data 44 is created to further achieve the effect of a fast security verification.

在另一範例中，上述安全性驗證可以待後續收到新的一聲紋特徵，再將此新的一聲紋特徵與調整後第一特徵資料44比對，如此同樣可達到安全性驗證之目的，且相較於前一個範例，可降低誤判斷為「不通過」之機率，提昇整體流程的順暢度，進而提昇使用者體驗。 In another example, the above security verification can wait until a new voiceprint feature is subsequently received, and then compare the new voiceprint feature with the adjusted first feature data 44, so that the security verification can also be achieved. Purpose, compared with the previous example, can reduce the probability of misjudgment as "fail", improve the smoothness of the overall process, and thereby improve the user experience.

在一些實施例中，上述判斷的至少一聲紋特徵可對應於使用者之一正常聲紋，或受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第一特徵資料44則可視為使用者原始預設之聲紋特徵，若至少一聲紋特徵能與第一特徵資料44相符，則使用調整後第一特徵資料44作為聲紋識別標準來進行安全性驗證，且第一特徵資料44可被持續訓練，以持續更新第一特徵資料44可接受之聲紋範疇。 In some embodiments, at least the voiceprint characteristics determined above may correspond to a normal voiceprint of the user, or an abnormal voiceprint affected by factors such as age, emotion, or physical condition, and the first characteristic data 44 can be viewed It is the user's original preset voiceprint feature. If at least the voiceprint feature can match the first feature data 44, then the adjusted first feature data 44 is used as the voiceprint recognition standard for security verification, and the first feature The data 44 can be continuously trained to continuously update the acceptable voiceprint range of the first characteristic data 44 .

於一實施例中，如圖2所示，聲紋辨識裝置200之運算單元30可進一步包含一驗證計算器32，驗證計算器32之計數值於一初始狀態為0單位，若運算單元30判斷上述至少一聲紋特徵不符第一特徵資料44，則驗證計算器32之計數值增加1單位。 In one embodiment, as shown in Figure 2, the computing unit 30 of the voiceprint recognition device 200 may further include a verification calculator 32. The count value of the verification calculator 32 is 0 units in an initial state. If the computing unit 30 determines If the at least one voiceprint characteristic does not match the first characteristic data 44, the count value of the verification calculator 32 is increased by 1 unit.

若上述計數值小於一預定數值，若運算單元30判斷上述至少一聲紋特徵不符第一特徵資料44，則運算單元30將傳送一驗證請求給使用者，且等待上述驗證請求被使用者完成確認後，運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，但儲存單元40不根據上述至少一聲紋特徵來變更第一特徵資料44，而是仍以第一特徵資料44進行安全性驗證。在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第一特徵資料44則可視為使用者原始預設之聲紋特徵。若異常聲紋的上述至少一聲紋特徵與第一特徵資料44不符，則本發明仍保留第一特徵資料44，並且僅將異常聲紋另外儲存於暫存空間42內，以此可作為一辨識緩衝機制，上述辨識緩衝機制主要避免使用者之上述至少一聲紋特徵僅為聲紋暫時性異常、或聲紋辨識裝置100判斷失誤之可能。 If the count value is less than a predetermined value, and if the computing unit 30 determines that the at least one voiceprint feature does not match the first feature data 44, the computing unit 30 will send a verification request to the user and wait for the verification request to be confirmed by the user. Afterwards, the computing unit 30 stores the at least one voice fingerprint feature in the temporary storage space 42 , but the storage unit 40 does not change the first feature data 44 based on the at least one voice fingerprint feature, but still uses the first feature data 44 for processing. Security verification. In some embodiments, the at least one voiceprint feature may correspond to the user's abnormal voiceprint that is affected by factors such as age, emotion, or physical condition, and the first feature data 44 may be regarded as the user's original preset voiceprint. Characteristics. If the abnormal voiceprint is at least the above If the voiceprint characteristics do not match the first characteristic data 44, the present invention still retains the first characteristic data 44, and only additionally stores the abnormal voiceprint in the temporary storage space 42, which can be used as a recognition buffering mechanism. The above recognition buffering The mechanism mainly avoids the possibility that at least the above-mentioned voiceprint characteristics of the user are only temporary abnormalities in the voiceprint, or that the voiceprint recognition device 100 makes an error in judgment.

若上述計數值達該預定數值，則運算單元30根據暫存空間42內的上述至少一聲紋特徵與第一特徵資料44產生一第二特徵資料46。在一些實施例中，第二特徵資料46可以是類比訊號、數位訊號、類比/數位混合訊號等資料儲存之模式。 If the count value reaches the predetermined value, the computing unit 30 generates a second feature data 46 based on the at least one voice fingerprint feature and the first feature data 44 in the temporary storage space 42 . In some embodiments, the second characteristic data 46 may be an analog signal, a digital signal, an analog/digital mixed signal, or other data storage modes.

於第二特徵資料46產生後，驗證計算器32調整計數值為0單位，運算單元30將第一特徵資料44、第二特徵資料46儲存於儲存單元40，並修改為根據第一特徵資料44、第二特徵資料46，使運算單元30判斷後續接收之至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符，以進行安全性驗證。在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第一特徵資料44則可視為使用者原始預設之聲紋特徵。若異常聲紋的上述至少一聲紋特徵已達一定預定次數(預定時間)而不能符合第一特徵資料44，則本發明仍保留第一特徵資料44，並且另根據暫存空間42內儲存的上述至少一聲紋特徵與第一特徵資料44產生一第二特徵資料46，以此可作為另一辨識緩衝機制，上述辨識緩衝機制主要增加使用者受到年齡、情緒、或身體狀況等因素一定時間影響後的聲紋特徵辨識資料。 After the second characteristic data 46 is generated, the verification calculator 32 adjusts the count value to 0 units, and the computing unit 30 stores the first characteristic data 44 and the second characteristic data 46 in the storage unit 40 and modifies them according to the first characteristic data 44 , the second characteristic data 46, so that the computing unit 30 determines whether the subsequently received at least one voiceprint characteristic matches the first characteristic data 44 and the second characteristic data 46 for security verification. In some embodiments, the at least one voiceprint feature may correspond to the user's abnormal voiceprint that is affected by factors such as age, emotion, or physical condition, and the first feature data 44 may be regarded as the user's original preset voiceprint. Characteristics. If the above-mentioned at least one voiceprint feature of the abnormal voiceprint has reached a certain predetermined number of times (predetermined time) and cannot match the first feature data 44, the present invention still retains the first feature data 44, and additionally based on the first feature data 44 stored in the temporary storage space 42 The above-mentioned at least voiceprint characteristics and the first characteristic data 44 generate a second characteristic data 46, which can be used as another recognition buffering mechanism. The above-mentioned recognition buffering mechanism mainly increases the user's age, mood, or physical condition and other factors for a certain period of time. Influenced voiceprint feature identification data.

於一實施例中，運算單元30於判斷至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符時，若上述至少一聲紋特徵符合第二特徵資料46，則運算單元30傳送上述至少一聲紋特徵儲存於暫存空間42內，且根據暫存空間42內的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46，並根據該調整後第二特徵資料46進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。 In one embodiment, when the computing unit 30 determines whether the at least one voiceprint feature matches the first feature data 44 and the second feature data 46, if the at least one voiceprint feature matches the second feature data 46, the calculation unit 30 The at least one voiceprint feature is transmitted and stored in the temporary storage space 42, and the second feature data 46 is trained according to the at least one voiceprint feature in the temporary storage space 42 to generate an adjusted second feature. Data 46, and security verification is performed based on the adjusted second characteristic data 46. For example, the above security verification can be executed immediately after the adjusted second characteristic data 46 is created to further achieve the effect of a fast security verification.

在另一範例中，上述安全性驗證可以待後續收到新的一聲紋特徵，再將此新的一聲紋特徵與調整後第二特徵資料46比對，如此同樣可達到安全性驗證之目的，且相較於前一個範例，可降低誤判斷為「不通過」之機率，提昇整體流程的順暢度，進而提昇使用者體驗。 In another example, the above security verification can wait until a new voiceprint feature is subsequently received, and then compare the new voiceprint feature with the adjusted second feature data 46, so that the security verification can also be achieved. Purpose, compared with the previous example, can reduce the probability of misjudgment as "fail", improve the smoothness of the overall process, and thereby improve the user experience.

在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋，第二特徵資料46則可視為使用者受一定時間異常影響後的聲紋特徵辨識資料。若異常聲紋的上述至少一聲紋特徵已達一定預定次數(預定時間)而卻與第二特徵資料46相符，則本發明仍保留第一特徵資料44，並且僅判斷為使用者已達一聲紋異常變化後的穩定狀況，而使用調整後第二特徵資料46作為此穩定狀況的聲紋識別標準來進行安全性驗證，且第二特徵資料46可被持續訓練，以持續更新第二特徵資料46可接受之聲紋範疇。 In some embodiments, the above-mentioned at least voiceprint characteristics may correspond to the user's abnormal voiceprint after being affected by factors such as age, emotion, or physical condition, and the second characteristic data 46 may be regarded as the user's abnormal voiceprint after being affected by abnormality for a certain period of time. Voiceprint feature identification data. If the above-mentioned at least one voiceprint feature of the abnormal voiceprint has reached a certain predetermined number of times (predetermined time) but matches the second feature data 46, the present invention still retains the first feature data 44, and only determines that the user has reached a certain predetermined number of times (predetermined time). The stable condition after the abnormal change of voiceprint, and the adjusted second characteristic data 46 is used as the voiceprint recognition standard of this stable condition for security verification, and the second characteristic data 46 can be continuously trained to continuously update the second characteristic. Data 46 Acceptable range of voiceprints.

若運算單元30判斷上述至少一聲紋特徵不符第二特徵資料46、但符合第一特徵資料44，則運算單元30自儲存單元40中移除第二特徵資料46。第二特徵資料46移除後，重新根據第一特徵資料44進行安全性驗證。在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋特徵，第一特徵資料44可視為使用者原始預設之聲紋特徵，第二特徵資料46則可視為使用者受一定時間異常影響後的聲紋特徵辨識資料。若使用者由造成異常聲紋特徵之狀況恢復時，則可判斷為第二特徵資料46已不適用為一聲紋識別標準，故予以移除第二特徵資料46、恢復為根據第一特徵資料44作為聲紋識別之標準來進行安全性驗證。 If the computing unit 30 determines that the at least one voiceprint characteristic does not match the second characteristic data 46 but matches the first characteristic data 44 , the computing unit 30 removes the second characteristic data 46 from the storage unit 40 . After the second characteristic data 46 is removed, the security verification is performed again based on the first characteristic data 44 . In some embodiments, the above-mentioned at least voiceprint characteristics may correspond to abnormal voiceprint characteristics of the user affected by factors such as age, emotion, or physical condition, and the first characteristic data 44 may be regarded as the user's original preset voiceprint. Characteristics, the second characteristic data 46 can be regarded as the user's voiceprint characteristic identification data after being affected by abnormality for a certain period of time. If the user recovers from the situation that caused the abnormal voiceprint characteristics, it can be determined that the second characteristic data 46 is no longer applicable. As the standard for voiceprint recognition, the second characteristic data 46 is removed and the first characteristic data 44 is restored as the standard for voiceprint recognition to perform security verification.

若運算單元30判斷上述至少一聲紋特徵不符第二特徵資料46且不符第一特徵資料44，則驗證計算器32對上述計數值增加1單位。若上述計數值小於一預定數值，則運算單元30傳送一驗證請求給使用者，且等待上述驗證請求被使用者完成確認後，運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，但儲存單元40不根據上述至少一聲紋特徵變更第一特徵資料44、第二特徵資料46，以根據該第一特徵資料、該第二特徵進行安全性驗證。 If the computing unit 30 determines that the at least one voiceprint feature does not match the second feature data 46 and the first feature data 44 , the verification calculator 32 increases the count value by 1 unit. If the count value is less than a predetermined value, the computing unit 30 sends a verification request to the user, and after waiting for the verification request to be confirmed by the user, the computing unit 30 stores the at least one voice fingerprint feature in the temporary storage space 42 , but the storage unit 40 does not change the first characteristic data 44 and the second characteristic data 46 according to the at least one voice fingerprint characteristic, so as to perform security verification based on the first characteristic data and the second characteristic.

於一實施例中，若上述計數值達該預定數值，則運算單元30根據暫存空間42內的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46。產生調整後第二特徵資料46後，驗證計算器32調整該計數值為0單位，且運算單元30判斷至少一聲紋特徵與第一特徵資料44、調整後第二特徵資料46是否相符，以進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。 In one embodiment, if the count value reaches the predetermined value, the computing unit 30 trains the second feature data 46 according to the at least one voice fingerprint feature in the temporary storage space 42 to generate an adjusted second feature data 46 . After the adjusted second characteristic data 46 is generated, the verification calculator 32 adjusts the count value to 0 units, and the computing unit 30 determines whether at least the voice fingerprint characteristic is consistent with the first characteristic data 44 and the adjusted second characteristic data 46, so as to Perform security verification. For example, the above security verification can be executed immediately after the adjusted second characteristic data 46 is created to further achieve the effect of a fast security verification.

在另一範例中，上述安全性驗證可以待後續收到新的一聲紋特徵，再將此新的一聲紋特徵與第一特徵資料44、調整後第二特徵資料46比對，如此同樣可達到安全性驗證之目的，且相較於前一個範例，可降低誤判斷為「不通過」之機率，提昇整體流程的順暢度，進而提昇使用者體驗。 In another example, the above security verification can wait until a new voiceprint feature is subsequently received, and then compare the new voiceprint feature with the first feature data 44 and the adjusted second feature data 46, and so on. It can achieve the purpose of security verification, and compared with the previous example, it can reduce the probability of misjudgment as "failed", improve the smoothness of the overall process, and thereby improve the user experience.

在一些實施例中，上述至少一聲紋特徵可對應於使用者受到年齡、情緒、或身體狀況等因素影響後之異常聲紋特徵，且造成異常聲紋特徵之狀況恐已難以恢復。若異常聲紋的上述至少一聲紋特徵已達一定預定次數(預定時間)而不能符合第一特徵資料44、第二特徵資料46，則可判斷為第二特徵資料46 已不適用為一聲紋異常變化達穩定狀況的聲紋識別標準，故利用暫存空間42內的聲紋特徵重新訓練第二特徵資料46，產生一調整後第二特徵資料46，使其重新作為聲紋識別之標準來進行安全性驗證。 In some embodiments, the above-mentioned at least voiceprint characteristics may correspond to abnormal voiceprint characteristics of the user affected by factors such as age, emotion, or physical condition, and the conditions causing the abnormal voiceprint characteristics may be difficult to recover. If the above-mentioned at least one voiceprint feature of the abnormal voiceprint has reached a certain predetermined number of times (predetermined time) and cannot match the first feature data 44 and the second feature data 46, it can be determined to be the second feature data 46 The voiceprint recognition standard for abnormal changes in voiceprint to reach a stable state is no longer applicable. Therefore, the voiceprint features in the temporary storage space 42 are used to retrain the second feature data 46 to generate an adjusted second feature data 46 to retrain it. Used as a standard for voiceprint recognition for security verification.

於一實施例中，聲紋辨識裝置100、200可設置於透過網路執行金融交易的個人電腦、膝上型電腦、蜂窩電話、相機電話、智慧型電話、個人數位助理、媒體播放機、導航設備、電子郵件收發設備、遊戲控制台、平板電腦、可穿戴設備或者這些設備中的任意幾種設備的組合。 In one embodiment, the voiceprint recognition devices 100 and 200 can be installed on personal computers, laptops, cellular phones, camera phones, smart phones, personal digital assistants, media players, and navigation systems that perform financial transactions over the Internet. device, email device, game console, tablet, wearable, or a combination of any of these devices.

於一實施例中，運算單元30可以是一中央處理器(central processing unit，CPU)，亦可配置為其他運算能力足夠的元件。 In one embodiment, the computing unit 30 may be a central processing unit (CPU), or may be configured as other components with sufficient computing capabilities.

於一實施例中，儲存單元40可以是一伺服器(server)，用於儲存一或多個公司、銀行或機構的聲紋辨識資料庫。於另一實施例中，儲存單元40可以是一任何型態的固定式或可移動式的隨機存取記憶體(random access memory，RAM)、唯讀記憶體(read-only memory，ROM)、快閃記憶體(flash memory)、硬碟(hard disk drive，HDD)、固態硬碟(solid state drive，SSD)或類似元件或上述元件的組合。 In one embodiment, the storage unit 40 may be a server for storing voiceprint recognition databases of one or more companies, banks or institutions. In another embodiment, the storage unit 40 may be any type of fixed or removable random access memory (RAM), read-only memory (ROM), Flash memory (flash memory), hard disk drive (HDD), solid state drive (SSD) or similar components or a combination of the above components.

於一實施例中，上述驗證請求所包含的使用者身份特徵之識別方法可以是簡訊、傳真、電子郵件、應用程式、數位簽章、或基於其他生物學特徵(例如為臉部影像、指紋、虹膜等辨識)等。 In one embodiment, the identification method of the user identity included in the above verification request may be SMS, fax, email, application, digital signature, or based on other biological characteristics (such as facial image, fingerprint, Iris recognition), etc.

於一實施例中，儲存單元40可包含一深度學習模型，用以訓練第一特徵資料44、第二特徵資料46，上述深度學習模型可包括各種類型的神經網路模型中的至少一者，所述各種類型的神經網路模型包括卷積神經網路(convolution neural network，CNN)、區域卷積神經網路(region with convolution neural network，R-CNN)、區域建議網絡(region proposal network，RPN)、遞迴神經網路(recurrent neural network，RNN)、基於堆疊的深度神經網路(stacking-based deep neural network，S-DNN)、狀態空間動態神經網路(state-space dynamic neural network，S-SDNN)、反卷積網路(deconvolution network)、深度置信網路(deep belief network，DBN)、受限玻爾茲曼機(restricted Boltzmann machine，RBM)、全卷積網路(fully convolutional network)、長短期記憶體(long short-term memory，LSTM)網路以及分類網路(classification network)。 In one embodiment, the storage unit 40 may include a deep learning model for training the first feature data 44 and the second feature data 46. The above-mentioned deep learning model may include at least one of various types of neural network models, The various types of neural network models include convolutional neural network (CNN), region with convolutional neural network (region with convolution) neural network (R-CNN), region proposal network (RPN), recurrent neural network (RNN), stacking-based deep neural network (S-DNN) ), state-space dynamic neural network (S-SDNN), deconvolution network (deconvolution network), deep belief network (DBN), restricted Boltzmann machine (restricted Boltzmann machine, RBM), fully convolutional network, long short-term memory (LSTM) network and classification network.

請一併參閱圖3與圖4，圖3係根據本發明之一實施例繪示一種聲紋辨識方法之流程圖，圖4係根據本發明之另一實施例繪示一種聲紋辨識方法之流程圖。 Please refer to FIG. 3 and FIG. 4 together. FIG. 3 is a flow chart illustrating a voiceprint identification method according to one embodiment of the present invention. FIG. 4 is a flow chart illustrating a voiceprint identification method according to another embodiment of the present invention. flowchart.

於一實施例中，如圖3所示，於步驟S1中，藉由音訊接收單元10擷取一音頻資料。 In one embodiment, as shown in FIG. 3 , in step S1 , audio data is captured by the audio receiving unit 10 .

於步驟S2中，聲紋辨識單元20接收音訊接收單元10所傳送的上述音頻資料，且辨識出上述音頻資料內的至少一聲紋特徵。 In step S2, the voiceprint recognition unit 20 receives the audio data transmitted by the audio receiving unit 10, and identifies at least one voiceprint feature in the audio data.

於步驟S3中，運算單元30接收聲紋辨識單元20所傳送的上述至少一聲紋特徵、接收儲存單元40所傳送的第一特徵資料44，用以比對上述聲紋特徵與第一特徵資料44是否相符。 In step S3, the computing unit 30 receives the at least one voiceprint feature sent by the voiceprint recognition unit 20 and the first feature data 44 sent by the storage unit 40 to compare the voiceprint feature with the first feature data. 44 is consistent.

於步驟S4中，若上述至少一聲紋特徵符合第一特徵資料44，則運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，且運算單元30根據暫存空間42內的上述至少一聲紋特徵訓練第一特徵資料44，以產生一調整後第一特徵資料44，以根據調整後第一特徵資料44進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第一特徵資料44建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。 In step S4, if the at least one voiceprint feature matches the first feature data 44, the computing unit 30 stores the at least one voiceprint feature in the temporary storage space 42, and the computing unit 30 stores the at least one voiceprint feature in the temporary storage space 42 according to the above feature data in the temporary storage space 42. At least the voiceprint feature trains the first characteristic data 44 to generate an adjusted first characteristic data 44 for security verification based on the adjusted first characteristic data 44 . For example, the above security The security verification can be executed immediately after the adjusted first characteristic data 44 is created to further achieve the effect of a fast security verification.

於步驟S5中，運算單元30可包含一驗證計算器32，驗證計算器32之計數值於一初始狀態為0單位，若上述聲紋特徵不符第一特徵資料44，則增加驗證計算器32之計數值1單位。若上述計數值小於一預定數值，則運算單元30將傳送一驗證請求給使用者，當上述驗證請求被使用者完成確認後，則運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，但不根據上述至少一聲紋特徵變更第一特徵資料44，以根據第一特徵資料44進行安全性驗證。 In step S5, the computing unit 30 may include a verification calculator 32. The count value of the verification calculator 32 is 0 units in an initial state. If the above-mentioned voiceprint feature does not match the first feature data 44, the verification calculator 32 will be increased. The count value is 1 unit. If the count value is less than a predetermined value, the computing unit 30 will send a verification request to the user. When the verification request is confirmed by the user, the computing unit 30 will store the at least one fingerprint feature in the temporary storage space 42 within, but does not change the first characteristic data 44 based on the at least one voiceprint characteristic, so as to perform security verification based on the first characteristic data 44 .

於步驟S6中，若上述至少一聲紋特徵不符第一特徵資料44，且上述計數值達一預定數值，則運算單元30使暫存空間42內的至少一聲紋特徵與第一特徵資料44產生一第二特徵資料46。 In step S6 , if the at least one voice fingerprint feature does not match the first feature data 44 and the count value reaches a predetermined value, the computing unit 30 compares the at least one voice fingerprint feature in the temporary storage space 42 with the first feature data 44 A second characteristic data 46 is generated.

於步驟S7中，第二特徵資料46產生後，驗證計算器32調整計數值為0單位，且運算單元30修正成根據第一特徵資料44、第二特徵資料46來比對至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符，以進行安全性驗證。 In step S7, after the second characteristic data 46 is generated, the verification calculator 32 adjusts the count value to 0 units, and the computing unit 30 is modified to compare at least the voiceprint characteristics according to the first characteristic data 44 and the second characteristic data 46. Whether it is consistent with the first characteristic data 44 and the second characteristic data 46 for security verification.

於一實施例中，如圖4所示，於步驟S8中，運算單元30修正成根據第一特徵資料44、第二特徵資料46來比對至少一聲紋特徵與第一特徵資料44、第二特徵資料46是否相符，而其中運算單元30先根據第二特徵資料46來比對至少一聲紋特徵與第二特徵資料46是否相符。 In one embodiment, as shown in FIG. 4 , in step S8 , the computing unit 30 is modified to compare at least one voice fingerprint feature with the first feature data 44 and the second feature data 46 based on the first feature data 44 and the second feature data 46 . Whether the two feature data 46 are consistent, and the computing unit 30 first compares at least the voice fingerprint feature and the second feature data 46 based on the second feature data 46 to see whether they are consistent.

於步驟S9中，若上述聲紋特徵符合第二特徵資料46，則運算單元30將上述至少一聲紋特徵儲存於暫存空間42內，且儲存單元40根據暫存空間42內的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46，並根據調整後第二特徵資料46進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。 In step S9, if the voiceprint feature matches the second feature data 46, the computing unit 30 stores the at least one voiceprint feature in the temporary storage space 42, and the storage unit 40 stores the voiceprint feature according to the at least one feature in the temporary storage space 42. The voiceprint feature trains the second feature data 46 to generate an adjusted second feature data 46, and security verification is performed based on the adjusted second feature data 46. For example, the above security verification can be executed immediately after the adjusted second characteristic data 46 is created to further achieve the effect of a fast security verification.

於步驟S10中，若上述聲紋特徵不符第二特徵資料46，則運算單元30再根據第一特徵資料44來比對上述聲紋特徵與第一特徵資料44是否相符。 In step S10 , if the voiceprint feature does not match the second feature data 46 , the computing unit 30 then compares the voiceprint feature with the first feature data 44 based on the first feature data 44 .

於步驟S11中，若上述聲紋特徵不符第二特徵資料46、但符合第一特徵資料44，則運算單元30移除第二特徵資料46，以根據第一特徵資料44進行安全性驗證。 In step S11 , if the voiceprint feature does not match the second feature data 46 but matches the first feature data 44 , the computing unit 30 removes the second feature data 46 to perform security verification based on the first feature data 44 .

於步驟S12中，驗證計算器32之計數值於一初始狀態為0單位，若上述聲紋特徵不符第二特徵資料46且不符第一特徵資料44，則增加驗證計算器32之計數值1單位。若上述計數值小於一預定數值，則傳送一驗證請求給使用者，上述驗證請求被使用者完成確認後，運算單元30傳送上述聲紋特徵至暫存空間 42內儲存，但儲存單元40不根據上述至少一聲紋特徵變更第一特徵資料44、第二特徵資料46，以根據第一特徵資料44、第二特徵資料46進行安全性驗證。 In step S12, the count value of the verification calculator 32 is 0 units in an initial state. If the voiceprint feature does not match the second feature data 46 and does not match the first feature data 44, then the count value of the verification calculator 32 is increased by 1 unit. . If the count value is less than a predetermined value, a verification request is sent to the user. After the verification request is confirmed by the user, the computing unit 30 sends the voiceprint feature to the temporary storage space. 42, but the storage unit 40 does not change the first characteristic data 44 and the second characteristic data 46 according to the at least one voice fingerprint characteristic, so as to perform security verification based on the first characteristic data 44 and the second characteristic data 46.

於步驟S13中，若上述至少一聲紋特徵不符第二特徵資料46且不符第一特徵資料44，且上述計數值達一預定數值，則運算單元30根據暫存空間42的上述至少一聲紋特徵訓練第二特徵資料46，以產生一調整後第二特徵資料46。 In step S13 , if the at least one voiceprint feature does not match the second feature data 46 and the first feature data 44 , and the count value reaches a predetermined value, the computing unit 30 calculates the at least one voiceprint feature based on the at least one voiceprint feature in the temporary storage space 42 . Feature training second feature data 46 to generate adjusted second feature data 46 .

於步驟S14中，產生調整後第二特徵資料46後，該驗證計算器調整該計數值為0單位，運算單元30修正成根據第一特徵資料44、調整後第二特徵資料46來判斷至少一聲紋特徵與第一特徵資料44、調整後第二特徵資料46是否相符，以進行安全性驗證。舉例來說，上述安全性驗證可以在調整後第二特徵資料46建立完當下立即執行，以更進一步達成一快速安全性驗證之效果。 In step S14, after the adjusted second characteristic data 46 is generated, the verification calculator adjusts the count value to 0 units, and the computing unit 30 is modified to determine at least one based on the first characteristic data 44 and the adjusted second characteristic data 46. Whether the voiceprint characteristics match the first characteristic data 44 and the adjusted second characteristic data 46 is used for security verification. For example, the above security verification can be executed immediately after the adjusted second characteristic data 46 is created to further achieve the effect of a fast security verification.

於一實施例中，聲紋辨識方法300、400可設置於電子金融交易系統，例如為自動櫃員機(ATM)、網路銀行、或其他金融交易系統上。 In one embodiment, the voiceprint recognition methods 300 and 400 can be configured on an electronic financial transaction system, such as an automated teller machine (ATM), online banking, or other financial transaction systems.

於一實施例中，聲紋辨識方法300、400中之聲紋辨識裝置可包含一深度學習模型，用以訓練第一特徵資料44、第二特徵資料46，上述深度學習模型可包括各種類型的神經網路模型中的至少一者，所述各種類型的神經網路模型包括卷積神經網路(convolution neural network，CNN)、區域卷積神經網路(region with convolution neural network，R-CNN)、區域建議網絡(region proposal network，RPN)、遞迴神經網路(recurrent neural network，RNN)、基於堆疊的深度神經網路(stacking-based deep neural network，S-DNN)、狀態空間動態神經網路(state-space dynamic neural network，S-SDNN)、反卷積網路(deconvolution network)、深度置信網路(deep belief network，DBN)、受限玻爾茲曼機(restricted Boltzmann machine，RBM)、全卷積網路(fully convolutional network)、長短期記憶體(long short-term memory，LSTM)網路以及分類網路(classification network)。 In one embodiment, the voiceprint recognition device in the voiceprint recognition methods 300 and 400 may include a deep learning model for training the first feature data 44 and the second feature data 46. The above-mentioned deep learning model may include various types of At least one of the neural network models, the various types of neural network models Types include convolutional neural network (CNN), region with convolutional neural network (R-CNN), region proposal network (RPN), recurrent neural network ( recurrent neural network (RNN), stacking-based deep neural network (S-DNN), state-space dynamic neural network (S-SDNN), deconvolution network deconvolution network, deep belief network (DBN), restricted Boltzmann machine (RBM), fully convolutional network, long short-term memory (long short-term memory) short-term memory, LSTM) network and classification network.

本發明對於電子裝置上的金融交易系統可做為一道輔助認證的功能。使用者於登入系統環境欲進行金融交易等動作時，藉由讀取裝置使用時所記錄的聲紋圖譜，透過深度學習、加入遞歸神經網路來訓練聲紋圖譜模型，藉此改善認證的品質，讓模型可以更準確地辨認使用者本人的指令，進而將此項認證應用於各種金融交易系統中，例如為行動銀行(網路銀行)等。 The present invention can be used as an auxiliary authentication function for financial transaction systems on electronic devices. When users log into the system environment and want to perform financial transactions and other actions, they read the voiceprint pattern recorded when the device is used, and train the voiceprint pattern model through deep learning and adding recursive neural networks to improve the quality of authentication. , so that the model can more accurately identify the user's own instructions, and then apply this authentication to various financial transaction systems, such as mobile banking (online banking), etc.

惟以上所述者，僅為本發明之實施例而已，當不能以此限定本發明實施之範圍，凡是依本發明申請專利範圍及專利說明書內容所作之簡單的等效變化與修飾，皆仍屬本發明專利涵蓋之範圍內。 However, the above are only examples of the present invention, and should not be used to limit the scope of the present invention. All simple equivalent changes and modifications made based on the patent scope of the present invention and the content of the patent specification are still within the scope of the present invention. Within the scope covered by the patent of this invention.

400:聲紋辨識方法 400: Voiceprint identification method

S1~S14:步驟 S1~S14: steps

S1’,S2’:步驟 S1’, S2’: steps

Claims

A low-noise voiceprint recognition device for financial trading systems, including: an audio receiving unit, used to capture an audio data; a voiceprint recognition unit, connected to the audio receiving unit, the voiceprint recognition unit The audio data is received by the audio receiving unit, and the voiceprint recognition unit identifies at least one voiceprint feature of the audio data; a storage unit stores a first feature data; a computing unit is connected to the voiceprint recognition unit and the storage unit, the computing unit receives the at least voiceprint characteristics from the voiceprint recognition unit and the first characteristic data from the storage unit, and the computing unit includes: a verification calculator, the verification calculator The count value is 0 units in an initial state, wherein the computing unit determines whether the at least one voiceprint feature matches the first feature data based on the first feature data; wherein, if the computing unit determines that the at least one voiceprint feature is consistent with the first feature data, If the first characteristic data is met, the computing unit stores the at least one voice fingerprint feature in a temporary storage space of the storage unit, and trains the first feature data based on the at least one voice fingerprint feature in the temporary storage space, To generate an adjusted first characteristic data, and perform security verification based on the adjusted first characteristic data; if the computing unit determines that the at least one voice fingerprint characteristic does not match the first characteristic data, the verification calculator calculates the calculated The value increases by 1 unit, and when the count value is less than a predetermined value, the computing unit sends a verification request, and after the verification request is confirmed, the computing unit stores the at least one voice fingerprint feature in the temporary storage space , but the storage unit does not change the first characteristic data based on the at least one voiceprint characteristic, so as to perform security verification based on the first characteristic data, wherein the verification request includes identification of the user's identity characteristics; If the computing unit determines that the at least one voiceprint feature does not match the first feature data, and the count value reaches the predetermined value, then the calculation unit determines that the at least one voiceprint feature in the temporary storage space and the first feature data Generate a second characteristic data; and after the second characteristic data is generated, the verification calculator adjusts the count value to 0 units, and the computing unit stores the first characteristic data and the second characteristic data in the storage unit, Determining whether the at least one voiceprint feature matches the first feature data and the second feature data based on the first feature data and the second feature data for security verification, including: if the computing unit determines that the at least one voiceprint feature is consistent with the first feature data and the second feature data. If the voiceprint feature matches the second feature data, the computing unit transmits the at least voiceprint feature and stores it in the temporary storage space, and trains the second feature data based on the at least voiceprint feature in the temporary storage space. , to generate an adjusted second characteristic data, and perform security verification based on the adjusted second characteristic data; if the computing unit determines that the at least one voice fingerprint characteristic does not match the second characteristic data, but matches the first characteristic data , then the computing unit removes the second characteristic data from the storage unit to perform security verification based on the first characteristic data; and after the second characteristic data is generated, if the computing unit determines that the at least one voiceprint characteristic does not match the second characteristic data and does not match the first characteristic data, the verification calculator increases the count value by 1 unit, and when the count value is less than a predetermined value, the computing unit sends a verification request, and in the verification After the request is completed and confirmed, the computing unit stores the at least one voiceprint feature in the temporary storage space, but the storage unit does not change the first feature data and the second feature data based on the at least one voiceprint feature. The first characteristic data and the second characteristic data are subject to security verification, wherein identifying the at least one voiceprint feature in the audio data further includes at least one of the following operations: filtering, reducing noise, suppressing background noise, amplifying specific voiceprint features, calculating vector parameters of the audio data, and detecting specific keys words, detect specific sound wave bands, detect specific sound wave waveforms, and detect specific sound wave frequencies.

The low-noise voiceprint recognition device of claim 1, wherein the computing unit determines whether the at least voiceprint feature is consistent with the first feature data and the second feature data further includes: after the second feature data is generated, If the count value reaches the predetermined value, the computing unit trains the second feature data based on the at least one voice fingerprint feature in the temporary storage space to generate an adjusted second feature data; and after generating the adjusted second feature data After obtaining the two characteristic data, the verification calculator adjusts the count value to 0 units, and the computing unit determines the characteristics of the at least voiceprint and the first characteristic data and the adjusted second characteristic data based on the first characteristic data and the adjusted second characteristic data. Check whether the adjusted second characteristic data is consistent for security verification.

The low-noise voiceprint recognition device of claim 1, wherein the voiceprint recognition device further includes: a deep learning algorithm for training the first feature data and the second feature data, and the deep learning algorithm includes a A Deep Neural Network (DNN) model, a Convolutional Neural Network (CNN) model, a Recurrent Neural Network (RNN), and one of their combinations.

A low-noise voiceprint recognition method for financial trading systems, applied to a voiceprint recognition device. The voiceprint recognition device includes an arithmetic unit and a verification calculator. The count value of the verification calculator is in an initial state. is 0 units. The voiceprint recognition method includes controlling the computing unit to perform the following steps: Retrieve an audio data; identify at least one voiceprint feature in the audio data; provide a first feature data, a temporary storage space and a verification request; based on the first feature data, compare the at least one voiceprint feature with Whether the first characteristic data matches; if the at least one voice fingerprint characteristic matches the first characteristic data, the at least one voice fingerprint characteristic is stored in the temporary storage space, and the at least one voice fingerprint characteristic in the temporary storage space is Feature training the first feature data to generate an adjusted first feature data for security verification based on the adjusted first feature data; if the at least one voiceprint feature does not match the first feature data, the verification The count value of the calculator is increased by 1 unit, and when the count value is less than a predetermined value, the verification request is sent. After the verification request is confirmed, the at least one fingerprint feature is stored in the temporary storage space, but no The first feature data is modified based on the at least voiceprint feature to perform security verification based on the first feature data, where the verification request includes identification of user identity features; if the count value reaches the predetermined value, based on the The at least one voiceprint feature and the first feature data in the temporary storage space generate a second feature data; and after the second feature data is generated, the count value of the verification calculator is adjusted to 0 units, and according to the The first characteristic data and the second characteristic data compare whether the at least one voiceprint characteristic is consistent with the first characteristic data and the second characteristic data for security verification, including: If the computing unit determines that the at least one voiceprint feature matches the second feature data, the computing unit transmits the at least one voiceprint feature and stores it in the temporary storage space, and based on the at least one voiceprint feature in the temporary storage space Feature training the second feature data to generate an adjusted second feature data, and perform security verification based on the adjusted second feature data; if the computing unit determines that the at least one voiceprint feature does not match the second feature data, However, if the first characteristic data is met, the computing unit removes the second characteristic data from a storage unit to perform security verification based on the first characteristic data; and after the second characteristic data is generated, if the computing unit If it is determined that the at least one voiceprint feature does not match the second feature data and does not match the first feature data, the verification calculator increases the count value by 1 unit, and when the count value is less than a predetermined value, the computing unit sends a Verification request, and after the verification request is completed and confirmed, the computing unit stores the at least one voice fingerprint feature in the temporary storage space, but the storage unit does not change the first feature data and the first feature data based on the at least one voice fingerprint feature. The second characteristic data is used to perform security verification based on the first characteristic data and the second characteristic data, wherein identifying the at least one fingerprint characteristic in the audio data further includes at least one of the following operations: filtering, noise reduction , suppress background noise, amplify specific voiceprint features, calculate vector parameters of the audio data, detect specific keywords, detect specific sound wave bands, detect specific sound wave waveforms, and detect specific sound wave frequencies.

The low-noise voiceprint identification method of claim 4, wherein comparing whether the at least one voiceprint feature matches the first feature data and the second feature data further includes the following steps: After the second characteristic data is generated and the count value reaches the predetermined value, the second characteristic data is trained according to the at least one voiceprint characteristic in the temporary storage space to generate an adjusted second characteristic data; and After the adjusted second characteristic data is generated, the count value of the verification calculator is adjusted to 0 units, and based on whether the at least one voiceprint characteristic is consistent with the first characteristic data and the adjusted second characteristic data, the process is performed Security verification.

The low-noise voiceprint recognition method of claim 4, wherein the voiceprint recognition device further includes: a deep learning algorithm for training the first feature data and the second feature data, and the deep learning algorithm includes a A Deep Neural Network (DNN) model, a Convolutional Neural Network (CNN) model, a Recurrent Neural Network (RNN), and one of their combinations.