TWI234762B

TWI234762B - Voiceprint identification system for e-commerce

Info

Publication number: TWI234762B
Application number: TW92136456A
Authority: TW
Inventors: Kun-Lang Yu; Andy Cheng; Yen-Chieh Ouyang
Original assignee: Top Dihital Co Ltd
Priority date: 2003-12-22
Filing date: 2003-12-22
Publication date: 2005-06-21
Also published as: TW200521962A

Abstract

An e-commerce method includes steps of: accessing user's login through an electronic communication means; using an identification device to recognize a password; using the identification device to verify whether the user's registration for voiceprint identification; using a voiceprint identification system to identify or register for voiceprint identification; and using an identification device to decide for allowing or rejecting the user for proceeding e-commerce.

Description

12347621234762

【發明所屬之技術領域】本發明係關於一種電子商務交易之聲紋驗證系統，其別有關於進行電子商務交易之聲紋驗證外，择么言ς八佈機率、動態時間校準演算法及隱藏式馬可夫ϋ冋並二用維特比〔Viterbi〕演算法獲得最相似路徑以模型參數之聲紋驗證系統。 "" 【先前技術】[Technical field to which the invention belongs] The present invention relates to a voiceprint verification system for e-commerce transactions. In addition to voiceprint verification for e-commerce transactions, choose a language, eight cloth probability, dynamic time calibration algorithm, and hiding. The Markov Pyramid II uses a Viterbi algorithm to obtain a voiceprint verification system that uses the most similar path to model parameters. " " [Prior art]

習用電子商務交易方法，如中華民國專利公告第38541 6 號「電子商務系統」發明專利，其揭示一種在一網路上之一交易記錄〔transaction 1〇g〕提供存檔安全〔archiving safety〕的電子商務系統〔c〇mmerce system〕，其包括：―對話密鑰產生器〔sessi〇n以又 creator〕用以產生一對話密鑰以加密該交易記錄；一易記錄加密器〔encryptor〕用以加密使用該對話密鑰| 該交易記錄；及一交易記錄發送器用以將該已加密交易記錄發这至該網路上之一存檔伺服器〔server〕。然而，該第3 8 5 41 6唬僅將父易記錄加密以便進行資料傳輸及儲存，未針對使用者加以辨識身分。另一習用電子商務交易方法，如中華民國專利公告第 550477 ^網站帳戶之方法、系統及電腦可讀取媒體及自中央位置之電子商務管理」發明專利，其揭示一種用以管理一使用者在一目的地電子商務網站上進行線上〔中央網站〕金融交易之方法，其包括：登入使用者至一目的地電子商矛/；、，周站，產生泫使用者於該中央網站中的唯一使用者Conventional e-commerce transaction methods, such as the Republic of China Patent Bulletin No. 38541 6 "E-commerce system" invention patent, which reveals an e-commerce that provides archiving safety for a transaction record (transaction 10g) on a network A system [c〇mmerce system], which includes:-a session key generator [session and creator] to generate a session key to encrypt the transaction record; an easy record encryptor [encryptor] for encryption use The session key | the transaction record; and a transaction record sender for sending the encrypted transaction record to an archive server on the network. However, the 3rd, 3rd, 5th, 4th, and 6th encryption only encrypted the parent record for data transmission and storage, and did not identify the user. Another conventional e-commerce transaction method, such as the Republic of China Patent Bulletin No. 550477 ^ website account method, system and computer-readable media and e-commerce management from a central location "invention patent, which discloses a method for managing a user's A method for conducting online [central website] financial transactions on a destination e-commerce website, which includes: logging in users to a destination e-commerce platform; and, weekly stations, generating the sole use of the user on the central website By

第7頁 1234762 五、發明說明（2) 名稱及密碼；利用該唯一使用使用者名稱及密碼在一或多，目的地網站上進行註冊；傳輸一啟用指令至一處理金融 ί易t 2 Ϊ Ϊ用以啟動該使用者的一信用卡或簽帳卡帳戶，备該#用卡或簽帳卡帳戶為啟用狀態時，經由詨地電子商務網站發送該信用卡或簽帳卡帳戶之一 =及傳輸—撤銷指令至該金融機構以撤：該 “用卡或簽帳卡帳戶；其中當該信用卡或簽帳卡帳戶為啟該金融機構只接受及處理從該電子商務網站所二ί :ί求；及其中當該信用卡或簽帳卡帳戶為撤銷 i i二ί融機構拒絕付費請求。然而，該第5 5 〇 4 m虎用使用者之唯一名稱及密碼加以辨識身分，因此豆且有岔碼洩漏的疑慮。八八 12 ΐ i:該第38541 6號及第550477號之電子商務交+需 V加以改良，以便能準確辨識使用者之身分。& 聲ί驗證：法，如中華民國專利公告第490655 J：用聲错資訊辯識使用者的方法與其裝置」發明專 f不同使用者特有的聲错資訊辨識使用者的身伤’以決疋使用者是否經過授權。該方法包含步驟： ”立、A_用者發出語音後，偵剛語音之終點；〔2〕、自 = ” = 特徵；〔3〕、決定是否需要訓 -界阳，〗」「二特徵作為一參考樣本，同時設定符二去#否」則進仃下—步驟；〔4〕、將該語音特樣本進行圖樣比對；〔5〕、依兩者之間距之距離；〔6〕、將該計算結果與設定界限比Page 71234762 V. Description of the invention (2) Name and password; use the unique user name and password to register on one or more destination websites; transmit an activation instruction to a processing finance 易易 2 2 Ϊ Ϊ Used to activate a credit or debit card account for the user. When the #used or debit card account is enabled, send one of the credit or debit card account via the local e-commerce website = and transmit — Revocation order to the financial institution to withdraw: the "used or debit card account; when the credit or debit card account is activated, the financial institution only accepts and processes requests from the e-commerce website; Among them, when the credit or debit card account was revoked, the financial institution refused to pay the request. However, the 5504m tiger used the user's unique name and password to identify him, so the bean and the fork code leaked. Doubt. 88:12 i: The e-commerce transactions No. 38541 6 and No. 550477 need to be improved so that users can be accurately identified. &Amp; Voice verification: law, such as the Republic of China Patent Bulletin 490655 J: Method wrong user identification information with its sound devices "designed f invention different user-specific sound wrong user identification information body injury 'to determine whether the user is authorized Cloth. The method includes the following steps: "Li, A_ Detect the end of the speech just after the user has spoken; [2], since =" = feature; [3], decide whether training is needed-Jieyang, "" and "two features as Take a reference sample and set the symbol two to go to #No ”, then proceed to the next step— [4], pattern comparison of the voice special samples; [5], according to the distance between the two; [6], will Ratio of the calculation result to the set limit

·_ \L(X;〇-5\FIVH Ci)NTINli‘NTS\pj(()35| 第8頁 1234762 五、發明說明（3) 較；〔7〕、依該比較結果決定該使用者是否為一授權使用者。該方法係使用於行動電話，其利用聲譜分析方法將語音之獨特資訊取出，藉此進行辨識使用者之方法。該第 490655號主要利用每一時框〔fraine〕之主要值與使用者設定的界限進行比較，決定語音之始點與終點後，再利用 Princen-Bradley濾波器轉換已偵測的語音訊號，以便取得其對應聲譜圖案。該聲譜圖案與預先儲存之參考聲譜樣本進行比對，以辨識使用者之聲紋。簡言之，該第490655號需要進行圖案的匹配及距離的運算’若該運算距離未超過界限時，使用者即可通過聲紋辨識。然而，該第4 9 0 6 5 5號在進行圖案的匹配及距離的運算時，必須計算在參考樣本及測試樣本之間的距離。事實上’該參考樣本所佔用資料庫的空間相當大，因此其來但需要較大的資料庫空間且需要更長的檔案傳輸時間。若4將忒聲紋驗證技術能應用在電子商務交易時，具有延長交易時間的缺點。因此，該第490655號仍有必要進一步改良其參考樣本之佔用空間的問題，如此能節省儲存參考樣本之資料庫空間’以避免使用者數量的限制。利用減少該參考樣本之位元方法，更能加速聲紋驗證所需時間，且更能提升辨識率’以便將聲紋驗證技術能應用在電子商務交易時，能縮短交易時間。有鑑於此’本發明改良上述之缺點，其在進行電子商務交易時，除了利用聲紋驗證系統進行辨識使用者之身分· _ \ L (X; 〇-5 \ FIVH Ci) NTINli'NTS \ pj (() 35 | Page 81234762 V. Description of the invention (3) comparison; [7], determine whether the user is based on the comparison result It is an authorized user. This method is used in mobile phones. It uses sound spectrum analysis method to extract the unique information of speech to identify the user. This 490655 mainly uses the main of each frame [fraine] The value is compared with the limit set by the user. After determining the start and end points of the speech, the detected speech signal is converted by the Princen-Bradley filter in order to obtain its corresponding sound spectrum pattern. The sound spectrum pattern and the pre-stored The reference sound spectrum samples are compared to identify the user's voiceprint. In short, the 490655 requires pattern matching and distance calculation. 'If the calculation distance does not exceed the limit, the user can pass the voiceprint. Identification. However, when performing pattern matching and distance calculations, the No. 49 0 65 5 must calculate the distance between the reference sample and the test sample. In fact, 'the reference sample occupies a considerable amount of space in the database Large, so it comes but requires a larger database space and longer file transfer time. If 4 can apply the voiceprint verification technology to e-commerce transactions, it has the disadvantage of extending the transaction time. Therefore, this section 490655 It is still necessary to further improve the problem of the space occupied by its reference samples, so as to save the database space for storing the reference samples' to avoid the limitation of the number of users. By reducing the bit method of the reference samples, the voiceprint verification can be accelerated. The time required, and the recognition rate can be improved, so that the voiceprint verification technology can be applied to e-commerce transactions, which can shorten the transaction time. In view of this, the present invention improves the above-mentioned disadvantages. Identify users with voiceprint verification system

c ： \ L(K；(). 5 uq VE C()NT l Nl-NTS\ PK9 3 51. P ul 第9頁 1234762 五、發明說明（4) 外，且該聲紋準演算法及隱最相似路徑，【發明内容】本發明主要統，其在進行識使用者之身本發明次要統，其除了進斯分佈機率、並利用維特比數，使本發明根據本發明戶帳號由一連戶基本對；利識；及本發部、— 進行訓前端處訊；再再進行作為模資料；用一聲該可辨明之聲訓練系練或測理部自利用該運算該型參數驗證系統另結合高斯分佈機率、藏式馬可夫模式，並利用維特比^時間校以便計算模型參數。 /、异法獲得目的係提供一種電子商務交易之電子商務交易肖，利用聲紋驗證系：：證系分，使本發明具有提升辨識率之^力效。仃辨目的係提供一種電子商務交易之聲^么行電子商務交易之聲紋驗證外，其、j证不動態時間校準演算法及隱藏式馬;，高凟算法獲得最相似路徑，以便計算模型= 具有簡化訓練及測試作業之功效。、|多務交:易方☆，該方法包含步驟於客政置進订么錄，利用一可辨識裝置確i 該可辨識裝置進行核對是否已申請聲紋= 紋驗證系統選擇進行聲紋辨識或註冊聲纹辨識裝置決定允許或拒絕進行電子商務交易。紋驗證系統包含一前端處理部、一特徵擷取，及一測試系統，以便對原始輸入語音資料試作業。在訓練語音上，該訓練系統利用該該原始輸入語音資料擷取有效訓練語音資特徵操取部進行擷取該有效訓練語音特徵；有效訓練語音資訊以獲得最相似路徑，以便。同樣在測试語音上，該測試系統利用該前c: \ L (K; (). 5 uq VE C () NT l Nl-NTS \ PK9 3 51. P ul Page 9 1234762 5. The invention is explained (4), and the voiceprint quasi-performance algorithm and hidden The most similar path. [Summary of the Invention] The main system of the present invention is the secondary system of the present invention, which is in addition to identifying the user. In addition to the probability of distribution, and using Viterbi numbers, the present invention makes the account of the user according to the present invention continuous. The basic knowledge of the households; profit knowledge; and the development department,-to conduct training front-end processing; then again as a model data; use a discernible voice to train the training department or the measurement department to use the calculation of this type of parameter verification system In addition, it combines the probability of Gaussian distribution, the Tibetan Markov model, and uses Viterbi ^ time calibration to calculate the model parameters. / 、 The purpose of obtaining an e-commerce transaction is to provide an e-commerce transaction shaw for e-commerce transactions, using the voiceprint verification system :: certificate system This makes the present invention have a powerful effect of improving the recognition rate. The purpose of identification is to provide a voice verification of e-commerce transactions. In addition to the verification of the voiceprint of e-commerce transactions, its dynamic time calibration algorithm and hidden type horse ;, Gao Yan's algorithm to obtain the most similar path, so that the calculation model = has the effect of simplifying training and testing operations., | Multi-service delivery: Yi Fang ☆, this method includes the steps of ordering records in the guest house, using The device verifies whether the recognizable device has applied for voiceprint = print verification system chooses to perform voiceprint recognition or register voiceprint recognition device to decide whether to allow or deny e-commerce transactions. The print verification system includes a front-end processing unit, a feature extraction Fetch, and a test system to test the original input voice data. On the training voice, the training system uses the original input voice data to extract the effective training voice feature extraction unit to extract the effective training voice feature; Efficiently train speech information to obtain the most similar path so that, also on test speech, the test system uses the previous

1234762 五、發明說明（5) t f Γ :自該原始輪入語音資料擷取有效測試語音資訊； —運| ^特徵擷取部進行擷取該有效測試語音特徵；、再進 2辨識結1試語音特徵與模型參數之間相似機率以便輸出【實施方式] 確。ί本和其他目的、特徵、和優點能更明式，作詳細：ΪΠ舉本發明較佳實施例，並配合所附圖本發明較佳實施例電子商務交易之聲紋驗證聲= = 進佳實施例電子商務交易之接裝置進行登錄。該連：；置：含帳二由1 comDuter 1 ^ ^ s 入也細〔persona 1 M ^ 一自動存提款機〔Automated Teller广1234762 V. Description of the invention (5) tf Γ: Extract the valid test voice information from the original turn-by-round voice data; —Run | ^ Feature extraction section to retrieve the valid test voice feature; Similarity between speech features and model parameters in order to output [implementation] This and other objects, features, and advantages can be made more explicit, and detailed: Ϊ 举 citing the preferred embodiment of the present invention, and with the accompanying drawings of the preferred embodiment of the present invention, the voiceprint verification voice of e-commerce transactions = = Jin Jia In the embodiment, an e-commerce transaction receiving device performs login. This company :; Set: Including account 2 by 1 comDuter 1 ^ ^ s into the details [persona 1 M ^ 1 ATM [Automated Teller 广

Mach;ne〕、—特約商店刷卡機〔creducar 4 ，二Γ即可連接進行-般商務交易。 :紋驗證中心，該聲紋驗證中心可選= = 置禮-ΐίϊ: 聲紋驗證中心利用一可辨識带置確 < 客戶基本資料，該可辨識裝置包凌輯電路等。此外，該聲紋驗證中心辨識邏嗜A夂昭锋一 T U具有一聲紋驗證系統。客戶曰ϋ、由一圖所不，接著，該可辨識裝置進行核對兮請聲紋比對’即產生該客戶是否需要進^ ，.文比對之、，，。果。該聲紋驗證中心將該結果傳回該連接裝斗Mach; ne], credit card swiping machine [creducar 4], two Γ can be connected for ordinary business transactions. : Texture verification center, this voiceprint verification center is optional = = Zhili-ΐίϊ: The voiceprint verification center uses an identifiable band to set the confirmation of the customer's basic information, and the identifiable device includes the editing circuit and so on. In addition, the voiceprint verification center identification logic A 夂 ZHAO Feng-TU has a voiceprint verification system. The customer said, as shown in a picture, and then, the recognizable device checks it. Please compare the voiceprint 'to generate whether the customer needs to enter it. fruit. The voiceprint verification center returns the result to the connection bucket

：\L(Xj().5\I-IVF： C()NTINENTS\PKy35l.ptd 第11頁 1234762 、發明說明（6) 置’以便進行後續電子商務交易程序。第一圖揭示本發明較佳實施例之聲紋驗證系統之流程方塊圖。 β月參照第二圖所示，本發明較佳實施例之聲紋驗證系統 ^訓練系統1 〇及一測試系統2 0，以便對原始輸入語 t ^料進行訓練或測試作業。該聲紋驗證系統1另包含一 =端處=里部、一特徵擷取部、一儲存部及一運算部。該前 j f理部及特徵擷取部供該訓練系統1 0及測試系統20進行前，，理及特徵擷取，該儲存部供語音特徵加以儲存，該運^部則將該儲存語音特徵及輸入語音特徵加以運算。，客戶帳號輸入本發明之聲紋驗證系統1時，即可進行確W身刀。接著’該糸統依輸入帳號查詢資料庫，B 輸入帳號屬於已建立。若該輸入帳號未建立時，要以進入該訓練系統10進行語音訓練作業，以便建立健疋否輸入帳號之語音資料。若該輸入帳號已建立時，t存/y亥试系統2 0進行語音測試作業，以便辨識該輸入入°亥測特徵是否符合已儲存該輸入帳號之語音資料。、就之語音 «月再參照第一及一圖所示，接著，當客戶未對時，則進入要求客戶輸入個人密碼。若客戶二請聲紋比個人密碼後，即進入拒絕交易階段。隨客戶輪輪入不正確密碼後，要求是否申請聲紋辨識註冊。當選^入正確個人辨識註冊時，即進入允許交易階段。反之，當=申請聲紋紋辨識註冊時，即進入該聲紋驗證系統1之甽曰選擇申請聲本發明之聲紋辨識註冊操作該訓練系統丨〇之\」、來系統1 〇。砰述如下：: \ L (Xj (). 5 \ I-IVF: C () NTINENTS \ PKy35l.ptd Page 111234762, Description of Invention (6)) for subsequent e-commerce transaction procedures. The first figure reveals that the present invention is preferred The block diagram of the process of the voiceprint verification system of the embodiment. As shown in FIG. 2 with reference to the second figure, the voiceprint verification system of the preferred embodiment of the present invention ^ training system 10 and a test system 20, so that the original input t ^ Material for training or test operations. The voiceprint verification system 1 further includes a = end = inside, a feature extraction section, a storage section and a computing section. The former jf management section and feature extraction section are provided for the Before the training system 10 and the test system 20 are performed, the processing and feature extraction are performed. The storage unit is used to store the voice features, and the operation unit calculates the stored voice features and the input voice features. The customer account number is input into the present invention. When the voiceprint verification system is 1, you can confirm the body knives. Then 'the system will query the database according to the input account, and the B input account belongs to the established account. If the input account has not been established, you must enter the training system 10 Perform voice training assignments to build fitness No input the voice data of the account. If the input account has been established, the t / y test system 20 performs a voice test operation in order to identify whether the input test feature matches the voice data of the input account. For the voice «month, please refer to the first and first pictures, and then, when the customer is not correct, then enter the request for the customer to enter the personal password. If the customer 2 asks for voiceprint than the personal password, it will enter the transaction rejection stage. With the customer After entering the incorrect password in turn, ask whether to apply for voiceprint identification registration. When you choose ^ to enter the correct personal identification registration, you will enter the stage of allowing transactions. Conversely, when = apply for voiceprint identification registration, you will enter the voiceprint verification system. The first one said that he would choose to apply for the voiceprint recognition and registration of the present invention to operate the training system 丨〇 and the system 1 〇. The bang is as follows:

12347621234762

在擷取語音特徵之前，利用該前端處理部將有效語音資訊自原始輸入語音資料擷取，以濾除無效語音資訊。本發明偵測包含短時距能量〔Short-Energy〕及過零率〔Zero-Cross ing Rate〕。本發明採用結合高斯機率分佈的計算方法，其方程式如下： exP 卜 y (卜 _ Σ!-1 〇一 % (1) 其中:^為原始訊號將其分為數個d維的音框、，. 1，…，M，為所屬機率、《7為背景雜訊之期望值4為背景 1 雜訊的變異數。在此，因為中 ^ , ^ T0^D = 256 為一個定佶，故將其省略不予計算，將方程式（1)簡化如下巧個疋值Before capturing voice features, the front-end processing unit is used to extract valid voice information from the original input voice data to filter out invalid voice information. The detection of the present invention includes short-time energy [Short-Energy] and zero-crossing rate [Zero-Crossing Rate]. The present invention uses a calculation method that combines a Gaussian probability distribution, and its equation is as follows: exP BU y (Bu _ Σ! -1 〇一% (1) where: ^ is the original signal divided into several d-dimensional sound frames ,. 1,…, M are the probability of belonging, “7 is the expected value of background noise 4 is the number of variation of background 1 noise. Here, because ^, ^ T0 ^ D = 256 is a fixed value, so it is omitted Without calculation, simplify equation (1) as follows:

上式中的指數運算，在運瞀齡擔μ 士 (2) 取對數後，將方程式（2 )簡化如下·· 可能過大，故將其 :1η il/2The exponential operation in the above formula, after taking the age of μ μ (2) After taking the logarithm, the equation (2) is simplified as follows: · It may be too large, so it is: 1η il / 2

1234762 五、發明說明（8) 祕=(4吨丨4(;-扣ή) (3) 擷取原輸入語音資料前端256點，計算短時距能量及過零率的期望值及變異數，接著將該兩個數及原輸入語音資料代入該方程式（3 )進行運算。利用短時距能量與過零率的分佈機率區分有效語音資訊及無效語音資訊，將無效語音資訊加以濾除，不但減少資料量，亦能正確擷取有效語音資訊。在該特徵擷取部進行擷取特徵上，本發明採用兩個語音識別特徵參數，其包含線性預測倒頻譜係數〔L i n e a r1234762 V. Description of the invention (8) Secret = (4 tons 丨 4 (;-valence) (3) Retrieve the front-end 256 points of the original input voice data, calculate the short-range energy and zero-crossing rate expected value and the number of variations, The two numbers and the original input voice data are substituted into the equation (3) for calculation. The distribution probability of short-distance energy and zero-crossing rate is used to distinguish valid voice information from invalid voice information, and the invalid voice information is filtered, not only reduced The amount of data can also correctly capture effective speech information. In the feature extraction section, the present invention uses two speech recognition feature parameters, which include linear prediction cepstrum coefficients [L inear

Prediction Cepstrum Coefficient，LPCC〕及梅爾頻標倒頻譜參數〔Mel Frequency Cepstrum Coefficient， MFCC〕兩者各 12 個倒頻譜參數（cepstrai coefficients) 及12個一階倒頻譜參數（delta-cepstrai 〜 coefficients)。將倒頻譜參數^對時間做偏微分 di Σ 七2 (4) JU-尤 κ為考慮音框數。Prediction Cepstrum Coefficient (LPCC) and Mel Frequency Standard Cepstrum Coefficient (MFCC) each have 12 cepstrai coefficients and 12 first-order cepstrai coefficients (delta-cepstrai ~ coefficients). Partial differentiation of cepstrum parameter ^ with time di Σ 7 2 (4) JU- especially κ is the number of frames to be considered.

因為一階倒頻譜參數的公式⑷過於複雜，故將其加以下列各式為僅考慮前後各兩個時框日夺，方程式簡化如下：卜[2*C(2，)+C(U)]/5 (5)Because the formula of the first-order cepstrum parameter is too complicated, the following formulas are added to consider only the two time frames before and after. The equation is simplified as follows: [2 * C (2,) + C (U)] / 5 (5)

C:\L(X；().5\HIvr： C〇NTINENTS\PK«)351 .ptd 第14頁 1234762 五、發明說明（9) ACi = [2 ^ C(3,«) + C(2,λ) - C(0,«)] / 6 ( 6 ) AC^ =[2*〇^ + 2,;〇十 C(i+l,^)-C(i_l,»)-2*C(i-2，《)]/10 (7) AC3f"2=[C(£-l,«)-C(I-3,«)-23<cC(Z；-4,«)]/6 ( 8 ) △。广1 =卜 C(Z - 2,λ) - 2 木C(Z - 3，《)]/ 5 (9) 方程式（5 )至（9 )中，Cn為η階特徵值，L為訊號中時框總數，i為時框編號。第三圖揭示本發明較佳實施例之聲紋驗證系統之狀態及音框之關係示意圖。在訓練語音上，語音具有所謂「狀態」的觀念，狀態是發音時嘴型以及聲道的變化。一般而言，每一次說話嘴型一定有變化，故每一個狀態都是一個語音變化的特徵表現。有時一個單音卻有可能含有多個狀態。一個狀態並不像音框一樣具有固定尺寸，通常一個狀態包含數個或數十個音$ ° ϋιι 清參照第二圖所示，第一狀態包含三個音框、第二狀熊包含六個音框及第三狀態包含四個音框。第四圖揭示本發明較佳實施例之聲紋驗證系統之音框與狀態之初始分配模式示意圖。該初始分配模式舉例三個樣本$吾音進行均分動作。在初始模式將語音作均分動作，在均分後可能無法整除’多餘音框則將其平分在第一個及最後一個狀態。請再參照第三圖所示，在分配模式中，樣本語音均分必須考虎二點· 1、第一個音框一定屬於第一個狀態；2、最後一個C: \ L (X; (). 5 \ HIvr: C〇NTINENTS \ PK «) 351 .ptd Page 141234762 V. Description of the invention (9) ACi = [2 ^ C (3,«) + C (2 , λ)-C (0, «)] / 6 (6) AC ^ = [2 * 〇 ^ + 2,; 〇十 C (i + l, ^)-C (i_l,»)-2 * C ( i-2, ")] / 10 (7) AC3f " 2 = [C (£ -l,«)-C (I-3, «)-23 < cC (Z; -4,«)] / 6 ( 8) △. Guang 1 = Bu C (Z-2, λ)-2 C (Z-3, ")] / 5 (9) In equations (5) to (9), Cn is the η order characteristic value, and L is the signal Total number of time frames, i is the time frame number. The third figure illustrates the relationship between the state of the voiceprint verification system and the sound frame in the preferred embodiment of the present invention. In training speech, speech has the concept of the so-called "state". State is the change of mouth shape and vocal tract during pronunciation. Generally speaking, every time the mouth shape changes, each state is a feature of a voice change. Sometimes a single tone may contain multiple states. A state does not have a fixed size like a sound box, usually a state contains several or dozens of sounds. ° Refer to the second figure, the first state contains three sound boxes, and the second bear contains six The sound frame and the third state include four sound frames. The fourth figure shows a schematic diagram of the initial allocation mode of the sound frame and status of the voiceprint verification system of the preferred embodiment of the present invention. This initial allocation pattern exemplifies three samples of $ Goy to perform an equalizing action. In the initial mode, the voice is evenly divided. After the equalization, it may not be able to be removed. The extra frame will be divided into the first and last states. Please refer to the third figure again. In the distribution mode, the sample voice must be divided equally. Two points 1. The first frame must belong to the first state; 2. The last one

1234762 五、發明說明（ίο) 個狀態；3、音框的狀態變化，只有於每個狀態的機率，並斯分配機率計算每個音框屈句。手並且利用維特比演算法獲得最相似路换第ϊ f揭不本發明較佳實施例之聲紋驗證系統之狀態轉換不意圖。艟五圓所示，在三個狀態時，L個音框可能狀態、、、° 將打又音框視為不可能屬於的狀態，箭頭的方向視為可此狀態變化路徑。，二圖揭不本發明較佳實施例之聲紋驗證系統之最相似路徑示意圖。 A請參照第六圖所示，擷取特徵之最相似路徑具有第一狀1234762 V. Description of the invention (ίο) states; 3, The state of the sound box changes only in the probability of each state, and the probability is assigned to calculate the sentence of each frame. The hand and use the Viterbi algorithm to obtain the most similar path ϊf reveals that the state transition of the voiceprint verification system of the preferred embodiment of the present invention is not intended. As shown by Wuyuan, in three states, the possible states of the L sound frames are, ,,, and °. The sound frame is regarded as a state that cannot belong to, and the direction of the arrow is regarded as the path that this state can change. The second figure shows the most similar path diagram of the voiceprint verification system of the preferred embodiment of the present invention. A Please refer to the sixth figure. The most similar path of the extracted features has the first shape.

態包含第1至3音框、第-肿能4人哲」s e L 包含第7至10音框。至5音框及第^。第七圖揭示本發明較佳實施例之聲紋驗證系統之框示意圖。』刀曰請圖所示，三個樣本語音在三個狀態的初始模 t 後之分佈。第—樣本語音之每個樣本語音均为二個音框後，剩餘兩個音框分別分配置第一狀態及狀態。第二樣本語音之每個樣本語音均分四個音框。第三樣本語音之每個樣本語音均分三個音框後，剩餘一個分別分配置第-狀態。在計算後，纟最大相似機率為 215 7。第八圖揭不本發明較佳實施例之聲紋驗證系統之第一次The state includes frames 1 to 3, and the 4th spleen. The e L includes frames 7 to 10. To 5 frame and ^. The seventh figure discloses a block diagram of a voiceprint verification system according to a preferred embodiment of the present invention. Daodao Please show the distribution of the three sample speeches after the initial modes t of the three states. After each sample voice of the first-sample voice has two voice frames, the remaining two voice frames are respectively arranged in the first state and state. Each sample speech of the second sample speech is divided into four frames. After each sample voice of the third sample voice is divided into three frames, the remaining one is divided into the first state. After calculation, the maximum similarity probability of 纟 is 215 7. Figure 8 shows the first time a voiceprint verification system of a preferred embodiment of the present invention

：\ L(X；() - 5 \ FIVE COST INENTS \ PK9 3 51 第16頁 1234762 五、發明說明（11) 重新分配音框示意圖。，參照第八圖所示’在第-次重新分配音相似機率上升至3171。其最大 =圖揭示本發明較佳實施例之聲紋驗證重新分配音框示意圖。〈弟一次請參照第九圖所示，在第二次重新分配立相似機率上升至3571。 -曰 ’其最大第十圖揭示本發明較佳實施例之配音框示意圖。卑、，又驗也系統之最佳分似：十圖所示’在多次重新分配音框後，其最Η 以機率3571不再上升，因此其視其最大相狀態的期望值及變異數作為模都：：佳刀配“匡。計算各存在資料庫。吳數作為杈型參數，該模型參數可供儲練作業時，運曾：J :⑴在,々5亥訓練系統10進行接著利用維；= ;期望值及變異數；為==似；！成；ίγ各狀'態 2音訓練上’其最大相似機率小業。法通過語音訓練且钎、預疋參考值&，無紋驗證系統1 ;反之°，"畀 "，因而必須重新操作該聲時，通過語音訓缝，’、大相似機率大於該預定參考值統1。而將模型參數儲存在該聲紋驗證系睛再參照第一 % — ^ 進入允許交易階段。不田70成申請聲紋辨識註冊時，即 C: \L(X;()- 5 \F 丨 VE CONT丨 NENTS\PKy3 51 · 第17頁 1 1234762 五、發明說明（12) 請再參照第一及二圖所示，該輪入帳號已建立該測試系統20進行語音測試作業。本發明之聲紋操作該測試系統2 0之詳述如下：為謂4 同樣的在進入該測試系統20進行語音測試作方程式（1 )至（9 )獲得有效測試語音特徵。异請再參照第二圖所示，接著，進行運算該測試語音與模型參數之間相似機率以便輸出一辨識結。―五立識上，其最小相似機率大於預定參考值時f通過组^ ^辨，，因而可離開該聲紋驗證系統！，且進入後續電子曰商務父易程序，反之，其最小相似機率小於該預定參考值無法通過語音辨識且結束測試作業，因而必須離開該聲纹驗=系統1，且拒絕進行後續電子商務交易程序。、纹m第一及二圖所示，最後，該可辨識裝置依讎链紋驗證糸統1之測試系統20測試結果決定允許或拒進電子商務交易。 —雖然本發明已以前述較佳實施例揭示，然其並非用以限 =本發明、，任何熟習此技藝者，在不脫離本發明之精神和内，S可作各種之更動與修改，因此本發明之保護範圍备視後附之申請專利範圍所界定者為準。 :\L(X；〇-5\!;IVI： C〇NTINENTS\PKy351 .ptd 第18頁 1234762 圖式簡單說明【圖式簡單說明】第1圖：本發明較佳實施例電子商務交易之聲紋驗證系統之流程圖。第2圖：本發明較佳實施例之聲紋驗證系統之流程方塊圖。第3圖：本發明較佳實施例之聲紋驗證系統之狀態及音框之關係示意圖。第4圖：本發明較佳實施例之聲紋驗證系統之音框與狀態之初始分配模式示意圖。第5圖：本發明較佳實施例之聲紋驗證系統之狀態轉換示意圖。第6圖：本發明較佳實施例之聲紋驗證系統之最相似路徑示意圖。龜第7圖：本發明較佳實施例之聲紋驗證系統之均分示意圖。第8圖：本發明較佳實施例之聲紋驗證系統之第一次重新分配音框示意圖。第9圖：本發明較佳實施例之聲紋驗證系統之第二次重新分配音框示意圖。第1 〇圖：本發明較佳實施例之聲紋驗證系統之最佳分配音框示意圖。圖號說明： 1 聲紋驗證系統 10 訓練系統 20 測試系統: \ L (X; ()-5 \ FIVE COST INENTS \ PK9 3 51 Page 16 1234762 5. Explanation of the invention (11) Schematic diagram of redistribution of sound frames. Refer to the eighth figure, 'Redistribute sound at the first time- The probability of similarity rises to 3171. Its maximum = the figure shows the schematic diagram of the voiceprint verification reallocation of the preferred embodiment of the present invention. <Please refer to the ninth figure once, and the similarity probability of the second reallocation rises to 3571. -Said 'The largest tenth picture thereof shows a schematic diagram of the voice-over frame of the preferred embodiment of the present invention. The best part of the system is similar to that shown in the ten diagrams:' Η Take the probability of 3571 no longer rising, so it regards the expected value of the maximum phase state and the number of mutations as the modulo :: Jiadao with "Kuang. Calculate each existing database. Wu number as a branch type parameter, the model parameters can be stored When practicing homework, Yun Zeng: J: ⑴ 在, 々 5 HAI training system 10 then use the dimension; =; expected value and the number of mutations; = = similar;! Cheng; γ γ various states 'state 2-tone training on' its maximum Similar chances for small businesses. The reference value &, non-texture verification system 1; otherwise °, " 畀 ", so when the sound must be re-operated, the probability of similarity is greater than the predetermined reference value system 1 through voice training, and the model parameters are stored. In this voiceprint verification system, refer to the first% — ^ to enter the allowed transaction phase. When Hada 70% applies for voiceprint identification registration, that is, C: \ L (X; ()-5 \ F 丨 VE CONT 丨 NENTS \ PKy3 51 · Page 17 1 1234762 V. Description of the invention (12) Please refer to the first and second figures again, the test account 20 has been established for the turn-in account for voice test operations. The voiceprint of the present invention operates the test system The details of 2 0 are as follows: For Predicate 4, the same goes into the test system 20 for the voice test. Equations (1) to (9) are obtained to obtain valid test voice features. Please refer to the second figure again, and then perform the calculation. The probability of similarity between the test voice and the model parameters in order to output a recognition knot. ― In the five senses, when the smallest similarity probability is greater than the predetermined reference value, f is identified by the group ^ ^, and thus can leave the voiceprint verification system! Enter the follow-up electronic business The parent program, on the other hand, its minimum similarity probability is less than the predetermined reference value and cannot pass voice recognition and end the test operation. Therefore, it must leave the voiceprint inspection = system 1 and refuse to conduct subsequent e-commerce transaction procedures. As shown in the two figures, finally, the identifiable device decides whether to allow or deny e-commerce transactions based on the test results of the test system 20 of the chain verification system 1. Although the present invention has been disclosed in the foregoing preferred embodiment, it is not Limitation = the present invention. Any person skilled in this art can make various changes and modifications without departing from the spirit and scope of the present invention. Therefore, the scope of protection of the present invention is defined by the scope of the attached patent application. Prevail. : \ L (X; 〇-5 \ !; IVI: C〇NTINENTS \ PKy351 .ptd Page 18 1234762 Brief Description [Schematic Illustration] Figure 1: Voice of E-commerce Transactions in the Preferred Embodiment of the Present Invention Flowchart of the voiceprint verification system. Figure 2: Block diagram of the voiceprint verification system of the preferred embodiment of the present invention. Figure 3: Schematic diagram of the state of the voiceprint verification system and the relationship between the sound boxes of the preferred embodiment of the present invention Figure 4: Schematic diagram of the initial allocation mode of the voice frame and status of the voiceprint verification system of the preferred embodiment of the present invention. Figure 5: State transition of the voiceprint verification system of the preferred embodiment of the present invention. Figure 6 : Schematic diagram of the most similar path of the voiceprint verification system of the preferred embodiment of the present invention. Turtle FIG. 7: The equalization diagram of the voiceprint verification system of the preferred embodiment of the present invention. FIG. 8: of the preferred embodiment of the present invention Schematic diagram of the first reassignment of the voice frame of the voiceprint verification system. Figure 9: Schematic diagram of the second reassignment of the voiceprint of the voiceprint verification system of the preferred embodiment of the present invention. Figure 10: The preferred embodiment of the present invention Best Voiceprint Verification System Schematic diagram of the voice distribution box. Drawing number description: 1 Voiceprint verification system 10 Training system 20 Test system

C:\L(X；〇-5\I;IVi： CONTINUNTS\PK*)351 .pul 第19頁C: \ L (X; 〇-5 \ I; IVi: CONTINUNTS \ PK *) 351 .pul page 19

Claims

1234762 6. Scope of patent application, an electronic customer account using one that can be identified and one that should be recognized; verifying the original input invalid voice at a front end according to the application-specific voiceprint; a feature extraction feature; a business transaction method, which Including steps: login by a connected device; identifying the device to confirm the customer's basic information; checking whether the device has applied for voiceprint comparison; voiceprint identification system for voiceprint identification; and the device determining to allow or deny e-commerce transactions. Lee range first! The item's e-commerce transaction method system includes: Qianghaili, which is used for front-end processing. The voiceprint verification system = voice data, and thus completes the distinction of valid voice information and information, and then retrieves valid voice information; It is used for retrieving the valid voice information. The May 1 storage unit stores the voice feature; 丨 transportation; the computing unit'uses the stored voice feature and input voice feature force according to the second item of the scope of the patent application In the e-commerce transaction method, 1 The voiceprint verification system further includes a training system that uses the front-end feature extraction section to obtain a model of the original input voice data ^. 2. The e-commerce transaction method according to item 4 of the scope of patent application, wherein the training system of the voiceprint verification system uses Viterbi algorithm ^ to obtain ^ similar paths in order to calculate model parameters for storage. Also, according to the e-commerce transaction method according to the scope of the patent application, the voiceprint verification system further includes a test system using the front-end processing unit and

1234762 6. Scope of patent application Feature extraction section to obtain the voice features of the original input voice data. 6. The e-commerce transaction method according to item 1 of the scope of patent application, where the voiceprint verification system enters the personal password when no voiceprint comparison is applied. 7. The e-commerce transaction method according to item 6 of the scope of patent application, When the correct personal password is entered, whether to apply for registration of voiceprint recognition phase is entered. _

: \ L (X; ()-5 \ FIVE CONTINENTS \ PK <; 3 51. p t d p. 21