TWI234762B - Voiceprint identification system for e-commerce - Google Patents

Voiceprint identification system for e-commerce Download PDF

Info

Publication number
TWI234762B
TWI234762B TW92136456A TW92136456A TWI234762B TW I234762 B TWI234762 B TW I234762B TW 92136456 A TW92136456 A TW 92136456A TW 92136456 A TW92136456 A TW 92136456A TW I234762 B TWI234762 B TW I234762B
Authority
TW
Taiwan
Prior art keywords
voiceprint
voice
patent application
verification system
scope
Prior art date
Application number
TW92136456A
Other languages
Chinese (zh)
Other versions
TW200521962A (en
Inventor
Kun-Lang Yu
Andy Cheng
Yen-Chieh Ouyang
Original Assignee
Top Dihital Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Top Dihital Co Ltd filed Critical Top Dihital Co Ltd
Priority to TW92136456A priority Critical patent/TWI234762B/en
Application granted granted Critical
Publication of TWI234762B publication Critical patent/TWI234762B/en
Publication of TW200521962A publication Critical patent/TW200521962A/en

Links

Landscapes

  • Collating Specific Patterns (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

An e-commerce method includes steps of: accessing user's login through an electronic communication means; using an identification device to recognize a password; using the identification device to verify whether the user's registration for voiceprint identification; using a voiceprint identification system to identify or register for voiceprint identification; and using an identification device to decide for allowing or rejecting the user for proceeding e-commerce.

Description

12347621234762

【發明所屬之技術領域】 本發明係關於一種電子商務交易之聲紋驗證系統,其 別有關於進行電子商務交易之聲紋驗證外,择么言ς八 佈機率、動態時間校準演算法及隱藏式馬可夫ϋ冋並二 用維特比〔Viterbi〕演算法獲得最相似路徑以 模型參數之聲紋驗證系統。 "" 【先前技術】[Technical field to which the invention belongs] The present invention relates to a voiceprint verification system for e-commerce transactions. In addition to voiceprint verification for e-commerce transactions, choose a language, eight cloth probability, dynamic time calibration algorithm, and hiding. The Markov Pyramid II uses a Viterbi algorithm to obtain a voiceprint verification system that uses the most similar path to model parameters. " " [Prior art]

習用電子商務交易方法,如中華民國專利公告第38541 6 號「電子商務系統」發明專利,其揭示一種在一網路上之 一交易記錄〔transaction 1〇g〕提供存檔安全 〔archiving safety〕的電子商務系統〔c〇mmerce system〕,其包括:―對話密鑰產生器〔sessi〇n以又 creator〕用以產生一對話密鑰以加密該交易記錄;一 易記錄加密器〔encryptor〕用以加密使用該對話密鑰| 該交易記錄;及一交易記錄發送器用以將該已加密交易記 錄發这至該網路上之一存檔伺服器〔server〕。然而,該 第3 8 5 41 6唬僅將父易記錄加密以便進行資料傳輸及儲存, 未針對使用者加以辨識身分。 另一習用電子商務交易方法,如中華民國專利公告第 550477 ^網站帳戶之方法、系統及電腦可讀取媒體及自 中央位置之電子商務管理」發明專利,其揭示一種用以管 理一使用者在一目的地電子商務網站上進行線上〔中央網 站〕金融交易之方法,其包括:登入使用者至一目的地電 子商矛/;、,周站,產生泫使用者於該中央網站中的唯一使用者Conventional e-commerce transaction methods, such as the Republic of China Patent Bulletin No. 38541 6 "E-commerce system" invention patent, which reveals an e-commerce that provides archiving safety for a transaction record (transaction 10g) on a network A system [c〇mmerce system], which includes:-a session key generator [session and creator] to generate a session key to encrypt the transaction record; an easy record encryptor [encryptor] for encryption use The session key | the transaction record; and a transaction record sender for sending the encrypted transaction record to an archive server on the network. However, the 3rd, 3rd, 5th, 4th, and 6th encryption only encrypted the parent record for data transmission and storage, and did not identify the user. Another conventional e-commerce transaction method, such as the Republic of China Patent Bulletin No. 550477 ^ website account method, system and computer-readable media and e-commerce management from a central location "invention patent, which discloses a method for managing a user's A method for conducting online [central website] financial transactions on a destination e-commerce website, which includes: logging in users to a destination e-commerce platform; and, weekly stations, generating the sole use of the user on the central website By

第7頁 1234762 五、發明說明(2) 名稱及密碼;利用該唯一使用使用者名稱及密碼在一或多 ,目的地網站上進行註冊;傳輸一啟用指令至一處理金融 ί易t 2 Ϊ Ϊ用以啟動該使用者的一信用卡或簽帳卡帳 戶,备該#用卡或簽帳卡帳戶為啟用狀態時,經由詨 地電子商務網站發送該信用卡或簽帳卡帳戶之一 =及傳輸—撤銷指令至該金融機構以撤:該 “用卡或簽帳卡帳戶;其中當該信用卡或簽帳卡帳戶為啟 該金融機構只接受及處理從該電子商務網站所 二ί :ί求;及其中當該信用卡或簽帳卡帳戶為撤銷 i i二ί融機構拒絕付費請求。然而,該第5 5 〇 4 m虎 用使用者之唯一名稱及密碼加以辨識身分,因此豆且 有岔碼洩漏的疑慮。 八八 12 ΐ i:該第38541 6號及第550477號之電子商務交+需 V加以改良,以便能準確辨識使用者之身分。& 聲ί驗證:法,如中華民國專利公告第490655 J:用聲错資訊辯識使用者的方法與其裝置」發明專 f不同使用者特有的聲错資訊辨識使用者的身 伤’以決疋使用者是否經過授權。該方法包含步驟: ”立、A_用者發出語音後,偵剛語音之終點;〔2〕、自 = ” = 特徵;〔3〕、決定是否需要訓 -界阳,〗」「 二特徵作為一參考樣本,同時設定 符二去#否」則進仃下—步驟;〔4〕、將該語音特 樣本進行圖樣比對;〔5〕、依 兩者之間距之距離;〔6〕、將該計算結果與設定界限比Page 71234762 V. Description of the invention (2) Name and password; use the unique user name and password to register on one or more destination websites; transmit an activation instruction to a processing finance 易易 2 2 Ϊ Ϊ Used to activate a credit or debit card account for the user. When the #used or debit card account is enabled, send one of the credit or debit card account via the local e-commerce website = and transmit — Revocation order to the financial institution to withdraw: the "used or debit card account; when the credit or debit card account is activated, the financial institution only accepts and processes requests from the e-commerce website; Among them, when the credit or debit card account was revoked, the financial institution refused to pay the request. However, the 5504m tiger used the user's unique name and password to identify him, so the bean and the fork code leaked. Doubt. 88:12 i: The e-commerce transactions No. 38541 6 and No. 550477 need to be improved so that users can be accurately identified. &Amp; Voice verification: law, such as the Republic of China Patent Bulletin 490655 J: Method wrong user identification information with its sound devices "designed f invention different user-specific sound wrong user identification information body injury 'to determine whether the user is authorized Cloth. The method includes the following steps: "Li, A_ Detect the end of the speech just after the user has spoken; [2], since =" = feature; [3], decide whether training is needed-Jieyang, "" and "two features as Take a reference sample and set the symbol two to go to #No ”, then proceed to the next step— [4], pattern comparison of the voice special samples; [5], according to the distance between the two; [6], will Ratio of the calculation result to the set limit

·_ \L(X;〇-5\FIVH Ci)NTINli‘NTS\pj(()35| 第8頁 1234762 五、發明說明(3) 較;〔7〕、依該比較結果決定該使用者是否為一授權使 用者。該方法係使用於行動電話,其利用聲譜分析方法將 語音之獨特資訊取出,藉此進行辨識使用者之方法。該第 490655號主要利用每一時框〔fraine〕之主要值與使用者 設定的界限進行比較,決定語音之始點與終點後,再利用 Princen-Bradley濾波器轉換已偵測的語音訊號,以便取 得其對應聲譜圖案。該聲譜圖案與預先儲存之參考聲譜樣 本進行比對,以辨識使用者之聲紋。 簡言之,該第490655號需要進行圖案的匹配及距離的運 算’若該運算距離未超過界限時,使用者即可通過聲紋辨 識。然而,該第4 9 0 6 5 5號在進行圖案的匹配及距離的運算 時,必須計算在參考樣本及測試樣本之間的距離。事實 上’該參考樣本所佔用資料庫的空間相當大,因此其來但 需要較大的資料庫空間且需要更長的檔案傳輸時間。若4將 忒聲紋驗證技術能應用在電子商務交易時,具有延長交易 時間的缺點。 因此,該第490655號仍有必要進一步改良其參考樣本之 佔用空間的問題,如此能節省儲存參考樣本之資料庫空 間’以避免使用者數量的限制。利用減少該參考樣本之位 元方法,更能加速聲紋驗證所需時間,且更能提升辨識 率’以便將聲紋驗證技術能應用在電子商務交易時,能縮 短交易時間。 有鑑於此’本發明改良上述之缺點,其在進行電子商務 交易時,除了利用聲紋驗證系統進行辨識使用者之身分· _ \ L (X; 〇-5 \ FIVH Ci) NTINli'NTS \ pj (() 35 | Page 81234762 V. Description of the invention (3) comparison; [7], determine whether the user is based on the comparison result It is an authorized user. This method is used in mobile phones. It uses sound spectrum analysis method to extract the unique information of speech to identify the user. This 490655 mainly uses the main of each frame [fraine] The value is compared with the limit set by the user. After determining the start and end points of the speech, the detected speech signal is converted by the Princen-Bradley filter in order to obtain its corresponding sound spectrum pattern. The sound spectrum pattern and the pre-stored The reference sound spectrum samples are compared to identify the user's voiceprint. In short, the 490655 requires pattern matching and distance calculation. 'If the calculation distance does not exceed the limit, the user can pass the voiceprint. Identification. However, when performing pattern matching and distance calculations, the No. 49 0 65 5 must calculate the distance between the reference sample and the test sample. In fact, 'the reference sample occupies a considerable amount of space in the database Large, so it comes but requires a larger database space and longer file transfer time. If 4 can apply the voiceprint verification technology to e-commerce transactions, it has the disadvantage of extending the transaction time. Therefore, this section 490655 It is still necessary to further improve the problem of the space occupied by its reference samples, so as to save the database space for storing the reference samples' to avoid the limitation of the number of users. By reducing the bit method of the reference samples, the voiceprint verification can be accelerated. The time required, and the recognition rate can be improved, so that the voiceprint verification technology can be applied to e-commerce transactions, which can shorten the transaction time. In view of this, the present invention improves the above-mentioned disadvantages. Identify users with voiceprint verification system

c : \ L(K;(). 5 uq VE C()NT l Nl-NTS\ PK9 3 51. P ul 第9頁 1234762 五、發明說明(4) 外,且該聲紋 準演算法及隱 最相似路徑, 【發明内容】 本發明主要 統,其在進行 識使用者之身 本發明次要 統,其除了進 斯分佈機率、 並利用維特比 數,使本發明 根據本發明 戶帳號由一連 戶基本 對;利 識;及 本發 部、— 進行訓 前端處 訊;再 再進行 作為模 資料; 用一聲 該可辨 明之聲 訓練系 練或測 理部自 利用該 運算該 型參數 驗證系統另結合高斯分佈機率、 藏式馬可夫模式,並利用維特比^時間校 以便計算模型參數。 /、异法獲得 目的係提供一種電子商務交易之 電子商務交易肖,利用聲紋驗證系::證系 分,使本發明具有提升辨識率之^力效。仃辨 目的係提供一種電子商務交易之聲^么 行電子商務交易之聲紋驗證外,其、j证不 動態時間校準演算法及隱藏式馬;,高 凟算法獲得最相似路徑,以便計算模型= 具有簡化訓練及測試作業之功效。、|多 務交:易方☆,該方法包含步驟於客 政置進订么錄,利用一可辨識裝置確i 該可辨識裝置進行核對是否已申請聲紋= 紋驗證系統選擇進行聲紋辨識或註冊聲纹辨 識裝置決定允許或拒絕進行電子商務交易。 紋驗證系統包含一前端處理部、一特徵擷取 ,及一測試系統,以便對原始輸入語音資料 試作業。在訓練語音上,該訓練系統利用該 該原始輸入語音資料擷取有效訓練語音資 特徵操取部進行擷取該有效訓練語音特徵; 有效訓練語音資訊以獲得最相似路徑,以便 。同樣在測试語音上,該測試系統利用該前c: \ L (K; (). 5 uq VE C () NT l Nl-NTS \ PK9 3 51. P ul Page 9 1234762 5. The invention is explained (4), and the voiceprint quasi-performance algorithm and hidden The most similar path. [Summary of the Invention] The main system of the present invention is the secondary system of the present invention, which is in addition to identifying the user. In addition to the probability of distribution, and using Viterbi numbers, the present invention makes the account of the user according to the present invention continuous. The basic knowledge of the households; profit knowledge; and the development department,-to conduct training front-end processing; then again as a model data; use a discernible voice to train the training department or the measurement department to use the calculation of this type of parameter verification system In addition, it combines the probability of Gaussian distribution, the Tibetan Markov model, and uses Viterbi ^ time calibration to calculate the model parameters. / 、 The purpose of obtaining an e-commerce transaction is to provide an e-commerce transaction shaw for e-commerce transactions, using the voiceprint verification system :: certificate system This makes the present invention have a powerful effect of improving the recognition rate. The purpose of identification is to provide a voice verification of e-commerce transactions. In addition to the verification of the voiceprint of e-commerce transactions, its dynamic time calibration algorithm and hidden type horse ;, Gao Yan's algorithm to obtain the most similar path, so that the calculation model = has the effect of simplifying training and testing operations., | Multi-service delivery: Yi Fang ☆, this method includes the steps of ordering records in the guest house, using The device verifies whether the recognizable device has applied for voiceprint = print verification system chooses to perform voiceprint recognition or register voiceprint recognition device to decide whether to allow or deny e-commerce transactions. The print verification system includes a front-end processing unit, a feature extraction Fetch, and a test system to test the original input voice data. On the training voice, the training system uses the original input voice data to extract the effective training voice feature extraction unit to extract the effective training voice feature; Efficiently train speech information to obtain the most similar path so that, also on test speech, the test system uses the previous

1234762 五、發明說明(5) t f Γ :自該原始輪入語音資料擷取有效測試語音資訊; —運| ^特徵擷取部進行擷取該有效測試語音特徵;、再進 2辨識結1試語音特徵與模型參數之間相似機率以便輸出 【實施方式] 確。ί本和其他目的、特徵、和優點能更明 式,作詳細:ΪΠ舉本發明較佳實施例,並配合所附圖 本發明較佳實施例電子商務交易之聲紋驗證 聲= = 進佳實施例電子商務交易之 接裝置進行登錄。該連:;置:含帳二由1 comDuter 1 ^ ^ s 入也細〔persona 1 M ^ 一自動存提款機〔Automated Teller广1234762 V. Description of the invention (5) tf Γ: Extract the valid test voice information from the original turn-by-round voice data; —Run | ^ Feature extraction section to retrieve the valid test voice feature; Similarity between speech features and model parameters in order to output [implementation] This and other objects, features, and advantages can be made more explicit, and detailed: Ϊ 举 citing the preferred embodiment of the present invention, and with the accompanying drawings of the preferred embodiment of the present invention, the voiceprint verification voice of e-commerce transactions = = Jin Jia In the embodiment, an e-commerce transaction receiving device performs login. This company :; Set: Including account 2 by 1 comDuter 1 ^ ^ s into the details [persona 1 M ^ 1 ATM [Automated Teller 广

Mach;ne〕、—特約商店刷卡機〔creducar 4 ,二Γ即可連接進行-般商務交易。 :紋驗證中心,該聲紋驗證中心可選= = 置禮-ΐίϊ: 聲紋驗證中心利用一可辨識带 置確 < 客戶基本資料,該可辨識裝置包凌 輯電路等。此外,該聲紋驗證中心 辨識邏 嗜A夂昭锋 一 T U具有一聲紋驗證系統。 客戶曰ϋ、由一圖所不,接著,該可辨識裝置進行核對兮 請聲紋比對’即產生該客戶是否需要進^ ,.文比對之、,,。果。該聲紋驗證中心將該結果傳回該連接裝斗Mach; ne], credit card swiping machine [creducar 4], two Γ can be connected for ordinary business transactions. : Texture verification center, this voiceprint verification center is optional = = Zhili-ΐίϊ: The voiceprint verification center uses an identifiable band to set the confirmation of the customer's basic information, and the identifiable device includes the editing circuit and so on. In addition, the voiceprint verification center identification logic A 夂 ZHAO Feng-TU has a voiceprint verification system. The customer said, as shown in a picture, and then, the recognizable device checks it. Please compare the voiceprint 'to generate whether the customer needs to enter it. fruit. The voiceprint verification center returns the result to the connection bucket

:\L(Xj().5\I-IVF: C()NTINENTS\PKy35l.ptd 第11頁 1234762 、發明說明(6) 置’以便進行後續電子商務交易程序。 第一圖揭示本發明較佳實施例之聲紋驗證系統之流程方 塊圖。 β月參照第二圖所示,本發明較佳實施例之聲紋驗證系統 ^訓練系統1 〇及一測試系統2 0,以便對原始輸入語 t ^料進行訓練或測試作業。該聲紋驗證系統1另包含一 =端處=里部、一特徵擷取部、一儲存部及一運算部。該前 j f理部及特徵擷取部供該訓練系統1 0及測試系統20進行 前,,理及特徵擷取,該儲存部供語音特徵加以儲存,該 運^部則將該儲存語音特徵及輸入語音特徵加以運算。 ,客戶帳號輸入本發明之聲紋驗證系統1時,即可進行 確W身刀。接著’該糸統依輸入帳號查詢資料庫,B 輸入帳號屬於已建立。若該輸入帳號未建立時,要以 進入該訓練系統10進行語音訓練作業,以便建立健疋否 輸入帳號之語音資料。若該輸入帳號已建立時,t存/y亥 试系統2 0進行語音測試作業,以便辨識該輸入入°亥測 特徵是否符合已儲存該輸入帳號之語音資料。、就之語音 «月再參照第一及一圖所示,接著,當客戶未 對時,則進入要求客戶輸入個人密碼。若客戶二請聲紋比 個人密碼後,即進入拒絕交易階段。隨客戶輪輪入不正確 密碼後,要求是否申請聲紋辨識註冊。當選^入正確個人 辨識註冊時,即進入允許交易階段。反之,當=申請聲紋 紋辨識註冊時,即進入該聲紋驗證系統1之甽曰選擇申請聲 本發明之聲紋辨識註冊操作該訓練系統丨〇之\」、來系統1 〇。 砰述如下:: \ L (Xj (). 5 \ I-IVF: C () NTINENTS \ PKy35l.ptd Page 111234762, Description of Invention (6)) for subsequent e-commerce transaction procedures. The first figure reveals that the present invention is preferred The block diagram of the process of the voiceprint verification system of the embodiment. As shown in FIG. 2 with reference to the second figure, the voiceprint verification system of the preferred embodiment of the present invention ^ training system 10 and a test system 20, so that the original input t ^ Material for training or test operations. The voiceprint verification system 1 further includes a = end = inside, a feature extraction section, a storage section and a computing section. The former jf management section and feature extraction section are provided for the Before the training system 10 and the test system 20 are performed, the processing and feature extraction are performed. The storage unit is used to store the voice features, and the operation unit calculates the stored voice features and the input voice features. The customer account number is input into the present invention. When the voiceprint verification system is 1, you can confirm the body knives. Then 'the system will query the database according to the input account, and the B input account belongs to the established account. If the input account has not been established, you must enter the training system 10 Perform voice training assignments to build fitness No input the voice data of the account. If the input account has been established, the t / y test system 20 performs a voice test operation in order to identify whether the input test feature matches the voice data of the input account. For the voice «month, please refer to the first and first pictures, and then, when the customer is not correct, then enter the request for the customer to enter the personal password. If the customer 2 asks for voiceprint than the personal password, it will enter the transaction rejection stage. With the customer After entering the incorrect password in turn, ask whether to apply for voiceprint identification registration. When you choose ^ to enter the correct personal identification registration, you will enter the stage of allowing transactions. Conversely, when = apply for voiceprint identification registration, you will enter the voiceprint verification system. The first one said that he would choose to apply for the voiceprint recognition and registration of the present invention to operate the training system 丨 〇 and the system 1 〇. The bang is as follows:

12347621234762

在擷取語音特徵之前,利用該前端處理部將有效語音資 訊自原始輸入語音資料擷取,以濾除無效語音資訊。本發 明偵測包含短時距能量〔Short-Energy〕及過零率 〔Zero-Cross ing Rate〕。本發明採用結合高斯機率分佈 的計算方法,其方程式如下: exP 卜 y (卜 _ Σ!-1 〇 一 % (1) 其中:^為原始訊號將其分為數個d維的音框、,. 1,…,M,為所屬機率、《7為背景雜訊之期望值4為背景 1 雜訊的變異數。在此,因為中 ^ , ^ T0^D = 256 為一個定佶, 故將其省略不予計算,將方程式(1)簡化如下巧個疋值Before capturing voice features, the front-end processing unit is used to extract valid voice information from the original input voice data to filter out invalid voice information. The detection of the present invention includes short-time energy [Short-Energy] and zero-crossing rate [Zero-Crossing Rate]. The present invention uses a calculation method that combines a Gaussian probability distribution, and its equation is as follows: exP BU y (Bu _ Σ! -1 〇 一% (1) where: ^ is the original signal divided into several d-dimensional sound frames ,. 1,…, M are the probability of belonging, “7 is the expected value of background noise 4 is the number of variation of background 1 noise. Here, because ^, ^ T0 ^ D = 256 is a fixed value, so it is omitted Without calculation, simplify equation (1) as follows:

上式中的指數運算,在運瞀齡擔μ 士 (2) 取對數後,將方程式(2 )簡化如下·· 可能過大,故將其 :1η il/2The exponential operation in the above formula, after taking the age of μ μ (2) After taking the logarithm, the equation (2) is simplified as follows: · It may be too large, so it is: 1η il / 2

1234762 五、發明說明(8) 祕=(4吨丨4(;-扣ή) (3) 擷取原輸入語音資料前端256點,計算短時距能量及過 零率的期望值及變異數,接著將該兩個數及原輸入語音資 料代入該方程式(3 )進行運算。利用短時距能量與過零率 的分佈機率區分有效語音資訊及無效語音資訊,將無效語 音資訊加以濾除,不但減少資料量,亦能正確擷取有效語 音資訊。 在該特徵擷取部進行擷取特徵上,本發明採用兩個語音 識別特徵參數,其包含線性預測倒頻譜係數〔L i n e a r1234762 V. Description of the invention (8) Secret = (4 tons 丨 4 (;-valence) (3) Retrieve the front-end 256 points of the original input voice data, calculate the short-range energy and zero-crossing rate expected value and the number of variations, The two numbers and the original input voice data are substituted into the equation (3) for calculation. The distribution probability of short-distance energy and zero-crossing rate is used to distinguish valid voice information from invalid voice information, and the invalid voice information is filtered, not only reduced The amount of data can also correctly capture effective speech information. In the feature extraction section, the present invention uses two speech recognition feature parameters, which include linear prediction cepstrum coefficients [L inear

Prediction Cepstrum Coefficient,LPCC〕及梅爾頻標 倒頻譜參數〔Mel Frequency Cepstrum Coefficient, MFCC〕兩者各 12 個倒頻譜參數(cepstrai coefficients) 及12個一階倒頻譜參數(delta-cepstrai 〜 coefficients)。將倒頻譜參數^對時間做偏微分 di Σ 七2 (4) JU-尤 κ為考慮音框數。Prediction Cepstrum Coefficient (LPCC) and Mel Frequency Standard Cepstrum Coefficient (MFCC) each have 12 cepstrai coefficients and 12 first-order cepstrai coefficients (delta-cepstrai ~ coefficients). Partial differentiation of cepstrum parameter ^ with time di Σ 7 2 (4) JU- especially κ is the number of frames to be considered.

因為一階倒頻譜參數的公式⑷過於複雜,故將其加以 下列各式為僅考慮前後各兩個時框日夺,方程式簡化 如下 : 卜[2*C(2,)+C(U)]/5 (5)Because the formula of the first-order cepstrum parameter is too complicated, the following formulas are added to consider only the two time frames before and after. The equation is simplified as follows: [2 * C (2,) + C (U)] / 5 (5)

C:\L(X;().5\HIvr: C〇NTINENTS\PK«)351 .ptd 第14頁 1234762 五、發明說明(9) ACi = [2 ^ C(3,«) + C(2,λ) - C(0,«)] / 6 ( 6 ) AC^ =[2*〇^ + 2,;〇 十 C(i+l,^)-C(i_l,»)-2*C(i-2,《)]/10 (7) AC3f"2=[C(£-l,«)-C(I-3,«)-23<cC(Z;-4,«)]/6 ( 8 ) △。广1 =卜 C(Z - 2,λ) - 2 木C(Z - 3,《)]/ 5 (9) 方程式(5 )至(9 )中,Cn為η階特徵值,L為訊號中時框 總數,i為時框編號。 第三圖揭示本發明較佳實施例之聲紋驗證系統之狀態及 音框之關係示意圖。 在訓練語音上,語音具有所謂「狀態」的觀念,狀態是 發音時嘴型以及聲道的變化。一般而言,每一次說話嘴型 一定有變化,故每一個狀態都是一個語音變化的特徵表 現。有時一個單音卻有可能含有多個狀態。一個狀態並不 像音框一樣具有固定尺寸,通常一個狀態包含數個或數十 個音$ ° ϋιι 清參照第二圖所示,第一狀態包含三個音框、第二狀熊 包含六個音框及第三狀態包含四個音框。 第四圖揭示本發明較佳實施例之聲紋驗證系統之音框與 狀態之初始分配模式示意圖。該初始分配模式舉例三個樣 本$吾音進行均分動作。 在初始模式將語音作均分動作,在均分後可能無法整 除’多餘音框則將其平分在第一個及最後一個狀態。請再 參照第三圖所示,在分配模式中,樣本語音均分必須考虎 二點· 1、第一個音框一定屬於第一個狀態;2、最後一個C: \ L (X; (). 5 \ HIvr: C〇NTINENTS \ PK «) 351 .ptd Page 141234762 V. Description of the invention (9) ACi = [2 ^ C (3,«) + C (2 , λ)-C (0, «)] / 6 (6) AC ^ = [2 * 〇 ^ + 2,; 〇 十 C (i + l, ^)-C (i_l,»)-2 * C ( i-2, ")] / 10 (7) AC3f " 2 = [C (£ -l,«)-C (I-3, «)-23 < cC (Z; -4,«)] / 6 ( 8) △. Guang 1 = Bu C (Z-2, λ)-2 C (Z-3, ")] / 5 (9) In equations (5) to (9), Cn is the η order characteristic value, and L is the signal Total number of time frames, i is the time frame number. The third figure illustrates the relationship between the state of the voiceprint verification system and the sound frame in the preferred embodiment of the present invention. In training speech, speech has the concept of the so-called "state". State is the change of mouth shape and vocal tract during pronunciation. Generally speaking, every time the mouth shape changes, each state is a feature of a voice change. Sometimes a single tone may contain multiple states. A state does not have a fixed size like a sound box, usually a state contains several or dozens of sounds. ° Refer to the second figure, the first state contains three sound boxes, and the second bear contains six The sound frame and the third state include four sound frames. The fourth figure shows a schematic diagram of the initial allocation mode of the sound frame and status of the voiceprint verification system of the preferred embodiment of the present invention. This initial allocation pattern exemplifies three samples of $ Goy to perform an equalizing action. In the initial mode, the voice is evenly divided. After the equalization, it may not be able to be removed. The extra frame will be divided into the first and last states. Please refer to the third figure again. In the distribution mode, the sample voice must be divided equally. Two points 1. The first frame must belong to the first state; 2. The last one

1234762 五、發明說明(ίο) 個狀態;3、音框的狀態變化,只有 於每個狀態的機率,並斯分配機率計算每個音框屈 句。 手並且利用維特比演算法獲得最相似路 换第ϊ f揭不本發明較佳實施例之聲紋驗證系統之狀態轉 換不意圖。 艟五圓所示,在三個狀態時,L個音框可能狀態 、、、° 將打又音框視為不可能屬於的狀態,箭頭的 方向視為可此狀態變化路徑。 ,二圖揭不本發明較佳實施例之聲紋驗證系統之最相似 路徑示意圖。 A請參照第六圖所示,擷取特徵之最相似路徑具有第一狀1234762 V. Description of the invention (ίο) states; 3, The state of the sound box changes only in the probability of each state, and the probability is assigned to calculate the sentence of each frame. The hand and use the Viterbi algorithm to obtain the most similar path ϊf reveals that the state transition of the voiceprint verification system of the preferred embodiment of the present invention is not intended. As shown by Wuyuan, in three states, the possible states of the L sound frames are, ,,, and °. The sound frame is regarded as a state that cannot belong to, and the direction of the arrow is regarded as the path that this state can change. The second figure shows the most similar path diagram of the voiceprint verification system of the preferred embodiment of the present invention. A Please refer to the sixth figure. The most similar path of the extracted features has the first shape.

態包含第1至3音框、第-肿能4人哲」s e L 包含第7至10音框。至5音框及第^。 第七圖揭示本發明較佳實施例之聲紋驗證系統之 框示意圖。 』刀曰 請圖所示,三個樣本語音在三個狀態的初始模 t 後之分佈。第—樣本語音之每個樣本語音均 为二個音框後,剩餘兩個音框分別分配置第一狀態及 狀態。第二樣本語音之每個樣本語音均分四個音框。第三 樣本語音之每個樣本語音均分三個音框後,剩餘一個 分別分配置第-狀態。在計算後,纟最大相似機率為 215 7。 第八圖揭不本發明較佳實施例之聲紋驗證系統之第一次The state includes frames 1 to 3, and the 4th spleen. The e L includes frames 7 to 10. To 5 frame and ^. The seventh figure discloses a block diagram of a voiceprint verification system according to a preferred embodiment of the present invention. Daodao Please show the distribution of the three sample speeches after the initial modes t of the three states. After each sample voice of the first-sample voice has two voice frames, the remaining two voice frames are respectively arranged in the first state and state. Each sample speech of the second sample speech is divided into four frames. After each sample voice of the third sample voice is divided into three frames, the remaining one is divided into the first state. After calculation, the maximum similarity probability of 纟 is 215 7. Figure 8 shows the first time a voiceprint verification system of a preferred embodiment of the present invention

:\ L(X;() - 5 \ FIVE COST INENTS \ PK9 3 51 第16頁 1234762 五、發明說明(11) 重新分配音框示意圖。 ,參照第八圖所示’在第-次重新分配音 相似機率上升至3171。 其最大 =圖揭示本發明較佳實施例之聲紋驗證 重新分配音框示意圖。 〈弟一次 請參照第九圖所示,在第二次重新分配立 相似機率上升至3571。 -曰 ’其最大 第十圖揭示本發明較佳實施例之 配音框示意圖。 卑、,又驗也系統之最佳分 似:十圖所示’在多次重新分配音框後,其最Η 以機率3571不再上升,因此其視 其最大相 狀態的期望值及變異數作為模都::佳刀配“匡。計算各 存在資料庫。吳數作為杈型參數,該模型參數可供儲 練作業時,運曾:J :⑴在,々5亥訓練系統10進行 接著利用維;= ;期望值及變異數;為==似;!成;ίγ各狀'態 2音訓練上’其最大相似機率小業。 法通過語音訓練且钎 、預疋參考值&amp;,無 紋驗證系統1 ;反之°,&quot;畀 &quot;,因而必須重新操作該聲 時,通過語音訓缝,’、大相似機率大於該預定參考值 統1。 而將模型參數儲存在該聲紋驗證系 睛再參照第一 % — ^ 進入允許交易階段。不田70成申請聲紋辨識註冊時,即 C: \L(X;()- 5 \F 丨 VE CONT丨 NENTS\PKy3 51 · 第17頁 1 1234762 五、發明說明(12) 請再參照第一及二圖所示,該輪入帳號已建立 該測試系統20進行語音測試作業。本發明之聲紋 操作該測試系統2 0之詳述如下: 為謂4 同樣的在進入該測試系統20進行語音測試作 方程式(1 )至(9 )獲得有效測試語音特徵。 异 請再參照第二圖所示,接著,進行運算該測試語音 與模型參數之間相似機率以便輸出一辨識結。―五立 識上,其最小相似機率大於預定參考值時f通過组^ ^辨 ,,因而可離開該聲紋驗證系統!,且進入後續電子曰商務 父易程序,反之,其最小相似機率小於該預定參考值 無法通過語音辨識且結束測試作業,因而必須離開該聲纹 驗=系統1,且拒絕進行後續電子商務交易程序。 、 纹m第一及二圖所示,最後,該可辨識裝置依讎链 紋驗證糸統1之測試系統20測試結果決定允許或拒進 電子商務交易。 —雖然本發明已以前述較佳實施例揭示,然其並非用以限 =本發明、,任何熟習此技藝者,在不脫離本發明之精神和 内,S可作各種之更動與修改,因此本發明之保護範 圍备視後附之申請專利範圍所界定者為準。 :\L(X;〇-5\!;IVI: C〇NTINENTS\PKy351 .ptd 第18頁 1234762 圖式簡單說明 【圖式簡單說明】 第1圖:本發明較佳實施例電子商務交易之聲紋驗證系 統之流程圖。 第2圖:本發明較佳實施例之聲紋驗證系統之流程方塊 圖。 第3圖:本發明較佳實施例之聲紋驗證系統之狀態及音 框之關係示意圖。 第4圖:本發明較佳實施例之聲紋驗證系統之音框與狀 態之初始分配模式示意圖。 第5圖:本發明較佳實施例之聲紋驗證系統之狀態轉換 示意圖。 第6圖:本發明較佳實施例之聲紋驗證系統之最相似路 徑示意圖。 龜 第7圖:本發明較佳實施例之聲紋驗證系統之均分 示意圖。 第8圖:本發明較佳實施例之聲紋驗證系統之第一次重 新分配音框示意圖。 第9圖:本發明較佳實施例之聲紋驗證系統之第二次重 新分配音框示意圖。 第1 〇圖:本發明較佳實施例之聲紋驗證系統之最佳分 配音框示意圖。 圖號說明: 1 聲紋驗證系統 10 訓練系統 20 測試系統: \ L (X; ()-5 \ FIVE COST INENTS \ PK9 3 51 Page 16 1234762 5. Explanation of the invention (11) Schematic diagram of redistribution of sound frames. Refer to the eighth figure, 'Redistribute sound at the first time- The probability of similarity rises to 3171. Its maximum = the figure shows the schematic diagram of the voiceprint verification reallocation of the preferred embodiment of the present invention. <Please refer to the ninth figure once, and the similarity probability of the second reallocation rises to 3571. -Said 'The largest tenth picture thereof shows a schematic diagram of the voice-over frame of the preferred embodiment of the present invention. The best part of the system is similar to that shown in the ten diagrams:' Η Take the probability of 3571 no longer rising, so it regards the expected value of the maximum phase state and the number of mutations as the modulo :: Jiadao with "Kuang. Calculate each existing database. Wu number as a branch type parameter, the model parameters can be stored When practicing homework, Yun Zeng: J: ⑴ 在, 々 5 HAI training system 10 then use the dimension; =; expected value and the number of mutations; = = similar;! Cheng; γ γ various states 'state 2-tone training on' its maximum Similar chances for small businesses. The reference value &amp;, non-texture verification system 1; otherwise °, &quot; 畀 &quot;, so when the sound must be re-operated, the probability of similarity is greater than the predetermined reference value system 1 through voice training, and the model parameters are stored. In this voiceprint verification system, refer to the first% — ^ to enter the allowed transaction phase. When Hada 70% applies for voiceprint identification registration, that is, C: \ L (X; ()-5 \ F 丨 VE CONT 丨 NENTS \ PKy3 51 · Page 17 1 1234762 V. Description of the invention (12) Please refer to the first and second figures again, the test account 20 has been established for the turn-in account for voice test operations. The voiceprint of the present invention operates the test system The details of 2 0 are as follows: For Predicate 4, the same goes into the test system 20 for the voice test. Equations (1) to (9) are obtained to obtain valid test voice features. Please refer to the second figure again, and then perform the calculation. The probability of similarity between the test voice and the model parameters in order to output a recognition knot. ― In the five senses, when the smallest similarity probability is greater than the predetermined reference value, f is identified by the group ^ ^, and thus can leave the voiceprint verification system! Enter the follow-up electronic business The parent program, on the other hand, its minimum similarity probability is less than the predetermined reference value and cannot pass voice recognition and end the test operation. Therefore, it must leave the voiceprint inspection = system 1 and refuse to conduct subsequent e-commerce transaction procedures. As shown in the two figures, finally, the identifiable device decides whether to allow or deny e-commerce transactions based on the test results of the test system 20 of the chain verification system 1. Although the present invention has been disclosed in the foregoing preferred embodiment, it is not Limitation = the present invention. Any person skilled in this art can make various changes and modifications without departing from the spirit and scope of the present invention. Therefore, the scope of protection of the present invention is defined by the scope of the attached patent application. Prevail. : \ L (X; 〇-5 \ !; IVI: C〇NTINENTS \ PKy351 .ptd Page 18 1234762 Brief Description [Schematic Illustration] Figure 1: Voice of E-commerce Transactions in the Preferred Embodiment of the Present Invention Flowchart of the voiceprint verification system. Figure 2: Block diagram of the voiceprint verification system of the preferred embodiment of the present invention. Figure 3: Schematic diagram of the state of the voiceprint verification system and the relationship between the sound boxes of the preferred embodiment of the present invention Figure 4: Schematic diagram of the initial allocation mode of the voice frame and status of the voiceprint verification system of the preferred embodiment of the present invention. Figure 5: State transition of the voiceprint verification system of the preferred embodiment of the present invention. Figure 6 : Schematic diagram of the most similar path of the voiceprint verification system of the preferred embodiment of the present invention. Turtle FIG. 7: The equalization diagram of the voiceprint verification system of the preferred embodiment of the present invention. FIG. 8: of the preferred embodiment of the present invention Schematic diagram of the first reassignment of the voice frame of the voiceprint verification system. Figure 9: Schematic diagram of the second reassignment of the voiceprint of the voiceprint verification system of the preferred embodiment of the present invention. Figure 10: The preferred embodiment of the present invention Best Voiceprint Verification System Schematic diagram of the voice distribution box. Drawing number description: 1 Voiceprint verification system 10 Training system 20 Test system

C:\L(X;〇-5\I;IVi: CONTINUNTS\PK*)351 .pul 第19頁C: \ L (X; 〇-5 \ I; IVi: CONTINUNTS \ PK *) 351 .pul page 19

Claims (1)

1234762 六、申請專利範圍 、一種電子 客戶帳號 利用一可 該可辨識 利用一聲 該可辨識 ;、依申請專 聲紋驗證 一前端處 之原始輸 無效語音 一特徵擷 音特徵; 商務交易方法,其包含步驟: 由一連接裝置進行登錄; 辨識裝置確認客戶基本資料; 裝置進行核對是否已申請聲紋比對; 紋驗證系統進行聲紋辨識;及 裝置決定允許、拒絕進行電子商務交易。 利範圍第!項之電子商務交易方法 系統包含: 干5亥 理,,其用以進行前端處理該聲紋驗證系統 =語音資料,因而完成區分有效語音資訊及 資訊,再擷取有效語音資訊; 、 取部,其用以進行擷取該有效語音資訊之言五 一儲存部,其將該語音特徵加以儲存; 丨 運;算部’其將該儲存語音特徵及輸入語音特徵力 、依申請專利範圍第2項之電子商務交易方法,1 聲紋驗證系統另包含一訓練系統使用該前端處ς = 特徵擷取部,以獲得原始輸入語音資料之模型來^。 、依申請專利範圍第4項之電子商務交易方法,其中該 聲紋驗證系統之訓練系統另利用維特比演算法^得^ 相似路徑,以便計算模型參數供儲存。 又亍 、依申請專利範圍第丨項之電子商務交易方法,其中該 聲紋驗證系統另包含一測試系統使用該前端處理部及1234762 6. Scope of patent application, an electronic customer account using one that can be identified and one that should be recognized; verifying the original input invalid voice at a front end according to the application-specific voiceprint; a feature extraction feature; a business transaction method, which Including steps: login by a connected device; identifying the device to confirm the customer's basic information; checking whether the device has applied for voiceprint comparison; voiceprint identification system for voiceprint identification; and the device determining to allow or deny e-commerce transactions. Lee range first! The item's e-commerce transaction method system includes: Qianghaili, which is used for front-end processing. The voiceprint verification system = voice data, and thus completes the distinction of valid voice information and information, and then retrieves valid voice information; It is used for retrieving the valid voice information. The May 1 storage unit stores the voice feature; 丨 transportation; the computing unit'uses the stored voice feature and input voice feature force according to the second item of the scope of the patent application In the e-commerce transaction method, 1 The voiceprint verification system further includes a training system that uses the front-end feature extraction section to obtain a model of the original input voice data ^. 2. The e-commerce transaction method according to item 4 of the scope of patent application, wherein the training system of the voiceprint verification system uses Viterbi algorithm ^ to obtain ^ similar paths in order to calculate model parameters for storage. Also, according to the e-commerce transaction method according to the scope of the patent application, the voiceprint verification system further includes a test system using the front-end processing unit and 1234762 六、申請專利範圍 特徵擷取部,以獲得原始輸入語音資料之語音特徵。 6、 依申請專利範圍第1項之電子商務交易方法,其中當 未申請聲紋比對時,該聲紋驗證系統進入輸入個人密 碼0 7、 依申請專利範圍第6項之電子商務交易方法,其中當 輸入正確個人密碼時,進入是否申請註冊聲紋辨識階 段。 _1234762 6. Scope of patent application Feature extraction section to obtain the voice features of the original input voice data. 6. The e-commerce transaction method according to item 1 of the scope of patent application, where the voiceprint verification system enters the personal password when no voiceprint comparison is applied. 7. The e-commerce transaction method according to item 6 of the scope of patent application, When the correct personal password is entered, whether to apply for registration of voiceprint recognition phase is entered. _ :\ L(X;() - 5 \ FIVE CONTINENTS\ PK&lt;;3 51. p t d 第21頁: \ L (X; ()-5 \ FIVE CONTINENTS \ PK &lt;; 3 51. p t d p. 21
TW92136456A 2003-12-22 2003-12-22 Voiceprint identification system for e-commerce TWI234762B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW92136456A TWI234762B (en) 2003-12-22 2003-12-22 Voiceprint identification system for e-commerce

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW92136456A TWI234762B (en) 2003-12-22 2003-12-22 Voiceprint identification system for e-commerce

Publications (2)

Publication Number Publication Date
TWI234762B true TWI234762B (en) 2005-06-21
TW200521962A TW200521962A (en) 2005-07-01

Family

ID=36597937

Family Applications (1)

Application Number Title Priority Date Filing Date
TW92136456A TWI234762B (en) 2003-12-22 2003-12-22 Voiceprint identification system for e-commerce

Country Status (1)

Country Link
TW (1) TWI234762B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI481774B (en) * 2013-09-18 2015-04-21 Generalplus Technology Inc Method for unlocking door, method for leasing asset and system thereof
TWI641965B (en) * 2017-03-13 2018-11-21 平安科技(深圳)有限公司 Method and system of authentication based on voiceprint recognition

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI633425B (en) * 2016-03-02 2018-08-21 美律實業股份有限公司 Microphone apparatus

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI481774B (en) * 2013-09-18 2015-04-21 Generalplus Technology Inc Method for unlocking door, method for leasing asset and system thereof
TWI641965B (en) * 2017-03-13 2018-11-21 平安科技(深圳)有限公司 Method and system of authentication based on voiceprint recognition

Also Published As

Publication number Publication date
TW200521962A (en) 2005-07-01

Similar Documents

Publication Publication Date Title
WO2021120631A1 (en) Intelligent interaction method and apparatus, and electronic device and storage medium
WO2019085575A1 (en) Voiceprint authentication method and apparatus, and account registration method and apparatus
JP5695709B2 (en) Method and system for validating personal account identifiers using biometric authentication and self-learning algorithms.
JP6096333B2 (en) Method, apparatus and system for verifying payment
US8694315B1 (en) System and method for authentication using speaker verification techniques and fraud model
US20150088746A1 (en) Method and system for implementing financial transactions
CN110169014A (en) Device, method and computer program product for certification
KR20160019924A (en) Speech transaction processing
US20220172729A1 (en) System and Method For Achieving Interoperability Through The Use of Interconnected Voice Verification System
JP2007004796A (en) Method, system and program for sequential authentication using one or more error rates, which characterize each security challenge
CN107633627A (en) One kind is without card withdrawal method, apparatus, equipment and storage medium
CN110324314B (en) User registration method and device, storage medium and electronic equipment
US20060229879A1 (en) Voiceprint identification system for e-commerce
CN112417412A (en) Bank account balance inquiry method, device and system
CN112201254A (en) Non-sensitive voice authentication method, device, equipment and storage medium
KR101181060B1 (en) Voice recognition system and method for speaker recognition using thereof
US20130339245A1 (en) Method for Performing Transaction Authorization to an Online System from an Untrusted Computer System
CN111598577B (en) Resource transfer method, device, computer equipment and storage medium
TWI234762B (en) Voiceprint identification system for e-commerce
KR20190142056A (en) Voice recognition otp authentication method using machine learning and system thereof
CN108564374A (en) Payment authentication method, device, equipment and storage medium
US11289080B2 (en) Security tool
CN107454044A (en) A kind of e-book reading protection of usage right method and system
KR20100130809A (en) Electronic commerce system of record by ordering based on sound recognition and user-authentication based on speech or image recognition
KR102722515B1 (en) System and method for achieving interoperability through the use of interconnected voice verification system

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees