US20100017209A1 - Random voiceprint certification system, random voiceprint cipher lock and creating method therefor - Google Patents

Random voiceprint certification system, random voiceprint cipher lock and creating method therefor Download PDF

Info

Publication number
US20100017209A1
US20100017209A1 US12/519,982 US51998207A US2010017209A1 US 20100017209 A1 US20100017209 A1 US 20100017209A1 US 51998207 A US51998207 A US 51998207A US 2010017209 A1 US2010017209 A1 US 2010017209A1
Authority
US
United States
Prior art keywords
voiceprint
random
voice
voice data
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/519,982
Other languages
English (en)
Inventor
Kun-Lang Yu
Yen-Chieh Ouyang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Top Digital Co Ltd
Original Assignee
Top Digital Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Top Digital Co Ltd filed Critical Top Digital Co Ltd
Assigned to TOP DIGITAL CO., LTD. reassignment TOP DIGITAL CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: OUYANG, YEN-CHIEH, YU, KUN-LANG
Publication of US20100017209A1 publication Critical patent/US20100017209A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04KSECRET COMMUNICATION; JAMMING OF COMMUNICATION
    • H04K1/00Secret communication
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/32User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/06Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols the encryption apparatus using shift registers or memories for block-wise or stream coding, e.g. DES systems or RC4; Hash functions; Pseudorandom sequence generators
    • H04L9/065Encryption by serially and continuously modifying data stream elements, e.g. stream cipher systems, RC4, SEAL or A5/3
    • H04L9/0656Pseudorandom key sequence combined element-for-element with data sequence, e.g. one-time-pad [OTP] or Vernam's cipher
    • H04L9/0662Pseudorandom key sequence combined element-for-element with data sequence, e.g. one-time-pad [OTP] or Vernam's cipher with particular pseudorandom sequence generator
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/32Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials
    • H04L9/3226Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials using a predetermined code, e.g. password, passphrase or PIN
    • H04L9/3231Biological data, e.g. fingerprint, voice or retina

Definitions

  • This method is used in mobile phones or computer related products and can extract the unique feature of the voice by voice spectrum analysis for identifying the user.
  • the primary value of each frame is compared with the boundary set by the user to decide the starting point and end point of the voice.
  • a Princen-Bradley filter is then used to convert the detected voice signals to retrieve corresponding voice spectrum patterns which are compared with reference voice spectrum samples stored previously, thereby identifying the voiceprint of the user.
  • the identification method disclosed in TWN490655 must calculate degrees of matches and distances of gaps for the patterns of sound spectrums.
  • a user can pass the random voiceprint certification system if the calculated distance of gaps does not exceed in the boundaries.
  • there is only one reference sample used by this system so that it is easy for illegal invasion such as proceeding by playing an illegal pre-record voiceprint data.
  • TWN490655 needs further improvement to solve a problem caused by the said single reference sample so the random voiceprint certification system can avoid illegal invasion and enhance security of the random voiceprint certification system.
  • the primary objective of the present invention is to provide a random voiceprint certification system, random voiceprint cipher lock and creating method therefor.
  • a random voiceprint certification system By the random combination of several voiceprint characteristic units, at least one reference voiceprint password is formed.
  • the random voiceprint certification system operation can be carried out and the reliability of the random voiceprint certification system can be improved.
  • the random voiceprint certification system comprises a training system, a random cipher generator, and a testing system.
  • the input raw voice data can be dealt in the training or testing operation.
  • the training system obtains an appointment voiceprint feature model parameter groups from input raw voice data.
  • a plurality of voiceprint characteristic units can be obtained from the appointment voiceprint feature model parameter groups.
  • at least one reference voiceprint password can be obtained to provide the testing system processing the voice testing operation.
  • the random cipher generator generates randomly at least one reference voiceprint password from the appointment voiceprint feature model parameter groups of the voiceprint characteristic units to build the random voiceprint cipher lock.
  • the testing voice data is relative to the reference voiceprint passwords for the requirement of the testing system so that the voice testing operation can be completed.
  • the random voiceprint certification system further comprises a front-end processing portion, and a feature-retrieving portion.
  • the training system retrieved the effective voice data by the front-end processing portion on the input raw voice data.
  • the feature-retrieving portion retrieves features from the effective voice data. The calculation is carried out on the effective voice data to get the most similar path as the appointment voiceprint feature model parameter groups.
  • the testing voice operation the testing system retrieved the effective voice data by the front-end processing portion on the input raw voice data.
  • the feature-retrieving portion retrieves features from the effective voice data. The calculation is carried out for the similar probability between the testing voice feature and the model parameter to output a verification result.
  • the random voiceprint cipher lock comprises a plurality of the voiceprint characteristic units.
  • voiceprint characteristic units By the random combination of voiceprint characteristic units, one or several reference voiceprint passwords are built.
  • reference voiceprint passwords By one or several reference voiceprint passwords, the random voiceprint cipher lock is set up.
  • the testing voice data required by the random voiceprint cipher lock is relative to the reference voiceprint passwords so that the voice testing operation can be completed.
  • the procedures of the random voiceprint cipher lock and creating method therefor of the present invention includes input an input raw voice data; the appointment voiceprint feature model parameter groups can be obtained from the input raw voice data. From the appointment voiceprint feature model parameter groups, the several voiceprint characteristic units are obtained. By one or the several voiceprint characteristic units to built at least one reference voiceprint password, the random voiceprint cipher lock is provided.
  • FIG. 1 is a flowchart of the random voiceprint cipher lock and creating method therefor in accordance with the present invention
  • FIG. 2 is a processing block diagram of the random voiceprint certification system in accordance with the present invention.
  • FIG. 3A is a processing block diagram of the random voiceprint certification system carrying out the voice training operation in accordance with the present invention
  • FIG. 3B is a processing block diagram of the random voiceprint certification system carrying out the voice training operation in accordance with the present invention.
  • FIG. 4A is a relationship diagram illustrating the energy and the frames of the random voiceprint certification system employing short-time energy and zero-crossing rate carrying out endpoint detection in accordance with the present invention
  • FIG. 4B is a partial magnification diagram in accordance with the present invention.
  • FIG. 5A is a relationship diagram illustrating the energy and the frames of the random voiceprint certification system employing the entropy-based algorithm carrying out endpoint testing in accordance with the present invention
  • FIG. 5B is a partial magnification diagram in accordance with the present invention.
  • FIG. 6A is a schematic diagram illustrating the energy timing order of inputting ten voices of the random voiceprint certification system in accordance with the present invention
  • FIG. 6B is a relationship diagram illustrating the energy and the frames of the random voiceprint certification system employing the entropy-based algorithm carrying out endpoint detection according to FIG. 6A shown input voice in accordance with the present invention
  • FIG. 6C is a frames matrix diagram of the random voiceprint certification system employing FIG. 6B shown by endpoint detection to decide selection of the frames in accordance with the present invention
  • FIG. 7 is a schematic diagram illustrating the energy timing order of the random voiceprint certification system by after the completion of cutting point operation, the recombination is carried out or not in accordance with the present invention
  • FIG. 8 is a relationship diagram illustrating the status and the frames of the random voiceprint certification system in accordance with the present invention.
  • FIG. 9 is a schematic diagram illustrating initial distribution models of the status and the frames of the random voiceprint certification system in accordance with the present invention.
  • FIG. 10 is a schematic diagram illustrating status conversion of the random voiceprint certification system in accordance with the present invention.
  • FIG. 11 is a schematic diagram illustrating a most similar path of the random voiceprint certification system in accordance with the present invention.
  • FIG. 12 is a schematic diagram illustrating the equal distinction of the frames of the random voiceprint certification system in accordance with the present invention.
  • FIG. 13 is a schematic diagram illustrating a first redistribution of the frames of the random voiceprint certification system in accordance with the present invention
  • FIG. 14 is a schematic diagram illustrating a second redistribution of the frames of the random voiceprint certification system in accordance with the present invention.
  • FIG. 15 is a schematic diagram illustrating an optimal distribution of the frames of the random voiceprint certification system in accordance with the present invention.
  • FIG. 16 is a schematic diagram illustrating the random cipher generator generating randomly the reference voiceprint passwords of the random voiceprint certification system in accordance with the present invention
  • a random voiceprint cipher lock and creating method thereof in accordance with the present invention comprises three main procedures to build the random voiceprint cipher lock. Furthermore, in general application, the random voiceprint cipher lock and creating method thereof can be applied on the personal electronic data, bank business transactions, and security system of personal recognition.
  • the said three main procedures for the random voiceprint cipher lock and of the creating method includes: a procedure of obtaining input raw voice data S 1 , a procedure of generating appointment voiceprint feature model parameter groups S 2 , and a procedure of generating the random voiceprint cipher lock S 3 .
  • the random voiceprint certification system in accordance with the present invention comprises a training system 10 , a random cipher generator 20 , and a testing system 30 . Therefore, a training or testing operation can be carried out for the input raw voice data.
  • the random voiceprint certification system in accordance with the present invention further comprises: an A/D converter, a voice detector, a front-end processing portion, and a feature-retrieving portion, which are employed to carry out the voice training operation and the voice testing operation.
  • the most important procedure of the voice training operation and the voice testing operation of the present invention is to search position of the voice signal from the input raw voice data, that is, to identify the starting point and terminal point of the voice signal.
  • the better searching method for correct position of the starting point and the terminal point is “endpoint detection”, which includes algorithm based on short-time energy and zero-crossing rate, and entropy-based algorithm.
  • the endpoint detection is firstly carried out in accordance with the energy of the input raw voice data. Additionally, in accordance with the energy of the input raw voice data still, the endpoint detection can be further processed by the entropy-based algorithm. Furthermore, by a cutting operation, the silence and the noise of the input raw voice data can be removed. Finally, After completing the cutting operation, the cut input raw voice data further go through compression and arrangement to obtain the effective voice data.
  • a result of the endpoint detection completed by the algorithm based on both short-time energy and zero-crossing rate shown by a relationship chart of energy vs. frame is illustrated. From the relationship chart of energy vs. frame, several endpoints can be determined. Finally, the random voiceprint certification system can determine each starting point and each terminal point of the input raw voice data.
  • the random voiceprint certification system can also determine each starting point and each terminal point of the input raw voice data through the entropy-based algorithm.
  • the random voiceprint certification system of the present invention can avoid false rejection and false acceptance and thus provide the random voiceprint certification system with good performance in recognition.
  • the random voiceprint certification system of the present invention can prevent the effective voice data from being determined as silence or noise, so as to avoid false rejection; on the other hand, it can also prevent silence and noise from being determined as the effective voice data, so as to avoid false acceptance.
  • an example shows how to process the operation of the endpoint detection by sequentially pronouncing numbers 0 to 9 as the input raw voice data and inputting the input raw voice data into the random voiceprint certification system of the present invention.
  • ten voice signals can be obtained, with each voice signal presenting a sound of one of the pronounced numbers and having a starting point and a terminal point.
  • ten frames which respectively represent the voice signals, can be decided.
  • the operation of the cutting points and a recombination can be processed after determining these frames.
  • a feature value can be obtained from each frame.
  • the operation of the recombination of the frames can be carried out.
  • the random voiceprint certification system can carry out the voiceprint recognition.
  • the random voiceprint certification system 1 checks up the database whether the input account number has been registered. If the account number has not been registered, the procedure is automatically moved to the training system 10 for training and registering voice data for a new account number. But, if the account number has been registered, the procedure is automatically moved to the testing system 30 for verifying whether the features of the input voice match those stored in the account number. At this time, the random cipher generator 20 can randomly generate one or several reference voiceprint passwords, and thus the random voiceprint certification system 1 is completely set up to form the random voiceprint cipher lock for the testing system 30 to process the voice testing operation.
  • a creating method therefor is provided, which preliminary processes the front part of the voice training operation to carry out a step of obtaining the input raw voice data, designated as “S 1 ”. Therefore, users can directly input voice to a voice detector and then can complete the step S 1 for obtaining the input raw voice data. Referring to FIG. 3A , after complete the step S 1 for obtaining the input raw voice data, the input raw voice data can be employed for the next step.
  • the random voiceprint cipher lock and creating method therefor carry out the a step of generating the appointment voiceprint feature model parameter groups, designated as “S 2 ”.
  • the appointment voiceprint feature model parameter groups will be stored in the random voiceprint certification system 1 .
  • the appointment voiceprint feature model parameter groups can be further employed for the next step.
  • the input raw voice data provided by a user for the random voiceprint certification system 1 need to be corresponding to a specific voice sequence, such as the pronunciations of “0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, F, G, H, I, J, K . . . ” etc. Therefore, the training system 10 can generate the appointment voiceprint feature model parameter groups including a plurality of the voiceprint characteristic units.
  • the voiceprint characteristic units relate with 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, F, G, H, I, J, K . . . etc. one by one.
  • the random cipher generator 20 of the random voiceprint certification system 1 randomly generates at least one reference voiceprint password.
  • the voice testing operation can be carried out according to the at least one reference voiceprint password, which is selected by the random cipher generator 20 .
  • the front-end processing portion retrieves the effective voice data from the input raw voice data and filters non-effective voice data.
  • the short-time energy and zero-crossing rate are employed in the present invention for the endpoint detection.
  • a calculating method combining Gauss possibility distribution is employed, and the equation is as follows:
  • x is the original signal that is divided into a plurality of frames in D-dimension
  • ⁇ i is the expectation value of the background noise signal
  • Equation (2) is simplified and rewritten into equation (3) after obtaining its logarithm.
  • the first 256 points of the front portion of the raw voice data are extracted to calculate the expectation value, variance of the short-time energy and zero-crossing rate.
  • the two values and the raw voice data are substituted into equation (3) for calculation purposes. Since the distributive possibility area of the short-time energy and zero-crossing rate includes the effective voice data and the non-effective voice data, the non-effective voice data can be removed to reduce the amount of data while allowing correct retrieval of the effective voice data.
  • the feature-retrieving portion retrieves voice features from the input voice data
  • the parameters include linear prediction cepstrum coefficient (LPCC) and Mel frequency cepstrum coefficient (MFCC).
  • LPCC linear prediction cepstrum coefficient
  • MFCC Mel frequency cepstrum coefficient
  • Each of the parameters includes twelve cepstral coefficients and twelve delta-cepstral coefficients. Equation (4) is obtained after carrying out partial differentiation on the cepstral coefficients with respect to time:
  • K is the number of considered frames.
  • Cn is the feature value in n-th order
  • L is the total number of the frames in the signal
  • i is the serial number of the frames.
  • FIG. 4 is a schematic diagram illustrating relationship between statuses and frames of the random voiceprint certification system in accordance with the present invention.
  • the term “status” means the change in the mouth shape and the vocal band. Generally, a speaker's mouth has changes in shape while speaking. Thus, each status is the feature of the change of the voice. In some cases, a single sound contains several statuses. The size of the respective status is not fixed like the frame. A status usually includes several or tens of frames.
  • the first status includes three frames
  • the second status includes six frames
  • the third status includes four frames.
  • FIG. 9 is a schematic diagram illustrating initial distribution models of the statuses and the frames of the random voiceprint certification system in accordance with the present invention. For example, three sample voices are equally divided in an initial distribution model.
  • the voices are equally divided for forming frames
  • the residual frame if any, is equally divided into two groups and the result is added into each of the first status and the last status.
  • three factors must be considered in the distribution model: (1) the first frame must belong to the first status, (2) the last frame must belong to the last status, and (3) the status in the frame either remains unchanged or the change of the status in the frame continues to the next one.
  • Gauss distribution possibility is employed to calculate the possibility of each frame of each state, and Viterbi algorithm is employed to obtain the most similar path.
  • FIG. 10 is a schematic diagram illustrating status conversion of the random voiceprint certification system in accordance with the present invention.
  • FIG. 10 shows the possible conversion of the statuses of frames (the number of which is L) when three statuses are involved.
  • the crossed frame is deemed as an impossible status, and the directions indicated by the arrows are the possible paths of the change of the statuses.
  • FIG. 11 is a schematic diagram illustrating a most similar path of the random voiceprint certification system in accordance with the present invention.
  • the most similar path includes a first status having the first, the second, and the third frames, a second status having the fourth, the fifth, and the sixth frames, and a third status having the seventh, the eight, the ninth, and the tenth frames.
  • FIG. 12 is a schematic diagram illustrating division of the frames of the random voiceprint certification system in accordance with the present invention.
  • FIG. 12 shows initial models of three statuses of three sample voices, which are distributions after equal division.
  • the first sample voice is divided equally into three statuses each having three frames, and the residual two frames are divided equally and added into the first status and the second status respectively.
  • the second sample voice is divided equally into three statuses each having four frames.
  • the third sample voice is divided into three statuses each having three frames, and one residual frame is added into the first status. After calculation, the possibility of most similarity is 2156.
  • FIG. 13 is a schematic diagram illustrating a first redistribution of the frames of the random voiceprint certification system in accordance with the present invention. As illustrated in FIG. 13 , the possibility of most similarity has an increase to reach 3171 after the first redistribution.
  • FIG. 14 is a schematic diagram illustrating a second redistribution of the frames of the random voiceprint certification system in accordance with the present invention. As illustrated in FIG. 14 , the possibility most similarity has increase to reach 3571 after the second redistribution.
  • FIG. 15 is a schematic diagram illustrating an optimal distribution of the frames of the random voiceprint certification system in accordance with the present invention. As illustrated in FIG. 15 , the possibility of most similarity cannot be raised after the third distribution. Thus, it can be deemed as the most optimal frame distribution. The expectation value and the variance of each status are calculated to obtain the model parameters that can be stored in the database.
  • equations (1)-(9) are used to obtain the effective training voice features. Viterbi algorithm is then employed to obtain the most similar path. Next, the expectation value and variance of each status are calculated to obtain the appointment voiceprint feature model parameter groups, thereby completing the voice training operation.
  • the training for a user will be ended and rejected if the possibility of the most similarity is smaller than a predetermined threshold. Accordingly, a new training of the random voiceprint certification system 1 is required. Conversely, the training for a user will be approved and ended when the possibility of the most similarity is greater than the predetermined threshold.
  • the appointment voiceprint feature model parameter groups are stored in the random voiceprint certification system 1 .
  • the random voiceprint cipher lock and creating method therefor of the present invention process the procedure of generating the random voiceprint cipher lock S 3 .
  • the random cipher generator 20 randomly generates one or several sets of the reference voiceprint lock from the appointment voiceprint feature model parameter groups.
  • the random voiceprint certification system completes setting up and forms the random voiceprint cipher lock.
  • the random voiceprint cipher lock can provide the testing system 30 to carry out the voice testing operation.
  • the random cipher generator 20 can generate four columns of the reference voiceprint passwords.
  • the column A of the reference voiceprint passwords includes 1279, 2385, A1B2, 9F5U . . . etc.;
  • the column B of the reference voiceprint passwords includes 1357 . . . etc.;
  • the column C of the reference voiceprint passwords includes ABCD . . . etc.;
  • the column D of the reference voiceprint passwords includes 1234 . . . etc.
  • the random voiceprint certification system 1 processes the voice testing operation according to the selection of the reference voiceprint passwords by the random cipher generator.
  • the possibility of similarity between the testing voice features and the reference voiceprint passwords are calculated to obtain the verification result.
  • voice verification when the possibility of least similarity is greater than a predetermined threshold, the user can pass the testing and enter the random voiceprint certification system 1 . Conversely, when the possibility of least similarity is smaller than the predetermined threshold, the testing of the user is failed and ended to exit the random voiceprint certification system 1 .
  • the random voiceprint certification system 1 of the present invention has the random cipher generator 20 to randomly generate one or several reference voiceprint passwords. Therefore, the random voiceprint certification system 1 completes setting and forms the random voiceprint cipher lock. The effect of preventing easy illegal invasion can be achieved.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • Game Theory and Decision Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Lock And Its Accessories (AREA)
  • Collating Specific Patterns (AREA)
US12/519,982 2006-12-07 2007-12-06 Random voiceprint certification system, random voiceprint cipher lock and creating method therefor Abandoned US20100017209A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200610161138.5A CN101197131B (zh) 2006-12-07 2006-12-07 随机式声纹密码验证系统、随机式声纹密码锁及其产生方法
PCT/CN2007/071200 WO2008083571A1 (fr) 2006-12-07 2007-12-06 Système de certification de cryptage aléatoire d'empreinte vocale, verrou de cryptage aléatoire d'empreinte vocale, et procédé de création associé à ceux-ci

Publications (1)

Publication Number Publication Date
US20100017209A1 true US20100017209A1 (en) 2010-01-21

Family

ID=39547491

Family Applications (1)

Application Number Title Priority Date Filing Date
US12/519,982 Abandoned US20100017209A1 (en) 2006-12-07 2007-12-06 Random voiceprint certification system, random voiceprint cipher lock and creating method therefor

Country Status (5)

Country Link
US (1) US20100017209A1 (de)
EP (1) EP2120232A4 (de)
CN (1) CN101197131B (de)
RU (1) RU2009125695A (de)
WO (1) WO2008083571A1 (de)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070220708A1 (en) * 2006-03-26 2007-09-27 Chatsworth Products, Inc. Indexing hinge
US20090187405A1 (en) * 2008-01-18 2009-07-23 International Business Machines Corporation Arrangements for Using Voice Biometrics in Internet Based Activities
US20100292988A1 (en) * 2009-05-13 2010-11-18 Hon Hai Precision Industry Co., Ltd. System and method for speech recognition
US20120226495A1 (en) * 2011-03-03 2012-09-06 Hon Hai Precision Industry Co., Ltd. Device and method for filtering out noise from speech of caller
US20130166296A1 (en) * 2011-12-21 2013-06-27 Nicolas Scheffer Method and apparatus for generating speaker-specific spoken passwords
US8843369B1 (en) * 2013-12-27 2014-09-23 Google Inc. Speech endpointing based on voice profile
CN105225664A (zh) * 2015-09-24 2016-01-06 百度在线网络技术(北京)有限公司 信息验证方法和装置及声音样本的生成方法和装置
JP2017010511A (ja) * 2015-06-25 2017-01-12 バイドゥ オンライン ネットワーク テクノロジー (ベイジン) カンパニー リミテッド 声紋認証方法および装置
US9564134B2 (en) * 2011-12-21 2017-02-07 Sri International Method and apparatus for speaker-calibrated speaker detection
US9607613B2 (en) 2014-04-23 2017-03-28 Google Inc. Speech endpointing based on word comparisons
US20170244701A1 (en) * 2014-11-07 2017-08-24 Baidu Online Network Technology (Beijing) Co., Ltd. Voiceprint verification method, apparatus, storage medium and device
US10269341B2 (en) 2015-10-19 2019-04-23 Google Llc Speech endpointing
US20190122669A1 (en) * 2016-06-01 2019-04-25 Baidu Online Network Technology (Beijing) Co., Ltd. Methods and devices for registering voiceprint and for authenticating voiceprint
US20190130084A1 (en) * 2017-10-31 2019-05-02 Baidu Usa Llc Authentication method, electronic device, and computer-readable program medium
TWI678635B (zh) * 2018-01-19 2019-12-01 遠傳電信股份有限公司 聲紋認證方法及其電子裝置
US10593352B2 (en) 2017-06-06 2020-03-17 Google Llc End of query detection
US10629209B2 (en) * 2017-02-16 2020-04-21 Ping An Technology (Shenzhen) Co., Ltd. Voiceprint recognition method, device, storage medium and background server
US10929754B2 (en) 2017-06-06 2021-02-23 Google Llc Unified endpointer using multitask and multidomain learning
US11062696B2 (en) 2015-10-19 2021-07-13 Google Llc Speech endpointing
US20210256104A1 (en) * 2018-09-12 2021-08-19 Maxell, Ltd. Information processing apparatus, user authentication network system, and user authentication method
CN114185304A (zh) * 2021-12-07 2022-03-15 城市花园(北京)环境科技有限公司 一种基于声纹控制智能设备开闭系统

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102254559A (zh) * 2010-05-20 2011-11-23 盛乐信息技术(上海)有限公司 基于声纹的身份认证系统及方法
CN102314877A (zh) * 2010-07-08 2012-01-11 盛乐信息技术(上海)有限公司 字符内容提示的声纹识别方法
CN102005070A (zh) * 2010-11-17 2011-04-06 广东中大讯通信息有限公司 一种语音识别门禁系统
CN103366745B (zh) * 2012-03-29 2016-01-20 三星电子(中国)研发中心 基于语音识别保护终端设备的方法及其终端设备
CN102916815A (zh) * 2012-11-07 2013-02-06 华为终端有限公司 用户身份验证的方法和装置
CN104598790A (zh) * 2013-10-30 2015-05-06 鸿富锦精密工业(深圳)有限公司 手持装置解锁系统、方法及手持装置
CN104503677A (zh) * 2014-12-19 2015-04-08 上海电机学院 一种屏幕解锁方法及相应电子设备
CN104616655B (zh) 2015-02-05 2018-01-16 北京得意音通技术有限责任公司 声纹模型自动重建的方法和装置
CN105991288B (zh) * 2015-03-06 2019-07-30 科大讯飞股份有限公司 声纹密码文本生成方法及系统
NO344910B1 (en) * 2016-01-12 2020-06-29 Kk88 No As Device for verifying the identity of a person
CN105632489A (zh) * 2016-01-20 2016-06-01 曾戟 一种语音播放方法和装置

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5995927A (en) * 1997-03-14 1999-11-30 Lucent Technologies Inc. Method for performing stochastic matching for use in speaker verification
US6161090A (en) * 1997-06-11 2000-12-12 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6356868B1 (en) * 1999-10-25 2002-03-12 Comverse Network Systems, Inc. Voiceprint identification system
US20020087319A1 (en) * 2001-01-04 2002-07-04 Stephenson Marc C. Portable electronic voice recognition device capable of executing various voice activated commands and calculations associated with aircraft operation by means of synthesized voice response
US20020104027A1 (en) * 2001-01-31 2002-08-01 Valene Skerpac N-dimensional biometric security system
US20050089172A1 (en) * 2003-10-24 2005-04-28 Aruze Corporation Vocal print authentication system and vocal print authentication program
US20060161435A1 (en) * 2004-12-07 2006-07-20 Farsheed Atef System and method for identity verification and management
US7308484B1 (en) * 2000-06-30 2007-12-11 Cisco Technology, Inc. Apparatus and methods for providing an audibly controlled user interface for audio-based communication devices
US20080126097A1 (en) * 2006-11-27 2008-05-29 Ashantiplc Limited Voice confirmation authentication for domain name transactions
US7421387B2 (en) * 2004-02-24 2008-09-02 General Motors Corporation Dynamic N-best algorithm to reduce recognition errors
US20100106503A1 (en) * 2008-10-24 2010-04-29 Nuance Communications, Inc. Speaker verification methods and apparatus

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5216720A (en) * 1989-05-09 1993-06-01 Texas Instruments Incorporated Voice verification circuit for validating the identity of telephone calling card customers
AU744678B2 (en) * 1997-10-15 2002-02-28 British Telecommunications Public Limited Company Pattern recognition using multiple reference models
TW490655B (en) 2000-12-27 2002-06-11 Winbond Electronics Corp Method and device for recognizing authorized users using voice spectrum information
CN1808567A (zh) * 2006-01-26 2006-07-26 覃文华 验证真人在场状态的声纹认证设备和其认证方法

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5995927A (en) * 1997-03-14 1999-11-30 Lucent Technologies Inc. Method for performing stochastic matching for use in speaker verification
US6161090A (en) * 1997-06-11 2000-12-12 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6356868B1 (en) * 1999-10-25 2002-03-12 Comverse Network Systems, Inc. Voiceprint identification system
US20020152078A1 (en) * 1999-10-25 2002-10-17 Matt Yuschik Voiceprint identification system
US7308484B1 (en) * 2000-06-30 2007-12-11 Cisco Technology, Inc. Apparatus and methods for providing an audibly controlled user interface for audio-based communication devices
US20020087319A1 (en) * 2001-01-04 2002-07-04 Stephenson Marc C. Portable electronic voice recognition device capable of executing various voice activated commands and calculations associated with aircraft operation by means of synthesized voice response
US20020104027A1 (en) * 2001-01-31 2002-08-01 Valene Skerpac N-dimensional biometric security system
US20050089172A1 (en) * 2003-10-24 2005-04-28 Aruze Corporation Vocal print authentication system and vocal print authentication program
US7421387B2 (en) * 2004-02-24 2008-09-02 General Motors Corporation Dynamic N-best algorithm to reduce recognition errors
US20060161435A1 (en) * 2004-12-07 2006-07-20 Farsheed Atef System and method for identity verification and management
US20080126097A1 (en) * 2006-11-27 2008-05-29 Ashantiplc Limited Voice confirmation authentication for domain name transactions
US20100106503A1 (en) * 2008-10-24 2010-04-29 Nuance Communications, Inc. Speaker verification methods and apparatus

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070220708A1 (en) * 2006-03-26 2007-09-27 Chatsworth Products, Inc. Indexing hinge
US20090187405A1 (en) * 2008-01-18 2009-07-23 International Business Machines Corporation Arrangements for Using Voice Biometrics in Internet Based Activities
US8140340B2 (en) * 2008-01-18 2012-03-20 International Business Machines Corporation Using voice biometrics across virtual environments in association with an avatar's movements
US20100292988A1 (en) * 2009-05-13 2010-11-18 Hon Hai Precision Industry Co., Ltd. System and method for speech recognition
US20120226495A1 (en) * 2011-03-03 2012-09-06 Hon Hai Precision Industry Co., Ltd. Device and method for filtering out noise from speech of caller
US9564134B2 (en) * 2011-12-21 2017-02-07 Sri International Method and apparatus for speaker-calibrated speaker detection
US9147400B2 (en) * 2011-12-21 2015-09-29 Sri International Method and apparatus for generating speaker-specific spoken passwords
US20130166296A1 (en) * 2011-12-21 2013-06-27 Nicolas Scheffer Method and apparatus for generating speaker-specific spoken passwords
US8843369B1 (en) * 2013-12-27 2014-09-23 Google Inc. Speech endpointing based on voice profile
US12051402B2 (en) 2014-04-23 2024-07-30 Google Llc Speech endpointing based on word comparisons
US11004441B2 (en) 2014-04-23 2021-05-11 Google Llc Speech endpointing based on word comparisons
US10546576B2 (en) 2014-04-23 2020-01-28 Google Llc Speech endpointing based on word comparisons
US9607613B2 (en) 2014-04-23 2017-03-28 Google Inc. Speech endpointing based on word comparisons
US11636846B2 (en) 2014-04-23 2023-04-25 Google Llc Speech endpointing based on word comparisons
US10140975B2 (en) 2014-04-23 2018-11-27 Google Llc Speech endpointing based on word comparisons
US20170244701A1 (en) * 2014-11-07 2017-08-24 Baidu Online Network Technology (Beijing) Co., Ltd. Voiceprint verification method, apparatus, storage medium and device
US10277589B2 (en) * 2014-11-07 2019-04-30 Baidu Online Network Technology (Beijing) Co., Ltd. Voiceprint verification method, apparatus, storage medium and device
US9792913B2 (en) 2015-06-25 2017-10-17 Baidu Online Network Technology (Beijing) Co., Ltd. Voiceprint authentication method and apparatus
JP2017010511A (ja) * 2015-06-25 2017-01-12 バイドゥ オンライン ネットワーク テクノロジー (ベイジン) カンパニー リミテッド 声紋認証方法および装置
CN105225664A (zh) * 2015-09-24 2016-01-06 百度在线网络技术(北京)有限公司 信息验证方法和装置及声音样本的生成方法和装置
US11062696B2 (en) 2015-10-19 2021-07-13 Google Llc Speech endpointing
US10269341B2 (en) 2015-10-19 2019-04-23 Google Llc Speech endpointing
US11710477B2 (en) 2015-10-19 2023-07-25 Google Llc Speech endpointing
US11348590B2 (en) * 2016-06-01 2022-05-31 Baidu Online Network Technology (Beijing) Co., Ltd. Methods and devices for registering voiceprint and for authenticating voiceprint
US20190122669A1 (en) * 2016-06-01 2019-04-25 Baidu Online Network Technology (Beijing) Co., Ltd. Methods and devices for registering voiceprint and for authenticating voiceprint
US10629209B2 (en) * 2017-02-16 2020-04-21 Ping An Technology (Shenzhen) Co., Ltd. Voiceprint recognition method, device, storage medium and background server
US10593352B2 (en) 2017-06-06 2020-03-17 Google Llc End of query detection
US10929754B2 (en) 2017-06-06 2021-02-23 Google Llc Unified endpointer using multitask and multidomain learning
US11676625B2 (en) 2017-06-06 2023-06-13 Google Llc Unified endpointer using multitask and multidomain learning
US11551709B2 (en) 2017-06-06 2023-01-10 Google Llc End of query detection
US10936705B2 (en) * 2017-10-31 2021-03-02 Baidu Usa Llc Authentication method, electronic device, and computer-readable program medium
US20190130084A1 (en) * 2017-10-31 2019-05-02 Baidu Usa Llc Authentication method, electronic device, and computer-readable program medium
CN109726536A (zh) * 2017-10-31 2019-05-07 百度(美国)有限责任公司 鉴权方法、电子设备和计算机可读程序介质
TWI678635B (zh) * 2018-01-19 2019-12-01 遠傳電信股份有限公司 聲紋認證方法及其電子裝置
EP3851985A4 (de) * 2018-09-12 2022-04-20 Maxell, Ltd. Informationsverarbeitungsvorrichtung, netzwerksystem für benutzerauthentifizierung und benutzerauthentifizierungsverfahren
US20210256104A1 (en) * 2018-09-12 2021-08-19 Maxell, Ltd. Information processing apparatus, user authentication network system, and user authentication method
CN114185304A (zh) * 2021-12-07 2022-03-15 城市花园(北京)环境科技有限公司 一种基于声纹控制智能设备开闭系统

Also Published As

Publication number Publication date
WO2008083571A1 (fr) 2008-07-17
CN101197131A (zh) 2008-06-11
RU2009125695A (ru) 2011-01-20
EP2120232A4 (de) 2011-01-19
EP2120232A1 (de) 2009-11-18
CN101197131B (zh) 2011-03-30

Similar Documents

Publication Publication Date Title
US20100017209A1 (en) Random voiceprint certification system, random voiceprint cipher lock and creating method therefor
US7447632B2 (en) Voice authentication system
US10950245B2 (en) Generating prompts for user vocalisation for biometric speaker recognition
Naik Speaker verification: A tutorial
US9218809B2 (en) Fast, language-independent method for user authentication by voice
US9792912B2 (en) Method for verifying the identity of a speaker, system therefore and computer readable medium
US20060222210A1 (en) System, method and computer program product for determining whether to accept a subject for enrollment
US6496800B1 (en) Speaker verification system and method using spoken continuous, random length digit string
US6697779B1 (en) Combined dual spectral and temporal alignment method for user authentication by voice
JPH06175680A (ja) 最も近い隣接距離を使用した発声者確認装置
US20070038868A1 (en) Voiceprint-lock system for electronic data
US20060229879A1 (en) Voiceprint identification system for e-commerce
CN110767239A (zh) 一种基于深度学习的声纹识别方法、装置及设备
GB2388947A (en) Method of voice authentication
EP1760566A1 (de) Verschlüsserung von Daten mit Hilfe eines Stimmmusters
JP4440414B2 (ja) 話者照合装置及び方法
Jha et al. Analysis of Human Voice for Speaker Recognition: Concepts and Advancement
Londhe et al. [Retracted] Extracting Behavior Identification Features for Monitoring and Managing Speech‐Dependent Smart Mental Illness Healthcare Systems
Gupta et al. Text dependent voice based biometric authentication system using spectrum analysis and image acquisition
US7162641B1 (en) Weight based background discriminant functions in authentication systems
JP2001350494A (ja) 照合装置及び照合方法
Hajipour et al. Listening to sounds of silence for audio replay attack detection
Mandalapu et al. Smartphone audio replay attacks dataset
TW200816166A (en) Randomized voiceprint identification system, random voiceprint lock and creating method therefor
Aloufi et al. On-Device Voice Authentication with Paralinguistic Privacy

Legal Events

Date Code Title Description
AS Assignment

Owner name: TOP DIGITAL CO., LTD.,TAIWAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YU, KUN-LANG;OUYANG, YEN-CHIEH;REEL/FRAME:022847/0200

Effective date: 20090508

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION