CN109243465A

CN109243465A - Voiceprint authentication method, device, computer equipment and storage medium

Info

Publication number: CN109243465A
Application number: CN201811487395.7A
Authority: CN
Inventors: 彭俊清; 王健宗; 肖京
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2018-12-06
Filing date: 2018-12-06
Publication date: 2019-01-18

Abstract

The embodiment of the invention discloses a kind of voiceprint authentication method, device, computer equipment and storage mediums, wherein the described method includes: pre-establishing sound-groove model library；The voice print database to be certified of user's input is obtained, and extracts corresponding vocal print feature to be certified from the voice print database to be certified；Target sound-groove model is determined from the sound-groove model library according to the vocal print feature to be certified；Calculate the vocal print similarity between the vocal print feature to be certified and target sound-groove model；It obtains the vocal print similarity and judges whether the vocal print similarity is greater than preset threshold；If the vocal print similarity is greater than the preset threshold, determine that the corresponding user identity of the voice print database to be certified matches with target user's identity, and determine that the user identity authentication passes through.The present invention provides a kind of sound groove recognition technology in e, can be improved authentication efficiency, shortens the response time of certification.

Description

Voiceprint authentication method, device, computer equipment and storage medium

Technical field

The present invention relates to field of computer technology more particularly to a kind of voiceprint authentication method, device, computer equipment and Storage medium.

Background technique

Application on Voiceprint Recognition (Voiceprint Recognize) is one and is spoken human physiology and row according to reflection in speech waveform The speech parameter being characterized, the technology of automatic identification speaker's identity.With the development of sound groove recognition technology in e, more and more set Standby that authentication is carried out using Application on Voiceprint Recognition, traditional technology that authentication is carried out using Application on Voiceprint Recognition is a small amount of in processing Voice print database when can guarantee the efficiency of certification, but for a certain number of voice print databases, low, response that there are authentication efficiencies The problem of time length.

Summary of the invention

It is situated between in view of this, the embodiment of the present invention provides a kind of voiceprint authentication method, device, computer equipment and storage Matter can be improved authentication efficiency, shorten the response time of certification.

On the one hand, the embodiment of the invention provides a kind of voiceprint authentication methods, this method comprises:

Pre-establish sound-groove model library；

The voice print database to be certified of user's input is obtained, and is extracted from the voice print database to be certified corresponding to be certified Vocal print feature；

Target sound-groove model is determined from the sound-groove model library according to the vocal print feature to be certified；

Calculate the vocal print similarity between the vocal print feature to be certified and target sound-groove model；

It obtains the vocal print similarity and judges whether the vocal print similarity is greater than preset threshold；

If the vocal print similarity is greater than the preset threshold, the corresponding user identity of the voice print database to be certified is determined Match with target user's identity, and determines that the user identity authentication passes through.

On the other hand, the embodiment of the invention provides a kind of voiceprint authentication apparatus, described device includes:

Unit is established, for pre-establishing sound-groove model library；

First extraction unit, for obtaining the voice print database to be certified of user's input, and from the voice print database to be certified It is middle to extract corresponding vocal print feature to be certified；

Determination unit, for determining target vocal print mould from the sound-groove model library according to the vocal print feature to be certified Type；

Computing unit, for calculating the vocal print similarity between the vocal print feature to be certified and target sound-groove model；

Judging unit, for obtaining the vocal print similarity and judging whether the vocal print similarity is greater than preset threshold；

Judging unit determines the voice print database to be certified if being greater than the preset threshold for the vocal print similarity Corresponding user identity matches with target user's identity, and determines that the user identity authentication passes through.

Another aspect the embodiment of the invention also provides a kind of computer equipment, including memory, processor and is stored in On the memory and the computer program that can run on the processor, when the processor executes the computer program Realize voiceprint authentication method as described above.

It is described computer-readable to deposit in another aspect, the embodiment of the invention also provides a kind of computer readable storage medium Storage media is stored with one or more than one computer program, and the one or more computer program can be by one Or more than one processor executes, to realize voiceprint authentication method as described above.

The embodiment of the present invention provides a kind of voiceprint authentication method, device, computer equipment and storage medium, wherein method It include: to pre-establish sound-groove model library；The voice print database to be certified of user's input is obtained, and from the voice print database to be certified Extract corresponding vocal print feature to be certified；Target vocal print is determined from the sound-groove model library according to the vocal print feature to be certified Model；Calculate the vocal print similarity between the vocal print feature to be certified and target sound-groove model；Obtain the vocal print similarity And judge whether the vocal print similarity is greater than preset threshold；If the vocal print similarity is greater than the preset threshold, institute is determined It states the corresponding user identity of voice print database to be certified to match with target user's identity, and determines that the user identity authentication is logical It crosses.The present invention provides a kind of sound groove recognition technology in e, can be improved authentication efficiency, shortens the response time of certification.

Detailed description of the invention

Technical solution in order to illustrate the embodiments of the present invention more clearly, below will be to needed in embodiment description Attached drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, general for this field For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.

Fig. 1 is a kind of application scenarios schematic diagram of voiceprint authentication method provided in an embodiment of the present invention；

Fig. 2 is a kind of schematic flow diagram of voiceprint authentication method provided in an embodiment of the present invention；

Fig. 3 is a kind of another schematic flow diagram of voiceprint authentication method provided in an embodiment of the present invention；

Fig. 4 is a kind of another schematic flow diagram of voiceprint authentication method provided in an embodiment of the present invention；

Fig. 5 is a kind of schematic block diagram of voiceprint authentication apparatus provided in an embodiment of the present invention；

Fig. 6 is a kind of another schematic block diagram of voiceprint authentication apparatus provided in an embodiment of the present invention；

Fig. 7 is a kind of another schematic block diagram of voiceprint authentication apparatus provided in an embodiment of the present invention；

Fig. 8 is a kind of structure composition schematic diagram of computer equipment provided in an embodiment of the present invention.

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.

It should be appreciated that ought use in this specification and in the appended claims, term " includes " and "comprising" instruction Described feature, entirety, step, operation, the presence of element and/or component, but one or more of the other feature, whole is not precluded Body, step, operation, the presence or addition of element, component and/or its set.

It is also understood that mesh of the term used in this description of the invention merely for the sake of description specific embodiment And be not intended to limit the present invention.As description of the invention and it is used in the attached claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singular, "one" and "the" are intended to include plural form.

It will be further appreciated that the term "and/or" used in description of the invention and the appended claims is Refer to any combination and all possible combinations of one or more of associated item listed, and including these combinations.

Fig. 1 and Fig. 2 are please referred to, Fig. 1 is a kind of application scenarios signal of voiceprint authentication method provided in an embodiment of the present invention Figure, Fig. 2 are a kind of flow diagram of voiceprint authentication method provided in an embodiment of the present invention.The voiceprint authentication method is applied to clothes It is engaged in device or terminal, wherein terminal can be smart phone, tablet computer, laptop, desktop computer, personal digital assistant Electronic equipment with wearable device etc. with communication function.As an application, as shown in Figure 1, the voiceprint authentication method application In server 10, which can be a server in Distributed Services platform, which executes vocal print Identification/certification instruction, and by implementing result feedback in terminal 20.

It should be noted that only illustrate a terminal 20 in Fig. 1, in the actual operation process, server 10 can be with Voiceprint result is fed back in more terminals 20.

Referring to Fig. 2, Fig. 2 is a kind of schematic flow diagram of voiceprint authentication method provided in an embodiment of the present invention.Such as Fig. 2 institute Show, this approach includes the following steps S101~S106.

S101 pre-establishes sound-groove model library.

In embodiments of the present invention, different particular by collecting several sections of voice data in advance as training speech samples Voice data correspond to different user identity, utilize GMM model (Gaussian Mixture Model, gauss hybrid models) Training speech samples are trained, obtain the sound-groove model for different user identity, and by obtained different vocal print moulds Type forms default sound-groove model library；More specifically, it presets and preserves trained speech samples in sound-groove model library and correspond to vocal print at this Model, the corresponding vocal print feature of training speech samples and different user identity corresponding from training speech samples.Wherein, it uses Family identity can be identified by identity ID, the user that different identity ID can be different with unique identification.

Further, as shown in figure 3, the step S101 includes step S202~S208.

S202, at least two training speech samples of the acquisition for model training.

In embodiments of the present invention, user can be passed through by the microphone typing voice data in terminal 20, server 10 With terminal 20 and obtain voice data that user inputted as training speech samples, specifically, at least need two it is different User's typing two different voice data are as training speech samples.

S204 pre-processes each each trained speech samples.

In embodiments of the present invention, it before extracting each trained speech samples, needs to carry out each trained speech samples pre- Processing, because of mankind's phonatory organ itself and the aliasing as brought by the equipment of acquisition voice data, higher hamonic wave distortion, height Frequency etc. factor, the influence to quality of speech signal, by pre-processing the voice signal that can be obtained more evenly, smoothly, and The low-frequency disturbance in voice data can be removed, good characteristic parameter is provided for the extraction of vocal print feature, improves speech processes Quality.

It is divided into sample quantization specifically, pre-process to each trained speech samples, goes drift, preemphasis and adding window Four steps.Wherein, sample quantization is handled, and refers to being filtered with voice signal of the sharp filter to training speech samples Wave makes its nyquist frequency FN 4KHZ；Drift is gone, refers to the average value for calculating the amplitude sequence of quantization, and will be each Amplitude subtracts average value, obtains the amplitude sequence that average value is 0 after drift；Preemphasis refers to setting digital filter Transmission function in pre emphasis factor, and the high, medium and low comparable vibration of frequency amplitude of voice signal is obtained by digital filter Width sequence；Adding window is referred to carrying out each speech signal frame using Hamming window function plus hamming code window is handled.

S206 extracts the vocal print feature of pretreated each trained speech samples.

In embodiments of the present invention, so-called vocal print (Voiceprint) is the carrying speech letter that electricity consumption acoustic instrument is shown The sound wave spectrum of breath.The generation of human language is a complicated physiology physics mistake between Body Languages maincenter and vocal organs Journey, the phonatory organ (such as: tongue, tooth, larynx, lung, nasal cavity) that people uses in speech in terms of size and form everyone It is widely different, so the voiceprint map of any two people is all variant.Different speak can be recognized and confirmed by vocal print People.The vocal print feature includes one of acoustic feature, lexical characteristics, prosodic features and language feature or a variety of；This reality Example is applied and realizes based on the vocal print feature of different trained speech samples the training of different sound-groove models, and based on different vocal prints The determination of trained objectives sound-groove model also may be implemented in feature, it is therefore desirable to extract from training speech samples in advance Out the vocal print feature of different trained speech samples and correspondence saved；Specifically, being extracted from different trained speech samples It corresponding vocal print feature, in particular to is extracted from sound and selects there is the vocal print of speaker that separability is strong, stability is high Etc. characteristics acoustics, morphology, the rhythm or language feature, extraction in the prior art can be passed through for the extraction of vocal print feature Mode realizes that this is no longer described in detail in the embodiment of the present invention.

It should be noted that the vocal print feature extracted to training speech samples may be one, it is also possible to multiple.

Each vocal print feature of extraction is carried out gauss hybrid models training by S208, obtains different sound-groove models, and by The different sound-groove model forms the sound-groove model library.

In embodiments of the present invention, GMM model is carried out to the vocal print feature for belonging to a trained speech samples (Gaussian Mixture Model, gauss hybrid models) training is to establish corresponding sound-groove model.More specifically, can first really Surely belong to the corresponding speech signal frame of all vocal print features of the same trained speech samples, it later, can be to belonging to the same instruction All speech signal frames for practicing speech samples are trained to obtain corresponding GMM model, and by different training speech samples The different sound-groove models that are trained of vocal print feature form the sound-groove model library.

S102, obtains the voice print database to be certified of user's input, and extracts from the voice print database to be certified corresponding Vocal print feature to be certified.

In embodiments of the present invention, the voice print database to be certified of user's input refers to that unverified user passes through terminal 20 Microphone input for certification voice data.After detecting the voice print database to be certified of active user's input, it can mention Take the vocal print feature to be certified of current voice print database, wherein the vocal print feature to be certified include acoustic feature, lexical characteristics, One of prosodic features and language feature are a variety of.Vocal print feature, the i.e. anatomical structure with the pronunciation mechanism of the mankind have The feature of pass, including such as nasal sound, band deep breathing sound, hoarse sound, laugh, thus analysis is available such as frequency spectrum, cepstrum, resonance The features such as peak, fundamental tone, reflection coefficient；Lexical characteristics, i.e., the language influenced by socioeconomic status, education level, birthplace etc. The features such as justice, rhetoric, pronunciation, speech habit；Prosodic features, that is, the features such as the rhythm spoken, rhythm, speed, intonation, volume. The features such as language feature, i.e. languages, dialect, accent.

S103 determines target sound-groove model according to the vocal print feature to be certified from the sound-groove model library.

In embodiments of the present invention, using in extracted vocal print feature to be certified and the default sound-groove model library established The corresponding vocal print feature parameter of each sound-groove model compares, and the target sound-groove model is determined according to comparing result, specific During realization, it is contemplated that the sound of each individual subject may change within a certain period of time, in order to improve contrast effect, It is a predetermined threshold that comparison degree, which can be preset, when in extracted vocal print feature to be certified and default sound-groove model library When the comparison degree of the corresponding vocal print feature parameter of each sound-groove model is more than institute's scheduled threshold value, it is determined that the vocal print compared is special The corresponding sound-groove model of sign parameter is the vocal print feature extracted in target sound-groove model, such as current voice print database and default vocal print Vocal print feature parameter A matching degree in model library reaches 90% or more, that is, determines the corresponding vocal print mould of vocal print feature parameter A Type is target sound-groove model.

In one embodiment, as shown in figure 4, the step S103 includes step S302~S306.

S302 determines the quantity of extracted vocal print feature to be certified.

In embodiments of the present invention, extracted vocal print feature to be certified may be one in current voice print database, It may be multiple, it is thus necessary to determine that the quantity of extracted vocal print feature to be certified.

S304 determines the priority of each vocal print feature to be certified if extracted vocal print feature to be certified is multiple.

In embodiments of the present invention, when extracted vocal print feature to be certified be it is multiple when, it is thus necessary to determine that comparison wait recognize The priority of vocal print feature is demonstrate,proved, for example, the vocal print feature to be certified currently extracted includes acoustic feature, lexical characteristics, rhythm spy It levies, vocal print feature priority corresponding to acoustic feature is higher than lexical characteristics, vocal print feature priority corresponding to lexical characteristics Higher than prosodic features, select acoustic feature as the vocal print feature to be certified for comparison at this time.Vocal print feature knowledge can be improved Other accuracy.

S306 determines target vocal print mould according to the vocal print feature to be certified of highest priority from the sound-groove model library Type.

In embodiments of the present invention, the vocal print feature to be certified of highest priority and the default sound-groove model established are selected The corresponding vocal print feature parameter of each sound-groove model compares in library, when extracted vocal print feature to be certified and default vocal print mould When the comparison degree of the corresponding vocal print feature parameter of each sound-groove model is more than institute's scheduled threshold value in type library, it is determined that compared The corresponding sound-groove model of vocal print feature parameter is target sound-groove model.The accuracy of vocal print feature identification can be improved.

S104 calculates the vocal print similarity between the vocal print feature to be certified and target sound-groove model.

In embodiments of the present invention, the vocal print similarity between the vocal print feature to be certified and target sound-groove model is calculated Calculation method may include:

If the target sound-groove model has n vocal print feature, respectively to the vocal print feature to be certified and the mesh N vocal print feature for marking sound-groove model carries out vectorization, calculates the vocal print feature to be certified and institute by included angle cosine function The similarity between n vocal print feature of target sound-groove model is stated, and obtains similarity matrix K；Similarity matrix K is asked With to calculate the vocal print similarity between the vocal print feature to be certified and the target sound-groove model:

If K=[sim (x, y_i)]_n, i=1 ..., n, wherein sim (x, y_i) indicate vocal print feature to be certified and the target Similarity between n vocal print feature of sound-groove model, sum formula are as follows:

S=1 ..., n.

In one embodiment, if the vocal print feature to be certified be it is multiple, using K-means clustering algorithm calculate described in Vocal print similarity between the vocal print feature to be certified of highest priority and each vocal print feature of the target sound-groove model.

In one embodiment, if the vocal print feature to be certified is one, using described in the calculating of K-means clustering algorithm Vocal print similarity between vocal print feature to be certified and each vocal print feature of the target sound-groove model.

S105 obtains the vocal print similarity and judges whether the vocal print similarity is greater than preset threshold.

In embodiments of the present invention, the preset threshold for vocal print similarity comparison can be preset, when calculated The vocal print similarity between target sound-groove model in vocal print feature to be certified and default sound-groove model library is more than the default threshold When value, then by the certification of the vocal print feature to be certified, and user identity corresponding to the vocal print feature to be certified is determined Target user's identity corresponding with target sound-groove model matches and passes through authentication.

S106 determines the corresponding use of the voice print database to be certified if the vocal print similarity is greater than the preset threshold Family identity matches with target user's identity, and determines that the user identity authentication passes through.

In embodiments of the present invention, only just allow user's further operating after user identity is by certification, for example, Management is operated by the fund of the user of certification, typing user information or the personal information for modifying user etc. at the terminal.

As seen from the above, the embodiment of the present invention is by pre-establishing sound-groove model library；Obtain the sound to be certified of user's input Line data, and corresponding vocal print feature to be certified is extracted from the voice print database to be certified；It is special according to the vocal print to be certified Sign determines target sound-groove model from the sound-groove model library；It calculates between the vocal print feature to be certified and target sound-groove model Vocal print similarity；It obtains the vocal print similarity and judges whether the vocal print similarity is greater than preset threshold；If the sound Line similarity is greater than the preset threshold, determines the corresponding user identity of the voice print database to be certified and target user's identity phase Matching, and determine that the user identity authentication passes through.The present invention provides a kind of sound groove recognition technology in e, can be improved authentication effect Rate shortens the response time of certification.

Referring to Fig. 5, a kind of corresponding above-mentioned voiceprint authentication method, the embodiment of the present invention also proposes a kind of voiceprint dress It sets, which includes: to establish unit 101, the first extraction unit 102, determination unit 103, computing unit 104, judging unit 105, judging unit 106.

Wherein, described to establish unit 101, for pre-establishing sound-groove model library.

First extraction unit 102, for obtaining the voice print database to be certified of user's input, and from the vocal print number to be certified Corresponding vocal print feature to be certified is extracted according to middle.

Determination unit 103, for determining target vocal print from the sound-groove model library according to the vocal print feature to be certified Model.

Computing unit 104, for calculating the vocal print similarity between the vocal print feature to be certified and target sound-groove model.

Judging unit 105, for obtaining the vocal print similarity and judging whether the vocal print similarity is greater than default threshold Value.

Judging unit 106 determines the vocal print number to be certified if being greater than the preset threshold for the vocal print similarity Match according to corresponding user identity and target user's identity, and determines that the user identity authentication passes through.

In one embodiment, the computing unit 104, if be specifically used for the vocal print feature to be certified be it is multiple, make The vocal print feature to be certified of the highest priority and each vocal print of the target sound-groove model are calculated with K-means clustering algorithm Vocal print similarity between feature.

In one embodiment, the computing unit 104, if being one also particularly useful for the vocal print feature to be certified, It is calculated between the vocal print feature to be certified and each vocal print feature of the target sound-groove model using K-means clustering algorithm Vocal print similarity.

Referring to Fig. 6, described establish unit 101, comprising:

Acquisition unit 101a, for acquiring at least two training voices for being used for model training.

Pretreatment unit 101b, for being pre-processed to each each trained speech samples.

Second extraction unit 101c, for extracting the vocal print feature of pretreated each trained speech samples.

Subelement 101d is established, each vocal print feature for that will extract carries out gauss hybrid models training, obtains difference Sound-groove model, and the sound-groove model library is formed by the different sound-groove model.

Referring to Fig. 7, the determination unit 103, comprising:

First determines subelement 103a, for determining the quantity of extracted vocal print feature to be certified.

Second determines subelement 103b, if be multiple for extracted vocal print feature to be certified, determines each wait recognize Demonstrate,prove the priority of vocal print feature.

Third determines subelement 103c, for according to the vocal print feature to be certified of highest priority from the sound-groove model library Middle determining target sound-groove model.

Above-mentioned voiceprint authentication apparatus and above-mentioned voiceprint authentication method one-to-one correspondence, specific principle and process and above-mentioned reality It is identical to apply the method, repeats no more.

Above-mentioned voiceprint authentication apparatus can be implemented as a kind of form of computer program, and computer program can be in such as Fig. 8 Shown in run in computer equipment.

Fig. 8 is a kind of structure composition schematic diagram of computer equipment of the present invention.The equipment can be terminal, be also possible to take Business device, wherein terminal can be smart phone, tablet computer, laptop, desktop computer, personal digital assistant and wearing Formula device etc. has the electronic device of communication function and speech voice input function.Server can be independent server, can also be with It is the server cluster of multiple server compositions, server can obtain the vocal print to be certified of user's input by communicating with terminal Data.Referring to Fig. 8, which includes the processor 502 connected by system bus 501, non-volatile memories Jie Matter 503, built-in storage 504 and network interface 505.Wherein, the non-volatile memory medium 503 of the computer equipment 500 can be deposited Storage operating system 5031 and computer program 5032, the computer program 5032 are performed, and processor 502 may make to execute one Kind voiceprint authentication method.The processor 502 of the computer equipment 500 supports entire calculate for providing calculating and control ability The operation of machine equipment 500.The built-in storage 504 is that the operation of the computer program 5032 in non-volatile memory medium 503 mentions For environment, when which is executed by processor, processor 502 may make to execute a kind of voiceprint authentication method.Computer The network interface 505 of equipment 500 is for carrying out network communication.It will be understood by those skilled in the art that structure shown in Fig. 8, The only block diagram of part-structure relevant to application scheme, does not constitute the calculating being applied thereon to application scheme The restriction of machine equipment, specific computer equipment may include more certain than more or fewer components as shown in the figure, or combination Component, or with different component layouts.

Wherein, following operation is realized when the processor 502 executes the computer program:

Pre-establish sound-groove model library；

It is in one embodiment, described to pre-establish sound-groove model library, comprising:

At least two training speech samples of the acquisition for model training；

Each each trained speech samples are pre-processed；

Extract the vocal print feature of pretreated each trained speech samples；

Each vocal print feature of extraction is subjected to gauss hybrid models training, obtains different sound-groove models, and by described Different sound-groove models forms the sound-groove model library.

In one embodiment, described to determine target sound from the sound-groove model library according to the vocal print feature to be certified Line model, comprising:

Determine the quantity of extracted vocal print feature to be certified；

If extracted vocal print feature to be certified is multiple, the priority of each vocal print feature to be certified is determined；

Target sound-groove model is determined from the sound-groove model library according to the vocal print feature to be certified of highest priority.

In one embodiment, the calculating vocal print feature to be certified is similar to the vocal print between target sound-groove model Degree, comprising:

If the vocal print feature to be certified be it is multiple, using K-means clustering algorithm calculate the highest priority to Authenticate the vocal print similarity between vocal print feature and each vocal print feature of the target sound-groove model.

If the vocal print feature to be certified is one, the vocal print feature to be certified is calculated using K-means clustering algorithm Vocal print similarity between each vocal print feature of the target sound-groove model.

It will be understood by those skilled in the art that the embodiment of computer equipment shown in Fig. 8 is not constituted to computer The restriction of equipment specific composition, in other embodiments, computer equipment may include components more more or fewer than diagram, or Person combines certain components or different component layouts.For example, in some embodiments, computer equipment only includes memory And processor, in such embodiments, the structure and function of memory and processor are consistent with embodiment illustrated in fig. 8, herein It repeats no more.

The present invention provides a kind of computer readable storage medium, computer-readable recording medium storage has one or one A above computer program, the one or more computer program can be held by one or more than one processor Row, to perform the steps of

Pre-establish sound-groove model library；

At least two training speech samples of the acquisition for model training；

Each each trained speech samples are pre-processed；

Extract the vocal print feature of pretreated each trained speech samples；

Determine the quantity of extracted vocal print feature to be certified；

Present invention storage medium above-mentioned include: magnetic disk, CD, read-only memory (Read-Only Memory, The various media that can store program code such as ROM).

Unit in all embodiments of the invention can pass through universal integrated circuit, such as CPU (Central Processing Unit, central processing unit), or pass through ASIC (Application Specific Integrated Circuit, specific integrated circuit) it realizes.

Step in voiceprint authentication method of the embodiment of the present invention can according to actual needs the adjustment of carry out sequence, merge and delete Subtract.

Unit in voiceprint authentication apparatus of the embodiment of the present invention can be combined, divided and deleted according to actual needs.

The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims

1. a kind of voiceprint authentication method, which is characterized in that the described method includes:

Pre-establish sound-groove model library；

The voice print database to be certified of user's input is obtained, and extracts corresponding vocal print to be certified from the voice print database to be certified Feature；

If the vocal print similarity is greater than the preset threshold, the corresponding user identity of the voice print database to be certified and mesh are determined Mark user identity matches, and determines that the user identity authentication passes through.

2. the method as described in claim 1, which is characterized in that described to pre-establish sound-groove model library, comprising:

At least two training speech samples of the acquisition for model training；

Each each trained speech samples are pre-processed；

Extract the vocal print feature of pretreated each trained speech samples；

Each vocal print feature of extraction is subjected to gauss hybrid models training, obtains different sound-groove models, and by the difference Sound-groove model form the sound-groove model library.

3. the method as described in claim 1, which is characterized in that it is described according to the vocal print feature to be certified from the vocal print mould Target sound-groove model is determined in type library, comprising:

Determine the quantity of extracted vocal print feature to be certified；

4. method as claimed in claim 3, which is characterized in that described to calculate the vocal print feature to be certified and target vocal print mould Vocal print similarity between type, comprising:

If the vocal print feature to be certified be it is multiple, calculate the to be certified of the highest priority using K-means clustering algorithm Vocal print similarity between vocal print feature and each vocal print feature of the target sound-groove model.

5. the method as described in claim 1, which is characterized in that described to calculate the vocal print feature to be certified and target vocal print mould Vocal print similarity between type, comprising:

If the vocal print feature to be certified is one, the vocal print feature to be certified and institute are calculated using K-means clustering algorithm State the vocal print similarity between each vocal print feature of target sound-groove model.

6. a kind of voiceprint authentication apparatus, which is characterized in that described device includes:

Unit is established, for pre-establishing sound-groove model library；

First extraction unit for obtaining the voice print database to be certified of user's input, and is mentioned from the voice print database to be certified Take corresponding vocal print feature to be certified；

Determination unit, for determining target sound-groove model from the sound-groove model library according to the vocal print feature to be certified；

Judging unit determines that the voice print database to be certified is corresponding if being greater than the preset threshold for the vocal print similarity User identity match with target user's identity, and determine that the user identity authentication passes through.

7. device as claimed in claim 6, which is characterized in that described to establish unit, comprising:

Acquisition unit, for acquiring at least two training speech samples for being used for model training；

Pretreatment unit, for being pre-processed to each each trained speech samples；

Second extraction unit, for extracting the vocal print feature of pretreated each trained speech samples；

Subelement is established, each vocal print feature for that will extract carries out gauss hybrid models training, obtains different vocal print moulds Type, and the sound-groove model library is formed by the different sound-groove model.

8. device as claimed in claim 6, which is characterized in that the determination unit, comprising:

First determines subelement, for determining the quantity of extracted vocal print feature to be certified；

Second determines subelement, if be multiple for extracted vocal print feature to be certified, determines that each vocal print to be certified is special The priority of sign；

Third determines subelement, and mesh is determined from the sound-groove model library for the vocal print feature to be certified according to highest priority Mark sound-groove model.

9. a kind of computer equipment, including memory, processor and it is stored on the memory and can be on the processor The computer program of operation, which is characterized in that the processor realizes that claim 1-5 such as appoints when executing the computer program Voiceprint authentication method described in one.

10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage have one or More than one computer program, the one or more computer program can be by one or more than one processors It executes, to realize voiceprint authentication method as described in any one in claim 1-5.