CN106971729A - A kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope - Google Patents
A kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope Download PDFInfo
- Publication number
- CN106971729A CN106971729A CN201610025132.9A CN201610025132A CN106971729A CN 106971729 A CN106971729 A CN 106971729A CN 201610025132 A CN201610025132 A CN 201610025132A CN 106971729 A CN106971729 A CN 106971729A
- Authority
- CN
- China
- Prior art keywords
- acoustic model
- voice signal
- sound characteristic
- sound
- characteristic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000012549 training Methods 0.000 claims abstract description 25
- 238000000605 extraction Methods 0.000 claims description 14
- 238000001228 spectrum Methods 0.000 claims description 12
- 239000013598 vector Substances 0.000 claims description 12
- 238000012545 processing Methods 0.000 claims description 7
- 230000003595 spectral effect Effects 0.000 claims description 6
- 230000008447 perception Effects 0.000 claims description 4
- 238000004088 simulation Methods 0.000 claims description 4
- 238000009432 framing Methods 0.000 claims description 3
- 238000005259 measurement Methods 0.000 claims description 3
- 230000009466 transformation Effects 0.000 claims description 3
- 230000000875 corresponding effect Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Business, Economics & Management (AREA)
- Game Theory and Decision Science (AREA)
- Toys (AREA)
Abstract
The invention belongs to field of voice signal, more particularly to a kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic, applied to domestic robot, including:S1:Gather voice signal;S2:Voice signal is pre-processed;S3:Speech characteristic parameter is extracted from pretreated voice signal;S4:Acoustic model is set up for each kinsfolk;S5:Training in advance obtains the sound characteristic scope of correspondence children, adult and the elderly, and acoustic model is divided into by the first acoustic model and the second acoustic model according to sound characteristic scope, wherein, first acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, second acoustic model includes the training sentence in the range of the sound characteristic of correspondence children and the elderly, and the first acoustic model is loaded onto in caching when being powered;S6:Pattern match is carried out to voice signal to be measured according to the first acoustic model and the second acoustic model, recognition result is obtained.
Description
Technical field
The invention belongs to field of voice signal, more particularly to a kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope.
Background technology
Household service robot is one of current most active field of forward position high-tech research, it can complete the services being beneficial to man, housework, amusement and leisure, education, security monitoring service are such as provided, possess extensive potential customers colony and market, the existing widely used speech recognition technology of household service robot realizes man-machine interaction, robot is allowed to understand human speech, to perform corresponding actions, but, existing robot there is no method to accurately identify speaker's identity, it is impossible to meet the demand of user individual.The sound groove recognition technology in e occurred with the development of computer technology and digital signal processing theory, by from one section of voice of speaker, extract and reflect the human physiology of speaking, the speech characteristic parameter of psychology, by carrying out analysis modeling and pattern match to speech characteristic parameter, come the purpose realized identification or confirm unknown speaker identity.But, existing Voiceprint Recognition System is often designed for a specific application scenarios, when systematic difference scene changes, adaptive ability is not strong, it can not realize and man-machine freely exchange, and because the speed of Application on Voiceprint Recognition is excessively slow, poor user experience is caused, this is that those skilled in the art do not expect to see.
The content of the invention
To solve above technical problem there is provided a kind of system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, the defect of existing recognition methods is solved.
Concrete technical scheme is as follows:
A kind of method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, wherein, applied to domestic robot, specific works step includes:
S1:Gather voice signal;
S2:The voice signal is pre-processed;
S3:Speech characteristic parameter is extracted from the pretreated voice signal, the Equations of The Second Kind characteristic parameter that the first kind characteristic parameter and simulation human ear that the speech characteristic parameter is obtained including linear prediction are extracted to the perception characteristic of sound frequency;
S4:A code book is set up for each kinsfolk and is stored in sound template in speech database as the kinsfolk, and all code books of the kinsfolk constitute an acoustic model;
S5:Training in advance obtains the sound characteristic scope of correspondence children, adult and the elderly, and the acoustic model is divided into by the first acoustic model and the second acoustic model according to sound characteristic scope, wherein, first acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, second acoustic model includes the training sentence in the range of the sound characteristic of correspondence children and the elderly, and first acoustic model is loaded onto in caching when being powered;
S6:Pattern match is carried out to voice signal to be measured according to first acoustic model and the second acoustic model, recognition result is obtained.
Include successively in the above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, the step S2, the step of the pretreatment:
Step S21, is sampled and is quantified to obtain audio digital signals to the pretreated voice signal;
Step S22, the audio digital signals are by a wave filter group to lift the radio-frequency component of the data signal;
Step S23, the voice signal obtained to step S22 carries out framing and adding window, obtains the voice signal after adding window.
The first kind characteristic parameter is extracted in the above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, the step S3 for linear predictor coefficient, extraction step is as follows:
Step S31a, defines Short Time Speech signal and error signal;
Step S32a, calculates the error sum of squares of the Short Time Speech signal and the error signal;
Step S33a, differentiates to the error sum of squares, and solves the equation group acquisition first kind characteristic parameter.
The step of extracting the Equations of The Second Kind characteristic parameter in the above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, the step S3 is included:
Step S31b, carries out Fourier transformation to the pretreated voice signal and obtains linear spectral;
Step S32b, corresponding Mel frequency spectrum is obtained to the linear spectral by a triangular band pass wave filter group;
Step S33b, calculates the log spectrum of the Mel frequency spectrum;
Step S34b, carries out discrete cosine transform to the log spectrum and obtains Equations of The Second Kind characteristic parameter.
The above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, the step S4's comprises the following steps that:
Step S41, N number of characteristic vector is extracted from the voice signal, and the characteristic vector sort out by clustering procedure to obtain M code book;
Step S42, obtains the corresponding codebook vectors of each class;
Step S43, the set for setting up the codebook vectors of each kinsfolk constitutes acoustic model.
The above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, the step S6 is specific as follows,
Step S61, voice signal to be identified is matched with first acoustic model and the second acoustic model as similitude successively, and is estimated according to weighted euclidean distance and judged;
Step S62, chooses appropriately distance measurement and is used as threshold value;
Step S63, meets the result in the range of threshold value as recognition result.
Also provide, a kind of system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, including
Voice input module, for capturing voice signal;
Pretreatment module, is connected with the voice input module, for being pre-processed to the voice signal;
Fisrt feature parameter extraction module, is connected with the pretreatment module, for obtaining the fisrt feature parameter in the voice signal;
Second feature parameter extraction module, is connected with the pretreatment module, for obtaining the second feature parameter in the voice signal;
Training module, is connected, the sound template for setting up each kinsfolk with the fisrt feature parameter extraction module and the second feature parameter extraction module, and all code books of the kinsfolk constitute an acoustic model;
Acquisition processing module, training in advance obtains the sound characteristic scope of correspondence children, adult and the elderly, and the acoustic model is divided into by the first acoustic model and the second acoustic model according to sound characteristic scope, first acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, second acoustic model includes the training sentence in the range of the sound characteristic of correspondence children and the elderly, and first acoustic model is loaded onto in caching when being powered;
Template matches module, is connected with the acquisition processing module, carries out pattern match to voice signal to be measured according to first acoustic model and the second acoustic model, obtains recognition result.
Beneficial effect:Above technical scheme can adaptively realize Application on Voiceprint Recognition, and effectively increase the man-machine communication under the speed of Application on Voiceprint Recognition, reply different application scene, be conducive to lifting Consumer's Experience.
Brief description of the drawings
Fig. 1 is flow chart of the method for the present invention;
Fig. 2 is the method flow diagram of the step 2 of the present invention;
Fig. 3 is system structure diagram of the invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.Based on the embodiment in the present invention, the every other embodiment that those of ordinary skill in the art are obtained on the premise of creative work is not made belongs to the scope of protection of the invention.
It should be noted that in the case where not conflicting, the embodiment in the present invention and the feature in embodiment can be mutually combined.
The invention will be further described with specific embodiment below in conjunction with the accompanying drawings, but not as limiting to the invention.
Reference picture 1, a kind of method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, wherein, applied to domestic robot, specific works step includes:
S1:Gather voice signal;
S2:Voice signal is pre-processed;
S3:Speech characteristic parameter is extracted from pretreated voice signal, the Equations of The Second Kind characteristic parameter that the first kind characteristic parameter and simulation human ear that speech characteristic parameter is obtained including linear prediction are extracted to the perception characteristic of sound frequency;
S4:A code book is set up for each kinsfolk and is stored in sound template in speech database as kinsfolk, and all code books of kinsfolk constitute an acoustic model;
S5:Training in advance obtains the sound characteristic scope (such as frequecy characteristic) of correspondence children, adult and the elderly, and acoustic model is divided into by the first acoustic model and the second acoustic model according to sound characteristic scope, first acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, second acoustic model includes the training sentence in the range of the sound characteristic of correspondence children and the elderly, and the first acoustic model is loaded onto in caching when being powered, the second acoustic model is remained stored in speech database;
S6:Pattern match is carried out to voice signal to be measured according to the first acoustic model and the second acoustic model, recognition result is obtained.
Everyone can cause articulation type and custom of speaking different due to the differences of Physiological of vocal organs, the Equations of The Second Kind characteristic parameter that the present invention is extracted with reference to the first kind characteristic parameter and simulation human ear that linear prediction is obtained to the perception characteristic of sound frequency, obtain acoustic model, to improve existing Application on Voiceprint Recognition effect, Consumer's Experience is lifted.
Include successively in the above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, reference picture 2, step S2, the step of pretreatment:
Step S21, is sampled to pretreated voice signal and quantifies to obtain audio digital signals;
Step S22, audio digital signals are by a wave filter group to lift the radio-frequency component of data signal;
Step S23, the voice signal obtained to step S22 carries out framing and adding window, obtains the voice signal after adding window.
It can be linear predictor coefficient that first kind characteristic parameter is extracted in the above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, step S3, and its extraction step is as follows:
Step S31a, defines Short Time Speech signal and error signal;
Step S32a, calculates the error sum of squares of Short Time Speech signal and error signal;
Step S33a, differentiates to error sum of squares, and solves equation group acquisition first kind characteristic parameter.
Due to having correlation between voice adjacent spots, the mode of linear prediction can be utilized, present or following sample value is predicted according to past voice sample value, i.e., using several voices sampling in the past or their linear combination, to approach the sample value that voice is present.
The step of Equations of The Second Kind characteristic parameter being extracted in the above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, step S3, including:
Step S31b, carries out Fourier transformation to pretreated voice signal and obtains linear spectral;
Step S32b, corresponding Mel frequency spectrum is obtained to linear spectral by a triangular band pass wave filter group;
Step S33b, calculates the log spectrum of Mel frequency spectrum;
Step S34b, carries out discrete cosine transform to log spectrum and obtains Equations of The Second Kind characteristic parameter.
The above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, step S4's comprises the following steps that:
Step S41, N number of characteristic vector is extracted from first kind characteristic parameter and Equations of The Second Kind characteristic parameter, and characteristic vector sort out by clustering procedure to obtain M code book;
Step S42, obtains the corresponding codebook vectors of each class;
Step S43, the set for setting up the codebook vectors of each kinsfolk constitutes acoustic model.
The above-mentioned method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, step S6 is specific as follows,
Step S61, voice signal to be identified is matched with the first acoustic model and the second acoustic model as similitude successively, and is estimated according to weighted euclidean distance and judged;
Step S62, chooses appropriately distance measurement and is used as threshold value;
Step S63, meets the result in the range of threshold value as recognition result.
Also provide, a kind of system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, reference picture 3, including
Voice input module 1, for capturing voice signal;
Pretreatment module 2, is connected with voice input module 1, for being pre-processed to voice signal;
Fisrt feature parameter extraction module 3, is connected with pretreatment module 2, for obtaining the fisrt feature parameter in voice signal;
Second feature parameter extraction module 4, is connected with pretreatment module 2, for obtaining the second feature parameter in voice signal;
Training module 5, is connected with fisrt feature parameter extraction module and second feature parameter extraction module, the sound template for setting up each kinsfolk, and all code books of kinsfolk constitute an acoustic model;
Acquisition processing module 6, it is connected with training module 5, training in advance obtains the sound characteristic scope of correspondence children, adult and the elderly, and acoustic model is divided into by the first acoustic model and the second acoustic model according to sound characteristic scope, first acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, second acoustic model includes the training sentence in the range of the sound characteristic of correspondence children and the elderly, and the first acoustic model is loaded onto in caching when being powered;
Template matches module 7, is connected with acquisition processing module 6, carries out pattern match to voice signal to be measured according to the first acoustic model and the second acoustic model successively, obtains recognition result.
It these are only preferred embodiments of the present invention; not thereby embodiments of the present invention and protection domain are limited; to those skilled in the art; the scheme obtained by all utilization description of the invention and the equivalent substitution made by diagramatic content and obvious change should be can appreciate that, should be included in protection scope of the present invention.
Claims (7)
1. a kind of method that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, it is characterised in that application
In domestic robot, specific works step includes:
S1:Gather voice signal;
S2:The voice signal is pre-processed;
S3:Speech characteristic parameter, the speech characteristic parameter are extracted from the pretreated voice signal
The first kind characteristic parameter and simulation human ear obtained including linear prediction is carried to the perception characteristic of sound frequency
The Equations of The Second Kind characteristic parameter taken;
S4:For each kinsfolk set up a code book be stored in speech database as the family into
The sound template of member, all code books of the kinsfolk constitute an acoustic model;
S5:Training in advance obtains the sound characteristic scope of correspondence children, adult and the elderly, and root
The acoustic model is divided into the first acoustic model and the second acoustic model according to sound characteristic scope, it is described
First acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, second acoustics
Model includes the training sentence in the range of the sound characteristic of correspondence children and the elderly, and when being powered by institute
The first acoustic model is stated to be loaded onto in caching;
S6:Row mode is entered to voice signal to be measured according to first acoustic model and the second acoustic model
Match somebody with somebody, obtain recognition result.
2. the method according to claim 1 that Application on Voiceprint Recognition speed is improved based on sound characteristic scope,
Include successively characterized in that, in the step S2, the step of the pretreatment:
Step S21, is sampled to the pretreated voice signal and quantifies to obtain digital speech letter
Number;
Step S22, the audio digital signals are by a wave filter group to lift the high frequency of the data signal
Composition;
Step S23, the voice signal obtained to step S22 carries out framing and adding window, obtains the language after adding window
Message number.
3. the method according to claim 1 that Application on Voiceprint Recognition speed is improved based on sound characteristic scope,
Characterized in that, extracting the first kind characteristic parameter in the step S3 for linear predictor coefficient, carry
Take step as follows:
Step S31a, defines Short Time Speech signal and error signal;
Step S32a, calculates the error sum of squares of the Short Time Speech signal and the error signal;
Step S33a, differentiates to the error sum of squares, and it is special to solve the equation group acquisition first kind
Levy parameter.
4. the method according to claim 1 that Application on Voiceprint Recognition speed is improved based on sound characteristic scope,
Characterized in that, the step of extracting the Equations of The Second Kind characteristic parameter in the step S3 includes:
Step S31b, carries out Fourier transformation to the pretreated voice signal and obtains linear spectral;
Step S32b, corresponding Mel is obtained to the linear spectral by a triangular band pass wave filter group
Frequency spectrum;
Step S33b, calculates the log spectrum of the Mel frequency spectrum;
Step S34b, carries out discrete cosine transform to the log spectrum and obtains Equations of The Second Kind characteristic parameter.
5. the method according to claim 1 that Application on Voiceprint Recognition speed is improved based on sound characteristic scope,
Characterized in that, the step S4's comprises the following steps that:
Step S41, N number of feature is extracted from the first kind characteristic parameter and the Equations of The Second Kind characteristic parameter
Vector, to the characteristic vector sort out obtaining M code book by clustering procedure;
Step S42, obtains the corresponding codebook vectors of each class;
Step S43, the set for setting up the codebook vectors of each kinsfolk constitutes acoustic model.
6. the method according to claim 1 that Application on Voiceprint Recognition speed is improved based on sound characteristic scope,
Characterized in that, the step S6 is specific as follows,
Step S61, by voice signal to be identified successively with first acoustic model and second acoustics
Model makees similitude matching, and is estimated according to weighted euclidean distance and judged;
Step S62, chooses appropriately distance measurement and is used as threshold value;
Step S63, meets the result in the range of threshold value as recognition result.
7. a kind of system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope, it is characterised in that including
Voice input module, for capturing voice signal;
Pretreatment module, is connected with the voice input module, for being located in advance to the voice signal
Reason;
Fisrt feature parameter extraction module, is connected with the pretreatment module, for obtaining the voice letter
Fisrt feature parameter in number;
Second feature parameter extraction module, is connected with the pretreatment module, for obtaining the voice letter
Second feature parameter in number;
Training module, connects with the fisrt feature parameter extraction module and the second feature parameter extraction module
Connect, the sound template for setting up each kinsfolk, all code books of the kinsfolk constitute a sound
Learn model;
Acquisition processing module, training in advance obtains the sound characteristic of correspondence children, adult and the elderly
Scope, and the acoustic model is divided into by the first acoustic model and the second acoustics according to sound characteristic scope
Model, first acoustic model includes the training sentence in the range of the sound characteristic of correspondence adult, institute
State the second acoustic model include correspondence children and the elderly sound characteristic in the range of training sentence, and
First acoustic model is loaded onto in caching during energization;
Template matches module, is connected with the acquisition processing module, according to first acoustic model and
Two acoustic models carry out pattern match to voice signal to be measured, obtain recognition result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610025132.9A CN106971729A (en) | 2016-01-14 | 2016-01-14 | A kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610025132.9A CN106971729A (en) | 2016-01-14 | 2016-01-14 | A kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106971729A true CN106971729A (en) | 2017-07-21 |
Family
ID=59334348
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610025132.9A Pending CN106971729A (en) | 2016-01-14 | 2016-01-14 | A kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106971729A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107978311A (en) * | 2017-11-24 | 2018-05-01 | 腾讯科技(深圳)有限公司 | A kind of voice data processing method, device and interactive voice equipment |
CN114038450A (en) * | 2021-12-06 | 2022-02-11 | 深圳市北科瑞声科技股份有限公司 | Dialect identification method, dialect identification device, dialect identification equipment and storage medium |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1787077A (en) * | 2005-12-13 | 2006-06-14 | 浙江大学 | Method for fast identifying speeking person based on comparing ordinal number of archor model space projection |
CN101350196A (en) * | 2007-07-19 | 2009-01-21 | 丁玉国 | On-chip system for confirming role related talker identification and confirming method thereof |
CN101661754A (en) * | 2003-10-03 | 2010-03-03 | 旭化成株式会社 | Data processing unit, method and control program |
CN101944359A (en) * | 2010-07-23 | 2011-01-12 | 杭州网豆数字技术有限公司 | Voice recognition method facing specific crowd |
CN102099853A (en) * | 2009-03-16 | 2011-06-15 | 富士通株式会社 | Apparatus and method for recognizing speech emotion change |
CN102509547A (en) * | 2011-12-29 | 2012-06-20 | 辽宁工业大学 | Method and system for voiceprint recognition based on vector quantization based |
CN102800316A (en) * | 2012-08-30 | 2012-11-28 | 重庆大学 | Optimal codebook design method for voiceprint recognition system based on nerve network |
CN102930864A (en) * | 2012-11-26 | 2013-02-13 | 江苏物联网研究发展中心 | Sound networking voice information keyword mining system based on child nodes |
CN104185868A (en) * | 2012-01-24 | 2014-12-03 | 澳尔亚有限公司 | Voice authentication and speech recognition system and method |
CN104392718A (en) * | 2014-11-26 | 2015-03-04 | 河海大学 | Robust voice recognition method based on acoustic model array |
CN104835498A (en) * | 2015-05-25 | 2015-08-12 | 重庆大学 | Voiceprint identification method based on multi-type combination characteristic parameters |
CN105006230A (en) * | 2015-06-10 | 2015-10-28 | 合肥工业大学 | Voice sensitive information detecting and filtering method based on unspecified people |
-
2016
- 2016-01-14 CN CN201610025132.9A patent/CN106971729A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101661754A (en) * | 2003-10-03 | 2010-03-03 | 旭化成株式会社 | Data processing unit, method and control program |
CN1787077A (en) * | 2005-12-13 | 2006-06-14 | 浙江大学 | Method for fast identifying speeking person based on comparing ordinal number of archor model space projection |
CN101350196A (en) * | 2007-07-19 | 2009-01-21 | 丁玉国 | On-chip system for confirming role related talker identification and confirming method thereof |
CN102099853A (en) * | 2009-03-16 | 2011-06-15 | 富士通株式会社 | Apparatus and method for recognizing speech emotion change |
CN101944359A (en) * | 2010-07-23 | 2011-01-12 | 杭州网豆数字技术有限公司 | Voice recognition method facing specific crowd |
CN102509547A (en) * | 2011-12-29 | 2012-06-20 | 辽宁工业大学 | Method and system for voiceprint recognition based on vector quantization based |
CN104185868A (en) * | 2012-01-24 | 2014-12-03 | 澳尔亚有限公司 | Voice authentication and speech recognition system and method |
CN102800316A (en) * | 2012-08-30 | 2012-11-28 | 重庆大学 | Optimal codebook design method for voiceprint recognition system based on nerve network |
CN102930864A (en) * | 2012-11-26 | 2013-02-13 | 江苏物联网研究发展中心 | Sound networking voice information keyword mining system based on child nodes |
CN104392718A (en) * | 2014-11-26 | 2015-03-04 | 河海大学 | Robust voice recognition method based on acoustic model array |
CN104835498A (en) * | 2015-05-25 | 2015-08-12 | 重庆大学 | Voiceprint identification method based on multi-type combination characteristic parameters |
CN105006230A (en) * | 2015-06-10 | 2015-10-28 | 合肥工业大学 | Voice sensitive information detecting and filtering method based on unspecified people |
Non-Patent Citations (1)
Title |
---|
谷志新: "基于声纹信息的身份认证模式与算法的研究", 《中国优秀硕士学位论文全文数据库,信息科技辑》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107978311A (en) * | 2017-11-24 | 2018-05-01 | 腾讯科技(深圳)有限公司 | A kind of voice data processing method, device and interactive voice equipment |
CN107978311B (en) * | 2017-11-24 | 2020-08-25 | 腾讯科技(深圳)有限公司 | Voice data processing method and device and voice interaction equipment |
CN114038450A (en) * | 2021-12-06 | 2022-02-11 | 深圳市北科瑞声科技股份有限公司 | Dialect identification method, dialect identification device, dialect identification equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105611477B (en) | The voice enhancement algorithm that depth and range neutral net are combined in digital deaf-aid | |
CN109215665A (en) | A kind of method for recognizing sound-groove based on 3D convolutional neural networks | |
CN110415701A (en) | The recognition methods of lip reading and its device | |
CN111785285A (en) | Voiceprint recognition method for home multi-feature parameter fusion | |
CN111489763B (en) | GMM model-based speaker recognition self-adaption method in complex environment | |
CN102968990A (en) | Speaker identifying method and system | |
CN111986679A (en) | Speaker confirmation method, system and storage medium for responding to complex acoustic environment | |
CN103021405A (en) | Voice signal dynamic feature extraction method based on MUSIC and modulation spectrum filter | |
CN112767927A (en) | Method, device, terminal and storage medium for extracting voice features | |
CN110136726A (en) | A kind of estimation method, device, system and the storage medium of voice gender | |
Chauhan et al. | Speech to text converter using Gaussian Mixture Model (GMM) | |
CN105913842A (en) | Method for waking up mobile phone by custom voice | |
Nandyal et al. | MFCC based text-dependent speaker identification using BPNN | |
Hou et al. | Domain adversarial training for speech enhancement | |
CN105679323B (en) | A kind of number discovery method and system | |
CN106796803A (en) | Method and apparatus for separating speech data with background data in voice communication | |
CN106971712A (en) | A kind of adaptive rapid voiceprint recognition methods and system | |
Hepsiba et al. | Enhancement of single channel speech quality and intelligibility in multiple noise conditions using wiener filter and deep CNN | |
CN106971729A (en) | A kind of method and system that Application on Voiceprint Recognition speed is improved based on sound characteristic scope | |
CN106971735B (en) | A kind of method and system regularly updating the Application on Voiceprint Recognition of training sentence in caching | |
CN106875944A (en) | A kind of system of Voice command home intelligent terminal | |
CN106981287A (en) | A kind of method and system for improving Application on Voiceprint Recognition speed | |
CN118230722B (en) | Intelligent voice recognition method and system based on AI | |
Wang et al. | Application of speech recognition technology in IoT smart home | |
CN114758668A (en) | Training method of voice enhancement model and voice enhancement method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170721 |