CN105719650A

CN105719650A - Speech recognition method and system

Info

Publication number: CN105719650A
Application number: CN201610065010.2A
Authority: CN
Inventors: 谷树森
Original assignee: Shenzhen Erwu Technology Co Ltd
Current assignee: Shenzhen Erwu Technology Co Ltd
Priority date: 2016-01-30
Filing date: 2016-01-30
Publication date: 2016-06-29

Abstract

The invention discloses a speech recognition method and system, and aims at overcoming the disadvantage that a present speech recognition system cannot be applied to intelligent hardware in large scale. The method comprises the following steps that speech data is obtained; a command word recognition module is used to recognize the speech data, if the command word recognition module is capable of recognizing the speech data, a speech data result recognized by the command word recognition module is output, and if not, the speech data is input to a dictation recognition module; and the dictation recognition module recognizes the input speech data, and obtains a final speech data result. According to the speech recognition method and system, a command word is recognized from the input speech at first, dictation recognition is carried out if the speech data is not recognized via command word recognition, and the recognition result is provided; and the scale that the speech recognition system is applied to the intelligent hardware is expanded to certain extent.

Description

A kind of method and system of speech recognition

Technical field

The present invention relates to field of speech recognition, particularly to the method and system of a kind of speech recognition.

Background technology

Speech recognition technology is exactly allow machine by identifying and voice signal is changed into the technology of corresponding word or order by understanding process.Current existing speech recognition system includes dictation and identifies and order word identification, and both technology all existing defects.The deficiency that dictation identifies is in that to require of a relatively high to computer hardware and communication network, and response time is long；Although order word identification need not network still its identification content be restricted, it is impossible to meets a large amount of content aware demand of needs, therefore, also cannot large-scale application speech recognition on current Intelligent hardware.

Summary of the invention

In order to overcome prior art speech recognition system can not the deficiency of large-scale application Intelligent hardware, it is an object of the invention to provide the method and system of a kind of speech recognition being easy to speech recognition system large-scale application.

For solving the problems referred to above, the technical solution adopted in the present invention is as follows: a kind of method providing speech recognition, comprises the following steps:

S101: obtain speech data；

S102: by speech data described in order word identification module identification, if described order word identification module identifies described speech data, then exports the speech data result of described order word identification module identification；If it is not, then input to dictating identification module；

S103: by dictating input described in identification module identification to the speech data dictating identification module, and obtain final speech data result.

Preferably, step S102 comprises the following steps:

Ripple storehouse is built according to order word；

The ripple of the speech data of acquisition being compared with the ripple in ripple storehouse, if having, then exporting the speech data result of order word identification module identification；If nothing, then input to dictating identification module.

Preferably, step S103 comprises the following steps:

From described input to characteristic information extraction the speech data of dictation identification module；

Utilize the speech data result that hidden Markov model processing feature information acquisition is final.

Preferably, described characteristic information is MFCC or PLP.

There is provided the system of a kind of speech recognition, it is characterised in that including acquisition module, order word identification module and dictation identification module, described order word identification module connects described acquisition module, and described dictation identification module connects described order word identification module；Wherein,

Described acquisition module is used for obtaining speech data；

Described order word identification module is used for identifying described speech data, if described order word identification module identifies described speech data, then exports the speech data result of described order word identification module identification；If it is not, then input to described dictation identification module；

Described dictation identification module is for identifying the speech data that described order word identification module inputs, and obtains final speech data result.

Preferably, described order word identification module includes building module and comparing module, described structure module is for building ripple storehouse according to order word, described comparing module is for comparing the ripple of the speech data of acquisition with the ripple in ripple storehouse, if having, then export the speech data result of described order word identification module identification；If it is not, then input to dictating identification module.

Preferably, described dictation identification module includes extraction module and model module, described extraction module is for from described input to characteristic information extraction the speech data of dictation identification module, and described model module is used for the speech data result utilizing hidden Markov model processing feature information acquisition final.

Preferably, described dictation identification module is HTK sound identification module.

Compared to existing technology, the beneficial effects of the present invention is:

First the method and system of this kind of speech recognition by carrying out order word identification after phonetic entry, if order word identifies recognition result, identify, if unidentified go out recognition result; carry out dictation identify, finally provide recognition result, network need not can be relied on not by when identifying content constraints by speech recognition technology in hardware configuration that need not be too high, it still is able to have higher accuracy of identification, meanwhile, speech recognition system application scale on Intelligent hardware is also expanded to a certain extent.

Accompanying drawing explanation

Fig. 1 is the flow chart of the method for a kind of speech recognition of the embodiment of the present invention；

Fig. 2 is the function structure chart of the system of a kind of speech recognition of the embodiment of the present invention.

Identifier declaration in figure:

1001, acquisition module；1002, order word identification module；1003, dictation identification module.

Detailed description of the invention

Below in conjunction with the drawings and specific embodiments, the present invention is described in further detail.

Referring to Fig. 1, Fig. 1 and illustrate the flow chart of a kind of audio recognition method of embodiment provided by the invention, the method for this speech recognition comprises the following steps:

S101: obtain speech data；

Specifically, step S102 comprises the following steps:

Ripple storehouse is built according to order word；

Specifically, step S103 comprises the following steps:

Alternatively, features described above information can be MFCC (Mel-FrequencyCepstralCoefficients, Mel frequency cepstral coefficient) or PLP (PerceptualLinearPrediction, perception linear predictor coefficient).

The embodiment one identification system of a kind of offer of the present invention, it includes acquisition module 1001, order word identification module 1002 and dictation identification module 1003, described order word identification module 1002 connects acquisition module 1001, and described dictation identification module 1003 connects described order word identification module 1002；Wherein,

Described acquisition module 1001 is used for obtaining speech data；

Described order word identification module 1002 is used for identifying described speech data, if described order word identification module 1002 identifies described speech data, then exports the speech data result that described order word identification module 1002 identifies；If it is not, then input to described dictation identification module 1003；

Described dictation identification module 1003 is for identifying the speech data that described order word identification module 1002 inputs, and obtains final speech data result.

Order word identification module 1002 includes building module and comparing module, wherein, builds module for building ripple storehouse according to order word；Comparing module is for comparing the ripple of the speech data of acquisition with the ripple in ripple storehouse, if having, then exports the speech data result that described order word identification module 1002 identifies, if nothing, then input extremely dictation identification module 1003.

Dictation identification module 1003 includes extraction module and model module, and wherein, extraction module is for from described input to characteristic information extraction the speech data of dictation identification module 1003；Model module is used for the speech data result utilizing hidden Markov model processing feature information acquisition final.

Preferably, dictation identification module 1003 is HTK sound identification module.

Compared with prior art, the method have the advantages that

First the method and system of this kind of speech recognition by carrying out order word identification after phonetic entry, if order word identifies result, identify, if unidentified go out recognition result; carry out dictation identify, finally provide recognition result, network need not can be relied on not by when identifying content constraints by speech recognition technology in hardware configuration that need not be too high, it still is able to have higher accuracy of identification, meanwhile, speech recognition system application scale on Intelligent hardware is also expanded to a certain extent.

Above-mentioned embodiment is only the preferred embodiment of the present invention, it is impossible to limit the scope of protection of the invention with this, and the change of any unsubstantiality that those skilled in the art does on the basis of the present invention and replacement belong to present invention scope required for protection.

Claims

1. the method for a speech recognition, it is characterised in that comprise the following steps:

S101: obtain speech data；

2. the method for speech recognition as claimed in claim 1, it is characterised in that step S102 comprises the following steps:

Ripple storehouse is built according to order word；

3. the method for speech recognition as claimed in claim 1, it is characterised in that step S103 comprises the following steps:

4. the method for speech recognition as claimed in claim 3, it is characterised in that described characteristic information is MFCC or PLP.

5. the system of a speech recognition, it is characterised in that including acquisition module, order word identification module and dictation identification module, described order word identification module connects described acquisition module, and described dictation identification module connects described order word identification module；Wherein,

Described acquisition module is used for obtaining speech data；

6. the system of speech recognition as claimed in claim 5, it is characterized in that, described order word identification module includes building module and comparing module, described structure module is for building ripple storehouse according to order word, described comparing module is for comparing the ripple of the speech data of acquisition with the ripple in ripple storehouse, if having, then export the speech data result of described order word identification module identification；If it is not, then input to dictating identification module.

7. the system of speech recognition as claimed in claim 5, it is characterized in that, described dictation identification module includes extraction module and model module, described extraction module is for from described input to characteristic information extraction the speech data of dictation identification module, and described model module is used for the speech data result utilizing hidden Markov model processing feature information acquisition final.

8. the system of speech recognition as claimed in claim 5, it is characterised in that described dictation identification module is HTK sound identification module.