CN105261356A - Voice recognition system and method - Google Patents

Voice recognition system and method Download PDF

Info

Publication number
CN105261356A
CN105261356A CN201510728467.2A CN201510728467A CN105261356A CN 105261356 A CN105261356 A CN 105261356A CN 201510728467 A CN201510728467 A CN 201510728467A CN 105261356 A CN105261356 A CN 105261356A
Authority
CN
China
Prior art keywords
module
audio information
standard audio
voice messaging
match
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510728467.2A
Other languages
Chinese (zh)
Inventor
范浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GUILIN XINTONG TECHNOLOGY Co Ltd
Original Assignee
GUILIN XINTONG TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUILIN XINTONG TECHNOLOGY Co Ltd filed Critical GUILIN XINTONG TECHNOLOGY Co Ltd
Priority to CN201510728467.2A priority Critical patent/CN105261356A/en
Publication of CN105261356A publication Critical patent/CN105261356A/en
Pending legal-status Critical Current

Links

Abstract

The invention relates to a voice recognition system and method. The system comprises an acquisition module, a conversion module, an extraction module, a coupling module, an execution module, and a voice database, wherein the acquisition module is used for collecting to-be-recognized voice information; the conversion module is used for converting the to-be-recognized voice information into first standard audio information capable of being recognized by the extraction module; the extraction module is used for analyzing the first standard audio information and extracting a keyword from the first standard audio information; the coupling module is used for calling a target command word and coupling the target command word with the keyword in the first standard audio information, and if the coupling succeeds, sending the corresponding target command word to the execution module; and the execution module is used for receiving the target command word and executing a corresponding target action. According to the voice recognition system and method, the recognition rate and recognition accuracy of a voice signal is improved; the corresponding target action is executed through the execution module; and automation of voice control is realized. Under the precondition of ensuring the quality of voice recognition, user's experience and operation efficiency is improved.

Description

A kind of speech recognition system and method
Technical field
The present invention relates to technical field of voice recognition, particularly relate to a kind of speech recognition system and method.
Background technology
Speech recognition technology is that machine is converted to corresponding word or symbol by the sound, syllable or the phrase that identify and people sends by understanding process, or provide response, as performed control, making answer etc., its application widely, almost relate to each field of life, such as computing machine control, Industry Control, information network inquiry etc.
Speech recognition system, according to the requirement of different recognition system, can be divided into much different kinds.Such as, according to the difference identifying object, can be divided into: isolated word (word) identification, connection string, continuous speech recognition; Can be divided into according to the limited range of speaker: particular person and signer-independent sign language recognition system; Divide according to recognition methods, mainly contain: template matching method, probability model method, based on systems such as artificial neural networks.Usually, speech recognition system all can arrange a vocabulary, and system identifies the entry be contained in this vocabulary.In the prior art, be substantially all semi-automatic identification, need manually to participate in follow-up performing an action, therefore efficiency comparison is low.In addition, be all adopt once to identify mostly in prior art, so not only discrimination is lower, also can affect the accuracy of identification.
Summary of the invention
Technical matters to be solved by this invention is for above-mentioned the deficiencies in the prior art, provides a kind of speech recognition system and method.
The technical scheme that the present invention solves the problems of the technologies described above is as follows:
According to one aspect of the present invention, provide a kind of speech recognition system, comprise acquisition module, conversion module, extraction module, matching module, execution module and speech database.Described acquisition module is for gathering voice messaging to be identified; Described conversion module is used for described voice messaging to be identified to be converted into discernible first standard audio information of described extraction module; Described extraction module is used for resolving described first standard audio information and extracting the key word in described first standard audio information; The command object word of correspondence, for calling the command object word that prestores in described speech database and it being mated with the key word in described first standard audio information, if the match is successful, is then sent to execution module by described matching module; Described execution module receiving target order word also performs corresponding subject performance; Described speech database is for storing the command object word of setting.
According to another aspect of the present invention, provide a kind of audio recognition method, comprising:
Step 1: gather voice messaging to be identified;
Step 2: described voice messaging to be identified is converted into discernible first standard audio information;
Step 3: described first standard audio information is resolved and extracts the key word in described first standard audio information;
Step 4: call the command object word that prestores in speech database and it is mated with the key word in described first standard audio information, if the match is successful, then the command object word of correspondence being sent to execution module;
Step 5: described execution module receiving target order word also performs corresponding subject performance.
The invention has the beneficial effects as follows: a kind of speech recognition system of the present invention and method, by transforming and extraction process the voice signal to be identified gathered, improve the discrimination of voice signal and the accuracy of identification, and perform corresponding subject performance by corresponding execution module, achieve voice-operated robotization and intellectuality, under the prerequisite ensureing speech recognition quality, substantially increase the dirigibility of recognition system, enhance Consumer's Experience and operating efficiency.
Accompanying drawing explanation
Fig. 1 is a kind of speech recognition system structural representation of the present invention;
Fig. 2 is for being a kind of audio recognition method process flow diagram of the present invention.
Embodiment
Be described principle of the present invention and feature below in conjunction with accompanying drawing, example, only for explaining the present invention, is not intended to limit scope of the present invention.
Embodiment one, a kind of speech recognition system, be described in detail a kind of speech recognition system of the present invention below in conjunction with accompanying drawing 1.
As shown in Figure 1, a kind of speech recognition system structural representation, comprises acquisition module, conversion module, extraction module, matching module, execution module and speech database.
Wherein, described acquisition module is for gathering voice messaging to be identified; Described conversion module is used for described voice messaging to be identified to be converted into discernible first standard audio information of described extraction module; Described extraction module is used for resolving described first standard audio information and extracting the key word in described first standard audio information; The command object word of correspondence, for calling the command object word that prestores in described speech database and it being mated with the key word in described first standard audio information, if the match is successful, is then sent to execution module by described matching module; Described execution module receiving target order word also performs corresponding subject performance; Described speech database is for storing the command object word of setting.
A kind of speech recognition system of the present embodiment also comprises pretreatment module, described pretreatment module is used for carrying out analog to digital conversion, method, anti-confusion filtering and pre-emphasis process to described voice messaging to be identified after described acquisition module gathers voice messaging to be identified, and pretreated signal is sent to conversion module.Process can be optimized to the voice signal to be identified that described acquisition module gathers, the impurity component of going out wherein by described pretreatment module, be convenient to follow-up conversion module identification, improve the accuracy of recognition efficiency and identification.
Preferably, a kind of speech recognition system of the present embodiment also comprises supplementary acquisition module, described supplementary acquisition module is used for gathering when it fails to match for described matching module supplementing voice messaging, described pretreatment module carries out pre-service to described supplementary voice messaging, through described conversion module, pretreated supplementary voice messaging is converted into discernible second standard audio information of described extraction module again, and calls extraction module and matching module successively.The success ratio of speech recognition can be improved by described supplementary acquisition module, compared with traditional recognition system, a kind of speech recognition system described in the present embodiment can carry out when it fails to match for described matching module supplementing identification, and this has very important significance in actual application.
Preferably, it fails to match to the command object word prestored in the key word in described second standard audio information and described speech database for described matching module, then repeat to mate next time, when it fails to match number of times reaches predetermined threshold value time, then point out None-identified.The success ratio of speech recognition can be improved in this way further.In practice, supplement speech recognition and there is the unsuccessful situation of identification, be provided with in this way, can make greatly to improve the recognition success rate of supplementary voice signal.
Preferably, according to during coupling, the match is successful, number of times carries out descending sort to the command object word stored in described speech database.For a specific speech recognition system, by to early stage identification data analysis, we find when identifying, the specific command object word number of times that the match is successful can be higher, and that is, the frequency that client performs certain subject performance is higher, so to the command object word stored in described speech database, according to during coupling, the match is successful, number of times carries out descending sort, the recognition efficiency of system can be improved, shorten recognition time, strengthen the experience of user.
Embodiment two, a kind of audio recognition method, be described in detail a kind of audio recognition method of the present invention below in conjunction with accompanying drawing 2.
As shown in Figure 2, a kind of audio recognition method process flow diagram, comprising:
Step 1: gather voice messaging to be identified;
Step 2: described voice messaging to be identified is converted into discernible first standard audio information;
Step 3: described first standard audio information is resolved and extracts the key word in described first standard audio information;
Step 4: call the command object word that prestores in speech database and it is mated with the key word in described first standard audio information, if the match is successful, then the command object word of correspondence being sent to execution module;
Step 5: described execution module receiving target order word also performs corresponding subject performance.
In the present embodiment, before the described step 2 of execution, also analog to digital conversion, method, anti-confusion filtering and pre-emphasis process are carried out to described voice messaging to be identified.Process can be optimized to the voice signal to be identified that described acquisition module gathers, the impurity component of going out wherein by above-mentioned pre-service, be convenient to follow-up conversion module identification, improve the accuracy of recognition efficiency and identification.
Preferably, in described step 4, when the keyword match in the command object word prestored in speech database and described first standard audio information is failed, gather and supplement voice messaging, pre-service is carried out to described supplementary voice messaging, and return step 2, discernible second standard audio information will be converted into through pretreated supplementary voice messaging, then perform the extraction of step 3 and the coupling action of step 4 successively.The success ratio of speech recognition can be improved by gathering supplementary voice messaging, compared with traditional recognition system, a kind of audio recognition method described in the present embodiment can carry out complementary matching identification when first time, it fails to match, and this has very important significance in actual application.
Preferably, if it fails to match to the command object word prestored in the key word in described second standard audio information and described speech database, then repeat to mate next time, when it fails to match number of times reaches predetermined threshold value time, then point out None-identified.The success ratio of speech recognition can be improved in this way further.In practice, supplement speech recognition and there is the unsuccessful situation of identification, be provided with in this way, can make greatly to improve the recognition success rate of supplementary voice signal.
Preferably, according to during coupling, the match is successful, number of times carries out descending sort to the command object word stored in described speech database.For a specific speech recognition system, by to early stage identification data analysis, we find when identifying, the specific command object word number of times that the match is successful can be higher, and that is, the frequency that client performs certain subject performance is higher, so to the command object word stored in described speech database, according to during coupling, the match is successful, number of times carries out descending sort, the recognition efficiency of system can be improved, shorten recognition time, strengthen the experience of user.
A kind of speech recognition system of the present invention and method, by transforming and extraction process the voice signal to be identified gathered, improve the discrimination of voice signal and the accuracy of identification, and perform corresponding subject performance by corresponding execution module, achieve voice-operated robotization and intellectuality, under the prerequisite ensureing speech recognition quality, substantially increase the dirigibility of recognition system, enhance Consumer's Experience and operating efficiency.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. a speech recognition system, is characterized in that: comprise acquisition module, conversion module, extraction module, matching module, execution module and speech database;
Described acquisition module is for gathering voice messaging to be identified;
Described conversion module is used for described voice messaging to be identified to be converted into discernible first standard audio information of described extraction module;
Described extraction module is used for resolving described first standard audio information and extracting the key word in described first standard audio information;
The command object word of correspondence, for calling the command object word that prestores in described speech database and it being mated with the key word in described first standard audio information, if the match is successful, is then sent to execution module by described matching module;
Described execution module receiving target order word also performs corresponding subject performance;
Described speech database is for storing the command object word of setting.
2. a kind of speech recognition system according to claim 1, it is characterized in that: also comprise pretreatment module, described pretreatment module is used for carrying out analog to digital conversion, method, anti-confusion filtering and pre-emphasis process to described voice messaging to be identified after described acquisition module gathers voice messaging to be identified, and pretreated signal is sent to conversion module.
3. a kind of speech recognition system according to claim 2, it is characterized in that: also comprise supplementary acquisition module, described supplementary acquisition module is used for gathering when it fails to match for described matching module supplementing voice messaging, described pretreatment module carries out pre-service to described supplementary voice messaging, through described conversion module, pretreated supplementary voice messaging is converted into discernible second standard audio information of described extraction module again, and calls extraction module and matching module successively.
4. a kind of speech recognition system according to claim 3, it is characterized in that: if described matching module is to the command object word prestored in the key word in described second standard audio information and described speech database, it fails to match, then repeat to mate next time, when it fails to match number of times reaches predetermined threshold value time, then point out None-identified.
5. a kind of speech recognition system according to any one of Claims 1-4, is characterized in that: according to during coupling, the match is successful, number of times carries out descending sort to the command object word stored in described speech database.
6. an audio recognition method, is characterized in that, comprising:
Step 1: gather voice messaging to be identified;
Step 2: described voice messaging to be identified is converted into discernible first standard audio information;
Step 3: described first standard audio information is resolved and extracts the key word in described first standard audio information;
Step 4: call the command object word that prestores in speech database and it is mated with the key word in described first standard audio information, if the match is successful, then the command object word of correspondence being sent to execution module;
Step 5: described execution module receiving target order word also performs corresponding subject performance.
7. a kind of audio recognition method according to claim 6, is characterized in that: before the described step 2 of execution, also carry out analog to digital conversion, method, anti-confusion filtering and pre-emphasis process to described voice messaging to be identified.
8. a kind of audio recognition method according to claim 7, it is characterized in that: in described step 4, when the keyword match in the command object word prestored in speech database and described first standard audio information is failed, gather and supplement voice messaging, pre-service is carried out to described supplementary voice messaging, and return step 2, discernible second standard audio information will be converted into through pretreated supplementary voice messaging, then perform the extraction of step 3 and the coupling action of step 4 successively.
9. a kind of audio recognition method according to claim 8, it is characterized in that: if it fails to match to the command object word prestored in the key word in described second standard audio information and described speech database, then repeat to mate next time, when it fails to match number of times reaches predetermined threshold value time, then point out None-identified.
10. a kind of audio recognition method according to any one of claim 6 to 9, is characterized in that: according to during coupling, the match is successful, number of times carries out descending sort to the command object word stored in described speech database.
CN201510728467.2A 2015-10-30 2015-10-30 Voice recognition system and method Pending CN105261356A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510728467.2A CN105261356A (en) 2015-10-30 2015-10-30 Voice recognition system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510728467.2A CN105261356A (en) 2015-10-30 2015-10-30 Voice recognition system and method

Publications (1)

Publication Number Publication Date
CN105261356A true CN105261356A (en) 2016-01-20

Family

ID=55101016

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510728467.2A Pending CN105261356A (en) 2015-10-30 2015-10-30 Voice recognition system and method

Country Status (1)

Country Link
CN (1) CN105261356A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105913840A (en) * 2016-06-20 2016-08-31 西可通信技术设备(河源)有限公司 Speech recognition device and mobile terminal
CN106297796A (en) * 2016-03-25 2017-01-04 李克军 A kind of pilot rehearses monitoring method and device
CN106373562A (en) * 2016-08-31 2017-02-01 黄钰 Robot voice recognition method based on natural language processing
CN107256707A (en) * 2017-05-24 2017-10-17 深圳市冠旭电子股份有限公司 A kind of audio recognition method, system and terminal device
CN107393531A (en) * 2017-07-20 2017-11-24 Tcl医疗核磁技术(无锡)有限公司 The phonetic controller and method of a kind of medical detection system
CN107845381A (en) * 2017-10-27 2018-03-27 安徽硕威智能科技有限公司 A kind of method and system of robot semantic processes
CN107909995A (en) * 2017-11-16 2018-04-13 北京小米移动软件有限公司 Voice interactive method and device
CN107958215A (en) * 2017-11-23 2018-04-24 深圳市分期乐网络科技有限公司 A kind of antifraud recognition methods, device, server and storage medium
CN108922267A (en) * 2018-07-12 2018-11-30 河南恩久信息科技有限公司 A kind of intelligent voice system for wisdom classroom
CN109147785A (en) * 2018-09-19 2019-01-04 淄博职业学院 A kind of method and apparatus of Korean audio-frequency information processing
CN109448726A (en) * 2019-01-14 2019-03-08 李庆湧 A kind of method of adjustment and system of voice control accuracy rate
CN110299139A (en) * 2019-06-29 2019-10-01 联想(北京)有限公司 A kind of sound control method, device and electronic equipment
CN110928999A (en) * 2019-12-09 2020-03-27 北京小米智能科技有限公司 Destination determining method and device, electronic equipment and storage medium
CN112086155A (en) * 2020-09-11 2020-12-15 北京欧应信息技术有限公司 Diagnosis and treatment information structured collection method based on voice input
CN113096654A (en) * 2021-03-26 2021-07-09 山西三友和智慧信息技术股份有限公司 Computer voice recognition system based on big data

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6249759B1 (en) * 1998-01-16 2001-06-19 Nec Corporation Communication apparatus using speech vector comparison and recognition
EP1012828B1 (en) * 1997-09-18 2001-08-16 Siemens Aktiengesellschaft Method for recognising a keyword in speech
CN1365487A (en) * 1999-06-24 2002-08-21 西门子公司 Voice recognition method and device
CN103888606A (en) * 2014-03-11 2014-06-25 上海乐今通信技术有限公司 Mobile terminal and unlocking method thereof
CN104836925A (en) * 2014-02-11 2015-08-12 携程计算机技术(上海)有限公司 Consultation system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1012828B1 (en) * 1997-09-18 2001-08-16 Siemens Aktiengesellschaft Method for recognising a keyword in speech
US6249759B1 (en) * 1998-01-16 2001-06-19 Nec Corporation Communication apparatus using speech vector comparison and recognition
CN1365487A (en) * 1999-06-24 2002-08-21 西门子公司 Voice recognition method and device
CN104836925A (en) * 2014-02-11 2015-08-12 携程计算机技术(上海)有限公司 Consultation system and method
CN103888606A (en) * 2014-03-11 2014-06-25 上海乐今通信技术有限公司 Mobile terminal and unlocking method thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
姜薇等: "《大学计算机基础教程》", 30 September 2008 *
王华奎: "《移动通信原理与技术》", 31 October 2009 *
王知津: "《信息检索与处理》", 30 June 2015 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106297796A (en) * 2016-03-25 2017-01-04 李克军 A kind of pilot rehearses monitoring method and device
CN105913840A (en) * 2016-06-20 2016-08-31 西可通信技术设备(河源)有限公司 Speech recognition device and mobile terminal
CN106373562A (en) * 2016-08-31 2017-02-01 黄钰 Robot voice recognition method based on natural language processing
CN107256707A (en) * 2017-05-24 2017-10-17 深圳市冠旭电子股份有限公司 A kind of audio recognition method, system and terminal device
CN107393531A (en) * 2017-07-20 2017-11-24 Tcl医疗核磁技术(无锡)有限公司 The phonetic controller and method of a kind of medical detection system
CN107845381A (en) * 2017-10-27 2018-03-27 安徽硕威智能科技有限公司 A kind of method and system of robot semantic processes
CN107909995B (en) * 2017-11-16 2021-08-17 北京小米移动软件有限公司 Voice interaction method and device
CN107909995A (en) * 2017-11-16 2018-04-13 北京小米移动软件有限公司 Voice interactive method and device
CN107958215A (en) * 2017-11-23 2018-04-24 深圳市分期乐网络科技有限公司 A kind of antifraud recognition methods, device, server and storage medium
CN108922267A (en) * 2018-07-12 2018-11-30 河南恩久信息科技有限公司 A kind of intelligent voice system for wisdom classroom
CN109147785A (en) * 2018-09-19 2019-01-04 淄博职业学院 A kind of method and apparatus of Korean audio-frequency information processing
CN109448726A (en) * 2019-01-14 2019-03-08 李庆湧 A kind of method of adjustment and system of voice control accuracy rate
CN110299139A (en) * 2019-06-29 2019-10-01 联想(北京)有限公司 A kind of sound control method, device and electronic equipment
CN110928999A (en) * 2019-12-09 2020-03-27 北京小米智能科技有限公司 Destination determining method and device, electronic equipment and storage medium
CN110928999B (en) * 2019-12-09 2023-02-24 北京小米智能科技有限公司 Destination determining method and device, electronic equipment and storage medium
CN112086155A (en) * 2020-09-11 2020-12-15 北京欧应信息技术有限公司 Diagnosis and treatment information structured collection method based on voice input
CN113096654A (en) * 2021-03-26 2021-07-09 山西三友和智慧信息技术股份有限公司 Computer voice recognition system based on big data

Similar Documents

Publication Publication Date Title
CN105261356A (en) Voice recognition system and method
CN107578769A (en) Speech data mask method and device
CN103456297A (en) Method and device for matching based on voice recognition
CN106448654A (en) Robot speech recognition system and working method thereof
CN106782521A (en) A kind of speech recognition system
CN112735383A (en) Voice signal processing method, device, equipment and storage medium
CN109065051B (en) Voice recognition processing method and device
CN111445898B (en) Language identification method and device, electronic equipment and storage medium
CN107564528B (en) Method and equipment for matching voice recognition text with command word text
CN103594085A (en) Method and system providing speech recognition result
CN111161726B (en) Intelligent voice interaction method, device, medium and system
CN107705791A (en) Caller identity confirmation method, device and Voiceprint Recognition System based on Application on Voiceprint Recognition
CN107845381A (en) A kind of method and system of robot semantic processes
CN111402899B (en) Cross-channel voiceprint recognition method and device
CN111933120A (en) Voice data automatic labeling method and system for voice recognition
CN112270166A (en) Method for quickly making and creating 5G message
CN110288996A (en) A kind of speech recognition equipment and audio recognition method
CN106682642A (en) Multi-language-oriented behavior identification method and multi-language-oriented behavior identification system
CN110688473A (en) Method for robot to dynamically acquire information
CN107180629B (en) Voice acquisition and recognition method and system
CN110619877A (en) Voice recognition man-machine interaction method, device and system applied to laser pen and storage medium
CN112466287B (en) Voice segmentation method, device and computer readable storage medium
CN111128127A (en) Voice recognition processing method and device
CN111312252A (en) Method for inviting address book personnel through AI voice
CN111914777B (en) Method and system for identifying robot instruction in cross-mode manner

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160120

RJ01 Rejection of invention patent application after publication