CN110910869B - Voice recognition method and device - Google Patents

Voice recognition method and device Download PDF

Info

Publication number
CN110910869B
CN110910869B CN201811072652.0A CN201811072652A CN110910869B CN 110910869 B CN110910869 B CN 110910869B CN 201811072652 A CN201811072652 A CN 201811072652A CN 110910869 B CN110910869 B CN 110910869B
Authority
CN
China
Prior art keywords
recognition
threshold value
command
words
equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811072652.0A
Other languages
Chinese (zh)
Other versions
CN110910869A (en
Inventor
何云鹏
高君效
余杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chipintelli Technology Co Ltd
Original Assignee
Chipintelli Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chipintelli Technology Co Ltd filed Critical Chipintelli Technology Co Ltd
Priority to CN201811072652.0A priority Critical patent/CN110910869B/en
Publication of CN110910869A publication Critical patent/CN110910869A/en
Application granted granted Critical
Publication of CN110910869B publication Critical patent/CN110910869B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

A speech recognition method includes the steps of grouping command words having complementary functions and similar parts in pronunciation into groups, setting first recognition threshold groups A1 and B1 and second recognition threshold groups A2 and B2, respectively; and a1 is less than a2, B1 is less than B2; and recognizing the command word, judging whether the command word threshold value is larger than any one of a first recognition threshold value group and a second recognition threshold value group A2 and B2 of any pair of command complementary words, if the command word threshold value is not larger than the second recognition threshold value group but larger than any one of the first recognition threshold value group, recognizing the current working state of the equipment, and switching the equipment to the complementary other state after recognition. The invention also discloses a voice recognition device suitable for adopting the method. The invention divides the complementary commands into groups and thresholds, and effectively controls the equipment by voice through recognition, thereby improving the recognition accuracy of the voice control command with high contact ratio.

Description

Voice recognition method and device
Technical Field
The invention belongs to the field of artificial intelligence, relates to a voice recognition technology, and particularly relates to a voice recognition method and voice recognition equipment.
Background
Many current devices are controlled by speech recognition, which uses a function command word or sentence that can express the function to speak to the device, and the device recognizes the speech function command word or sentence, understands the meaning and performs a corresponding control function. In practical application, a plurality of equipment functions such as startup, shutdown and the like have the control command words close to each other and only differ by one word, but have mutually exclusive functions, and when a controller speaks at a high speed or pronounces unclear, the equipment often causes misrecognition and executes a behavior inconsistent with the requirement of the controller, so that the experience of equipment use is greatly influenced.
Disclosure of Invention
In order to overcome the defects in the prior art, the invention discloses a voice recognition method and voice recognition equipment.
The voice recognition method comprises the following steps:
the command words with complementary functions and similar pronunciations are grouped and associated pairwise, wherein the complementary functions refer to that the equipment can only work in a non-A or B working state; and sets first recognition threshold groups a1 and B1, and second recognition threshold groups a2 and B2, respectively, for two command complements within the same group; and a1 is less than a2, B1 is less than B2;
recognizing the command word, judging whether the command word threshold value is larger than any one of the first recognition threshold value group A1 and B1 of any pair of command complementary words, if so, continuously judging whether the command word threshold value is larger than any one of the second recognition threshold value group A2 and B2;
if the current working state of the equipment is not greater than the second recognition threshold set but is greater than any one of the first recognition threshold set, the equipment is switched to the complementary other state after the current working state of the equipment is recognized;
if the recognition threshold value is not greater than any of the first recognition threshold value groups a1 and B1, the comparison recognition with the other command words is continued.
Preferably, the first recognition threshold sets a1 and B1 are set for similar parts in the command word.
The invention also discloses voice recognition equipment, which comprises a command word bank, a recognition module and an output module, wherein the command word bank comprises common words and associated words, every two of the associated words are associated, pronunciation of the associated words has similarity, the commands are complementary, the voice recognition equipment also comprises an equipment state detection module, and the recognition module is also in signal connection with the equipment state detection module.
Preferably, the identification module comprises a primary identification module and a secondary identification module, and the secondary identification module is in signal connection with the equipment state detection module.
The method divides the complementary commands into groups and thresholds, combines the control intention of a controller and the current state of the equipment, and performs effective voice control on the equipment through recognition, so that the recognition accuracy of the voice control command with high contact ratio is improved, and the hardware cost is not increased; has the advantages of obvious effect, convenience and easy use.
Drawings
FIG. 1 is a flow chart illustrating a speech recognition method according to an embodiment of the present invention;
fig. 2 is a schematic diagram of an embodiment of the speech recognition apparatus according to the present invention.
Detailed Description
The following provides a more detailed description of the present invention.
The basic principle of speech recognition is that after a sound signal of a command word is collected by a microphone and converted into an electrical signal, the electrical signal is compared with stored data, and the sound signal is recognized by comparison and an instruction corresponding to the sound signal is called to perform corresponding operation on equipment. For example, the command words issued by the controller's mouth are: and starting the air conditioner. After the air conditioner microphone with the voice recognition function reads the voice signal and converts the voice signal into an electric signal, the electric signal is compared with the stored data, and the air conditioner starts to work after the voice signal is recognized.
In the recognition process, no sound signal which is completely coincident exists, whether a command word is coincident with a certain command word stored in a command word bank or not is judged by setting a threshold, in the voice recognition, the recognition is carried out by judging the similarity of a pronunciation signal and a certain word in the existing word bank, the threshold can be the similarity, for example, an electric signal converted by reading the sound signal is scored according to the similarity of the signal and a certain word in the command word bank, the score is higher than a certain value, namely the word is considered to be coincident and a corresponding command is sent, and at the moment, the value is the threshold of the voice recognition.
The voice recognition method comprises the following steps of grouping and associating command words with complementary functions and similar pronunciations in pairs, wherein the complementary functions refer to that equipment can only work in a non-A or B working state; and sets first recognition threshold groups a1 and B1, and second recognition threshold groups a2 and B2, respectively, for two command complements within the same group; and a1 is less than a2, B1 is less than B2;
recognizing the command word, judging whether the command word threshold value is larger than any one of the first recognition threshold value group A1 and B1 of any pair of command complementary words, if so, continuously judging whether the command word threshold value is larger than any one of the second recognition threshold value group A2 and B2;
if the current working state of the equipment is not greater than the second recognition threshold set but is greater than any one of the first recognition threshold set, the equipment is switched to the complementary other state after the current working state of the equipment is recognized;
if not greater than any of the first recognition threshold sets A1 and B1, recognition continues in comparison to other command words.
The functions of the invention are complementary, command words with similar parts exist in pronunciation, and widely exist in practical operation, such as 'air conditioner on' and 'air conditioner off', 'air conditioner refrigeration' and 'air conditioner heating', 'microwave oven door opening' and 'microwave oven door closing', and the like, and the above command words can be seen from the meaning of the command words, the equipment can only work in a working state corresponding to non-A, namely B; the command words have the characteristic of high pronunciation similarity, and are easy to make mistakes in recognition.
One specific embodiment of the present invention for recognizing such words is as follows:
for example, for two words of air-conditioning refrigeration and air-conditioning heating, the words are firstly divided into the same group and are mutually associated, and two groups of thresholds A1 and B1, A2 and B2 are respectively set for the words, wherein A1 is smaller than A2, and B1 is smaller than B2; the a1 and the B1 can recognize the same similar parts of the two words in the group, the similar parts are the same pronunciations of the first three words for the two command words of air-conditioning cooling and air-conditioning heating, the two command words both comprise four words, and the like, and when the first three words, namely the null modulation X, are recognized and have the same sequence, and a fourth undetermined word X is also provided, the situation that the number of the words is larger than any one of the first group threshold value can be set.
After the controller sends out a command word, a primary recognition module in the recognition modules firstly recognizes whether the command word reaches a certain threshold value A1 or B1 of a first recognition threshold value set of a certain group of associated words, if the command word does not meet the certain threshold value, other command words which are not grouped or other grouped associated words are continuously recognized, if the command word meets the certain threshold value, a secondary recognition module continuously performs comparison recognition and probability scoring with a second recognition threshold value set A2 and a second recognition threshold value set B2 corresponding to two words in the group, and if the comparison result shows that the comparison result is larger than any final recognition threshold value, an instruction corresponding to the command word is executed. Fig. 1 shows a specific flow chart of the identification of the present invention.
For two recognition threshold value groups set in groups, because the similarity of the associated words is higher, but the two recognition threshold value groups are obviously different from other non-associated words, the two recognition threshold value groups are adopted for secondary recognition, whether the associated words belong to a certain associated phrase can be firstly discriminated roughly, the recognition threshold value is improved in the associated phrase, the recognition precision is favorably improved, and the wrong instruction is avoided being sent.
If the second recognition cannot recognize, for example, for "air-conditioning cooling" and "air-conditioning heating", the speaker may pronounce "cold" and "hot" for its own reason, and the device may recognize that the words belong to the relevant phrase, but it is difficult to recognize cooling or heating, and then the device switches to change the state directly according to the current operating state of the device.
Fig. 2 shows a specific implementation of a speech recognition device capable of implementing the speech recognition method, where the speech recognition device includes a command lexicon, a recognition module, and an output module, where the command lexicon includes common words and associated words, the associated words are associated pairwise, pronunciation of the associated words has similarity and the commands are complementary, the speech recognition device further includes a device state detection module, and the recognition module is further connected to the device state detection module through a signal. The identification module may include a primary identification module and a secondary identification module in signal connection with the device status detection module.
The foregoing is a description of preferred embodiments of the present invention, and the preferred embodiments in the preferred embodiments may be combined and combined in any combination, if not obviously contradictory or prerequisite to a certain preferred embodiment, and the specific parameters in the examples and the embodiments are only for the purpose of clearly illustrating the inventor's invention verification process and are not intended to limit the patent protection scope of the present invention, which is defined by the claims and the equivalent structural changes made by the content of the description of the present invention are also included in the protection scope of the present invention.

Claims (2)

1. A speech recognition method, comprising the steps of:
the command words with complementary functions and similar pronunciations are grouped and associated pairwise, wherein the complementary functions refer to that the equipment can only work in a non-A or B working state; and sets first recognition threshold groups a1 and B1, and second recognition threshold groups a2 and B2, respectively, for two command complements within the same group; and a1 is less than a2, B1 is less than B2;
recognizing the command word, judging whether the command word threshold value is larger than any one of the first recognition threshold value group A1 and B1 of any pair of command complementary words, if so, continuously judging whether the command word threshold value is larger than any one of the second recognition threshold value group A2 and B2;
if the current working state of the equipment is not greater than the second recognition threshold set but is greater than any one of the first recognition threshold set, the equipment is switched to the complementary other state;
if the recognition threshold value is not greater than any of the first recognition threshold value groups a1 and B1, the comparison recognition with the other command words is continued.
2. The speech recognition method of claim 1, wherein the first set of recognition thresholds a1 and B1 are set for similar parts in a command word.
CN201811072652.0A 2018-09-14 2018-09-14 Voice recognition method and device Active CN110910869B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811072652.0A CN110910869B (en) 2018-09-14 2018-09-14 Voice recognition method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811072652.0A CN110910869B (en) 2018-09-14 2018-09-14 Voice recognition method and device

Publications (2)

Publication Number Publication Date
CN110910869A CN110910869A (en) 2020-03-24
CN110910869B true CN110910869B (en) 2022-02-18

Family

ID=69812360

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811072652.0A Active CN110910869B (en) 2018-09-14 2018-09-14 Voice recognition method and device

Country Status (1)

Country Link
CN (1) CN110910869B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111951793B (en) * 2020-08-13 2021-08-24 北京声智科技有限公司 Method, device and storage medium for awakening word recognition
CN112965687A (en) * 2021-03-19 2021-06-15 成都启英泰伦科技有限公司 Multi-user voice recognition product development platform and development method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003241790A (en) * 2002-02-13 2003-08-29 Internatl Business Mach Corp <Ibm> Speech command processing system, computer device, speech command processing method, and program
CN103714816A (en) * 2012-09-28 2014-04-09 三星电子株式会社 Electronic appratus, server and control method thereof
CN107591155A (en) * 2017-08-29 2018-01-16 珠海市魅族科技有限公司 Audio recognition method and device, terminal and computer-readable recording medium
CN108055617A (en) * 2017-12-12 2018-05-18 广东小天才科技有限公司 A kind of awakening method of microphone, device, terminal device and storage medium
CN108183844A (en) * 2018-02-06 2018-06-19 四川虹美智能科技有限公司 A kind of intelligent home appliance voice control method, apparatus and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003241790A (en) * 2002-02-13 2003-08-29 Internatl Business Mach Corp <Ibm> Speech command processing system, computer device, speech command processing method, and program
CN103714816A (en) * 2012-09-28 2014-04-09 三星电子株式会社 Electronic appratus, server and control method thereof
CN107591155A (en) * 2017-08-29 2018-01-16 珠海市魅族科技有限公司 Audio recognition method and device, terminal and computer-readable recording medium
CN108055617A (en) * 2017-12-12 2018-05-18 广东小天才科技有限公司 A kind of awakening method of microphone, device, terminal device and storage medium
CN108183844A (en) * 2018-02-06 2018-06-19 四川虹美智能科技有限公司 A kind of intelligent home appliance voice control method, apparatus and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
嵌入式语音交互系统;蒙山,张有为,毛士艺;《五邑大学学报(自然科学版)》;20000430;全文 *

Also Published As

Publication number Publication date
CN110910869A (en) 2020-03-24

Similar Documents

Publication Publication Date Title
CN110910869B (en) Voice recognition method and device
US7729920B2 (en) Systems and methods for predicting consequences of misinterpretation of user commands in automated systems
KR102007478B1 (en) Device and method for controlling application using speech recognition under predetermined condition
CN104049963B (en) The method that Chinese speech controls electromechanical equipment operation
CN105654949A (en) Voice wake-up method and device
EP1933301A3 (en) Speech recognition method and system with intelligent speaker identification and adaptation
JP2000214880A (en) Voice recognition method and voice recognition device
CN105609103A (en) Speech instant recognition system
CN104049964B (en) Chinese speech low coverage or the method for long-range control electromechanical equipment operation
CN107742516B (en) Intelligent recognition method, robot and computer readable storage medium
CN100578613C (en) System and method for speech recognition utilizing a merged dictionary
WO2021098318A1 (en) Response method, terminal, and storage medium
CN107742520A (en) Sound control method, apparatus and system
CN106910498A (en) The method for improving voice control command word discrimination
CN112185357A (en) Device and method for simultaneously recognizing human voice and non-human voice
CN112233655A (en) Neural network training method for improving voice command word recognition performance
CN104930642B (en) One kind realizes interactive air conditioner and its method of work based on voice
CN111798838A (en) Method, system, equipment and storage medium for improving speech recognition accuracy
JP2003185221A (en) System and method for air conditioning control
CN106482285A (en) A kind of air conditioning control method and its air-conditioner controller
KR102417899B1 (en) Apparatus and method for recognizing voice of vehicle
KR20210130465A (en) Dialogue system and method for controlling the same
KR102174148B1 (en) Speech Recognition Method Determining the Subject of Response in Natural Language Sentences
US20220122593A1 (en) User-friendly virtual voice assistant
CN112309396A (en) AI virtual robot state dynamic setting system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant