CN116631397A

CN116631397A - Voice recognition scene awakening method and device

Info

Publication number: CN116631397A
Application number: CN202310675003.4A
Authority: CN
Inventors: 邓汉军; 潘继水; 陈亮; 曾海英; 陈继才; 吴强; 钟伟国
Original assignee: Dongguan Konka Electronics Co Ltd
Current assignee: Dongguan Konka Electronics Co Ltd
Priority date: 2023-06-08
Filing date: 2023-06-08
Publication date: 2023-08-22

Abstract

The invention provides a voice recognition scene awakening method and device, and relates to the technical field of voice recognition. The voice recognition scene wake-up method comprises the following steps: s1, dividing boundaries of a field; s2, numbering the voice modules in the field; s3, limiting the number of simultaneous work of the voice modules in the field; s4, the appointed voice module enters a wake-up word in the scene; s5, sending a wake-up word to an external service terminal, and entering a corresponding scene; s6, a contracted voice module quits a wake-up word of a scene; s7, sending a wake-up word to the external service terminal, and exiting the corresponding scene. Before sending a wake-up word to an external service terminal, temporarily storing signals input by a plurality of voice modules, encoding the signals in sequence, and forming the wake-up word; by temporarily storing and encoding in sequence the signals input by the plurality of speech modules. The voice recognition scene wake-up method and device provided by the invention have the advantage of being beneficial to the external service terminal to receive the voice signal.

Description

Voice recognition scene awakening method and device

Technical Field

The present invention relates to the field of speech recognition technologies, and in particular, to a method and apparatus for waking up speech recognition scenes.

Background

Scene wake experiments are established based on exploring new scene applications in commercial spaces, where scenes can occur in any space, whether restaurants, bars, clothing stores, or vegetable markets, where scenes exist.

At present, when scene wake-up is performed through voice recognition, in order to improve interestingness and participation, a plurality of voice wake-up modules are generally provided, and when the voice wake-up modules work simultaneously, a plurality of voice signals are mutually interfered, so that the voice signals are not received by an external service terminal.

Therefore, it is necessary to provide a new voice recognition scene wake-up method and device to solve the above technical problems.

Disclosure of Invention

In order to solve the technical problems, the invention provides a voice recognition scene wake-up method and a device which are beneficial to an external service terminal to receive voice signals.

The invention provides a voice recognition scene wake-up method, which comprises the following steps:

s1, dividing boundaries of a field;

s2, numbering the voice modules in the field;

s3, limiting the number of simultaneous work of the voice modules in the field;

s4, the appointed voice module enters a wake-up word in the scene;

s5, sending a wake-up word to an external service terminal, and entering a corresponding scene;

s6, a contracted voice module quits a wake-up word of a scene;

s7, sending a wake-up word to the external service terminal, and exiting the corresponding scene.

Preferably, before sending the wake-up word to the external service terminal, temporarily storing signals input by a plurality of voice modules, encoding the signals in sequence, and forming the wake-up word; the signals input by the voice modules are temporarily stored and encoded according to the sequence, so that the orderly input of the voice signals is ensured, and the mutual interference of the voice signals is avoided.

Preferably, when the external service terminal receives the wake-up word, the wake-up word is analyzed through a fuzzy recognition module in the external service terminal.

Preferably, the wake-up word is generated by means of external preset, automatic system generation and the like.

Preferably, the user previews the wake-up scenes of different voice modules through the external display module, and selects the voice module which accords with the preference of the user through the serial number of the voice module.

Preferably, the number of simultaneous operations of the voice modules in the site in S3 is adjustable, and is adjusted according to the data processing capability of the external service terminal, and is generally controlled to 2-4.

A speech recognition scene wake-up device comprising:

the external display module, the voice module, the signal transmission module and the voice information temporary storage and encoding module; the external display module is used for previewing wake-up scenes of different voice modules by a user and introducing the whole device; the voice module is used for receiving voice information; the voice information temporary storage and encoding module is used for temporarily storing the information input by the voice module and encoding the information correspondingly, so that the voice information is received by the external service terminal; the signal transmission module is used for sending the coded voice information to an external service terminal or transmitting an instruction of the external service terminal.

Compared with the related art, the voice recognition scene wake-up method and device provided by the invention have the following steps of

The beneficial effects are that:

1. the signals input by the voice modules are temporarily stored and coded according to the sequence, so that the orderly input of the voice signals is ensured, the mutual interference of the voice signals is avoided, and the voice signals are received by an external service terminal; the signals input by the voice modules are temporarily stored and then sequentially released after being stored, so that the collision of the voice modules in the process of coding is avoided, the sufficient coding time is ensured, and the voice signals are received by an external service terminal;

2. the wake-up words are analyzed through the fuzzy recognition module, so that the user can still accurately recognize the wake-up words when the user has accents, and the using difficulty of the device is reduced.

Drawings

FIG. 1 is a schematic diagram of a control structure of a method and a device for waking up a speech recognition scene according to the present invention;

Detailed Description

The invention will be further described with reference to the drawings and embodiments.

Referring to fig. 1 in combination, the voice recognition scene wake method includes:

s1, dividing boundaries of a field;

s2, numbering the voice modules in the field;

s3, limiting the number of simultaneous work of the voice modules in the field;

s4, the appointed voice module enters a wake-up word in the scene;

s6, a contracted voice module quits a wake-up word of a scene;

Before sending a wake-up word to an external service terminal, temporarily storing signals input by a plurality of voice modules, encoding the signals in sequence, and forming the wake-up word; the signals input by the voice modules are temporarily stored and encoded according to the sequence, so that the orderly input of the voice signals is ensured, and the mutual interference of the voice signals is avoided.

It is necessary to explain that: the signals input by the voice modules are temporarily stored and coded according to the sequence, so that the orderly input of the voice signals is ensured, the mutual interference of the voice signals is avoided, and the voice signals are received by an external service terminal;

it also needs to be stated that: the signals input by the voice modules are temporarily stored and then sequentially released after being stored, so that the collision of the voice modules during encoding is avoided, the sufficient encoding time is ensured, and the voice signals are received by the external service terminal.

When the external service terminal receives the wake-up word, the wake-up word is analyzed through a fuzzy recognition module in the external service terminal.

It is necessary to explain that: the wake-up words are analyzed through the fuzzy recognition module, so that the user can still accurately recognize the wake-up words when the user has accents, and the using difficulty of the device is reduced.

The wake-up words are generated in an external preset mode, a system automatic generation mode and the like.

The user previews the awakening scenes of the different voice modules through the external display module, and selects the voice module which accords with the preference of the user through the serial number of the voice module.

It is necessary to explain that: the voice module which accords with the preference of the user is selected through the number of the voice module, so that the participation degree of the user is improved, and the enthusiasm of the user is further improved; and the number of the voice module is used for selecting the voice module which accords with the preference of the user, so that the interest is better.

The number of the voice modules in the site in the S3 is adjustable, and the voice modules are adjusted according to the data processing capacity of the external service terminal and are generally controlled to be 2-4.

A speech recognition scene wake-up device comprising:

the external display module, the voice module, the signal transmission module and the voice information temporary storage and encoding module; the external display module is used for previewing wake-up scenes of different voice modules by a user and introducing the whole device; the voice module is used for receiving voice information; the voice information temporary storage and encoding module is used for temporarily storing the information input by the voice module and encoding the information correspondingly, so that the voice information is received by the external service terminal; the signal transmission module is used for sending the coded voice information to an external service terminal or transmitting an instruction of the external service terminal

The foregoing description is only illustrative of the present invention and is not intended to limit the scope of the invention, and all equivalent structures or equivalent processes or direct or indirect application in other related technical fields are included in the scope of the present invention.

Claims

1. The voice recognition scene wake-up method is characterized by comprising the following steps of:

s1, dividing boundaries of a field;

s2, numbering the voice modules in the field;

s3, limiting the number of simultaneous work of the voice modules in the field;

s4, the appointed voice module enters a wake-up word in the scene;

s6, a contracted voice module quits a wake-up word of a scene;

2. The voice recognition scene wake-up method of claim 1, wherein signals input by a plurality of voice modules are temporarily stored and sequentially encoded before a wake-up word is transmitted to an external service terminal, and the wake-up word is formed; the signals input by the voice modules are temporarily stored and encoded according to the sequence, so that the orderly input of the voice signals is ensured, and the mutual interference of the voice signals is avoided.

3. The method for waking up a speech recognition scene according to claim 2, wherein the external service terminal analyzes the wake-up word through a fuzzy recognition module built in the external service terminal when receiving the wake-up word.

4. The method for waking up a speech recognition scene according to claim 1, wherein the wake-up words are generated by means of external presets, automatic system generation, etc.

5. The method for waking up a speech recognition scene according to claim 1, wherein the user previews the wake-up scenes of different speech modules through an external display module, and selects a speech module according with his own preference through the number of the speech module.

6. The method for waking up a speech recognition scene according to claim 1, wherein the number of simultaneous operation of the speech modules in the site in S3 is adjustable, and is adjusted according to the data processing capability of the external service terminal, and is generally controlled to 2-4.

7. The voice recognition scene wake-up device is characterized by comprising: