CN112581959B - Intelligent equipment control method, system and voice server - Google Patents

Intelligent equipment control method, system and voice server Download PDF

Info

Publication number
CN112581959B
CN112581959B CN202011472759.1A CN202011472759A CN112581959B CN 112581959 B CN112581959 B CN 112581959B CN 202011472759 A CN202011472759 A CN 202011472759A CN 112581959 B CN112581959 B CN 112581959B
Authority
CN
China
Prior art keywords
text
voice
information
determining
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011472759.1A
Other languages
Chinese (zh)
Other versions
CN112581959A (en
Inventor
张奇
文俊
刘皓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changhong Meiling Xinhua Technology Co ltd
Original Assignee
Sichuan Hongmei Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan Hongmei Intelligent Technology Co Ltd filed Critical Sichuan Hongmei Intelligent Technology Co Ltd
Priority to CN202011472759.1A priority Critical patent/CN112581959B/en
Publication of CN112581959A publication Critical patent/CN112581959A/en
Application granted granted Critical
Publication of CN112581959B publication Critical patent/CN112581959B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Selective Calling Equipment (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention provides an intelligent device control method, a system and a voice server, wherein the method is applied to the voice server and comprises the following steps: pre-constructing a mapping relation between at least one keyword set and at least one voice model, wherein each keyword set corresponds to a different voice instruction type; receiving a voice signal sent by external intelligent equipment; processing the voice signal to obtain text information; determining at least one text to be identified according to at least one keyword in the text information; respectively determining a voice model of each text to be recognized according to at least one keyword set and the mapping relation; determining control information of text information according to each text to be identified and the voice model of each text to be identified; and sending the control information to the intelligent device so that the intelligent device executes control actions according to the control information. The scheme can improve the accuracy of voice control.

Description

Intelligent equipment control method, system and voice server
Technical Field
The invention relates to the technical field of intelligent equipment, in particular to an intelligent equipment control method, an intelligent equipment control system and a voice server.
Background
Along with the development of the internet of things, intelligent home appliances gradually develop to more convenient, intelligent and humanized directions, and more intelligent home appliances introduce a voice interaction function, so that the intelligent home appliances can be controlled in a voice interaction mode.
For example, the Chinese patent application with application number 201610384412.9 discloses a voice control system, which is mainly characterized in that a main control module of the intelligent refrigerator determines a control instruction according to a voice signal transmitted by a voice module and a pre-stored password set, so as to realize voice control of the intelligent refrigerator.
At present, in the prior art, voice models for voice recognition and synthesis are stored locally, namely, intelligent household appliances are controlled by voice based on an offline voice recognition method, and the voice recognition and synthesis accuracy is low due to the limited number of voice models, so that the accuracy of voice control is low.
Disclosure of Invention
The invention provides an intelligent device control method, an intelligent device control system and a voice server, which can improve the accuracy of voice control.
In a first aspect, an embodiment of the present invention provides a method for controlling an intelligent device, which is applied to a voice server, including:
pre-constructing a mapping relation between at least one keyword set and at least one voice model, wherein each keyword set corresponds to a different voice instruction type;
Further comprises:
receiving a voice signal sent by external intelligent equipment;
processing the voice signal to obtain text information;
determining at least one text to be identified according to at least one keyword in the text information, wherein the keyword is used for describing instruction information in the text to be identified, and each text to be identified corresponds to a different voice instruction type;
respectively determining a voice model of each text to be recognized according to the at least one keyword set and the mapping relation;
determining control information of the text information according to each text to be recognized and a voice model of each text to be recognized;
and sending the control information to the intelligent equipment so that the intelligent equipment executes control actions according to the control information.
In one possible design, the determining at least one text to be recognized according to at least one keyword in the text information includes:
acquiring at least one keyword in the text information according to a preset keyword dictionary;
determining at least one target keyword set corresponding to each keyword from the at least one keyword set;
And dividing the text information into the at least one text to be identified according to the at least one target keyword set.
In one possible design, the voice instruction types include: at least one of voice interaction, control commands, and device management;
when the voice instruction type corresponding to the text to be recognized comprises voice interaction or control commands,
the determining the control information of the text information according to each text to be recognized and the voice model of each text to be recognized comprises the following steps:
for each text to be recognized, executing:
determining at least one target keyword corresponding to the text to be recognized currently from the at least one keyword;
determining at least one target voice model corresponding to the at least one target keyword from the mapping relation;
determining target control information of the current text to be recognized according to the at least one target voice model;
and taking the target control information of each text to be identified as the control information of the text information.
In one possible design, when the voice command type corresponding to the text to be recognized includes device management,
After the determining at least one target voice model corresponding to the at least one target keyword from the mapping relation and before the determining the control information of the current text to be recognized according to the at least one target voice model, further comprising:
sending a query request carrying the at least one target keyword to an external interface server;
receiving query information returned by the interface server according to the at least one target keyword in the query request;
the determining the control information of the current text to be recognized according to the at least one target voice model comprises the following steps:
and generating target control information of the text to be recognized currently according to the query information and the at least one target voice model.
In one possible design, after the sending the control information to the smart device to cause the smart device to perform a control action according to the control information, the method further includes:
receiving a feedback text returned by the intelligent equipment after the intelligent equipment executes control actions according to the control information;
generating feedback voice information corresponding to the feedback text according to a preset voice feedback model;
And sending the feedback voice information to the intelligent equipment so that the intelligent equipment plays the feedback voice information.
In a second aspect, an embodiment of the present invention further provides a voice server based on the foregoing first aspect or any one of the possible implementation manners of the first aspect, where the voice server includes: the device comprises a construction module, a receiving module, an acquisition module, a first determination module, a second determination module, a third determination module and a sending module;
the construction module is used for pre-constructing a mapping relation between at least one keyword set and at least one voice model, wherein each keyword set corresponds to a different voice instruction type;
the receiving module is used for receiving voice signals sent by external intelligent equipment;
the acquisition module is used for processing the voice signal received by the receiving module to acquire text information;
the first determining module is configured to determine at least one text to be identified according to at least one keyword in the text information acquired by the acquiring module, where the keyword is used to describe instruction information in the text to be identified, and each text to be identified corresponds to a different voice instruction type;
The second determining module is configured to determine, according to the at least one keyword set and the mapping relationship constructed by the constructing module, the target speech model of each text to be identified determined by the first determining module;
the third determining module is configured to determine control information of the text information according to each text to be identified determined by the first determining module and the target voice model of each text to be identified determined by the second determining unit;
the sending module is configured to send the control information determined by the third determining module to the intelligent device, so that the intelligent device performs a control action according to the control information.
In one possible design of the device,
the first determining module is specifically configured to perform the following processing:
acquiring at least one keyword in the text information according to a preset keyword dictionary;
determining at least one target keyword set corresponding to each keyword from the at least one keyword set;
and dividing the text information into the at least one text to be identified according to the at least one target keyword set.
In one possible design, when the voice command type includes: the third determining module is specifically configured to execute, when the voice instruction type corresponding to the text to be recognized determined by the first determining module includes a voice interaction or a control command, the following processing when at least one of the voice interaction, the control command, and the device management is performed:
for each text to be recognized, executing:
determining at least one target keyword corresponding to the text to be recognized currently from the at least one keyword;
determining at least one target voice model corresponding to the at least one target keyword from the mapping relation;
determining target control information of the current text to be recognized according to the at least one target voice model;
and taking the target control information of each text to be identified as the control information of the text information.
In one possible design of the device,
the sending module is further configured to send, to an external interface server, a query request carrying the at least one target keyword determined by the third determining module when the voice command type corresponding to the text to be identified determined by the first determining module includes device management;
The receiving module is further used for receiving query information returned by the interface server according to the at least one target keyword in the query request sent by the sending module;
the third determining module is further configured to generate target control information of the text to be recognized according to the query information and the at least one target voice model received by the receiving module.
In a third aspect, an embodiment of the present invention further provides an intelligent device control system, including: a voice server and at least one smart device as provided in the second aspect or any one of the possible implementations of the second aspect;
the intelligent equipment is used for collecting voice signals, sending the voice signals to the voice server, receiving control information from the voice server, and executing control actions according to the control information.
According to the technical scheme, the mapping relation between the keyword sets and the voice models is constructed in advance, after voice signals acquired by the intelligent equipment are received, text information is acquired, the text information is divided into at least one text to be recognized according to the keywords, each text to be recognized corresponds to one voice instruction type, then the voice models of the texts to be recognized are determined, further control information corresponding to the text information is determined, and finally the control information is sent to the intelligent equipment, so that voice control of the intelligent equipment is achieved. According to the above, according to the plurality of pre-constructed voice models and mapping relations, the voice server can convert the voice signals collected by the intelligent device into corresponding control information to perform voice control on the intelligent device, and the intelligent device does not need to perform offline voice recognition on the collected voice signals to realize voice control, so that the accuracy of voice control is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a method for controlling an intelligent device according to an embodiment of the present invention;
FIG. 2 is a flow chart of another smart device control method provided by an embodiment of the present invention;
FIG. 3 is a schematic diagram of a voice server according to one embodiment of the present invention;
fig. 4 is a schematic diagram of a smart device control system according to an embodiment of the present invention.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments, and all other embodiments obtained by those skilled in the art without making any inventive effort based on the embodiments of the present invention are within the scope of protection of the present invention.
As shown in fig. 1, an embodiment of the present invention provides an intelligent device control method, which is applied to a voice server, and the method may include the following steps:
step 101: pre-constructing a mapping relation between at least one keyword set and at least one voice model, wherein each keyword set corresponds to a different voice instruction type;
step 102: receiving a voice signal sent by external intelligent equipment;
step 103: processing the voice signal to obtain text information;
step 104: determining at least one text to be identified according to at least one keyword in the text information, wherein the keyword is used for describing instruction information in the text to be identified, and each text to be identified corresponds to a different voice instruction type;
step 105: respectively determining a voice model of each text to be recognized according to at least one keyword set and the mapping relation;
step 106: determining control information of text information according to each text to be identified and the voice model of each text to be identified;
step 107: and sending the control information to the intelligent device so that the intelligent device executes control actions according to the control information.
In the embodiment of the invention, the mapping relation between a plurality of keyword sets and a plurality of voice models is constructed in advance, after voice signals acquired by intelligent equipment are received, text information is acquired, the text information is divided into at least one text to be recognized according to the plurality of keywords, each text to be recognized corresponds to one voice instruction type, then the voice models of the plurality of texts to be recognized are determined, further control information corresponding to the text information is determined, and finally the control information is sent to the intelligent equipment, so that voice control of the intelligent equipment is realized. According to the above, according to the plurality of pre-constructed voice models and mapping relations, the voice server can convert the voice signals collected by the intelligent device into corresponding control information to perform voice control on the intelligent device, and the intelligent device does not need to perform offline voice recognition on the collected voice signals to realize voice control, so that the accuracy of voice control is improved.
It should be appreciated that the method for performing voice control based on offline voice recognition generally relies on processes such as collecting a voice signal by a local processing unit of the smart device and recognizing the voice signal, so that the cost of voice control is high. And the voice model used for offline voice recognition is stored locally and limited by local resources, and the preset voice model is limited, so that on one hand, the recognition accuracy is poor, and further, the accuracy of a voice control instruction is low, and therefore, the accuracy of voice control is poor, on the other hand, the voice model is single in function and poor in expandability, and the use experience of a user is poor.
In the embodiment of the invention, the intelligent device is used for collecting the voice signal input by the user, the voice server carries out voice recognition on the voice signal collected by the intelligent device to generate corresponding control information, and the performance requirement on a local processing chip of the intelligent device is reduced, so that the cost of voice control on the intelligent device is reduced. And in the subsequent updating process, only the content on the voice server is required to be updated, and the local of the intelligent refrigerator is not required to be changed, so that the subsequent updating is facilitated.
In one embodiment of the present invention, based on the smart device control method shown in fig. 1, step 104 may specifically include the following steps:
acquiring at least one keyword in the text information according to a preset keyword dictionary;
determining at least one target keyword set corresponding to each keyword from at least one keyword set;
the text information is segmented into at least one text to be identified according to the at least one target keyword set.
In the embodiment of the invention, one or more keywords in the text information are acquired, a target keyword set corresponding to each keyword is determined from a plurality of preset keyword sets, and the text information is divided into at least one text to be identified according to the target keyword set, so that each text to be identified corresponds to a different voice instruction type. According to the method, the text information is divided into one or more texts to be recognized according to different voice command types, and voice recognition can be performed on each text to be recognized more accurately and efficiently, so that the accuracy of voice control commands is improved, the accuracy of voice control is improved, and meanwhile, the generation efficiency of follow-up control information is improved.
In one embodiment of the present invention, based on the smart device control method shown in fig. 1, when the voice command type includes: when at least one of the voice interaction, the control command and the device management is performed, and the voice instruction type corresponding to the text to be recognized includes the voice interaction or the control command, step 106 may specifically include the following steps:
for each text to be recognized, performing:
step S1: determining at least one target keyword corresponding to the text to be recognized currently from the at least one keyword;
step S2: determining at least one target voice model corresponding to the at least one target keyword from the mapping relation;
step S3: determining target control information of a current text to be recognized according to at least one target voice model;
and taking the target control information of each text to be identified as the control information of the text information.
In the embodiment of the invention, when the voice instruction type corresponding to the text to be recognized comprises voice interaction or control command, a target voice model corresponding to each keyword in the current text to be recognized is determined according to a pre-constructed mapping relation, so that target control information of the current text to be recognized is determined, and a plurality of target control information are used as control information of text information. According to the method, the matched target voice model can be accurately and rapidly locked according to the mapping relation and the keywords of each text to be identified, so that the accuracy and the efficiency of generating control information can be improved, and the experience of a user is further improved.
In one embodiment of the present invention, when the voice command type corresponding to the text to be recognized includes device management, after step S2 and before step S3, the following steps may be further included:
sending a query request carrying at least one target keyword to an external interface server;
receiving query information returned by the interface server according to at least one target keyword in the query request;
the step S3 specifically comprises the following steps: and generating target control information of the text to be recognized currently according to the query information and at least one target voice model.
In the embodiment of the invention, when the voice instruction type corresponding to the text to be recognized comprises equipment management, after determining the target voice model corresponding to each keyword in the current text to be recognized, a query request is sent to an external interface server, query information returned by the interface server according to at least one target keyword in the query request is received, then target control information of the current text to be recognized is determined according to the query information and the target voice model, and further control information of the text information is determined. According to the method, the complex voice interaction can be realized by using the external interface server, and richer voice interaction content is provided, so that the experience of a user is further improved.
In one embodiment of the present invention, based on the smart device control method shown in fig. 1, after step 107, the method may further include the following steps:
receiving a feedback text returned after the intelligent equipment executes the control action according to the control information;
generating feedback voice information corresponding to the feedback text according to a preset voice feedback model;
and sending the feedback voice information to the intelligent equipment so that the intelligent equipment plays the feedback voice information.
In the embodiment of the invention, after the feedback text of the intelligent equipment is received, corresponding feedback voice information is generated according to the preset voice feedback model and is sent to the intelligent equipment for playing, so that richer voice interaction content can be provided, the flexibility of voice interaction is improved, and the use experience of a user is further improved.
In order to more clearly illustrate the technical solution of the present invention, the following describes in detail the method for controlling an intelligent device provided in the embodiment of the present invention, as shown in fig. 2, the method may include the following steps:
step 201: a mapping relation between at least one keyword set and at least one voice model is pre-constructed, wherein each keyword set corresponds to a different voice instruction type.
In this step, the voice command types may include, but are not limited to, voice interactions, control commands and device management, wherein the voice interactions may be, for example, inquiring weather, listening to songs, listening to radio stations, etc.; the control command may be, for example, information for controlling the operation state of the smart device; the device management may be, for example, after-sales management of the smart device, operational status parameters of the smart device, and the like.
Step 202: and receiving voice signals sent by external intelligent equipment.
Specifically, the intelligent device starts a voice acquisition function according to a preset voice wake-up command word, and acquires a voice signal input by a user after waking up. In one embodiment of the invention, the smart device may also perform simple processing (e.g., voice noise reduction, etc.) on the collected voice signal.
Step 203: and processing the voice signal to obtain text information.
Step 204: and determining at least one text to be identified according to at least one keyword in the text information, wherein the keyword is used for describing instruction information in the text to be identified, and each text to be identified corresponds to a different voice instruction type.
Specifically, at least one keyword in the text information is obtained according to a preset keyword dictionary, at least one target keyword set corresponding to each keyword is determined from at least one keyword set, and the text information is segmented into at least one text to be recognized according to the at least one target keyword set.
For example, the keyword set a corresponding to the voice interaction type includes "query" and "weather", the keyword set B corresponding to the control command type includes "set", "temperature" and "refrigerator", and the keyword set C corresponding to the device management type includes "temperature", "refrigerator" and "query".
When the text information is "query weather and current temperature of the query refrigerator", the keywords are "query", "weather", "refrigerator" and "temperature", respectively, the target keyword set can be determined to be the keyword set a and the keyword set C according to the keywords, and the text information is divided into two texts to be identified, namely "query weather" and "current temperature of the query refrigerator", respectively.
When the text information is "set refrigerator temperature at 5 ℃, the keywords are" set "," refrigerator "and" temperature ", the target keyword set is determined as keyword set B, and the text to be identified is" set refrigerator temperature at 5) ".
Step 205: and respectively determining a voice model of each text to be recognized according to at least one keyword set and the mapping relation.
Step 206: a target speech model for each text to be identified is determined.
In this step, for each text to be recognized: and determining a target keyword corresponding to the text to be recognized currently from at least one keyword, and determining a target voice model corresponding to each target keyword from voice models corresponding to the current voice model according to the mapping relation and the target keyword.
In this step, following the previous example, the text to be identified "query" and "weather" in "query weather" are both target keywords.
For another example, the text to be recognized "query the current temperature of the refrigerator" is the target keyword.
Step 207: target control information of each text to be recognized is determined.
In the step, when the voice control type corresponding to the text to be recognized comprises voice interaction or control command, determining target control information of the text to be recognized currently according to a plurality of target voice models; when the voice control type corresponding to the text to be recognized comprises equipment management, sending a query request carrying at least one target keyword to an external interface server, receiving query information returned by the interface server according to the at least one target keyword in the query request, and generating target control information of the text to be recognized currently according to the query information and a plurality of target voice models.
For example, when the text to be identified is "query weather", the target control information of the text to be identified is output according to the voice model corresponding to the target keyword.
For another example, when the text to be identified is "inquiring the current temperature of the refrigerator", a query request is sent to an external interface server, wherein the query request comprises target keywords "inquiring", "refrigerator" and "temperature", query information returned by the interface server is received, and target control information of the text to be identified is generated according to the query information and a plurality of target voice models.
Step 208: and determining control information of the text information and sending the control information to the intelligent equipment so that the intelligent equipment can execute control actions according to the control information.
In this step, target control information of each text to be recognized is used as control information of the text information.
Step 209: and receiving feedback text returned after the intelligent equipment executes the control action according to the control information.
For example, when the control information sent to the intelligent refrigerator is to adjust the temperature to 5 ℃, the feedback text returned by the intelligent refrigerator after adjusting the temperature to 5 ℃ according to the control information is received.
Step 210: and generating feedback voice information corresponding to the feedback text according to a preset voice feedback model.
Step 211: and sending the feedback voice information to the intelligent equipment so that the intelligent equipment plays the feedback voice information.
As shown in fig. 3, an embodiment of the present invention provides a voice server based on the smart device control method provided in any one of the foregoing embodiments, including: a constructing module 301, a receiving module 302, an acquiring module 303, a first determining module 304, a second determining module 305, a third determining module 306 and a transmitting module 307;
a construction module 301, configured to pre-construct a mapping relationship between at least one keyword set and at least one voice model, where each keyword set corresponds to a different voice command type;
a receiving module 302, configured to receive a voice signal sent by an external smart device;
an obtaining module 303, configured to process the voice signal received by the receiving module 302, and obtain text information;
a first determining module 304, configured to determine at least one text to be identified according to at least one keyword in the text information acquired by the acquiring module 303, where the keyword is used to describe instruction information in the text to be identified, and each text to be identified corresponds to a different voice instruction type;
A second determining module 305, configured to determine, according to the at least one keyword set and the mapping relationship constructed by the constructing module 301, the target speech model of each text to be recognized determined by the first determining module 304;
a third determining module 306, configured to determine control information of the text information according to each text to be recognized determined by the first determining module 304 and the target voice model of each text to be recognized determined by the second determining unit 305;
and a sending module 307, configured to send the control information determined by the third determining module 306 to the intelligent device, so that the intelligent device performs a control action according to the control information.
In one embodiment of the present invention,
the first determining module 304 is specifically configured to perform the following processing:
acquiring at least one keyword in the text information according to a preset keyword dictionary;
determining at least one target keyword set corresponding to each keyword from at least one keyword set;
the text information is segmented into at least one text to be identified according to the at least one target keyword set.
In one embodiment of the present invention, when the voice command type includes: the third determining module 306 is specifically configured to, when the voice instruction type corresponding to the text to be recognized determined by the first determining module includes a voice interaction or a control command, execute the following processing:
For each text to be recognized, performing:
determining at least one target keyword corresponding to the text to be recognized currently from the at least one keyword;
determining at least one target voice model corresponding to the at least one target keyword from the mapping relation;
determining target control information of a current text to be recognized according to at least one target voice model;
and taking the target control information of each text to be identified as the control information of the text information.
In one embodiment of the present invention,
the sending module 307 is further configured to send, to an external interface server, a query request carrying at least one target keyword determined by the third determining module 306 when the voice command type corresponding to the text to be identified determined by the first determining module 304 includes device management;
the receiving module 302 is further configured to receive query information returned by the interface server according to at least one target keyword in the query request sent by the sending module 307;
the third determining module 306 is further configured to generate target control information of the text to be recognized according to the query information and the at least one target voice model received by the receiving module 302.
In one embodiment of the present invention,
The receiving module 302 is further configured to receive a feedback text returned after the intelligent device performs a control action according to the control information sent by the sending module 307, and generate feedback voice information corresponding to the feedback text according to a preset voice feedback model;
the sending module 307 is further configured to send the feedback voice information received by the receiving module 302 to the intelligent device, so that the intelligent device plays the feedback voice information.
As shown in fig. 4, an embodiment of the present invention provides an intelligent device control system, including: a voice server 401 and at least one smart device 402 provided in any of the embodiments described above;
the intelligent device 402 is configured to collect a voice signal, send the voice signal to the voice server 401, receive control information from the voice server 401, and perform a control action according to the control information.
In one embodiment of the present invention,
the intelligent device 402 is further configured to send a feedback text after performing a control action according to the control information to the voice server 401;
the voice server 401 is further configured to generate feedback voice information corresponding to the feedback text according to a preset voice feedback model after receiving the feedback text, and send the feedback voice information to the intelligent device 402;
The smart device 402 is further configured to receive and play the feedback voice message.
In one embodiment of the present invention, based on the smart device control system shown in fig. 4, the smart device control system further comprises: an interface server;
the interface server is configured to receive a query request carrying at least one target keyword sent from the voice server 401, obtain query information according to the at least one target keyword in the query request, and send the query information to the voice server 401.
In one embodiment of the present invention,
the intelligent device 402 is further configured to send its own state parameters to the interface server when the running state changes;
the interface server is also configured to receive and store status parameters from the smart device 402.
Taking the smart refrigerator as an example, the smart refrigerator 402 includes a voice module, a WiFi module, and a control module. The voice module is used for starting a voice acquisition function after receiving a preset voice wake-up command word, acquiring voice signals after the wake-up word and sending the voice signals to the WiFi module; the WiFi module is configured to send the received voice signal to the voice server 401 through a customized network communication protocol, receive control information from the voice server 401, send the control command to the control module when the control information includes a control command, and send the audio to the voice module when the control information includes the audio; the control module is used for executing corresponding control actions when receiving the control command; the voice module is also used for receiving and playing the received audio.
In a possible implementation manner, the WiFi module is further configured to send a broadcast text to be voice broadcast after the control module pattern control information performs the control action to the voice server 401, receive the broadcast information from the voice server 401, and send the broadcast information to the voice module; the voice module is also used for playing the received broadcasting information.
In one possible implementation, the external interface server may also query the refrigerator food management and after-sales service of the refrigerator, such as food addition, deletion, after-sales application, after-sales progress query, and warranty policy query, etc. through a voice control manner.
It should be noted that the structure illustrated in the embodiment of the present invention does not constitute a specific limitation on the voice server. In other embodiments of the invention, the voice server may include more or fewer components than shown, or certain components may be combined, or certain components may be split, or different arrangements of components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
The content of information interaction and execution process between the modules in the device is based on the same conception as the embodiment of the method of the present invention, and specific content can be referred to the description in the embodiment of the method of the present invention, which is not repeated here.
The embodiment of the invention also provides a voice server, which comprises: at least one memory and at least one processor;
the at least one memory for storing a machine readable program;
the at least one processor is configured to invoke the machine-readable program to execute the smart device control method according to any of the embodiments of the present invention.
Embodiments of the present invention also provide a computer readable medium storing instructions for causing a computer to perform a smart device control method as described herein. In particular, a method or apparatus provided with a storage medium on which a software program code realizing the functions of any of the above embodiments is stored, and a computer (or CPU or MPU) of the method or apparatus may be caused to read out and execute the program code stored in the storage medium.
In this case, the program code itself read from the storage medium may realize the functions of any of the above-described embodiments, and thus the program code and the storage medium storing the program code form part of the present invention.
Examples of the storage medium for providing the program code include a floppy disk, a hard disk, a magneto-optical disk, an optical disk (e.g., CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, DVD+RW), a magnetic tape, a nonvolatile memory card, and a ROM. Alternatively, the program code may be downloaded from a server computer by a communication network.
Further, it should be apparent that the functions of any of the above-described embodiments may be implemented not only by executing the program code read out by the computer, but also by causing an operating system or the like operating on the computer to perform part or all of the actual operations based on the instructions of the program code.
Further, it is understood that the program code read out by the storage medium is written into a memory provided in an expansion board inserted into a computer or into a memory provided in an expansion module connected to the computer, and then a CPU or the like mounted on the expansion board or the expansion module is caused to perform part and all of actual operations based on instructions of the program code, thereby realizing the functions of any of the above embodiments.
It should be noted that not all the steps and modules in the above flowcharts and the system configuration diagrams are necessary, and some steps or modules may be omitted according to actual needs. The execution sequence of the steps is not fixed and can be adjusted as required. The system structure described in the above embodiments may be a physical structure or a logical structure, that is, some modules may be implemented by the same physical entity, or some modules may be implemented by multiple physical entities, or may be implemented jointly by some components in multiple independent devices.
In the above embodiments, the hardware module may be mechanically or electrically implemented. For example, a hardware module may include permanently dedicated circuitry or logic (e.g., a dedicated processor, FPGA, or ASIC) to perform the corresponding operations. The hardware modules may also include programmable logic or circuitry (e.g., a general-purpose processor or other programmable processor) that may be temporarily configured by software to perform the corresponding operations. The particular implementation (mechanical, or dedicated permanent, or temporarily set) may be determined based on cost and time considerations.
While the invention has been illustrated and described in detail in the drawings and in the preferred embodiments, the invention is not limited to the disclosed embodiments, and it will be appreciated by those skilled in the art that the code audits of the various embodiments described above may be combined to produce further embodiments of the invention, which are also within the scope of the invention.

Claims (6)

1. The intelligent equipment control method is characterized by being applied to a voice server, and constructing mapping relations between a plurality of keyword sets and a plurality of models in advance, wherein each keyword set corresponds to a different voice instruction type;
Further comprises:
receiving a voice signal sent by external intelligent equipment;
processing the voice signal to obtain text information;
dividing the text information into a plurality of texts to be identified according to a plurality of keywords in the text information, wherein the keywords are used for describing instruction information in the texts to be identified, and each text to be identified corresponds to a different voice instruction type;
respectively determining a model of each text to be identified according to the keyword sets and the mapping relation;
determining control information of the text information according to each text to be identified and each model of the text to be identified;
the control information is sent to the intelligent equipment, so that the intelligent equipment executes control actions according to the control information;
the dividing the text information into a plurality of texts to be identified according to a plurality of keywords in the text information comprises the following steps:
acquiring a plurality of keywords in the text information according to a preset keyword dictionary;
determining at least one target keyword set corresponding to each keyword from the keyword sets;
Dividing the text information into a plurality of texts to be identified according to a plurality of target keyword sets;
the voice instruction types include: at least one of voice interaction, control commands, and device management;
when the voice instruction type corresponding to the text to be recognized comprises voice interaction or control commands,
the determining the control information of the text information according to each text to be identified and each model of the text to be identified comprises the following steps:
for each text to be recognized, executing:
determining at least one target keyword corresponding to the text to be recognized currently from the keywords;
determining at least one target model corresponding to the at least one target keyword from the mapping relation;
determining target control information of the text to be recognized currently according to the at least one target model;
and taking the target control information of each text to be identified as the control information of the text information.
2. The method of claim 1, wherein the step of determining the position of the substrate comprises,
when the voice command type corresponding to the text to be recognized comprises equipment management,
after the determining at least one target model corresponding to the at least one target keyword from the mapping relation and before the determining the control information of the current text to be recognized according to the at least one target model, further comprises:
Sending a query request carrying the at least one target keyword to an external interface server;
receiving query information returned by the interface server according to the at least one target keyword in the query request;
the determining the control information of the text to be recognized according to the at least one target model comprises the following steps:
and generating target control information of the text to be recognized currently according to the query information and the at least one target model.
3. The method according to any one of claims 1-2, further comprising, after said sending said control information to said smart device to cause said smart device to perform a control action in accordance with said control information:
receiving a feedback text returned by the intelligent equipment after the intelligent equipment executes control actions according to the control information;
generating feedback voice information corresponding to the feedback text according to a preset voice feedback model;
and sending the feedback voice information to the intelligent equipment so that the intelligent equipment plays the feedback voice information.
4. A voice server based on the intelligent device control method of any one of claims 1 to 3, comprising: the device comprises a construction module, a receiving module, an acquisition module, a first determination module, a second determination module, a third determination module and a sending module;
The construction module is used for pre-constructing mapping relations between a plurality of keyword sets and a plurality of models, wherein each keyword set corresponds to a different voice instruction type;
the receiving module is used for receiving voice signals sent by external intelligent equipment;
the acquisition module is used for processing the voice signal received by the receiving module to acquire text information;
the first determining module is configured to divide the text information into a plurality of texts to be identified according to a plurality of keywords in the text information acquired by the acquiring module, where the keywords are used to describe instruction information in the texts to be identified, and each text to be identified corresponds to a different voice instruction type;
the second determining module is used for respectively determining the target model of each text to be identified determined by the first determining module according to the plurality of keyword sets and the mapping relation constructed by the constructing module;
the third determining module is used for determining control information of the text information according to each text to be identified determined by the first determining module and the target model of each text to be identified determined by the second determining module;
The sending module is used for sending the control information determined by the third determining module to the intelligent equipment so that the intelligent equipment executes a control action according to the control information;
the first determining module is specifically configured to perform the following processing:
acquiring a plurality of keywords in the text information according to a preset keyword dictionary;
determining at least one target keyword set corresponding to each keyword from the keyword sets;
dividing the text information into a plurality of texts to be identified according to a plurality of target keyword sets;
when the voice instruction type includes: the third determining module is specifically configured to execute, when the voice instruction type corresponding to the text to be recognized determined by the first determining module includes a voice interaction or a control command, the following processing when at least one of the voice interaction, the control command, and the device management is performed:
for each text to be recognized, executing:
determining at least one target keyword corresponding to the text to be recognized currently from the keywords;
determining at least one target model corresponding to the at least one target keyword from the mapping relation;
Determining target control information of the text to be recognized currently according to the at least one target model;
and taking the target control information of each text to be identified as the control information of the text information.
5. The voice server of claim 4, wherein the voice server is configured to,
the sending module is further configured to send, to an external interface server, a query request carrying the at least one target keyword determined by the third determining module when the voice command type corresponding to the text to be identified determined by the first determining module includes device management;
the receiving module is further used for receiving query information returned by the interface server according to the at least one target keyword in the query request sent by the sending module;
the third determining module is further configured to generate target control information of the text to be identified according to the query information and the at least one target model received by the receiving module.
6. Intelligent device control system, its characterized in that includes: the voice server and at least one smart device of any one of claims 4 to 5;
the intelligent equipment is used for collecting voice signals, sending the voice signals to the voice server, receiving control information from the voice server, and executing control actions according to the control information.
CN202011472759.1A 2020-12-15 2020-12-15 Intelligent equipment control method, system and voice server Active CN112581959B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011472759.1A CN112581959B (en) 2020-12-15 2020-12-15 Intelligent equipment control method, system and voice server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011472759.1A CN112581959B (en) 2020-12-15 2020-12-15 Intelligent equipment control method, system and voice server

Publications (2)

Publication Number Publication Date
CN112581959A CN112581959A (en) 2021-03-30
CN112581959B true CN112581959B (en) 2023-05-09

Family

ID=75135287

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011472759.1A Active CN112581959B (en) 2020-12-15 2020-12-15 Intelligent equipment control method, system and voice server

Country Status (1)

Country Link
CN (1) CN112581959B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113377904B (en) * 2021-06-04 2024-05-10 百度在线网络技术(北京)有限公司 Industry action recognition method and device, electronic equipment and storage medium
CN113721770B (en) * 2021-09-03 2023-10-27 四川虹美智能科技有限公司 Method for providing voice assistance in intelligent household equipment and intelligent household equipment
CN114244879A (en) * 2021-12-15 2022-03-25 北京声智科技有限公司 Industrial control system, industrial control method and electronic equipment
CN114822530A (en) * 2022-03-18 2022-07-29 深圳绿米联创科技有限公司 Intelligent device control method and device, electronic device and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109584876A (en) * 2018-12-26 2019-04-05 珠海格力电器股份有限公司 Voice data processing method and device and voice air conditioner
CN111353292A (en) * 2020-02-26 2020-06-30 支付宝(杭州)信息技术有限公司 Analysis method and device for user operation instruction

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599270A (en) * 2008-06-02 2009-12-09 海尔集团公司 Voice server and voice control method
CN106297801A (en) * 2016-08-16 2017-01-04 北京云知声信息技术有限公司 Method of speech processing and device
CN106228974A (en) * 2016-08-19 2016-12-14 镇江惠通电子有限公司 Control method based on speech recognition, Apparatus and system
KR20180110979A (en) * 2017-03-30 2018-10-11 엘지전자 주식회사 Voice server, voice recognition server system, and method for operating the same
CN107146622B (en) * 2017-06-16 2021-02-19 合肥美的智能科技有限公司 Refrigerator, voice interaction system, method, computer device and readable storage medium
CN108183844B (en) * 2018-02-06 2020-09-08 四川虹美智能科技有限公司 Intelligent household appliance voice control method, device and system
US10979242B2 (en) * 2018-06-05 2021-04-13 Sap Se Intelligent personal assistant controller where a voice command specifies a target appliance based on a confidence score without requiring uttering of a wake-word
CN111161704A (en) * 2018-10-22 2020-05-15 联想图像(天津)科技有限公司 Control method of electronic equipment and electronic equipment
CN111292731A (en) * 2018-11-21 2020-06-16 深圳绿米联创科技有限公司 Voice information processing method and device, electronic equipment and storage medium
CN110875036A (en) * 2019-11-11 2020-03-10 广州国音智能科技有限公司 Voice classification method, device, equipment and computer readable storage medium
CN111179928A (en) * 2019-12-30 2020-05-19 上海欣能信息科技发展有限公司 Intelligent control method for power transformation and distribution station based on voice interaction
CN111640435A (en) * 2020-06-09 2020-09-08 合肥飞尔智能科技有限公司 Method and device for controlling infrared household appliances based on intelligent sound box
CN112786040A (en) * 2020-10-22 2021-05-11 青岛经济技术开发区海尔热水器有限公司 Voice control method, device and equipment applied to intelligent household electrical appliance
CN112350908B (en) * 2020-11-10 2021-11-23 珠海格力电器股份有限公司 Control method and device of intelligent household equipment
CN112905149A (en) * 2021-04-06 2021-06-04 Vidaa美国公司 Processing method of voice instruction on display device, display device and server

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109584876A (en) * 2018-12-26 2019-04-05 珠海格力电器股份有限公司 Voice data processing method and device and voice air conditioner
CN111353292A (en) * 2020-02-26 2020-06-30 支付宝(杭州)信息技术有限公司 Analysis method and device for user operation instruction

Also Published As

Publication number Publication date
CN112581959A (en) 2021-03-30

Similar Documents

Publication Publication Date Title
CN112581959B (en) Intelligent equipment control method, system and voice server
EP3734596B1 (en) Determining target device based on speech input of user and controlling target device
US10818151B2 (en) Vibration method, electronic device and computer readable storage medium
CN104159269B (en) Access method, relevant device and the system of wireless router
KR20190120353A (en) Speech recognition methods, devices, devices, and storage media
WO2016206494A1 (en) Voice control method, device and mobile terminal
US11238860B2 (en) Method and terminal for implementing speech control
JP2019204074A (en) Speech dialogue method, apparatus and system
CN110956963A (en) Interaction method realized based on wearable device and wearable device
CN104123938A (en) Voice control system, electronic device and voice control method
CN103197571A (en) Control method, device and system
CN113672748B (en) Multimedia information playing method and device
CN106227821B (en) A kind for the treatment of method and apparatus of order line command
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
JP2020003774A (en) Method and apparatus for processing speech
CN112767936B (en) Voice dialogue method and device, storage medium and electronic equipment
CN112151013A (en) Intelligent equipment interaction method
CN111667825A (en) Voice control method, cloud platform and voice equipment
CN108040111A (en) A kind of apparatus and method for supporting natural language interaction
CN112735406A (en) Device control method and apparatus, storage medium, and electronic apparatus
CN110224904B (en) Voice processing method, device, computer readable storage medium and computer equipment
CN111833857B (en) Voice processing method, device and distributed system
KR20220056836A (en) Method and apparatus for determining voice response rate, electronic device, computer readable storage medium and computer program
US20240046931A1 (en) Voice interaction method and apparatus
CN116708065B (en) Low-power consumption Bluetooth voice control method and system in intelligent home environment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20240428

Address after: No.2 Tongji West Road, Nantou Town, Zhongshan City, Guangdong Province

Patentee after: Changhong Meiling Xinhua Technology Co.,Ltd.

Country or region after: China

Address before: 621050 No. 303 Jiuzhou Road, Fucheng District, Mianyang, Sichuan.

Patentee before: SICHUAN HONGMEI INTELLIGENT TECHNOLOGY Co.,Ltd.

Country or region before: China

TR01 Transfer of patent right