CN113223510A - Refrigerator and equipment voice interaction method and computer readable storage medium thereof - Google Patents

Refrigerator and equipment voice interaction method and computer readable storage medium thereof Download PDF

Info

Publication number
CN113223510A
CN113223510A CN202010070740.8A CN202010070740A CN113223510A CN 113223510 A CN113223510 A CN 113223510A CN 202010070740 A CN202010070740 A CN 202010070740A CN 113223510 A CN113223510 A CN 113223510A
Authority
CN
China
Prior art keywords
state
semantic
current
equipment
full
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010070740.8A
Other languages
Chinese (zh)
Other versions
CN113223510B (en
Inventor
王广武
孔令磊
张景瑞
李嘉泓
李桂玺
孙兴凯
张宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Haier Refrigerator Co Ltd
Haier Smart Home Co Ltd
Original Assignee
Qingdao Haier Refrigerator Co Ltd
Haier Smart Home Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Haier Refrigerator Co Ltd, Haier Smart Home Co Ltd filed Critical Qingdao Haier Refrigerator Co Ltd
Priority to CN202010070740.8A priority Critical patent/CN113223510B/en
Publication of CN113223510A publication Critical patent/CN113223510A/en
Application granted granted Critical
Publication of CN113223510B publication Critical patent/CN113223510B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F25REFRIGERATION OR COOLING; COMBINED HEATING AND REFRIGERATION SYSTEMS; HEAT PUMP SYSTEMS; MANUFACTURE OR STORAGE OF ICE; LIQUEFACTION SOLIDIFICATION OF GASES
    • F25DREFRIGERATORS; COLD ROOMS; ICE-BOXES; COOLING OR FREEZING APPARATUS NOT OTHERWISE PROVIDED FOR
    • F25D29/00Arrangement or mounting of control or safety devices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/282Controlling appliance services of a home automation network by calling their functionalities based on user interaction within the home
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Thermal Sciences (AREA)
  • Combustion & Propulsion (AREA)
  • Chemical & Material Sciences (AREA)
  • Mechanical Engineering (AREA)
  • Automation & Control Theory (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Safety Devices In Control Systems (AREA)

Abstract

The invention provides a device voice interaction method, which comprises the following steps: identifying the audio signal of the current environment to obtain current voice information; performing semantic recognition on the current voice information to obtain a current semantic state; acquiring the current state of the equipment, and judging whether the current semantic state is a preset semantic or not under the condition that the equipment is in a full-duplex state; and if the current semantic state is not the preset semantic, not performing function control on the equipment. Therefore, in the present invention, the current state and semantic information of the device are judged, and if the device is in the full duplex state, and the semantic state is not the preset semantic, the semantic information is the irrelevant information, and the device does not perform the corresponding function control. Therefore, the equipment can prevent the equipment from feeding back noise and other irrelevant voice information under the full-duplex working state, and the working efficiency of the equipment is influenced.

Description

Refrigerator and equipment voice interaction method and computer readable storage medium thereof
Technical Field
The invention relates to a refrigerator, a voice interaction method of the refrigerator and a computer readable storage medium.
Background
With the further improvement of the living standard of people, the requirements of people on the intellectualization of household appliances are higher and higher. In smart furniture, and particularly in smart refrigerators, it is often desirable for a user to interact with the refrigerator by voice.
In the process of using the voice interaction function, the user is usually required to wake up through the customized command word first, and start the voice recognition function of the device to recognize the subsequent instruction. In the full-duplex natural communication process, after the equipment executes the instruction, the equipment does not immediately close the interaction, but waits for a period of time to see whether a subsequent instruction exists, and if so, the equipment continues to execute the instruction. If no instruction exists, the interaction is closed, and the next user awakening is waited.
However, in the full-duplex natural communication process, if there are other noises or other instructions, especially in a kitchen, the environment is noisy, and it is difficult to avoid other sounds, which affects the device to recognize voice and make feedback, and affects the interaction between the user and the device, resulting in poor user experience.
Therefore, it is necessary to design a new refrigerator and an apparatus voice interaction method thereof, and a computer readable storage medium.
Disclosure of Invention
In order to solve the above problem, the present invention provides a device voice interaction method, where the device voice interaction method includes:
identifying the audio signal of the current environment to obtain current voice information;
performing semantic recognition on the current voice information to obtain a current semantic state;
acquiring the current state of the equipment, and judging whether the current semantic state is a preset semantic or not under the condition that the equipment is in a full-duplex state;
and if the current semantic state is not the preset semantic, not performing function control on the equipment.
As a further improvement of the present invention, the step of "judging whether the current semantic state is the preset semantic state under the condition that the device is in the full duplex state" includes:
if the current semantic state is the preset semantic, judging whether the current semantic state is the common semantic;
and if the current semantic state is the common semantic, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
As a further improvement of the present invention, the step of "judging whether the current semantic state is the preset semantic state under the condition that the device is in the full duplex state" includes:
if the current semantic state is the preset semantic, judging whether the current semantic state is the full duplex semantic;
if the current semantic state is full duplex semantic, judging whether the current semantic state is consistent with the full duplex state;
if the voice information is consistent with the operation instruction, recognizing the voice information to obtain the operation instruction, and performing function control on the equipment according to the operation instruction;
and if the difference is not consistent, the function of the equipment is not controlled.
As a further improvement of the present invention, in the case where the device is in the full-duplex state, if the operation instruction is not obtained within the time T, the full-duplex state is exited.
As a further improvement of the present invention, the step "not performing function control on the device" specifically includes:
and displaying the voice information on the screen in real time.
As a further improvement of the present invention, the step of "acquiring the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
and if the current semantic state is not the preset semantic, ignoring the current voice information and not performing function control on the equipment.
As a further improvement of the present invention, the step of "acquiring the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is not the preset semantic, recognizing the voice information and judging whether an operation instruction is obtained, and if the operation instruction is obtained, performing function control on the equipment according to the operation instruction.
As a further improvement of the present invention, the step of "acquiring the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is the preset semantic, judging whether the current semantic state is the common semantic;
and if the current semantic state is the common semantic, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
As a further improvement of the present invention, the step of "acquiring the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is the preset semantic, judging whether the current semantic state is the full duplex semantic;
and if the current semantic state is full-duplex semantic, setting the current state of the equipment to be a full-duplex state, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
As a further improvement of the present invention, the "full duplex state" includes: the system comprises a food material management scene state, a menu management scene state, an audio-visual scene state, a takeout scene state and a function adjusting scene state.
In order to solve the above problem, the present invention provides a refrigerator, which includes a memory and a processor, wherein the memory stores a computer program operable on the processor, and the processor executes the computer program to implement the steps in the voice interaction method of the device.
In order to solve the above problem, the present invention proposes a computer-readable storage medium, on which a computer program is stored, wherein the computer program, when executed by a processor, implements the steps in the device voice interaction method as described above.
The invention has the beneficial effects that: in the invention, the current state and semantic information of the equipment are judged, and if the equipment is in a full duplex state, the semantic information is irrelevant information if the semantic state is not a preset semantic, and the equipment cannot perform corresponding function control. Therefore, the equipment can prevent the equipment from feeding back noise and other irrelevant voice information under the full-duplex working state, and the working efficiency of the equipment is influenced.
Drawings
Fig. 1 is a flow chart of a voice interaction method of the device according to the present invention.
Detailed Description
In order to make those skilled in the art better understand the technical solution of the present invention, the technical solution in the embodiment of the present invention will be clearly and completely described below with reference to the drawings in the embodiment of the present invention, and it is obvious that the described embodiment is only a part of the embodiment of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In addition, the refrigerator is adopted as a specific embodiment in the invention, and of course, the voice interaction in the invention can also be applied to other electrical appliances or equipment, and all the voice interaction in the invention shall belong to the protection scope of the invention.
As shown in fig. 1, the present invention provides a device voice interaction method, where the device voice interaction method includes:
identifying the audio signal of the current environment to obtain current voice information;
performing semantic recognition on the current voice information to obtain a current semantic state;
acquiring the current state of the equipment, and judging whether the current semantic state is a preset semantic or not under the condition that the equipment is in a full-duplex state;
and if the current semantic state is not the preset semantic, not performing function control on the equipment.
Therefore, in the present invention, the current state and semantic information of the device are judged, and if the device is in the full duplex state, and the semantic state is not the preset semantic, the semantic information is the irrelevant information, and the device does not perform the corresponding function control. Therefore, the equipment can prevent the equipment from feeding back noise and other irrelevant voice information under the full-duplex working state, and the working efficiency of the equipment is influenced.
In a specific embodiment of the present invention, the device is a refrigerator, and the "full-duplex state" includes a food material management scene state, a menu management scene state, an audio-visual scene state, a takeout scene state, a function adjustment scene state, and the like, that is, the refrigerator is preset with a plurality of scenes as described above, in which natural language interaction can be performed between the user and the refrigerator in the full-duplex state, and after one-time interaction is completed, the device does not exit the interaction, but can directly wait for receiving the next voice information of the user without repeatedly waking up. And under the full-duplex working state, if the voice recognition device of the refrigerator recognizes the voice information which is not in the preset semantic meaning, the equipment is not controlled.
And in the step of 'not performing function control on the equipment', displaying the voice information on the screen in real time. If the current semantic state is not the preset semantic, the equipment cannot identify the semantic information, and certainly cannot obtain an operation instruction, but the voice information is displayed on a screen in real time to tell a user that the voice information cannot be identified in the full duplex state.
The preset semantics are common semantics or full duplex semantics. The full duplex semantics refers to semantics corresponding to the full duplex state. For example, the "full-duplex status" includes a food material management scene status, a recipe management scene status, an audio-visual scene status, a takeout scene status, a function adjustment scene status, and the like, and the "full-duplex semantics" includes a food material management scene semantic, a recipe management scene semantic, an audio-visual scene semantic, a takeout scene semantic, a function adjustment scene semantic, and the like. The common semantics refers to semantics that can be recognized by the device in all full duplex states, such as adjusting volume or tone semantics, adjusting screen brightness semantics, adjusting font or font size semantics, and the like.
The above describes the case where the current semantic is not the preset semantic, and the following specifically analyzes the case where the current semantic is the preset semantic. Specifically, on the one hand, the step of "judging whether the current semantic state is the preset semantic state or not when the device is in the full duplex state" includes:
if the current semantic state is the preset semantic, judging whether the current semantic state is the common semantic;
and if the current semantic state is the common semantic, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
That is, when it is determined that the semantic state is the preset semantic, the type of the semantic state is further determined, and if the semantic state is the common semantic, the speech information is recognized as the operation command and the function control is performed. For example, in the menu management scene state, if the voice information is "increase the volume", the voice information is the common semantic meaning, that is, the voice information can be recognized and the operation instruction of "increase the volume" can be obtained, and the volume of the device can be correspondingly increased.
On the other hand, the step of judging whether the current semantic state is the preset semantic state or not under the condition that the equipment is in the full duplex state comprises the following steps:
if the current semantic state is the preset semantic, judging whether the current semantic state is the full duplex semantic;
if the current semantic state is full duplex semantic, judging whether the current semantic state is consistent with the full duplex state;
if the voice information is consistent with the operation instruction, recognizing the voice information to obtain the operation instruction, and performing function control on the equipment according to the operation instruction;
and if the difference is not consistent, the function of the equipment is not controlled.
As described above, the full duplex state is not limited to one, and therefore, after the current semantic state is determined to be the full duplex semantic, it is necessary to determine whether the current semantic state is consistent with the full duplex state. If the two voice messages are consistent, the fact that the user newly provides a voice message related to the current full duplex state in the current natural voice interaction process is indicated, and the equipment is controlled according to the corresponding operation instruction. If the voice information is not consistent with the voice information, the voice information is irrelevant to the other voice information, and the equipment does not perform function control.
As in the above case, in the step "not performing function control on the device", voice information is displayed on the screen in real time. If the current semantic state is inconsistent with the full-duplex state, the equipment cannot identify the semantic information, and certainly cannot obtain an operation instruction, but the voice information is displayed on a screen in real time to tell a user that the voice information cannot be identified under the condition that the full-duplex state is inconsistent.
For example, if the current semantic state is a food material management scene state and the current voice information is "view today's recommended menu," it may be determined that the current voice information cannot be used in the food material management scene state, and then the device is not subjected to function control. And if the current voice message is 'checking the number of the remaining potatoes', judging that the current voice message can be used in the food material management scene state. If the current voice information is 'volume up', the semantic state of the current voice information can be judged to be common semantic, recognition can be carried out, and the refrigerator is controlled to adjust the volume up.
Therefore, in this embodiment, by setting different full duplex states, continuous conversation can be realized only in the same full duplex state, so that the device can rapidly recognize voice information and perform function control on the device, thereby greatly reducing the recognition and feedback speed of the device, improving the interaction efficiency, and solving the problem of overlong waiting time when the user uses voice interaction.
Of course, in another embodiment of the present invention, the full duplex state may not be classified, but only in the full duplex state, if the semantic state is full duplex semantic, the recognition and feedback can be performed, and the purpose of the present invention can also be achieved.
In addition, when the device is in the full-duplex state, if the operation command is not obtained within the time T, the device exits the full-duplex state. Generally, in a full-duplex state, a device usually has a certain waiting time T, and if there is still no common semantics or full-duplex semantics consistent with a current full-duplex state within the waiting time T, the device will not recognize and obtain an operation instruction, and exit from the full-duplex state. Or, of course, if the full duplex state is exited by other methods, for example, a voice command or a key command for exiting the full duplex state is provided, the object of the present invention can also be achieved.
It should be noted that, in the present embodiment, recognizing the semantic state of the voice message can determine the current semantic state by recognizing the corresponding keyword in the voice message. For example, if keywords such as "food material", "food", "potato", "onion", etc. are recognized in the voice information, it is determined that the current semantic state is a full-duplex semantic and is a semantic matching the food material management scene state. If keywords related to "volume", "voice", and the like are recognized in the voice information, it is possible to determine that the current semantic state is a common semantic. Of course, if other ways are adopted to judge the current semantic state, the purpose of the invention can also be achieved.
The above description is made for the case where the device is in the full-duplex state, and the following description is made for the case where the device is not in the full-duplex state. Similarly, when the device is not in the full-duplex state, the current semantics also include two cases of "being the preset semantics" and "not being the preset semantics". Specifically, the following description will be made in detail.
On one hand, if the current semantic is not the preset semantic, the step of "acquiring the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
and if the current semantic state is not the preset semantic, ignoring the current voice information and not performing function control on the equipment.
That is, when the device is not in the full duplex state and the current semantic is not the preset semantic, the device does not respond and directly ignores the current voice information. In this case, noise, extraneous information, and the like can be avoided.
As described above, the full duplex state refers to a state where the user and the device are in a natural voice interaction state without waking up, and thus, the present invention also provides a general unnatural voice interaction state, that is, the user needs to wake up each time the user sends a voice message or a user instruction, and the unnatural voice interaction state can be applied to a simple voice interaction state without separately setting a full duplex state.
Then, in another embodiment, the step of "obtaining the current state of the device" comprises:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is not the preset semantic, recognizing the voice information and judging whether an operation instruction is obtained, and if the operation instruction is obtained, performing function control on the equipment according to the operation instruction.
Obviously, if the operation command cannot be obtained after the voice information is recognized, it is indicated that the voice information is completely noise, and the function control is performed without response.
In this case, the device is not in the full duplex state, and after the user wakes up the device, the user sends out voice information and makes the device perform corresponding function control, the device exits the interaction and waits for the next wake-up of the user.
On the other hand, if the current semantic is the preset semantic, it is also necessary to determine whether the current semantic is the common semantic or the full duplex semantic.
Specifically, in an embodiment, the step "acquiring the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is the preset semantic, judging whether the current semantic state is the common semantic;
and if the current semantic state is the common semantic, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
In the state that the equipment is not in full duplex, if the voice information sent by the user is in common semantics, the equipment can still recognize the voice information and obtain an operation instruction. However, since the full-duplex state is not available, the user can only perform a single voice interaction and then quit the interaction to wait for the next wake-up.
In another embodiment, the step of "obtaining the current state of the device" includes:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is the preset semantic, judging whether the current semantic state is the full duplex semantic;
and if the current semantic state is full-duplex semantic, setting the current state of the equipment to be a full-duplex state, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
That is, if the device is not in the full-duplex state and the current semantic state is the full-duplex semantic state, the current state of the device may be set to the full-duplex state, and the function of the device may be controlled according to the operation instruction. I.e. the user utters a voice message to the device for the first time after waking up the device, the device must of course not be in full duplex. Therefore, in the next step, the current state of the device is set to be the full-duplex state, and further, the current state can be set to be a certain full-duplex state, for example, the food material management scene state described above, so as to perform subsequent voice interaction.
In summary, the present invention provides a device voice interaction method, in the present invention, a current state and semantic information of a device are determined, and when the device is in a full duplex state, if the semantic state is not a preset semantic, it indicates that the semantic information is irrelevant information, and the device does not perform corresponding function control. Therefore, the equipment can prevent the equipment from feeding back noise and other irrelevant voice information under the full-duplex working state, and the working efficiency of the equipment is influenced.
The invention further provides a refrigerator, which comprises a memory and a processor, wherein the memory stores a computer program capable of running on the processor, and the processor executes the program to realize the steps in the device voice interaction method, namely, the processor executes the program to realize the steps in any technical scheme of the device voice interaction method.
The present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in the device voice interaction method as described above, that is, the processor implements the steps in any one of the technical solutions of the device voice interaction method as described above when executing the computer program.
It should be understood that although the present description refers to embodiments, not every embodiment may contain only a single embodiment, and such description is for clarity only, and those skilled in the art will be able to make the description as a whole, and the embodiments may be appropriately combined to form other embodiments as will be appreciated by those skilled in the art.
The above detailed description is merely illustrative of possible embodiments of the present invention and is not intended to limit the scope of the invention, which is intended to include all equivalent embodiments or modifications within the scope of the present invention without departing from the technical spirit of the present invention.

Claims (12)

1. A device voice interaction method is characterized by comprising the following steps:
identifying the audio signal of the current environment to obtain current voice information;
performing semantic recognition on the current voice information to obtain a current semantic state;
acquiring the current state of the equipment, and judging whether the current semantic state is a preset semantic or not under the condition that the equipment is in a full-duplex state;
and if the current semantic state is not the preset semantic, not performing function control on the equipment.
2. The device voice interaction method of claim 1, wherein the step of determining whether the current semantic state is a preset semantic state in case that the device is in a full duplex state comprises:
if the current semantic state is the preset semantic, judging whether the current semantic state is the common semantic;
and if the current semantic state is the common semantic, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
3. The device voice interaction method of claim 1, wherein the step of determining whether the current semantic state is a preset semantic state in case that the device is in a full duplex state comprises:
if the current semantic state is the preset semantic, judging whether the current semantic state is the full duplex semantic;
if the current semantic state is full duplex semantic, judging whether the current semantic state is consistent with the full duplex state;
if the voice information is consistent with the operation instruction, recognizing the voice information to obtain the operation instruction, and performing function control on the equipment according to the operation instruction;
and if the difference is not consistent, the function of the equipment is not controlled.
4. The device voice interaction method according to claim 2 or 3, wherein in the case that the device is in the full-duplex state, if no operation instruction is obtained within the time T, the full-duplex state is exited.
5. The device voice interaction method according to claim 1 or 3, wherein the step of not performing function control on the device specifically comprises:
and displaying the voice information on the screen in real time.
6. The device voice interaction method of claim 1, wherein the step of obtaining the current state of the device comprises:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
and if the current semantic state is not the preset semantic, ignoring the current voice information and not performing function control on the equipment.
7. The device voice interaction method of claim 1, wherein the step of obtaining the current state of the device comprises:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is not the preset semantic, recognizing the voice information and judging whether an operation instruction is obtained, and if the operation instruction is obtained, performing function control on the equipment according to the operation instruction.
8. The device voice interaction method of claim 1, wherein the step of obtaining the current state of the device comprises:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is the preset semantic, judging whether the current semantic state is the common semantic;
and if the current semantic state is the common semantic, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
9. The device voice interaction method of claim 1, wherein the step of obtaining the current state of the device comprises:
under the condition that the equipment is not in a full duplex state, judging whether the current semantic state is a preset semantic or not;
if the current semantic state is the preset semantic, judging whether the current semantic state is the full duplex semantic;
and if the current semantic state is full-duplex semantic, setting the current state of the equipment to be a full-duplex state, recognizing the voice information to obtain an operation instruction, and performing function control on the equipment according to the operation instruction.
10. The device voice interaction method of claim 1, wherein the "full duplex state" comprises: the system comprises a food material management scene state, a menu management scene state, an audio-visual scene state, a takeout scene state and a function adjusting scene state.
11. A refrigerator comprising a memory and a processor, the memory storing a computer program operable on the processor, wherein the processor when executing the program implements the steps in the device voice interaction method of any of claims 1-10.
12. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method for voice interaction of a device according to any one of claims 1 to 10.
CN202010070740.8A 2020-01-21 2020-01-21 Refrigerator and equipment voice interaction method and computer readable storage medium thereof Active CN113223510B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010070740.8A CN113223510B (en) 2020-01-21 2020-01-21 Refrigerator and equipment voice interaction method and computer readable storage medium thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010070740.8A CN113223510B (en) 2020-01-21 2020-01-21 Refrigerator and equipment voice interaction method and computer readable storage medium thereof

Publications (2)

Publication Number Publication Date
CN113223510A true CN113223510A (en) 2021-08-06
CN113223510B CN113223510B (en) 2022-09-20

Family

ID=77085451

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010070740.8A Active CN113223510B (en) 2020-01-21 2020-01-21 Refrigerator and equipment voice interaction method and computer readable storage medium thereof

Country Status (1)

Country Link
CN (1) CN113223510B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113223510B (en) * 2020-01-21 2022-09-20 青岛海尔电冰箱有限公司 Refrigerator and equipment voice interaction method and computer readable storage medium thereof

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2467220A1 (en) * 2003-05-29 2004-11-29 Microsoft Corporation Semantic object synchronous understanding implemented with speech application language tags
US8060371B1 (en) * 2007-05-09 2011-11-15 Nextel Communications Inc. System and method for voice interaction with non-voice enabled web pages
CN105719649A (en) * 2016-01-19 2016-06-29 百度在线网络技术(北京)有限公司 Voice recognition method and device
CN107316643A (en) * 2017-07-04 2017-11-03 科大讯飞股份有限公司 Voice interactive method and device
CN108093350A (en) * 2017-12-21 2018-05-29 广东小天才科技有限公司 Microphone control method and microphone
CN108337362A (en) * 2017-12-26 2018-07-27 百度在线网络技术(北京)有限公司 Voice interactive method, device, equipment and storage medium
WO2019015435A1 (en) * 2017-07-19 2019-01-24 腾讯科技(深圳)有限公司 Speech recognition method and apparatus, and storage medium
US20190134812A1 (en) * 2017-11-09 2019-05-09 Samsung Electronics Co., Ltd. Electronic device capable of moving and operating method thereof
CN109920413A (en) * 2018-12-28 2019-06-21 广州索答信息科技有限公司 A kind of implementation method and storage medium of kitchen scene touch screen voice dialogue
CN110634486A (en) * 2018-06-21 2019-12-31 阿里巴巴集团控股有限公司 Voice processing method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113223510B (en) * 2020-01-21 2022-09-20 青岛海尔电冰箱有限公司 Refrigerator and equipment voice interaction method and computer readable storage medium thereof

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2467220A1 (en) * 2003-05-29 2004-11-29 Microsoft Corporation Semantic object synchronous understanding implemented with speech application language tags
US8060371B1 (en) * 2007-05-09 2011-11-15 Nextel Communications Inc. System and method for voice interaction with non-voice enabled web pages
CN105719649A (en) * 2016-01-19 2016-06-29 百度在线网络技术(北京)有限公司 Voice recognition method and device
CN107316643A (en) * 2017-07-04 2017-11-03 科大讯飞股份有限公司 Voice interactive method and device
WO2019015435A1 (en) * 2017-07-19 2019-01-24 腾讯科技(深圳)有限公司 Speech recognition method and apparatus, and storage medium
US20190134812A1 (en) * 2017-11-09 2019-05-09 Samsung Electronics Co., Ltd. Electronic device capable of moving and operating method thereof
CN108093350A (en) * 2017-12-21 2018-05-29 广东小天才科技有限公司 Microphone control method and microphone
CN108337362A (en) * 2017-12-26 2018-07-27 百度在线网络技术(北京)有限公司 Voice interactive method, device, equipment and storage medium
CN110634486A (en) * 2018-06-21 2019-12-31 阿里巴巴集团控股有限公司 Voice processing method and device
CN109920413A (en) * 2018-12-28 2019-06-21 广州索答信息科技有限公司 A kind of implementation method and storage medium of kitchen scene touch screen voice dialogue

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈杰: "《电脑医院:电脑软硬件故障排除实例》", 30 November 1999 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113223510B (en) * 2020-01-21 2022-09-20 青岛海尔电冰箱有限公司 Refrigerator and equipment voice interaction method and computer readable storage medium thereof

Also Published As

Publication number Publication date
CN113223510B (en) 2022-09-20

Similar Documents

Publication Publication Date Title
US20190027129A1 (en) Method, apparatus, device and storage medium for switching voice role
CN112201246B (en) Intelligent control method and device based on voice, electronic equipment and storage medium
CN111045639B (en) Voice input method, device, electronic equipment and storage medium
CN109584860B (en) Voice wake-up word definition method and system
WO2020048222A1 (en) Sound effect adjustment method and apparatus, electronic device and storage medium
EP3611724A1 (en) Voice response method and device, and smart device
CN107424607A (en) Voice control mode switching method and device and equipment with device
CN103197571A (en) Control method, device and system
JP2003526120A (en) Dialogue processing method with consumer electronic equipment system
US20200265843A1 (en) Speech broadcast method, device and terminal
KR20160132748A (en) Electronic apparatus and the controlling method thereof
WO2021196617A1 (en) Voice interaction method and apparatus, electronic device and storage medium
CN107799115A (en) A kind of audio recognition method and device
CN115327932A (en) Scene creation method and device, electronic equipment and storage medium
CN113223510B (en) Refrigerator and equipment voice interaction method and computer readable storage medium thereof
CN109274825A (en) A kind of message prompt method and device
CN109658924B (en) Session message processing method and device and intelligent equipment
CN112420043A (en) Intelligent awakening method and device based on voice, electronic equipment and storage medium
WO2019242415A1 (en) Position prompt method, device, storage medium and electronic device
CN112786047B (en) Voice processing method, device, equipment, storage medium and intelligent sound box
CN112787899B (en) Equipment voice interaction method, computer readable storage medium and refrigerator
CN110164426A (en) Sound control method and computer storage medium
US11580974B2 (en) Method for exiting a voice skill, apparatus, device and storage medium
CN111414760B (en) Natural language processing method, related equipment, system and storage device
CN114627859A (en) Method and system for recognizing electronic photo frame in offline semantic manner

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant