CN108962235A - Voice interactive method and device - Google Patents

Voice interactive method and device Download PDF

Info

Publication number
CN108962235A
CN108962235A CN201711446766.2A CN201711446766A CN108962235A CN 108962235 A CN108962235 A CN 108962235A CN 201711446766 A CN201711446766 A CN 201711446766A CN 108962235 A CN108962235 A CN 108962235A
Authority
CN
China
Prior art keywords
acquisition instruction
content acquisition
content
instruction
psychomotor domain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711446766.2A
Other languages
Chinese (zh)
Other versions
CN108962235B (en
Inventor
高慧湍
韩伟
李茂全
李宝祥
修铭徽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Orion Star Technology Co Ltd
Original Assignee
Beijing Orion Star Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Orion Star Technology Co Ltd filed Critical Beijing Orion Star Technology Co Ltd
Priority to CN201711446766.2A priority Critical patent/CN108962235B/en
Publication of CN108962235A publication Critical patent/CN108962235A/en
Application granted granted Critical
Publication of CN108962235B publication Critical patent/CN108962235B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present invention proposes a kind of voice interactive method and device, and wherein method includes: to receive first content acquisition instruction, obtains content according to first content acquisition instruction;If receiving the second content acquisition instruction within a preset period of time, judge whether the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain or associative skills field;If it is determined that the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain or associative skills field, content is obtained according to the second content acquisition instruction, the present embodiment, which can be realized user, expresses intention by multiple content acquisition instruction;And field due to noise in ambient enviroment etc. and the psychomotor domain of the content acquisition instruction of user are generally uncorrelated, therefore the present embodiment can be avoided the phonetic order of speech ciphering equipment " mistake " executed in ambient enviroment, to improve interactive voice efficiency, the experience that user uses speech ciphering equipment is improved.

Description

Voice interactive method and device
Technical field
The present invention relates to speech ciphering equipment technical field more particularly to a kind of voice interactive methods and device.
Background technique
There are mainly two types of current voice interactive methods, after waking up every time, Exactly-once phonetic order.It is another Kind is after waking up every time, to allow to execute the phonetic order received in special time period.However, needing to use in the first scheme Family continually wakes up speech ciphering equipment, especially in the case where user can not be intended to by the expression of phonetic order, by the A kind of scheme is difficult to realize effective interaction between user and speech ciphering equipment.In second scheme, since speech ciphering equipment generally makes In open scene, there are many noise and background sound faced, are easy to cause in the execution ambient enviroment of speech ciphering equipment " mistake " Phonetic order, it is difficult to realize effective interaction between user and speech ciphering equipment, reduce interactive voice efficiency, reduce user Use the experience of speech ciphering equipment.
Summary of the invention
The present invention is directed to solve at least some of the technical problems in related technologies.
For this purpose, the first purpose of this invention is to propose a kind of voice interactive method, for solving language in the prior art Sound interactive efficiency is poor, influences the problem of user is experienced using speech ciphering equipment.
Second object of the present invention is to propose a kind of voice interaction device.
Third object of the present invention is to propose a kind of electronic equipment.
Fourth object of the present invention is to propose a kind of non-transitorycomputer readable storage medium.
5th purpose of the invention is to propose a kind of computer program product.
In order to achieve the above object, first aspect present invention embodiment proposes a kind of voice interactive method, comprising:
First content acquisition instruction is received, content is obtained according to the first content acquisition instruction;
If receiving the second content acquisition instruction within a preset period of time, judge the second content acquisition instruction with it is described Whether first content acquisition instruction belongs to identical psychomotor domain or associative skills field;
If it is determined that the second content acquisition instruction and the first content acquisition instruction belong to identical psychomotor domain or Associative skills field obtains content according to the second content acquisition instruction.
Further, the method further include:
If it is determined that the second content acquisition instruction and the first content acquisition instruction be not belonging to identical psychomotor domain or Person's associative skills field is not responding to the second content acquisition instruction.
Further, however, it is determined that the second content acquisition instruction and the first content acquisition instruction belong to identical technical ability Field obtains content according to the second content acquisition instruction, specifically includes:
According to the parsing result of the second content acquisition instruction, in conjunction with the parsing knot of the first content acquisition instruction Fruit obtains content;
If it is determined that the second content acquisition instruction and the first content acquisition instruction belong to associative skills field, according to The second content acquisition instruction obtains content, specifically includes:
According to the parsing result of the second content acquisition instruction, content is obtained.
Further, preset time period psychomotor domain according to belonging to first content acquisition instruction determines.
Further, judge whether the second content acquisition instruction and the first content acquisition instruction belong to identical skill Energy field or associative skills field, specifically include:
According to instruction parsing result, the first psychomotor domain belonging to the first content acquisition instruction and described is determined Second psychomotor domain belonging to second content acquisition instruction;
If first psychomotor domain is identical as second psychomotor domain, the second content acquisition instruction and institute are determined It states first content acquisition instruction and belongs to identical psychomotor domain;
If first psychomotor domain and second psychomotor domain be not identical, preset related fields mapping rule are inquired Then, default associative skills field corresponding with first psychomotor domain is determined;
If including second psychomotor domain in the default associative skills field, the second content acquisition instruction is determined Belong to associative skills field with the first content acquisition instruction.
Further, judge the second content acquisition instruction with the first content acquisition instruction and whether belong to it is identical Before psychomotor domain or associative skills field, further includes:
Determine that the second content acquisition instruction is not wake up instruction.
Further, the method further include: if the second content acquisition instruction is wake up instruction, called out described in response It wakes up and instructs.
Voice interactive method provided in this embodiment receives first content acquisition instruction, according to first content acquisition instruction Obtain content;If receiving the second content acquisition instruction within a preset period of time, judge in the second content acquisition instruction and first Hold whether acquisition instruction belongs to identical psychomotor domain or associative skills field;If it is determined that in the second content acquisition instruction and first Hold acquisition instruction and belong to identical psychomotor domain or associative skills field, content, this reality are obtained according to the second content acquisition instruction It applies example and can be realized user and express intention by multiple content acquisition instruction;And due to the field of noise in ambient enviroment etc. It is generally uncorrelated to the psychomotor domain of the content acquisition instruction of user, therefore the present embodiment can be avoided speech ciphering equipment " mistake " The phonetic order in ambient enviroment is executed, to improve interactive voice efficiency, improves the experience that user uses speech ciphering equipment.
In order to achieve the above object, second aspect of the present invention embodiment proposes a kind of voice interaction device, comprising:
Module is obtained, for receiving first content acquisition instruction, content is obtained according to the first content acquisition instruction;
Judgment module when for receiving the second content acquisition instruction within a preset period of time, judges second content Whether acquisition instruction and the first content acquisition instruction belong to identical psychomotor domain or associative skills field;
The acquisition module is also used to determining the second content acquisition instruction and the first content acquisition instruction category When identical psychomotor domain or associative skills field, content is obtained according to the second content acquisition instruction.
Further, the device further include:
Processing module, for determining that the second content acquisition instruction is not belonging to phase with the first content acquisition instruction When with psychomotor domain or associative skills field, it is not responding to the second content acquisition instruction.
Further, the acquisition module is specifically used for,
When determining that the second content acquisition instruction and the first content acquisition instruction belong to identical psychomotor domain, root Content is obtained in conjunction with the parsing result of the first content acquisition instruction according to the parsing result of the second content acquisition instruction;
When determining that the second content acquisition instruction and the first content acquisition instruction belong to associative skills field, root According to the parsing result of the second content acquisition instruction, content is obtained.
Further, preset time period psychomotor domain according to belonging to first content acquisition instruction determines.
Further, the judgment module is specifically used for,
According to instruction parsing result, the first psychomotor domain belonging to the first content acquisition instruction and described is determined Second psychomotor domain belonging to second content acquisition instruction;
If first psychomotor domain is identical as second psychomotor domain, the second content acquisition instruction and institute are determined It states first content acquisition instruction and belongs to identical psychomotor domain;
If first psychomotor domain and second psychomotor domain be not identical, preset related fields mapping rule are inquired Then, default associative skills field corresponding with first psychomotor domain is determined;
If including second psychomotor domain in the default associative skills field, the second content acquisition instruction is determined Belong to associative skills field with the first content acquisition instruction.
Further, the judgment module is also used to, and is judging the second content acquisition instruction and the first content Whether acquisition instruction belongs to before identical psychomotor domain or associative skills field, determines that the second content acquisition instruction is not Wake up instruction.
Further, the device further include:
Respond module, for responding the wake up instruction when the second content acquisition instruction is wake up instruction.
Voice interaction device provided in this embodiment receives first content acquisition instruction, according to first content acquisition instruction Obtain content;If receiving the second content acquisition instruction within a preset period of time, judge in the second content acquisition instruction and first Hold whether acquisition instruction belongs to identical psychomotor domain or associative skills field;If it is determined that in the second content acquisition instruction and first Hold acquisition instruction and belong to identical psychomotor domain or associative skills field, content, this reality are obtained according to the second content acquisition instruction It applies example and can be realized user and express intention by multiple content acquisition instruction;And due to the field of noise in ambient enviroment etc. It is generally uncorrelated to the psychomotor domain of the content acquisition instruction of user, therefore the present embodiment can be avoided speech ciphering equipment " mistake " The phonetic order in ambient enviroment is executed, to improve interactive voice efficiency, improves the experience that user uses speech ciphering equipment.
In order to achieve the above object, third aspect present invention embodiment proposes a kind of electronic equipment, comprising: memory, processing Device and storage are on a memory and the computer program that can run on a processor, which is characterized in that processor execution institute Voice interactive method as described above is realized when stating program.
To achieve the goals above, fourth aspect present invention embodiment proposes a kind of computer readable storage medium, On be stored with computer program, which realizes voice interactive method as described above when being executed by processor.
To achieve the goals above, fifth aspect present invention embodiment proposes a kind of computer program product, when described When instruction processing unit in computer program product executes, voice interactive method as described above is realized.
The additional aspect of the present invention and advantage will be set forth in part in the description, and will partially become from the following description Obviously, or practice through the invention is recognized.
Detailed description of the invention
Above-mentioned and/or additional aspect and advantage of the invention will become from the following description of the accompanying drawings of embodiments Obviously and it is readily appreciated that, in which:
Fig. 1 is a kind of flow diagram of voice interactive method provided in an embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram of voice interaction device provided in an embodiment of the present invention;
Fig. 3 is the structural schematic diagram of a kind of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
The embodiment of the present invention is described below in detail, examples of the embodiments are shown in the accompanying drawings, wherein from beginning to end Same or similar label indicates same or similar element or element with the same or similar functions.Below with reference to attached The embodiment of figure description is exemplary, it is intended to is used to explain the present invention, and is not considered as limiting the invention.
Below with reference to the accompanying drawings the voice interactive method and device of the embodiment of the present invention are described.
Fig. 1 is a kind of flow diagram of voice interactive method provided in an embodiment of the present invention.As shown in Figure 1, the voice Exchange method the following steps are included:
S101, first content acquisition instruction is received, content is obtained according to first content acquisition instruction.
The executing subject of voice interactive method provided by the invention is voice interaction device, and voice interaction device specifically can be with For the corresponding background server of speech ciphering equipment or speech ciphering equipment.Speech ciphering equipment for example can be, intelligent sound box, intelligent air condition, Intelligent washing machine, smart television etc. can carry out interactive voice with user, the equipment for carrying out corresponding operating according to the instruction of user.
In the present embodiment, in the case where voice interaction device background server corresponding for speech ciphering equipment, first content The acquisition modes of acquisition instruction can be, during speech ciphering equipment is interacted with user, monitor the voice for getting user After instruction, it is sent directly to background server.It, can be to first content after background server gets first content acquisition instruction Acquisition instruction carries out speech recognition, the parsing result of first content acquisition instruction is obtained, according to the solution of first content acquisition instruction It analyses result and obtains content.
In the present embodiment, in the case where voice interaction device is speech ciphering equipment, the acquisition side of first content acquisition instruction Formula can be during speech ciphering equipment is interacted with user, to monitor the phonetic order of the user got.Interactive voice dress It sets after getting first content acquisition instruction, speech recognition can be carried out to first content acquisition instruction, obtain first content and obtain The parsing result of instruction fetch obtains content according to the parsing result of first content acquisition instruction.
It should be noted that content can be the response result to first content acquisition instruction in the present embodiment.For example, When first content acquisition instruction is " I wants to listen lustily water ", corresponding content can be " the lustily water of passerby's first version ";Again For example, being " I wants to listen logical thinking " in first content acquisition instruction, corresponding content can be " the 12nd collection of logical thinking "; It is " inquiry weather " in first content acquisition instruction, corresponding content can be " rainy ".
If S102, receiving the second content acquisition instruction within a preset period of time, the second content acquisition instruction and the is judged Whether one content acquisition instruction belongs to identical psychomotor domain or associative skills field.
Wherein, preset time period psychomotor domain according to belonging to first content acquisition instruction determines.In the present embodiment, step Before 102, voice interaction device, can be according to the parsing of first content acquisition instruction after receiving first content acquisition instruction As a result, determining the first psychomotor domain belonging to first content acquisition instruction, the corresponding preset time period of the first psychomotor domain is determined, And timing is carried out, whether judgement receives the second content acquisition instruction within a preset period of time;If not connecing within a preset period of time The second content acquisition instruction is received, then this time interactive voice terminates.
In the present embodiment, in the case where voice interaction device background server corresponding for speech ciphering equipment, interactive voice After, voice interaction device can be sent to speech ciphering equipment stops interactive instruction, so that speech ciphering equipment no longer receives voice and refers to It enables, until speech ciphering equipment receives the wake up instruction of user, after carrying out wake operation, receives and sent out to voice interaction device again Sending voice instruction.In the case where voice interaction device is speech ciphering equipment, after interactive voice, voice interaction device is no longer connect Phonetic order is received, until receiving the wake up instruction of user, after carrying out wake operation, the voice for restarting to receive user refers to It enables.
In the present embodiment, in the case where voice interaction device background server corresponding for speech ciphering equipment, interactive voice Device determines of the first psychomotor domain belonging to first content acquisition instruction according to the parsing result of first content acquisition instruction A kind of mode can be with are as follows: the parsing result of first content acquisition instruction is inputted preset psychomotor domain model, obtains parsing knot Fruit belongs to the probability of each psychomotor domain;The probability for belonging to each psychomotor domain according to parsing result determines that first content obtains First psychomotor domain belonging to instruction.Wherein, preset psychomotor domain model can according to each psychomotor domain it is corresponding big The psychomotor domain model that amount sentence or word training obtain.
In the case where voice interaction device background server corresponding for speech ciphering equipment, voice interaction device is according to first The parsing result of content acquisition instruction determines that the another way of the first psychomotor domain belonging to first content acquisition instruction can be with Are as follows: the parsing result of first content acquisition instruction is segmented, word segmentation result is obtained;By in word segmentation result each word with Word in each psychomotor domain is compared, and determines the quantity for belonging to the word of each psychomotor domain in word segmentation result;According to The quantity for belonging to the word of each psychomotor domain in word segmentation result determines that the first technical ability belonging to first content acquisition instruction is led Domain.
It is of course also possible to use other way determines the first psychomotor domain belonging to first content acquisition instruction, herein not It illustrates again.
Determine that the implementation of the second psychomotor domain belonging to the second content acquisition instruction can be obtained with first content is determined The implementation of first psychomotor domain belonging to instruction fetch is identical, and this will not be detailed here.
In the present embodiment, the corresponding preset time period of each psychomotor domain can be set according to actual needs, herein It is not specifically limited.
If receiving the second content acquisition instruction within a preset period of time, voice interaction device judges that the acquisition of the second content refers to Enabling specifically can be with whether first content acquisition instruction belongs to the first way in identical psychomotor domain or associative skills field To determine that the first psychomotor domain belonging to first content acquisition instruction and the acquisition of the second content refer to according to instruction parsing result The second psychomotor domain belonging to enabling;If the first psychomotor domain is identical as the second psychomotor domain, determine the second content acquisition instruction with First content acquisition instruction belongs to identical psychomotor domain;If the first psychomotor domain and the second psychomotor domain be not identical, inquiry is default Related fields mapping ruler, determine corresponding with the first psychomotor domain default associative skills field;If default associative skills neck Include the second psychomotor domain in domain, determines that the second content acquisition instruction and first content acquisition instruction belong to associative skills field. If not including the second psychomotor domain in default associative skills field, it is determined that the second content acquisition instruction refers to first content acquisition Order is not belonging to identical psychomotor domain, is also not belonging to associative skills field.
Wherein, the corresponding default associative skills field of each psychomotor domain is preserved in preset related fields mapping ruler.
If receiving the second content acquisition instruction within a preset period of time, voice interaction device judges that the acquisition of the second content refers to Enabling specifically can be with whether first content acquisition instruction belongs to the second way in identical psychomotor domain or associative skills field To determine that the first psychomotor domain belonging to first content acquisition instruction and the acquisition of the second content refer to according to instruction parsing result The second psychomotor domain belonging to enabling;Preset related fields mapping ruler is inquired, is determined corresponding with the first psychomotor domain default Associative skills field;If including the second psychomotor domain in default associative skills field, the second content acquisition instruction and first is determined Content acquisition instruction belongs to associative skills field;If not including the second psychomotor domain in default associative skills field, the is judged Whether one psychomotor domain is identical as the second psychomotor domain, if the first psychomotor domain is identical as the second psychomotor domain, determines in second Hold acquisition instruction and first content acquisition instruction belongs to identical psychomotor domain;If the first psychomotor domain and the second psychomotor domain are not Together, it is determined that the second content acquisition instruction and first content acquisition instruction are not belonging to identical psychomotor domain, are also not belonging to related skill It can field.
Further, on the basis of the above embodiments, voice interaction device is judging the second content acquisition instruction and the Whether one content acquisition instruction belongs to before identical psychomotor domain or associative skills field, can first judge that the second content obtains Whether instruction is wake up instruction;If the second content acquisition instruction is not wake up instruction, the second content acquisition instruction and the is judged Whether one content acquisition instruction belongs to identical psychomotor domain or associative skills field;If the second content acquisition instruction is to wake up to refer to It enables, then responds wake up instruction.
S103, if it is determined that the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain or related Psychomotor domain obtains content according to the second content acquisition instruction.
In the present embodiment, however, it is determined that the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain, Then voice interaction device can be according to the parsing result of the second content acquisition instruction, in conjunction with the parsing knot of first content acquisition instruction Fruit obtains content.If it is determined that the second content acquisition instruction and first content acquisition instruction belong to associative skills field, then voice is handed over Mutual device can obtain content according to the parsing result of the second content acquisition instruction.
For example, being " I wants to listen lustily water " in first content acquisition instruction, the second content acquisition instruction is that " I wants to listen Liu De China " in the case where, the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain, then corresponding content Can be " the lustily water of Liu Dehua ".It is " I wants to listen logical thinking ", the second content acquisition instruction in first content acquisition instruction In the case where " the 9th collection ", the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain, corresponding interior Hold to be " the 9th collection of logical thinking ".
In another example being " inquiry weather " in first content acquisition instruction, the second content acquisition instruction is " to cry one to me Vehicle, I also calls a taxi company " in the case where, the second content acquisition instruction and first content acquisition instruction belong to associative skills neck Domain, corresponding content can be " unlatching call a taxi function ", for example, according to place tune rise taxi-hailing software, automatically enter CompanyAddress, Predetermined stroke etc..
In addition, it is also necessary to be illustrated, the method further include: if it is determined that the second content acquisition instruction and first Content acquisition instruction is not belonging to identical psychomotor domain or associative skills field, then voice interaction device is not responding to the second content and obtains Instruction fetch, and continue timing, judge before preset time period reaches, if receive third content acquisition instruction;If Third content acquisition instruction is not received, then this time interactive voice terminates.
Voice interactive method provided in this embodiment receives first content acquisition instruction, according to first content acquisition instruction Obtain content;If receiving the second content acquisition instruction within a preset period of time, judge in the second content acquisition instruction and first Hold whether acquisition instruction belongs to identical psychomotor domain or associative skills field;If it is determined that in the second content acquisition instruction and first Hold acquisition instruction and belong to identical psychomotor domain or associative skills field, content, this reality are obtained according to the second content acquisition instruction It applies example and can be realized user and express intention by multiple content acquisition instruction;And due to the field of noise in ambient enviroment etc. It is generally uncorrelated to the psychomotor domain of the content acquisition instruction of user, therefore the present embodiment can be avoided speech ciphering equipment " mistake " The phonetic order in ambient enviroment is executed, to improve interactive voice efficiency, improves the experience that user uses speech ciphering equipment.
Fig. 2 is a kind of structural schematic diagram of voice interaction device provided in an embodiment of the present invention.As shown in Figure 2, comprising: obtain Modulus block 21 and judgment module 22.
Wherein, module 21 is obtained, for receiving first content acquisition instruction, is obtained according to the first content acquisition instruction Content;
Judgment module 22 when for receiving the second content acquisition instruction within a preset period of time, judges in described second Hold acquisition instruction and whether the first content acquisition instruction belongs to identical psychomotor domain or associative skills field;
The acquisition module 21 is also used to determining the second content acquisition instruction and the first content acquisition instruction When belonging to identical psychomotor domain or associative skills field, content is obtained according to the second content acquisition instruction.
Voice interaction device provided by the invention is specifically as follows speech ciphering equipment or the corresponding background service of speech ciphering equipment Device.Speech ciphering equipment for example can be that intelligent sound box, intelligent air condition, intelligent washing machine, smart television etc. can carry out language with user Sound interaction, the equipment that corresponding operating is carried out according to the instruction of user.
In the present embodiment, in the case where voice interaction device is speech ciphering equipment, the acquisition side of first content acquisition instruction Formula can be during speech ciphering equipment is interacted with user, to monitor the phonetic order of the user got.Interactive voice dress It sets after getting first content acquisition instruction, speech recognition can be carried out to first content acquisition instruction, obtain first content and obtain The parsing result of instruction fetch obtains content according to the parsing result of first content acquisition instruction.
In the case where voice interaction device background server corresponding for speech ciphering equipment, first content acquisition instruction is obtained Taking mode can be, during speech ciphering equipment is interacted with user, after monitoring gets the phonetic order of user, directly send out It is sent to background server.After background server gets first content acquisition instruction, first content acquisition instruction can be carried out Speech recognition obtains the parsing result of first content acquisition instruction, according in the acquisition of the parsing result of first content acquisition instruction Hold.
Wherein, preset time period psychomotor domain according to belonging to first content acquisition instruction determines.In the present embodiment, step Before 102, voice interaction device, can be according to the parsing of first content acquisition instruction after receiving first content acquisition instruction As a result, determining the first psychomotor domain belonging to first content acquisition instruction, the corresponding preset time period of the first psychomotor domain is determined, And timing is carried out, whether judgement receives the second content acquisition instruction within a preset period of time;If not connecing within a preset period of time The second content acquisition instruction is received, then this time interactive voice terminates.
In the case where voice interaction device is speech ciphering equipment, after interactive voice, voice interaction device is no longer received Phonetic order after carrying out wake operation, restarts the phonetic order for receiving user until receiving the wake up instruction of user. In the case where voice interaction device background server corresponding for speech ciphering equipment, after interactive voice, voice interaction device It can be sent to speech ciphering equipment and stop interactive instruction, so that speech ciphering equipment no longer receives phonetic order, until speech ciphering equipment receives To the wake up instruction of user, after carrying out wake operation, receives again and send phonetic order to voice interaction device.
Further, the judgment module 22 specifically can be used for, and according to instruction parsing result, determine that first content obtains Second psychomotor domain belonging to first psychomotor domain belonging to instruction and the second content acquisition instruction;If the first psychomotor domain It is identical as the second psychomotor domain, determine that the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain;If First psychomotor domain and the second psychomotor domain be not identical, inquires preset related fields mapping ruler, determining to lead with the first technical ability The corresponding default associative skills field in domain;If including the second psychomotor domain in default associative skills field, determine that the second content obtains Instruction fetch and first content acquisition instruction belong to associative skills field.If in default associative skills field not including the second technical ability neck Domain, it is determined that the second content acquisition instruction and first content acquisition instruction are not belonging to identical psychomotor domain, are also not belonging to related skill It can field.
Further, the judgment module 22 specifically can be also used for, and according to instruction parsing result, determine that first content obtains Second psychomotor domain belonging to first psychomotor domain belonging to instruction fetch and the second content acquisition instruction;Inquire preset phase Pass field mapping ruler determines default associative skills field corresponding with the first psychomotor domain;If in default associative skills field Including the second psychomotor domain, determine that the second content acquisition instruction and first content acquisition instruction belong to associative skills field;If pre- If in associative skills field not including the second psychomotor domain, then judge whether the first psychomotor domain is identical as the second psychomotor domain, If the first psychomotor domain is identical as the second psychomotor domain, determine that the second content acquisition instruction belongs to phase with first content acquisition instruction Same psychomotor domain;If the first psychomotor domain is different from the second psychomotor domain, it is determined that the second content acquisition instruction and first content Acquisition instruction is not belonging to identical psychomotor domain, is also not belonging to associative skills field.
Wherein, the corresponding default associative skills field of each psychomotor domain is preserved in preset related fields mapping ruler.
Further, the acquisition module 21 is specifically used for, and is determining the second content acquisition instruction and described first When content acquisition instruction belongs to identical psychomotor domain, according to the parsing result of the second content acquisition instruction, in conjunction with described The parsing result of one content acquisition instruction obtains content;Determining that the second content acquisition instruction obtains with the first content When instruction fetch belongs to associative skills field, according to the parsing result of the second content acquisition instruction, content is obtained.
For example, being " I wants to listen lustily water " in first content acquisition instruction, the second content acquisition instruction is that " I wants to listen Liu De China " in the case where, the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain, then corresponding content Can be " the lustily water of Liu Dehua ".It is " I wants to listen logical thinking ", the second content acquisition instruction in first content acquisition instruction In the case where " the 9th collection ", the second content acquisition instruction and first content acquisition instruction belong to identical psychomotor domain, corresponding interior Hold to be " the 9th collection of logical thinking ".It is " inquiry weather " in first content acquisition instruction, the second content acquisition instruction is In the case where " be a vehicle to me, I also call a taxi company ", the second content acquisition instruction belongs to first content acquisition instruction Associative skills field, corresponding content can be " unlatching call a taxi function ", for example play taxi-hailing software according to place tune, automatically enter CompanyAddress, predetermined stroke etc..
Further, on the basis of the above embodiments, the judgment module 22 is also used to, and is judging second content Whether acquisition instruction and the first content acquisition instruction belong to before identical psychomotor domain or associative skills field, first judge Whether the second content acquisition instruction is wake up instruction;If the second content acquisition instruction is not wake up instruction, the second content is judged Whether acquisition instruction and first content acquisition instruction belong to identical psychomotor domain or associative skills field.In addition, the language Sound interactive device further include: respond module, for responding wake up instruction when the second content acquisition instruction is wake up instruction.
Further, on the basis of the above embodiments, the device can also include: processing module, for true When fixed second content acquisition instruction and first content acquisition instruction are not belonging to identical psychomotor domain or associative skills field, do not ring The second content acquisition instruction is answered, and continues timing, is judged before preset time period reaches, if receive third content Acquisition instruction;If not receiving third content acquisition instruction, this time interactive voice terminates.
Voice interaction device provided in this embodiment receives first content acquisition instruction, according to first content acquisition instruction Obtain content;If receiving the second content acquisition instruction within a preset period of time, judge in the second content acquisition instruction and first Hold whether acquisition instruction belongs to identical psychomotor domain or associative skills field;If it is determined that in the second content acquisition instruction and first Hold acquisition instruction and belong to identical psychomotor domain or associative skills field, content, this reality are obtained according to the second content acquisition instruction It applies example and can be realized user and express intention by multiple content acquisition instruction;And due to the field of noise in ambient enviroment etc. It is generally uncorrelated to the psychomotor domain of the content acquisition instruction of user, therefore the present embodiment can be avoided speech ciphering equipment " mistake " The phonetic order in ambient enviroment is executed, to improve interactive voice efficiency, improves the experience that user uses speech ciphering equipment.
Fig. 3 is the structural schematic diagram of a kind of electronic equipment provided in an embodiment of the present invention.The electronic equipment includes:
Memory 1001, processor 1002 and it is stored in the calculating that can be run on memory 1001 and on processor 1002 Machine program.
Processor 1002 realizes the voice interactive method provided in above-described embodiment when executing described program.
Further, electronic equipment further include:
Communication interface 1003, for the communication between memory 1001 and processor 1002.
Memory 1001, for storing the computer program that can be run on processor 1002.
Memory 1001 may include high speed RAM memory, it is also possible to further include nonvolatile memory (non- Volatilememory), a for example, at least magnetic disk storage.
Processor 1002 realizes voice interactive method described in above-described embodiment when for executing described program.
If memory 1001, processor 1002 and the independent realization of communication interface 1003, communication interface 1003, memory 1001 and processor 1002 can be connected with each other by bus and complete mutual communication.The bus can be industrial standard Architecture (Industry Standard Architecture, referred to as ISA) bus, external equipment interconnection (Peripheral Component, referred to as PCI) bus or extended industry-standard architecture (Extended Industry Standard Architecture, referred to as EISA) bus etc..The bus can be divided into address bus, data/address bus, control Bus processed etc..Only to be indicated with a thick line in Fig. 3, it is not intended that an only bus or a type of convenient for indicating Bus.
Optionally, in specific implementation, if memory 1001, processor 1002 and communication interface 1003, are integrated in one It is realized on block chip, then memory 1001, processor 1002 and communication interface 1003 can be completed mutual by internal interface Communication.
Processor 1002 may be a central processing unit (Central Processing Unit, referred to as CPU), or Person is specific integrated circuit (Application Specific Integrated Circuit, referred to as ASIC) or quilt It is configured to implement one or more integrated circuits of the embodiment of the present invention.
The present invention also provides a kind of computer readable storage mediums, are stored thereon with computer program, and the program is processed Device realizes voice interactive method as described above when executing.
The present invention also provides a kind of computer program products, when the instruction in the computer program product is held by processor When row, voice interactive method as described above is realized.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means specific features, structure, material or spy described in conjunction with this embodiment or example Point is included at least one embodiment or example of the invention.In the present specification, schematic expression of the above terms are not It must be directed to identical embodiment or example.Moreover, particular features, structures, materials, or characteristics described can be in office It can be combined in any suitable manner in one or more embodiment or examples.In addition, without conflicting with each other, the skill of this field Art personnel can tie the feature of different embodiments or examples described in this specification and different embodiments or examples It closes and combines.
In addition, term " first ", " second " are used for descriptive purposes only and cannot be understood as indicating or suggesting relative importance Or implicitly indicate the quantity of indicated technical characteristic.Define " first " as a result, the feature of " second " can be expressed or Implicitly include at least one this feature.In the description of the present invention, the meaning of " plurality " is at least two, such as two, three It is a etc., unless otherwise specifically defined.
Any process described otherwise above or method description are construed as in flow chart or herein, and expression includes It is one or more for realizing custom logic function or process the step of executable instruction code module, segment or portion Point, and the range of the preferred embodiment of the present invention includes other realization, wherein can not press shown or discussed suitable Sequence, including according to related function by it is basic simultaneously in the way of or in the opposite order, to execute function, this should be of the invention The technical staff of the affiliated psychomotor domain of embodiment understood.
Expression or logic and/or step described otherwise above herein in flow charts, for example, being considered use In the order list for the executable instruction for realizing logic function, may be embodied in any computer-readable medium, for Instruction execution system, device or equipment (such as computer based system, including the system of processor or other can be held from instruction The instruction fetch of row system, device or equipment and the system executed instruction) it uses, or combine these instruction execution systems, device or set It is standby and use.For the purpose of this specification, " computer-readable medium ", which can be, any may include, stores, communicates, propagates or pass Defeated program is for instruction execution system, device or equipment or the dress used in conjunction with these instruction execution systems, device or equipment It sets.The more specific example (non-exhaustive list) of computer-readable medium include the following: there is the electricity of one or more wirings Interconnecting piece (electronic device), portable computer diskette box (magnetic device), random access memory (RAM), read-only memory (ROM), erasable edit read-only storage (EPROM or flash memory), fiber device and portable optic disk is read-only deposits Reservoir (CDROM).In addition, computer-readable medium can even is that the paper that can print described program on it or other are suitable Medium, because can then be edited, be interpreted or when necessary with it for example by carrying out optical scanner to paper or other media His suitable method is handled electronically to obtain described program, is then stored in computer storage.
It should be appreciated that each section of the invention can be realized with hardware, software, firmware or their combination.Above-mentioned In embodiment, software that multiple steps or method can be executed in memory and by suitable instruction execution system with storage Or firmware is realized.Such as, if realized with hardware in another embodiment, following skill well known in the art can be used Any one of art or their combination are realized: have for data-signal is realized the logic gates of logic function from Logic circuit is dissipated, the specific integrated circuit with suitable combinational logic gate circuit, programmable gate array (PGA), scene can compile Journey gate array (FPGA) etc..
The those of ordinary skill of this psychomotor domain is understood that realize all or part of step that above-described embodiment method carries It suddenly is that relevant hardware can be instructed to complete by program, the program can store in a kind of computer-readable storage medium In matter, which when being executed, includes the steps that one or a combination set of embodiment of the method.
It, can also be in addition, each functional unit in each embodiment of the present invention can integrate in a processing module It is that each unit physically exists alone, can also be integrated in two or more units in a module.Above-mentioned integrated mould Block both can take the form of hardware realization, can also be realized in the form of software function module.The integrated module is such as Fruit is realized and when sold or used as an independent product in the form of software function module, also can store in a computer In read/write memory medium.
Storage medium mentioned above can be read-only memory, disk or CD etc..Although having been shown and retouching above The embodiment of the present invention is stated, it is to be understood that above-described embodiment is exemplary, and should not be understood as to limit of the invention System, those skilled in the art can be changed above-described embodiment, modify, replace and become within the scope of the invention Type.

Claims (10)

1. a kind of voice interactive method characterized by comprising
First content acquisition instruction is received, content is obtained according to the first content acquisition instruction;
If receiving the second content acquisition instruction within a preset period of time, the second content acquisition instruction and described first is judged Whether content acquisition instruction belongs to identical psychomotor domain or associative skills field;
If it is determined that the second content acquisition instruction and the first content acquisition instruction belong to identical psychomotor domain or related Psychomotor domain obtains content according to the second content acquisition instruction.
2. the method according to claim 1, wherein further include:
If it is determined that the second content acquisition instruction is not belonging to identical psychomotor domain or phase with the first content acquisition instruction Psychomotor domain is closed, the second content acquisition instruction is not responding to.
3. the method according to claim 1, wherein
If it is determined that the second content acquisition instruction and the first content acquisition instruction belong to identical psychomotor domain, according to described Second content acquisition instruction obtains content, specifically includes:
It is obtained according to the parsing result of the second content acquisition instruction in conjunction with the parsing result of the first content acquisition instruction Take content;
If it is determined that the second content acquisition instruction and the first content acquisition instruction belong to associative skills field, according to described Second content acquisition instruction obtains content, specifically includes:
According to the parsing result of the second content acquisition instruction, content is obtained.
4. the method according to claim 1, wherein the preset time period is according to first content acquisition instruction institute The psychomotor domain of category determines.
5. the method according to claim 1, wherein
Judge whether the second content acquisition instruction belongs to identical psychomotor domain or phase with the first content acquisition instruction Psychomotor domain is closed, is specifically included:
According to instruction parsing result, the first psychomotor domain belonging to the first content acquisition instruction and described second are determined Second psychomotor domain belonging to content acquisition instruction;
If first psychomotor domain is identical as second psychomotor domain, the second content acquisition instruction and described the are determined One content acquisition instruction belongs to identical psychomotor domain;
If first psychomotor domain and second psychomotor domain be not identical, preset related fields mapping ruler is inquired, really Fixed default associative skills field corresponding with first psychomotor domain;
If including second psychomotor domain in the default associative skills field, the second content acquisition instruction and institute are determined It states first content acquisition instruction and belongs to associative skills field.
6. the method according to claim 1, wherein judging the second content acquisition instruction and described first Whether content acquisition instruction belongs to before identical psychomotor domain or associative skills field, further includes:
Determine that the second content acquisition instruction is not wake up instruction.
7. a kind of voice interaction device characterized by comprising
Module is obtained, for receiving first content acquisition instruction, content is obtained according to the first content acquisition instruction;
Judgment module when for receiving the second content acquisition instruction within a preset period of time, judges that second content obtains Whether instruction belongs to identical psychomotor domain or associative skills field with the first content acquisition instruction;
The acquisition module is also used to determining that the second content acquisition instruction belongs to phase with the first content acquisition instruction When with psychomotor domain or associative skills field, content is obtained according to the second content acquisition instruction.
8. a kind of electronic equipment characterized by comprising memory, processor and storage are on a memory and can be in processor The computer program of upper operation when the processor executes described program, realizes such as voice as claimed in any one of claims 1 to 6 Exchange method.
9. a kind of non-transitorycomputer readable storage medium, is stored thereon with computer program, which is characterized in that the program quilt Such as voice interactive method as claimed in any one of claims 1 to 6 is realized when processor executes.
10. a kind of computer program product, which is characterized in that when the instruction in the computer program product is executed by processor When, execute such as voice interactive method as claimed in any one of claims 1 to 6.
CN201711446766.2A 2017-12-27 2017-12-27 Voice interaction method and device Active CN108962235B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711446766.2A CN108962235B (en) 2017-12-27 2017-12-27 Voice interaction method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711446766.2A CN108962235B (en) 2017-12-27 2017-12-27 Voice interaction method and device

Publications (2)

Publication Number Publication Date
CN108962235A true CN108962235A (en) 2018-12-07
CN108962235B CN108962235B (en) 2021-09-17

Family

ID=64495731

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711446766.2A Active CN108962235B (en) 2017-12-27 2017-12-27 Voice interaction method and device

Country Status (1)

Country Link
CN (1) CN108962235B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109960754A (en) * 2019-03-21 2019-07-02 珠海格力电器股份有限公司 A kind of speech ciphering equipment and its voice interactive method, device and storage medium
CN110838292A (en) * 2019-09-29 2020-02-25 广东美的白色家电技术创新中心有限公司 Voice interaction method, electronic equipment and computer storage medium
CN113327609A (en) * 2019-04-23 2021-08-31 百度在线网络技术(北京)有限公司 Method and apparatus for speech recognition

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040138885A1 (en) * 2003-01-09 2004-07-15 Xiaofan Lin Commercial automatic speech recognition engine combinations
CN103594089A (en) * 2013-11-18 2014-02-19 联想(北京)有限公司 Voice recognition method and electronic device
US9098467B1 (en) * 2012-12-19 2015-08-04 Rawles Llc Accepting voice commands based on user identity
CN105404161A (en) * 2015-11-02 2016-03-16 百度在线网络技术(北京)有限公司 Intelligent voice interaction method and device
CN105448293A (en) * 2014-08-27 2016-03-30 北京羽扇智信息科技有限公司 Voice monitoring and processing method and voice monitoring and processing device
CN105810194A (en) * 2016-05-11 2016-07-27 北京奇虎科技有限公司 Voice control information acquisition method under standby state and intelligent terminal
CN106648530A (en) * 2016-11-21 2017-05-10 海信集团有限公司 Voice control method and terminal
CN107293293A (en) * 2017-05-22 2017-10-24 深圳市搜果科技发展有限公司 A kind of voice instruction recognition method, system and robot

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040138885A1 (en) * 2003-01-09 2004-07-15 Xiaofan Lin Commercial automatic speech recognition engine combinations
US9098467B1 (en) * 2012-12-19 2015-08-04 Rawles Llc Accepting voice commands based on user identity
CN103594089A (en) * 2013-11-18 2014-02-19 联想(北京)有限公司 Voice recognition method and electronic device
CN105448293A (en) * 2014-08-27 2016-03-30 北京羽扇智信息科技有限公司 Voice monitoring and processing method and voice monitoring and processing device
CN105404161A (en) * 2015-11-02 2016-03-16 百度在线网络技术(北京)有限公司 Intelligent voice interaction method and device
CN105810194A (en) * 2016-05-11 2016-07-27 北京奇虎科技有限公司 Voice control information acquisition method under standby state and intelligent terminal
CN106648530A (en) * 2016-11-21 2017-05-10 海信集团有限公司 Voice control method and terminal
CN107293293A (en) * 2017-05-22 2017-10-24 深圳市搜果科技发展有限公司 A kind of voice instruction recognition method, system and robot

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109960754A (en) * 2019-03-21 2019-07-02 珠海格力电器股份有限公司 A kind of speech ciphering equipment and its voice interactive method, device and storage medium
CN113327609A (en) * 2019-04-23 2021-08-31 百度在线网络技术(北京)有限公司 Method and apparatus for speech recognition
CN113327609B (en) * 2019-04-23 2022-06-28 百度在线网络技术(北京)有限公司 Method and apparatus for speech recognition
CN110838292A (en) * 2019-09-29 2020-02-25 广东美的白色家电技术创新中心有限公司 Voice interaction method, electronic equipment and computer storage medium

Also Published As

Publication number Publication date
CN108962235B (en) 2021-09-17

Similar Documents

Publication Publication Date Title
CN107680591A (en) Voice interactive method, device and its equipment based on car-mounted terminal
US11080337B2 (en) Storage edge controller with a metadata computational engine
CN108538305A (en) Audio recognition method, device, equipment and computer readable storage medium
CN107591151A (en) Far field voice awakening method, device and terminal device
CN107977415B (en) Automatic question-answering method and device
TWI709866B (en) Equipment model identification method, device and processing equipment
JP6811755B2 (en) Voice wake-up method by reading, equipment, equipment and computer-readable media, programs
US11106896B2 (en) Methods and apparatus for multi-task recognition using neural networks
CN108962235A (en) Voice interactive method and device
CN107704275A (en) Smart machine awakening method, device, server and smart machine
CN107315772B (en) The problem of based on deep learning matching process and device
CN106210545A (en) Video shooting method and device and electronic equipment
CN108181992A (en) Voice awakening method, device, equipment and computer-readable medium based on gesture
US11398228B2 (en) Voice recognition method, device and server
CN107729210A (en) The abnormality diagnostic method and device of Distributed Services cluster
CN109040471A (en) Emotive advisory method, apparatus, mobile terminal and storage medium
CN106021403B (en) Client service method and device
CN107103906A (en) It is a kind of to wake up method, smart machine and medium that smart machine carries out speech recognition
CN109359196A (en) Text Multimodal presentation method and device
CN109461448A (en) Voice interactive method and device
CN110111789A (en) Voice interactive method, calculates equipment and computer-readable medium at device
CN108960836A (en) Voice payment method, apparatus and system
CN104361311A (en) Multi-modal online incremental access recognition system and recognition method thereof
CN109979437A (en) Audio recognition method, device, equipment and storage medium
CN112541450A (en) Context awareness function control method and related device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant