WO2018157388A1 - Wake-up method and device for robot, and robot - Google Patents

Wake-up method and device for robot, and robot Download PDF

Info

Publication number
WO2018157388A1
WO2018157388A1 PCT/CN2017/075588 CN2017075588W WO2018157388A1 WO 2018157388 A1 WO2018157388 A1 WO 2018157388A1 CN 2017075588 W CN2017075588 W CN 2017075588W WO 2018157388 A1 WO2018157388 A1 WO 2018157388A1
Authority
WO
WIPO (PCT)
Prior art keywords
wake
command
robot
word
voice
Prior art date
Application number
PCT/CN2017/075588
Other languages
French (fr)
Chinese (zh)
Inventor
骆磊
Original Assignee
深圳前海达闼云端智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳前海达闼云端智能科技有限公司 filed Critical 深圳前海达闼云端智能科技有限公司
Priority to CN201780000607.1A priority Critical patent/CN107223280B/en
Priority to PCT/CN2017/075588 priority patent/WO2018157388A1/en
Publication of WO2018157388A1 publication Critical patent/WO2018157388A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J11/00Manipulators not otherwise provided for
    • B25J11/0005Manipulators having means for high-level communication with users, e.g. speech generator, face recognition means
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics

Definitions

  • Embodiments of the present invention relate to the field of artificial intelligence automatic control, for example, to a robot wake-up method, apparatus, and robot.
  • robots bring a lot of convenience to human production and life.
  • the robot can pre-set the wake-up words.
  • the robot hears a specific wake-up word, it knows that the user is calling himself.
  • the user can issue a voice command to the robot in the form of a wake-up word plus command content, for example, "Mike (awake word), what is the weather today (command content)?", the robot that hears the voice command will parse the voice command.
  • the wake-up word is parsed.
  • the robot will be able to discriminate that it is calling itself, and the content after the wake-up word in the voice command is used as the command content, that is, the user is identified today.
  • the weather, so you can directly answer, as shown in Figure 1 shows the user to control a robot.
  • the inventors have found that at least the following problems exist in the related art: when a same user owns a plurality of robots, and a certain task requires multiple robots to complete together, the existing method will no longer be applicable.
  • the user command "Mike, Tom, Jerry, help me clean up the room”
  • the command content of the wake-up word for Mike's robot is "Tom, Jerry, help me clean up the room.” Since the command content cannot be parsed correctly, the robot will not be able to complete the task that the user scholares.
  • An object of the embodiments of the present invention is to provide a new robot wake-up method, device, and robot.
  • the robot can correctly parse the command content, thereby correctly completing the task that the user scholares. .
  • an embodiment of the present invention provides a robot wake-up method, where the wake-up method is applied to a robot, and the method includes:
  • the wake-up word library includes at least two wake-up words, and the wake-up words in the wake-up word library are used for Wake up at least two robots corresponding to one or more wake words;
  • an embodiment of the present invention further provides a robot wake-up device, where the wake-up device should For a robot, the device includes:
  • a voice command acquisition module configured to acquire a voice command
  • a voice command parsing module configured to parse an awakening word and a command content in the voice command according to the voice command and a preset wake-up term library, where the wake-up word library includes at least two wake-up words, the wake-up word The wake-up words in the library are used to wake up at least two robots, the robot corresponding to one or more wake-up words;
  • An execution module is configured to perform an operation according to the parsed wake-up words and command content.
  • an embodiment of the present invention further provides a robot, including:
  • At least one processor and,
  • the memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the method as described above.
  • the wake-up method and device provided by the embodiment of the present invention can preset the wake-up words corresponding to the plurality of robots to the wake-up words by preset a wake-up word library including at least two wake-up words in the robot, for example, the user owns multiple robots.
  • a wake-up word library including at least two wake-up words in the robot, for example, the user owns multiple robots.
  • the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary, thereby correctly parsing the command content in the voice command, and thus correct. Complete the task that the user admirees.
  • FIG. 1 is a schematic diagram of a user controlling a robot in the prior art
  • FIG. 2 is a schematic diagram of an application scenario of the method and apparatus of the present invention.
  • FIG. 3 is a flow chart of one embodiment of a wake-up method of the present invention.
  • 4a is a flow chart showing steps of acquiring a voice command in an embodiment of the wake-up method of the present invention
  • 4b is a flowchart of a step of acquiring a voice command in an embodiment of the wake-up method of the present invention
  • FIG. 5 is a flow chart showing steps of parsing a voice command in an embodiment of the wake-up method of the present invention
  • Figure 6a is a flow chart showing the steps of executing a voice command in one embodiment of the wake-up method of the present invention
  • Figure 6b is a flow chart showing the steps of executing a voice command in one embodiment of the wake-up method of the present invention
  • Figure 7a is a flow chart of one embodiment of a wake-up method of the present invention.
  • Figure 7b is a flow chart of one embodiment of the wake-up method of the present invention.
  • Figure 8 is a schematic structural view of an embodiment of the wake-up device of the present invention.
  • FIG. 9 is a schematic structural diagram of a voice command acquiring module in an embodiment of the wake-up device of the present invention.
  • Figure 10 is a schematic structural view of an embodiment of the wake-up device of the present invention.
  • Figure 11 is a block diagram showing the structure of an embodiment of the wake-up device of the present invention.
  • FIG. 12 is a schematic diagram showing the hardware structure of a waking method according to an embodiment of the present invention.
  • the robot wake-up method and apparatus provided by the present invention are applicable to an application scenario as shown in FIG. 2, and include a plurality of robots 20, which can communicate with each other through a network 30, wherein the network 30 can be, for example, a home or a company. LAN, or a specific network, etc.
  • the bot 20 has at least one network interface that establishes a communication connection with the network 30 to retrieve data or instructions from the network 30.
  • the user 10 can set or issue commands to the plurality of robots 20.
  • Each robot has its corresponding wake-up word for waking itself from the sleep state or responding to the user's call (the wake-up word is generally one or more).
  • the wake-up word may be a name, an identifier or any other vocabulary of the robot, and the wake-up word may be set by the user or may be provided at the factory.
  • Each robot has a wake-up vocabulary for placing wake-up words, which can be shipped from the factory or set by the user.
  • the same wake-up vocabulary can be shared between the robots of the same user.
  • the wake-up words of the three robots are Mike, Tom, and Jerry (here Each robot corresponds to a wake-up word as an example, but it is not limited to this.
  • Each robot can also associate more than two wake-up words. Then, the wake-up vocabularies of the three robots can be set to include Mike, Tom, and Jerry.
  • the acoustic model can be used to resolve the wake-up words in the voice command.
  • the wake-up words in the wake-up lexicon use the phoneme sequence corresponding to the wake-up word, and the phoneme sequence is decoded according to the voice and the preset acoustic model. The phoneme sequence is then matched with the wake-up word phoneme sequence to resolve the wake-up word. After the wake-up word is parsed, the content behind the wake-up word in the voice command is used as the command content.
  • the semantic analysis of the command content in the voice command requires the command lexical method file to be constructed in advance. The command content sent by the user needs to exist in the pre-built command lexical file, and the semantics of the command content is parsed according to the voice and command lexical file sent by the user.
  • the update of the wakeup vocabulary can be set by the user or can be done by the robot itself.
  • the current wake-up vocabulary includes three wake-up words: Mike, Tom, and Jerry.
  • the newly added robot will The other wake-up words John are broadcasted to other robots through the network.
  • other robots After receiving the wake-up words, other robots will add the wake-up words to the wake-up vocabulary and send the updated wake-up vocabulary to the newly added robot.
  • An embodiment of the present invention provides a robot wake-up method, which may be performed by any of the robots shown in FIG. 2, as shown in FIG. 3, which is a flowchart of an embodiment of the wake-up method, and the wake-up method.
  • Step 101 Acquire a voice command.
  • a microphone can be placed on the robot for receiving voice signals in real time.
  • the voice command may be a voice signal received in real time.
  • the user makes a voice, but it is not necessarily a voice command for the robot. Therefore, it is necessary to further judge the voice information. If the voice message is a command for the robot corresponding to any wake-up word in the wake-up dictionary, Voice information as a voice command.
  • Step 102 Parsing an awakening word and a command content in the voice command according to the voice command and a preset wake-up term library, where the wake-up word library includes a plurality of wake-up words, and the wake-up words in the wake-up word library Corresponding to at least two robots;
  • the command content may be a voice corresponding to the command content intercepted from the voice command sent by the user, or may be a result of semantically parsing the voice. If it is the former, the robot needs to perform semantic analysis when performing the operation corresponding to the command content.
  • the plurality of ones include one or more, that is, the wake-up vocabulary may include one or more wake-up words.
  • Step 103 Perform an operation according to the parsed wake-up word and the command content.
  • the robot parses the wake-up word and the command content, if the wake-up word in the parsing includes the corresponding wake-up word, the operation corresponding to the command content is performed, and if the corresponding wake-up word is not included, no operation is performed.
  • the command content is sent to other robots corresponding to the wake-up word, and the robot corresponding to the wake-up word completes the task that the user scholares.
  • the robot may also decompose the task that the user scholares into several subtasks according to the command content, notify the subtasks of the other robots corresponding to the wakeup words, and coordinate the tasks corresponding to the user by the robot corresponding to the wakeup words.
  • steps 101, 102, and 103 are not necessarily performed by each robot, and it is possible for any robot to perform all steps or only one or two of them.
  • Figure 2 The application scenario shown is an example.
  • the robots Mike, Tom, and Jerry can respectively obtain the voice commands issued by the user, and after parsing the wake-up words and the command content, if they find that they are the command objects of the user, the operations are performed according to the parsed command content. In this case, all three robots perform steps 101, 102, and 103. It is also possible that only the robot Mike hears the voice command issued by the user.
  • Step 102 may be performed by only one robot or several robots, and the executed robot will share the parsed wake-up words and command content to other robots through the network.
  • the wake-up method provided by the embodiment of the present invention can preset the wake-up words corresponding to the plurality of robots in the wake-up word library by preset a wake-up word library including a plurality of wake-up words in the robot, for example, the user owns multiple robots.
  • the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary, thereby correctly parsing the command content in the voice command, thereby completing the user's account. task.
  • the acquiring a voice command includes the following steps;
  • Step 1011 Listening to voice information
  • Each robot will listen to the voice information sent by the user in real time
  • Step 1012 Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary
  • the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up dictionary, and if yes, the voice information is a voice command issued by the user.
  • the voice information is a command for the robot corresponding to the wakeup vocabulary
  • the word continues to determine whether the appearance of the wake-up word is a user's command or call to the robot, rather than the user mentioning the robot in a call with other people.
  • the acquiring a voice command includes:
  • Step 1014 Listening to voice information
  • Each robot will listen to the voice information sent by the user in real time
  • Step 1015 Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary
  • Step 1016 If the voice information is a command for the robot corresponding to the wakeup vocabulary, record the voice information and the start time of the voice information, and add a preset temporary command record group;
  • the robot monitors the voice information sent by the user, it further determines whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up dictionary, and if so, records the voice information and the robot hears The moment of the voice message, and join the preset temporary command record group.
  • the robot can store the voice information and the moment when the voice information is heard into the cache of the robot.
  • the purpose of establishing a temporary command group is to obtain relatively complete and clear user commands. Take the three robots shown in Figure 2 as an example. Suppose the voice command issued by the user is “Mike, Tom, Jerry, help me clean up the room”.
  • whether the voice information is a command for the robot corresponding to the wakeup vocabulary first confirm whether the voice information includes any wake-up words in the preset wake-up vocabulary, if any wake-up in the wake-up vocabulary is included The word continues to determine whether the appearance of the wake-up word is a user's command or call to the robot, rather than the user mentioning the robot in a call with other people. Determining whether the voice information sent by the user is a command or a call to the robot, and determining whether the time interval between the wake-up word and the following voice content is more than a preset time. If the preset time is exceeded, the voice information is Command or call to the robot. Or by judging whether there is other voice content in front of the first wake-up word, if there is no other voice content, the voice information is an order or call to the robot.
  • Step 1017 Confirm the earliest time in the start time of the robot record in the temporary command record group, and determine the start time period according to the earliest time;
  • the starting time of the robot record in the temporary command record group is compared, and the earliest time is determined because the voice information recorded at the earliest time is relatively complete.
  • the earliest time is t1
  • the earliest time t1 is taken as the starting point
  • the starting time period is determined from t1 to t1+t0 according to an empirical setting threshold t0.
  • the setting of t0 can be set according to the performance and experience of the robot, for example, 0.1s.
  • the response of different robots is fast and slow, so it is necessary to set a time difference.
  • the set time difference should be less than this interval. In case of missing a wake-up word.
  • Step 1018 Obtain the voice information with the highest resolution among the voice information whose starting time is located in the starting time period as a voice command.
  • the robot in the temporary command recording group will judge the clarity of the voice information buffered by itself, and obtain the score value x, and take the highest-resolution voice information in the starting time period (t1 to t1+t0). As a voice command.
  • steps 1014, 1015, 1016, 1017, and 1018 are not necessarily performed by each robot.
  • the robot that hears the voice information performs steps 1014, 1015, and 1016, but steps 1017 and 1018 may only Executed by one robot or several robots, each robot can broadcast its own working state to other robots, which is executed by the most idle robot, and then the executed robot will share the execution results to other robots through the network.
  • each robot records the start time of the voice information, and joins the temporary command record group. By comparing the start time of the robot record in the temporary command record group, the earliest time when the voice information is heard can be determined. Get a relatively complete voice command. By determining the speech information with the highest resolution in the initial time period as the voice command, relatively complete and clear voice commands can be obtained, and the reliability of acquiring the user voice command is enhanced. The clarity of the voice command facilitates the correct interpretation of the voice command, and the integrity of the voice command can ensure that the wake-up words sent by the user can be parsed as completely as possible, and the cooperation between the robots corresponding to the wake-up words can be facilitated.
  • the wakeup word and the command content in the voice command are parsed according to the voice command and the preset wakeup vocabulary.
  • Step 1021 Parse the wake-up words in the voice command according to the voice command and the wake-up vocabulary
  • the grammatical format of the user's voice command to the robot is generally as follows:
  • ⁇ name> is the wake-up word of a certain robot, and the number of occurrences of ⁇ name> may be one or more;
  • the user voice command is "Zhang San, Li Si, and Wang Wu, help me prepare lunch before twelve o'clock.”
  • This sentence can be applied according to the above grammatical format, then it can be judged “Zhang San, Li Si, Wang Wu "The awakening words for the three robots, "also” is a conjunction or a spoken word, "help me prepare lunch before twelve o'clock” as the command content.
  • the wake-up words in the voice command can be parsed.
  • Step 1022 Parse the command content in the voice command according to the voice command and the wake-up word.
  • the content behind the wake-up word in the voice command is the actual command content of the user, and the semantic meaning of the command content can be used to know the user's true intention.
  • steps 1021 and 1022 are not necessarily performed by each robot, and may be performed by only one robot or several robots. Each robot can broadcast its own working state to other robots, and the most idle robots. To execute.
  • the performing operations according to the parsed wake-up words and command content includes:
  • Step 1031 Notifying the robot corresponding to the wake-up word with the command content, so that the robot corresponding to the wake-up word performs an operation corresponding to the command content.
  • the performing operations according to the parsed wake-up words and command content including:
  • Step 1032 According to the command content decomposition task, respectively notify the decomposed task to the robot corresponding to the wake-up word, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
  • the robot that parses the wake-up word and the command content may also be another robot that acquires the wake-up word and the command content through the network, and may notify other robots corresponding to the wake-up word to wait.
  • Tasks so that each robot establishes a task group, and the task group members start to synchronize (such as sharing location information, unique identifier, own ability, etc.), and then wait for the task.
  • the robot that parses the voice command combines the position of the robot and its own capabilities according to the command content, disassembles the task that the user scholares, divides it into several subtasks, and then sends the disassembled subtask to other members in the task group, the task group.
  • the robots within the collaboration cooperate to complete the tasks that the user scholares. For example, taking Figure 2 as an example, assuming that the robot that parses the voice command is Mike, Mike divides the task of cleaning up the room into three subtasks: cleaning up the living room and bedroom, cleaning up the bathroom, and cleaning up the kitchen. Mike can clean up the bathroom and clean up the kitchen. The missions were sent to Tom and Jerry to clean up the living room and bedroom.
  • the main body of the task is no longer a separate robot, but a plurality of robots are combined to perform tasks, and the execution efficiency is higher and the user experience is good.
  • the waking method includes:
  • Step 201 Listening to voice information
  • Each robot will listen to the voice information sent by the user in real time
  • Step 202 Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary
  • Step 203 If the voice information is a command for the robot corresponding to the wakeup vocabulary, the voice information is recorded as a voice command.
  • Each robot that listens to the voice information confirms whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up vocabulary, and if so, the voice information is used as a voice command.
  • Step 204 Parse the wake-up word and the command content in the voice command according to the voice command and the preset wake-up dictionary.
  • each robot that listens to a voice command parses the voice command and parses the wake word and the command content.
  • Step 205 Notifying the robot corresponding to the wake-up word with the command content, so that the robot corresponding to the wake-up word performs an operation corresponding to the command content.
  • the robot that parses the voice command if the parsed wake-up word does not include itself, sends the command content to the robot corresponding to the other wake-up words; if the parsed wake-up word includes itself, starts executing the operation corresponding to the command content, and the command The content is sent to other related robots. If the robot parses the command content itself, the operation is performed according to the command content that is parsed by itself, and if the command content is not parsed by itself, the operation is performed according to the command content sent by the other robot.
  • the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary by preset a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user scholares is correctly completed.
  • the user can issue commands to multiple robots at the same time, and multiple robots can jointly perform the tasks issued by the user, and the execution efficiency is higher and the user experience is good.
  • the waking method includes:
  • Step 301 Listening to voice information
  • Each robot will listen to the voice information sent by the user in real time
  • Step 302 Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary
  • Step 303 If the voice information is a command for the robot corresponding to the wakeup vocabulary, record the voice information and the start time of the voice information, and add a preset temporary command record group;
  • Each robot that listens to the voice information confirms whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up dictionary, and if so, records the voice information and listens to the start of the voice information. At the moment, and join the preset temporary command record group. Then, the robot that joins the temporary command record group will broadcast the temporary command record group to other robots to Make the members of the temporary command record group aware of the existence of other group members.
  • Step 304 Confirm the earliest time in the start time of the robot record in the temporary command record group, and determine the start time period according to the earliest time;
  • the robot in the temporary command record group broadcasts the start time of its own record to other robots in the group, and a certain robot determines the earliest time in each start time and determines the start time period based on the earliest time.
  • Step 305 Obtain voice information with the highest resolution among the voice information whose starting time is located in the starting time period as a voice command.
  • the robot in the temporary command recording group After the voice information recording is finished, the robot in the temporary command recording group performs the resolution determination on the voice information recorded by itself, obtains the sharpness score value, and then broadcasts the sharpness score value to other robots in the temporary command record group, a certain robot It will find the highest-resolution voice information in the starting time period.
  • Step 306 Parsing the wake-up words and the command content in the voice command according to the voice command and the preset wake-up term library
  • Step 307 According to the command content decomposition task, respectively notify the decomposed task to the robot corresponding to the wake-up word, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
  • steps 304, 305, 306, and 307 are not necessarily performed by each robot, and each robot can broadcast its own working state to other robots, and is executed by the most idle robot.
  • the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary by preset a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user scholares is correctly completed.
  • the robot By recording the start time of the voice information monitored by each robot and adding a temporary command record group, by comparing the start time of the robot record in the temporary command record group, the earliest time to hear the voice information can be determined, and a relatively complete can be obtained. Voice command.
  • the speech information with the highest resolution in the initial time period as the voice command, relatively complete and clear voice commands can be obtained, and the reliability of acquiring the user voice command is enhanced.
  • the user can issue commands to multiple robots at the same time, and multiple robots can complete the tasks issued by the users in a coordinated manner, and the execution efficiency is higher and the user experience is good.
  • the method further includes:
  • the newly added robot When the user purchases another robot and establishes a communication connection with other robots through the network, the newly added robot will broadcast its wake-up words to other robots through the network, and the robot receiving the wake-up words will synchronize the wake-up words to The own wake-up vocabulary, and send the synchronized corpus of wake-up vocabulary back to the newly added robot.
  • the method further includes:
  • each robot After the task assignment is completed, each robot starts to perform the task. At this time, the temporary command record group can be dismissed, the memory is released, and the resource utilization rate is improved.
  • the embodiment of the present invention further provides a robot wake-up device, which is disposed in any of the robots shown in FIG. 2.
  • the wake-up device 400 includes:
  • the voice command obtaining module 401 is configured to acquire a voice command.
  • the voice command parsing module 402 is configured to parse the wake-up word and the command content in the voice command according to the voice command and the preset wake-up vocabulary, where the wake-up vocabulary includes a plurality of wake-up words, the wake-up word
  • the wake-up words in the library correspond to at least two robots;
  • the executing module 403 is configured to perform an operation according to the parsed wake-up word and the command content.
  • the wake-up device provided by the embodiment of the present invention can correct the wake-up words contained in the voice command according to the preset wake-up vocabulary by presetting a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user sufferes is correctly completed.
  • the voice command obtaining module 401 includes:
  • a voice command confirmation submodule configured to confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary
  • the first voice command acquisition submodule is configured to record the voice information as a voice command if the voice information is a command for a robot corresponding to the wakeup vocabulary.
  • the voice command obtaining module 401 includes:
  • a voice information monitoring sub-module 4011 configured to monitor voice information
  • a voice command confirmation sub-module 4012 configured to confirm whether the voice information is a command for a robot corresponding to the wake-up vocabulary
  • the voice command recording sub-module 4013 is configured to record the voice information and the start time of the voice information if the voice information is a command for the robot corresponding to the wake-up vocabulary, and add a preset temporary command record group. ;
  • a start time period confirmation sub-module 4014 configured to confirm an earliest time in a start time of the robot record in the temporary command record group, and determine a start time period according to the earliest time;
  • the second voice command acquisition sub-module 4015 is configured to obtain voice information with the highest resolution among the voice information whose starting time is located in the start time period as a voice command.
  • the voice command confirmation sub-module includes:
  • a voice command confirmation subunit configured to: if the voice information includes any one of a preset wakeup vocabulary The wake-up word and the appearance of the wake-up word are called, and the voice information is a command for the robot corresponding to the wake-up vocabulary.
  • the awake device 500 includes: a voice command obtaining module 501, a voice command parsing module 502, and an executing module 503,
  • the wakeup vocabulary update module 504 is configured to update the wakeup vocabulary.
  • the wakeup vocabulary update module includes:
  • the first wakeup vocabulary update submodule is configured to set a wakeup word according to the wakeup word setting instruction, and broadcast the wakeup word.
  • the wakeup vocabulary update module further includes:
  • the second wake-up vocabulary update sub-module is configured to receive the broadcasted wake-up word, add the wake-up word to the preset wake-up vocabulary, and send the updated wake-up vocabulary to the robot that broadcasts the wake-up word.
  • the voice command parsing module includes:
  • the wake-up word parsing sub-module is configured to parse the wake-up words in the voice command according to the voice command and the wake-up vocabulary;
  • the command content parsing sub-module is configured to parse the command content in the voice command according to the voice command and the wake-up word.
  • the execution module includes:
  • the first execution submodule is configured to notify the robot corresponding to the wakeup word of the command content, so that the robot corresponding to the wakeup word performs an operation corresponding to the command content.
  • the execution module includes:
  • the second execution sub-module is configured to respectively notify the robot corresponding to the wake-up word according to the command content decomposition task, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
  • FIG. 11 is a schematic structural diagram of an embodiment of the wake-up device.
  • the wake-up device 600 includes:
  • the voice command obtaining module 601 is configured to obtain a voice command.
  • the voice command acquiring module 601 includes:
  • a voice information monitoring sub-module 6011 configured to monitor voice information
  • a voice command confirmation sub-module 6012 configured to confirm whether the voice information is a command for a robot corresponding to the wake-up vocabulary
  • the voice command recording sub-module 6013 is configured to record the voice information and the start time of the voice information if the voice information is a command for the robot corresponding to the wake-up vocabulary, and add a preset temporary command record group. ;
  • a start time period confirmation sub-module 6014 for confirming the robot record in the temporary command record group The earliest time in the starting time, determining the starting time period according to the earliest time;
  • the second voice command acquisition sub-module 6015 is configured to obtain voice information with the highest resolution among the voice information whose starting time is located in the start time period as a voice command.
  • the voice command parsing module 602 is configured to parse the wake-up word and the command content in the voice command according to the voice command and the preset wake-up term library, where the wake-up word is used to wake up the robot; wherein the voice command is
  • the parsing module 602 includes:
  • the wake-up word parsing sub-module 6021 is configured to parse the wake-up words in the voice command according to the voice command and the wake-up vocabulary;
  • the command content parsing sub-module 6022 is configured to parse the command content in the voice command according to the voice command and the wake-up word.
  • the executing module 603 is configured to perform an operation according to the parsed wake-up word and the command content, where the executing module 603 includes:
  • the second execution sub-module 6031 is configured to separately notify the robot corresponding to the wake-up word according to the command content decomposition task, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
  • the wakeup vocabulary update module 604 is configured to update the wakeup vocabulary.
  • the voice information monitoring sub-module 6011 monitors the voice sent by the user in real time
  • the voice command confirmation sub-module 6012 confirms whether the voice information monitored by the voice information monitoring sub-module 6011 is a command of the user for the robot corresponding to the wake-up vocabulary, if the voice information is For the command of the robot corresponding to the vocabulary
  • the voice command recording sub-module 6013 records the voice information and the start time of the voice information, and adds a preset temporary command record group.
  • the robot in the temporary command record group broadcasts the start time of the record to other robots
  • the start time period confirmation sub-module 6014 confirms the earliest time to listen to the user voice command according to each start time, and determines the start according to the earliest time. period.
  • the robot in the temporary command recording group performs the resolution determination on the voice information recorded by itself, obtains the sharpness score value, and then broadcasts the sharpness score value to the other robots in the temporary command record group, the second voice.
  • the command acquisition sub-module 6015 determines the voice information with the highest resolution among the voice information whose start time is located in the start time period as a voice command.
  • the wake-up word analysis sub-module 6021 parses the wake-up word according to the voice command
  • the command content analysis sub-module 6022 parses the command content according to the voice command and the wake-up word.
  • the second execution sub-module 6031 causes the robot corresponding to the wake-up word to cooperate to perform an operation corresponding to the command content according to the command content decomposition task.
  • the wake-up device can correct the wake-up words contained in the voice command according to the preset wake-up vocabulary by presetting a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user scholares is correctly completed.
  • the start time of the voice information is monitored by each robot, and the temporary command record group is added, and the voice information can be determined by comparing the start time of the robot record in the temporary command record group. At the earliest moment, you can get relatively complete voice commands. By determining the speech information with the highest resolution in the initial time period as the voice command, relatively complete and clear voice commands can be obtained, and the reliability of acquiring the user voice command is enhanced.
  • the user can issue commands to multiple robots at the same time, and multiple robots can complete the tasks issued by the users in a coordinated manner, and the execution efficiency is higher and the user experience is good.
  • the above-mentioned wake-up device can perform the wake-up method provided by the embodiment of the present invention, and has the corresponding functional modules and beneficial effects of the execution method.
  • the wake-up method provided by the embodiment of the present invention.
  • FIG. 12 is a schematic diagram showing the hardware structure of the robot 700 for the robot wake-up method according to the embodiment of the present invention. As shown in FIG. 12, the robot 700 includes:
  • One or more processors 701 and memory 702, one processor 701 is taken as an example in FIG.
  • the processor 701 and the memory 702 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
  • the memory 702 is a non-volatile computer readable storage medium, and can be used to store non-volatile software programs, non-volatile computer-executable programs, and modules, such as program instructions corresponding to the wake-up method in the embodiment of the present invention.
  • the module (for example, the voice command acquisition module 401, the voice command analysis module 402, and the execution module 403 shown in FIG. 8).
  • the processor 701 executes various functional applications of the server and data processing by executing non-volatile software programs, instructions, and modules stored in the memory 702, that is, implementing the wake-up method of the above method embodiments.
  • the memory 702 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to the use of the wake-up device, and the like.
  • memory 702 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device.
  • memory 702 can optionally include memory remotely located relative to processor 701 that can be connected to the wake-up device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the one or more modules are stored in the memory 702, and when executed by the one or more processors 701, perform the wake-up method in any of the above method embodiments, for example, performing the above described FIG. Method step 101 to step 103, method step 1011 to step 1013 in Fig. 4a, method step 1014 to step 1018 in Fig. 4b, method step 1021 to step 1022 in Fig. 5, method step 1031 in Fig. 6a, Fig. 6b
  • Method step 1032 method steps 201-205 in FIG. 7a, method steps 301-307 in FIG. 7b; implementing modules 401-403 in FIG. 8, sub-modules 4011 and 4015 in FIG. 9, module 501 in FIG. - 504, the functions of the modules 601-604, the sub-module 6011-6015, the sub-module 6021-6022, and the sub-module 6031 in FIG.
  • the above product can perform the method provided by the embodiment of the present invention, and has the corresponding functional modules and beneficial effects of the execution method.
  • the method provided can perform the method provided by the embodiment of the present invention, and has the corresponding functional modules and beneficial effects of the execution method.
  • Embodiments of the present invention provide a non-transitory computer readable storage medium storing computer-executable instructions that are executed by one or more processors, such as in FIG.
  • the processor 701 is configured to enable the one or more processors to perform the wake-up method in any of the foregoing method embodiments, for example, to perform the method steps 101 to 103 in FIG. 3 described above, the method steps in FIG. 4a 1011 to step 1013, method step 1014 to step 1018 in Fig. 4b, method step 1021 to step 1022 in Fig. 5, method step 1031 in Fig. 6a, method step 1032 in Fig. 6b, method step 201 in Fig. 7a - 205, method steps 301-307 in FIG.
  • modules 401-403 in FIG. 8 sub-modules 4011 and 4015 in FIG. 9, modules 501-504 in FIG. 10, modules 601-604, sub-modules in FIG. Functions of 6011-6015, submodule 6021-6022, and submodule 6031.
  • the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Landscapes

  • Engineering & Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Manipulator (AREA)

Abstract

A wake-up method for a robot, comprising: acquiring a voice command (101); parsing the wake-up words and command content in the voice command according to the voice command and a preconfigured wake-up word database (102), the wake-up word database comprising at least two wake-up words, and the wake-up words in the wake-up word database corresponding to at least two robots; and executing an operation according to the parsed wake-up words and command content (103). The method can correctly parse the command content in the voice command, thereby correctly completing a task assigned by a user.

Description

机器人唤醒方法、装置和机器人Robot wake-up method, device and robot 技术领域Technical field
本发明实施例涉及人工智能自动控制领域,例如涉及一种机器人唤醒方法、装置和机器人。Embodiments of the present invention relate to the field of artificial intelligence automatic control, for example, to a robot wake-up method, apparatus, and robot.
背景技术Background technique
随着人工智能技术的发展,机器人为人类的生产生活带来了很多便利。当前用户对机器人进行控制时,可对机器人预先设置唤醒词,当机器人听到一个特定的唤醒词时,就知道是用户在呼唤自己。用户可以以唤醒词加命令内容的形式向机器人发布语音命令,例如,“Mike(唤醒词),今天天气怎么样(命令内容)?”,听到语音命令的机器人将对上述语音命令进行解析,解析出唤醒词,如果唤醒词与该机器人内设置的唤醒词一致,机器人将能够判别出是在叫自己,并将语音命令中唤醒词后的内容作为命令内容,即识别出用户是在询问今天的天气,于是便可直接做出回答,如图1示出了用户对一个机器人进行控制的场景。With the development of artificial intelligence technology, robots bring a lot of convenience to human production and life. When the current user controls the robot, the robot can pre-set the wake-up words. When the robot hears a specific wake-up word, it knows that the user is calling himself. The user can issue a voice command to the robot in the form of a wake-up word plus command content, for example, "Mike (awake word), what is the weather today (command content)?", the robot that hears the voice command will parse the voice command. The wake-up word is parsed. If the wake-up word is consistent with the wake-up word set in the robot, the robot will be able to discriminate that it is calling itself, and the content after the wake-up word in the voice command is used as the command content, that is, the user is identified today. The weather, so you can directly answer, as shown in Figure 1 shows the user to control a robot.
在实现本发明过程中,发明人发现相关技术中至少存在如下问题:当同一用户拥有多个机器人,且某项任务需要多个机器人共同完成时,现有方法将不再适用。例如,用户命令:“Mike,Tom,Jerry,帮我把房间收拾一下”,按照上述解析方法,唤醒词为Mike的机器人解析出的命令内容是“Tom,Jerry,帮我把房间收拾一下”。由于不能正确的解析出命令内容,机器人将无法完成用户交代的任务。In the process of implementing the present invention, the inventors have found that at least the following problems exist in the related art: when a same user owns a plurality of robots, and a certain task requires multiple robots to complete together, the existing method will no longer be applicable. For example, the user command: "Mike, Tom, Jerry, help me clean up the room", according to the above analysis method, the command content of the wake-up word for Mike's robot is "Tom, Jerry, help me clean up the room." Since the command content cannot be parsed correctly, the robot will not be able to complete the task that the user confesses.
发明内容Summary of the invention
本发明实施例的一个目的是提供一种新的机器人唤醒方法、装置和机器人,当用户同时对多个机器人发布语音命令时,机器人能正确的解析出命令内容,从而正确的完成用户交代的任务。An object of the embodiments of the present invention is to provide a new robot wake-up method, device, and robot. When a user simultaneously issues a voice command to multiple robots, the robot can correctly parse the command content, thereby correctly completing the task that the user confesses. .
第一方面,本发明实施例提供了一种机器人唤醒方法,所述唤醒方法应用于机器人,所述方法包括:In a first aspect, an embodiment of the present invention provides a robot wake-up method, where the wake-up method is applied to a robot, and the method includes:
获取语音命令;Get a voice command;
根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词库包括至少两个唤醒词,所述唤醒词库中的唤醒词用于唤醒至少两个机器人,所述机器人对应一个或一个以上的唤醒词;And parsing the wake-up words and the command content in the voice command according to the voice command and the preset wake-up word library, the wake-up word library includes at least two wake-up words, and the wake-up words in the wake-up word library are used for Wake up at least two robots corresponding to one or more wake words;
根据解析出的唤醒词和命令内容执行操作。Perform operations based on the parsed wakeup words and command content.
第二方面,本发明实施例还提供了一种机器人唤醒装置,所述唤醒装置应 用于机器人,所述装置包括:In a second aspect, an embodiment of the present invention further provides a robot wake-up device, where the wake-up device should For a robot, the device includes:
语音命令获取模块,用于获取语音命令;a voice command acquisition module, configured to acquire a voice command;
语音命令解析模块,用于根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词库包括至少两个唤醒词,所述唤醒词库中的唤醒词用于唤醒至少两个机器人,所述机器人对应一个或一个以上的唤醒词;a voice command parsing module, configured to parse an awakening word and a command content in the voice command according to the voice command and a preset wake-up term library, where the wake-up word library includes at least two wake-up words, the wake-up word The wake-up words in the library are used to wake up at least two robots, the robot corresponding to one or more wake-up words;
执行模块,用于根据解析出的唤醒词和命令内容执行操作。An execution module is configured to perform an operation according to the parsed wake-up words and command content.
第三方面,本发明实施例还提供了一种机器人,包括:In a third aspect, an embodiment of the present invention further provides a robot, including:
至少一个处理器;以及,At least one processor; and,
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如上所述的方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the method as described above.
本发明实施例提供的唤醒方法和装置,通过在机器人内预设包括至少两个唤醒词的唤醒词库,例如用户拥有多个机器人,可以将该多个机器人对应的唤醒词预先设置于唤醒词库内,当用户同时对多个机器人发布语音命令时,机器人可以根据预设的唤醒词库正确的解析出语音命令中含有的唤醒词,从而正确的解析出语音命令中的命令内容,进而正确的完成用户交代的任务。The wake-up method and device provided by the embodiment of the present invention can preset the wake-up words corresponding to the plurality of robots to the wake-up words by preset a wake-up word library including at least two wake-up words in the robot, for example, the user owns multiple robots. In the library, when the user issues a voice command to multiple robots at the same time, the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary, thereby correctly parsing the command content in the voice command, and thus correct. Complete the task that the user confesses.
附图说明DRAWINGS
一个或多个实施例通过与之对应的附图中的图片进行示例性说明,这些示例性说明并不构成对实施例的限定,附图中具有相同参考数字标号的元件表示为类似的元件,除非有特别申明,附图中的图不构成比例限制。The one or more embodiments are exemplified by the accompanying drawings in the accompanying drawings, and FIG. The figures in the drawings do not constitute a scale limitation unless otherwise stated.
图1是现有技术中用户对一个机器人进行控制的示意图;1 is a schematic diagram of a user controlling a robot in the prior art;
图2是本发明方法和装置的应用场景示意图;2 is a schematic diagram of an application scenario of the method and apparatus of the present invention;
图3是本发明唤醒方法的一个实施例的流程图;3 is a flow chart of one embodiment of a wake-up method of the present invention;
图4a是本发明唤醒方法的一个实施例中获取语音命令步骤的流程图;4a is a flow chart showing steps of acquiring a voice command in an embodiment of the wake-up method of the present invention;
图4b是本发明唤醒方法的一个实施例中获取语音命令步骤的流程图;4b is a flowchart of a step of acquiring a voice command in an embodiment of the wake-up method of the present invention;
图5是本发明唤醒方法的一个实施例中解析语音命令步骤的流程图;5 is a flow chart showing steps of parsing a voice command in an embodiment of the wake-up method of the present invention;
图6a是本发明唤醒方法的一个实施例中执行语音命令步骤的流程图;Figure 6a is a flow chart showing the steps of executing a voice command in one embodiment of the wake-up method of the present invention;
图6b是本发明唤醒方法的一个实施例中执行语音命令步骤的流程图;Figure 6b is a flow chart showing the steps of executing a voice command in one embodiment of the wake-up method of the present invention;
图7a是本发明唤醒方法的一个实施例的流程图;Figure 7a is a flow chart of one embodiment of a wake-up method of the present invention;
图7b是本发明唤醒方法的一个实施例的流程图;Figure 7b is a flow chart of one embodiment of the wake-up method of the present invention;
图8是本发明唤醒装置的一个实施例的结构示意图;Figure 8 is a schematic structural view of an embodiment of the wake-up device of the present invention;
图9是本发明唤醒装置的一个实施例中语音命令获取模块的结构示意图;9 is a schematic structural diagram of a voice command acquiring module in an embodiment of the wake-up device of the present invention;
图10是本发明唤醒装置的一个实施例的结构示意图; Figure 10 is a schematic structural view of an embodiment of the wake-up device of the present invention;
图11是本发明唤醒装置的一个实施例的结构示意图;以及Figure 11 is a block diagram showing the structure of an embodiment of the wake-up device of the present invention;
图12是本发明实施例提供的唤醒方法的机器人的硬件结构示意图。FIG. 12 is a schematic diagram showing the hardware structure of a waking method according to an embodiment of the present invention.
具体实施方式detailed description
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本发明提供的机器人唤醒方法和装置适用于如图2所示的应用场景,包括多个机器人20,多个机器人20之间可以通过网络30互相通信,其中,网络30可以是例如家庭或公司的局域网,或一个特定网络等。机器人20具有至少一个网络接口,与网络30建立通信连接,从网络30获取数据或者指令。用户10可以对多个机器人20进行设置或者发布命令。The robot wake-up method and apparatus provided by the present invention are applicable to an application scenario as shown in FIG. 2, and include a plurality of robots 20, which can communicate with each other through a network 30, wherein the network 30 can be, for example, a home or a company. LAN, or a specific network, etc. The bot 20 has at least one network interface that establishes a communication connection with the network 30 to retrieve data or instructions from the network 30. The user 10 can set or issue commands to the plurality of robots 20.
每个机器人都具有与其对应的唤醒词用于将自身从休眠状态中唤醒或者响应用户的呼唤(唤醒词一般为一个,也可以为多个)。其中,所述唤醒词可以为机器人的名字,识别码或者其他任意词汇,所述唤醒词可以由用户进行设置,也可以在出厂时自带。Each robot has its corresponding wake-up word for waking itself from the sleep state or responding to the user's call (the wake-up word is generally one or more). The wake-up word may be a name, an identifier or any other vocabulary of the robot, and the wake-up word may be set by the user or may be provided at the factory.
每个机器人都具有用于放置唤醒词的唤醒词库,所述唤醒词库可以在出厂时自带,也可以由用户设置。同一用户的各个机器人之间可以共享相同的唤醒词库,以图2所示的实施例为例,假设用户10拥有三个机器人,三个机器人的唤醒词分别为Mike、Tom和Jerry(此处以每个机器人对应一个唤醒词为例说明,但并不限于此,每个机器人也可以关联两个以上的唤醒词),那么三个机器人的唤醒词库均可以设置成包括Mike、Tom和Jerry。当用户同时向该三个机器人发布语音命令“Mike,Tom,Jerry,帮我把房间收拾一下”时,由于每个机器人的唤醒词库均包括Mike、Tom和Jerry,根据该唤醒词库,机器人可以解析出唤醒词Mike、Tom和Jerry,从而能正确的解析出唤醒词后的命令内容“帮我把房间收拾一下”。Each robot has a wake-up vocabulary for placing wake-up words, which can be shipped from the factory or set by the user. The same wake-up vocabulary can be shared between the robots of the same user. Taking the embodiment shown in Figure 2 as an example, assume that the user 10 has three robots, and the wake-up words of the three robots are Mike, Tom, and Jerry (here Each robot corresponds to a wake-up word as an example, but it is not limited to this. Each robot can also associate more than two wake-up words. Then, the wake-up vocabularies of the three robots can be set to include Mike, Tom, and Jerry. When the user simultaneously issues a voice command "Mike, Tom, Jerry, help me clean up the room" to the three robots, since each robot's wake-up vocabulary includes Mike, Tom, and Jerry, according to the wake-up vocabulary, the robot You can parse out the wake-up words Mike, Tom, and Jerry so that you can correctly parse the command content after the wake-up word "help me clean up the room."
在实际使用中,对语音命令中唤醒词的解析可以采用声学模型,唤醒词库中的唤醒词采用与唤醒词对应的音素序列,根据用户发出的语音和预设的声学模型解码出音素序列,再将该音素序列与唤醒词音素序列进行匹配,从而解析出唤醒词。解析出唤醒词后,语音命令中唤醒词后面的内容作为命令内容。语音命令中命令内容的语义解析需要事先构建命令词语法文件,用户发出的命令内容需要存在于事先构建的命令词语法文件中,根据用户发出的语音和命令词语法文件解析出命令内容的语义。In actual use, the acoustic model can be used to resolve the wake-up words in the voice command. The wake-up words in the wake-up lexicon use the phoneme sequence corresponding to the wake-up word, and the phoneme sequence is decoded according to the voice and the preset acoustic model. The phoneme sequence is then matched with the wake-up word phoneme sequence to resolve the wake-up word. After the wake-up word is parsed, the content behind the wake-up word in the voice command is used as the command content. The semantic analysis of the command content in the voice command requires the command lexical method file to be constructed in advance. The command content sent by the user needs to exist in the pre-built command lexical file, and the semantics of the command content is parsed according to the voice and command lexical file sent by the user.
唤醒词库的更新可以由用户进行设置,也可以由机器人之间自行完成更新。 例如目前的唤醒词库中包括Mike、Tom和Jerry三个唤醒词,当用户又购入一个机器人(假设唤醒词为John)并通过网络与其他机器人之间建立通信连接后,新加入的机器人会通过网络向其他机器人广播自己的唤醒词John,其他机器人接收到该唤醒词后,会将该唤醒词加入到唤醒词库中,并将更新后的唤醒词库发送给新加入的机器人。The update of the wakeup vocabulary can be set by the user or can be done by the robot itself. For example, the current wake-up vocabulary includes three wake-up words: Mike, Tom, and Jerry. When the user purchases a robot (assuming the wake-up word is John) and establishes a communication connection with other robots through the network, the newly added robot will The other wake-up words John are broadcasted to other robots through the network. After receiving the wake-up words, other robots will add the wake-up words to the wake-up vocabulary and send the updated wake-up vocabulary to the newly added robot.
需要说明的是,虽然在图2中仅显示了1个用户10、3个机器人20。但本领域技术人员可以理解的是,在实际应用过程中,该应用场景还可以包括更多的用户10和机器人20。本发明提供的机器人唤醒方法和装置亦适用于用户对一个或两个机器人进行控制的场合。It should be noted that although only one user 10 and three robots 20 are shown in FIG. 2 . However, those skilled in the art can understand that the application scenario may further include more users 10 and robots 20 in the actual application process. The robot wake-up method and apparatus provided by the present invention are also applicable to a case where a user controls one or two robots.
本发明实施例提供了一种机器人唤醒方法,所述唤醒方法可由图2所示的任一机器人执行,如图3所示,为所述唤醒方法的一个实施例的流程图,所述唤醒方法包括:An embodiment of the present invention provides a robot wake-up method, which may be performed by any of the robots shown in FIG. 2, as shown in FIG. 3, which is a flowchart of an embodiment of the wake-up method, and the wake-up method. include:
步骤101:获取语音命令;Step 101: Acquire a voice command.
在实际应用中,可以在机器人身上设置麦克风用于实时接收语音信号。所述语音命令可以是实时接收的语音信号。但是有时用户虽然发出语音,但不一定是针对机器人的语音命令,因此需要对语音信息进行进一步判断,如果该语音信息是针对唤醒词库中的任一唤醒词对应的机器人的命令,则将该语音信息作为语音命令。In practical applications, a microphone can be placed on the robot for receiving voice signals in real time. The voice command may be a voice signal received in real time. However, sometimes the user makes a voice, but it is not necessarily a voice command for the robot. Therefore, it is necessary to further judge the voice information. If the voice message is a command for the robot corresponding to any wake-up word in the wake-up dictionary, Voice information as a voice command.
步骤102:根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词库包括若干个唤醒词,所述唤醒词库中的唤醒词对应至少两个机器人;Step 102: Parsing an awakening word and a command content in the voice command according to the voice command and a preset wake-up term library, where the wake-up word library includes a plurality of wake-up words, and the wake-up words in the wake-up word library Corresponding to at least two robots;
其中,所述命令内容可以是从用户发出的语音命令中截取的命令内容对应的语音,也可以是对该语音进行语义解析后的结果。如果是前者,机器人在执行命令内容对应的操作时还需要进行语义解析。所述若干个包括一个或者多个,即所述唤醒词库可以包括一个或多个唤醒词。The command content may be a voice corresponding to the command content intercepted from the voice command sent by the user, or may be a result of semantically parsing the voice. If it is the former, the robot needs to perform semantic analysis when performing the operation corresponding to the command content. The plurality of ones include one or more, that is, the wake-up vocabulary may include one or more wake-up words.
步骤103:根据解析出的唤醒词和命令内容执行操作。Step 103: Perform an operation according to the parsed wake-up word and the command content.
可选的,机器人解析出唤醒词和命令内容后,如果解析中的唤醒词包括自身对应的唤醒词,则执行命令内容对应的操作,如果不包含自身对应的唤醒词,则不执行任何操作。或者,无论解析出的唤醒词是否包含自身对应的唤醒词,都将命令内容发送给唤醒词对应的其他机器人,由唤醒词对应的机器人共同完成用户交代的任务。也可以由机器人根据命令内容将用户交代的任务分解成若干个子任务,将子任务分别通知唤醒词对应的其他机器人,由唤醒词对应的机器人协作完成用户交代的任务。Optionally, after the robot parses the wake-up word and the command content, if the wake-up word in the parsing includes the corresponding wake-up word, the operation corresponding to the command content is performed, and if the corresponding wake-up word is not included, no operation is performed. Alternatively, regardless of whether the parsed wake-up word contains its own corresponding wake-up word, the command content is sent to other robots corresponding to the wake-up word, and the robot corresponding to the wake-up word completes the task that the user confesses. The robot may also decompose the task that the user confesses into several subtasks according to the command content, notify the subtasks of the other robots corresponding to the wakeup words, and coordinate the tasks corresponding to the user by the robot corresponding to the wakeup words.
需要说明的是,上述步骤101、102和103并不必然被每个机器人都执行到,任一机器人有可能执行全部步骤也有可能只执行其中的一或两个步骤。以图2 所示的应用场景为例,当用户发出语音“Mike,Tom,Jerry,帮我把房间收拾一下”时。机器人Mike,Tom,Jerry可以分别获取用户发出的语音命令,解析出唤醒词和命令内容后,如果发现自己是用户的命令对象,则根据解析出的命令内容执行操作。在这种情况下,三个机器人都执行了步骤101、102和103。也可能只有机器人Mike听到了用户发出的语音命令,Mike获取了语音命令后,解析出唤醒词和命令内容,将命令内容发送给Tom和Jerry,三个机器人共同执行任务。在这种情况下只有Mike执行了步骤101、102和103,而Tom和Jerry只执行了步骤103。步骤102可能只被一个机器人或几个机器人执行,执行的机器人会将解析出的唤醒词和命令内容通过网络共享给其他机器人。It should be noted that the above steps 101, 102, and 103 are not necessarily performed by each robot, and it is possible for any robot to perform all steps or only one or two of them. Figure 2 The application scenario shown is an example. When the user makes a voice "Mike, Tom, Jerry, help me clean up the room". The robots Mike, Tom, and Jerry can respectively obtain the voice commands issued by the user, and after parsing the wake-up words and the command content, if they find that they are the command objects of the user, the operations are performed according to the parsed command content. In this case, all three robots perform steps 101, 102, and 103. It is also possible that only the robot Mike hears the voice command issued by the user. After Mike obtains the voice command, Mike parses out the wake-up word and the command content, and sends the command content to Tom and Jerry, and the three robots perform the task together. In this case only Mike performed steps 101, 102 and 103, while Tom and Jerry only performed step 103. Step 102 may be performed by only one robot or several robots, and the executed robot will share the parsed wake-up words and command content to other robots through the network.
本发明实施例提供的唤醒方法,通过在机器人内预设包括若干个唤醒词的唤醒词库,例如用户拥有多个机器人,可以将该多个机器人对应的唤醒词预先设置于唤醒词库内,当用户同时对多个机器人发布语音命令时,机器人可以根据预设的唤醒词库正确的解析出语音命令中含有的唤醒词,从而正确的解析出语音命令中的命令内容,进而完成用户交代的任务。The wake-up method provided by the embodiment of the present invention can preset the wake-up words corresponding to the plurality of robots in the wake-up word library by preset a wake-up word library including a plurality of wake-up words in the robot, for example, the user owns multiple robots. When the user issues a voice command to multiple robots at the same time, the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary, thereby correctly parsing the command content in the voice command, thereby completing the user's account. task.
其中,具体的,在本发明的某些实施例中,如图4a所示,所述获取语音命令包括以下步骤;Specifically, in some embodiments of the present invention, as shown in FIG. 4a, the acquiring a voice command includes the following steps;
步骤1011:监听语音信息;Step 1011: Listening to voice information;
每个机器人都会实时监听用户发出的语音信息;Each robot will listen to the voice information sent by the user in real time;
步骤1012:确认所述语音信息是否为针对唤醒词库对应的机器人的命令;Step 1012: Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
步骤1013:如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息作为语音命令。Step 1013: If the voice information is a command for the robot corresponding to the wakeup vocabulary, the voice information is recorded as a voice command.
确认上述语音信息是否是用户针对唤醒词库中的任一唤醒词对应的机器人发出的命令,如果是,则该语音信息即为用户发出的语音命令。具体的,确认所述语音信息是否为针对唤醒词库对应的机器人的命令,首先确认所述语音信息是否包括预设唤醒词库中的任一唤醒词,如果包括唤醒词库中的任一唤醒词,则继续判断该唤醒词的出现是否为用户对机器人的命令或者呼唤,而不是用户在与其他人的通话中提到机器人。判断用户发出的语音信息是否是对机器人的命令或者呼唤,可以通过判断唤醒词与后面的语音内容之间停顿的时间间隔是否超过一预设时间,如果超过一预设时间,则该语音信息是对机器人的命令或者呼唤。或者可以通过判断第一个唤醒词前面是否有其他语音内容,如果没有其他语音内容,则该语音信息是对机器人的命令或者呼唤。It is confirmed whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up dictionary, and if yes, the voice information is a voice command issued by the user. Specifically, whether the voice information is a command for the robot corresponding to the wakeup vocabulary, first confirm whether the voice information includes any wake-up words in the preset wake-up vocabulary, if any wake-up in the wake-up vocabulary is included The word continues to determine whether the appearance of the wake-up word is a user's command or call to the robot, rather than the user mentioning the robot in a call with other people. Determining whether the voice information sent by the user is a command or a call to the robot, and determining whether the time interval between the wake-up word and the following voice content is more than a preset time. If the preset time is exceeded, the voice information is Command or call to the robot. Or by judging whether there is other voice content in front of the first wake-up word, if there is no other voice content, the voice information is an order or call to the robot.
可选的,在所述唤醒方法的其他实施例中,如图4b所示,所述获取语音命令包括:Optionally, in another embodiment of the waking method, as shown in FIG. 4b, the acquiring a voice command includes:
步骤1014:监听语音信息;Step 1014: Listening to voice information;
每个机器人都会实时监听用户发出的语音信息; Each robot will listen to the voice information sent by the user in real time;
步骤1015:确认所述语音信息是否为针对唤醒词库对应的机器人的命令;Step 1015: Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
步骤1016:如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组;Step 1016: If the voice information is a command for the robot corresponding to the wakeup vocabulary, record the voice information and the start time of the voice information, and add a preset temporary command record group;
如果机器人监听到用户发出的语音信息,会进一步判断所述语音信息是否是用户针对唤醒词库中的任一唤醒词对应的机器人发出的命令,如果是,则记录所述语音信息以及机器人听到语音信息的时刻,并加入预设的临时命令记录组。If the robot monitors the voice information sent by the user, it further determines whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up dictionary, and if so, records the voice information and the robot hears The moment of the voice message, and join the preset temporary command record group.
在实际应用中,机器人可以将所述语音信息和听到语音信息的时刻存入机器人的缓存中。建立临时命令组的目的是获取相对完整和清晰的用户命令,以图2所示的三个机器人为例,假设用户发布的语音命令为“Mike,Tom,Jerry,帮我把房间收拾一下”,则听到“Mike”开头、“Tom”开头或“Jerry”开头的每一个机器人记录其听到上述唤醒词时的时间戳t(起始时刻),并不一定每个机器人都能听到完整的命令,可能有的机器人正在远处向用户移动,没有听到最开始的“Mike”,听到的是“Tom,Jerry,帮我把房间收拾一下”,此处记录的时间戳就是为了防止这种半截语句被当作完整命令的情况发生。In practical applications, the robot can store the voice information and the moment when the voice information is heard into the cache of the robot. The purpose of establishing a temporary command group is to obtain relatively complete and clear user commands. Take the three robots shown in Figure 2 as an example. Suppose the voice command issued by the user is “Mike, Tom, Jerry, help me clean up the room”. Then every robot that hears the beginning of "Mike", the beginning of "Tom" or the beginning of "Jerry" records the timestamp t (starting time) when it hears the above-mentioned wake-up words, and not necessarily every robot can hear the complete The order, maybe the robot is moving to the user in the distance, did not hear the initial "Mike", heard "Tom, Jerry, help me clean up the room", the timestamp recorded here is to prevent This half-statement occurs as a complete command.
具体的,确认所述语音信息是否为针对唤醒词库对应的机器人的命令,首先确认所述语音信息是否包括预设唤醒词库中的任一唤醒词,如果包括唤醒词库中的任一唤醒词,则继续判断该唤醒词的出现是否为用户对机器人的命令或者呼唤,而不是用户在与其他人的通话中提到机器人。判断用户发出的语音信息是否是对机器人的命令或者呼唤,可以通过判断唤醒词与后面的语音内容之间停顿的时间间隔是否超过一预设时间,如果超过一预设时间,则该语音信息是对机器人的命令或者呼唤。或者可以通过判断第一个唤醒词前面是否有其他语音内容,如果没有其他语音内容,则该语音信息是对机器人的命令或者呼唤。Specifically, whether the voice information is a command for the robot corresponding to the wakeup vocabulary, first confirm whether the voice information includes any wake-up words in the preset wake-up vocabulary, if any wake-up in the wake-up vocabulary is included The word continues to determine whether the appearance of the wake-up word is a user's command or call to the robot, rather than the user mentioning the robot in a call with other people. Determining whether the voice information sent by the user is a command or a call to the robot, and determining whether the time interval between the wake-up word and the following voice content is more than a preset time. If the preset time is exceeded, the voice information is Command or call to the robot. Or by judging whether there is other voice content in front of the first wake-up word, if there is no other voice content, the voice information is an order or call to the robot.
步骤1017:确认临时命令记录组内的机器人记录的起始时刻中的最早时刻,根据所述最早时刻确定起始时间段;Step 1017: Confirm the earliest time in the start time of the robot record in the temporary command record group, and determine the start time period according to the earliest time;
比较临时命令记录组内的机器人记录的起始时刻,确定最早的时刻,因为最早的时刻记录的语音信息相对最完整。例如最早时刻为t1,以最早时刻t1为起点,并按照一个经验设定阈值t0确定起始时间段为从t1到t1+t0。The starting time of the robot record in the temporary command record group is compared, and the earliest time is determined because the voice information recorded at the earliest time is relatively complete. For example, the earliest time is t1, the earliest time t1 is taken as the starting point, and the starting time period is determined from t1 to t1+t0 according to an empirical setting threshold t0.
t0的设置可以根据机器人的性能和经验来设定,例如0.1s。一方面,不同机器人的反应有快有慢,因此需要设定一个时间差,另一方面,用户发出的两个唤醒词之间会有一个相对固定的间隔,因此设定的时间差应小于此间隔,以防漏掉一个唤醒词。The setting of t0 can be set according to the performance and experience of the robot, for example, 0.1s. On the one hand, the response of different robots is fast and slow, so it is necessary to set a time difference. On the other hand, there is a relatively fixed interval between the two wake-up words sent by the user, so the set time difference should be less than this interval. In case of missing a wake-up word.
步骤1018:获得起始时刻位于所述起始时间段内的语音信息中清晰度最高的语音信息作为语音命令。 Step 1018: Obtain the voice information with the highest resolution among the voice information whose starting time is located in the starting time period as a voice command.
语音信息记录结束后,临时命令记录组内的机器人会对自身缓存的语音信息进行清晰度判定,得到分数值x,取起始时间段(t1到t1+t0)内的清晰度最高的语音信息作为语音命令。After the voice information recording is finished, the robot in the temporary command recording group will judge the clarity of the voice information buffered by itself, and obtain the score value x, and take the highest-resolution voice information in the starting time period (t1 to t1+t0). As a voice command.
需要说明的是,上述步骤1014、1015、1016、1017和1018并不必然被每个机器人都执行到,一般听到语音信息的机器人都会执行步骤1014、1015和1016,但是步骤1017和1018可能只被一个机器人或几个机器人执行,每个机器人可以向其他机器人广播自己的工作状态,由最空闲的机器人来执行,然后执行的机器人会将执行结果通过网络共享给其他机器人。It should be noted that the above steps 1014, 1015, 1016, 1017, and 1018 are not necessarily performed by each robot. Generally, the robot that hears the voice information performs steps 1014, 1015, and 1016, but steps 1017 and 1018 may only Executed by one robot or several robots, each robot can broadcast its own working state to other robots, which is executed by the most idle robot, and then the executed robot will share the execution results to other robots through the network.
本发明实施例通过各机器人记录监听到语音信息的起始时刻,并加入临时命令记录组,通过比较临时命令记录组内的机器人记录的起始时刻,可以确定听到语音信息的最早时刻,能获得相对完整的语音命令。通过确定起始时间段内清晰度最高的语音信息作为语音命令,能获得相对完整和清晰的语音命令,增强了获取用户语音命令的可靠性。语音命令的清晰便于对所述语音命令的正确解析,语音命令的完整,可以保证能尽可能完整的解析出用户发出的唤醒词,便于各唤醒词对应的机器人之间的协同合作。In the embodiment of the present invention, each robot records the start time of the voice information, and joins the temporary command record group. By comparing the start time of the robot record in the temporary command record group, the earliest time when the voice information is heard can be determined. Get a relatively complete voice command. By determining the speech information with the highest resolution in the initial time period as the voice command, relatively complete and clear voice commands can be obtained, and the reliability of acquiring the user voice command is enhanced. The clarity of the voice command facilitates the correct interpretation of the voice command, and the integrity of the voice command can ensure that the wake-up words sent by the user can be parsed as completely as possible, and the cooperation between the robots corresponding to the wake-up words can be facilitated.
具体的,在所述唤醒方法的某些实施例中,如图5所示,所述根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,包括:Specifically, in some embodiments of the waking method, as shown in FIG. 5, the wakeup word and the command content in the voice command are parsed according to the voice command and the preset wakeup vocabulary. include:
步骤1021:根据语音命令和唤醒词库解析出语音命令中的唤醒词;Step 1021: Parse the wake-up words in the voice command according to the voice command and the wake-up vocabulary;
由于用户可能命令一个或者同时命令多个机器人,用户对机器人的语音命令的语法格式一般如下:Since the user may command one or several commands at the same time, the grammatical format of the user's voice command to the robot is generally as follows:
<name>,[与|和|还有|那个|嗯|…]<name>,[与|和|还有|那个|嗯|…]<name>,……,<command>。<name>, [and | and | and | that | um | ...] <name>, [and | and | and | that | ah | ...] <name>, ..., <command>.
其中,<name>为某个机器人的唤醒词,<name>的出现次数可以是一个或多个;Where <name> is the wake-up word of a certain robot, and the number of occurrences of <name> may be one or more;
[与|和|还有|那个|嗯|…]为两个唤醒词间可能出现的连词,如“张三还有李四”,此连词并不是一定出现的,因此用[]表示;[And | and | and | that | ah | ...] are two possible conjunctions between the awakening words, such as "Zhang San and Li Si", this conjunction does not necessarily appear, so use [] to indicate;
<command>为唤醒词后的命令内容。<command> is the content of the command after the wake-up word.
例如,用户语音命令为“张三,李四,还有王五,帮我十二点前准备好午饭”,此句话按照上述语法格式套用,则可判定“张三,李四,王五”为三个机器人的唤醒词,“还有”为连词或口头语,“帮我十二点前准备好午饭”为命令内容。For example, the user voice command is "Zhang San, Li Si, and Wang Wu, help me prepare lunch before twelve o'clock." This sentence can be applied according to the above grammatical format, then it can be judged "Zhang San, Li Si, Wang Wu "The awakening words for the three robots, "also" is a conjunction or a spoken word, "help me prepare lunch before twelve o'clock" as the command content.
通过将语音命令和唤醒词库中的唤醒词逐一进行匹配,就可以解析出语音命令中的唤醒词。By matching the voice command and the wake-up words in the wake-up vocabulary one by one, the wake-up words in the voice command can be parsed.
步骤1022:根据语音命令和唤醒词解析出语音命令中的命令内容。 Step 1022: Parse the command content in the voice command according to the voice command and the wake-up word.
解析出语音命令中的唤醒词后,语音命令中唤醒词后面的内容即为用户实际的命令内容,通过对该命令内容的语义解析,就能获知用户的真实用意。After the wake-up word in the voice command is parsed, the content behind the wake-up word in the voice command is the actual command content of the user, and the semantic meaning of the command content can be used to know the user's true intention.
需要说明的是,上述步骤1021和1022并不必然被每个机器人都执行到,可能只被一个机器人或几个机器人执行,每个机器人可以向其他机器人广播自己的工作状态,由最空闲的机器人来执行。It should be noted that the above steps 1021 and 1022 are not necessarily performed by each robot, and may be performed by only one robot or several robots. Each robot can broadcast its own working state to other robots, and the most idle robots. To execute.
具体的,在所述唤醒方法的某些实施例中,如图6a所示,所述根据解析出的唤醒词和命令内容执行操作,包括:Specifically, in some embodiments of the waking method, as shown in FIG. 6a, the performing operations according to the parsed wake-up words and command content includes:
步骤1031:将命令内容通知与唤醒词对应的机器人,以使与唤醒词对应的机器人执行命令内容对应的操作。Step 1031: Notifying the robot corresponding to the wake-up word with the command content, so that the robot corresponding to the wake-up word performs an operation corresponding to the command content.
可以是解析出唤醒词和命令内容的机器人也可以是通过网络获取唤醒词和命令内容的其他机器人将命令内容通知与唤醒词对应的机器人。例如,用户发布的语音命令为“Mike,Tom,Jerry,帮我把房间收拾一下”,假设解析语音命令的机器人是Mike,那么Mike会将命令内容“帮我把房间收拾一下”发送给Tom和Jerry,然后Mike、Tom和Jeery一起收拾房间。其中,Mike发送给Tom和Jeery的命令内容可以是进行语义解析后的结果,也可以是语音,需要Tom和Jerry自行对语音进行语义解析。The robot that can parse the wake-up word and the command content may also be a robot that acquires the wake-up word and the command content through the network to notify the command content to the wake-up word. For example, the voice command issued by the user is "Mike, Tom, Jerry, help me clean up the room." If the robot that parses the voice command is Mike, then Mike will send the command content "Help me clean up the room" to Tom and Jerry, then Mike, Tom and Jeery clean up the room together. Among them, the command content that Mike sends to Tom and Jeery can be the result of semantic analysis, or it can be voice, and Tom and Jerry need to perform semantic analysis on the voice.
可选的,在所述唤醒方法的其他实施例中,如图6b所示,所述根据解析出的唤醒词和命令内容执行操作,包括:Optionally, in another embodiment of the waking method, as shown in FIG. 6b, the performing operations according to the parsed wake-up words and command content, including:
步骤1032:根据命令内容分解任务,将分解后的任务分别通知与唤醒词对应的机器人,以使与唤醒词对应的机器人协作执行命令内容对应的操作。Step 1032: According to the command content decomposition task, respectively notify the decomposed task to the robot corresponding to the wake-up word, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
在实际应用中,机器人解析出唤醒词和命令内容后,可以是解析出唤醒词和命令内容的机器人也可以是通过网络获取唤醒词和命令内容的其他机器人,可以通知唤醒词对应的其他机器人等待任务,使各个机器人建立任务组,任务组成员间开始同步(如共享位置信息,唯一识别码,自身能力等等),然后等待任务。解析语音命令的机器人根据命令内容结合各个机器人的位置和自身能力,将用户交代的任务进行拆解,分成若干个子任务,然后将拆解后的子任务发送给任务组内的其他成员,任务组内的机器人协作完成用户交代的任务。例如,以图2为例说明,假设解析语音命令的机器人为Mike,Mike将收拾房间的任务分成三个子任务:收拾客厅和卧室、收拾卫生间以及收拾厨房,Mike可以将收拾卫生间和收拾厨房的子任务分别发送给Tom和Jerry,自己收拾客厅和卧室。In practical applications, after the robot parses the wake-up word and the command content, the robot that parses the wake-up word and the command content may also be another robot that acquires the wake-up word and the command content through the network, and may notify other robots corresponding to the wake-up word to wait. Tasks, so that each robot establishes a task group, and the task group members start to synchronize (such as sharing location information, unique identifier, own ability, etc.), and then wait for the task. The robot that parses the voice command combines the position of the robot and its own capabilities according to the command content, disassembles the task that the user confesses, divides it into several subtasks, and then sends the disassembled subtask to other members in the task group, the task group. The robots within the collaboration cooperate to complete the tasks that the user confesses. For example, taking Figure 2 as an example, assuming that the robot that parses the voice command is Mike, Mike divides the task of cleaning up the room into three subtasks: cleaning up the living room and bedroom, cleaning up the bathroom, and cleaning up the kitchen. Mike can clean up the bathroom and clean up the kitchen. The missions were sent to Tom and Jerry to clean up the living room and bedroom.
本发明实施例相对于现有技术,执行任务的主体不再是单独的机器人,而是将多个机器人联合起来共同执行任务,执行效率更高,用户体验好。Compared with the prior art, the main body of the task is no longer a separate robot, but a plurality of robots are combined to perform tasks, and the execution efficiency is higher and the user experience is good.
如图7a所示,为所述唤醒方法的一个实施例的流程示意图,在该实施例中,所述唤醒方法,包括: As shown in FIG. 7a, it is a schematic flowchart of an embodiment of the waking method. In this embodiment, the waking method includes:
步骤201:监听语音信息;Step 201: Listening to voice information;
每个机器人都会实时监听用户发出的语音信息;Each robot will listen to the voice information sent by the user in real time;
步骤202:确认所述语音信息是否为针对唤醒词库对应的机器人的命令;Step 202: Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
步骤203:如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息作为语音命令。Step 203: If the voice information is a command for the robot corresponding to the wakeup vocabulary, the voice information is recorded as a voice command.
每个监听到语音信息的机器人都会确认上述语音信息是否是用户针对唤醒词库中的任一唤醒词对应的机器人发出的命令,如果是,则将该语音信息作为语音命令。Each robot that listens to the voice information confirms whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up vocabulary, and if so, the voice information is used as a voice command.
步骤204:根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容。Step 204: Parse the wake-up word and the command content in the voice command according to the voice command and the preset wake-up dictionary.
在该实施例中,每个监听到语音命令的机器人会对语音命令进行解析,解析出唤醒词和命令内容。In this embodiment, each robot that listens to a voice command parses the voice command and parses the wake word and the command content.
步骤205:将命令内容通知与唤醒词对应的机器人,以使与唤醒词对应的机器人执行命令内容对应的操作。Step 205: Notifying the robot corresponding to the wake-up word with the command content, so that the robot corresponding to the wake-up word performs an operation corresponding to the command content.
解析语音命令的机器人,如果解析出的唤醒词不包括自己,则将命令内容发送给其他唤醒词对应的机器人;如果解析出的唤醒词包括自己,则开始执行命令内容对应的操作,并将命令内容发送给其他相关的机器人。机器人如果自身解析出命令内容,则按照自身解析出的命令内容执行操作,如果自身没有解析出命令内容,则按照其他机器人发送的命令内容执行操作。The robot that parses the voice command, if the parsed wake-up word does not include itself, sends the command content to the robot corresponding to the other wake-up words; if the parsed wake-up word includes itself, starts executing the operation corresponding to the command content, and the command The content is sent to other related robots. If the robot parses the command content itself, the operation is performed according to the command content that is parsed by itself, and if the command content is not parsed by itself, the operation is performed according to the command content sent by the other robot.
本发明实施例提供的唤醒方法,通过在机器人内预设包括若干个唤醒词的唤醒词库,机器人可以根据预设的唤醒词库正确的解析出语音命令中含有的唤醒词,从而正确的解析出语音命令中的命令内容,进而正确的完成用户交代的任务。用户可以对多个机器人同时发出指令,多个机器人可以联合起来共同执行用户发布的任务,执行效率更高,用户体验好。According to the wake-up method provided by the embodiment of the present invention, the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary by preset a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user confesses is correctly completed. The user can issue commands to multiple robots at the same time, and multiple robots can jointly perform the tasks issued by the user, and the execution efficiency is higher and the user experience is good.
如图7b所示,为所述唤醒方法的一个实施例的流程示意图,在该实施例中,所述唤醒方法,包括:As shown in FIG. 7b, it is a schematic flowchart of an embodiment of the waking method. In this embodiment, the waking method includes:
步骤301:监听语音信息;Step 301: Listening to voice information;
每个机器人都会实时监听用户发出的语音信息;Each robot will listen to the voice information sent by the user in real time;
步骤302:确认所述语音信息是否为针对唤醒词库对应的机器人的命令;Step 302: Confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
步骤303:如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组;Step 303: If the voice information is a command for the robot corresponding to the wakeup vocabulary, record the voice information and the start time of the voice information, and add a preset temporary command record group;
每个监听到语音信息的机器人都会确认上述语音信息是否是用户针对唤醒词库中的任一唤醒词对应的机器人发出的命令,如果是,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组。然后,加入临时命令记录组的机器人会向其他机器人广播自己加入了临时命令记录组,以 使临时命令记录组内的组员能知晓其他组员的存在。Each robot that listens to the voice information confirms whether the voice information is a command issued by the user for the robot corresponding to any wake-up word in the wake-up dictionary, and if so, records the voice information and listens to the start of the voice information. At the moment, and join the preset temporary command record group. Then, the robot that joins the temporary command record group will broadcast the temporary command record group to other robots to Make the members of the temporary command record group aware of the existence of other group members.
步骤304:确认临时命令记录组内的机器人记录的起始时刻中的最早时刻,根据所述最早时刻确定起始时间段;Step 304: Confirm the earliest time in the start time of the robot record in the temporary command record group, and determine the start time period according to the earliest time;
临时命令记录组中的机器人会向组内的其他机器人广播自己记录的起始时刻,某个机器人会确定各个起始时刻中的最早时刻,并根据该最早时刻确定起始时间段。The robot in the temporary command record group broadcasts the start time of its own record to other robots in the group, and a certain robot determines the earliest time in each start time and determines the start time period based on the earliest time.
步骤305:获得起始时刻位于所述起始时间段内的语音信息中清晰度最高的语音信息作为语音命令Step 305: Obtain voice information with the highest resolution among the voice information whose starting time is located in the starting time period as a voice command.
语音信息记录结束后,临时命令记录组中的机器人对自身记录的语音信息进行清晰度判定,得到清晰度分数值,然后向临时命令记录组中的其他机器人广播该清晰度分数值,某个机器人会找出位于起始时间段内清晰度最高的语音信息。After the voice information recording is finished, the robot in the temporary command recording group performs the resolution determination on the voice information recorded by itself, obtains the sharpness score value, and then broadcasts the sharpness score value to other robots in the temporary command record group, a certain robot It will find the highest-resolution voice information in the starting time period.
步骤306:根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容;Step 306: Parsing the wake-up words and the command content in the voice command according to the voice command and the preset wake-up term library;
步骤307:根据命令内容分解任务,将分解后的任务分别通知与唤醒词对应的机器人,以使与唤醒词对应的机器人协作执行命令内容对应的操作。Step 307: According to the command content decomposition task, respectively notify the decomposed task to the robot corresponding to the wake-up word, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
需要说明的是,上述步骤304、305、306和307并不必然被每个机器人执行,每个机器人可以向其他机器人广播自己的工作状态,由最空闲的机器人来执行。It should be noted that the above steps 304, 305, 306, and 307 are not necessarily performed by each robot, and each robot can broadcast its own working state to other robots, and is executed by the most idle robot.
本发明实施例提供的唤醒方法,通过在机器人内预设包括若干个唤醒词的唤醒词库,机器人可以根据预设的唤醒词库正确的解析出语音命令中含有的唤醒词,从而正确的解析出语音命令中的命令内容,进而正确的完成用户交代的任务。通过各机器人记录监听到语音信息的起始时刻,并加入临时命令记录组,通过比较临时命令记录组内的机器人记录的起始时刻,可以确定听到语音信息的最早时刻,能获得相对完整的语音命令。通过确定起始时间段内清晰度最高的语音信息作为语音命令,能获得相对完整和清晰的语音命令,增强了获取用户语音命令的可靠性。用户可以对多个机器人同时发出指令,多个机器人可以以协作的方式完成用户发布的任务,执行效率更高,用户体验好。According to the wake-up method provided by the embodiment of the present invention, the robot can correctly parse the wake-up words contained in the voice command according to the preset wake-up vocabulary by preset a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user confesses is correctly completed. By recording the start time of the voice information monitored by each robot and adding a temporary command record group, by comparing the start time of the robot record in the temporary command record group, the earliest time to hear the voice information can be determined, and a relatively complete can be obtained. Voice command. By determining the speech information with the highest resolution in the initial time period as the voice command, relatively complete and clear voice commands can be obtained, and the reliability of acquiring the user voice command is enhanced. The user can issue commands to multiple robots at the same time, and multiple robots can complete the tasks issued by the users in a coordinated manner, and the execution efficiency is higher and the user experience is good.
可选的,在所述唤醒方法的其他实施例中,所述方法还包括:Optionally, in another embodiment of the waking method, the method further includes:
更新唤醒词库。Update the wakeup vocabulary.
当用户又购入一个机器人,并通过网络与其他机器人之间建立通信连接后,新加入的机器人会通过网络向其他机器人广播自己的唤醒词,收到唤醒词的机器人会将该唤醒词同步到自身的唤醒词库,并将同步后的全集唤醒词库发回给该新加入的机器人。When the user purchases another robot and establishes a communication connection with other robots through the network, the newly added robot will broadcast its wake-up words to other robots through the network, and the robot receiving the wake-up words will synchronize the wake-up words to The own wake-up vocabulary, and send the synchronized corpus of wake-up vocabulary back to the newly added robot.
可选的,在所述唤醒方法的其他实施例中,所述方法还包括: Optionally, in another embodiment of the waking method, the method further includes:
解散临时命令记录组。Dismiss the temporary command record group.
任务分配完成后,各个机器人开始执行任务,此时可以解散临时命令记录组,释放内存,提高资源利用率。After the task assignment is completed, each robot starts to perform the task. At this time, the temporary command record group can be dismissed, the memory is released, and the resource utilization rate is improved.
相应的,本发明实施例还提供了一种机器人唤醒装置,所述唤醒装置设置于图2所示的任一机器人内,如图8所示,所述唤醒装置400包括:Correspondingly, the embodiment of the present invention further provides a robot wake-up device, which is disposed in any of the robots shown in FIG. 2. As shown in FIG. 8, the wake-up device 400 includes:
语音命令获取模块401,用于获取语音命令;The voice command obtaining module 401 is configured to acquire a voice command.
语音命令解析模块402,用于根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词库包括若干个唤醒词,所述唤醒词库中的唤醒词对应至少两个机器人;The voice command parsing module 402 is configured to parse the wake-up word and the command content in the voice command according to the voice command and the preset wake-up vocabulary, where the wake-up vocabulary includes a plurality of wake-up words, the wake-up word The wake-up words in the library correspond to at least two robots;
执行模块403,用于根据解析出的唤醒词和命令内容执行操作。The executing module 403 is configured to perform an operation according to the parsed wake-up word and the command content.
本发明实施例提供的唤醒装置,通过在机器人内预设包括若干个唤醒词的唤醒词库,机器人可以根据预设的唤醒词库正确的解析出语音命令中含有的唤醒词,从而正确的解析出语音命令中的命令内容,进而正确的完成用户交代的任务。The wake-up device provided by the embodiment of the present invention can correct the wake-up words contained in the voice command according to the preset wake-up vocabulary by presetting a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user confesses is correctly completed.
具体的,在所述唤醒装置的某些实施例中,所述语音命令获取模块401包括:Specifically, in some embodiments of the waking device, the voice command obtaining module 401 includes:
语音信息监听子模块,用于监听语音信息;a voice information monitoring sub-module for monitoring voice information;
语音命令确认子模块,用于确认所述语音信息是否为针对唤醒词库对应的机器人的命令;a voice command confirmation submodule, configured to confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
第一语音命令获取子模块,用于如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息作为语音命令。The first voice command acquisition submodule is configured to record the voice information as a voice command if the voice information is a command for a robot corresponding to the wakeup vocabulary.
可选的,在所述唤醒装置的其他实施例中,如图9所示,所述语音命令获取模块401包括:Optionally, in other embodiments of the waking device, as shown in FIG. 9, the voice command obtaining module 401 includes:
语音信息监听子模块4011,用于监听语音信息;a voice information monitoring sub-module 4011, configured to monitor voice information;
语音命令确认子模块4012,用于确认所述语音信息是否为针对唤醒词库对应的机器人的命令;a voice command confirmation sub-module 4012, configured to confirm whether the voice information is a command for a robot corresponding to the wake-up vocabulary;
语音命令记录子模块4013,用于如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组;The voice command recording sub-module 4013 is configured to record the voice information and the start time of the voice information if the voice information is a command for the robot corresponding to the wake-up vocabulary, and add a preset temporary command record group. ;
起始时间段确认子模块4014,用于确认临时命令记录组内的机器人记录的起始时刻中的最早时刻,根据所述最早时刻确定起始时间段;a start time period confirmation sub-module 4014, configured to confirm an earliest time in a start time of the robot record in the temporary command record group, and determine a start time period according to the earliest time;
第二语音命令获取子模块4015,用于获得起始时刻位于所述起始时间段内的语音信息中清晰度最高的语音信息作为语音命令。The second voice command acquisition sub-module 4015 is configured to obtain voice information with the highest resolution among the voice information whose starting time is located in the start time period as a voice command.
具体的,在所述唤醒装置的某些实施例中,,所述语音命令确认子模块包括:Specifically, in some embodiments of the wake-up device, the voice command confirmation sub-module includes:
语音命令确认子单元,用于如果所述语音信息包括预设唤醒词库中的任一 唤醒词且该唤醒词的出现为呼唤,则所述语音信息为针对唤醒词库对应的机器人的命令。a voice command confirmation subunit, configured to: if the voice information includes any one of a preset wakeup vocabulary The wake-up word and the appearance of the wake-up word are called, and the voice information is a command for the robot corresponding to the wake-up vocabulary.
可选的,如图10所示,在所述唤醒装置的其他实施例中,所述唤醒装置500除了包括语音命令获取模块501、语音命令解析模块502和执行模块503之外,还包括:Optionally, as shown in FIG. 10, in other embodiments of the awake device, the awake device 500 includes: a voice command obtaining module 501, a voice command parsing module 502, and an executing module 503,
唤醒词库更新模块504,用于更新唤醒词库。The wakeup vocabulary update module 504 is configured to update the wakeup vocabulary.
其中,在一些实施例中,所述唤醒词库更新模块包括:In some embodiments, the wakeup vocabulary update module includes:
第一唤醒词库更新子模块,用于根据唤醒词设置指令设置唤醒词,并广播所述唤醒词。The first wakeup vocabulary update submodule is configured to set a wakeup word according to the wakeup word setting instruction, and broadcast the wakeup word.
在另一些实施例中,所述唤醒词库更新模块还包括:In other embodiments, the wakeup vocabulary update module further includes:
第二唤醒词库更新子模块,用于接收广播的唤醒词,将所述唤醒词加入预设的唤醒词库中,并将更新后的唤醒词库发送给广播唤醒词的机器人。The second wake-up vocabulary update sub-module is configured to receive the broadcasted wake-up word, add the wake-up word to the preset wake-up vocabulary, and send the updated wake-up vocabulary to the robot that broadcasts the wake-up word.
具体的,在所述唤醒装置的某些实施例中,所述语音命令解析模块包括:Specifically, in some embodiments of the waking device, the voice command parsing module includes:
唤醒词解析子模块,用于根据语音命令和唤醒词库解析出语音命令中的唤醒词;The wake-up word parsing sub-module is configured to parse the wake-up words in the voice command according to the voice command and the wake-up vocabulary;
命令内容解析子模块,用于根据语音命令和唤醒词解析出语音命令中的命令内容。The command content parsing sub-module is configured to parse the command content in the voice command according to the voice command and the wake-up word.
具体的,在所述唤醒装置的某些实施例中,所述执行模块包括:Specifically, in some embodiments of the waking device, the execution module includes:
第一执行子模块,用于将命令内容通知与唤醒词对应的机器人,以使与唤醒词对应的机器人执行命令内容对应的操作。The first execution submodule is configured to notify the robot corresponding to the wakeup word of the command content, so that the robot corresponding to the wakeup word performs an operation corresponding to the command content.
在所述唤醒装置的其他实施例中,所述执行模块包括:In other embodiments of the wake-up device, the execution module includes:
第二执行子模块,用于根据命令内容分解任务,将分解后的任务分别通知与唤醒词对应的机器人,以使与唤醒词对应的机器人协作执行命令内容对应的操作。The second execution sub-module is configured to respectively notify the robot corresponding to the wake-up word according to the command content decomposition task, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
如图11所示,为所述唤醒装置的一个实施例的结构示意图,在该实施例中,所述唤醒装置600包括:FIG. 11 is a schematic structural diagram of an embodiment of the wake-up device. In this embodiment, the wake-up device 600 includes:
语音命令获取模块601,用于获取语音命令;其中,所述语音命令获取模块601包括:The voice command obtaining module 601 is configured to obtain a voice command. The voice command acquiring module 601 includes:
语音信息监听子模块6011,用于监听语音信息;a voice information monitoring sub-module 6011, configured to monitor voice information;
语音命令确认子模块6012,用于确认所述语音信息是否为针对唤醒词库对应的机器人的命令;a voice command confirmation sub-module 6012, configured to confirm whether the voice information is a command for a robot corresponding to the wake-up vocabulary;
语音命令记录子模块6013,用于如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组;The voice command recording sub-module 6013 is configured to record the voice information and the start time of the voice information if the voice information is a command for the robot corresponding to the wake-up vocabulary, and add a preset temporary command record group. ;
起始时间段确认子模块6014,用于确认临时命令记录组内的机器人记录的 起始时刻中的最早时刻,根据所述最早时刻确定起始时间段;a start time period confirmation sub-module 6014 for confirming the robot record in the temporary command record group The earliest time in the starting time, determining the starting time period according to the earliest time;
第二语音命令获取子模块6015,用于获得起始时刻位于所述起始时间段内的语音信息中清晰度最高的语音信息作为语音命令。The second voice command acquisition sub-module 6015 is configured to obtain voice information with the highest resolution among the voice information whose starting time is located in the start time period as a voice command.
语音命令解析模块602,用于根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词用于唤醒机器人;其中,所述语音命令解析模块602包括:The voice command parsing module 602 is configured to parse the wake-up word and the command content in the voice command according to the voice command and the preset wake-up term library, where the wake-up word is used to wake up the robot; wherein the voice command is The parsing module 602 includes:
唤醒词解析子模块6021,用于根据语音命令和唤醒词库解析出语音命令中的唤醒词;The wake-up word parsing sub-module 6021 is configured to parse the wake-up words in the voice command according to the voice command and the wake-up vocabulary;
命令内容解析子模块6022,用于根据语音命令和唤醒词解析出语音命令中的命令内容。The command content parsing sub-module 6022 is configured to parse the command content in the voice command according to the voice command and the wake-up word.
执行模块603,用于根据解析出的唤醒词和命令内容执行操作,其中,所述执行模块603包括:The executing module 603 is configured to perform an operation according to the parsed wake-up word and the command content, where the executing module 603 includes:
第二执行子模块6031,用于根据命令内容分解任务,将分解后的任务分别通知与唤醒词对应的机器人,以使与唤醒词对应的机器人协作执行命令内容对应的操作。The second execution sub-module 6031 is configured to separately notify the robot corresponding to the wake-up word according to the command content decomposition task, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
唤醒词库更新模块604,用于更新唤醒词库。The wakeup vocabulary update module 604 is configured to update the wakeup vocabulary.
语音信息监听子模块6011实时监听用户发出的语音,语音命令确认子模块6012确认语音信息监听子模块6011监听到的语音信息是否为用户针对唤醒词库对应的机器人的命令,如果所述语音信息为针对唤醒词库对应的机器人的命令,语音命令记录子模块6013对所述语音信息以及监听到语音信息的起始时刻进行记录,并加入预设的临时命令记录组。临时命令记录组中的机器人会向其他机器人广播自己记录的起始时刻,起始时间段确认子模块6014根据各个起始时刻确认监听到用户语音命令的最早时刻,并根据该最早时刻确定起始时间段。语音信息记录结束后,临时命令记录组中的机器人对自身记录的语音信息进行清晰度判定,得到清晰度分数值,然后向临时命令记录组中的其他机器人广播该清晰度分数值,第二语音命令获取子模块6015确定起始时刻位于所述起始时间段内的语音信息中清晰度最高的语音信息作为语音命令。唤醒词解析子模块6021根据该语音命令解析出唤醒词,命令内容解析子模块6022,根据语音命令和上述唤醒词解析出命令内容。第二执行子模块6031,根据上述命令内容分解任务,促使与唤醒词对应的机器人协作执行命令内容对应的操作。The voice information monitoring sub-module 6011 monitors the voice sent by the user in real time, and the voice command confirmation sub-module 6012 confirms whether the voice information monitored by the voice information monitoring sub-module 6011 is a command of the user for the robot corresponding to the wake-up vocabulary, if the voice information is For the command of the robot corresponding to the vocabulary, the voice command recording sub-module 6013 records the voice information and the start time of the voice information, and adds a preset temporary command record group. The robot in the temporary command record group broadcasts the start time of the record to other robots, and the start time period confirmation sub-module 6014 confirms the earliest time to listen to the user voice command according to each start time, and determines the start according to the earliest time. period. After the voice information recording is finished, the robot in the temporary command recording group performs the resolution determination on the voice information recorded by itself, obtains the sharpness score value, and then broadcasts the sharpness score value to the other robots in the temporary command record group, the second voice. The command acquisition sub-module 6015 determines the voice information with the highest resolution among the voice information whose start time is located in the start time period as a voice command. The wake-up word analysis sub-module 6021 parses the wake-up word according to the voice command, and the command content analysis sub-module 6022 parses the command content according to the voice command and the wake-up word. The second execution sub-module 6031 causes the robot corresponding to the wake-up word to cooperate to perform an operation corresponding to the command content according to the command content decomposition task.
本发明实施例提供的唤醒装置,通过在机器人内预设包括若干个唤醒词的唤醒词库,机器人可以根据预设的唤醒词库正确的解析出语音命令中含有的唤醒词,从而正确的解析出语音命令中的命令内容,进而正确的完成用户交代的任务。通过各机器人记录监听到语音信息的起始时刻,并加入临时命令记录组,通过比较临时命令记录组内的机器人记录的起始时刻,可以确定听到语音信息 的最早时刻,能获得相对完整的语音命令。通过确定起始时间段内清晰度最高的语音信息作为语音命令,能获得相对完整和清晰的语音命令,增强了获取用户语音命令的可靠性。用户可以对多个机器人同时发出指令,多个机器人可以以协作的方式完成用户发布的任务,执行效率更高,用户体验好。The wake-up device provided by the embodiment of the present invention can correct the wake-up words contained in the voice command according to the preset wake-up vocabulary by presetting a wake-up vocabulary including a plurality of wake-up words in the robot, thereby correctly analyzing The content of the command in the voice command is completed, and the task that the user confesses is correctly completed. The start time of the voice information is monitored by each robot, and the temporary command record group is added, and the voice information can be determined by comparing the start time of the robot record in the temporary command record group. At the earliest moment, you can get relatively complete voice commands. By determining the speech information with the highest resolution in the initial time period as the voice command, relatively complete and clear voice commands can be obtained, and the reliability of acquiring the user voice command is enhanced. The user can issue commands to multiple robots at the same time, and multiple robots can complete the tasks issued by the users in a coordinated manner, and the execution efficiency is higher and the user experience is good.
需要说明的是,上述唤醒装置可执行本发明实施例所提供的唤醒方法,具备执行方法相应的功能模块和有益效果。未在唤醒装置实施例中详尽描述的技术细节,可参见本发明实施例所提供的唤醒方法。It should be noted that the above-mentioned wake-up device can perform the wake-up method provided by the embodiment of the present invention, and has the corresponding functional modules and beneficial effects of the execution method. For a technical detail that is not described in detail in the wake-up device embodiment, reference may be made to the wake-up method provided by the embodiment of the present invention.
图12是本发明实施例提供的机器人唤醒方法的机器人700的硬件结构示意图,如图12所示,该机器人700包括:FIG. 12 is a schematic diagram showing the hardware structure of the robot 700 for the robot wake-up method according to the embodiment of the present invention. As shown in FIG. 12, the robot 700 includes:
一个或多个处理器701以及存储器702,图12中以一个处理器701为例。One or more processors 701 and memory 702, one processor 701 is taken as an example in FIG.
处理器701和存储器702可以通过总线或者其他方式连接,图12中以通过总线连接为例。The processor 701 and the memory 702 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
存储器702作为一种非易失性计算机可读存储介质,可用于存储非易失性软件程序、非易失性计算机可执行程序以及模块,如本发明实施例中的唤醒方法对应的程序指令/模块(例如,附图8所示的语音命令获取模块401、语音命令解析模块402、执行模块403)。处理器701通过运行存储在存储器702中的非易失性软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例的唤醒方法。The memory 702 is a non-volatile computer readable storage medium, and can be used to store non-volatile software programs, non-volatile computer-executable programs, and modules, such as program instructions corresponding to the wake-up method in the embodiment of the present invention. The module (for example, the voice command acquisition module 401, the voice command analysis module 402, and the execution module 403 shown in FIG. 8). The processor 701 executes various functional applications of the server and data processing by executing non-volatile software programs, instructions, and modules stored in the memory 702, that is, implementing the wake-up method of the above method embodiments.
存储器702可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据唤醒装置的使用所创建的数据等。此外,存储器702可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实施例中,存储器702可选包括相对于处理器701远程设置的存储器,这些远程存储器可以通过网络连接至唤醒装置。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 702 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to the use of the wake-up device, and the like. Moreover, memory 702 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 702 can optionally include memory remotely located relative to processor 701 that can be connected to the wake-up device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
所述一个或者多个模块存储在所述存储器702中,当被所述一个或者多个处理器701执行时,执行上述任意方法实施例中的唤醒方法,例如,执行以上描述的图3中的方法步骤101至步骤103,图4a中的方法步骤1011至步骤1013,图4b中的方法步骤1014至步骤1018,图5中的方法步骤1021至步骤1022,图6a中的方法步骤1031,图6b中的方法步骤1032,图7a中的方法步骤201-205,图7b中的方法步骤301-307;实现图8中的模块401-403、图9中子模块4011和4015,图10中模块501-504,图11中模块601-604、子模块6011-6015、子模块6021-6022、子模块6031的功能。The one or more modules are stored in the memory 702, and when executed by the one or more processors 701, perform the wake-up method in any of the above method embodiments, for example, performing the above described FIG. Method step 101 to step 103, method step 1011 to step 1013 in Fig. 4a, method step 1014 to step 1018 in Fig. 4b, method step 1021 to step 1022 in Fig. 5, method step 1031 in Fig. 6a, Fig. 6b Method step 1032, method steps 201-205 in FIG. 7a, method steps 301-307 in FIG. 7b; implementing modules 401-403 in FIG. 8, sub-modules 4011 and 4015 in FIG. 9, module 501 in FIG. - 504, the functions of the modules 601-604, the sub-module 6011-6015, the sub-module 6021-6022, and the sub-module 6031 in FIG.
上述产品可执行本发明实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本发明实施例所 提供的方法。The above product can perform the method provided by the embodiment of the present invention, and has the corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the embodiments of the present invention. The method provided.
本发明实施例提供了一种非易失性计算机可读存储介质,所述计算机可读存储介质存储有计算机可执行指令,该计算机可执行指令被一个或多个处理器执行,例如图12中的一个处理器701,可使得上述一个或多个处理器可执行上述任意方法实施例中的唤醒方法,例如,执行以上描述的图3中的方法步骤101至步骤103,图4a中的方法步骤1011至步骤1013,图4b中的方法步骤1014至步骤1018,图5中的方法步骤1021至步骤1022,图6a中的方法步骤1031,图6b中的方法步骤1032,图7a中的方法步骤201-205,图7b中的方法步骤301-307;实现图8中的模块401-403、图9中子模块4011和4015,图10中模块501-504,图11中模块601-604、子模块6011-6015、子模块6021-6022、子模块6031的功能。Embodiments of the present invention provide a non-transitory computer readable storage medium storing computer-executable instructions that are executed by one or more processors, such as in FIG. The processor 701 is configured to enable the one or more processors to perform the wake-up method in any of the foregoing method embodiments, for example, to perform the method steps 101 to 103 in FIG. 3 described above, the method steps in FIG. 4a 1011 to step 1013, method step 1014 to step 1018 in Fig. 4b, method step 1021 to step 1022 in Fig. 5, method step 1031 in Fig. 6a, method step 1032 in Fig. 6b, method step 201 in Fig. 7a - 205, method steps 301-307 in FIG. 7b; implementing modules 401-403 in FIG. 8, sub-modules 4011 and 4015 in FIG. 9, modules 501-504 in FIG. 10, modules 601-604, sub-modules in FIG. Functions of 6011-6015, submodule 6021-6022, and submodule 6031.
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
通过以上的实施方式的描述,本领域普通技术人员可以清楚地了解到各实施方式可借助软件加通用硬件平台的方式来实现,当然也可以通过硬件。本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。Through the description of the above embodiments, those skilled in the art can clearly understand that the various embodiments can be implemented by means of software plus a general hardware platform, and of course, by hardware. A person skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium. When executed, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;在本发明的思路下,以上实施例或者不同实施例中的技术特征之间也可以进行组合,步骤可以以任意顺序实现,并存在如上所述的本发明的不同方面的许多其它变化,为了简明,它们没有在细节中提供;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, and are not limited thereto; in the idea of the present invention, the technical features in the above embodiments or different embodiments may also be combined. The steps may be carried out in any order, and there are many other variations of the various aspects of the invention as described above, which are not provided in the details for the sake of brevity; although the invention has been described in detail with reference to the foregoing embodiments, It should be understood by those skilled in the art that the technical solutions described in the foregoing embodiments may be modified or equivalently substituted for some of the technical features; and the modifications or substitutions do not deviate from the embodiments of the present invention. The scope of the technical solution.

Claims (19)

  1. 一种机器人唤醒方法,所述唤醒方法应用于机器人,其特征在于,所述方法包括:A robot wake-up method, the wake-up method applied to a robot, wherein the method comprises:
    获取语音命令;Get a voice command;
    根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词库包括至少两个唤醒词,所述唤醒词库中的唤醒词用于唤醒至少两个机器人,所述机器人对应一个或一个以上的唤醒词;And parsing the wake-up words and the command content in the voice command according to the voice command and the preset wake-up word library, the wake-up word library includes at least two wake-up words, and the wake-up words in the wake-up word library are used for Wake up at least two robots corresponding to one or more wake words;
    根据解析出的唤醒词和命令内容执行操作。Perform operations based on the parsed wakeup words and command content.
  2. 根据权利要求1所述的方法,其特征在于,所述获取语音命令包括:The method according to claim 1, wherein the obtaining the voice command comprises:
    监听语音信息;Monitor voice information;
    确认所述语音信息是否为针对唤醒词库对应的机器人的命令;Determining whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
    如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息作为语音命令。If the voice information is a command for a robot corresponding to the wakeup vocabulary, the voice information is recorded as a voice command.
  3. 根据权利要求1所述的方法,其特征在于,所述获取语音命令包括:The method according to claim 1, wherein the obtaining the voice command comprises:
    监听语音信息;Monitor voice information;
    确认所述语音信息是否为针对唤醒词库对应的机器人的命令;Determining whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
    如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组;If the voice information is a command for the robot corresponding to the wakeup vocabulary, recording the voice information and monitoring the start time of the voice information, and adding a preset temporary command record group;
    确认临时命令记录组内的机器人记录的起始时刻中的最早时刻,根据所述最早时刻确定起始时间段;Confirming the earliest time in the start time of the robot record in the temporary command record group, and determining the start time period according to the earliest time;
    获得起始时刻位于所述起始时间段内的语音信息中清晰度最高的语音信息作为语音命令。The speech information with the highest resolution among the speech information whose starting time is located in the initial time period is obtained as a voice command.
  4. 根据权利要求2或3所述的方法,其特征在于,所述确认所述语音信息是否为针对唤醒词库对应的机器人的命令,包括:The method according to claim 2 or 3, wherein the confirming whether the voice information is a command for a robot corresponding to the wakeup vocabulary comprises:
    如果所述语音信息包括预设唤醒词库中的任一唤醒词且该唤醒词的出现为呼唤,则所述语音信息为针对唤醒词库对应的机器人的命令。If the voice information includes any wake-up word in the preset wake-up vocabulary and the appearance of the wake-up word is a call, the voice information is a command for the robot corresponding to the wake-up vocabulary.
  5. 根据权利要求1-4的任意一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 4, further comprising:
    更新唤醒词库;Update the wakeup vocabulary;
    所述更新唤醒词库,包括:The update wake-up vocabulary includes:
    根据唤醒词设置指令设置唤醒词,并广播所述唤醒词。The wake-up word is set according to the wake-up word setting instruction, and the wake-up word is broadcast.
  6. 根据权利要求5所述的方法,其特征在于,所述更新唤醒词库,还包括:The method of claim 5, wherein the updating the wake-up vocabulary further comprises:
    接收广播的唤醒词,将所述唤醒词加入预设的唤醒词库中,并将更新后的 唤醒词库发送给广播唤醒词的机器人。Receiving a wake-up word of the broadcast, adding the wake-up word to the preset wake-up word library, and updating the Wake up the thesaurus to the robot that broadcasts the wake-up words.
  7. 根据权利要求1-4的任意一项所述的方法,其特征在于,所述根据解析出的唤醒词和命令内容执行操作,包括:The method according to any one of claims 1 to 4, wherein the performing the operation according to the parsed wake-up word and the command content comprises:
    将命令内容通知与唤醒词对应的机器人,以使与唤醒词对应的机器人执行命令内容对应的操作。The robot corresponding to the wake-up word is notified of the command content so that the robot corresponding to the wake-up word performs an operation corresponding to the command content.
  8. 根据权利要求1-4的任意一项所述的方法,其特征在于,所述根据解析出的唤醒词和命令内容执行操作,包括:The method according to any one of claims 1 to 4, wherein the performing the operation according to the parsed wake-up word and the command content comprises:
    根据命令内容分解任务,将分解后的任务分别通知与唤醒词对应的机器人,以使与唤醒词对应的机器人协作执行命令内容对应的操作。According to the command content decomposition task, the decomposed task is separately notified to the robot corresponding to the wake-up word, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
  9. 一种机器人唤醒装置,所述唤醒装置应用于机器人,其特征在于,所述装置包括:A robot wake-up device, the wake-up device being applied to a robot, characterized in that the device comprises:
    语音命令获取模块,用于获取语音命令;a voice command acquisition module, configured to acquire a voice command;
    语音命令解析模块,用于根据所述语音命令和预设的唤醒词库,解析出所述语音命令中的唤醒词以及命令内容,所述唤醒词库包括至少两个唤醒词,所述唤醒词库中的唤醒词用于唤醒至少两个机器人,所述机器人对应一个或一个以上的唤醒词;a voice command parsing module, configured to parse an awakening word and a command content in the voice command according to the voice command and a preset wake-up term library, where the wake-up word library includes at least two wake-up words, the wake-up word The wake-up words in the library are used to wake up at least two robots, the robot corresponding to one or more wake-up words;
    执行模块,用于根据解析出的唤醒词和命令内容执行操作。An execution module is configured to perform an operation according to the parsed wake-up words and command content.
  10. 根据权利要求9所述的装置,其特征在于,所述语音命令获取模块包括:The device according to claim 9, wherein the voice command acquisition module comprises:
    语音信息监听子模块,用于监听语音信息;a voice information monitoring sub-module for monitoring voice information;
    语音命令确认子模块,用于确认所述语音信息是否为针对唤醒词库对应的机器人的命令;a voice command confirmation submodule, configured to confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
    第一语音命令获取子模块,用于如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息作为语音命令。The first voice command acquisition submodule is configured to record the voice information as a voice command if the voice information is a command for a robot corresponding to the wakeup vocabulary.
  11. 根据权利要求9所述的装置,其特征在于,所述语音命令获取模块包括:The device according to claim 9, wherein the voice command acquisition module comprises:
    语音信息监听子模块,用于监听语音信息;a voice information monitoring sub-module for monitoring voice information;
    语音命令确认子模块,用于确认所述语音信息是否为针对唤醒词库对应的机器人的命令;a voice command confirmation submodule, configured to confirm whether the voice information is a command for a robot corresponding to the wakeup vocabulary;
    语音命令记录子模块,用于如果所述语音信息为针对唤醒词库对应的机器人的命令,则记录所述语音信息以及监听到语音信息的起始时刻,并加入预设的临时命令记录组;a voice command recording submodule, configured to record the voice information and the start time of the voice information if the voice information is a command for the robot corresponding to the wakeup vocabulary, and add a preset temporary command record group;
    起始时间段确认子模块,用于确认临时命令记录组内的机器人记录的起始时刻中的最早时刻,根据所述最早时刻确定起始时间段;a start time period confirmation submodule, configured to confirm an earliest time in a start time of the robot record in the temporary command record group, and determine a start time period according to the earliest time;
    第二语音命令获取子模块,用于获得起始时刻位于所述起始时间段内的语 音信息中清晰度最高的语音信息作为语音命令。a second voice command acquisition submodule, configured to obtain a language whose starting time is within the starting time period The highest-resolution voice information in the voice information is used as a voice command.
  12. 根据权利要求10或11所述的装置,其特征在于,所述语音命令确认子模块包括:The device according to claim 10 or 11, wherein the voice command confirmation submodule comprises:
    语音命令确认子单元,用于如果所述语音信息包括预设唤醒词库中的任一唤醒词且该唤醒词的出现为呼唤,则所述语音信息为针对唤醒词库对应的机器人的命令。The voice command confirmation sub-unit is configured to: if the voice information includes any wake-up words in the preset wake-up vocabulary and the appearance of the wake-up word is a call, the voice information is a command for the robot corresponding to the wake-up vocabulary.
  13. 根据权利要求9-12的任意一项所述的装置,其特征在于,所述装置还包括:The device according to any one of claims 9 to 12, wherein the device further comprises:
    唤醒词库更新模块,用于更新唤醒词库;The wakeup vocabulary update module is used to update the wakeup vocabulary;
    所述唤醒词库更新模块包括:The wakeup vocabulary update module includes:
    第一唤醒词库更新子模块,用于根据唤醒词设置指令设置唤醒词,并广播所述唤醒词。The first wakeup vocabulary update submodule is configured to set a wakeup word according to the wakeup word setting instruction, and broadcast the wakeup word.
  14. 根据权利要求13所述的装置,其特征在于,所述唤醒词库更新模块还包括:The device according to claim 13, wherein the wake-up thesaurus update module further comprises:
    第二唤醒词库更新子模块,用于接收广播的唤醒词,将所述唤醒词加入预设的唤醒词库中,并将更新后的唤醒词库发送给广播唤醒词的机器人。The second wake-up vocabulary update sub-module is configured to receive the broadcasted wake-up word, add the wake-up word to the preset wake-up vocabulary, and send the updated wake-up vocabulary to the robot that broadcasts the wake-up word.
  15. 根据权利要求9-12的任意一项所述的装置,其特征在于,所述执行模块包括:The apparatus according to any one of claims 9 to 12, wherein the execution module comprises:
    第一执行子模块,用于将命令内容通知与唤醒词对应的机器人,以使与唤醒词对应的机器人执行命令内容对应的操作。The first execution submodule is configured to notify the robot corresponding to the wakeup word of the command content, so that the robot corresponding to the wakeup word performs an operation corresponding to the command content.
  16. 根据权利要求9-12的任意一项所述的装置,其特征在于,所述执行模块包括:The apparatus according to any one of claims 9 to 12, wherein the execution module comprises:
    第二执行子模块,用于根据命令内容分解任务,将分解后的任务分别通知与唤醒词对应的机器人,以使与唤醒词对应的机器人协作执行命令内容对应的操作。The second execution sub-module is configured to respectively notify the robot corresponding to the wake-up word according to the command content decomposition task, so that the robot corresponding to the wake-up word cooperates to perform the operation corresponding to the command content.
  17. 一种机器人,其特征在于,包括:A robot characterized by comprising:
    至少一个处理器;以及,At least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-8任一项所述的方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform the method of any of claims 1-8 method.
  18. 一种非易失性计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机可执行指令,当所述计算机可执行指令被机器人执行时,使所述机器人执行执行权利要求1-8任一项所述的方法。A non-transitory computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions that, when executed by a robot, cause the robot to execute an execution claim The method of any of 1-8.
  19. 一种计算机程序产品,其特征在于,所述计算机程序产品包括存储在 非易失性计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被机器人执行时,使所述机器人执行权利要求1-8任一项所述的方法。 A computer program product, characterized in that the computer program product comprises A computer program on a non-transitory computer readable storage medium, the computer program comprising program instructions that, when executed by a robot, cause the robot to perform the method of any of claims 1-8.
PCT/CN2017/075588 2017-03-03 2017-03-03 Wake-up method and device for robot, and robot WO2018157388A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201780000607.1A CN107223280B (en) 2017-03-03 2017-03-03 Robot awakening method and device and robot
PCT/CN2017/075588 WO2018157388A1 (en) 2017-03-03 2017-03-03 Wake-up method and device for robot, and robot

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/075588 WO2018157388A1 (en) 2017-03-03 2017-03-03 Wake-up method and device for robot, and robot

Publications (1)

Publication Number Publication Date
WO2018157388A1 true WO2018157388A1 (en) 2018-09-07

Family

ID=59955075

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/075588 WO2018157388A1 (en) 2017-03-03 2017-03-03 Wake-up method and device for robot, and robot

Country Status (2)

Country Link
CN (1) CN107223280B (en)
WO (1) WO2018157388A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109003611A (en) * 2018-09-29 2018-12-14 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and medium for vehicle audio control

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109725798B (en) * 2017-10-25 2021-07-27 腾讯科技(北京)有限公司 Intelligent role switching method and related device
WO2019087546A1 (en) * 2017-10-30 2019-05-09 ソニー株式会社 Information processing device and information processing method
CN107895578B (en) * 2017-11-15 2021-07-20 百度在线网络技术(北京)有限公司 Voice interaction method and device
CN108177151A (en) * 2017-11-28 2018-06-19 上海魔龙机器人科技有限公司 A kind of robot and its ontology voice interactive system
CN108320733B (en) * 2017-12-18 2022-01-04 上海科大讯飞信息科技有限公司 Voice data processing method and device, storage medium and electronic equipment
CN109963233B (en) * 2017-12-22 2021-03-02 深圳市优必选科技有限公司 Method and device for updating robot wake-up word and terminal equipment
CN109977364B (en) * 2017-12-27 2023-05-26 深圳市优必选科技有限公司 Method and device for obtaining robot wake-up word
CN110097878A (en) * 2018-01-30 2019-08-06 阿拉的(深圳)人工智能有限公司 Polygonal color phonetic prompt method, cloud device, prompt system and storage medium
CN110134360A (en) * 2018-02-09 2019-08-16 阿拉的(深圳)人工智能有限公司 Intelligent voice broadcasting method, broadcast device, storage medium and intelligent sound box
CN108536668B (en) * 2018-02-26 2022-06-07 科大讯飞股份有限公司 Wake-up word evaluation method and device, storage medium and electronic equipment
CN109358751A (en) * 2018-10-23 2019-02-19 北京猎户星空科技有限公司 A kind of wake-up control method of robot, device and equipment
CN109949447A (en) * 2018-12-08 2019-06-28 浙江国自机器人技术有限公司 Identity identifying method for IDC crusing robot
CN110797015B (en) * 2018-12-17 2020-09-29 北京嘀嘀无限科技发展有限公司 Voice wake-up method and device, electronic equipment and storage medium
CN110428821A (en) * 2019-07-26 2019-11-08 广州市申迪计算机系统有限公司 A kind of voice command control method and device for crusing robot
CN111091814A (en) * 2019-12-13 2020-05-01 晶晨半导体(深圳)有限公司 Method for constructing multi-voice assistant
CN111429901B (en) * 2020-03-16 2023-03-21 云知声智能科技股份有限公司 IoT chip-oriented multi-stage voice intelligent awakening method and system
CN114343483B (en) * 2020-10-12 2023-08-18 百度在线网络技术(北京)有限公司 Control method, device, equipment and storage medium for movable object
CN112382281B (en) * 2020-11-05 2023-11-21 北京百度网讯科技有限公司 Voice recognition method, device, electronic equipment and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003305674A (en) * 2002-04-11 2003-10-28 Sony Corp Robot device, robot control method, recording medium and program
CN101894553A (en) * 2010-07-23 2010-11-24 四川长虹电器股份有限公司 Realization method of television voice control
CN105206271A (en) * 2015-08-25 2015-12-30 北京宇音天下科技有限公司 Intelligent equipment voice wake-up method and system for realizing method
CN105632493A (en) * 2016-02-05 2016-06-01 深圳前海勇艺达机器人有限公司 Method for controlling and wakening robot through voice
CN105869637A (en) * 2016-05-26 2016-08-17 百度在线网络技术(北京)有限公司 Voice wake-up method and device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1215658A3 (en) * 2000-12-05 2002-08-14 Hewlett-Packard Company Visual activation of voice controlled apparatus
US9705736B2 (en) * 2014-03-14 2017-07-11 Ray Wang Method and system for a personal network
JP2015184563A (en) * 2014-03-25 2015-10-22 シャープ株式会社 Interactive household electrical system, server device, interactive household electrical appliance, method for household electrical system to interact, and program for realizing the same by computer
CN104538030A (en) * 2014-12-11 2015-04-22 科大讯飞股份有限公司 Control system and method for controlling household appliances through voice
CN105563484B (en) * 2015-12-08 2018-04-10 深圳达闼科技控股有限公司 Cloud robot system, robot and robot cloud platform
CN105553799A (en) * 2016-02-29 2016-05-04 深圳市广佳乐新智能科技有限公司 Intelligent housing system based on voice recognition

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003305674A (en) * 2002-04-11 2003-10-28 Sony Corp Robot device, robot control method, recording medium and program
CN101894553A (en) * 2010-07-23 2010-11-24 四川长虹电器股份有限公司 Realization method of television voice control
CN105206271A (en) * 2015-08-25 2015-12-30 北京宇音天下科技有限公司 Intelligent equipment voice wake-up method and system for realizing method
CN105632493A (en) * 2016-02-05 2016-06-01 深圳前海勇艺达机器人有限公司 Method for controlling and wakening robot through voice
CN105869637A (en) * 2016-05-26 2016-08-17 百度在线网络技术(北京)有限公司 Voice wake-up method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109003611A (en) * 2018-09-29 2018-12-14 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and medium for vehicle audio control

Also Published As

Publication number Publication date
CN107223280A (en) 2017-09-29
CN107223280B (en) 2021-01-08

Similar Documents

Publication Publication Date Title
WO2018157388A1 (en) Wake-up method and device for robot, and robot
AU2019246868B2 (en) Method and system for voice activation
CN112272819B (en) Method and system for passively waking up user interaction device
WO2018205083A1 (en) Robot wakeup method and device, and robot
EP3669355B1 (en) Voice-activated selective memory for voice-capturing devices
CN106297781B (en) Control method and controller
WO2017071645A1 (en) Voice control method, device and system
CN108962262B (en) Voice data processing method and device
US20170330566A1 (en) Distributed Volume Control for Speech Recognition
WO2016127550A1 (en) Method and device for human-machine voice interaction
JP6730994B2 (en) Question/answer information processing method, device, storage medium, and device
EP3611724A1 (en) Voice response method and device, and smart device
CN109147779A (en) Voice data processing method and device
KR20210008521A (en) Dynamic and/or context-specific hot words to invoke automated assistants
WO2016112644A1 (en) Voice control method, apparatus, and terminal
WO2016107362A1 (en) Method and system for setting alarm clock of wireless music system
US11289090B2 (en) Systems and methods for addressing possible interruption during interaction with digital assistant
US11790901B2 (en) Task-oriented dialog suitable for a standalone device
WO2018107389A1 (en) Method and apparatus for joint assistance by means of voice, and robot
CN111933149A (en) Voice interaction method, wearable device, terminal and voice interaction system
WO2013071738A1 (en) Personal dedicated living auxiliary equipment and method
US20210329047A1 (en) Method, apparatus, electronic device and storage medium for acquiring programs in live streaming room
CN112185394A (en) Playing method, device and playing system of equipment group
JP7288885B2 (en) Voice interaction method, device, equipment and storage medium
CN112447177B (en) Full duplex voice conversation method and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17898709

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 13/01/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 17898709

Country of ref document: EP

Kind code of ref document: A1