WO2021004236A1 - Voice control method and system, device and computer-readable storage medium - Google Patents

Voice control method and system, device and computer-readable storage medium Download PDF

Info

Publication number
WO2021004236A1
WO2021004236A1 PCT/CN2020/096267 CN2020096267W WO2021004236A1 WO 2021004236 A1 WO2021004236 A1 WO 2021004236A1 CN 2020096267 W CN2020096267 W CN 2020096267W WO 2021004236 A1 WO2021004236 A1 WO 2021004236A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
target
storage space
preset
duration
Prior art date
Application number
PCT/CN2020/096267
Other languages
French (fr)
Chinese (zh)
Inventor
庄健春
Original Assignee
深圳开立生物医疗科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳开立生物医疗科技股份有限公司 filed Critical 深圳开立生物医疗科技股份有限公司
Publication of WO2021004236A1 publication Critical patent/WO2021004236A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • This application relates to the field of communication technology, and more specifically, to a voice control method, system, device, and computer-readable storage medium.
  • smart devices With the development of communication technology, more and more smart devices have entered the lives of users and attracted attention from users.
  • One feature of smart devices is that they can recognize and respond to users' voices.
  • the smart device as a mobile phone as an example, when the user wakes up the voice recognition function of the mobile phone through a specific voice, the mobile phone can collect the voice input by the user over a period of time and perform corresponding processing, and enter the sleep state after the processing operation is completed, and wait Was awakened by the user once.
  • a smart device such as a mobile phone
  • the experience of smart devices is poor, making smart devices less efficient in processing voice.
  • the portability of mobile phones can still make up for the shortcomings of voice triggering (long pressing the menu button, etc.), but for some smart devices that are relatively large and do not have portability, the operation is time-consuming and laborious.
  • This application is to provide a voice control method that can solve the problem of how to improve the efficiency of voice processing by smart devices to a certain extent.
  • This application also provides a voice control system, equipment, and computer-readable storage medium.
  • a voice control method applied to smart devices including:
  • the target speech is composed of speech units
  • the continuously collecting voice to obtain the target voice includes:
  • the judging whether the current moment is before the preset voice collection moment further includes:
  • the voice collection time and the preset time length are determined according to the principle that the time length between adjacent voice collection times is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command.
  • the said voice collection time and the preset time are determined according to the principle that the time between adjacent voice collection times is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command Duration, including:
  • the time length between the adjacent voice collection time is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command, determine the voice collection time and the Preset duration
  • the duration relationship formula includes:
  • X represents the voice duration of the target command
  • N represents a positive integer greater than 1
  • L represents the preset duration
  • P represents the duration between adjacent voice collection moments.
  • the collecting a voice of a preset duration from the current moment as the voice unit includes:
  • the duration of the voice that can be stored in the storage space is the preset duration.
  • the selecting a free storage space for storing voice as the target storage space includes:
  • a free storage space is selected as the target storage space.
  • the voices collected from the current moment are stored in the target storage space until the target storage space is full, and after the voice unit is obtained, the method further includes:
  • the recognizing the target command in the target voice includes:
  • the recognizing the target command in the target voice includes:
  • the target voice is matched with a preset grammar, and if the matching is successful, the preset grammar that matches the target voice is mapped to the target command.
  • the smart device includes an ultrasound device
  • the recognizing the target command in the target voice and responding to the target command includes:
  • a voice control system applied to smart devices including:
  • the first collection module is used to continuously collect voices to obtain the target voice when it is determined to perform the voice interaction function
  • the first recognition module is used to recognize the target command in the target voice and respond to the target command.
  • An ultrasound device including:
  • Memory used to store computer programs
  • the processor is used to implement the steps of any of the above voice control methods when executing the computer program.
  • a computer-readable storage medium is applied to a smart device.
  • the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of any of the above voice control methods are realized.
  • the voice control method provided by the present application is applied to a smart device.
  • the voice is continuously collected to obtain the target voice; the target command in the target voice is recognized, and the target command is responded to.
  • the voice control method provided by this application when the smart device determines to perform the voice interaction function, it continuously collects voice to obtain the target voice, recognizes the target command in the target voice, and responds to the target command. Because the voice is continuously collected, The user does not need to continue to wake up the smart device to continue to input voice, and there is no situation that the smart device goes to sleep before the voice is received, which can improve the efficiency of the smart device to collect voice, thereby improving the efficiency of voice processing.
  • the voice control system, equipment, and computer-readable storage medium provided by this application also solve the corresponding technical problems.
  • FIG. 1 is a first flowchart of a voice control method provided by an embodiment of this application
  • FIG. 2 is a second flowchart of a voice control method provided by an embodiment of the application.
  • Figure 3 is a schematic diagram of the relationship between the voice duration of the target command, the preset duration, and the duration between adjacent voice collection moments;
  • FIG. 4 is a schematic structural diagram of a voice control system provided by an embodiment of this application.
  • FIG. 5 is a schematic structural diagram of a voice control device provided by an embodiment of this application.
  • FIG. 6 is another schematic structural diagram of a voice control device provided by an embodiment of the application.
  • smart devices With the development of communication technology, more and more smart devices have entered the lives of users and attracted attention from users.
  • One feature of smart devices is that they can recognize and respond to users' voices.
  • the smart device as a mobile phone as an example, when the user wakes up the voice recognition function of the mobile phone through a specific voice, the mobile phone can collect the voice input by the user over a period of time and perform corresponding processing, and enter the sleep state after the processing operation is completed, and wait Was awakened by the user once.
  • the voice control method can improve the convenience and voice processing efficiency of a user when using a smart device.
  • FIG. 1 is a first flowchart of a voice control method according to an embodiment of the application.
  • a voice control method provided by an embodiment of the present application, applied to a smart device may include the following steps:
  • Step S101 When it is determined to perform the voice interaction function, the voice is continuously collected to obtain the target voice.
  • the smart device when it decides to perform the voice interaction function, it will continue to collect voice and obtain the corresponding target voice.
  • the type of smart device can be determined according to actual needs, for example, it can be a mobile phone, a tablet, an ultrasound device, etc.
  • the judgment method for the smart device to perform the voice interaction function can also be flexibly determined according to actual needs. For example, the smart device can determine that it needs to perform the voice interaction function after receiving a specific trigger command, or it can determine that it needs to perform voice interaction when its own specific button is triggered Function, you can also determine that the voice interaction function needs to be performed after its own button is triggered in a specific trigger mode.
  • Step S102 Recognize the target command in the target voice, and respond to the target command.
  • the smart device can recognize the target command in the target voice, and recognize the target command accordingly.
  • a grammar recognition network can be built in the smart device in advance, and the target voice can be matched with the grammar recognition network to obtain the corresponding target command.
  • the target voice when recognizing the target command in the target voice, the target voice can also be directly matched with the preset grammar. If the matching is successful, the preset grammar matching the target voice is mapped to the target command.
  • the smart device may be an ultrasound device.
  • the smart device when recognizing the target command in the target voice and responding to the target command, it can recognize the ultrasonic command in the target voice and respond to the ultrasonic command.
  • the process of whether the smart device turns off the voice interaction function can be controlled by the outside world.
  • the outside world can control whether the smart device turns off the voice interaction function through instructions, etc., then the smart device can also determine whether it has received voice after responding to the target command.
  • the interactive function close command if the voice interactive function close command is received, the voice collection is stopped; if the voice interactive function close command is not received, the voice collection continues.
  • the voice interaction function closing instruction may be an instruction input by the user's voice, or an instruction generated after the user triggers a button on the smart device.
  • the voice control method provided by the present application is applied to a smart device.
  • the voice is continuously collected to obtain the target voice; the target command in the target voice is recognized, and the target command is responded to.
  • the voice control method provided by this application when the smart device determines to perform the voice interaction function, it continuously collects voice to obtain the target voice, recognizes the target command in the target voice, and responds to the target command. Because the voice is continuously collected, The user does not need to continue to wake up the smart device to continue to input voice, and there is no situation that the smart device goes to sleep before the voice is received. This can improve the efficiency of the smart device to collect voice, thereby improving the efficiency of voice processing, because there is no need to wake up repeatedly, The operation is simple and convenient, suitable for large-scale intelligent equipment.
  • FIG. 2 is a second flowchart of a voice control method provided by an embodiment of this application.
  • the target voice in this application may be composed of multiple voice units, and a voice control method provided in an embodiment of this application may include the following steps:
  • Step S201 When it is determined to perform the voice interaction function, it is determined whether the current time belongs to the preset voice collection time, if it is, step S202 is executed, and if not, step S201 is returned to.
  • Step S202 Collect a voice of a preset duration as a voice unit, and perform step S203.
  • the power consumption of the smart device will be large.
  • the unit performs command recognition and processing on the collected voice, that is, when the next voice unit is collected, the collected voice unit can be processed. Compared with the voice processing after the complete target voice is collected, it can improve Recognition efficiency and processing efficiency of commands. It should be pointed out that the voice collection moments involved in this application belong to the time when the voice collection moments are concentrated, that is, the value of the voice collection moment is not unique, and its number can be determined by the voice collection duration in a specific application scenario.
  • the target commands in the target voice may be stored in a voice unit.
  • the command is recognized for each voice unit .
  • the target voice collected by the smart device will be incomplete, and the smart device may not be able to recognize the instructions in the target voice.
  • the time between adjacent voice collection time can be less than the preset time, and the preset time is greater than or equal to the voice time of the target command.
  • the target command tends to be completely collected into a voice unit, which can ensure that smart devices collect through the voice unit To complete the target command, avoid the smart device from performing operations such as patching the recognized commands, and further improve the efficiency of the smart device in processing voice.
  • voice collection time and the preset duration may also be other methods for determining the voice collection time and the preset duration, which are not specifically limited in this application.
  • the voice collection moment and the preset duration can be determined according to the duration relationship formula, Determine the voice collection time and the preset duration according to the principle that the time between adjacent voice collection moments is less than the preset duration, and the preset duration is greater than or equal to the voice duration of the target command;
  • the duration relationship formula includes:
  • X represents the voice duration of the target command
  • N represents a positive integer greater than 1
  • L represents the preset duration
  • P represents the duration between adjacent voice collection moments.
  • FIG. 3 is a schematic diagram of the relationship between the voice duration of the target command, the preset duration, and the duration between adjacent voice collection moments.
  • X ⁇ (N-1)P that is, X ⁇ ( N-1) L/N.
  • the voice duration of the target command is 2 seconds
  • the duration between adjacent voice collection moments is 2 seconds
  • the preset duration is 4 seconds.
  • the target command can be collected into a voice unit no matter what time period.
  • the target command can be completely collected into a voice unit.
  • Step S203 Recognize the target command in the voice unit, and respond to the target command.
  • different voice storage carriers can be used to distinguish the target voice collected at different voice collection moments.
  • the voice can be saved with the help of storage space. Unit, and the length of the voice that can be stored in the storage space is exactly equal to the length of the voice unit. Then, a storage space can only store one voice unit, so that different voice units can be distinguished with the help of the storage space.
  • the amount of existing storage space may be limited. In this case, if the storage space is occupied, it will cause trouble to the storage of the voice unit. In order to avoid the storage space from causing trouble to the storage of the voice unit, When you select a free storage space for storing voice as the target storage space, you can determine whether there is free storage space; if there is no free storage space, create a storage space and use it as the target storage space; if there is free storage space, then Choose a free storage space as the target storage space.
  • voice units not only can different voice units be distinguished with the help of storage space, but also different voice units can be processed with the help of storage space.
  • the smart device will store all the voices collected from the current moment in the target storage space until the target storage space is full. After the voice unit is obtained, it can also store the voice unit in the target storage space to the preset Set the audio queue; release the target storage space; accordingly, when recognizing the target command in the voice unit, you can obtain a voice unit from the preset audio queue for recognition; and delete the selected voice from the preset audio queue unit.
  • the smart device After the smart device obtains the voice unit, it will store the voice unit in the preset audio queue, and then release the target storage space so that the target storage space can store the next voice unit, reducing the number of storage spaces created and increasing the storage space Utilization; and the smart device obtains one voice unit from the preset audio queue for recognition each time, avoiding recognizing multiple voice units at a time, thereby avoiding the smart device from recognizing multiple commands at a time, thereby avoiding the recognition There are too many commands in the process, and the smart device recognizes the error situation, which ensures the accuracy of the smart device to recognize the voice.
  • the present application also provides a voice control system, which has the corresponding effects of the voice control method provided in the embodiments of the present application.
  • FIG. 4 is a schematic structural diagram of a voice control system provided by an embodiment of the application.
  • a voice control system provided by an embodiment of the present application, applied to a smart device may include:
  • the first collection module 101 is configured to continuously collect voices to obtain target voices when it is determined to perform the voice interaction function;
  • the first recognition module 102 is configured to recognize the target command in the target voice and respond to the target command.
  • a voice control system provided by an embodiment of the present application is applied to a smart device, and the target voice may be composed of voice units;
  • the first collection module may include:
  • the first judgment sub-module is used to judge whether the current moment belongs to the preset voice collection moment; if the current moment belongs to the voice collection moment, start from the current moment, collect the voice of the preset duration as the voice unit; if the current moment does not belong to the voice At the time of collection, return to the step of determining whether the current time belongs to the preset voice collection time.
  • a voice control system provided by an embodiment of the present application, applied to a smart device may also include:
  • the first determining sub-module is used for the first determining sub-module to determine whether the current time belongs to the preset voice collection time before, according to the time between adjacent voice collection time is less than the preset time, and the preset time is greater than or equal to the voice of the target command
  • the principle of duration is to determine the voice collection time and preset duration.
  • a voice control system provided by an embodiment of the present application is applied to a smart device, and the first determining submodule may include:
  • the first determining unit is configured to determine the voice collection time and the preset time length according to the principle that the time between adjacent voice collection moments is less than the preset time length and the preset time length is greater than or equal to the voice time length of the target command according to the time length relationship formula;
  • the duration relationship formula includes:
  • X represents the voice duration of the target command
  • N represents a positive integer greater than 1
  • L represents the preset duration
  • P represents the duration between adjacent voice collection moments.
  • the voice control system provided by the embodiment of the present application is applied to a smart device, and the first judgment submodule may include:
  • the first selection sub-module is used to select a free storage space for storing voice as the target storage space
  • the first storage sub-module is used to store the voices collected from the current moment in the target storage space until the target storage space is filled to obtain the voice unit;
  • the duration of the voice that can be stored in the storage space is the preset duration.
  • the voice control system provided by the embodiment of the present application is applied to a smart device, and the first selection submodule may include:
  • the first judging unit is used to judge whether there is a free storage space; if there is no free storage space, a storage space is created and used as the target storage space; if there is a free storage space, a free storage space is selected as the target storage space.
  • a voice control system provided by an embodiment of the present application, applied to a smart device may also include:
  • the second storage sub-module is used for the first storage sub-module to store all the voices collected from the current moment in the target storage space until the target storage space is filled, and after the voice unit is obtained, store the voice unit in the target storage space To the preset audio queue;
  • the first release submodule is used to release the target storage space
  • the first identification module may include:
  • the first acquisition sub-module is used to acquire a voice unit from the preset audio queue for recognition
  • the first deletion sub-module is used to delete the selected voice unit from the preset audio queue.
  • the voice control system provided by the embodiment of the present application is applied to a smart device, and the first recognition module may include:
  • the first matching unit is configured to match the target voice with the preset grammar, and if the matching is successful, map the preset grammar matching the target voice to the target command.
  • a voice control system provided by an embodiment of the application is applied to a smart device, and the smart device may include an ultrasound device;
  • the first identification module may include:
  • the first recognition unit is used for recognizing the ultrasonic instruction in the target voice and responding to the ultrasonic instruction.
  • This application also provides an ultrasound device and a computer-readable storage medium, both of which have the corresponding effects of the voice control method provided in the embodiments of the application.
  • FIG. 5 is a schematic structural diagram of an ultrasonic device provided by an embodiment of the application.
  • An ultrasound device provided by an embodiment of the present application is applied to a smart device and includes a memory 201 and a processor 202.
  • a computer program is stored in the memory 201.
  • the processor 202 executes the computer program stored in the memory 201, any of the above embodiments is implemented The steps of the described voice control method.
  • another ultrasound device may further include: an input port 203 connected to the processor 202, used to transmit commands input from the outside to the processor 202; and a display connected to the processor 202
  • the unit 204 is used to display the processing result of the processor 202 to the outside; the communication module 205 connected to the processor 202 is used to implement the communication between the ultrasound device and the outside.
  • the display unit 204 can be a display panel, a laser scanning display, etc.; the communication mode adopted by the communication module 205 includes but is not limited to mobile high-definition link technology (HML), universal serial bus (USB), high-definition multimedia interface (HDMI), Wireless connection: wireless fidelity technology (WiFi), Bluetooth communication technology, low-power Bluetooth communication technology, communication technology based on IEEE802.11s.
  • HML mobile high-definition link technology
  • USB universal serial bus
  • HDMI high-definition multimedia interface
  • WiFi wireless fidelity technology
  • Bluetooth communication technology low-power Bluetooth communication technology
  • communication technology based on IEEE802.11s IEEE802.11s.
  • An embodiment of the present application provides a computer-readable storage medium, which is applied to a smart device, and a computer program is stored in the computer-readable storage medium.
  • the computer program is executed by a processor, the voice control method as described in any of the above embodiments is implemented. step.
  • RAM random access memory
  • ROM read-only memory
  • EEPROM electrically programmable ROM
  • EEPly erasable programmable ROM registers
  • hard disks hard disks
  • removable disks or CD-ROMs , Or any other form of storage medium known in the technical field.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)

Abstract

A voice control method and system, a device and a computer-readable storage medium, applied to a smart device. Said method comprises: when determining to execute a voice interaction function, the smart device continuously acquiring a voice to obtain a target voice (S101); and identifying a target command in the target voice, and responding to the target command (S102). As the voice is continuously acquired, a user can continuously input a voice without continuing to wake up a smart device, and there is no case where the smart device sleeps before receiving the voice audio, so that the efficiency of voice acquisition by the smart device can be improved, and the efficiency of voice processing can further be improved.

Description

一种语音控制方法、系统、设备及计算机可读存储介质Voice control method, system, equipment and computer readable storage medium
本申请要求于2019年7月8日提交中国专利局、申请号为201910611505.4、发明名称为“一种语音控制方法、系统、设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office, the application number is 201910611505.4, and the invention title is "a voice control method, system, equipment and computer-readable storage medium" on July 8, 2019. All of them The content is incorporated in this application by reference.
技术领域Technical field
本申请涉及通信技术领域,更具体地说,涉及一种语音控制方法、系统、设备及计算机可读存储介质。This application relates to the field of communication technology, and more specifically, to a voice control method, system, device, and computer-readable storage medium.
背景技术Background technique
随着通信技术的发展,智能设备越来越多的进入用户的生活,受到用户的关注,智能设备的一个特点是可以识别并响应用户的语音。以智能设备为手机为例,当用户通过特定语音唤醒手机的语音识别功能后,手机能够采集用户在一段时间内输入的语音并且进行相应处理,并且在执行完处理操作后进入休眠状态,等待下一次被用户唤醒。也即用户在使用手机等智能设备的语音交互功能时,需要多次唤醒手机,并且用户在唤醒手机后,若未能在特定时间内完成语音输入操作,手机仍会进行休眠状态,使得用户使用智能设备的体验性较差,使得智能设备处理语音的效率较低。并且,手机的便携性尚且可以弥补语音触发的缺点(长按菜单键等),但对于一些体积比较大不具备便携性的智能设备来说,操作起来费时费力。With the development of communication technology, more and more smart devices have entered the lives of users and attracted attention from users. One feature of smart devices is that they can recognize and respond to users' voices. Taking the smart device as a mobile phone as an example, when the user wakes up the voice recognition function of the mobile phone through a specific voice, the mobile phone can collect the voice input by the user over a period of time and perform corresponding processing, and enter the sleep state after the processing operation is completed, and wait Was awakened by the user once. That is to say, when the user uses the voice interaction function of a smart device such as a mobile phone, it is necessary to wake up the mobile phone many times, and after the user wakes up the mobile phone, if the voice input operation is not completed within a specific time, the mobile phone will still be in a sleep state, making the user use The experience of smart devices is poor, making smart devices less efficient in processing voice. In addition, the portability of mobile phones can still make up for the shortcomings of voice triggering (long pressing the menu button, etc.), but for some smart devices that are relatively large and do not have portability, the operation is time-consuming and laborious.
发明内容Summary of the invention
本申请的目的是提供一种语音控制方法,其能在一定程度上解决如何提高智能设备处理语音的效率问题。本申请还提供了一种语音控制系统、设备及计算机可读存储介质。The purpose of this application is to provide a voice control method that can solve the problem of how to improve the efficiency of voice processing by smart devices to a certain extent. This application also provides a voice control system, equipment, and computer-readable storage medium.
为了实现上述目的,本申请提供如下技术方案:In order to achieve the above objectives, this application provides the following technical solutions:
一种语音控制方法,应用于智能设备,包括:A voice control method applied to smart devices, including:
当判定执行语音交互功能时,持续采集语音,得到目标语音;When it is determined to perform the voice interaction function, continue to collect voices to obtain the target voice;
对所述目标语音中的目标命令进行识别,并响应所述目标命令。Recognizing the target command in the target voice, and responding to the target command.
优选的,所述目标语音由语音单元组成;Preferably, the target speech is composed of speech units;
所述持续采集语音,得到目标语音,包括:The continuously collecting voice to obtain the target voice includes:
判断当前时刻是否属于预设的语音采集时刻;Determine whether the current moment belongs to the preset voice collection moment;
若当前时刻属于所述语音采集时刻,则从当前时刻开始,采集预设时长的语音作为所述语音单元;If the current moment belongs to the voice collection moment, starting from the current moment, a voice of a preset duration is collected as the voice unit;
若当前时刻不属于所述语音采集时刻,则返回执行所述判断当前时刻是否属于预设的语音采集时刻的步骤。If the current time does not belong to the voice collection time, return to the step of determining whether the current time belongs to the preset voice collection time.
优选的,所述判断当前时刻是否属于预设的语音采集时刻之前,还包括:Preferably, the judging whether the current moment is before the preset voice collection moment, further includes:
按照相邻语音采集时刻间的时长小于所述预设时长,且所述预设时长大于等于所述目标命令的语音时长的原则,确定所述语音采集时刻和所述预设时长。The voice collection time and the preset time length are determined according to the principle that the time length between adjacent voice collection times is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command.
优选的,所述按照相邻语音采集时刻间的时长小于所述预设时长,且所述预设时长大于等于所述目标命令的语音时长的原则,确定所述语音采集时刻和所述预设时长,包括:Preferably, the said voice collection time and the preset time are determined according to the principle that the time between adjacent voice collection times is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command Duration, including:
根据时长关系公式,按照所述相邻语音采集时刻间的时长小于所述预设时长,且所述预设时长大于等于所述目标命令的语音时长的原则,确定所述语音采集时刻和所述预设时长;According to the time length relation formula, the time length between the adjacent voice collection time is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command, determine the voice collection time and the Preset duration
所述时长关系公式包括:The duration relationship formula includes:
X≤(N-1)L/N;L=NP;X≤(N-1)L/N; L=NP;
其中,X表示所述目标命令的语音时长;N表示大于1的正整数;L表示所述预设时长;P表示所述相邻语音采集时刻间的时长。Wherein, X represents the voice duration of the target command; N represents a positive integer greater than 1; L represents the preset duration; P represents the duration between adjacent voice collection moments.
优选的,所述从当前时刻开始,采集预设时长的语音作为所述语音单元,包括:Preferably, the collecting a voice of a preset duration from the current moment as the voice unit includes:
选取一个空闲的用于存储语音的存储空间作为目标存储空间;Select a free storage space for storing voice as the target storage space;
将从当前时刻开始采集的语音均存储在所述目标存储空间中,直至装满所述目标存储空间,得到所述语音单元;All the voices collected from the current moment are stored in the target storage space until the target storage space is full to obtain the voice unit;
其中,所述存储空间所能存储的语音的时长为所述预设时长。Wherein, the duration of the voice that can be stored in the storage space is the preset duration.
优选的,所述选取一个空闲的用于存储语音的存储空间作为目标存储空间,包括:Preferably, the selecting a free storage space for storing voice as the target storage space includes:
判断是否存在空闲存储空间;Determine whether there is free storage space;
若不存在空闲存储空间,则创建一个存储空间并作为所述目标存储空间;If there is no free storage space, create a storage space and use it as the target storage space;
若存在空闲存储空间,则选取一个空闲的存储空间作为所述目标存储空间。If there is a free storage space, a free storage space is selected as the target storage space.
优选的,所述将从当前时刻开始采集的语音均存储在所述目标存储空间中,直至装满所述目标存储空间,得到所述语音单元之后,还包括:Preferably, the voices collected from the current moment are stored in the target storage space until the target storage space is full, and after the voice unit is obtained, the method further includes:
将所述目标存储空间中的所述语音单元存储至预设音频队列中;Storing the voice unit in the target storage space in a preset audio queue;
释放所述目标存储空间;Release the target storage space;
所述对所述目标语音中的目标命令进行识别,包括:The recognizing the target command in the target voice includes:
从所述预设音频队列中获取一个所述语音单元进行命令识别;Acquiring one of the voice units from the preset audio queue for command recognition;
并从所述预设音频队列中删除选取的所述语音单元。And delete the selected voice unit from the preset audio queue.
优选的,所述对所述目标语音中的目标命令进行识别,包括:Preferably, the recognizing the target command in the target voice includes:
将所述目标语音与预设语法进行匹配,若匹配成功,则将与所述目标语音匹配的所述预设语法映射为所述目标命令。The target voice is matched with a preset grammar, and if the matching is successful, the preset grammar that matches the target voice is mapped to the target command.
优选的,所述智能设备包括超声设备;Preferably, the smart device includes an ultrasound device;
所述对所述目标语音中的目标命令进行识别,并响应所述目标命令,包括:The recognizing the target command in the target voice and responding to the target command includes:
对所述目标语音中的超声指令进行识别,并响应所述超声指令。Recognizing the ultrasonic instruction in the target voice, and responding to the ultrasonic instruction.
一种语音控制系统,应用于智能设备,包括:A voice control system applied to smart devices, including:
第一采集模块,用于当判定执行语音交互功能时,持续采集语音,得到目标语音;The first collection module is used to continuously collect voices to obtain the target voice when it is determined to perform the voice interaction function;
第一识别模块,用于对所述目标语音中的目标命令进行识别,并响应所述目标命令。The first recognition module is used to recognize the target command in the target voice and respond to the target command.
一种超声设备,包括:An ultrasound device, including:
存储器,用于存储计算机程序;Memory, used to store computer programs;
处理器,用于执行所述计算机程序时实现如上任一所述语音控制方法 的步骤。The processor is used to implement the steps of any of the above voice control methods when executing the computer program.
一种计算机可读存储介质,应用于智能设备,所述计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时实现如上任一所述语音控制方法的步骤。A computer-readable storage medium is applied to a smart device. The computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps of any of the above voice control methods are realized.
本申请提供的一种语音控制方法,应用于智能设备,当判定执行语音交互功能时,持续采集语音,得到目标语音;对目标语音中的目标命令进行识别,并响应目标命令。本申请提供的一种语音控制方法,智能设备在判定执行语音交互功能时,持续采集语音,得到目标语音,对目标语音中的目标命令进行识别,并响应目标命令,由于是持续采集语音,使得用户无需继续唤醒智能设备即可持续输入语音,也不存在智能设备未接收完语音便进入休眠的情况,可以提高智能设备采集语音的效率,进而提高对语音的处理效率。本申请提供的一种语音控制系统、设备及计算机可读存储介质也解决了相应技术问题。The voice control method provided by the present application is applied to a smart device. When it is determined to perform a voice interaction function, the voice is continuously collected to obtain the target voice; the target command in the target voice is recognized, and the target command is responded to. In the voice control method provided by this application, when the smart device determines to perform the voice interaction function, it continuously collects voice to obtain the target voice, recognizes the target command in the target voice, and responds to the target command. Because the voice is continuously collected, The user does not need to continue to wake up the smart device to continue to input voice, and there is no situation that the smart device goes to sleep before the voice is received, which can improve the efficiency of the smart device to collect voice, thereby improving the efficiency of voice processing. The voice control system, equipment, and computer-readable storage medium provided by this application also solve the corresponding technical problems.
附图说明Description of the drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only It is an embodiment of the present invention. For those of ordinary skill in the art, other drawings can be obtained based on the provided drawings without creative work.
图1为本申请实施例提供的一种语音控制方法的第一流程图;FIG. 1 is a first flowchart of a voice control method provided by an embodiment of this application;
图2为本申请实施例提供的语音控制方法的第二流程图;2 is a second flowchart of a voice control method provided by an embodiment of the application;
图3为目标命令的语音时长、预设时长、相邻语音采集时刻间的时长间的关系示意图;Figure 3 is a schematic diagram of the relationship between the voice duration of the target command, the preset duration, and the duration between adjacent voice collection moments;
图4为本申请实施例提供的一种语音控制系统的结构示意图;FIG. 4 is a schematic structural diagram of a voice control system provided by an embodiment of this application;
图5为本申请实施例提供的一种语音控制设备的结构示意图;FIG. 5 is a schematic structural diagram of a voice control device provided by an embodiment of this application;
图6为本申请实施例提供的一种语音控制设备的另一结构示意图。FIG. 6 is another schematic structural diagram of a voice control device provided by an embodiment of the application.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进 行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
随着通信技术的发展,智能设备越来越多的进入用户的生活,受到用户的关注,智能设备的一个特点是可以识别并响应用户的语音。以智能设备为手机为例,当用户通过特定语音唤醒手机的语音识别功能后,手机能够采集用户在一段时间内输入的语音并且进行相应处理,并且在执行完处理操作后进入休眠状态,等待下一次被用户唤醒。也即用户在使用手机等智能设备的语音交互功能时,需要多次唤醒手机,并且用户在唤醒手机后,若未能在特定时间内完成语音输入操作,手机仍会进行休眠状态,但对于一些体积比较大不具备便携性的智能设备来说,这种语音操作方式则会降低用户的使用体验。本申请提供的一种语音控制方法可以提高用户使用智能设备时的便捷性和语音处理效率。With the development of communication technology, more and more smart devices have entered the lives of users and attracted attention from users. One feature of smart devices is that they can recognize and respond to users' voices. Taking the smart device as a mobile phone as an example, when the user wakes up the voice recognition function of the mobile phone through a specific voice, the mobile phone can collect the voice input by the user over a period of time and perform corresponding processing, and enter the sleep state after the processing operation is completed, and wait Was awakened by the user once. That is, when a user uses the voice interaction function of a smart device such as a mobile phone, the mobile phone needs to be awakened multiple times, and after the user wakes up the mobile phone, if the voice input operation is not completed within a certain time, the mobile phone will still go to sleep. For smart devices that are relatively large and have no portability, this voice operation method will reduce the user experience. The voice control method provided in the present application can improve the convenience and voice processing efficiency of a user when using a smart device.
请参阅图1,图1为本申请实施例提供的一种语音控制方法的第一流程图。Please refer to FIG. 1. FIG. 1 is a first flowchart of a voice control method according to an embodiment of the application.
本申请实施例提供的一种语音控制方法,应用于智能设备,可以包括以下步骤:A voice control method provided by an embodiment of the present application, applied to a smart device, may include the following steps:
步骤S101:当判定执行语音交互功能时,持续采集语音,得到目标语音。Step S101: When it is determined to perform the voice interaction function, the voice is continuously collected to obtain the target voice.
实际应用中,智能设备在判定执行语音交互功能时,便持续采集语音,并得到相应的目标语音。智能设备的类型可以根据实际需要确定,比如其可以为手机、平板、超声设备等。智能设备判定执行语音交互功能的判断方法也可以根据实际需要灵活确定,比如智能设备可以在接收到特定触发命令后判定需执行语音交互功能,也可以在自身特定按键被触发时判定需执行语音交互功能,还可以在自身按键被按特定触发方式触发后判定需执行语音交互功能等。In practical applications, when the smart device decides to perform the voice interaction function, it will continue to collect voice and obtain the corresponding target voice. The type of smart device can be determined according to actual needs, for example, it can be a mobile phone, a tablet, an ultrasound device, etc. The judgment method for the smart device to perform the voice interaction function can also be flexibly determined according to actual needs. For example, the smart device can determine that it needs to perform the voice interaction function after receiving a specific trigger command, or it can determine that it needs to perform voice interaction when its own specific button is triggered Function, you can also determine that the voice interaction function needs to be performed after its own button is triggered in a specific trigger mode.
步骤S102:对目标语音中的目标命令进行识别,并响应目标命令。Step S102: Recognize the target command in the target voice, and respond to the target command.
实际应用中,智能设备在采集得到目标语音后,便可以对目标语音中 的目标命令进行识别,并相应识别得到目标命令等。具体应用场景中,在对目标语音中的目标命令进行识别时,可以在智能设备中预先搭建语法识别网络,将目标语音与语法识别网络进行匹配,得到相应的目标命令。具体应用场景中,在对目标语音中的目标命令进行识别时,还可以直接将目标语音与预设语法进行匹配,若匹配成功,则将与目标语音匹配的预设语法映射为目标命令。In practical applications, after the smart device collects the target voice, it can recognize the target command in the target voice, and recognize the target command accordingly. In specific application scenarios, when recognizing the target command in the target voice, a grammar recognition network can be built in the smart device in advance, and the target voice can be matched with the grammar recognition network to obtain the corresponding target command. In specific application scenarios, when recognizing the target command in the target voice, the target voice can also be directly matched with the preset grammar. If the matching is successful, the preset grammar matching the target voice is mapped to the target command.
具体应用场景中,智能设备可以为超声设备,此时对目标语音中的目标命令进行识别,并响应目标命令时,可以对目标语音中的超声指令进行识别,并响应超声指令。In a specific application scenario, the smart device may be an ultrasound device. At this time, when recognizing the target command in the target voice and responding to the target command, it can recognize the ultrasonic command in the target voice and respond to the ultrasonic command.
实际应用中,智能设备是否关闭语音交互功能的过程可以由外界控制,比如外界可以通过指令来控制智能设备是否关闭语音交互功能等,则智能设备在响应目标命令之后,还可以判断是否接收到语音交互功能关闭指令;若接收到语音交互功能关闭指令,则停止采集语音;若未接收到语音交互功能关闭指令,则继续采集语音。应当指出,语音交互功能关闭指令可以为用户语音输入的指令,也可以为用户触发智能设备上的按键后生成的指令等。In practical applications, the process of whether the smart device turns off the voice interaction function can be controlled by the outside world. For example, the outside world can control whether the smart device turns off the voice interaction function through instructions, etc., then the smart device can also determine whether it has received voice after responding to the target command. The interactive function close command; if the voice interactive function close command is received, the voice collection is stopped; if the voice interactive function close command is not received, the voice collection continues. It should be pointed out that the voice interaction function closing instruction may be an instruction input by the user's voice, or an instruction generated after the user triggers a button on the smart device.
本申请提供的一种语音控制方法,应用于智能设备,当判定执行语音交互功能时,持续采集语音,得到目标语音;对目标语音中的目标命令进行识别,并响应目标命令。本申请提供的一种语音控制方法,智能设备在判定执行语音交互功能时,持续采集语音,得到目标语音,对目标语音中的目标命令进行识别,并响应目标命令,由于是持续采集语音,使得用户无需继续唤醒智能设备即可持续输入语音,也不存在智能设备未接收完语音便进入休眠的情况,可以提高智能设备采集语音的效率,进而提高对语音的处理效率,由于不需要反复唤醒,操作简单便捷,适用于大型智能设备。The voice control method provided by the present application is applied to a smart device. When it is determined to perform a voice interaction function, the voice is continuously collected to obtain the target voice; the target command in the target voice is recognized, and the target command is responded to. In the voice control method provided by this application, when the smart device determines to perform the voice interaction function, it continuously collects voice to obtain the target voice, recognizes the target command in the target voice, and responds to the target command. Because the voice is continuously collected, The user does not need to continue to wake up the smart device to continue to input voice, and there is no situation that the smart device goes to sleep before the voice is received. This can improve the efficiency of the smart device to collect voice, thereby improving the efficiency of voice processing, because there is no need to wake up repeatedly, The operation is simple and convenient, suitable for large-scale intelligent equipment.
请参阅图2,图2为本申请实施例提供的语音控制方法的第二流程图。Please refer to FIG. 2. FIG. 2 is a second flowchart of a voice control method provided by an embodiment of this application.
实际应用中,本申请中的目标语音可以由多个语音单元组成,则本申请实施例提供的一种语音控制方法可以包括以下步骤:In practical applications, the target voice in this application may be composed of multiple voice units, and a voice control method provided in an embodiment of this application may include the following steps:
步骤S201:当判定执行语音交互功能时,判断当前时刻是否属于预设的语音采集时刻,若是,则执行步骤S202,若否,则返回执行步骤S201。Step S201: When it is determined to perform the voice interaction function, it is determined whether the current time belongs to the preset voice collection time, if it is, step S202 is executed, and if not, step S201 is returned to.
步骤S202:采集预设时长的语音作为语音单元,执行步骤S203。Step S202: Collect a voice of a preset duration as a voice unit, and perform step S203.
实际应用中,如果智能设备无间隔的持续采集语音的话,会造成智能设备功耗较大,为了降低智能设备的功耗,在持续采集语音时,可以先判断当前时刻是否属于预设的语音采集时刻,若是,则采集预设时长的语音作为目标语音,由于只有在语音采集时刻才采集语音,与无间隔的持续采集语音相比,可以降低智能设备采集语音时的功耗;此外,与无间隔的持续采集语音得到一个整体的目标语音相比,通过在不同的语音采集时刻采集预设时长的语音作为语音单元,相当于将目标语音拆分为多个语音单元,从而可以以语音单元为单位对采集的语音进行命令识别、处理等,也即在采集下一个语音单元时,便可以对已采集的语音单元进行处理,与采集完整个目标语音后才对语音进行处理相比,可以提高对命令的识别效率、处理效率。应当指出,本申请所涉及的语音采集时刻属于语音采集时刻集中的时刻,也即语音采集时刻的值不唯一,其数量可以由具体应用场景中语音采集时长确定。In practical applications, if the smart device continuously collects voice without interval, the power consumption of the smart device will be large. In order to reduce the power consumption of the smart device, when the voice is continuously collected, you can first determine whether the current moment belongs to the preset voice collection If yes, the voice of the preset duration is collected as the target voice. Since the voice is only collected at the voice collection moment, compared with continuous voice collection without interval, the power consumption of the smart device when collecting voice can be reduced; Compared with the continuous collection of voices at intervals to obtain a whole target voice, by collecting voices of a preset duration at different voice collection moments as the voice unit, it is equivalent to splitting the target voice into multiple voice units, so that the voice unit can be used as the voice unit. The unit performs command recognition and processing on the collected voice, that is, when the next voice unit is collected, the collected voice unit can be processed. Compared with the voice processing after the complete target voice is collected, it can improve Recognition efficiency and processing efficiency of commands. It should be pointed out that the voice collection moments involved in this application belong to the time when the voice collection moments are concentrated, that is, the value of the voice collection moment is not unique, and its number can be determined by the voice collection duration in a specific application scenario.
具体应用场景中,在按照不同语音采集时刻采集预设时长的语音作为语音单元时,目标语音中的目标命令可能被保存在一个语音单元中,此时,在对每个语音单元进行命令识别时,只需在识别出目标命令后,直接响应目标命令即可;而在目标命令被保存在多个语音单元中时,在对每个语音单元进行命令识别时,各个语音单元识别得到的命令只是目标命令中的部分命令,此时,在识别出语音单元中的命令后,还需对识别出的命令进行拼凑等操作来恢复出目标命令,进而响应目标命令等。In specific application scenarios, when voices of a preset duration are collected as voice units according to different voice collection moments, the target commands in the target voice may be stored in a voice unit. At this time, when the command is recognized for each voice unit , You only need to directly respond to the target command after recognizing the target command; and when the target command is stored in multiple voice units, when the command recognition is performed on each voice unit, the command recognized by each voice unit is only Part of the command in the target command. At this time, after the command in the voice unit is recognized, it is necessary to piece together the recognized commands to recover the target command, and then respond to the target command.
具体应用场景中,如果预设时长小于相邻两个语音采集时刻间的时长,则会使得智能设备出现采集的目标语音不完整的情况,由此使得智能设备可能无法识别目标语音中的指令,影响用户体验,为了避免此种情况,在判断当前时刻是否属于预设的语音采集时刻之前,可以按照相邻语音采集时刻间的时长小于预设时长,且预设时长大于等于目标命令的语音时长的原则,确定语音采集时刻和预设时长。由于相邻语音采集时刻间的时长小 于预设时长,且预设时长大于等于目标命令的语音时长,那么目标命令趋向于能够被完整采集到一个语音单元中,从而可以保证智能设备通过语音单元采集到完整的目标命令,避免智能设备执行对识别出的命令进行拼凑等操作,进一步提高智能设备处理语音的效率。当然,也可以有其他确定语音采集时刻和预设时长的方法,本申请在此不做具体限定。In specific application scenarios, if the preset duration is less than the duration between two adjacent voice collection moments, the target voice collected by the smart device will be incomplete, and the smart device may not be able to recognize the instructions in the target voice. In order to avoid this situation, before judging whether the current time belongs to the preset voice collection time, the time between adjacent voice collection time can be less than the preset time, and the preset time is greater than or equal to the voice time of the target command. The principle of determining the voice collection time and preset duration. Since the duration between adjacent voice collection moments is less than the preset duration, and the preset duration is greater than or equal to the voice duration of the target command, the target command tends to be completely collected into a voice unit, which can ensure that smart devices collect through the voice unit To complete the target command, avoid the smart device from performing operations such as patching the recognized commands, and further improve the efficiency of the smart device in processing voice. Of course, there may also be other methods for determining the voice collection time and the preset duration, which are not specifically limited in this application.
具体应用场景中,在按照相邻语音采集时刻间的时长小于预设时长,且预设时长大于等于目标命令的语音时长的原则,确定语音采集时刻和预设时长时,可以根据时长关系公式,按照相邻语音采集时刻间的时长小于预设时长,且预设时长大于等于目标命令的语音时长的原则,确定语音采集时刻和预设时长;In specific application scenarios, according to the principle that the duration between adjacent voice collection moments is less than the preset duration, and the preset duration is greater than or equal to the voice duration of the target command, the voice collection moment and the preset duration can be determined according to the duration relationship formula, Determine the voice collection time and the preset duration according to the principle that the time between adjacent voice collection moments is less than the preset duration, and the preset duration is greater than or equal to the voice duration of the target command;
时长关系公式包括:The duration relationship formula includes:
X≤(N-1)L/N;L=NP;X≤(N-1)L/N; L=NP;
其中,X表示目标命令的语音时长;N表示大于1的正整数;L表示预设时长;P表示相邻语音采集时刻间的时长。Among them, X represents the voice duration of the target command; N represents a positive integer greater than 1; L represents the preset duration; P represents the duration between adjacent voice collection moments.
时长关系公式的推导过程如下:The derivation process of the duration relationship formula is as follows:
请参阅图3,图3为目标命令的语音时长、预设时长、相邻语音采集时刻间的时长间的关系示意图。为了对齐数据以方便对数据进行处理,假设L=NP,也即L是P的整数倍;当某个语音单元能够包含整个目标命令时,X≤(N-1)P,也即X≤(N-1)L/N。为了便于理解,假设目标命令的语音时长为2秒,相邻语音采集时刻间的时长为2秒,取N=2,则预设时长便为4秒。目标命令无论在什么时段都可以被采集到一个语音单元中。Please refer to FIG. 3, which is a schematic diagram of the relationship between the voice duration of the target command, the preset duration, and the duration between adjacent voice collection moments. In order to align the data to facilitate the processing of the data, assume that L=NP, that is, L is an integer multiple of P; when a certain voice unit can contain the entire target command, X≤(N-1)P, that is, X≤( N-1) L/N. For ease of understanding, assume that the voice duration of the target command is 2 seconds, and the duration between adjacent voice collection moments is 2 seconds, and if N=2, the preset duration is 4 seconds. The target command can be collected into a voice unit no matter what time period.
按照该计算公式可以保证目标命令能够被完整采集到一个语音单元中。According to the calculation formula, it can be ensured that the target command can be completely collected into a voice unit.
步骤S203:对语音单元中的目标命令进行识别,并响应目标命令。Step S203: Recognize the target command in the voice unit, and respond to the target command.
实际应用中,为了便于智能设备处理目标语音,在按照语音采集时刻采集语音时,可以借助不同的语音存储载体来将不同语音采集时刻采集到的目标语音进行区分,比如可以借助存储空间来保存语音单元,并且设置存储空间所能存储的语音的时长恰好等于语音单元的时长,那么,一个存储空间便只能存储一个语音单元,从而可以借助存储空间将不同的语音单 元进行区分,则从当前时刻开始,采集预设时长的语音作为语音单元时,可以选取一个空闲的用于存储语音的存储空间作为目标存储空间;将从当前时刻开始采集的语音均存储在目标存储空间中,直至装满目标存储空间,得到语音单元;其中,存储空间所能存储的语音的时长为预设时长。In practical applications, in order to facilitate the processing of the target voice by the smart device, when the voice is collected according to the voice collection time, different voice storage carriers can be used to distinguish the target voice collected at different voice collection moments. For example, the voice can be saved with the help of storage space. Unit, and the length of the voice that can be stored in the storage space is exactly equal to the length of the voice unit. Then, a storage space can only store one voice unit, so that different voice units can be distinguished with the help of the storage space. At the beginning, when collecting voices with a preset duration as the voice unit, you can select an idle storage space for storing voices as the target storage space; the voices collected from the current moment are all stored in the target storage space until the target is full The storage space obtains the voice unit; wherein, the duration of the voice that can be stored in the storage space is the preset duration.
具体应用场景中,已有存储空间的数量可能有限,此种情况下,如果存储空间被占用的话,会给语音单元的存储带来困扰,为了避免存储空间给语音单元的存储带来困扰,在选取一个空闲的用于存储语音的存储空间作为目标存储空间时,可以判断是否存在空闲存储空间;若不存在空闲存储空间,则创建一个存储空间并作为目标存储空间;若存在空闲存储空间,则选取一个空闲的存储空间作为目标存储空间。In specific application scenarios, the amount of existing storage space may be limited. In this case, if the storage space is occupied, it will cause trouble to the storage of the voice unit. In order to avoid the storage space from causing trouble to the storage of the voice unit, When you select a free storage space for storing voice as the target storage space, you can determine whether there is free storage space; if there is no free storage space, create a storage space and use it as the target storage space; if there is free storage space, then Choose a free storage space as the target storage space.
具体应用场景中,借助存储空间不仅能将不同语音单元区分开来,还可以借助存储空间来对不同的语音单元进行处理,在此过程中,为了提高存储空间的利用率,并且为了便于智能设备准确对语音单元进行处理,智能设备将从当前时刻开始采集的语音均存储在目标存储空间中,直至装满目标存储空间,得到语音单元之后,还可以将目标存储空间中的语音单元存储至预设音频队列中;释放目标存储空间;相应的,在对语音单元中的目标命令进行识别时,可以从预设音频队列中获取一个语音单元进行识别;并从预设音频队列中删除选取的语音单元。也即智能设备在得到语音单元后,会将语音单元存储在预设音频队列中,然后释放目标存储空间,使得目标存储空间可以存储下一个语音单元,减少存储空间的创建数量,提高存储空间的利用率;并且智能设备每次从预设音频队列中获取一个语音单元进行识别,避免了一次对多个语音单元进行识别,从而避免了智能设备一次识别出多个命令,进而避免了因一次识别过程中命令数量过多,智能设备识别出错的情况,保证了智能设备识别语音的准确率。In specific application scenarios, not only can different voice units be distinguished with the help of storage space, but also different voice units can be processed with the help of storage space. In this process, in order to improve the utilization of storage space and to facilitate smart devices Accurately process the voice unit. The smart device will store all the voices collected from the current moment in the target storage space until the target storage space is full. After the voice unit is obtained, it can also store the voice unit in the target storage space to the preset Set the audio queue; release the target storage space; accordingly, when recognizing the target command in the voice unit, you can obtain a voice unit from the preset audio queue for recognition; and delete the selected voice from the preset audio queue unit. That is, after the smart device obtains the voice unit, it will store the voice unit in the preset audio queue, and then release the target storage space so that the target storage space can store the next voice unit, reducing the number of storage spaces created and increasing the storage space Utilization; and the smart device obtains one voice unit from the preset audio queue for recognition each time, avoiding recognizing multiple voice units at a time, thereby avoiding the smart device from recognizing multiple commands at a time, thereby avoiding the recognition There are too many commands in the process, and the smart device recognizes the error situation, which ensures the accuracy of the smart device to recognize the voice.
本申请还提供了一种语音控制系统,其具有本申请实施例提供的一种语音控制方法具有的对应效果。请参阅图4,图4为本申请实施例提供的一种语音控制系统的结构示意图。The present application also provides a voice control system, which has the corresponding effects of the voice control method provided in the embodiments of the present application. Please refer to FIG. 4, which is a schematic structural diagram of a voice control system provided by an embodiment of the application.
本申请实施例提供的一种语音控制系统,应用于智能设备,可以包括:A voice control system provided by an embodiment of the present application, applied to a smart device, may include:
第一采集模块101,用于当判定执行语音交互功能时,持续采集语音,得到目标语音;The first collection module 101 is configured to continuously collect voices to obtain target voices when it is determined to perform the voice interaction function;
第一识别模块102,用于对目标语音中的目标命令进行识别,并响应目标命令。The first recognition module 102 is configured to recognize the target command in the target voice and respond to the target command.
本申请实施例提供的一种语音控制系统,应用于智能设备,目标语音可以由语音单元组成;A voice control system provided by an embodiment of the present application is applied to a smart device, and the target voice may be composed of voice units;
第一采集模块可以包括:The first collection module may include:
第一判断子模块,用于判断当前时刻是否属于预设的语音采集时刻;若当前时刻属于语音采集时刻,则从当前时刻开始,采集预设时长的语音作为语音单元;若当前时刻不属于语音采集时刻,则返回执行判断当前时刻是否属于预设的语音采集时刻的步骤。The first judgment sub-module is used to judge whether the current moment belongs to the preset voice collection moment; if the current moment belongs to the voice collection moment, start from the current moment, collect the voice of the preset duration as the voice unit; if the current moment does not belong to the voice At the time of collection, return to the step of determining whether the current time belongs to the preset voice collection time.
本申请实施例提供的一种语音控制系统,应用于智能设备,还可以包括:A voice control system provided by an embodiment of the present application, applied to a smart device, may also include:
第一确定子模块,用于第一判断子模块判断当前时刻是否属于预设的语音采集时刻之前,按照相邻语音采集时刻间的时长小于预设时长,且预设时长大于等于目标命令的语音时长的原则,确定语音采集时刻和预设时长。The first determining sub-module is used for the first determining sub-module to determine whether the current time belongs to the preset voice collection time before, according to the time between adjacent voice collection time is less than the preset time, and the preset time is greater than or equal to the voice of the target command The principle of duration is to determine the voice collection time and preset duration.
本申请实施例提供的一种语音控制系统,应用于智能设备,第一确定子模块可以包括:A voice control system provided by an embodiment of the present application is applied to a smart device, and the first determining submodule may include:
第一确定单元,用于根据时长关系公式,按照相邻语音采集时刻间的时长小于预设时长,且预设时长大于等于目标命令的语音时长的原则,确定语音采集时刻和预设时长;The first determining unit is configured to determine the voice collection time and the preset time length according to the principle that the time between adjacent voice collection moments is less than the preset time length and the preset time length is greater than or equal to the voice time length of the target command according to the time length relationship formula;
时长关系公式包括:The duration relationship formula includes:
X≤(N-1)L/N;L=NP;X≤(N-1)L/N; L=NP;
其中,X表示目标命令的语音时长;N表示大于1的正整数;L表示预设时长;P表示相邻语音采集时刻间的时长。Among them, X represents the voice duration of the target command; N represents a positive integer greater than 1; L represents the preset duration; P represents the duration between adjacent voice collection moments.
本申请实施例提供的一种语音控制系统,应用于智能设备,第一判断子模块可以包括:The voice control system provided by the embodiment of the present application is applied to a smart device, and the first judgment submodule may include:
第一选取子模块,用于选取一个空闲的用于存储语音的存储空间作为 目标存储空间;The first selection sub-module is used to select a free storage space for storing voice as the target storage space;
第一存储子模块,用于将从当前时刻开始采集的语音均存储在目标存储空间中,直至装满目标存储空间,得到语音单元;The first storage sub-module is used to store the voices collected from the current moment in the target storage space until the target storage space is filled to obtain the voice unit;
其中,存储空间所能存储的语音的时长为预设时长。The duration of the voice that can be stored in the storage space is the preset duration.
本申请实施例提供的一种语音控制系统,应用于智能设备,第一选取子模块可以包括:The voice control system provided by the embodiment of the present application is applied to a smart device, and the first selection submodule may include:
第一判断单元,用于判断是否存在空闲存储空间;若不存在空闲存储空间,则创建一个存储空间并作为目标存储空间;若存在空闲存储空间,则选取一个空闲的存储空间作为目标存储空间。The first judging unit is used to judge whether there is a free storage space; if there is no free storage space, a storage space is created and used as the target storage space; if there is a free storage space, a free storage space is selected as the target storage space.
本申请实施例提供的一种语音控制系统,应用于智能设备,还可以包括:A voice control system provided by an embodiment of the present application, applied to a smart device, may also include:
第二存储子模块,用于第一存储子模块将从当前时刻开始采集的语音均存储在目标存储空间中,直至装满目标存储空间,得到语音单元之后,将目标存储空间中的语音单元存储至预设音频队列中;The second storage sub-module is used for the first storage sub-module to store all the voices collected from the current moment in the target storage space until the target storage space is filled, and after the voice unit is obtained, store the voice unit in the target storage space To the preset audio queue;
第一释放子模块,用于释放目标存储空间;The first release submodule is used to release the target storage space;
第一识别模块可以包括:The first identification module may include:
第一获取子模块,用于从预设音频队列中获取一个语音单元进行识别;The first acquisition sub-module is used to acquire a voice unit from the preset audio queue for recognition;
第一删除子模块,用于从预设音频队列中删除选取的语音单元。The first deletion sub-module is used to delete the selected voice unit from the preset audio queue.
本申请实施例提供的一种语音控制系统,应用于智能设备,第一识别模块可以包括:The voice control system provided by the embodiment of the present application is applied to a smart device, and the first recognition module may include:
第一匹配单元,用于将目标语音与预设语法进行匹配,若匹配成功,则将与目标语音匹配的预设语法映射为目标命令。The first matching unit is configured to match the target voice with the preset grammar, and if the matching is successful, map the preset grammar matching the target voice to the target command.
本申请实施例提供的一种语音控制系统,应用于智能设备,智能设备可以包括超声设备;A voice control system provided by an embodiment of the application is applied to a smart device, and the smart device may include an ultrasound device;
第一识别模块可以包括:The first identification module may include:
第一识别单元,用于对目标语音中的超声指令进行识别,并响应超声指令。The first recognition unit is used for recognizing the ultrasonic instruction in the target voice and responding to the ultrasonic instruction.
本申请还提供了一种超声设备及计算机可读存储介质,其均具有本申 请实施例提供的一种语音控制方法具有的对应效果。请参阅图5,图5为本申请实施例提供的一种超声设备的结构示意图。This application also provides an ultrasound device and a computer-readable storage medium, both of which have the corresponding effects of the voice control method provided in the embodiments of the application. Please refer to FIG. 5, which is a schematic structural diagram of an ultrasonic device provided by an embodiment of the application.
本申请实施例提供的一种超声设备,应用于智能设备,包括存储器201和处理器202,存储器201中存储有计算机程序,处理器202执行存储器201中存储的计算机程序时实现如上任一实施例所描述的语音控制方法的步骤。An ultrasound device provided by an embodiment of the present application is applied to a smart device and includes a memory 201 and a processor 202. A computer program is stored in the memory 201. When the processor 202 executes the computer program stored in the memory 201, any of the above embodiments is implemented The steps of the described voice control method.
请参阅图6,本申请实施例提供的另一种超声设备中还可以包括:与处理器202连接的输入端口203,用于传输外界输入的命令至处理器202;与处理器202连接的显示单元204,用于显示处理器202的处理结果至外界;与处理器202连接的通信模块205,用于实现超声设备与外界的通信。显示单元204可以为显示面板、激光扫描使显示器等;通信模块205所采用的通信方式包括但不局限于移动高清链接技术(HML)、通用串行总线(USB)、高清多媒体接口(HDMI)、无线连接:无线保真技术(WiFi)、蓝牙通信技术、低功耗蓝牙通信技术、基于IEEE802.11s的通信技术。Referring to FIG. 6, another ultrasound device provided by an embodiment of the present application may further include: an input port 203 connected to the processor 202, used to transmit commands input from the outside to the processor 202; and a display connected to the processor 202 The unit 204 is used to display the processing result of the processor 202 to the outside; the communication module 205 connected to the processor 202 is used to implement the communication between the ultrasound device and the outside. The display unit 204 can be a display panel, a laser scanning display, etc.; the communication mode adopted by the communication module 205 includes but is not limited to mobile high-definition link technology (HML), universal serial bus (USB), high-definition multimedia interface (HDMI), Wireless connection: wireless fidelity technology (WiFi), Bluetooth communication technology, low-power Bluetooth communication technology, communication technology based on IEEE802.11s.
本申请实施例提供的一种计算机可读存储介质,应用于智能设备,计算机可读存储介质中存储有计算机程序,计算机程序被处理器执行时实现如上任一实施例所描述的语音控制方法的步骤。An embodiment of the present application provides a computer-readable storage medium, which is applied to a smart device, and a computer program is stored in the computer-readable storage medium. When the computer program is executed by a processor, the voice control method as described in any of the above embodiments is implemented. step.
本申请所涉及的计算机可读存储介质包括随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质。The computer-readable storage media involved in this application include random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disks, removable disks, and CD-ROMs , Or any other form of storage medium known in the technical field.
本申请实施例提供的一种语音控制系统、设备及计算机可读存储介质中相关部分的说明请参见本申请实施例提供的一种语音控制方法中对应部分的详细说明,在此不再赘述。另外,本申请实施例提供的上述技术方案中与现有技术中对应技术方案实现原理一致的部分并未详细说明,以免过多赘述。Please refer to the detailed description of the corresponding part in the voice control method provided in the embodiment of the present application for the description of the relevant parts in the voice control system, device and computer-readable storage medium provided in the embodiment of the present application, which will not be repeated here. In addition, the parts of the foregoing technical solutions provided by the embodiments of the present application that are consistent with the implementation principles of the corresponding technical solutions in the prior art are not described in detail, so as to avoid redundant description.
还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或 者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should also be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply these entities or operations There is any such actual relationship or order between. Moreover, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article, or device that includes a series of elements includes not only those elements, but also includes Other elements of, or also include elements inherent to this process, method, article or equipment. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other same elements in the process, method, article, or equipment including the element.
对所公开的实施例的上述说明,使本领域技术人员能够实现或使用本申请。对这些实施例的多种修改对本领域技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本申请的精神或范围的情况下,在其它实施例中实现。因此,本申请将不会被限制于本文所示的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。The foregoing description of the disclosed embodiments enables those skilled in the art to implement or use this application. Various modifications to these embodiments will be obvious to those skilled in the art, and the general principles defined in this document can be implemented in other embodiments without departing from the spirit or scope of the application. Therefore, the application will not be limited to the embodiments shown in this document, but should conform to the widest scope consistent with the principles and novel features disclosed in this document.

Claims (12)

  1. 一种语音控制方法,其特征在于,应用于智能设备,包括:A voice control method, characterized in that it is applied to a smart device, and includes:
    当判定执行语音交互功能时,持续采集语音,得到目标语音;When it is determined to perform the voice interaction function, continue to collect voices to obtain the target voice;
    对所述目标语音中的目标命令进行识别,并响应所述目标命令。Recognizing the target command in the target voice, and responding to the target command.
  2. 根据权利要求1所述的方法,其特征在于,所述目标语音由语音单元组成;The method according to claim 1, wherein the target speech is composed of speech units;
    所述持续采集语音,得到目标语音,包括:The continuously collecting voice to obtain the target voice includes:
    判断当前时刻是否属于预设的语音采集时刻;Determine whether the current moment belongs to the preset voice collection moment;
    若当前时刻属于所述语音采集时刻,则从当前时刻开始,采集预设时长的语音作为所述语音单元;If the current moment belongs to the voice collection moment, starting from the current moment, a voice of a preset duration is collected as the voice unit;
    若当前时刻不属于所述语音采集时刻,则返回执行所述判断当前时刻是否属于预设的语音采集时刻的步骤。If the current time does not belong to the voice collection time, return to the step of determining whether the current time belongs to the preset voice collection time.
  3. 根据权利要求2所述的方法,其特征在于,所述判断当前时刻是否属于预设的语音采集时刻之前,还包括:The method according to claim 2, wherein the judging whether the current moment belongs to the preset voice collection moment before, further comprises:
    按照相邻语音采集时刻间的时长小于所述预设时长,且所述预设时长大于等于所述目标命令的语音时长的原则,确定所述语音采集时刻和所述预设时长。The voice collection time and the preset time length are determined according to the principle that the time length between adjacent voice collection times is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command.
  4. 根据权利要求3所述的方法,其特征在于,所述按照相邻语音采集时刻间的时长小于所述预设时长,且所述预设时长大于等于所述目标命令的语音时长的原则,确定所述语音采集时刻和所述预设时长,包括:The method according to claim 3, wherein the determining is based on the principle that the duration between adjacent voice collection moments is less than the preset duration, and the preset duration is greater than or equal to the voice duration of the target command The voice collection time and the preset duration include:
    根据时长关系公式,按照所述相邻语音采集时刻间的时长小于所述预设时长,且所述预设时长大于等于所述目标命令的语音时长的原则,确定所述语音采集时刻和所述预设时长;According to the time length relation formula, the time length between the adjacent voice collection time is less than the preset time length, and the preset time length is greater than or equal to the voice time length of the target command, determine the voice collection time and the Preset duration
    所述时长关系公式包括:The duration relationship formula includes:
    X≤(N-1)L/N;L=NP;X≤(N-1)L/N; L=NP;
    其中,X表示所述目标命令的语音时长;N表示大于1的正整数;L表示所述预设时长;P表示所述相邻语音采集时刻间的时长。Wherein, X represents the voice duration of the target command; N represents a positive integer greater than 1; L represents the preset duration; P represents the duration between adjacent voice collection moments.
  5. 根据权利要求2所述的方法,其特征在于,所述从当前时刻开始,采集预设时长的语音作为所述语音单元,包括:The method according to claim 2, wherein the collecting a voice of a preset duration as the voice unit starting from the current moment comprises:
    选取一个空闲的用于存储语音的存储空间作为目标存储空间;Select a free storage space for storing voice as the target storage space;
    将从当前时刻开始采集的语音均存储在所述目标存储空间中,直至装满所述目标存储空间,得到所述语音单元;All the voices collected from the current moment are stored in the target storage space until the target storage space is full to obtain the voice unit;
    其中,所述存储空间所能存储的语音的时长为所述预设时长。Wherein, the duration of the voice that can be stored in the storage space is the preset duration.
  6. 根据权利要求5所述的方法,其特征在于,所述选取一个空闲的用于存储语音的存储空间作为目标存储空间,包括:The method according to claim 5, wherein the selecting a free storage space for storing voice as the target storage space comprises:
    判断是否存在空闲存储空间;Determine whether there is free storage space;
    若不存在空闲存储空间,则创建一个存储空间并作为所述目标存储空间;If there is no free storage space, create a storage space and use it as the target storage space;
    若存在空闲存储空间,则选取一个空闲的存储空间作为所述目标存储空间。If there is a free storage space, a free storage space is selected as the target storage space.
  7. 根据权利要求5所述的方法,其特征在于,所述将从当前时刻开始采集的语音均存储在所述目标存储空间中,直至装满所述目标存储空间,得到所述语音单元之后,还包括:The method according to claim 5, wherein the voices collected from the current moment are all stored in the target storage space until the target storage space is full, and after the voice unit is obtained, include:
    将所述目标存储空间中的所述语音单元存储至预设音频队列中;Storing the voice unit in the target storage space in a preset audio queue;
    释放所述目标存储空间;Release the target storage space;
    所述对所述目标语音中的目标命令进行识别,包括:The recognizing the target command in the target voice includes:
    从所述预设音频队列中获取一个所述语音单元进行命令识别;Acquiring one of the voice units from the preset audio queue for command recognition;
    并从所述预设音频队列中删除选取的所述语音单元。And delete the selected voice unit from the preset audio queue.
  8. 根据权利要求1至7任一项所述的方法,其特征在于,所述对所述目标语音中的目标命令进行识别,包括:The method according to any one of claims 1 to 7, wherein the recognizing the target command in the target voice comprises:
    将所述目标语音与预设语法进行匹配,若匹配成功,则将与所述目标语音匹配的所述预设语法映射为所述目标命令。The target voice is matched with a preset grammar, and if the matching is successful, the preset grammar that matches the target voice is mapped to the target command.
  9. 根据权利要求8所述的方法,其特征在于,所述智能设备包括超声设备;The method according to claim 8, wherein the smart device comprises an ultrasound device;
    所述对所述目标语音中的目标命令进行识别,并响应所述目标命令,包括:The recognizing the target command in the target voice and responding to the target command includes:
    对所述目标语音中的超声指令进行识别,并响应所述超声指令。Recognizing the ultrasonic instruction in the target voice, and responding to the ultrasonic instruction.
  10. 一种语音控制系统,其特征在于,应用于智能设备,包括:A voice control system, characterized in that it is applied to smart devices, and includes:
    第一采集模块,用于当判定执行语音交互功能时,持续采集语音,得到目标语音;The first collection module is used to continuously collect voices to obtain the target voice when it is determined to perform the voice interaction function;
    第一识别模块,用于对所述目标语音中的目标命令进行识别,并响应所述目标命令。The first recognition module is used to recognize the target command in the target voice and respond to the target command.
  11. 一种超声设备,其特征在于,包括:An ultrasonic device, characterized in that it comprises:
    存储器,用于存储计算机程序;Memory, used to store computer programs;
    处理器,用于执行所述计算机程序时实现如权利要求1至9任一项所述语音控制方法的步骤。The processor is configured to implement the steps of the voice control method according to any one of claims 1 to 9 when executing the computer program.
  12. 一种计算机可读存储介质,其特征在于,应用于智能设备,所述计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1至9任一项所述语音控制方法的步骤。A computer-readable storage medium, characterized in that it is applied to a smart device, and a computer program is stored in the computer-readable storage medium. When the computer program is executed by a processor, the computer program implements any one of claims 1 to 9 The steps of the voice control method are described.
PCT/CN2020/096267 2019-07-08 2020-06-16 Voice control method and system, device and computer-readable storage medium WO2021004236A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910611505.4A CN110335599B (en) 2019-07-08 2019-07-08 Voice control method, system, equipment and computer readable storage medium
CN201910611505.4 2019-07-08

Publications (1)

Publication Number Publication Date
WO2021004236A1 true WO2021004236A1 (en) 2021-01-14

Family

ID=68143362

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/096267 WO2021004236A1 (en) 2019-07-08 2020-06-16 Voice control method and system, device and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN110335599B (en)
WO (1) WO2021004236A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110335599B (en) * 2019-07-08 2021-12-10 深圳开立生物医疗科技股份有限公司 Voice control method, system, equipment and computer readable storage medium
CN115312051A (en) * 2022-07-07 2022-11-08 青岛海尔科技有限公司 Voice control method and device for equipment, storage medium and electronic device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150206544A1 (en) * 2014-01-23 2015-07-23 International Business Machines Corporation Adaptive pause detection in speech recognition
CN104917904A (en) * 2014-03-14 2015-09-16 联想(北京)有限公司 Voice information processing method and device and electronic device
CN108847237A (en) * 2018-07-27 2018-11-20 重庆柚瓣家科技有限公司 continuous speech recognition method and system
CN109273005A (en) * 2018-12-11 2019-01-25 胡应章 Sound control output device
CN110335599A (en) * 2019-07-08 2019-10-15 深圳开立生物医疗科技股份有限公司 A kind of sound control method, system, equipment and computer readable storage medium
CN110689877A (en) * 2019-09-17 2020-01-14 华为技术有限公司 Voice end point detection method and device

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646646B (en) * 2013-11-27 2018-08-31 联想(北京)有限公司 A kind of sound control method and electronic equipment
CN106128456A (en) * 2016-06-16 2016-11-16 美的集团股份有限公司 The sound control method of intelligent appliance, terminal and system
US10546575B2 (en) * 2016-12-14 2020-01-28 International Business Machines Corporation Using recurrent neural network for partitioning of audio data into segments that each correspond to a speech feature cluster identifier
CN106782554B (en) * 2016-12-19 2020-09-25 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN107919124B (en) * 2017-12-22 2021-07-13 北京小米移动软件有限公司 Equipment awakening method and device
CN108198554A (en) * 2018-01-29 2018-06-22 深圳市共进电子股份有限公司 The control method of domestic robot work system based on interactive voice
CN108647048A (en) * 2018-05-17 2018-10-12 Oppo(重庆)智能科技有限公司 Doze mode regulating methods, device, mobile terminal and storage medium
CN109102806A (en) * 2018-09-29 2018-12-28 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for interactive voice
CN109360570B (en) * 2018-10-19 2022-06-21 歌尔科技有限公司 Voice recognition method of voice device, voice device and readable storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150206544A1 (en) * 2014-01-23 2015-07-23 International Business Machines Corporation Adaptive pause detection in speech recognition
CN104917904A (en) * 2014-03-14 2015-09-16 联想(北京)有限公司 Voice information processing method and device and electronic device
CN108847237A (en) * 2018-07-27 2018-11-20 重庆柚瓣家科技有限公司 continuous speech recognition method and system
CN109273005A (en) * 2018-12-11 2019-01-25 胡应章 Sound control output device
CN110335599A (en) * 2019-07-08 2019-10-15 深圳开立生物医疗科技股份有限公司 A kind of sound control method, system, equipment and computer readable storage medium
CN110689877A (en) * 2019-09-17 2020-01-14 华为技术有限公司 Voice end point detection method and device

Also Published As

Publication number Publication date
CN110335599B (en) 2021-12-10
CN110335599A (en) 2019-10-15

Similar Documents

Publication Publication Date Title
WO2021004236A1 (en) Voice control method and system, device and computer-readable storage medium
WO2017075965A1 (en) Voice information processing method and device
CN204791954U (en) Voice interaction system of home automation robot
CN106847285B (en) Robot and voice recognition method thereof
US10891945B2 (en) Method and apparatus for judging termination of sound reception and terminal device
CN103442370A (en) ZigBee networking system and networking method
US20170339263A1 (en) Call Processing Method and Device
CN110888844B (en) Data deleting method, system, equipment and computer readable storage medium
CN105100460A (en) Method and system for controlling intelligent terminal by use of sound
CN110347995B (en) File saving method, device, equipment and storage medium
CN105072278A (en) Method and mobile phone for rapidly dialling telephone of appointed contact person in black screen state
CN102821182B (en) Automatic phone directory contact matching method for handheld device
WO2020078206A1 (en) Task scheduling method and device, terminal, and storage medium
CN113657577B (en) Model training method and computing system
WO2019227370A1 (en) Method, apparatus and system for controlling multiple voice assistants, and computer-readable storage medium
CN104683872A (en) Method for managing users by accounts on television equipment by using human face identification technology
CN103377292B (en) Database result set caching method and device
CN105159475A (en) Character input method and device
CN109509468A (en) Method and device for equipment to execute voice broadcast task
CN110808031A (en) Voice recognition method and device and computer equipment
CN106598508A (en) Solid-state hard disc and write-in arbitrating method and system thereof
WO2021134237A1 (en) Video recording method and apparatus, and computer-readable storage medium
WO2021134240A1 (en) Game screen recording method and apparatus, and computer-readable storage medium
CN110502645A (en) Information query method and device
CN103945152A (en) Television set and method for voice control over television set

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20836400

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 20836400

Country of ref document: EP

Kind code of ref document: A1