WO2020015283A1 - 设备的控制方法及装置、存储介质和电子装置 - Google Patents

设备的控制方法及装置、存储介质和电子装置 Download PDF

Info

Publication number
WO2020015283A1
WO2020015283A1 PCT/CN2018/120355 CN2018120355W WO2020015283A1 WO 2020015283 A1 WO2020015283 A1 WO 2020015283A1 CN 2018120355 W CN2018120355 W CN 2018120355W WO 2020015283 A1 WO2020015283 A1 WO 2020015283A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound information
weight
specified direction
sound
instruction
Prior art date
Application number
PCT/CN2018/120355
Other languages
English (en)
French (fr)
Inventor
邹其琛
毛跃辉
王慧君
王现林
Original Assignee
珠海格力电器股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 珠海格力电器股份有限公司 filed Critical 珠海格力电器股份有限公司
Publication of WO2020015283A1 publication Critical patent/WO2020015283A1/zh

Links

Images

Classifications

    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/62Control or safety arrangements characterised by the type of control or by internal processing, e.g. using fuzzy logic, adaptive control or estimation of values
    • F24F11/63Electronic processing
    • F24F11/64Electronic processing using pre-stored data
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/70Control systems characterised by their outputs; Constructional details thereof
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/282Controlling appliance services of a home automation network by calling their functionalities based on user interaction within the home
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the present invention relates to the field of computers, and in particular, to a method and device for controlling a device, a storage medium, and an electronic device.
  • Embodiments of the present invention provide a method and device for controlling a device, a storage medium, and an electronic device, so as to at least solve the problem of recognition of a voice instruction in the related art, and the probability of misidentification and misawakening based on only voice is high. The problem.
  • a device control method including: acquiring sound information around a space where the device is located; determining whether a sound source of the obtained sound information falls within a range corresponding to a specified direction; A weight for identifying the sound information as a voice control instruction is determined according to a determination result.
  • a device control device including: an acquisition module configured to acquire sound information around a space where the device is located; and a determination module configured to determine the sound of the acquired sound information. Whether the source falls within a range corresponding to the specified direction; a determining module configured to determine a weight for identifying the sound information as a voice control instruction according to a judgment result.
  • a storage medium stores a computer program, wherein the computer program is configured to execute the steps in the embodiment of the control method of the device when running.
  • an electronic device including a memory and a processor, wherein the computer program is stored in the memory, and the processor is configured to run the computer program to perform control of the device. Steps in a method embodiment.
  • the sound information around the space where the device is located is obtained; it is determined whether the sound source of the obtained sound information falls within the range corresponding to the specified direction, and finally, the sound information used for identifying the sound information as voice control is determined according to the determination result
  • the weight of the instruction that is, the weight used to identify the sound information as a voice control instruction is determined according to whether the source of the sound information is a specified direction, which solves the problem of recognizing only the sound information when recognizing the sound instruction in the related technology. This leads to a higher probability of misidentification and misawakening, which improves the user experience effect.
  • FIG. 1 is a block diagram of a hardware structure of a terminal device in a device control method according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for controlling a device according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of a specified direction of a device according to an embodiment of the present invention.
  • FIG. 4 is a flowchart of a method for identifying and controlling a voice-based air conditioner according to an embodiment of the present invention
  • FIG. 5 is a schematic structural diagram of a device for controlling a device according to an embodiment of the present invention.
  • FIG. 6 is a first schematic diagram of an optional structure of a device control device according to an embodiment of the present invention.
  • FIG. 7 is a second schematic diagram of an optional structure of a device control device according to an embodiment of the present invention.
  • FIG. 8 is a third schematic diagram of an optional structure of a device control device according to an embodiment of the present invention.
  • FIG. 1 is a block diagram of a hardware structure of a terminal device according to a device control method according to an embodiment of the present invention.
  • the terminal device 10 may include one or more (only one shown in FIG.
  • a processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA)
  • a memory 104 configured to store data
  • the above-mentioned terminal device may further include a transmission device 106 and an input-output device 108 configured as a communication function.
  • the structure shown in FIG. 1 is only schematic, and it does not limit the structure of the terminal device.
  • the terminal device 10 may further include more or fewer components than those shown in FIG. 1, or have a configuration different from that shown in FIG. 1.
  • the memory 104 may be used to store a computer program, for example, a software program and module of application software, such as a computer program corresponding to a method for controlling a device in an embodiment of the present invention.
  • the processor 102 executes the computer program stored in the memory 104 to execute the computer program.
  • the memory 104 may include a high-speed random access memory, and may further include a non-volatile memory, such as one or more magnetic storage devices, a flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include memories remotely set with respect to the processor 102, and these remote memories may be connected to the terminal device 10 through a network. Examples of the above network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
  • the transmission device 106 is configured to receive or transmit data via a network.
  • the above-mentioned specific examples of the network may include a wireless network provided by a communication provider of the terminal device 10.
  • the transmission device 106 includes a network adapter (Network Interface Controller, NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet.
  • the transmission device 106 may be a radio frequency (RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF radio frequency
  • FIG. 2 is a flowchart of a method for controlling a device according to an embodiment of the present invention. As shown in FIG. 2, the process includes the following steps:
  • Step S202 Acquire sound information around the space where the device is located
  • Step S204 Determine whether the sound source of the acquired sound information falls within a range corresponding to a specified direction
  • Step S206 Determine a weight for identifying the sound information as a voice control instruction according to the determination result.
  • the weight of the voice control instruction that is, the weight used to identify the voice information as a voice control instruction is determined according to whether the source of the voice information is a specified direction, which solves the problem that when the voice instruction is recognized in the related technology, it is based on the voice only. Recognizing the problem that the probability of misidentification and misawakening is relatively high, which improves the user experience effect.
  • the execution subject of the above steps may be a terminal device (such as an air conditioner), but is not limited thereto.
  • the manner in which the specified direction is centered on the device and the direction corresponding to the preset angle range is set to the specified direction may be implemented in the following manners:
  • Step S21 Receive an instruction for setting a specified direction; wherein the instruction carries a preset angle range;
  • Step S22 Set the direction corresponding to the preset angle range to the specified direction with the device as the center according to the instruction;
  • the method of setting the direction corresponding to the preset angle range to the specified direction according to the instruction in step S202 may be as follows: the device is the center according to the instruction, The directions of the multiple preset angle ranges are respectively set as the corresponding multiple specified directions.
  • the device is used as an air conditioner. If the air conditioner is standing against a wall, the divergent direction facing away from the wall is the direction of the available angle range.
  • the user can use the APP (application) sets the angle range of the specified direction in advance, and then sends the set angle range of the specified direction to the air conditioner through instructions.
  • the air conditioner sets the angle range of the specified direction according to the received instruction.
  • Figure 3 is based on this The schematic diagram of the designated direction of the device of the embodiment of the invention is as shown in FIG. 3 to enhance the recognition direction.
  • a method for determining whether the sound source of the acquired sound information comes from a specified direction in step S204 involved in this embodiment may be implemented by the following methods:
  • Step S204-1 determining the position of the sound source of the acquired sound information
  • Step S204-2 Determine whether the position falls within a range corresponding to the specified direction.
  • the manner in which the sound information is determined as the weight of the voice control instruction according to the judgment result involved in step S206 in this embodiment may be implemented as follows:
  • Step S206-1 In a case where the sound of the acquired sound information originates from a specified direction, it is determined that the weight of the sound information as the voice control instruction is the first weight;
  • Step S206-2 when the sound source of the acquired sound information is not from a specified direction, determine that the weight of the sound information is a voice control instruction as a second weight;
  • the first weight is greater than the second weight.
  • the sound emitted by the sound source is identified by the first weight
  • the sound source does not fall into the enhanced recognition direction (Corresponding to the above-mentioned specified direction), for example, position 2
  • the sound emitted by the sound source is identified with a second weight.
  • the specific values of the first weight and the second weight can be set according to actual needs, but the first weight must be greater than the second weight. Only in this way can the sound recognition rate of the sound source falling in the specified direction be guaranteed.
  • the technical solution of the present invention in essence, or a part that contributes to the prior art, can be embodied in the form of a software product, which is stored in a storage medium (such as ROM / RAM, magnetic disk, The optical disc) includes a plurality of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, or a network device, etc.) to execute the methods described in the embodiments of the present invention.
  • a terminal device which may be a mobile phone, a computer, a server, or a network device, etc.
  • This embodiment relates to a method for identifying and controlling a voice-based air conditioner, and the control process includes: drawing an azimuth map of an air conditioner according to a signal received by a camera, an infrared sensor, and a microphone, and completing a direction setting; further, setting an appropriate (app) Commonly used control directions, according to whether the current position comparison matches the control direction preset by the user, determine whether to improve the recognition of the user's voice command, and finally judge based on the voiceprint and the user's position to optimize the false wake-up and error caused by the recognition of the voice command. Identify the problem.
  • FIG. 4 is a flowchart of an identification and control method based on a voice air conditioner according to an embodiment of the present invention. As shown in FIG. 4, the control process may include the following steps:
  • Step S401 The user binds the current smart air-conditioning device through the mobile App account.
  • Step S402 set a common direction
  • the information after setting the common directions, the information will be transmitted to the air-conditioning equipment through wireless fidelity WiFi;
  • Step S403 The air conditioner receives the set direction information and saves it;
  • Step S404 When receiving and saving the configuration information, the air-conditioning device will simultaneously collect the image and other information according to the infrared sensor and the camera, draw an orientation map, and combine the direction settings transmitted by the App to complete the commonly used direction settings. In the common direction set by the user, the weight of all voice control instructions performed by the user in that direction will increase, improving the recognition of all the instructions of the user.
  • Step S405 In the set common control direction, when the air conditioner is controlled by voice instructions, the smart air conditioner will first determine the user's voiceprint and location information to confirm whether the user is in the set direction. If so, perform step S406, otherwise execute Step S407;
  • Step S406 the recognition of the voice control instruction is improved
  • Step S407 It is judged whether the voice instruction is executed according to the normal weight, so as to avoid the problems of mis-wake and mis-recognition caused by common direction recognition.
  • the user sets a commonly used voice control direction on the mobile APP, and then the App synchronizes the data to the voice module, and the voice module saves the configuration of the position.
  • the voice module will compare the user's current position according to the camera, infrared sensor and other devices. If the current user's position is within the set range, the weight of the position command word will be enhanced. Brings better command recognition.
  • a device control device is also provided.
  • the device is used to implement the foregoing embodiments and preferred implementation manners, and the descriptions will not be repeated.
  • the term "module” may implement a combination of software and / or hardware for a predetermined function.
  • the devices described in the following embodiments are preferably implemented in software, implementation in hardware, or a combination of software and hardware is also possible and conceived.
  • FIG. 5 is a schematic structural diagram of a device control device according to an embodiment of the present invention.
  • the device includes: an acquisition module 52 configured to acquire sound information around a space where the device is located; a determination module 54, and The obtaining module 52 is coupled and configured to determine whether the sound source of the acquired sound information falls within a range corresponding to a specified direction; the determining module 56 is coupled to the determining module 54 and is configured to determine a sound for identifying according to a judgment result
  • the information is the weight of the voice control instruction.
  • FIG. 6 is a first schematic diagram of an optional structure of a device control device according to an embodiment of the present invention.
  • the device further includes a setting module 62 coupled to the obtaining module 52 and configured to use the device as The direction corresponding to the center and the preset angle range is set as the specified direction.
  • the setting module 62 includes a receiving unit 622 configured to receive an instruction for setting the specified direction.
  • the instruction carries a preset angle range.
  • a setting unit 624 which is coupled to the receiving unit 622, and is configured to set the direction of the preset angle range to the specified direction with the device as the center according to the instruction.
  • the setting unit 624 includes: a setting sub-unit configured to set the directions of the multiple preset angle ranges to the corresponding multiples according to the instruction, with the device as the center. Specified directions.
  • FIG. 7 is a second schematic diagram of an optional structure of a device control device according to an embodiment of the present invention.
  • the determining module 54 includes a first determining unit 542 configured to determine a position of a sound source of the acquired sound information.
  • a judging unit 544 coupled to the first determining unit 542 and configured to judge whether the position falls within a range corresponding to the specified direction.
  • FIG. 8 is a third schematic diagram of an optional structure of a device control device according to an embodiment of the present invention.
  • the determination module 56 includes a second determination unit 562 configured to obtain the sound information of the acquired sound information from In the case of specifying the direction, it is determined that the weight used to identify the sound information as the voice control instruction is the first weight; the third determining unit 564 is coupled to the second determining unit 562 and is set to be different from the sound source of the acquired sound information. Since the direction is specified, the weight used to identify the voice information as the voice control instruction is determined as the second weight; wherein the first weight is greater than the second weight.
  • the above modules can be implemented by software or hardware. For the latter, they can be implemented in the following ways, but are not limited to the above: the above modules are located in the same processor; or the above modules are arbitrarily combined The forms are located in different processors.
  • An embodiment of the present invention further provides a storage medium.
  • the storage medium stores a computer program, and the computer program is configured to execute the steps in any one of the foregoing method embodiments when running.
  • the foregoing storage medium may be configured to store a computer program for performing the following steps:
  • the storage medium is further configured to store a computer program for performing the following steps:
  • the foregoing storage medium may include, but is not limited to, a U disk, a read-only memory (ROM), a random access memory (Random Access Memory, RAM), A variety of media that can store computer programs, such as mobile hard disks, magnetic disks, or optical disks.
  • ROM read-only memory
  • RAM Random Access Memory
  • An embodiment of the present invention further provides an electronic device including a memory and a processor.
  • the memory stores a computer program
  • the processor is configured to run the computer program to perform the steps in any one of the foregoing method embodiments.
  • the electronic device may further include a transmission device and an input-output device, wherein the transmission device is connected to the processor, and the input-output device is connected to the processor.
  • the foregoing processor may be configured to execute the following steps by a computer program:
  • modules or steps of the present invention can be implemented by a general-purpose computing device, and they can be concentrated on a single computing device or distributed on a network composed of multiple computing devices.
  • they may be implemented with program code executable by a computing device, so that they may be stored in a storage device and executed by the computing device, and in some cases, may be in a different order than here
  • the steps shown or described are performed either by making them into individual integrated circuit modules or by making multiple modules or steps into a single integrated circuit module. As such, the invention is not limited to any particular combination of hardware and software.
  • the sound information around the space where the device is located is obtained; it is determined whether the sound source of the obtained sound information falls within the range corresponding to the specified direction, and finally the sound information used to identify the sound information is determined according to the determination result.
  • the weight of the voice control instruction that is, the weight used to identify the voice information as a voice control instruction is determined according to whether the source of the voice information is a specified direction, which solves the problem of identifying voice instructions in the related technology based on the voice information only.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Chemical & Material Sciences (AREA)
  • Combustion & Propulsion (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Mechanical Engineering (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Automation & Control Theory (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computational Linguistics (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Selective Calling Equipment (AREA)

Abstract

一种设备的控制方法及装置、存储介质和电子装置,其中该方法步骤包括:获取所述设备所处空间周围的声音信息(S202);判断获取到的声音信息的声源是否落入指定方向所对应的范围内(S204);根据判断结果来确定用于识别所述声音信息为语音控制指令的权重(S206)。该方法解决了相关技术中在进行语音指令的识别时,只根据声音来识别导致误识别和误唤醒的几率比较高的问题,提高了用户的体验效果。

Description

设备的控制方法及装置、存储介质和电子装置 技术领域
本发明涉及计算机领域,具体而言,涉及一种设备的控制方法及装置、存储介质和电子装置。
背景技术
随着语音空调的开发,对于语音的识别率一直是个比较头疼的难题。由于声音方向性、噪音干扰等因素,在进行语音指令的识别时误识别和误唤醒的几率比较高,导致用户体验下降。
针对相关技术中的上述问题,目前尚未存在有效的解决方案。
发明内容
本发明实施例提供了一种设备的控制方法及装置、存储介质和电子装置,以至少解决相关技术中在进行语音指令的识别时,只根据声音来识别导致误识别和误唤醒的几率比较高的问题。
根据本发明的一个实施例,提供一种设备的控制方法,包括:获取所述设备所处空间周围的声音信息;判断获取到的声音信息的声源是否落入指定方向所对应的范围内;根据判断结果来确定用于识别所述声音信息为语音控制指令的权重。
根据本发明的另一个实施例,提供了一种设备的控制装置,包括:获取模块,设置为获取所述设备所处空间周围的声音信息;判断模块,设置为判断获取到的声音信息的声源是否落入指定方向所对应的范围内;确定模块,设置为根据判断结果来确定用于识别所述声音信息为语音控制指令的权重。
根据本发明的又一个实施例,还提供了一种存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行上述设备的控制方法实施例中的步骤。
根据本发明的又一个实施例,还提供了一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行上述设备的控制方法实施例中的步骤。
通过本发明实施例,获取设备所处空间周围的声音信息;判断获取到的声音信息的声源是否落入指定方向所对应的范围内,最后根据判断结果来确定用于识别声音信息为语音控制指令的权重,也就是说,根据声音信息的来源是否为指定方向从而确定用于识别声音信息为语音控制指令的权重,解决了相关技术中在进行语音指令的识别时,只根据声音信息来识别导致误识别和误唤醒的几率比较高的问题,提高了用户的体验效果。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1是本发明实施例的一种设备的控制方法的终端设备的硬件结构框图;
图2是根据本发明实施例的设备的控制方法流程图;
图3是根据本发明实施例的设备的指定方向的示意图;
图4是根据本发明实施例的基于语音空调的识别控制方法流程图;
图5是根据本发明实施例的设备的控制装置的结构示意图;
图6是根据本发明实施例的设备的控制装置的可选结构示意图一;
图7是根据本发明实施例的设备的控制装置的可选结构示意图二;
图8是根据本发明实施例的设备的控制装置的可选结构示意图三。
具体实施方式
下文中将参考附图并结合实施例来详细说明本发明。需要说明的是, 在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。
实施例1
本申请实施例一所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在终端设备上为例,图1是本发明实施例的一种设备的控制方法的终端设备的硬件结构框图。如图1所示,终端设备10可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)和设置为存储数据的存储器104,可选地,上述终端设备还可以包括设置为通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述终端设备的结构造成限定。例如,终端设备10还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。
存储器104可用于存储计算机程序,例如,应用软件的软件程序以及模块,如本发明实施例中的设备的控制方法对应的计算机程序,处理器102通过运行存储在存储器104内的计算机程序,从而执行各种功能应用以及数据处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至终端设备10。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。
传输装置106设置为经由一个网络接收或者发送数据。上述的网络具体实例可包括终端设备10的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为 NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其用于通过无线方式与互联网进行通讯。
在本实施例中提供了一种运行于上述终端设备的设备的控制方法,图2是根据本发明实施例的设备的控制方法流程图,如图2所示,该流程包括如下步骤:
步骤S202,获取所述设备所处空间周围的声音信息;
步骤S204,判断获取到的声音信息的声源是否落入指定方向所对应的范围内;
步骤S206,根据判断结果来确定用于识别声音信息为语音控制指令的权重。
通过上述步骤S202至步骤S206,获取设备所处空间周围的声音信息;判断获取到的声音信息的声源是否落入指定方向所对应的范围内,最后根据判断结果来确定用于识别声音信息为语音控制指令的权重,也就是说,根据声音信息的来源是否为指定方向从而确定用于识别声音信息为语音控制指令的权重,解决了相关技术中在进行语音指令的识别时,只根据声音来识别导致误识别和误唤醒的几率比较高的问题,提高了用户的体验效果。
可选地,上述步骤的执行主体可以为终端设备(例如空调)等,但不限于此。
在本实施例的可选实施方式中,通过以下方式来设置该指定方向以设备为中心且预设角度范围所对应的方向设置为指定方向的方式,可以通过如下方式来实现:
步骤S21:接收用于设置指定方向的指令;其中,指令中携带有预设角度范围;
步骤S22:根据指令以设备为中心,将预设角度范围所对应的方向设置为指定方向;
其中,在预设角度范围的数量为多个时,上步骤S202中根据指令以设备为中心,将预设角度范围所对应的方向设置为指定方向的方式可以是:根据指令以设备为中心,将多个预设角度范围的方向分别设置为对应的多个指定方向。
对于上述步骤S22和步骤S21,在具体的应用场景中以设备为空调为例,如果空调是靠墙而立,则以背向墙面的发散方向为可用角度范围方向,用户可以通过移动终端上的APP(应用程序)预先设置指定方向的角度范围,然后将设置好的指定方向的角度范围通过指令发送到空调设备,空调设备根据接收到的指令设置该指定方向的角度范围,图3是根据本发明实施例的设备的指定方向的示意图,如图3中所示的增强识别方向。
在本实施例的另一个可选实施方式中,对于本实施例中涉及到的步骤S204中的判断获取到的声音信息的声源是否来自于指定方向的方式,可以通过如下方式来实现:
步骤S204-1:确定获取到的声音信息的声源的位置;
步骤S204-2:判断位置是否落入与指定方向所对应的范围内。
基于上述步骤S204-1和步骤S204-2,由于发出声音信息的声源位置可以来自设备周围的任意位置,因此需要判断该声源是否落入指定方向所对应的范围,如图3所示,位置1则落入增强识别方向(对应于指定方向)所对应的范围,位置2则没有落入指定方向所对应的范围。
在本实施例的再一个可选实施方式中,对于本实施例中步骤S206中涉及到的根据判断结果来确定声音信息为语音控制指令的权重的方式,可以通过如下方式来实现:
步骤S206-1:在获取到的声音信息的声源自于指定方向的情况下,确定声音信息为语音控制指令的权重为第一权重;
步骤S206-2:在获取到的声音信息的声源不是自于指定方向的情况下, 确定声音信息为语音控制指令的权重为第二权重;
其中,第一权重大于第二权重。
以图3为例,如果声源落入增强识别方向(对应于上述的指定方向),例如位置1,则以第一权重识别该声源发出的声音;如果声源没有落入增强识别方向(对应于上述的指定方向),例如位置2,则以第二权重识别该声源发出的声音。第一权重和第二权重的具体取值可以根据实际需求进行设置,但是第一权重一定是比第二权重大,只有这样才能保证落入指定方向的声源的声音的识别率更高。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。
下面将以空调为例,对本实施例1进行详细的举例说明;
本实施例涉及的是基于语音空调的识别控制方法,该控制的过程包括:根据摄像头、红外传感器和麦克风接收信号等绘制空调方位图,完成方向的设置;进而通过App(应用程序)设置合适的常用控制方向,根据目前位置比对是否符合用户预设的控制方向,确定是否提升用户语音指令识别度,最后根据声纹和用户位置判断,优化语音指令识别度提升后带来的误唤醒、误识别问题。
图4是根据本发明实施例的基于语音空调的识别控制方法流程图,如图4所示,其控制流程可以包括以下步骤:
步骤S401:用户通过手机App账号绑定当前的智能空调设备。
步骤S402:设置常用方向;
其中,设置完的常用方向后信息会通过无线保真WiFi传输给空调设备;
步骤S403:空调设备会接收设置的方向信息并保存;
步骤S404:在接收并保存配置信息时,空调设备同时会根据红外传感器和摄像头采集图像等信息,绘制方位图,并结合App传送的方向设置,完成常用方向设置。在用户设置的常用方向里,用户在该方向进行的所有语音控制指令的权重都会增加,提升用户所有的指令识别度。
步骤S405:在设置好的常用控制方向里,通过语音指令控制空调时,智能空调设备会先判断用户的声纹和位置信息,确认用户是否在设置方向中,如果是,执行步骤S406,否则执行步骤S407;
步骤S406:才会提升语音控制指令的识别;
步骤S407:将按照正常权重判断该语音指令是否执行,避免设置常用方向识别带来的误唤醒和误识别问题。
基于上述步骤S401至步骤S407,用户在手机APP上设定一个常用的语音控制方向,然后App会将该数据同步给语音模块,语音模块保存该方位的配置。在用户唤醒空调控制时,语音模块会根据摄像头、红外传感器等设备比对用户目前所处的位置,如果目前用户处在的位置是在设定范围,就会增强对该方位命令词的权重,带来更好的命令识别效果。
实施例2
在本实施例中还提供了一种设备的控制装置,该装置用于实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。
图5是根据本发明实施例的设备的控制装置的结构示意图,如图5所示,该装置包括:获取模块52,设置为获取所述设备所处空间周围的声音信息;判断模块54,与获取模块52耦合连接,设置为判断获取到的声音 信息的声源是否落入指定方向所对应的范围内;确定模块56,与判断模块54耦合连接,设置为根据判断结果来确定用于识别声音信息为语音控制指令的权重。
图6是根据本发明实施例的设备的控制装置的可选结构示意图一,如图6所示,该装置还包括:设置模块62,与获取模块52耦合连接,设置为将以所述设备为中心且预设角度范围所对应的方向设置为所述指定方向,其中,该设置模块62包括:接收单元622,设置为接收用于设置指定方向的指令;其中,指令中携带有预设角度范围;设置单元624,与接收单元622耦合连接,设置为根据指令以设备为中心,将预设角度范围的方向设置为指定方向。
可选地,在预设角度范围的数量为多个时,该设置单元624包括:设置子单元,设置为根据指令以设备为中心,将多个预设角度范围的方向分别设置为对应的多个指定方向。
图7是根据本发明实施例的设备的控制装置的可选结构示意图二,如图7所示,判断模块54包括:第一确定单元542,设置为确定获取到的声音信息的声源的位置;判断单元544,与第一确定单元542耦合连接,设置为判断位置是否落入与指定方向所对应的范围内。
图8是根据本发明实施例的设备的控制装置的可选结构示意图三,如图8所示,确定模块56包括:第二确定单元562,设置为在获取到的声音信息的声源自于指定方向的情况下,确定用于识别声音信息为语音控制指令的权重为第一权重;第三确定单元564,与第二确定单元562耦合连接,设置为在获取到的声音信息的声源不是自于指定方向的情况下,确定用于识别声音信息为语音控制指令的权重为第二权重;其中,第一权重大于第二权重。
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述各个模块以任意组合的形式分别位于不同的处理器中。
本发明的实施例还提供了一种存储介质,该存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。
可选地,在本实施例中,上述存储介质可以被设置为存储用于执行以下步骤的计算机程序:
S1,获取所述设备所处空间周围的声音信息;
S2,判断获取到的声音信息的声源是否落入指定方向所对应的范围内;
S3,根据判断结果来确定用于识别声音信息为语音控制指令的权重。
可选地,存储介质还被设置为存储用于执行以下步骤的计算机程序:
S1,在获取到的声音信息的声源自于指定方向的情况下,确定声音信息为语音控制指令的权重为第一权重;
S2,在获取到的声音信息的声源不是自于指定方向的情况下,确定声音信息为语音控制指令的权重为第二权重;其中,第一权重大于第二权重。
可选地,在本实施例中,上述存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储计算机程序的介质。
本发明的实施例还提供了一种电子装置,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为运行计算机程序以执行上述任一项方法实施例中的步骤。
可选地,上述电子装置还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。
可选地,在本实施例中,上述处理器可以被设置为通过计算机程序执行以下步骤:
S1,获取所述设备所处空间周围的声音信息;
S2,判断获取到的声音信息的声源是否落入指定方向所对应的范围内;
S3,根据判断结果来确定用于识别声音信息为语音控制指令的权重。
可选地,本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例,本实施例在此不再赘述。
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。
工业实用性
在本发明实施例中,通过获取设备所处空间周围的声音信息;判断获取到的声音信息的声源是否落入指定方向所对应的范围内,最后根据判断结果来确定用于识别声音信息为语音控制指令的权重,也就是说,根据声音信息的来源是否为指定方向从而确定用于识别声音信息为语音控制指令的权重,解决了相关技术中在进行语音指令的识别时,只根据声音信息来识别导致误识别和误唤醒的几率比较高的问题,提高了用户的体验效果。

Claims (12)

  1. 一种设备的控制方法,包括:
    获取所述设备所处空间周围的声音信息;
    判断获取到的声音信息的声源是否落入指定方向所对应的范围内;
    根据判断结果来确定用于识别所述声音信息为语音控制指令的权重。
  2. 根据权利要求1所述的方法,其中,所述指定方向通过以下方式设置:
    将以所述设备为中心且预设角度范围所对应的方向设置为所述指定方向。
  3. 根据权利要求2所述的方法,其中,所述将以所述设备为中心且预设角度范围所对应的方向设置为指定方向包括:
    接收用于设置所述指定方向的指令;其中,所述指令中携带有所述预设角度范围;
    根据所述指令以所述设备为中心,将所述预设角度范围所对应的方向设置为指定方向。
  4. 根据权利要求3所述的方法,其中,在所述预设角度范围的数量为多个时,根据所述指令以所述设备为中心,将所述预设角度范围所对应的方向设置为指定方向包括:
    根据所述指令以所述设备为中心,将多个所述预设角度范围所对应的方向分别设置为对应的多个指定方向。
  5. 根据权利要求1所述的方法,其中,所述判断获取到的声音信息的声源是否落入指定方向所对应的范围内包括:
    确定获取到的声音信息的声源的位置;
    判断所述位置是否落入与所述指定方向所对应的范围内。
  6. 根据权利要求1所述的方法,其中,所述根据判断结果来确定用于识别所述声音信息为语音控制指令的权重包括:
    在获取到的声音信息的声源自于所述指定方向的情况下,确定用于识别所述声音信息为语音控制指令的权重为第一权重;
    在获取到的声音信息的声源不是自于所述指定方向的情况下,确定用于识别所述声音信息为语音控制指令的权重为第二权重;
    其中,所述第一权重大于所述第二权重。
  7. 一种设备的控制装置,包括:
    获取模块,设置为获取所述设备所处空间周围的声音信息;
    判断模块,设置为判断获取到的声音信息的声源是否落入指定方向所对应的范围内;
    确定模块,设置为根据判断结果来确定用于识别所述声音信息为语音控制指令的权重。
  8. 根据权利要求7所述的装置,其中,所述装置还包括:
    设置模块,设置为将以所述设备为中心且预设角度范围所对应的方向设置为所述指定方向。
  9. 根据权利要求7所述的装置,其中,所述判断模块包括:
    第一确定单元,设置为确定获取到的声音信息的声源的位置;
    判断单元,设置为判断所述位置是否落入与所述指定方向所对应的范围内。
  10. 根据权利要求7所述的装置,其中,所述确定模块包括:
    第二确定单元,设置为在获取到的声音信息的声源自于所述指定方向的情况下,确定用于识别所述声音信息为语音控制指令的权重为第一权重;
    第三确定单元,设置为在获取到的声音信息的声源不是自于所述 指定方向的情况下,确定用于识别所述声音信息为语音控制指令的权重为第二权重;
    其中,所述第一权重大于所述第二权重。
  11. 一种存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行所述权利要求1至6任一项中所述的方法。
  12. 一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行所述权利要求1至6任一项中所述的方法。
PCT/CN2018/120355 2018-07-20 2018-12-11 设备的控制方法及装置、存储介质和电子装置 WO2020015283A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810804485.8A CN108800473A (zh) 2018-07-20 2018-07-20 设备的控制方法及装置、存储介质和电子装置
CN201810804485.8 2018-07-20

Publications (1)

Publication Number Publication Date
WO2020015283A1 true WO2020015283A1 (zh) 2020-01-23

Family

ID=64077437

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/120355 WO2020015283A1 (zh) 2018-07-20 2018-12-11 设备的控制方法及装置、存储介质和电子装置

Country Status (2)

Country Link
CN (1) CN108800473A (zh)
WO (1) WO2020015283A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109743236A (zh) * 2018-12-06 2019-05-10 珠海格力电器股份有限公司 语音控制方法、装置、设备及计算机可读存储介质
CN110794368B (zh) * 2019-10-28 2021-10-19 星络智能科技有限公司 一种声源定位方法、装置、智能音箱及存储介质
CN114360546A (zh) * 2020-09-30 2022-04-15 华为技术有限公司 电子设备及其唤醒方法
CN112197405B (zh) * 2020-10-30 2021-12-03 佛山市顺德区美的电子科技有限公司 区域规划方法、终端设备及计算机可读存储介质
CN113539258B (zh) * 2021-06-15 2024-07-09 未来穿戴技术有限公司 按摩设备的控制方法、装置、存储介质以及按摩设备

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103529726A (zh) * 2013-09-16 2014-01-22 四川虹微技术有限公司 一种具有语音识别功能的智能开关
CN104965448A (zh) * 2015-07-17 2015-10-07 小米科技有限责任公司 智能设备的控制方法和装置
CN106356061A (zh) * 2016-10-24 2017-01-25 合肥华凌股份有限公司 基于声源定位的语音识别方法和系统、及智能家电设备
US20180048482A1 (en) * 2016-08-11 2018-02-15 Alibaba Group Holding Limited Control system and control processing method and apparatus
CN107863105A (zh) * 2017-11-23 2018-03-30 郑州庭淼软件科技有限公司 一种用于智能家居的语音控制系统

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3085737B2 (ja) * 1991-07-17 2000-09-11 ダイキン工業株式会社 空気調和機
JP4912036B2 (ja) * 2006-05-26 2012-04-04 富士通株式会社 指向性集音装置、指向性集音方法、及びコンピュータプログラム
CN202613678U (zh) * 2012-05-08 2012-12-19 成都众询科技有限公司 一种语音控制空调
CN102967026B (zh) * 2012-12-07 2015-04-01 四川长虹电器股份有限公司 智能空调及其控制方法
CN103994541B (zh) * 2014-04-21 2017-01-04 美的集团股份有限公司 基于语音控制的风向切换方法和系统
CN106023992A (zh) * 2016-07-04 2016-10-12 珠海格力电器股份有限公司 家用电器的语音控制方法及系统
CN106288183B (zh) * 2016-08-15 2021-11-16 珠海格力电器股份有限公司 空调控制方法和装置
CN108088027A (zh) * 2017-11-08 2018-05-29 珠海格力电器股份有限公司 空调辅助设备、空调控制方法及装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103529726A (zh) * 2013-09-16 2014-01-22 四川虹微技术有限公司 一种具有语音识别功能的智能开关
CN104965448A (zh) * 2015-07-17 2015-10-07 小米科技有限责任公司 智能设备的控制方法和装置
US20180048482A1 (en) * 2016-08-11 2018-02-15 Alibaba Group Holding Limited Control system and control processing method and apparatus
CN106356061A (zh) * 2016-10-24 2017-01-25 合肥华凌股份有限公司 基于声源定位的语音识别方法和系统、及智能家电设备
CN107863105A (zh) * 2017-11-23 2018-03-30 郑州庭淼软件科技有限公司 一种用于智能家居的语音控制系统

Also Published As

Publication number Publication date
CN108800473A (zh) 2018-11-13

Similar Documents

Publication Publication Date Title
WO2020015283A1 (zh) 设备的控制方法及装置、存储介质和电子装置
CN108667697B (zh) 语音控制冲突解决方法、装置及语音控制系统
EP3274988B1 (en) Controlling electronic device based on direction of speech
US10453457B2 (en) Method for performing voice control on device with microphone array, and device thereof
CN106030699B (zh) 多个设备上的热词检测
US11557291B2 (en) Method for location inference of IoT device, server, and electronic device supporting the same
EP3131316B1 (en) Method of managing geo-fence and electronic device thereof
US20180158460A1 (en) Lamp device for inputting or outputting voice signal and method of driving the same
WO2016101729A1 (zh) 无线网络接入的方法、装置及系统
CN108023934B (zh) 电子装置及其控制方法
JP2019079052A (ja) 音声データ処理方法、装置、設備及びプログラム
US20150194152A1 (en) Far-field speech recognition systems and methods
WO2016101730A1 (zh) 无线网络接入的方法、装置及系统
JP6615227B2 (ja) 音声の発生位置を特定するための方法及び端末デバイス
US20160360332A1 (en) Electronic device and method for controlling input and output by electronic device
US20180122226A1 (en) Method and device for controlling subordinate electronic device or supporting control of subordinate electronic device by learning ir signal
CN109389978B (zh) 一种语音识别方法及装置
EP2905998A1 (en) Electronic device and method of connecting electronic device to network
CN112400346A (zh) 采集其它设备的位置信息的服务器设备和方法
US20180098368A1 (en) Control method for ble communication between host device and peripheral device
US10038984B1 (en) Wireless positioning method and wireless positioning device in indoor environment
CN111653284B (zh) 交互以及识别方法、装置、终端设备及计算机存储介质
EP3342180B1 (en) Method of detecting external devices and electronic device for processing same
WO2019196450A1 (zh) 空调系统的无线组网方法和装置
CN108900959B (zh) 测试语音交互设备的方法、装置、设备和计算机可读介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18927082

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18927082

Country of ref document: EP

Kind code of ref document: A1