WO2019061382A1 - 基于智能音箱的家电语音控制方法及相关产品 - Google Patents

基于智能音箱的家电语音控制方法及相关产品 Download PDF

Info

Publication number
WO2019061382A1
WO2019061382A1 PCT/CN2017/104722 CN2017104722W WO2019061382A1 WO 2019061382 A1 WO2019061382 A1 WO 2019061382A1 CN 2017104722 W CN2017104722 W CN 2017104722W WO 2019061382 A1 WO2019061382 A1 WO 2019061382A1
Authority
WO
WIPO (PCT)
Prior art keywords
amplitude
smart
control
waveform signal
voice data
Prior art date
Application number
PCT/CN2017/104722
Other languages
English (en)
French (fr)
Inventor
朱晨露
张黎君
田辉
熊胜峰
Original Assignee
陈银芳
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 陈银芳 filed Critical 陈银芳
Priority to PCT/CN2017/104722 priority Critical patent/WO2019061382A1/zh
Publication of WO2019061382A1 publication Critical patent/WO2019061382A1/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor

Definitions

  • the invention relates to the technical field of terminal devices, in particular to a home appliance voice control method based on a smart speaker and related products.
  • Smart home (English: smart home, home Automation) is based on the residential platform, using integrated wiring technology, network communication technology, Security technology, automatic control technology, audio and video technology integrate home-related facilities, build efficient management system for residential facilities and family schedules, improve home safety, convenience, comfort, and artistry, and achieve environmental protection and energy conservation. Living Environment.
  • Intelligent audio is a part of smart home.
  • the existing intelligent audio is generally a receiving device, that is, it cannot control other home devices through intelligent audio, which makes the user experience low.
  • the embodiment of the invention provides a smart speaker-based home appliance voice control method and related products, which can improve user experience.
  • an embodiment of the present invention provides a smart speaker-based home appliance voice control method, where the method includes:
  • the intelligent sound recognizes the voice data to obtain a control object corresponding to the voice data and a control instruction
  • the intelligent sound extracts the MAC address of the control object according to the control object, and sends the MAC address and the control command to the smart home control center to control the control object.
  • the method further includes:
  • the intelligent sound extracts the waveform signal of the voice data, and the peak of the waveform signal whose amplitude is greater than the maximum amplitude is subjected to peak clipping processing, and the valley of the waveform signal whose amplitude is smaller than the minimum amplitude is compensated to obtain the maximum amplitude. And processing data between the minimum amplitudes, and the processed data is sent to a speech recognition algorithm for semantic recognition processing.
  • the method for implementing the peak clipping method includes:
  • a peak signal greater than the maximum amplitude of the waveform signal is removed, and the peak signal is replaced by a straight line.
  • the method for implementing the peak clipping method includes:
  • n regions 1>k1>k2 >k3...kn; wherein the area corresponding to kn is the area of the waveform signal containing the maximum value of the amplitude;
  • x is the original speech signal and Y is the signal after the peak clipping process.
  • a smart speaker comprising:
  • a receiving unit configured to receive voice data
  • a processing unit configured to perform a recognition process on the voice data to obtain a control object corresponding to the voice data, and a control instruction; and extract a MAC address of the control object according to the control object;
  • a sending unit configured to send the MAC address and the control command to the smart home control center to control the control object.
  • the processing unit is further configured to extract a waveform signal of the voice data, and perform peak clipping processing on a peak of the waveform signal whose amplitude is greater than a maximum amplitude, where the amplitude of the waveform signal is less than a minimum amplitude.
  • the trough performs compensation processing to obtain processing data between the maximum amplitude and the minimum amplitude, and sends the processed data to a speech recognition algorithm for semantic recognition processing.
  • the processing unit is specifically configured to remove a peak signal of the waveform signal that is greater than the maximum amplitude, and replace the peak signal with a straight line.
  • x is the original speech signal and Y is the signal after the peak clipping process.
  • a computer readable storage medium storing a computer program for electronic data exchange, wherein the computer program causes a computer to perform the method provided by the first aspect.
  • a computer program product comprising a non-transitory computer readable storage medium storing a computer program, the computer program being operative to cause a computer to perform the method provided by the first aspect.
  • the technical solution of the embodiment of the present invention receives the voice data through the intelligent sound, analyzes and processes the voice data to obtain a corresponding control object and a control command, which has the advantages of facilitating the user to control the smart home and facilitating the user. .
  • FIG. 1A is a schematic flow chart of a home appliance voice control method based on a smart speaker.
  • Figure 2a is a schematic diagram of the architecture of a smart home.
  • 2b is a schematic flow chart of data transmission of a smart home.
  • Figure 2c is a schematic diagram of another smart home architecture.
  • Figure 2d is a schematic diagram of peak clipping processing.
  • FIG. 3 is a schematic structural diagram of a smart speaker according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of an intelligent terminal according to an embodiment of the present invention.
  • references to "an embodiment” herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the invention.
  • the appearances of the phrases in various places in the specification are not necessarily referring to the same embodiments, and are not exclusive or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.
  • FIG. 1 provides a smart speaker-based home appliance voice control method, which is performed by a smart sound.
  • the method is as shown in FIG. 1 and includes the following steps:
  • Step S101 The intelligent audio receives the voice data sent by the user.
  • the manner of receiving the voice data sent by the user in the foregoing step S101 may be various.
  • the voice data sent by the user may be received by using a microphone.
  • the microphone may be a microphone device built in the smart sound. Of course, in practical applications, It is through a microphone device connected to the smart speaker, such as a microphone that sings a K device.
  • Step S102 The intelligent sound performs recognition processing on the voice data to obtain a control object corresponding to the voice data and a control command.
  • the manner of identifying the voice data in the foregoing step S102 can be identified by using an existing voice recognition algorithm, such as a natural voice recognition algorithm, and of course, a custom algorithm.
  • the voice data recognition algorithm of the present invention is not limited. For a specific customized algorithm, refer to the following description, and details are not described herein again.
  • Step S103 The intelligent audio extracts the MAC address of the control object according to the control object, and sends the MAC address and the control command to the smart home control center to control the control object.
  • control object may specifically include: a smart electric light, a smart television, a smart cleaning device, a smart sleep device, an intelligent monitoring device, etc.
  • the expression may be in various forms.
  • the smart electric light includes, but is not limited to, a smart table lamp, a smart ceiling lamp, a smart wall lamp, etc.
  • a smart TV it can be a Samsung smart TV, and of course it can also be a Sharp smart TV.
  • a smart cleaning device it may be a smart sweeping robot.
  • a smart vacuum cleaner for example, for a smart sleep device, it may be: a smart mattress, a smart sofa.
  • the device may be, for example, an intelligent monitoring device, which may be an intelligent sphygmomanometer, a smart thermometer, or the like.
  • the present invention does not limit the specific form and the number or type of the above-mentioned smart audio.
  • the technical solution provided by the invention receives voice data through the intelligent sound, analyzes and processes the voice data to obtain a corresponding control object and a control command, which has the advantages that the user can control the smart home and is convenient for the user to use.
  • a time-phase encryption method for receiving data by a smart home access point AP is provided.
  • the method is applied to the home network as shown in FIG. 2a or 2c.
  • the home network includes: a smart terminal 10, a smart home access point AP20, and a gateway 30.
  • the smart terminals are different according to different The situation may have different manifestations.
  • the smart sound may be: a smart terminal, a tablet computer, a computer, etc., of course, it may also include other devices with networking functions, such as smart TV, smart air conditioner, smart water bottle or some
  • the terminal device of the smart home, the smart speaker 10 is connected to the AP 20 in a wireless manner, and the AP 20 accesses the Internet through the gateway 30 through another mode (ie, a connection mode different from the wireless mode), and the wireless mode includes but is not limited to: Bluetooth.
  • the foregoing manner may be an LTE or a wired mode
  • the foregoing gateway may specifically be a mobile base station, a mobile relay station, a switch, or the like.
  • Fig. 2a the example is wired, and for convenience of representation, only one solid line is shown here.
  • the above gateway 30 can be a personal computer according to the size of the smart home (English: Personal Computer, PC), of course, may be a plurality of PCs, servers, or server groups in actual applications.
  • PC Personal Computer
  • the specific embodiment of the present invention does not limit the specific expression of the gateway 30.
  • FIG. 2b is a transmission flow chart of data transmission of the smart home AP, as shown in FIG. 2b, the process includes:
  • Step S201 the smart audio 10 sends the data packet to be sent to the AP20 by wireless;
  • Step S202 AP20 forwards the data packet to the gateway 30;
  • Step S203 The gateway 30 transmits the data packet to the control object.
  • step S101 and step S102 may further include:
  • the intelligent sound extracts a waveform signal of the voice data, and the peak of the waveform signal whose amplitude is greater than the maximum amplitude is subjected to peak clipping processing, and the valley of the waveform signal whose amplitude is smaller than the minimum amplitude is compensated to obtain the maximum amplitude and
  • the processed data between the minimum amplitudes is sent to the speech recognition algorithm for semantic recognition processing.
  • the technical solution processes the amplitude of the voice data, and the amplitude may be multiple, for example, the frequency of the voice data, the volume of the voice data, etc., and the technical solution is to avoid If the waveform signal of the speech data is too large or too small, the speech recognition algorithm recognizes the error.
  • the speech recognition algorithm the better the effect of the input speech data, the higher the accuracy of the recognition, so the original speech data is By performing the compensation process or the peak cut processing, the processing data in the set range can be obtained, and the recognition accuracy can be improved.
  • the method for peak clipping processing described above may be multiple, specific,
  • the peak clipping mode may be: cutting out the peak signal of the waveform signal larger than the maximum amplitude, and replacing the peak signal with a straight line, and the specific pattern is shown in FIG. 2d.
  • the peak can be processed in a sub-region, which can make the voice data more smooth and improve the quality of the voice data.
  • FIG. 3 provides a smart speaker, which includes:
  • the receiving unit 301 is configured to receive voice data.
  • the processing unit 302 is configured to perform a recognition process on the voice data to obtain a control object corresponding to the voice data and a control instruction, and extract a MAC address of the control object according to the control object;
  • the sending unit 303 is configured to send the MAC address and the control command to the smart home control center to control the control object.
  • the processing unit is further configured to extract a waveform signal of the voice data, and perform peak clipping processing on a peak of the waveform signal whose amplitude is greater than a maximum amplitude, where the amplitude of the waveform signal is less than a minimum amplitude.
  • the trough performs compensation processing to obtain processing data between the maximum amplitude and the minimum amplitude, and sends the processed data to a speech recognition algorithm for semantic recognition processing.
  • the processing unit is specifically configured to remove a peak signal of the waveform signal that is greater than the maximum amplitude, and replace the peak signal with a straight line.
  • x is the original speech signal and Y is the signal after the peak clipping process.
  • FIG. 4 is a block diagram showing a partial structure of a smart terminal related to a mobile terminal provided by an embodiment of the present invention.
  • the smart terminal includes: radio frequency (Radio Frequency, RF) circuit 910, memory 920, input unit 930, sensor 950, audio circuit 960, wireless fidelity (Wireless Fidelity, WiFi) module 970, application processor AP980, communication module 991, and power supply 990 and other components.
  • RF Radio Frequency
  • the foregoing communication module 991 may specifically be an LTE communication module.
  • the foregoing communication module may also be another communication module that supports the CSFB function.
  • the input unit 930 can be configured to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the smart terminal.
  • the input unit 930 can include a touch display screen 933, a fingerprint recognition device 931, and other input devices 932.
  • the fingerprint recognition device 931 is coupled to the touch display screen 933.
  • the input unit 930 can also include other input devices 932.
  • other input devices 932 may include, but are not limited to, one or more of physical buttons, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • the touch display screen 933 is configured to collect a touch parameter set when the user performs a sliding operation on the touch display screen 933, and notify the fingerprint identification device 931 to perform fingerprint collection, and The touch parameter set is sent to the AP 980; the fingerprint identification device 931 is configured to collect a fingerprint image, and send the fingerprint image to the AP 980; the AP 980 is configured to respectively perform the touch parameter The set and the fingerprint image are verified.
  • the AP 980 is a control center of the intelligent terminal, and connects various parts of the entire intelligent terminal using various interfaces and lines, and executes by executing or executing software programs and/or modules stored in the memory 920 and calling data stored in the memory 920. Intelligent terminals perform various functions and process data to monitor the intelligent terminal as a whole.
  • the AP 980 may include one or more processing units; optional
  • the AP980 can integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, an application, etc., and the modem processor mainly processes wireless communication. It can be understood that the above modem processor may not be integrated into the AP 980.
  • memory 920 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
  • the RF circuit 910 can be used for receiving and transmitting information.
  • RF circuit 910 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier (Low) Noise Amplifier, LNA), duplexer, etc.
  • RF circuitry 910 can also communicate with the network and other devices via wireless communication.
  • the above wireless communication may use any communication standard or protocol, including but not limited to the global mobile communication system.
  • GSM Global System of Mobile communication
  • General Packet Radio Service General Packet Radio Service
  • GPRS General Packet Radio Service
  • CDMA Code Division Multiple Access
  • Wideband Code Division Multiple Access Wideband Code Division Multiple Access
  • WCDMA Wideband Code Division Multiple Access
  • LTE Long Term Evolution
  • email Short Messaging Service
  • the smart terminal may also include at least one type of sensor 950, such as a light sensor, motion sensor, and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor can adjust the brightness of the touch display screen according to the brightness of the ambient light, and the proximity sensor can turn off the touch display when the smart terminal moves to the ear. Screen and / or backlight.
  • the accelerometer sensor can detect the acceleration of each direction (usually three axes), and the magnitude and direction of gravity can be detected at rest.
  • the intelligent terminal can be used to identify the posture of the intelligent terminal (such as horizontal and vertical screen switching, Related games, magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tapping), etc.;
  • Other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, an infrared sensor, and the like which can be configured by the smart terminal will not be described herein.
  • An audio circuit 960, a speaker 961, and a microphone 962 can provide an audio interface between the user and the smart terminal.
  • the audio circuit 960 can transmit the converted electrical data of the received audio data to the speaker 961 for conversion to the sound signal by the speaker 961; on the other hand, the microphone 962 converts the collected sound signal into an electrical signal by the audio circuit 960. After receiving, it is converted into audio data, and then the audio data is played by the AP 980 for processing, sent to the smart terminal 910 via the RF circuit 910, or the audio data is played to the memory 920 for further processing.
  • WiFi is a short-range wireless transmission technology.
  • the smart terminal can help users to send and receive emails, browse web pages and access streaming media through the WiFi module 970, which provides users with wireless broadband Internet access.
  • FIG. 4 shows the WiFi module 970, it can be understood that it does not belong to the essential configuration of the smart terminal, and may be omitted as needed within the scope of not changing the essence of the invention.
  • the smart terminal also includes a power supply 990 (such as a battery) for supplying power to various components.
  • a power supply 990 such as a battery
  • the power supply can be logically connected to the AP980 through a power management system to manage functions such as charging, discharging, and power management through the power management system.
  • the smart terminal may further include a camera, a Bluetooth module, a fill light device, a light sensor, and the like, and details are not described herein again.
  • each step method flow may be implemented based on the structure of the smart terminal.
  • the mobile terminal assigns different priorities by different recognition sequences of biometrics, and within a set time, such as a second application initiated by the user and a type of the first application. Differently, the user needs to re-execute multiple biometric operations, avoiding the problem of directly affecting the highest priority of different types of applications and affecting security.
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium stores a computer program for electronic data exchange, the computer program causing the computer to execute any one of the smart speaker-based home appliances as described in the foregoing method embodiments. Part or all of the steps of the voice control method.
  • Embodiments of the present invention also provide a computer program product comprising a non-transitory computer readable storage medium storing a computer program, the computer program being operative to cause a computer to perform the operations as recited in the above method embodiments Any or all of the steps of a smart speaker-based home appliance voice control method.
  • the disclosed apparatus may be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software program module.
  • the integrated unit if implemented in the form of a software program module and sold or used as a standalone product, may be stored in a computer readable memory. Based on such understanding, the technical solution of the present invention may contribute to the prior art or all or part of the technical solution may be embodied in the form of a software product stored in a memory. A number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the aforementioned memory includes: U disk, read only memory (ROM, Read-Only Memory, random access memory (RAM), removable hard disk, disk or optical disk, and other media that can store program code.
  • ROM Read-Only Memory
  • RAM Random Access Memory

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本发明提供了一种基于智能音箱的家电语音控制方法及相关产品,所述方法包括:智能音响接收语音数据;智能音响对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令;智能音响依据该控制对象提取该控制对象的MAC地址,将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。本发明提供的技术方案具有用户体验度高的优点。

Description

基于智能音箱的家电语音控制方法及相关产品 技术领域
本发明涉及终端设备技术领域,具体涉及一种基于智能音箱的家电语音控制方法及相关产品。
背景技术
智能家居(英文:smart home, home automation)是以住宅为平台,利用综合布线技术、网络通信技术、 安全防范技术、自动控制技术、音视频技术将家居生活有关的设施集成,构建高效的住宅设施与家庭日程事务的管理系统,提升家居安全性、便利性、舒适性、艺术性,并实现环保节能的居住环境。
智能音响是智能家居中的一个部分,现有的智能音响一般为接收设备,即无法通过智能音响控制其他家居设备,使得用户体验度低。
技术问题
本发明实施例提供了一种基于智能音箱的家电语音控制方法及相关产品,可以提高用户体验度。
技术解决方案
第一方面,本发明实施例提供一种基于智能音箱的家电语音控制方法,所述方法包括:
智能音响接收语音数据;
智能音响对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令;
智能音响依据该控制对象提取该控制对象的MAC地址,将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。
可选的,所述方法在智能音响接收语音数据以后,还包括:
智能音响提取语音数据的波形信号,将所述波形信号中幅值大于最大幅值的波峰进行削峰处理,对该波形信号中幅值小于最小幅值的波谷进行补偿处理得到处于该最大幅值以及最小幅值之间的处理数据,将该处理数据发送至语音识别算法进行语意识别处理。
可选的,所述削峰方式的实现方法包括:
削除所述波形信号中大于所述最大幅值的波峰信号,将所述波峰信号以直线代替。
可选的,所述削峰方式的实现方法包括:
将波形信号按幅值最大值的比例划分为n个区域,采用Y=kn*x来对波形信号进行处理,其中,n个区域中,1>k1>k2 >k3…kn;其中,kn对应的区域为波形信号中包含幅值最大值的区域;
其中x为原始语音信号,Y为削峰处理后的信号。
第二方面,提供一种智能音箱,所述智能音箱包括:
接收单元,用于接收语音数据;
处理单元,用于对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令;依据该控制对象提取该控制对象的MAC地址;
发送单元,用于将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。
可选的,所述处理单元,还用于提取语音数据的波形信号,将所述波形信号中幅值大于最大幅值的波峰进行削峰处理,对该波形信号中幅值小于最小幅值的波谷进行补偿处理得到处于该最大幅值以及最小幅值之间的处理数据,将该处理数据发送至语音识别算法进行语意识别处理。
可选的,所述处理单元,具体用于削除所述波形信号中大于所述最大幅值的波峰信号,将所述波峰信号以直线代替。
可选的,所述处理单元,具体用于将波形信号按幅值最大值的比例划分为n个区域,采用Y=kn*x来对波形信号进行处理,其中,n个区域中,1>k1>k2 >k3…kn;其中,kn对应的区域为波形信号中包含幅值最大值的区域;
其中x为原始语音信号,Y为削峰处理后的信号。
第三方面,提供一种计算机可读存储介质,其存储用于电子数据交换的计算机程序,其中,所述计算机程序使得计算机执行第一方面提供的所述的方法。
第四方面,提供一种计算机程序产品,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行第一方面提供的方法。
有益效果
实施本发明实施例,具有如下有益效果:
可以看出,通过本发明实施例的技术方案通过智能音响接收语音数据,对该语音数据进行分析处理得到对应的控制对象以及控制指令,其具有方便用户对智能家居进行控制,方便用户使用的优点。
附图说明
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1A是一种基于智能音箱的家电语音控制方法的流程示意图。
图2a是一种智能家居的构架示意图。
图2b是一种智能家居的数据传输的流程示意图。
图2c是另一种智能家居的构架示意图。
图2d是削峰处理示意图。
图3是本发明实施例提供的一种智能音箱的结构示意图。
图4是本发明实施例公开的一种智能终端的结构示意图。
本发明的实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
本发明的说明书和权利要求书及所述附图中的术语“第一”、“第二”、“第三”和“第四”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。
参阅图1,图1提供了一种基于智能音箱的家电语音控制方法,该方法由智能音响执行,该方法如图1所示,包括如下步骤:
步骤S101、智能音响接收用户发送的语音数据。
上述步骤S101中的接收用户发送的语音数据的方式可以有多种,例如可以通过麦克来接收用户发送的语音数据,该麦克可以为智能音响内置的麦克设备的,当然在实际应用中,还可以是通过与该智能音响连接的麦克设备,例如唱K设备的麦克风等等设备。
步骤S102、智能音响对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令。
上述步骤S102中的对该语音数据进行识别的方式可以采用现有的语音识别算法进行识别,例如自然语音识别算法,当然还可以为自定义的算法,本发明对上述语音数据的识别算法并不限定。具体自定义的算法可以参见下述描述,这里不再赘述。
步骤S103、智能音响依据该控制对象提取该控制对象的MAC地址,将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。
需要说明的是,上述控制对象可以为多种,例如,该控制对象具体可以包括:智能电灯、智能电视、智能清扫设备、智能睡眠设备,智能监控设备等,其表现的形式可以为多种多样,例如对于智能电灯,该智能电灯包括但不限于:智能台灯,智能吸顶灯,智能壁灯等设备,例如对于智能电视来说,其可以为三星牌智能电视,当然其也可以为夏普牌智能电视,例如对于智能清扫设备来说,其可以为,智能扫地机器人,当然其还可以包括智能吸尘器、智能垃圾处理器等设备,例如对于智能睡眠设备来说,其可以为:智能床垫、智能沙发等设备,例如对智能监控设备来说或,其可以为,智能血压计,智能温度计等,本发明对上述智能音响的具体形式以及数量或种类并不限定。
本发明提供的技术方案通过智能音响接收语音数据,对该语音数据进行分析处理得到对应的控制对象以及控制指令,其具有方便用户对智能家居进行控制,方便用户使用的优点。
根据本发明的一个方面,提供了一种智能家居接入点AP接收数据的分时段加密方法。其中,该方法应用在如图2a或如2c所示的家居网络中,如图2a所示,该家居网络包括:智能终端10、智能家居接入点AP20以及网关30,上述智能终端根据不同的情况可以具有不同的表现形式,例如该智能音响具体可以为:智能终端、平板电脑、计算机等设备,当然其也可以包含带有联网功能的其他设备,例如智能电视、智能空调、智能水壶或一些智能家居的终端设备,上述智能音响10通过无线方式与AP20连接,AP20通过另一种方式(即与无线方式不同的连接方式)与网关30接入互联网,上述无线方式包括但不限于:蓝牙、WIFI等方式,上述另一种方式可以为,LTE或有线方式,上述网关具体可以为,移动基站、移动中继站、交换机等设备。图2a中以有线方式为示例,为了方便表示,这里仅以一根实线表示。
上述网关30根据智能家居的大小可以是一台个人电脑(英文:Personal computer,PC),当然在实际应用中,也可以是多台PC、服务器或服务器群组,本发明具体实施方式并不局限上述网关30的具体表现形式。
参阅图2b,图2b为智能家居AP的数据发送的传输流程图,如图2b所示,该流程包括:
步骤S201、智能音响10将需要发送的数据包通过无线方式发送至AP20;
步骤S202、AP20将该数据包转发给网关30;
步骤S203、网关30将数据包传输至控制对象。
通过上述图2a和图2b的表示,在数据包的实际传输中,如果AP20与网关30之间出现泄密,那么对于发送的数据包由于没有经过相应的加密处理,所以很容易导致数据的泄漏,容易出现安全性问题。
可选的,上述步骤S101与步骤S102的之间还可以包括:
智能音响提取语音数据的波形信号,将该波形信号中幅值大于最大幅值的波峰进行削峰处理,对该波形信号中幅值小于最小幅值的波谷进行补偿处理得到处于该最大幅值以及最小幅值之间的处理数据,将该处理数据发送至语音识别算法进行语意识别处理。
此技术方案对该语音数据的幅值进行处理,该幅值可以为多种,例如,可以为语音数据的频率,也可以为语音数据的音量的大小等等,此技术方案的处理是为了避免语音数据的波形信号过大或过小导致语音识别算法识别错误,对于语音识别算法来说,其输入的语音数据的效果越好,其识别的精度越高,所以对此对该原始的语音数据进行补偿处理或削峰处理即能够得到在设定范围的处理数据,对其识别就能够提高识别的精度。
可选的,上述削峰处理的方法可以有多种,具体的,
该削峰方式可以为,削除该波形信号中大于该最大幅值的波峰信号,将该波峰信号以直线代替,具体的图形如图2d所示。
该削峰方式还可以为:将波形信号按幅值的比例划分为n个区域,采用Y=kn*x来对波形信号进行处理,其中,n个区域中,1>k1>k2 >k3…kn;其中,kn对应的区域为波形信号中包含幅值最大值的区域;其中x为原始语音信号,Y为削峰处理后的信号。
此方式可以分区域将该峰值进行处理,这样能够使得语音数据更加的平滑,提高语音数据的质量。
参阅图3,图3提供一种智能音箱,所述智能音箱包括:
接收单元301,用于接收语音数据;
处理单元302,用于对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令;依据该控制对象提取该控制对象的MAC地址;
发送单元303,用于将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。
可选的,所述处理单元,还用于提取语音数据的波形信号,将所述波形信号中幅值大于最大幅值的波峰进行削峰处理,对该波形信号中幅值小于最小幅值的波谷进行补偿处理得到处于该最大幅值以及最小幅值之间的处理数据,将该处理数据发送至语音识别算法进行语意识别处理。
可选的,所述处理单元,具体用于削除所述波形信号中大于所述最大幅值的波峰信号,将所述波峰信号以直线代替。
可选的,所述处理单元,具体用于将波形信号按幅值最大值的比例划分为n个区域,采用Y=kn*x来对波形信号进行处理,其中,n个区域中,1>k1>k2 >k3…kn;其中,kn对应的区域为波形信号中包含幅值最大值的区域;
其中x为原始语音信号,Y为削峰处理后的信号。
图4示出的是与本发明实施例提供的移动终端相关的智能终端的部分结构的框图。参考图4,智能终端包括:射频(Radio Frequency,RF)电路910、存储器920、输入单元930、传感器950、音频电路960、无线保真(Wireless Fidelity,WiFi)模块970、应用处理器AP980、通信模块991以及电源990等部件。本领域技术人员可以理解,图4中示出的智能终端结构并不构成对智能终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。
下面结合图4对智能终端的各个构成部件进行具体的介绍:
上述通信模块991具体可以为LTE通信模块,当然上述通信模块还可以为其他的支持CSFB功能的通信模块。
输入单元930可用于接收输入的数字或字符信息,以及产生与智能终端的用户设置以及功能控制有关的键信号输入。具体地,输入单元930可包括触控显示屏933、指纹识别装置931以及其他输入设备932。指纹识别装置931结合至触控显示屏933。输入单元930还可以包括其他输入设备932。具体地,其他输入设备932可以包括但不限于物理按键、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种。其中,所述触控显示屏933,用于在检测到用户在所述触控显示屏933上进行滑动操作时,采集触控参数集,并通知所述指纹识别装置931进行指纹采集,以及将所述触控参数集发送给所述AP980;所述指纹识别装置931,用于采集指纹图像,并将所述指纹图像发送给所述AP980;所述AP980,用于分别对所述触控参数集以及所述指纹图像进行验证。
AP980是智能终端的控制中心,利用各种接口和线路连接整个智能终端的各个部分,通过运行或执行存储在存储器920内的软件程序和/或模块,以及调用存储在存储器920内的数据,执行智能终端的各种功能和处理数据,从而对智能终端进行整体监控。可选的,AP980可包括一个或多个处理单元;可选 的,AP980可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到AP980中。
此外,存储器920可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。
RF电路910可用于信息的接收和发送。通常,RF电路910包括但不限于天线、至少一个放大器、收发信机、耦合器、低噪声放大器(Low Noise Amplifier,LNA)、双工器等。此外,RF电路910还可以通过无线通信与网络和其他设备通信。上述无线通信可以使用任一通信标准或协议,包括但不限于全球移动通讯系统 (Global System of Mobile communication,GSM)、通用分组无线服务(General Packet Radio Service,GPRS)、码分多址(Code Division Multiple Access,CDMA)、宽带码分多址(Wideband Code Division Multiple Access, WCDMA)、长期演进 (Long Term Evolution,LTE)、电子邮件、短消息服务(Short Messaging Service,SMS)等。
智能终端还可包括至少一种传感器950,比如光传感器、运动传感器以及其他传感器。具体地,光传感器可包括环境光传感器及接近传感器,其中,环境光传感器可根据环境光线的明暗来调节触控显示屏的亮度,接近传感器可在智能终端移动到耳边时,关闭触控显示屏和/或背光。作为运动传感器的一种,加速计传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别智能终端姿态的应用(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等; 至于智能终端还可配置的陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器,在此不再赘述。
音频电路960、扬声器961,传声器962可提供用户与智能终端之间的音频接口。音频电路960可将接收到的音频数据转换后的电信号,传输到扬声器961,由扬声器961转换为声音信号播放;另一方面,传声器962将收集的声音信号转换为电信号,由音频电路960接收后转换为音频数据,再将音频数据播放AP980处理后,经RF电路910以发送给比如另一智能终端,或者将音频数据播放至存储器920以便进一步处理。
WiFi属于短距离无线传输技术,智能终端通过WiFi模块970可以帮助用户收发电子邮件、浏览网页和访问流式媒体等,它为用户提供了无线的宽带互联网访问。虽然图4示出了WiFi模块970,但是可以理解的是,其并不属于智能终端的必须构成,完全可以根据需要在不改变发明的本质的范围内而省略。
智能终端还包括给各个部件供电的电源990(比如电池),可选的,电源可以通过电源管理系统与AP980逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。
尽管未示出,智能终端还可以包括摄像头、蓝牙模块、补光装置、光线传感器等,在此不再赘述。
前述图1所示的实施例中,各步骤方法流程可以基于该智能终端的结构实现。
可以看出,通过本发明实施例,移动终端通过对不同的生物识别的识别顺序来分配不同的优先级,并且在设定时间内,如用户启动的第二应用程序与第一应用程序的类型不同,需要用户重新执行多生物识别操作,避免了直接给不同类型的应用程序最高优先级,影响安全性的问题。
本发明实施例还提供一种计算机存储介质,其中,该计算机存储介质存储用于电子数据交换的计算机程序,该计算机程序使得计算机执行如上述方法实施例中记载的任何一种基于智能音箱的家电语音控制方法的部分或全部步骤。
本发明实施例还提供一种计算机程序产品,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行如上述方法实施例中记载的任何一种基于智能音箱的家电语音控制方法的部分或全部步骤。
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于可选 实施例,所涉及的动作和模块并不一定是本发明所必须的。
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件程序模块的形式实现。
所述集成的单元如果以软件程序模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储器中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储器中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储器包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储器中,存储器可以包括:闪存盘、只读存储器(英文:Read-Only Memory ,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。
以上对本发明实施例进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。

Claims (10)

  1. 一种基于智能音箱的家电语音控制方法,其特征在于,所述方法包括:
    智能音响接收语音数据;
    智能音响对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令;
    智能音响依据该控制对象提取该控制对象的MAC地址,将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。
  2. 根据权利要求1所述的方法,其特征在于,所述方法在智能音响接收语音数据以后,还包括:
    智能音响提取语音数据的波形信号,将所述波形信号中幅值大于最大幅值的波峰进行削峰处理,对该波形信号中幅值小于最小幅值的波谷进行补偿处理得到处于该最大幅值以及最小幅值之间的处理数据,将该处理数据发送至语音识别算法进行语意识别处理。
  3. 根据权利要求2所述的方法,其特征在于,所述削峰方式的实现方法包括:
    削除所述波形信号中大于所述最大幅值的波峰信号,将所述波峰信号以直线代替。
  4. 根据权利要求2所述的方法,其特征在于,所述削峰方式的实现方法包括:
    将波形信号按幅值最大值的比例划分为n个区域,采用Y=kn*x来对波形信号进行处理,其中,n个区域中,1>k1>k2 >k3…kn;其中,kn对应的区域为波形信号中包含幅值最大值的区域;
    其中x为原始语音信号,Y为削峰处理后的信号。
  5. 一种智能音箱,其特征在于,所述智能音箱包括:
    接收单元,用于接收语音数据;
    处理单元,用于对该语音数据进行识别处理得到该语音数据对应的控制对象以及控制指令;依据该控制对象提取该控制对象的MAC地址;
    发送单元,用于将该MAC地址以及控制指令发送至智能家居控制中心控制该控制对象。
  6. 根据权利要求5所述的智能音箱,其特征在于,
    所述处理单元,还用于提取语音数据的波形信号,将所述波形信号中幅值大于最大幅值的波峰进行削峰处理,对该波形信号中幅值小于最小幅值的波谷进行补偿处理得到处于该最大幅值以及最小幅值之间的处理数据,将该处理数据发送至语音识别算法进行语意识别处理。
  7. 根据权利要求6所述的智能音箱,其特征在于,
    所述处理单元,具体用于削除所述波形信号中大于所述最大幅值的波峰信号,将所述波峰信号以直线代替。
  8. 根据权利要求6所述的智能音箱,其特征在于,
    所述处理单元,具体用于将波形信号按幅值最大值的比例划分为n个区域,采用Y=kn*x来对波形信号进行处理,其中,n个区域中,1>k1>k2 >k3…kn;其中,kn对应的区域为波形信号中包含幅值最大值的区域;
    其中x为原始语音信号,Y为削峰处理后的信号。
  9. 一种计算机可读存储介质,其特征在于,其存储用于电子数据交换的计算机程序,其中,所述计算机程序使得计算机执行如权利要求1-4任一所述的方法。
  10. 一种计算机程序产品,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行如权利要求1-4任一所述的方法。
PCT/CN2017/104722 2017-09-30 2017-09-30 基于智能音箱的家电语音控制方法及相关产品 WO2019061382A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/104722 WO2019061382A1 (zh) 2017-09-30 2017-09-30 基于智能音箱的家电语音控制方法及相关产品

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/104722 WO2019061382A1 (zh) 2017-09-30 2017-09-30 基于智能音箱的家电语音控制方法及相关产品

Publications (1)

Publication Number Publication Date
WO2019061382A1 true WO2019061382A1 (zh) 2019-04-04

Family

ID=65900437

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/104722 WO2019061382A1 (zh) 2017-09-30 2017-09-30 基于智能音箱的家电语音控制方法及相关产品

Country Status (1)

Country Link
WO (1) WO2019061382A1 (zh)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106385347A (zh) * 2016-09-09 2017-02-08 珠海格力电器股份有限公司 一种家电设备控制方法和装置
WO2017071645A1 (zh) * 2015-10-28 2017-05-04 中兴通讯股份有限公司 语音控制方法、装置及系统
CN106685772A (zh) * 2016-12-23 2017-05-17 北京奇虎科技有限公司 一种智能音箱、智能家居系统及其实现方法
CN106886166A (zh) * 2015-12-11 2017-06-23 美的集团股份有限公司 通过音箱控制家用电器的方法、装置以及音箱

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017071645A1 (zh) * 2015-10-28 2017-05-04 中兴通讯股份有限公司 语音控制方法、装置及系统
CN106886166A (zh) * 2015-12-11 2017-06-23 美的集团股份有限公司 通过音箱控制家用电器的方法、装置以及音箱
CN106385347A (zh) * 2016-09-09 2017-02-08 珠海格力电器股份有限公司 一种家电设备控制方法和装置
CN106685772A (zh) * 2016-12-23 2017-05-17 北京奇虎科技有限公司 一种智能音箱、智能家居系统及其实现方法

Similar Documents

Publication Publication Date Title
WO2018129977A1 (zh) 一种充电控制方法、装置、存储介质和计算机设备
WO2017140276A1 (zh) 网络连接方法及装置、计算机存储介质
CN106411448B (zh) 播放控制方法、装置及终端
WO2014092491A1 (en) User terminal apparatus, network apparatus, and control method thereof
CN108024128B (zh) 蓝牙音乐播放的控制方法、装置、终端设备及存储介质
WO2020130237A1 (en) Terminal device and method for controlling thereof
WO2016080747A1 (en) User terminal and method for controlling display apparatus
WO2018153268A1 (zh) 移动终端根据sim卡选择volte的方法及系统、移动终端
WO2018214744A1 (zh) 信息处理方法及相关产品
WO2018171534A1 (zh) 基于移动终端的双摄像头供电控制方法、系统及移动终端
WO2017215661A1 (zh) 一种场景音效的控制方法、及电子设备
BR112016018783B1 (pt) Método, aparelho, dispositivo e sistema de intercomunicação
WO2016188285A1 (zh) 一种终端应用的进程管理方法及装置
CN106452643B (zh) 播放控制方法、装置、终端及播放系统
CN106095073A (zh) 控制指令的发送方法及装置
WO2017146469A1 (ko) 동글 및 그의 제어 방법
WO2018161788A1 (zh) 多媒体数据共享方法及装置
CN106254924A (zh) 一种多媒体数据的播放方法、系统及相关设备
WO2017138708A1 (en) Electronic apparatus and sensor arrangement method thereof
CN109671450B (zh) 歌曲播放方法、装置及计算机可读存储介质
CN108009116B (zh) MicroUSB接口电路及其移动终端
WO2020259295A1 (zh) 移动终端及控制方法
CN105306244B (zh) 路由器管理方法、系统及设备
CN106303616B (zh) 一种播放控制方法、装置及终端
WO2017071349A1 (zh) 一种应用于移动终端的模块控制方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17927462

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17927462

Country of ref document: EP

Kind code of ref document: A1