WO2023185006A1 - 一种工作模式设置方法及装置 - Google Patents

一种工作模式设置方法及装置 Download PDF

Info

Publication number
WO2023185006A1
WO2023185006A1 PCT/CN2022/132600 CN2022132600W WO2023185006A1 WO 2023185006 A1 WO2023185006 A1 WO 2023185006A1 CN 2022132600 W CN2022132600 W CN 2022132600W WO 2023185006 A1 WO2023185006 A1 WO 2023185006A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
working mode
voice information
user
voiceprint
Prior art date
Application number
PCT/CN2022/132600
Other languages
English (en)
French (fr)
Inventor
张凯月
张桂芳
Original Assignee
青岛海尔空调器有限总公司
青岛海尔空调电子有限公司
海尔智家股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 青岛海尔空调器有限总公司, 青岛海尔空调电子有限公司, 海尔智家股份有限公司 filed Critical 青岛海尔空调器有限总公司
Publication of WO2023185006A1 publication Critical patent/WO2023185006A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/50Control or safety arrangements characterised by user interfaces or communication
    • F24F11/52Indication arrangements, e.g. displays
    • F24F11/526Indication arrangements, e.g. displays giving audible indications
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/50Control or safety arrangements characterised by user interfaces or communication
    • F24F11/54Control or safety arrangements characterised by user interfaces or communication using one central controller connected to several sub-controllers
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/62Control or safety arrangements characterised by the type of control or by internal processing, e.g. using fuzzy logic, adaptive control or estimation of values
    • F24F11/63Electronic processing
    • F24F11/64Electronic processing using pre-stored data
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F24HEATING; RANGES; VENTILATING
    • F24FAIR-CONDITIONING; AIR-HUMIDIFICATION; VENTILATION; USE OF AIR CURRENTS FOR SCREENING
    • F24F11/00Control or safety arrangements
    • F24F11/62Control or safety arrangements characterised by the type of control or by internal processing, e.g. using fuzzy logic, adaptive control or estimation of values
    • F24F11/63Electronic processing
    • F24F11/65Electronic processing for selecting an operating mode
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces

Definitions

  • the present application relates to the field of artificial intelligence technology, and in particular to a working mode setting method.
  • the existing air conditioning control requires the elderly to enter their voiceprint on the application (Application, APP) before executing voice commands.
  • the present application provides a working mode setting method and device to solve the defects of cumbersome input in the prior art and realize convenient operation of air conditioning settings.
  • This application provides a working mode setting method, which includes: receiving target voice information sent by the target user;
  • the target voice information obtain the confidence that the target user is identified as the target group
  • the working mode corresponding to the target user is set.
  • obtaining the confidence that the target user is identified as a target group based on the target voice information includes:
  • the voiceprint similarity between the target user and the target group is scored to obtain the confidence level.
  • setting the working mode corresponding to the target user according to the confidence includes:
  • the response instruction is input by any user in response to the voice prompt of the working mode.
  • a working mode setting method after obtaining the target voiceprint characteristics, it also includes:
  • a target working mode is set.
  • performing voiceprint analysis on the target voice information to obtain target voiceprint characteristics includes:
  • Voiceprint extraction is performed on the windowed voice information to obtain target voiceprint features of the target voice information.
  • a working mode setting method after receiving the target voice information sent by the target user, it also includes:
  • the target working mode is set and an opening voice prompt is generated.
  • This application also provides a working mode setting device, including: a receiving module for receiving target voice information sent by the target user;
  • An acquisition module configured to acquire the confidence that the target user is identified as the target group based on the target voice information
  • a setting module configured to set the working mode corresponding to the target user according to the confidence level.
  • This application also provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor.
  • the processor executes the program, it implements any one of the above working mode settings. method.
  • the present application also provides a non-transitory computer-readable storage medium on which a computer program is stored.
  • the computer program When executed by a processor, it implements any one of the above working mode setting methods.
  • the present application also provides a computer program product, which includes a computer program.
  • a computer program product which includes a computer program.
  • the computer program When executed by a processor, it implements any one of the above working mode setting methods.
  • the working mode setting method and device provided by this application determines whether the user is a specific group through the confidence level of the voice information, and then automatically sets the customized working mode of the group. There is no need to record voiceprints in advance and intermediate operations. It is simple and direct, and for the user friendly.
  • FIG 1 is one of the flow diagrams of the working mode setting method provided by this application.
  • FIG. 2 is the second schematic flow chart of the working mode setting method provided by this application.
  • FIG. 3 is a schematic structural diagram of the working mode setting device provided by this application.
  • Figure 4 is a schematic structural diagram of an electronic device provided by this application.
  • the working mode setting method provided by this application adopts non-registration voiceprint recognition technology.
  • the elderly do not need to enter their own voiceprints on the APP.
  • the smart air conditioner can automatically identify and determine whether the user is an elderly person, and Combined with the elderly model tailored to the physical condition of the elderly.
  • the execution subject may be an electronic device or a software or functional module or functional entity in the electronic device that can implement the working mode setting method.
  • the electronic device includes but is not limited to smart air conditioning equipment. . It should be noted that the above execution entities do not constitute a limitation on this application.
  • Figure 1 is one of the flow diagrams of the working mode setting method provided by this application. As shown in Figure 1, it includes but is not limited to the following steps:
  • step S1 the target voice information sent by the target user is received.
  • the target user who sends the target voice message can be a registered user who has entered a voiceprint, or an unregistered user who has not entered a voiceprint.
  • the target voice message can be a power-on command or an elder care mode setting command.
  • step S2 obtain the confidence that the target user is identified as the target group based on the target voice information.
  • the target group can be the elderly.
  • the target speech information is preprocessed by pre-emphasis, framing, and windowing, and the preprocessed target speech information is converted into a voiceprint feature map.
  • the voiceprint feature map may be a Mel energy spectrum map. Mel energy spectrogram can represent the frequency distribution of sounds that people can hear, which is the deep feature of people identifying things through sound. Using this distribution characteristic in the Mel frequency domain is more suitable for building a speaker recognition system.
  • the speech signal passes through Through such conversion, the speech signal becomes an image carrying voiceprint information.
  • its Mel energy spectrum is black and white and can be understood as a single-channel feature map.
  • the scoring neural network model has been trained with a large amount of sample data.
  • the sample data includes the group label of the sample user and the sample voiceprint feature map. Therefore, after inputting the voiceprint feature map of the target user, the confidence level of the target user can be obtained.
  • step S3 the working mode corresponding to the target user is set according to the confidence level.
  • the target user's confidence is greater than the confidence threshold, it is determined that the target user is an elderly group, and the working mode is set to the elder care mode; when it is determined that the target user's confidence is not greater than the confidence threshold, the target If the user is not an elderly person, a prompt for setting the working mode is generated.
  • the target user can set the air conditioner working mode according to the prompt, which may include temperature, wind speed, wind direction, etc.
  • the working mode setting method provided by this application determines whether the user belongs to a specific group through the confidence level of the voice information, and then automatically sets the customized working mode of the group. There is no need to record voiceprints in advance and intermediate operations. It is simple, direct, and user-friendly.
  • obtaining the confidence that the target user is identified as the target group based on the target voice information includes:
  • the voiceprint similarity between the target user and the target group is scored to obtain the confidence level.
  • the power-on command is executed.
  • voiceprint analysis is performed on the target voice information, the characteristic information of the target voice information is extracted, and input into the voiceprint recognition model, and the output is the target of the target voice information.
  • voiceprint recognition model is a deep neural network model that is trained on a large amount of Chinese corpus and has strong noise resistance and robustness.
  • the scoring model is obtained by training the neural network model with training samples composed of multiple sample voiceprint features and the age label corresponding to each sample voiceprint feature.
  • performing voiceprint analysis on the target voice information to obtain target voiceprint features includes:
  • Voiceprint extraction is performed on the windowed voice information to obtain target voiceprint features of the target voice information.
  • the high-frequency end is attenuated at about 6 decibels/octave (dB/oct) above 800 Hz.
  • Digital filters can be used to pre-emphasize the target speech information.
  • the voiceprint signal is divided into several frames at intervals of 10 to 20 milliseconds (ms), and one frame is a basic unit to achieve the framing of pre-emphasized voice information.
  • the Hamming window function is used to window the framed speech information.
  • the working mode setting method provided by this application, through pre-emphasis, framing and windowing of the target speech information, the aliasing and high-order harmonics caused by the human vocal organs themselves and the equipment for collecting speech signals can be eliminated. Distortion, high frequency and other factors affect the quality of speech signals. Try to ensure that the signal obtained by subsequent speech processing is more uniform and smooth, provide high-quality parameters for signal parameter extraction, and improve the quality of speech processing.
  • the method further includes:
  • a target working mode is set.
  • the characteristic user is the user who sends the target voice message.
  • the age tag can be determined based on the user's registration information. Based on the age tag, the target user can be determined to be an elderly group and the "elderly care mode" can be turned on.
  • the elder care model is the optimal air solution for the elderly obtained through experiments by the Human Comfort Research Institute.
  • PMV human comfort intelligent control system
  • the upper and lower guide plates are in the upward blowing position in summer, and the upper and lower guide plates are in the downward blowing position in winter.
  • the method further includes:
  • the target working mode is set, and an opening voice prompt is generated.
  • setting the working mode corresponding to the target user according to the confidence level includes:
  • the response instruction is input by any user in response to the voice prompt of the working mode.
  • the preset threshold can be 80; the preset confidence interval can be greater than 70 and less than 80.
  • FIG 2 is the second flow diagram of the working mode setting method provided by this application. As shown in Figure 2, it includes:
  • the air conditioner when the air conditioner is turned off, it receives the user's voice wake-up command, such as "turn on the air conditioner", confirms the user's intention to turn on the air conditioner, executes the voice wake-up command, and turns on the air conditioner;
  • the user's voice wake-up command such as "turn on the air conditioner”
  • the confidence level is not less than 80, it is determined that the user is an elderly person, the elderly mode is turned on, and "Elder care mode is turned on, and you can enjoy the air conditioning healthily and comfortably!"
  • the confidence level is not greater than 70, it is determined that the user is not an elderly person. According to the original logic of the smart air conditioner, only the user's settings are executed, and the air conditioner is prompted to turn on, and " ⁇ device Name> is turned on” is broadcast;
  • a setting voice prompt is generated to ask: Do you need to turn on the elder care mode for you?
  • the air conditioner when the air conditioner is turned on and woken up by voice, the user actively expresses the intention to switch to the elder care mode, such as "turn on the elder care mode";
  • the elder care mode is not currently turned on, the elderly mode is turned on and the message "The elder care mode is turned on, and you can blow the air conditioner healthily and comfortably!"
  • Table 1 is the voiceprint trigger mode table for the elderly, including the trigger conditions of Natural Language Generation (NLG), NLG content and entrance corpus. Among them, the identification of the smart air conditioner is the device name.
  • NLG Natural Language Generation
  • users can turn on and off the voiceprint function for the elderly through the APP that is equipped with the smart air conditioner.
  • the switch for the voiceprint function for the elderly is turned off by default and will take effect when the user turns it on.
  • the voice side cloud of the air conditioner is connected to the Voiceprint Application Programming Interface (API).
  • API Voiceprint Application Programming Interface
  • Voiceprint recognition returns the confidence level to the voice side cloud.
  • the confidence threshold and confidence interval can be adjusted according to the actual situation.
  • the working mode setting device provided by the present application is described below.
  • the working mode setting device described below and the working mode setting method described above can be mutually referenced.
  • FIG 3 is a schematic structural diagram of the working mode setting device provided by this application. As shown in Figure 3, it includes:
  • the receiving module 301 is used to receive the target voice information sent by the target user;
  • the acquisition module 302 is configured to obtain the confidence level that the target user is identified as the target group according to the target voice information
  • the setting module 303 is configured to set the working mode corresponding to the target user according to the confidence level.
  • the receiving module 301 receives the target voice information sent by the target user.
  • the target user who sends the target voice message can be a registered user who has entered a voiceprint, or an unregistered user who has not entered a voiceprint.
  • the target voice message can be a power-on command or an elder care mode setting command.
  • the obtaining module 302 obtains the confidence that the target user is identified as the target group based on the target voice information.
  • the target group can be the elderly.
  • the target voice information is preprocessed such as pre-emphasis, framing, and windowing, and the preprocessed target voice information is converted into a voiceprint feature map.
  • the voiceprint feature map may be a Mel energy spectrum map. Mel energy spectrogram can represent the frequency distribution of sounds that people can hear, which is the deep feature of people identifying things through sound. Using this distribution characteristic in the Mel frequency domain is more suitable for building a speaker recognition system.
  • the speech signal passes through Through such conversion, the speech signal becomes an image carrying voiceprint information. For a single signal, its Mel energy spectrum is black and white and can be understood as a single-channel feature map.
  • the scoring neural network model has been trained with a large amount of sample data.
  • the sample data includes the group label of the sample user and the sample voiceprint feature map. Therefore, after inputting the voiceprint feature map of the target user, the confidence level of the target user can be obtained.
  • the setting module 303 sets the working mode corresponding to the target user according to the confidence level.
  • the target user's confidence is greater than the confidence threshold, it is determined that the target user is an elderly group, and the working mode is set to the elder care mode; when it is determined that the target user's confidence is not greater than the confidence threshold, the target If the user is not an elderly person, a prompt for setting the working mode is generated.
  • the target user can set the air conditioner working mode according to the prompt, which may include temperature, wind speed, wind direction, etc.
  • the working mode setting device determines whether the user belongs to a specific group through the confidence level of the voice information, and then automatically sets the customized working mode of the group. There is no need to record voiceprints in advance and intermediate operations. It is simple, direct, and user-friendly.
  • FIG 4 is a schematic structural diagram of an electronic device provided by this application.
  • the electronic device may include: a processor (processor) 410, a communications interface (Communications Interface) 420, a memory (memory) 430 and a communication bus 440.
  • the processor 410, the communication interface 420, and the memory 430 complete communication with each other through the communication bus 440.
  • the processor 410 can call logical instructions in the memory 430 to execute a working mode setting method.
  • the method includes: receiving target voice information sent by a target user; and obtaining, according to the target voice information, the target user identified as a target group. Confidence; according to the confidence, set the working mode corresponding to the target user.
  • the above-mentioned logical instructions in the memory 430 can be implemented in the form of software functional units and can be stored in a computer-readable storage medium when sold or used as an independent product.
  • the technical solution of the present application is essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product.
  • the computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of this application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code. .
  • the present application also provides a computer program product.
  • the computer program product includes a computer program.
  • the computer program can be stored on a non-transitory computer-readable storage medium.
  • the computer can Execute the working mode setting method provided by each of the above methods.
  • the method includes: receiving target voice information sent by the target user; obtaining the confidence level that the target user is identified as the target group according to the target voice information; degree, and set the working mode corresponding to the target user.
  • the present application also provides a non-transitory computer-readable storage medium on which a computer program is stored.
  • the computer program is implemented when executed by the processor to perform the working mode setting method provided by each of the above methods.
  • the method includes : Receive the target voice information sent by the target user; obtain the confidence level that the target user is identified as the target group based on the target voice information; and set the working mode corresponding to the target user based on the confidence level.
  • the device embodiments described above are only illustrative.
  • the units described as separate components may or may not be physically separated.
  • the components shown as units may or may not be physical units, that is, they may be located in One location, or it can be distributed across multiple network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment. Persons of ordinary skill in the art can understand and implement the method without any creative effort.
  • each embodiment can be implemented by means of software plus the necessary general hardware platform, and of course it can also be implemented by hardware.
  • the computer software product can be stored in a computer-readable storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., including a number of instructions to cause a computer device (which can be a personal computer, a server, or a network device, etc.) to execute the methods described in various embodiments or certain parts of the embodiments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Combustion & Propulsion (AREA)
  • Mechanical Engineering (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Chemical & Material Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Air Conditioning Control Device (AREA)

Abstract

一种工作模式设置方法,包括:接收目标用户发送的目标语音信息(S1);根据目标语音信息,获取目标用户被认定为目标群体的置信度(S2);根据置信度,设置目标用户对应的工作模式(S3)。还涉及一种工作模式设置装置、电子设备、计算机刻度存储介质。提供的工作模式设置方法及装置,通过语音信息的置信度确定用户是否为特定的群体,进而自动设置该群体的定制化工作模式,无需提前录制声纹以及中间操作,简单直接,对于用户友好。

Description

一种工作模式设置方法及装置
相关申请的交叉引用
本申请要求于2022年3月29日提交的申请号为202210324202.6,名称为“一种工作模式设置方法及装置”的中国专利申请的优先权,其通过引用方式全部并入本文。
技术领域
本申请涉及人工智能技术领域,尤其涉及一种工作模式设置方法。
背景技术
老年人因为年龄大了身体机能减弱,比较容易受到环境因素的影响而诱发各种疾病,室内空气污染常常会诱发哮喘,肺、功能减弱等呼吸系统疾病,且温度过低会引发关节疾病。
现有的空调控制需要老人先在应用程序(Application,APP)上录入声纹,再执行语音指令。
然而,声纹录入过程繁琐,老人对网络操作不熟悉不适应,学习困难。
发明内容
本申请提供一种工作模式设置方法及装置,用以解决现有技术中录入繁琐的缺陷,实现空调设置的便捷操作。
本申请提供一种工作模式设置方法,包括:接收目标用户发送的目标语音信息;
根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;
根据所述置信度,设置所述目标用户对应的工作模式。
根据本申请提供的一种工作模式设置方法,所述根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度,包括:
在确定所述目标语音信息为开机指令的情况下,执行开机指令;
对所述目标语音信息进行声纹分析,获取目标声纹特征;
根据所述目标声纹特征,对所述目标用户与所述目标群体之间的声纹 相似度进行打分,获取所述置信度。
根据本申请提供的一种工作模式设置方法,所述根据所述置信度,设置所述目标用户对应的工作模式,包括:
在确定所述置信度不小于预设阈值的情况下,确定所述目标用户为目标群体,以设置目标工作模式;
在确定所述置信度小于所述预设阈值,且处于预设置信度区间的情况下,生成工作模式语音提示;
接收回应指令,以设置所述目标工作模式;
所述回应指令是任一用户响应所述工作模式语音提示后输入的。
根据本申请提供的一种工作模式设置方法,在所述获取目标声纹特征之后,还包括:
比对所述目标声纹特征与所有注册用户的录入声纹特征;
在确定所述目标用户为注册用户的情况下,从注册信息中确定所述目标用户的年龄标签;
在根据所述年龄标签,确定所述目标用户为所述目标群体的情况下,设置目标工作模式。
根据本申请提供的一种工作模式设置方法,所述对所述目标语音信息进行声纹分析,获取目标声纹特征,包括:
对所述目标语音信息进行预加重,确定预加重语音信息;
对所述预加重语音信息进行分帧,确定分帧语音信息;
对所述分帧语音信息进行加窗,获取加窗语音信息;
对所述加窗语音信息进行声纹提取,获取所述目标语音信息的目标声纹特征。
根据本申请提供的一种工作模式设置方法,在所述接收目标用户发送的目标语音信息之后,还包括:
在确定开机状态的情况下,确定所述目标语音信息为打开目标模式指令;
根据所述打开目标模式指令,确定当前工作模式;
在确定所述当前工作模式为目标工作模式的情况下,生成已开启提示;
在确定所述当前工作模式不为所述目标工作模式的情况下,设置目标 工作模式,并生成开启语音提示。
本申请还提供一种工作模式设置装置,包括:接收模块,用于接收目标用户发送的目标语音信息;
获取模块,用于根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;
设置模块,用于根据所述置信度,设置所述目标用户对应的工作模式。
本申请还提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如上述任一种所述工作模式设置方法。
本申请还提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如上述任一种所述工作模式设置方法。
本申请还提供一种计算机程序产品,包括计算机程序,所述计算机程序被处理器执行时实现如上述任一种所述工作模式设置方法。
本申请提供的工作模式设置方法及装置,通过语音信息的置信度确定用户是否为特定的群体,进而自动设置该群体的定制化工作模式,无需提前录制声纹以及中间操作,简单直接,对于用户友好。
附图说明
为了更清楚地说明本申请或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本申请提供的工作模式设置方法的流程示意图之一;
图2是本申请提供的工作模式设置方法的流程示意图之二;
图3是本申请提供的工作模式设置装置的结构示意图;
图4是本申请提供的电子设备的结构示意图。
具体实施方式
为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请中的附图,对本申请中的技术方案进行清楚、完整地描述,显然,所描述的 实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。
现有智能空调的功能复杂,在录入声纹之后才可进行识别,但是录入的过程很繁琐,老人常常因为操作麻烦而放弃使用该功能;由于智能空调的功能名称纷繁复杂,即使录入了声纹,老人也常常无法叫出适合自己的模式,进而导致适合老人的功能无法使用;或是,老人不知道如何调空调才是最健康,最合适的。
而且现有的空调遥控器按键很小,老人看不清,很多时候都需要借助子女的帮助。
为了解决上述问题,本申请提供的工作模式设置方法,采用了非注册制声纹识别技术,老人无需在APP上录入自己的声纹,智能空调便可自动识别和判断用户是否为老人群体,并结合针对老人身体状况量身定制的老人模式。
下面结合图1至图4描述本申请的实施例所提供的工作模式设置方法及装置。
本申请实施例提供的工作模式设置方法,执行主体可以为电子设备或者电子设备中能够实现该工作模式设置方法的软件或功能模块或功能实体,本申请实施例中电子包括但不限于智能空调设备。需要说明的是,上述执行主体并不构成对本申请的限制。
图1是本申请提供的工作模式设置方法的流程示意图之一,如图1所示,包括但不限于以下步骤:
首先,在步骤S1中,接收目标用户发送的目标语音信息。
发送目标语音信息的目标用户可以是已录入声纹的注册用户,也可以为未录入声纹的非注册用户。
目标语音信息可以为开机指令,也可以为长辈关怀模式设置指令。
进一步地,在步骤S2中,根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度。
目标群体可以为老人群体。
在获取到目标语音信息之后,将该目标语音信息进行预加重、分帧和 加窗等预处理,将预处理后的目标语音信息转换为声纹特征图。其中声纹特征图可以为梅尔能量谱图。梅尔能量谱图能表征人能听到的声音的频率分布,是人通过声音辨别事物的深层特征,利用这种在梅尔频域的分布特性,更适合构建说话人识别系统,语音信号经过这样的转换,语音信号就变为了携带声纹信息的图像,对于单个信号,其梅尔能量谱图是黑白的,可以理解为单通道的特征图。
将声纹特征图输入至预先训练好的打分神经网络模型,以得到对目标用户与老人群体的声纹相似度的打分,作为置信度。实现了智慧识人,使空调主动为用户服务,极其的便利。
打分神经网络模型经过大量的样本数据训练,样本数据包括样本用户的群体标签和样本声纹特征图,因此在输入目标用户的声纹特征图后,就可以得到目标用户的置信度。
进一步地,在步骤S3中,根据所述置信度,设置所述目标用户对应的工作模式。
在确定目标用户的置信度大于置信度阈值的情况下,确定目标用户为老人群体,则将工作模式设置为长辈关怀模式;在确定目标用户的置信度不大于置信度阈值的情况下,确定目标用户不为老人群体,则生成设置工作模式的提示,目标用户可以根据提示,进行空调工作模式的设置,可以包括温度、风速和风向等。
本申请提供的工作模式设置方法,通过语音信息的置信度确定用户是否为特定的群体,进而自动设置该群体的定制化工作模式,无需提前录制声纹以及中间操作,简单直接,对于用户友好。
可选地,所述根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度,包括:
在确定所述目标语音信息为开机指令的情况下,执行开机指令;
对所述目标语音信息进行声纹分析,获取目标声纹特征;
根据所述目标声纹特征,对所述目标用户与所述目标群体之间的声纹相似度进行打分,获取所述置信度。
在确定目标语音信息为开机指令的情况下,执行开机指令,开机后对目标语音信息进行声纹分析,提取目标语音信息的特征信息,并输入至声 纹识别模型,输出为目标语音信息的目标声纹特征。声纹识别模型是一个深度神经网络模型,经大量中文语料训练而得,具有很强的抗噪性和鲁棒性。
将目标声纹特征输入至打分模型进行打分,得到目标语音信息为老人群体的置信度。
其中,打分模型是由多个样本声纹特征,以及每个样本声纹特征对应的年龄标签组成的训练样本,对神经网络模型进行训练后得到的。
可选地,所述对所述目标语音信息进行声纹分析,获取目标声纹特征,包括:
对所述目标语音信息进行预加重,确定预加重语音信息;
对所述预加重语音信息进行分帧,确定分帧语音信息;
对所述分帧语音信息进行加窗,获取加窗语音信息;
对所述加窗语音信息进行声纹提取,获取所述目标语音信息的目标声纹特征。
由于语音信号的平均功率谱受声门激励和口鼻辐射的影响,高频端大约在800赫兹(Hz)以上按6分贝/倍频程(dB/oct)衰减,频率越高相应的成分越小,为此要在对语音信号进行分析之前对其高频部分加以提升。可以利用数字滤波器实现对目标语音信息的预加重。
以10至20毫秒(ms)为间隔将声纹信号分为若干帧,一帧为一个基本单位,实现对预加重语音信息的分帧。
采用汉明窗函数对分帧语音信息来进行窗化。
根据本申请提供的工作模式设置方法,经过对目标语音信息的预加重、分帧和加窗,能够消除因为人类发声器官本身和由于采集语音信号的设备所带来的混叠、高次谐波失真、高频等等因素,对语音信号质量的影响。尽可能保证后续语音处理得到的信号更均匀、平滑,为信号参数提取提供优质的参数,提高语音处理质量。
可选地,在所述获取目标声纹特征之后,还包括:
比对所述目标声纹特征与所有注册用户的录入声纹特征;
在确定所述目标用户为注册用户的情况下,从注册信息中确定所述目标用户的年龄标签;
在根据所述年龄标签,确定所述目标用户为所述目标群体的情况下,设置目标工作模式。
将目标语音信息的目标声纹特征与所有注册用户已储存的录入声纹特征进行相似度计算;若得到的最高相似度高于设置的声纹阈值,则判定该最高相似度对应的录入声纹特征用户为目标语音信息的发出用户,可以根据该用户的注册信息确定年龄标签,并根据年龄标签,确定目标用户为老人群体,打开“长辈关怀模式”。长辈关怀模式是经过人体舒适研究院实验得出的针对老人的最优的空气解决方案。
若最高相似度低于设置的声纹阈值,则确定发送所述目标语音信息的对象不为注册用户。
在6月至9月的夏季期间,打开智能空调的人体舒适智能控制系统(PMV),智能空调操作模式初始化(operation Mode=0),目标工作模式具体为:温度为27℃的制冷模式(target Temperature=27℃),风速为最低风(wind Speed=3),上下导板处于最大上吹位置1(wind Direction Vertical=2),空气洁净度为打开健康模式(health Mode=true)。
在12月至2月的冬季期间,打开PMV,智能空调操作模式初始化(operation Mode=0),目标工作模式具体为:温度设为26℃的制热模式(target Temperature=26℃);风速设为最低风(wind Speed=3);上下导板位置处于最大下吹位置4(wind Direction Vertical=6),空气洁净度为打开健康模式(health Mode=true)。调整为目标工作模式后,
在其他月份,打开PMV,智能空调操作模式初始化(operation Mode=0),目标工作模式具体为:温度设为26℃(target Temperature=26℃),在室内温度高于26℃的情况下制冷,在室内温度不高于26℃的情况下制热;风速为最低风(wind Speed=3),空气洁净度为打开健康模式(health Mode=true)。
由于热空气轻容易上浮,冷空气重容易下沉,故夏季的上下导板处于上吹位置,冬季的上下导板处于下吹位置。
在智能空调调整为目标工作模式后,播报:“长辈关怀模式已开启,可以健康又舒服地吹空调啦!”
根据本申请提供的工作模式设置方法,通过声纹识别,对老人群体进 行定制化呵护,提供最合适的空气方案。
可选地,在所述接收目标用户发送的目标语音信息之后,还包括:
在确定开机状态的情况下,确定所述目标语音信息为打开目标模式指令;
根据所述打开目标模式指令,确定当前工作模式;
在确定所述当前工作模式为目标工作模式的情况下,生成已开启提示;
在确定所述当前工作模式不为所述目标工作模式的情况下,设置目标工作模式,并生成开启语音提示。
可选地,所述根据所述置信度,设置所述目标用户对应的工作模式,包括:
在确定所述置信度不小于预设阈值的情况下,确定所述目标用户为目标群体,以设置目标工作模式;
在确定所述置信度小于所述预设阈值,且处于预设置信度区间的情况下,生成工作模式语音提示;
接收回应指令,以设置所述目标工作模式;
所述回应指令是任一用户响应所述工作模式语音提示后输入的。
预设阈值可以为80;预设置信度区间可以为大于70到小于80。
图2是本申请提供的工作模式设置方法的流程示意图之二,如图2所示,包括:
第一方面,在空调关机的状态下,接收用户的语音唤醒指令,如“打开空调”,确认用户发出开机意图,执行语音唤醒指令,开机;
进一步地,在确定APP端老人声纹功能关闭的情况下,按照智能空调的原始逻辑,只执行用户的设置,并提示空调开机,播报“<device Name>打开了”;在确定APP端老人声纹功能开启的情况下,对语音唤醒指令进行声纹识别,获取置信度;
进一步地,在置信度不小于80的情况下,确定用户为老人,开启老人模式,播报“长辈关怀模式已开启,可以健康又舒服地吹空调啦!”
在置信度不大于70的情况下,确定用户不为老人,按照智能空调的原始逻辑,只执行用户的设置,并提示空调开机,播报“<device Name>打开了”;
在置信度大于70且小于80的情况下,生成设置语音提示,追问:是否需要为您打开长辈关怀模式;
进一步地,在用户没有回应的情况下,按照智能空调的原始逻辑,只执行用户的设置,并提示空调开机,播报“<device Name>打开了”;
在用户回应的情况下,确定用户的回应内容;
进一步地,在用户的回应内容不为肯定回答的情况下,按照智能空调的原始逻辑,只执行用户的设置,并提示空调开机,播报“<device Name>打开了”;
在用户的回应内容为肯定回答的情况下,开启老人模式,播报“长辈关怀模式已开启,可以健康又舒服地吹空调啦!”
第二方面,在空调开机的状态下,语音唤醒,用户主动发出切换到长辈关怀模式的意图,如“打开长辈关怀模式”;
进一步地,在当前未打开长辈关怀模式的情况下,开启老人模式,播报“长辈关怀模式已开启,可以健康又舒服地吹空调啦!”
在当前已打开长辈关怀模式的情况下,保持现有逻辑,播报“长辈关怀模式开着呢”。
表1为声纹触发老人模式表,包括自然语言生成(Natural Language Generation,NLG)的触发条件、NLG内容和入口语料。其中,智能空调的标识为设备名(device name)。
表1声纹触发老人模式表
Figure PCTCN2022132600-appb-000001
Figure PCTCN2022132600-appb-000002
其中,用户可以通过与智能空调配套的APP端的开关老人声纹功能,老人声纹功能的开关默认关闭,用户开启则生效。空调的语音侧云端接入声纹应用程序接口(Application Programming Interface,API)。用户可以在主控空调端唤醒小优,并发话“打开空调”。
声纹识别返回置信度至语音侧云端。
其中,置信度阈值和置信度区间可以依实际情况进行调整。
下面对本申请提供的工作模式设置装置进行描述,下文描述的工作模式设置装置与上文描述的工作模式设置方法可相互对应参照。
图3是本申请提供的工作模式设置装置的结构示意图,如图3所示,包括:
接收模块301,用于接收目标用户发送的目标语音信息;
获取模块302,用于根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;
设置模块303,用于根据所述置信度,设置所述目标用户对应的工作模式。
首先,接收模块301接收目标用户发送的目标语音信息。
发送目标语音信息的目标用户可以是已录入声纹的注册用户,也可以为未录入声纹的非注册用户。
目标语音信息可以为开机指令,也可以为长辈关怀模式设置指令。
进一步地,获取模块302根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度。
目标群体可以为老人群体。
在获取到目标语音信息之后,将该目标语音信息进行预加重、分帧和加窗等预处理,将预处理后的目标语音信息转换为声纹特征图。其中声纹特征图可以为梅尔能量谱图。梅尔能量谱图能表征人能听到的声音的频率分布,是人通过声音辨别事物的深层特征,利用这种在梅尔频域的分布特性,更适合构建说话人识别系统,语音信号经过这样的转换,语音信号就变为了携带声纹信息的图像,对于单个信号,其梅尔能量谱图是黑白的,可以理解为单通道的特征图。
将声纹特征图输入至预先训练好的打分神经网络模型,以得到对目标用户与老人群体的声纹相似度的打分,作为置信度。实现了智慧识人,使空调主动为用户服务,极其的便利。
打分神经网络模型经过大量的样本数据训练,样本数据包括样本用户的群体标签和样本声纹特征图,因此在输入目标用户的声纹特征图后,就可以得到目标用户的置信度。
进一步地,设置模块303根据所述置信度,设置所述目标用户对应的工作模式。
在确定目标用户的置信度大于置信度阈值的情况下,确定目标用户为老人群体,则将工作模式设置为长辈关怀模式;在确定目标用户的置信度不大于置信度阈值的情况下,确定目标用户不为老人群体,则生成设置工作模式的提示,目标用户可以根据提示,进行空调工作模式的设置,可以包括温度、风速和风向等。
本申请提供的工作模式设置装置,通过语音信息的置信度确定用户是否为特定的群体,进而自动设置该群体的定制化工作模式,无需提前录制声纹以及中间操作,简单直接,对于用户友好。
图4是本申请提供的电子设备的结构示意图,如图4所示,该电子设备可以包括:处理器(processor)410、通信接口(Communications Interface)420、存储器(memory)430和通信总线440,其中,处理器410,通信接口420,存储器430通过通信总线440完成相互间的通信。处理器410可以调用存储器430中的逻辑指令,以执行工作模式设置方法,该方法包括:接收目标用户发送的目标语音信息;根据所述目标语音信息,获取所述目标用户 被认定为目标群体的置信度;根据所述置信度,设置所述目标用户对应的工作模式。
此外,上述的存储器430中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。
另一方面,本申请还提供一种计算机程序产品,所述计算机程序产品包括计算机程序,计算机程序可存储在非暂态计算机可读存储介质上,所述计算机程序被处理器执行时,计算机能够执行上述各方法所提供的工作模式设置方法,该方法包括:接收目标用户发送的目标语音信息;根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;根据所述置信度,设置所述目标用户对应的工作模式。
又一方面,本申请还提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现以执行上述各方法提供的工作模式设置方法,该方法包括:接收目标用户发送的目标语音信息;根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;根据所述置信度,设置所述目标用户对应的工作模式。
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动的情况下,即可以理解并实施。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到各实施方式可借助软件加必需的通用硬件平台的方式来实现,当然也可以通 过硬件。基于这样的理解,上述技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。
最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。

Claims (10)

  1. 一种工作模式设置方法,包括:
    接收目标用户发送的目标语音信息;
    根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;
    根据所述置信度,设置所述目标用户对应的工作模式。
  2. 根据权利要求1所述的工作模式设置方法,其中,所述根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度,包括:
    在确定所述目标语音信息为开机指令的情况下,执行开机指令;
    对所述目标语音信息进行声纹分析,获取目标声纹特征;
    根据所述目标声纹特征,对所述目标用户与所述目标群体之间的声纹相似度进行打分,获取所述置信度。
  3. 根据权利要求1所述的工作模式设置方法,其中,所述根据所述置信度,设置所述目标用户对应的工作模式,包括:
    在确定所述置信度不小于预设阈值的情况下,确定所述目标用户为目标群体,以设置目标工作模式;
    在确定所述置信度小于所述预设阈值,且处于预设置信度区间的情况下,生成工作模式语音提示;
    接收回应指令,以设置所述目标工作模式;
    所述回应指令是任一用户响应所述工作模式语音提示后输入的。
  4. 根据权利要求2所述的工作模式设置方法,其中,在所述获取目标声纹特征之后,还包括:
    比对所述目标声纹特征与所有注册用户的录入声纹特征;
    在确定所述目标用户为注册用户的情况下,从注册信息中确定所述目标用户的年龄标签;
    在根据所述年龄标签,确定所述目标用户为所述目标群体的情况下,设置目标工作模式。
  5. 根据权利要求2所述的工作模式设置方法,其中,所述对所述目标语音信息进行声纹分析,获取目标声纹特征,包括:
    对所述目标语音信息进行预加重,确定预加重语音信息;
    对所述预加重语音信息进行分帧,确定分帧语音信息;
    对所述分帧语音信息进行加窗,获取加窗语音信息;
    对所述加窗语音信息进行声纹提取,获取所述目标语音信息的目标声纹特征。
  6. 根据权利要求1所述的工作模式设置方法,其中,在所述接收目标用户发送的目标语音信息之后,还包括:
    在确定开机状态的情况下,确定所述目标语音信息为打开目标模式指令;
    根据所述打开目标模式指令,确定当前工作模式;
    在确定所述当前工作模式为目标工作模式的情况下,生成已开启提示;
    在确定所述当前工作模式不为所述目标工作模式的情况下,设置所述目标工作模式,并生成开启语音提示。
  7. 一种工作模式设置装置,包括:
    接收模块,用于接收目标用户发送的目标语音信息;
    获取模块,用于根据所述目标语音信息,获取所述目标用户被认定为目标群体的置信度;
    设置模块,用于根据所述置信度,设置所述目标用户对应的工作模式。
  8. 一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器执行所述程序时实现如权利要求1至6任一项所述工作模式设置方法。
  9. 一种非暂态计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现如权利要求1至6任一项所述工作模式设置方法。
  10. 一种计算机程序产品,包括计算机程序,其中,所述计算机程序被处理器执行时实现如权利要求1至6任一项所述工作模式设置方法。
PCT/CN2022/132600 2022-03-29 2022-11-17 一种工作模式设置方法及装置 WO2023185006A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210324202.6 2022-03-29
CN202210324202.6A CN114863932A (zh) 2022-03-29 2022-03-29 一种工作模式设置方法及装置

Publications (1)

Publication Number Publication Date
WO2023185006A1 true WO2023185006A1 (zh) 2023-10-05

Family

ID=82628587

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/132600 WO2023185006A1 (zh) 2022-03-29 2022-11-17 一种工作模式设置方法及装置

Country Status (2)

Country Link
CN (1) CN114863932A (zh)
WO (1) WO2023185006A1 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114863932A (zh) * 2022-03-29 2022-08-05 青岛海尔空调器有限总公司 一种工作模式设置方法及装置
CN114999472A (zh) * 2022-04-27 2022-09-02 青岛海尔空调器有限总公司 一种空调控制方法、装置及一种空调

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007052496A (ja) * 2005-08-15 2007-03-01 Advanced Media Inc ユーザ認証システム及びユーザ認証方法
CN108305615A (zh) * 2017-10-23 2018-07-20 腾讯科技(深圳)有限公司 一种对象识别方法及其设备、存储介质、终端
CN112201254A (zh) * 2020-09-28 2021-01-08 中国建设银行股份有限公司 无感语音认证方法、装置、设备及存储介质
CN113349460A (zh) * 2021-05-26 2021-09-07 深圳麦克韦尔科技有限公司 一种声音检测组件以及电子雾化装置
CN113836508A (zh) * 2021-08-30 2021-12-24 青岛海尔科技有限公司 开机模式的确定方法、装置、存储介质及电子装置
CN114863932A (zh) * 2022-03-29 2022-08-05 青岛海尔空调器有限总公司 一种工作模式设置方法及装置
CN114863931A (zh) * 2022-03-29 2022-08-05 青岛海尔空调器有限总公司 一种工作模式切换方法及装置

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007052496A (ja) * 2005-08-15 2007-03-01 Advanced Media Inc ユーザ認証システム及びユーザ認証方法
CN108305615A (zh) * 2017-10-23 2018-07-20 腾讯科技(深圳)有限公司 一种对象识别方法及其设备、存储介质、终端
CN112201254A (zh) * 2020-09-28 2021-01-08 中国建设银行股份有限公司 无感语音认证方法、装置、设备及存储介质
CN113349460A (zh) * 2021-05-26 2021-09-07 深圳麦克韦尔科技有限公司 一种声音检测组件以及电子雾化装置
CN113836508A (zh) * 2021-08-30 2021-12-24 青岛海尔科技有限公司 开机模式的确定方法、装置、存储介质及电子装置
CN114863932A (zh) * 2022-03-29 2022-08-05 青岛海尔空调器有限总公司 一种工作模式设置方法及装置
CN114863931A (zh) * 2022-03-29 2022-08-05 青岛海尔空调器有限总公司 一种工作模式切换方法及装置

Also Published As

Publication number Publication date
CN114863932A (zh) 2022-08-05

Similar Documents

Publication Publication Date Title
WO2023185006A1 (zh) 一种工作模式设置方法及装置
CN107342076B (zh) 一种兼容非常态语音的智能家居控制系统及方法
WO2023185005A1 (zh) 一种工作模式切换方法及装置
CN105374352B (zh) 一种语音激活方法及系统
CN110992932B (zh) 一种自学习的语音控制方法、系统及存储介质
WO2017084197A1 (zh) 一种基于情感识别的智能家居控制方法及其系统
WO2023185004A1 (zh) 一种音色切换方法及装置
CN105206271A (zh) 智能设备的语音唤醒方法及实现所述方法的系统
CN108766441A (zh) 一种基于离线声纹识别和语音识别的语音控制方法及装置
CN105957527A (zh) 一种语音控制电器的方法、装置及语音控制空调
CN110070865A (zh) 一种具有语音和图像识别功能的向导机器人
WO2019233228A1 (zh) 电子设备及设备控制方法
WO2020140840A1 (zh) 用于唤醒可穿戴设备的方法及装置
CN109036395A (zh) 个性化的音箱控制方法、系统、智能音箱及存储介质
WO2020125038A1 (zh) 语音控制方法及装置
CN112099375A (zh) 基于健康策略的智能家居控制方法、装置及系统
TWI839834B (zh) 語音喚醒方法和相關裝置
WO2023272502A1 (zh) 一种人机交互方法及装置、设备及车辆
CN109166584A (zh) 语音控制方法、装置、呼吸机和存储介质
CN107762948A (zh) 一种风扇装置送风方法及风扇装置
CN112233655A (zh) 一种提高语音命令词识别性能的神经网络训练方法
CN113012694A (zh) 一种轻生活语音识别控制系统
CN104188736A (zh) 基于瘘口气流气压信号调控的电子人工喉训练系统及操作方法
WO2022166340A1 (zh) 空调器室内机的控制方法及控制设备
CN114999472A (zh) 一种空调控制方法、装置及一种空调

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22934824

Country of ref document: EP

Kind code of ref document: A1