WO2020181407A1 - Voice recognition control method and device - Google Patents

Voice recognition control method and device Download PDF

Info

Publication number
WO2020181407A1
WO2020181407A1 PCT/CN2019/077469 CN2019077469W WO2020181407A1 WO 2020181407 A1 WO2020181407 A1 WO 2020181407A1 CN 2019077469 W CN2019077469 W CN 2019077469W WO 2020181407 A1 WO2020181407 A1 WO 2020181407A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
signal
module
data
control device
Prior art date
Application number
PCT/CN2019/077469
Other languages
French (fr)
Chinese (zh)
Inventor
陈旻宏
Original Assignee
发条橘子云端行销股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 发条橘子云端行销股份有限公司 filed Critical 发条橘子云端行销股份有限公司
Priority to PCT/CN2019/077469 priority Critical patent/WO2020181407A1/en
Publication of WO2020181407A1 publication Critical patent/WO2020181407A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing

Definitions

  • This application is related to voice control, especially a voice recognition control method and device.
  • the related art involves a controller controlling a plurality of electrical devices, and the controller is usually a smart phone, a tablet computer or other remote controllers, which controls the operating state of the multiple electrical devices in a wired or wireless manner.
  • the controller of the related art must be operated by hand to achieve the control effect. Such a complicated operation method is really inconvenient for the elderly or other users.
  • the main purpose of this application is to provide a voice recognition control method and device to control at least one electrical device by voice.
  • this application provides a voice recognition control method, including: receiving a voice signal with a voice transceiver; using a learning module to interact with at least a cloud search engine server to learn the voice signal, and then converting the voice signal Is at least one language data; a processor analyzes the voice signal according to each of the language data to generate a control signal; and an infrared transmitter transmits the control signal for controlling at least one electrical device.
  • the present application further provides a voice recognition control device, including: an infrared transmitter for transmitting at least one control signal, the at least one control signal for controlling at least one electrical device; a voice transceiver, which Used for receiving a voice signal and transmitting a voice feedback signal generated according to the voice signal; a learning module that at least converts the voice signal into at least one language data; a storage unit that stores each of the language data and at least one environment At least one of data and at least one state data; and a processor for analyzing the voice signal according to each of the language data, and reading the at least one environmental data and the at least one state data to convert the voice signal into the control At least one of the signal and the voice feedback signal.
  • a voice recognition control device including: an infrared transmitter for transmitting at least one control signal, the at least one control signal for controlling at least one electrical device; a voice transceiver, which Used for receiving a voice signal and transmitting a voice feedback signal generated according to the voice signal; a learning module that at least converts the
  • the learning module is connected to a cloud search engine server, and the learning module is used to obtain the at least one language data from the cloud search engine server according to the voice signal.
  • the processor further includes a voice recognition unit and a semantic recognition unit, the voice recognition unit analyzes the voice signal as at least one text signal, and the semantic recognition unit determines each text signal according to the at least one language data, the at least one At least one of an environmental data and the at least one state data is converted into at least one of the control signal or the voice feedback signal.
  • the voice recognition unit analyzes the voice signal as at least one text signal
  • the semantic recognition unit determines each text signal according to the at least one language data
  • the at least one At least one of an environmental data and the at least one state data is converted into at least one of the control signal or the voice feedback signal.
  • the storage unit further includes a semantic database, and the semantic database stores each of the language data.
  • the storage unit further includes an environment database and a state database, the environment database stores each of the environmental data, and the state database stores each of the state data.
  • the voice transceiver further includes an echo filtering module, and when the voice signal contains an echo signal, the echo filtering module filters the echo signal.
  • the voice transceiver further includes a noise filtering module, and when the voice signal includes a noise signal, the noise filtering module can filter the noise signal.
  • the voice transceiver includes a radio module and a sound module, the radio module receives the voice signal, and the sound module transmits the voice feedback signal.
  • the number of the at least one language data, the at least one environmental data, and the at least one state data is multiple;
  • the learning module is connected to a cloud search engine server, and the learning module is used to send the voice signal to the
  • the cloud search engine server obtains the at least one language data;
  • the storage unit further includes a semantic database that stores each of the language data;
  • the storage unit further includes an environment database and a state database, the environment database stores each of the environmental data ,
  • the state database stores the state data;
  • the voice transceiver further includes an echo filtering module, when the voice signal contains an echo signal, the echo filtering module filters the echo signal;
  • the voice transceiver further includes an echo signal
  • the noise filter module when the voice signal includes a noise signal, the noise filter module filters the noise signal;
  • the voice transceiver includes a radio module and a sound module, the radio module receives the voice signal, the speaker The voice module transmits the voice feedback signal;
  • the voice transceiver further includes a playback
  • Fig. 1 is a flowchart of a preferred embodiment of this application.
  • Figure 2 is a block diagram of a preferred embodiment of the application.
  • FIG. 3 is a schematic diagram of a use state of a preferred embodiment of the application.
  • S1 to S4 steps; 1: voice recognition control device; 2: electrical equipment; 3: cloud search engine server; 4: user; 10: infrared transmitter; 20: voice transceiver; 21: echo filter Module; 22: Noise Filter Module; 23: Radio Module; 24: Playback Module; 25: Play Module; 30: Storage Unit; 31: Semantic Database; 32: Environmental Database; 33: State Database; 40: Learning Module; 50: processor; 51: speech recognition unit; 52: semantic recognition unit.
  • the voice recognition control method of the present application includes the following steps: Step S1: Receive a voice signal with a voice transceiver; Step S2: Use a learning module to at least interact with A cloud search engine server interactively learns the voice signal, and then converts the voice signal into at least one language data. It is further explained that the learning module can be connected to the cloud search engine server in a wired or wireless manner to search for the semantics and grammar of the voice signal. Language data; Step S3: Use a processor to analyze the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and Step S4: Use an infrared transmitter to transmit the control signal to control at least one electrical equipment.
  • the learning module can also use the voice feedback signal to answer the user's questions or issue questions, and can learn the semantics and grammar of the language signal through interaction with the user.
  • the voice recognition control device 1 of the present application includes an infrared transmitter 10, a voice transceiver 20, a storage unit 30, a learning module 40, and a processor 50.
  • the infrared transmitter 10 can emit at least one control signal, each of the control signals is an infrared signal, and the at least one control signal is used to control at least one electrical device 2.
  • the at least one electrical device 2 can receive the infrared signal
  • the air conditioner, lamp, TV, or fan can also be an electrical device 2 that sends infrared signals; in other embodiments, the infrared transmitter 10 can also emit multiple control signals to control multiple electrical devices at the same time.
  • the voice transceiver 20 can receive a voice signal and can transmit a voice feedback signal generated according to the voice signal; the learning module 40 at least converts the voice signal into at least one language data; the storage unit 30 stores each of the language data, At least one of at least one environmental data and at least one state data; and the processor 50 parses the voice signal according to each of the language data, and reads at least one of the at least one environmental data and the at least one state data to perform the The voice signal is converted into at least one of the control signal and the voice feedback signal.
  • the voice recognition control device 1 can interact with the user 4 to learn Chinese grammar, and can control each of the electrical equipment 2 through the voice signal, so as to improve the convenience of operation.
  • the number of the at least one language data, the at least one environment data, and the at least one status data is multiple respectively;
  • the multiple language data may include Chinese, English, Cantonese, Hokkien, Thai and other languages.
  • Vocabulary and grammar the multiple environmental data may include multiple environmental names
  • the multiple state data may include environmental temperature status, environmental humidity status, operating status of the at least one electrical device 2 and so on.
  • the learning module 40 can be connected to a cloud search engine server 3, which can be connected to the cloud search engine server 3 in a wired or wireless manner.
  • the cloud search engine server 3 can be network information such as a search engine (such as GOOGLE), an information database (such as Wikipedia), and the learning module 40 can obtain the at least one language data from the cloud search engine server 3 according to the voice signal, and so The learning module 40 can learn through the network.
  • the processor 50 further includes a voice recognition unit 51 and a semantic recognition unit 52.
  • the voice recognition unit 51 analyzes the voice signal as at least one text signal, and the semantic recognition unit 52 converts each text signal according to the at least one text signal.
  • At least one of the language data, the at least one environmental data, and the at least one state data is converted into at least one of the control signal or the voice feedback signal, so that the voice signal can be clearly analyzed and interpreted.
  • the voice recognition unit 51 can determine different pronunciations and intonations to match similar characters.
  • the storage unit 30 further includes a semantic database 31, and the semantic database 31 stores each language data.
  • the storage unit 30 further includes an environmental database 32, and the environmental database 32 stores various environmental data.
  • the storage unit 30 further includes a state database 33, and the state database 33 stores each state data.
  • the voice transceiver 20 further includes an echo filter module 21.
  • the echo filter module 21 can filter the echo signal.
  • the voice transceiver 20 further includes a noise filter module 22.
  • the noise filter module 22 can filter the noise signal to improve the clarity of the voice signal.
  • the voice transceiver 20 includes a receiving module 23 and a playing module 24.
  • the receiving module 23 can receive the voice signal, and the playing module 24 can transmit the voice feedback signal.
  • the sound receiving module 23 may be, for example, a microphone device; the sound playback module 24 may be, for example, a speaker device.
  • the voice transceiver 20 further includes a playing module 25 connected to the semantic recognition unit 52 and the sound playing module 24.
  • the voice recognition control device 1 further includes a display (not shown in the figure), the display is connected to the processor 50, the display can display at least one image information, the at least one image information can be multimedia , Or remote video image to interact with the user 4 with images.
  • the voice recognition control method and device uses a voice transceiver to receive a voice signal; uses a learning module to interact with at least a cloud search engine server to learn the voice signal, and then convert the voice signal to at least A language data; a processor analyzes the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and an infrared transmitter transmits the control signal for controlling at least one electrical device.
  • At least one electrical device is realized by sound control, so it has industrial applicability.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A voice recognition control method and a voice recognition control device (1). The voice recognition control method comprises: receiving a voice signal with a voice transceiver (20) (S1); learning about the voice signal with a learning module (40) interacting at least with a cloud search engine server (3), then converting the voice signal into at least one piece of voice data (S2); parsing the voice signal with a processor (50) on the basis of language data to generate a control signal and a voice feedback signal (S3); and transmitting the control signal with an infrared transceiver (10) to control at least one piece of electrical equipment (2) (S4). The voice recognition control device (1) comprises the voice transceiver (20), the learning module (40), the processor (50), a storage unit (30), and the infrared transmitter (10). Voice control of the at least one piece of electrical equipment (2) is implemented via the voice recognition control method and the voice recognition control device (1).

Description

语音识别控制方法及装置Voice recognition control method and device 技术领域Technical field
本申请与语音控制有关,特别是有关于一种语音识别控制方法及装置。This application is related to voice control, especially a voice recognition control method and device.
背景技术Background technique
随着科技的进步,居家设备逐渐的导向智能型居家概念,以自动化系统调节居家环境,改善以往只能以一遥控器对应专一的电器设备的问题。With the advancement of science and technology, home equipment is gradually leading to the concept of smart home, using automated systems to adjust the home environment, and to improve the problem of only one remote control for specific electrical equipment in the past.
相关技术涉及一控制器控制多个电器设备,该控制器通常为智能型手机、平板计算机或其他遥控器,以有线或无线的方式控制该多个电器设备的运转状态。然而,相关技术的控制器须以手持操作才能达到控制的效果,如此繁复的操作方式对于老年人而言或其他使用者而言,实在有许多的不便。The related art involves a controller controlling a plurality of electrical devices, and the controller is usually a smart phone, a tablet computer or other remote controllers, which controls the operating state of the multiple electrical devices in a wired or wireless manner. However, the controller of the related art must be operated by hand to achieve the control effect. Such a complicated operation method is really inconvenient for the elderly or other users.
因此,有必要提供一种语音识别控制方法及装置,以解决上述的问题。Therefore, it is necessary to provide a voice recognition control method and device to solve the above-mentioned problems.
发明内容Summary of the invention
本申请的主要目的在于提供一种语音识别控制方法及装置,以声控至少一电器设备。The main purpose of this application is to provide a voice recognition control method and device to control at least one electrical device by voice.
为达成上述目的,本申请提供一种语音识别控制方法,包括:以一语音收发器接收一语音信号;以一学习模块至少与一云端搜索引擎服务器互动学习该语音信号,再将该语音信号转换为至少一语言数据;以一处理器依据各该语言数据解析该语音信号,以产生一控制讯号;及以一红外线发射器发射该控制讯号,以供控制至少一电器设备。In order to achieve the above objective, this application provides a voice recognition control method, including: receiving a voice signal with a voice transceiver; using a learning module to interact with at least a cloud search engine server to learn the voice signal, and then converting the voice signal Is at least one language data; a processor analyzes the voice signal according to each of the language data to generate a control signal; and an infrared transmitter transmits the control signal for controlling at least one electrical device.
为达成上述目的,本申请另提供一种语音识别控制装置,包括:一红外线发射器,其用于发射至少一控制讯号,该至少一控制讯号供控制至少一电器设备;一语音收发器,其用于接收一语音信号及发射一依据该语音信号产生的语音回馈信号;一学习模块,其至少将该语音信号转换为至少一语言数据;一储存单元,其储存各该语言数据、至少一环境数据及至少一状态数据中至少之一;及一处理器,其依据各该语言数据解析该语音信号,并读取该至少一环境数据及该至少一状态数据而将该语音信号转换为该控制讯号及该语音回馈信号中至少之一。In order to achieve the above-mentioned object, the present application further provides a voice recognition control device, including: an infrared transmitter for transmitting at least one control signal, the at least one control signal for controlling at least one electrical device; a voice transceiver, which Used for receiving a voice signal and transmitting a voice feedback signal generated according to the voice signal; a learning module that at least converts the voice signal into at least one language data; a storage unit that stores each of the language data and at least one environment At least one of data and at least one state data; and a processor for analyzing the voice signal according to each of the language data, and reading the at least one environmental data and the at least one state data to convert the voice signal into the control At least one of the signal and the voice feedback signal.
可选地,该学习模块,与一云端搜索引擎服务器连接,该学习模块用于依据该语音信号至该云端搜索引擎服务器获得该至少一语言数据。Optionally, the learning module is connected to a cloud search engine server, and the learning module is used to obtain the at least one language data from the cloud search engine server according to the voice signal.
可选地,该处理器另包括一语音识别单元及一语意识别单元,该语音识别单元分析该语音信号为至少一文字讯号,该语意识别单元将各该文字讯号依据该至少一语言数据、该至少一环境数据及该至少一状态数据中至少之一转换为该控制讯号或该语音回馈信号中至少之一。Optionally, the processor further includes a voice recognition unit and a semantic recognition unit, the voice recognition unit analyzes the voice signal as at least one text signal, and the semantic recognition unit determines each text signal according to the at least one language data, the at least one At least one of an environmental data and the at least one state data is converted into at least one of the control signal or the voice feedback signal.
可选地,该储存单元另包括一语意数据库,该语意数据库储存各该语言数据。Optionally, the storage unit further includes a semantic database, and the semantic database stores each of the language data.
可选地,该储存单元另包括一环境数据库及一状态数据库,该环境数据库储存各该环境数据,该状态数据库储存各该状态数据。Optionally, the storage unit further includes an environment database and a state database, the environment database stores each of the environmental data, and the state database stores each of the state data.
可选地,该语音收发器另包括一回音滤除模块,当该语音信号中包含一回音讯号时,该回音滤除模块过滤该回音讯号。Optionally, the voice transceiver further includes an echo filtering module, and when the voice signal contains an echo signal, the echo filtering module filters the echo signal.
可选地,该语音收发器另包括一噪声滤除模块,当该语音信号中包括一杂音讯号时,该噪声滤除模块可过滤该杂音讯号。Optionally, the voice transceiver further includes a noise filtering module, and when the voice signal includes a noise signal, the noise filtering module can filter the noise signal.
可选地,该语音收发器包括一收音模块及一放音模块,该收音模块接收该语音信号,该放音模块发射该语音回馈信号。Optionally, the voice transceiver includes a radio module and a sound module, the radio module receives the voice signal, and the sound module transmits the voice feedback signal.
可选地,该至少一语言数据、该至少一环境数据及至少一状态数据的数量分别为多个;该学习模块,与一云端搜索引擎服务器连接,该学习模块用于依据该语音信号至该云端搜索引擎服务器获得该至少一语言数据;该储存单元另包括一语意数据库,该语意数据库储存各该语言数据;该储存单元另包括一环境数据库及一状态数据库,该环境数据库储存各该环境数据,该状态数据库储存各该状态数据;该语音收发器另包括一回音滤除模块,当该语音信号中包含一回音讯号时,该回音滤除模块过滤该回音讯号;该语音收发器另包括一噪声滤除模块,当该语音信号中包括一杂音讯号时,该噪声滤除模块过滤该杂音讯号;该语音收发器包括一收音模块及一放音模块,该收音模块接收该语音信号,该放音模块发射该语音回馈信号;该语音收发器另包括一播放模块,该播放模块连接该语意识别单元及该放音模块;该语音识别控制装置另包括一显示器,该显示器连接该处理器,该显示器可供显示至少一影像信息。Optionally, the number of the at least one language data, the at least one environmental data, and the at least one state data is multiple; the learning module is connected to a cloud search engine server, and the learning module is used to send the voice signal to the The cloud search engine server obtains the at least one language data; the storage unit further includes a semantic database that stores each of the language data; the storage unit further includes an environment database and a state database, the environment database stores each of the environmental data , The state database stores the state data; the voice transceiver further includes an echo filtering module, when the voice signal contains an echo signal, the echo filtering module filters the echo signal; the voice transceiver further includes an echo signal The noise filter module, when the voice signal includes a noise signal, the noise filter module filters the noise signal; the voice transceiver includes a radio module and a sound module, the radio module receives the voice signal, the speaker The voice module transmits the voice feedback signal; the voice transceiver further includes a playback module connected to the semantic recognition unit and the playback module; the voice recognition control device further includes a display connected to the processor, the The display can display at least one image information.
附图说明Description of the drawings
图1为本申请一较佳实施例的流程图。Fig. 1 is a flowchart of a preferred embodiment of this application.
图2为本申请一较佳实施例的方块图。Figure 2 is a block diagram of a preferred embodiment of the application.
图3为本申请一较佳实施例的使用状态示意图。FIG. 3 is a schematic diagram of a use state of a preferred embodiment of the application.
符号说明:S1至S4:步骤;1:语音识别控制装置;2:电器设备;3:云端搜索引擎服务器;4:使用者;10:红外线发射器;20:语音收发器;21:回音滤除模块;22:噪声滤除模块;23:收音模块;24:放音模块;25:播放模块;30:储存单元;31:语意数据库;32:环境数据库;33:状态数据库;40:学习模块;50:处理器;51:语音识别单元;52:语意识别单元。Symbol description: S1 to S4: steps; 1: voice recognition control device; 2: electrical equipment; 3: cloud search engine server; 4: user; 10: infrared transmitter; 20: voice transceiver; 21: echo filter Module; 22: Noise Filter Module; 23: Radio Module; 24: Playback Module; 25: Play Module; 30: Storage Unit; 31: Semantic Database; 32: Environmental Database; 33: State Database; 40: Learning Module; 50: processor; 51: speech recognition unit; 52: semantic recognition unit.
具体实施方式detailed description
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。 基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions, and advantages of this application clearer, the following further describes this application in detail with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the application, and not used to limit the application. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
请参考图1,其显示本申请的一较佳实施例,本申请的语音识别控制方法,包括以下步骤:步骤S1:以一语音收发器接收一语音信号;步骤S2:以一学习模块至少与一云端搜索引擎服务器互动学习该语音信号,再将该语音信号转换为至少一语言数据,进一步说明,该学习模块可以有线或无线的方式连接到云端搜索引擎服务器搜寻该语音信号的语义及文法等语言数据;步骤S3:以一处理器依据各该语言数据解析该语音信号,以产生一控制讯号及一语音回馈信号;及步骤S4:以一红外线发射器发射该控制讯号,以供控制至少一电器设备。此外,该学习模块也可通过该语音回馈信号回答使用者的问题、或发出疑问等,并可通过与使用者互动学习该语言信号的语义及文法等。Please refer to FIG. 1, which shows a preferred embodiment of the present application. The voice recognition control method of the present application includes the following steps: Step S1: Receive a voice signal with a voice transceiver; Step S2: Use a learning module to at least interact with A cloud search engine server interactively learns the voice signal, and then converts the voice signal into at least one language data. It is further explained that the learning module can be connected to the cloud search engine server in a wired or wireless manner to search for the semantics and grammar of the voice signal. Language data; Step S3: Use a processor to analyze the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and Step S4: Use an infrared transmitter to transmit the control signal to control at least one electrical equipment. In addition, the learning module can also use the voice feedback signal to answer the user's questions or issue questions, and can learn the semantics and grammar of the language signal through interaction with the user.
请参考图2至3,本申请的语音识别控制装置1包括一红外线发射器10、一语音收发器20、一储存单元30、一学习模块40及一处理器50。2 to 3, the voice recognition control device 1 of the present application includes an infrared transmitter 10, a voice transceiver 20, a storage unit 30, a learning module 40, and a processor 50.
该红外线发射器10可发射至少一控制讯号,各该控制讯号为红外线讯号,该至少一控制讯号供控制至少一电器设备2,在本实施例中该至少一电器设备2可为可接收红外线讯号的冷气、灯具、电视或风扇等,当然也可为具有发送红外线讯号的电器设备2;在其他实施例,该红外线发射器10也可发射多个控制讯号,以同时控制多个电器设备。The infrared transmitter 10 can emit at least one control signal, each of the control signals is an infrared signal, and the at least one control signal is used to control at least one electrical device 2. In this embodiment, the at least one electrical device 2 can receive the infrared signal Of course, the air conditioner, lamp, TV, or fan can also be an electrical device 2 that sends infrared signals; in other embodiments, the infrared transmitter 10 can also emit multiple control signals to control multiple electrical devices at the same time.
该语音收发器20可接收一语音信号及可发射一依据该语音信号产生的语音回馈信号;该学习模块40至少将该语音信号转换为至少一语言数据;该储存单元30储存各该语言数据、至少一环境数据及至少一状态数据中至少之一;及该处理器50依据各该语言数据解析该语音信号,并读取该至少一环境数据及该至少一状态数据中至少之一而将该语音信号转换为该控制讯号及该语音回馈信号中至少之一。借此,该语音识别控制装置1可与使用者4互动进而学习中文语法,并可通过该语音信号控制各该电器设备2,以提升操作便利性。The voice transceiver 20 can receive a voice signal and can transmit a voice feedback signal generated according to the voice signal; the learning module 40 at least converts the voice signal into at least one language data; the storage unit 30 stores each of the language data, At least one of at least one environmental data and at least one state data; and the processor 50 parses the voice signal according to each of the language data, and reads at least one of the at least one environmental data and the at least one state data to perform the The voice signal is converted into at least one of the control signal and the voice feedback signal. Thereby, the voice recognition control device 1 can interact with the user 4 to learn Chinese grammar, and can control each of the electrical equipment 2 through the voice signal, so as to improve the convenience of operation.
在本实施例中,该至少一语言数据、该至少一环境数据及至少一状态数据的数量分别为多个;该多个语言数据可包括中文、英文、粤语、闽南语、泰语等多国语言的词汇及文法;该多个环境数据可包括多个环境名称,该多个状态数据可包括环境温度状态、环境湿度状态、该至少一电器设备2的运转状态等。In this embodiment, the number of the at least one language data, the at least one environment data, and the at least one status data is multiple respectively; the multiple language data may include Chinese, English, Cantonese, Hokkien, Thai and other languages. Vocabulary and grammar; the multiple environmental data may include multiple environmental names, and the multiple state data may include environmental temperature status, environmental humidity status, operating status of the at least one electrical device 2 and so on.
较佳地,该学习模块40可供连接一云端搜索引擎服务器3,以有线或无线的方式连接该云端搜索引擎服务器3。该云端搜索引擎服务器3可为搜索引擎(例如GOOGLE)、信息数据库(例如维基百科)等网络信息,该学习模块40可依据该语音信号至该云端搜索引擎服务器3获得该至少一语言数据,如此该学习模块40可通过网络学习。Preferably, the learning module 40 can be connected to a cloud search engine server 3, which can be connected to the cloud search engine server 3 in a wired or wireless manner. The cloud search engine server 3 can be network information such as a search engine (such as GOOGLE), an information database (such as Wikipedia), and the learning module 40 can obtain the at least one language data from the cloud search engine server 3 according to the voice signal, and so The learning module 40 can learn through the network.
详细地说明,该处理器50另包括一语音识别单元51及一语意识别单元52,该语音识别单元51分析该语音信号为至少一文字讯号,该语意识别单元52将各该文字讯号依据该至少一语言数据、该至少一环境数据及该至少一状态数据中至少之一转换为该控制讯号或该语音回馈信号中至少之一,借此可明确地分析及解读该语音信号。此外,该语音识别单元51可判断不同发音、语调以与相近的文字配对。In detail, the processor 50 further includes a voice recognition unit 51 and a semantic recognition unit 52. The voice recognition unit 51 analyzes the voice signal as at least one text signal, and the semantic recognition unit 52 converts each text signal according to the at least one text signal. At least one of the language data, the at least one environmental data, and the at least one state data is converted into at least one of the control signal or the voice feedback signal, so that the voice signal can be clearly analyzed and interpreted. In addition, the voice recognition unit 51 can determine different pronunciations and intonations to match similar characters.
其中,该储存单元30另包括一语意数据库31,该语意数据库31储存各该语言数据。此外,该储存单元30另包括一环境数据库32,该环境数据库32储存各该环境数据。进一步,该储存单元30另包括一状态数据库33,该状态数据库33储存各该状态数据。The storage unit 30 further includes a semantic database 31, and the semantic database 31 stores each language data. In addition, the storage unit 30 further includes an environmental database 32, and the environmental database 32 stores various environmental data. Further, the storage unit 30 further includes a state database 33, and the state database 33 stores each state data.
该语音收发器20另包括一回音滤除模块21,当该语音信号中包含一回音讯号时,该回音滤除模块21可过滤该回音讯号。此外,该语音收发器20另包括一噪声滤除模块22,当该语音信号中包括一杂音讯号时,该噪声滤除模块22可过滤该杂音讯号;借以提升该语音信号的清晰度。The voice transceiver 20 further includes an echo filter module 21. When the voice signal contains an echo signal, the echo filter module 21 can filter the echo signal. In addition, the voice transceiver 20 further includes a noise filter module 22. When the voice signal includes a noise signal, the noise filter module 22 can filter the noise signal to improve the clarity of the voice signal.
该语音收发器20包括一收音模块23及一放音模块24,该收音模块23可接收该语音信号,该放音模块24可发射该语音回馈信号。该收音模块23可例如为一麦克风装置;该放音模块24可例如为一扬声装置。此外,该语音收发器20另包括一播放模块25,该播放模块25连接该语意识别单元52及该放音模块24。The voice transceiver 20 includes a receiving module 23 and a playing module 24. The receiving module 23 can receive the voice signal, and the playing module 24 can transmit the voice feedback signal. The sound receiving module 23 may be, for example, a microphone device; the sound playback module 24 may be, for example, a speaker device. In addition, the voice transceiver 20 further includes a playing module 25 connected to the semantic recognition unit 52 and the sound playing module 24.
在本实施例中,该语音识别控制装置1另包括一显示器(图中未示出),该显示器连接该处理器50,该显示器可供显示至少一影像信息,该至少一影像信息可为多媒体、或远程视讯影像,以跟使用者4以影像互动。In this embodiment, the voice recognition control device 1 further includes a display (not shown in the figure), the display is connected to the processor 50, the display can display at least one image information, the at least one image information can be multimedia , Or remote video image to interact with the user 4 with images.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of this application, or directly or indirectly used in other related technical fields , The same reason is included in the scope of patent protection of this application.
工业实用性Industrial applicability
本申请实施例提供的一种语音识别控制方法及装置,以一语音收发器接收一语音信号;以一学习模块至少与一云端搜索引擎服务器互动学习该语音信号,再将该语音信号转换为至少一语言数据;以一处理器依据各该语言数据解析该语音信号,以产生一控制讯号及一语音回馈信号;及,以一红外线发射器发射该控制讯号,以供控制至少一电器设备。实现了声控至少一个电器设备,因此,具有工业实用性。The voice recognition control method and device provided by the embodiments of the present application uses a voice transceiver to receive a voice signal; uses a learning module to interact with at least a cloud search engine server to learn the voice signal, and then convert the voice signal to at least A language data; a processor analyzes the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and an infrared transmitter transmits the control signal for controlling at least one electrical device. At least one electrical device is realized by sound control, so it has industrial applicability.

Claims (10)

  1. 一种语音识别控制方法,其中,包括:A voice recognition control method, which includes:
    以一语音收发器接收一语音信号;Receive a voice signal with a voice transceiver;
    以一学习模块至少与一云端搜索引擎服务器互动学习该语音信号,再将该语音信号转换为至少一语言数据;Use a learning module to interact with at least one cloud search engine server to learn the voice signal, and then convert the voice signal into at least one language data;
    以一处理器依据各该语言数据解析该语音信号,以产生一控制讯号及一语音回馈信号;及Using a processor to analyze the voice signal according to each of the language data to generate a control signal and a voice feedback signal; and
    以一红外线发射器发射该控制讯号,以供控制至少一电器设备。An infrared transmitter is used to transmit the control signal for controlling at least one electrical device.
  2. 一种语音识别控制装置,其中,包括:A voice recognition control device, which includes:
    一红外线发射器,用于发射至少一控制讯号,该至少一控制讯号供控制至少一电器设备;An infrared transmitter for emitting at least one control signal for controlling at least one electrical device;
    一语音收发器,用于接收一语音信号及发射一依据该语音信号产生的语音回馈信号;A voice transceiver for receiving a voice signal and transmitting a voice feedback signal generated according to the voice signal;
    一学习模块,至少将该语音信号转换为至少一语言数据;A learning module, at least converting the voice signal into at least one language data;
    一储存单元,储存各该语言数据、至少一环境数据及至少一状态数据中至少之一;及A storage unit storing at least one of each of the language data, at least one environmental data, and at least one state data; and
    一处理器,依据各该语言数据解析该语音信号,并读取该至少一环境数据及该至少一状态数据而将该语音信号转换为该控制讯号及该语音回馈信号中至少之一。A processor analyzes the voice signal according to each of the language data, and reads the at least one environmental data and the at least one state data to convert the voice signal into at least one of the control signal and the voice feedback signal.
  3. 如权利要求2所述的语音识别控制装置,其中,该学习模块,与一云端搜索引擎服务器连接,该学习模块用于依据该语音信号至该云端搜索引擎服务器获得该至少一语言数据。3. The voice recognition control device of claim 2, wherein the learning module is connected to a cloud search engine server, and the learning module is used to obtain the at least one language data from the cloud search engine server according to the voice signal.
  4. 如权利要求2所述的语音识别控制装置,其中,该处理器另包括一语音识别单元及一语意识别单元,该语音识别单元分析该语音信号为至少一文字讯号,该语意识别单元将各该文字讯号依据该至少一语言数据、该至少一环境数据及该至少一状态数据中至少之一转换为该控制讯号或该语音回馈信号中至少之一。3. The voice recognition control device of claim 2, wherein the processor further comprises a voice recognition unit and a semantic recognition unit, the voice recognition unit analyzes the voice signal as at least one text signal, and the semantic recognition unit converts each of the text The signal is converted into at least one of the control signal or the voice feedback signal according to at least one of the at least one language data, the at least one environmental data, and the at least one state data.
  5. 如权利要求2所述的语音识别控制装置,其中,该储存单元另包括一语意数据库,该语意数据库储存各该语言数据。3. The voice recognition control device of claim 2, wherein the storage unit further comprises a semantic database, the semantic database storing each of the language data.
  6. 如权利要求2所述的语音识别控制装置,其中,该储存单元另包括一环境数据库及一状态数据库,该环境数据库储存各该环境数据,该状态数据库储存各该状态数据。3. The voice recognition control device of claim 2, wherein the storage unit further comprises an environment database and a state database, the environment database stores each of the environmental data, and the state database stores each of the state data.
  7. 如权利要求2所述的语音识别控制装置,其中,该语音收发器另包括一回音滤除模块,当该语音信号中包含一回音讯号时,该回音滤除模块过滤该回音讯号。3. The voice recognition control device of claim 2, wherein the voice transceiver further comprises an echo filtering module, and when the voice signal includes an echo signal, the echo filtering module filters the echo signal.
  8. 如权利要求2所述的语音识别控制装置,其中,该语音收发器另包括一噪声滤除模块,当该语音信号中包括一杂音讯号时,该噪声滤除模块可过滤该杂音讯号。3. The voice recognition control device of claim 2, wherein the voice transceiver further comprises a noise filtering module, and when the voice signal includes a noise signal, the noise filtering module can filter the noise signal.
  9. 如权利要求2所述的语音识别控制装置,其中,该语音收发器包括一收音模块及一放音模块,该收音模块接收该语音信号,该放音模块发射该语音回馈信号。3. The voice recognition control device of claim 2, wherein the voice transceiver comprises a radio module and a sound module, the radio module receives the voice signal, and the sound module transmits the voice feedback signal.
  10. 如权利要求4所述的语音识别控制装置,其中,该至少一语言数据、该至少一环境数据及至少一状态数据的数量分别为多个;该学习模块,与一云端搜索引擎服务器连接,该学习模块用于依据该语音信号至该云端搜索引擎服务器获得该至少一语言数据;该储存单元另包括一语意数据库,该语意数据库储存各该语言数据;该储存单元另包括一环境数据库及一状态数据库,该环境数据库储存各该环境数据,该状态数据库储存各该状态数据;该语音收发器另包括一回音滤除模块,当该语音信号中包含一回音讯号时,该回音滤除模块过滤该回音讯号;该语音收发器另包括一噪声滤除模块,当该语音信号中包括一杂音讯号时,该噪声滤除模块过滤该杂音讯号;该语音收发器包括一收音模块及一放音模块,该收音模块接收该语音信号,该放音模块发射该语音回馈信号;该语音收发器另包括一播放模块,该播放模块连接该语意识别单元及该放音模块;该语音识别控制装置另包括一显示器,该显示器连接该处理器,该显示器可供显示至少一影像信息。4. The voice recognition control device of claim 4, wherein the number of the at least one language data, the at least one environment data, and the at least one status data is multiple; the learning module is connected to a cloud search engine server, the The learning module is used to obtain the at least one language data from the cloud search engine server according to the voice signal; the storage unit further includes a semantic database, the semantic database stores each language data; the storage unit further includes an environment database and a status The environment database stores each of the environmental data, the state database stores each of the state data; the voice transceiver further includes an echo filter module, when the voice signal contains an echo signal, the echo filter module filters the Echo signal; the voice transceiver further includes a noise filter module, when the voice signal includes a noise signal, the noise filter module filters the noise signal; the voice transceiver includes a radio module and a playback module, The radio module receives the voice signal, and the playback module transmits the voice feedback signal; the voice transceiver further includes a playback module connected to the semantic recognition unit and the playback module; the voice recognition control device further includes a The display is connected to the processor, and the display can display at least one image information.
PCT/CN2019/077469 2019-03-08 2019-03-08 Voice recognition control method and device WO2020181407A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/077469 WO2020181407A1 (en) 2019-03-08 2019-03-08 Voice recognition control method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/077469 WO2020181407A1 (en) 2019-03-08 2019-03-08 Voice recognition control method and device

Publications (1)

Publication Number Publication Date
WO2020181407A1 true WO2020181407A1 (en) 2020-09-17

Family

ID=72426030

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/077469 WO2020181407A1 (en) 2019-03-08 2019-03-08 Voice recognition control method and device

Country Status (1)

Country Link
WO (1) WO2020181407A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090018818A1 (en) * 2007-07-10 2009-01-15 Aibelive Co., Ltd. Operating device for natural language input
CN101551998A (en) * 2009-05-12 2009-10-07 上海锦芯电子科技有限公司 A group of voice interaction devices and method of voice interaction with human
CN107016993A (en) * 2017-05-15 2017-08-04 成都铅笔科技有限公司 The voice interactive system and method for a kind of smart home
CN108648752A (en) * 2018-04-17 2018-10-12 重庆物奇科技有限公司 A kind of intelligent sound control system and its control method based on cloud processing
CN109065041A (en) * 2018-08-09 2018-12-21 上海常仁信息科技有限公司 A kind of voice interactive system and method based on robot

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090018818A1 (en) * 2007-07-10 2009-01-15 Aibelive Co., Ltd. Operating device for natural language input
CN101551998A (en) * 2009-05-12 2009-10-07 上海锦芯电子科技有限公司 A group of voice interaction devices and method of voice interaction with human
CN107016993A (en) * 2017-05-15 2017-08-04 成都铅笔科技有限公司 The voice interactive system and method for a kind of smart home
CN108648752A (en) * 2018-04-17 2018-10-12 重庆物奇科技有限公司 A kind of intelligent sound control system and its control method based on cloud processing
CN109065041A (en) * 2018-08-09 2018-12-21 上海常仁信息科技有限公司 A kind of voice interactive system and method based on robot

Similar Documents

Publication Publication Date Title
US11489691B2 (en) Apparatus, system and method for directing voice input in a controlling device
EP3190512B1 (en) Display device and operating method therefor
US11790912B2 (en) Phoneme recognizer customizable keyword spotting system with keyword adaptation
KR102411619B1 (en) Electronic apparatus and the controlling method thereof
US20180182399A1 (en) Control method for control device, control method for apparatus control system, and control device
US11114095B2 (en) Information processing device
US11664024B2 (en) Artificial intelligence device
US11122349B2 (en) Server and system for controlling smart microphone
US20140019141A1 (en) Method for providing contents information and broadcast receiving apparatus
KR20210001082A (en) Electornic device for processing user utterance and method for operating thereof
CN114402383A (en) Electronic device and method for controlling voice recognition thereof
US20220293106A1 (en) Artificial intelligence server and operation method thereof
US11587571B2 (en) Electronic apparatus and control method thereof
WO2020181407A1 (en) Voice recognition control method and device
US12087296B2 (en) Display device and artificial intelligence server
CN111539215A (en) Method, equipment and system for disambiguating natural language content title
TW202030624A (en) Voice recognition control method and device using the same
TWM578818U (en) Voice recognition control device
US20230261897A1 (en) Display device
KR100434561B1 (en) The system of a interactive digital toy
WO2020017165A1 (en) Information processing device, information processing system, information processing method, and program
CN111754996A (en) Control method and device based on voice simulation remote controller and electronic equipment
WO2022193735A1 (en) Display device and voice interaction method
CN115438625A (en) Text error correction server, terminal device and text error correction method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19919142

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19919142

Country of ref document: EP

Kind code of ref document: A1