WO2022041319A1 - Voice control optimization system and method - Google Patents

Voice control optimization system and method Download PDF

Info

Publication number
WO2022041319A1
WO2022041319A1 PCT/CN2020/114184 CN2020114184W WO2022041319A1 WO 2022041319 A1 WO2022041319 A1 WO 2022041319A1 CN 2020114184 W CN2020114184 W CN 2020114184W WO 2022041319 A1 WO2022041319 A1 WO 2022041319A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
module
information
editing
recognition
Prior art date
Application number
PCT/CN2020/114184
Other languages
French (fr)
Chinese (zh)
Inventor
汤智文
刘胜利
唐韧
叶鑫
Original Assignee
广东奥科伟业科技发展有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广东奥科伟业科技发展有限公司 filed Critical 广东奥科伟业科技发展有限公司
Publication of WO2022041319A1 publication Critical patent/WO2022041319A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • AHUMAN NECESSITIES
    • A47FURNITURE; DOMESTIC ARTICLES OR APPLIANCES; COFFEE MILLS; SPICE MILLS; SUCTION CLEANERS IN GENERAL
    • A47HFURNISHINGS FOR WINDOWS OR DOORS
    • A47H5/00Devices for drawing draperies, curtains, or the like
    • A47H5/02Devices for opening and closing curtains
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the invention relates to the technical field of automatic curtain opening and closing control, in particular to a system and method for optimizing voice control.
  • Electric curtains are widely used in various buildings, and with the development of science and technology, voice-controlled electric curtains have appeared.
  • voice control of electric curtains needs to use a specific language and accent.
  • people's daily language and accent are also different, which leads to people often need to adjust the language and accent several times during voice control.
  • the tone can only be completed in accordance with the specific language and tone of the electric curtain, which greatly causes the inconvenience of voice control and reduces the operating experience of the operator's voice control.
  • the present invention provides a system and method for optimizing voice control.
  • a system for optimizing voice control includes:
  • a voice recognition module which is used for receiving voice information, and performing command recognition on the voice information according to the recognition mode, and converting the voice information recognized as commands into control information;
  • a voice control module connected to the voice recognition module; the voice control module receives control information and performs actions according to the control information;
  • the voice editing module connected with the voice recognition module; the voice editing module is used to edit the recognition mode of the voice recognition module.
  • the present invention further includes a terminal module; the terminal module is connected to the voice editing module; the terminal module is used for sending editing information to the voice editing module, and the voice editing module edits the recognition mode of the voice recognition module according to the editing information.
  • the speech recognition module includes a speech recognition unit and a speech processing unit; the speech recognition unit is connected to the speech processing unit; the speech processing unit stores a recognition mode; the speech recognition unit receives the speech information and transmits the speech information to A voice processing unit; the voice processing unit performs command recognition on the voice information according to the recognition mode, and converts the voice information recognized as commands into control information.
  • the voice control module includes a voice control unit and an action execution unit; the voice control unit is connected to the action execution unit; the voice control unit receives the control information, and forms a control instruction and transmits it to the action execution unit; Instructions perform actions.
  • the voice editing module includes a voice editing unit and a first wireless communication unit; the voice editing unit is connected to the first wireless communication unit; the terminal module includes an input unit and a second wireless communication unit; the input unit is connected to the second wireless communication unit The communication unit is connected; the first wireless communication unit is wirelessly connected with the second wireless communication unit; the editing information input by the input unit is wirelessly transmitted to the voice editing unit through the cooperation of the second wireless communication unit and the first wireless communication unit; Information to change the recognition mode of the speech recognition module.
  • One method of optimizing voice control includes:
  • the speech editing module edits the recognition mode of the speech recognition module
  • the speech recognition module receives the speech information, performs command recognition on the speech information according to the edited recognition mode, and converts the speech information recognized as commands into control information;
  • the voice control module performs actions according to the control information.
  • the speech editing module edits the recognition mode of the speech recognition module, which further includes:
  • the terminal module sends editing information to the voice editing module.
  • the terminal module before the terminal module sends editing information to the voice editing module, it further includes:
  • the terminal module sends confirmation information and verification information for the operator to confirm.
  • the terminal module sends confirmation information and verification information for the operator to confirm, which further includes:
  • the operator inputs editing information through the terminal module.
  • the recognition mode includes language type and voice tone.
  • the voice editing module Through the setting of the voice editing module, the editing of the recognition mode of the voice recognition module is realized, and the optimization of the voice control is realized, so that the voice information used by the operator can be customized as the control information to be recognized, which greatly improves the performance of the voice recognition module. It increases the convenience and operating experience of voice control.
  • FIG. 1 is a schematic structural diagram of a system for optimizing voice control in Embodiment 1;
  • FIG. 2 is a flowchart of a method for optimizing voice control in the second embodiment.
  • FIG. 1 is a schematic structural diagram of a system for optimizing voice control in the first embodiment.
  • the system for optimizing voice control in this embodiment includes a voice recognition module 1 , a voice control module 2 and a voice editing module 3 .
  • the voice recognition module 1 is used for receiving voice information, and performing command recognition on the voice information according to the recognition mode, and converting the voice information recognized as commands into control information.
  • the voice control module 2 is connected to the voice recognition module 1, and the voice control module 2 receives control information and performs actions according to the control information.
  • the voice editing module 3 is connected to the voice recognition module 1 , and the voice editing module 3 is used to edit the recognition mode of the voice recognition module 1 .
  • the editing of the recognition mode of the voice recognition module 1 is realized, and the optimization of the voice control is realized, so that the voice information used by the operator can be customized as the control information to be recognized, which greatly increases the voice control. convenience and operating experience.
  • the system for optimizing voice control in this embodiment further includes a terminal module 4 .
  • the terminal module 4 is connected with the voice editing module 3 .
  • the terminal module 4 is used for sending editing information to the speech editing module 3, and the speech editing module 3 edits the recognition mode of the speech recognition module 1 according to the editing information.
  • the speech editing module 3 edits the recognition mode of the speech recognition module 1 according to the editing information.
  • the speech recognition module 1 includes a speech recognition unit 11 and a speech processing unit 12 .
  • the speech recognition unit 11 is connected to the speech processing unit 12 .
  • a recognition pattern is stored in the speech processing unit 12 .
  • the voice recognition unit 11 receives the voice information and transfers the voice information to the voice processing unit 12 .
  • the voice processing unit 12 performs command recognition on the voice information according to the recognition mode, and converts the voice information recognized as commands into control information.
  • the speech recognition unit 11 can use an existing speech recognizer or speech recognition circuit, such as a microphone, which can recognize and input human voices.
  • the voice information in this embodiment is the speech voices made by the human body.
  • the voice processing unit 12 may use an MCU chip with storage and voice processing functions. After the voice recognition unit 11 transmits the voice information to the voice processing unit 12, the voice processing unit 12 first performs command recognition on the voice information according to the recognition mode, that is, parses and judges the voice information, and only when it is judged that the voice information is a command, Voice information is converted into control information.
  • the recognition mode here is the command voice for which the language type and voice tone have been set.
  • the setting is set by the operator himself through the voice editing module 3.
  • the language type and voice tone are all used by the operator daily and are familiar to them. Yes, used to.
  • the language type can be a national language, such as "Mandarin”, “English”, “German”, etc., or a local language, such as "Cantonese”, “Hokkien”, “Henan dialect”, “Shanghai dialect”, etc.
  • the accent is the different accents of the above-mentioned language types.
  • the command recognition is performed on the voice information in the following manner: the preset command information is stored in the voice processing 21, and if the voice information matches the preset command information, it is judged that the voice information is a command, otherwise, it is not a command .
  • the recognition mode stored in the speech processing unit 12 there are command words such as "open the curtain” or "close the curtain” as the preset command information, that is, the preset command voice.
  • the voice processing unit 12 judges the voice information as a command. If the command voices such as "close the curtains” do not match, the voice processing unit 12 determines that the voice information is not a command.
  • the voice processing unit 12 After judging the voice information as a command, the voice processing unit 12 converts the voice information judged as a command into control information according to the recognition mode. For example, after the voice information of "open the curtains” or “close the curtains” is recognized as a command, the voice processing unit 12 converts the above voice information into control information suitable for "open the curtains” or "close the curtains".
  • the voice control module 2 includes a voice control unit 21 and an action execution unit 22 .
  • the voice control unit 21 is connected to the action execution unit 22 .
  • the voice control unit 21 receives the control information, and forms a control instruction and transmits it to the action execution unit 22 .
  • the action execution unit 22 executes the action according to the control instruction.
  • the voice control unit 21 is connected to the voice processing unit 12 .
  • the voice processing unit 12 judges the voice information as a command, converts the voice information judged as a command into control information according to the recognition mode, and then transmits the control information to the voice control unit 21 .
  • the voice control unit 21 in this embodiment is an MCU chip with a control function, such as a motor control chip.
  • the action execution unit 22 is a device having an action execution function, such as a motor for a rolling shutter. After the voice processing unit 12 transmits the control information to the voice control unit 21, the voice control unit 21 forms control commands such as forward rotation or reverse rotation to the action execution unit 22, and the action execution unit 22 completes the corresponding action, thereby realizing the action of opening and closing the curtains .
  • the voice editing module 3 includes a voice editing unit 31 and a first wireless communication unit 32 .
  • the voice editing unit 31 is connected to the first wireless communication unit 32 .
  • the terminal module 4 includes an input unit 41 and a second wireless communication unit 42 .
  • the input unit 41 is connected to the second wireless communication unit 42 .
  • the first wireless communication unit 32 is wirelessly connected to the second wireless communication unit 42 .
  • the editing information input by the input unit 41 is wirelessly transmitted to the speech editing unit 31 through the cooperation of the second wireless communication unit 42 and the first wireless communication unit 32 .
  • the voice editing unit 31 changes the recognition mode of the voice recognition module 1 according to the editing information.
  • the voice editing unit 31 is connected to the voice processing unit 12 .
  • the speech editing unit 31 edits and changes the recognition mode in the speech processing unit 12 according to the editing information.
  • the input unit 41 may be an APP built in a smartphone, such as a WeChat applet.
  • the input unit 41 is activated.
  • the wireless connection state of the second wireless communication unit 42 and the first wireless communication unit 32 is activated.
  • the first wireless communication unit 42 is a built-in Bluetooth module of a smart phone, and the first wireless communication unit 32 is also a Bluetooth module. After the two are wirelessly connected, the communication between the input unit 41 and the voice editing unit 31 can be realized through Bluetooth signals. Information exchange.
  • the voice editing unit 31 in this embodiment is an MCU chip with an editing function, such as a chip with a burning function, or a burner can also be used, which can change the recognition mode stored in the voice processing unit 12, that is, realize voice Edit the recognized language type and tone of voice.
  • the input unit 41 has an editing input button.
  • the operator presses the editing input button for a long time, and speaks the corresponding language according to his daily language and accent habits. For example, the operator speaks the Cantonese voices of "open the curtains” and "close the curtains”.
  • the input unit 41 communicates the Cantonese voices of "open the curtains” and "close the curtains” with the second wireless communication unit 42.
  • the first wireless communication unit 32 cooperates and transmits it to the speech editing unit 31, and the speech editing unit 31 deletes the recognition pattern originally stored by the speech processing unit 12, and records the new recognition patterns of “open curtains” and “close curtains” in Cantonese into the speech processing unit.
  • a new recognition mode is formed, so that the editing of the recognition mode is completed, and the recognition mode of the speech processing unit 12 is changed.
  • the input unit 41 may further add editing information confirmation and verification functions.
  • Editing information confirmation is to display and play the editing information entered by the operator on the APP of the smartphone for the operator to confirm. Edit the information for matching verification, and complete the verification if they are consistent.
  • the operator cooperates with the voice editing unit 31 through the input unit 41 to complete the editing and modification of the recognition mode of the voice processing unit 12, so that the operator can customize and modify the control voice according to his own habits and preferences, which greatly increases the The convenience and operation experience of voice control are improved.
  • FIG. 2 is a flowchart of a method for optimizing voice control in the second embodiment.
  • the method for optimizing voice control in this embodiment can be implemented based on a system for optimizing voice control in the embodiment, which specifically includes the following steps:
  • the speech editing module 3 edits the recognition mode of the speech recognition module 1 .
  • the speech recognition module 1 receives the speech information, performs command recognition on the speech information according to the edited recognition mode, and converts the speech information recognized as commands into control information.
  • the voice control module 2 performs an action according to the control information.
  • the recognition mode of the voice recognition module 1 is edited by the voice editing module 3, so that the recognition mode in the voice recognition module 1 can be customized and modified according to the operator's habit and preference, which greatly increases the convenience and operation of voice control. experience.
  • step S1 the speech editing module 3 edits the recognition mode of the speech recognition module 1, which also includes the following steps before:
  • the terminal module 4 sends the editing information to the voice editing module 3.
  • the operator sends the editing information to the editing module 3 through the terminal module 4, which greatly increases the convenience of editing.
  • the terminal module 4 sends the editing information to the voice editing module 3, which further includes:
  • the terminal module 4 sends confirmation information and verification information for the operator to confirm.
  • the terminal module 4 ensures the accuracy of the edited information by sending the confirmation information and verifying the information for the operator to confirm.
  • step S00 the terminal module 4 sends confirmation information and verification information for the operator to confirm, which also includes:
  • the recognition mode in this embodiment includes language types, and preferably, the speech recognition mode also includes voice intonation.
  • the editing of the recognition mode of the voice recognition module is realized, and the optimization of voice control is realized, so that the voice information used by the operator can be customized to be recognized as control information, which greatly increases the voice. Convenience of control and operating experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephone Function (AREA)

Abstract

A voice control optimization system and a voice control optimization method. The voice control optimization system comprises a voice recognition module (1), a voice control module (2) and a voice editing module (3). The voice control module (2) and the voice editing module (3) are respectively connected to the voice recognition module (1). The voice recognition module (1) is used to receive voice information, perform command recognition on the voice information according to a recognition mode, and convert the voice information recognized as a command into control information. The voice control module (2) receives the control information, and executes an action according to the control information. The voice editing module (3) is used to perform editing with respect to the recognition mode of the voice recognition module (1). Provision of the voice editing module (3) enables editing to be performed with respect to the recognition mode of the voice recognition module (1), and realizes optimization of voice control, such that voice information used by an operator in daily life can be used as custom control information to be recognized, thereby improving the convenience of voice control and operation experience.

Description

优化语音控制的系统及方法System and method for optimizing voice control 技术领域technical field
本发明涉及自动开合帘的控制技术领域,具体的涉及一种优化语音控制的系统及方法。The invention relates to the technical field of automatic curtain opening and closing control, in particular to a system and method for optimizing voice control.
背景技术Background technique
电动开合窗帘被广泛的应用在各类建筑当中,而随着科技的发展,目前已经出现了语音控制的电动窗帘。在现有技术中,对电动窗帘进行语音控制需要采用特定的语种和腔调,然而地区不同,人们日常使用的语种和腔调也不同,这就导致人们在语音控制时,往往需要多次调整语种和腔调以符合电动窗帘特定的语种和腔调才能够完成,这极大的引起了语音控制的不便,也降低了操作者语音控制的操作体验。Electric curtains are widely used in various buildings, and with the development of science and technology, voice-controlled electric curtains have appeared. In the prior art, voice control of electric curtains needs to use a specific language and accent. However, in different regions, people's daily language and accent are also different, which leads to people often need to adjust the language and accent several times during voice control. The tone can only be completed in accordance with the specific language and tone of the electric curtain, which greatly causes the inconvenience of voice control and reduces the operating experience of the operator's voice control.
发明内容SUMMARY OF THE INVENTION
针对现有技术的不足,本发明提供一种优化语音控制的系统及方法。Aiming at the deficiencies of the prior art, the present invention provides a system and method for optimizing voice control.
一种优化语音控制的系统包括:A system for optimizing voice control includes:
语音识别模块,其用于接收语音信息,并根据识别模式对语音信息进行命令识别,并将识别为命令的语音信息转化为控制信息;A voice recognition module, which is used for receiving voice information, and performing command recognition on the voice information according to the recognition mode, and converting the voice information recognized as commands into control information;
与语音识别模块连接的语音控制模块;语音控制模块接收控制信息,并根据控制信息执行动作;以及a voice control module connected to the voice recognition module; the voice control module receives control information and performs actions according to the control information; and
与语音识别模块连接的语音编辑模块;语音编辑模块用于对语音识别模块的识别模式进行编辑。The voice editing module connected with the voice recognition module; the voice editing module is used to edit the recognition mode of the voice recognition module.
根据本发明一实施方式,其还包括终端模块;终端模块与语音编辑模块连接;终端模块用于发送编辑信息至语音编辑模块,语音编辑模块根据编辑信息对语音识别模块的识别模式进行编辑。According to an embodiment of the present invention, it further includes a terminal module; the terminal module is connected to the voice editing module; the terminal module is used for sending editing information to the voice editing module, and the voice editing module edits the recognition mode of the voice recognition module according to the editing information.
根据本发明一实施方式,语音识别模块包括语音识别单元以及语音处理 单元;语音识别单元与语音处理单元连接;语音处理单元内存储有识别模式;语音识别单元接收语音信息,并将语音信息传递至语音处理单元;语音处理单元根据识别模式对语音信息进行命令识别,并将识别为命令的语音信息转化为控制信息。According to an embodiment of the present invention, the speech recognition module includes a speech recognition unit and a speech processing unit; the speech recognition unit is connected to the speech processing unit; the speech processing unit stores a recognition mode; the speech recognition unit receives the speech information and transmits the speech information to A voice processing unit; the voice processing unit performs command recognition on the voice information according to the recognition mode, and converts the voice information recognized as commands into control information.
根据本发明一实施方式,语音控制模块包括语音控制单元以及动作执行单元;语音控制单元与动作执行单元连接;语音控制单元接收控制信息,并形成控制指令传递给动作执行单元;动作执行单元根据控制指令执行动作。According to an embodiment of the present invention, the voice control module includes a voice control unit and an action execution unit; the voice control unit is connected to the action execution unit; the voice control unit receives the control information, and forms a control instruction and transmits it to the action execution unit; Instructions perform actions.
根据本发明一实施方式,语音编辑模块包括语音编辑单元以及第一无线通信单元;语音编辑单元与第一无线通信单元连接;终端模块包括输入单元以及第二无线通信单元;输入单元与第二无线通信单元连接;第一无线通信单元与第二无线通信单元无线连接;输入单元输入的编辑信息通过第二无线通信单元以及第一无线通信单元的配合无线传送至语音编辑单元;语音编辑单元根据编辑信息更改语音识别模块的识别模式。According to an embodiment of the present invention, the voice editing module includes a voice editing unit and a first wireless communication unit; the voice editing unit is connected to the first wireless communication unit; the terminal module includes an input unit and a second wireless communication unit; the input unit is connected to the second wireless communication unit The communication unit is connected; the first wireless communication unit is wirelessly connected with the second wireless communication unit; the editing information input by the input unit is wirelessly transmitted to the voice editing unit through the cooperation of the second wireless communication unit and the first wireless communication unit; Information to change the recognition mode of the speech recognition module.
一种优化语音控制的方法包括:One method of optimizing voice control includes:
语音编辑模块对语音识别模块的识别模式进行编辑;The speech editing module edits the recognition mode of the speech recognition module;
语音识别模块接收语音信息,并根据编辑后的识别模式对语音信息进行命令识别,并将识别为命令的语音信息转化为控制信息;The speech recognition module receives the speech information, performs command recognition on the speech information according to the edited recognition mode, and converts the speech information recognized as commands into control information;
语音控制模块根据控制信息执行动作。The voice control module performs actions according to the control information.
根据本发明一实施方式,语音编辑模块对语音识别模块的识别模式进行编辑,之前还包括:According to an embodiment of the present invention, the speech editing module edits the recognition mode of the speech recognition module, which further includes:
终端模块发送编辑信息至语音编辑模块。The terminal module sends editing information to the voice editing module.
根据本发明一实施方式,终端模块发送编辑信息至语音编辑模块,之前还包括:According to an embodiment of the present invention, before the terminal module sends editing information to the voice editing module, it further includes:
终端模块发送确认信息和校验信息供操作者确认。The terminal module sends confirmation information and verification information for the operator to confirm.
根据本发明一实施方式,终端模块发送确认信息和校验信息供操作者确认,之前还包括:According to an embodiment of the present invention, the terminal module sends confirmation information and verification information for the operator to confirm, which further includes:
操作者通过终端模块进行编辑信息的输入。The operator inputs editing information through the terminal module.
根据本发明一实施方式,识别模式包括语种类型和语音腔调。According to an embodiment of the present invention, the recognition mode includes language type and voice tone.
同现有技术相比,通过语音编辑模块的设置,实现对语音识别模块的识别模式的编辑,实现语音控制的优化,使得操作者日常使用的语音信息能够自定义为控制信息被识别,极大的增加了语音控制的便利性和操作体验。Compared with the prior art, through the setting of the voice editing module, the editing of the recognition mode of the voice recognition module is realized, and the optimization of the voice control is realized, so that the voice information used by the operator can be customized as the control information to be recognized, which greatly improves the performance of the voice recognition module. It increases the convenience and operating experience of voice control.
附图说明Description of drawings
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described herein are used to provide further understanding of the present application and constitute a part of the present application. The schematic embodiments and descriptions of the present application are used to explain the present application and do not constitute an improper limitation of the present application. In the attached image:
图1为实施例一中优化语音控制的系统的结构示意图;1 is a schematic structural diagram of a system for optimizing voice control in Embodiment 1;
图2为实施例二中优化语音控制的方法的流程图。FIG. 2 is a flowchart of a method for optimizing voice control in the second embodiment.
具体实施方式detailed description
以下将以图式揭露本发明的多个实施方式,为明确说明起见,许多实务上的细节将在以下叙述中一并说明。然而,应了解到,这些实务上的细节不应用以限制本发明。也就是说,在本发明的部分实施方式中,这些实务上的细节是非必要的。此外,为简化图式起见,一些习知惯用的结构与组件在图式中将以简单的示意的方式绘示之。Various embodiments of the present invention will be disclosed in the drawings below, and for the sake of clarity, many practical details will be described together in the following description. It should be understood, however, that these practical details should not be used to limit the invention. That is, in some embodiments of the invention, these practical details are unnecessary. In addition, for the purpose of simplifying the drawings, some well-known structures and components will be shown in a simple schematic manner in the drawings.
需要说明,本发明本实施例中所有方向性指示(诸如上、下、左、右、前、后……)仅用于解释在某一特定姿态(如附图所示)下各部件之间的相对位置关系、移动情况等,如果该特定姿态发生改变时,则该方向性指示也相应地随之改变。It should be noted that all directional indications (such as up, down, left, right, front, back...) in this embodiment of the present invention are only used to explain the difference between the various components under a certain posture (as shown in the accompanying drawings). If the specific posture changes, the directional indication also changes accordingly.
另外,在本发明中如涉及“第一”、“第二”等的描述仅用于描述目的,并非特别指称次序或顺位的意思,亦非用以限定本发明,其仅仅是为了区别以相同技术用语描述的组件或操作而已,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二” 的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本发明要求的保护范围之内。In addition, descriptions such as “first”, “second”, etc. in the present invention are only for the purpose of description, and do not refer to the meaning of order or sequence, nor are they used to limit the present invention. The components or operations are described by the same technical terms, and should not be construed as indicating or implying their relative importance or implying the quantity of the indicated technical features. Thus, a feature delimited with "first", "second" may expressly or implicitly include at least one of that feature. In addition, the technical solutions between the various embodiments can be combined with each other, but must be based on the realization by those of ordinary skill in the art. When the combination of technical solutions is contradictory or cannot be realized, it should be considered that the combination of such technical solutions does not exist. , is not within the scope of protection required by the present invention.
为能进一步了解本发明的内容、特点及功效,兹例举以下实施例,并配合附图详细说明如下:In order to further understand the content, features and effects of the present invention, the following embodiments are given as examples, and are described in detail as follows in conjunction with the accompanying drawings:
实施例一Example 1
参照图1,图1为实施例一中优化语音控制的系统的结构示意图。本实施例中的优化语音控制的系统包括语音识别模块1、语音控制模块2以及语音编辑模块3。其中,语音识别模块1用于接收语音信息,并根据识别模式对语音信息进行命令识别,并将识别为命令的语音信息转化为控制信息。语音控制模块2与语音识别模块1连接,语音控制模块2接收控制信息,并根据控制信息执行动作。语音编辑模块3与语音识别模块1连接,语音编辑模块3用于对语音识别模块1的识别模式进行编辑。Referring to FIG. 1 , FIG. 1 is a schematic structural diagram of a system for optimizing voice control in the first embodiment. The system for optimizing voice control in this embodiment includes a voice recognition module 1 , a voice control module 2 and a voice editing module 3 . The voice recognition module 1 is used for receiving voice information, and performing command recognition on the voice information according to the recognition mode, and converting the voice information recognized as commands into control information. The voice control module 2 is connected to the voice recognition module 1, and the voice control module 2 receives control information and performs actions according to the control information. The voice editing module 3 is connected to the voice recognition module 1 , and the voice editing module 3 is used to edit the recognition mode of the voice recognition module 1 .
通过语音编辑模块3的设置,实现对语音识别模块1的识别模式的编辑,实现语音控制的优化,使得操作者日常使用的语音信息能够自定义为控制信息被识别,极大的增加了语音控制的便利性和操作体验。Through the setting of the voice editing module 3, the editing of the recognition mode of the voice recognition module 1 is realized, and the optimization of the voice control is realized, so that the voice information used by the operator can be customized as the control information to be recognized, which greatly increases the voice control. convenience and operating experience.
复参照图1,进一步,本实施例中的优化语音控制的系统还还包括终端模块4。终端模块4与语音编辑模块3连接。终端模块4用于发送编辑信息至语音编辑模块3,语音编辑模块3根据编辑信息对语音识别模块1的识别模式进行编辑。通过终端模块4的设置,以便于操作者编辑信息的输入,增加了操作进行语音控制优化的便捷性。Referring back to FIG. 1 , further, the system for optimizing voice control in this embodiment further includes a terminal module 4 . The terminal module 4 is connected with the voice editing module 3 . The terminal module 4 is used for sending editing information to the speech editing module 3, and the speech editing module 3 edits the recognition mode of the speech recognition module 1 according to the editing information. Through the setting of the terminal module 4, it is convenient for the operator to edit the input of information, and the convenience of the operation to optimize the voice control is increased.
复参照图1,更进一步,语音识别模块1包括语音识别单元11以及语音处理单元12。语音识别单元11与语音处理单元12连接。语音处理单元12内存储有识别模式。语音识别单元11接收语音信息,并将语音信息传递至语音处理单元12。语音处理单元12根据识别模式对语音信息进行命令识别, 并将识别为命令的语音信息转化为控制信息。Referring back to FIG. 1 , further, the speech recognition module 1 includes a speech recognition unit 11 and a speech processing unit 12 . The speech recognition unit 11 is connected to the speech processing unit 12 . A recognition pattern is stored in the speech processing unit 12 . The voice recognition unit 11 receives the voice information and transfers the voice information to the voice processing unit 12 . The voice processing unit 12 performs command recognition on the voice information according to the recognition mode, and converts the voice information recognized as commands into control information.
具体的,语音识别单元11可采用现有的语音识别器或语音识别电路,例如话筒,其可以对人发出的声音进行识别输入,本实施例中的语音信息为人体发出的话语声音。语音处理单元12可采用具有存储和语音处理功能的MCU芯片。语音识别单元11将语音信息传递至语音处理单元12之后,语音处理单元12先根据识别模式对语音信息进行命令识别,即对语音信息进行解析和判断,当判断出语音信息为命令时,才将语音信息转化为控制信息。Specifically, the speech recognition unit 11 can use an existing speech recognizer or speech recognition circuit, such as a microphone, which can recognize and input human voices. The voice information in this embodiment is the speech voices made by the human body. The voice processing unit 12 may use an MCU chip with storage and voice processing functions. After the voice recognition unit 11 transmits the voice information to the voice processing unit 12, the voice processing unit 12 first performs command recognition on the voice information according to the recognition mode, that is, parses and judges the voice information, and only when it is judged that the voice information is a command, Voice information is converted into control information.
可以理解的是,只有是操作者发出语音是有效指令进行才有进行后续操作的必要,否则操作发出的语音是误操作,则进行后操作无意义,这是语音处理单元12先根据识别模式对语音信息进行命令识别的意义。此处的识别模式为已经设定好语种类型和语音腔调的命令语音,该设定是操作者自己通过语音编辑模块3进行设定的,语种类型和语音腔调都是操作者日常所用,所熟悉的,习惯的。其中语种类型可为国家语言,例如“普通话”、“英语”、“德语”等,也可为地方语言,例如“粤语”、“闽南语”、“河南话”、“上海话”等,语音腔调则为上述语种类型的不同口音。It can be understood that the follow-up operation is necessary only if the operator's voice is a valid instruction to carry out, otherwise the voice issued by the operation is a misoperation, and the subsequent operation is meaningless. The meaning of voice information for command recognition. The recognition mode here is the command voice for which the language type and voice tone have been set. The setting is set by the operator himself through the voice editing module 3. The language type and voice tone are all used by the operator daily and are familiar to them. Yes, used to. The language type can be a national language, such as "Mandarin", "English", "German", etc., or a local language, such as "Cantonese", "Hokkien", "Henan dialect", "Shanghai dialect", etc. The accent is the different accents of the above-mentioned language types.
本实施例中对语音信息进行命令识别是采用如下方式:在语音处理21内存储预设命令信息,若语音信息与预设命令信息相匹配,则判断语音信息为命令,否则,则为不是命令。例如,在语音处理单元12内存储的识别模式中有“开窗帘”或“关窗帘”等命令词语作为预设命令信息,即预设的命令语音,当语音识别单元11输入的语音信息在解析之后,与“开窗帘”或“关窗帘”等命令语音相匹配时,则语音处理单元12将语音信息判断为命令,若语音识别单元11输入的语音信息在解析之后,与“开窗帘”或“关窗帘”等命令语音不匹配,则语音处理单元12将语音信息判断为非命令。In this embodiment, the command recognition is performed on the voice information in the following manner: the preset command information is stored in the voice processing 21, and if the voice information matches the preset command information, it is judged that the voice information is a command, otherwise, it is not a command . For example, in the recognition mode stored in the speech processing unit 12, there are command words such as "open the curtain" or "close the curtain" as the preset command information, that is, the preset command voice. When the voice information input by the voice recognition unit 11 is parsed After that, when it matches the command voice such as "open the curtain" or "close the curtain", the voice processing unit 12 judges the voice information as a command. If the command voices such as "close the curtains" do not match, the voice processing unit 12 determines that the voice information is not a command.
在将语音信息判断为命令后,语音处理单元12再根据识别模式将判断为命令的语音信息转化为控制信息。例如,“开窗帘”或“关窗帘”的语音信息被识别为命令后,语音处理单元12将上述语音信息转化为与“开窗帘”或“关 窗帘”相适配的控制信息。After judging the voice information as a command, the voice processing unit 12 converts the voice information judged as a command into control information according to the recognition mode. For example, after the voice information of "open the curtains" or "close the curtains" is recognized as a command, the voice processing unit 12 converts the above voice information into control information suitable for "open the curtains" or "close the curtains".
优选的,语音控制模块2包括语音控制单元21以及动作执行单元22。语音控制单元21与动作执行单元22连接。语音控制单元21接收控制信息,并形成控制指令传递给动作执行单元22。动作执行单元22根据控制指令执行动作。Preferably, the voice control module 2 includes a voice control unit 21 and an action execution unit 22 . The voice control unit 21 is connected to the action execution unit 22 . The voice control unit 21 receives the control information, and forms a control instruction and transmits it to the action execution unit 22 . The action execution unit 22 executes the action according to the control instruction.
具体的,语音控制单元21与语音处理单元12连接。语音处理单元12将语音信息判断为命令,并根据识别模式将判断为命令的语音信息转化为控制信息之后,将控制信息传递至语音控制单元21。本实施例中的语音控制单元21为具有控制功能的MCU芯片,例如电机控制芯片。动作执行单元22为具有动作执行功能的装置,例如卷帘用的电机。当语音处理单元12将控制信息传递至语音控制单元21后,语音控制单元21形成正转或反转等控制指令给动作执行单元22,动作执行单元22完成对应的动作,从而实现开关窗帘的动作。Specifically, the voice control unit 21 is connected to the voice processing unit 12 . The voice processing unit 12 judges the voice information as a command, converts the voice information judged as a command into control information according to the recognition mode, and then transmits the control information to the voice control unit 21 . The voice control unit 21 in this embodiment is an MCU chip with a control function, such as a motor control chip. The action execution unit 22 is a device having an action execution function, such as a motor for a rolling shutter. After the voice processing unit 12 transmits the control information to the voice control unit 21, the voice control unit 21 forms control commands such as forward rotation or reverse rotation to the action execution unit 22, and the action execution unit 22 completes the corresponding action, thereby realizing the action of opening and closing the curtains .
复参照图1,更进一步,语音编辑模块3包括语音编辑单元31以及第一无线通信单元32。语音编辑单元31与第一无线通信单元32连接。终端模块4包括输入单元41以及第二无线通信单元42。输入单元41与第二无线通信单元42连接。第一无线通信单元32与第二无线通信单元42无线连接。输入单元41输入的编辑信息通过第二无线通信单元42以及第一无线通信单元32的配合无线传送至语音编辑单元31。语音编辑单元31根据编辑信息更改语音识别模块1的识别模式。Referring back to FIG. 1 , further, the voice editing module 3 includes a voice editing unit 31 and a first wireless communication unit 32 . The voice editing unit 31 is connected to the first wireless communication unit 32 . The terminal module 4 includes an input unit 41 and a second wireless communication unit 42 . The input unit 41 is connected to the second wireless communication unit 42 . The first wireless communication unit 32 is wirelessly connected to the second wireless communication unit 42 . The editing information input by the input unit 41 is wirelessly transmitted to the speech editing unit 31 through the cooperation of the second wireless communication unit 42 and the first wireless communication unit 32 . The voice editing unit 31 changes the recognition mode of the voice recognition module 1 according to the editing information.
语音编辑单元31与语音处理单元12连接。操作者将通过输入单元41将编辑信息输入后,语音编辑单元31根据编辑信息对语音处理单元12内的识别模式进行编辑更改。具体的,输入单元41可为智能手机内置的APP,例如微信小程序。当操作者需要对语音识别模块1的识别模式进行编辑时,即当操作者需要对控制用语音的语种类型和语音腔调进行更改时,则启动输入单元41。在输入单元41启动之后,激活第二无线通信单元42与第一无线通 信单元32的无线连接状态。本实施例中的第一无线通信单元42为智能手机内置的蓝牙模块,第一无线通信单元32也为蓝牙模块,两则无线连接后即可通过蓝牙信号实现输入单元41与语音编辑单元31的信息交互。本实施例中的语音编辑单元31为具有编辑功能的MCU芯片,例如具有烧录功能的芯片,也可采用烧录器,其能够对语音处理单元12存储的识别模式进行更改,即能实现语音识别的语种类型和语音腔调进行编辑更改。例如,输入单元41具有编辑输入按钮,在输入单元41与语音编辑单元31实现信息交互的连接后,操作者长按该编辑输入按钮后,按照自己的日常使用的语种和腔调习惯,说出对应的编辑信息,例如操作者说出“打开窗帘”、“关闭窗帘”的粤语语音,输入完成后,输入单元41将“打开窗帘”、“关闭窗帘”的粤语语音通过第二无线通信单元42与第一无线通信单元32配合传递至语音编辑单元31,语音编辑单元31将语音处理单元12原先存储的识别模式删除,将新的“打开窗帘”、“关闭窗帘”粤语的识别模式录入到语音处理单元12内,形成新的识别模式,从而完成识别模式的编辑,更改了语音处理单元12的识别模式。The voice editing unit 31 is connected to the voice processing unit 12 . After the operator inputs the editing information through the input unit 41, the speech editing unit 31 edits and changes the recognition mode in the speech processing unit 12 according to the editing information. Specifically, the input unit 41 may be an APP built in a smartphone, such as a WeChat applet. When the operator needs to edit the recognition mode of the speech recognition module 1 , that is, when the operator needs to change the language type and tone of the control speech, the input unit 41 is activated. After the input unit 41 is activated, the wireless connection state of the second wireless communication unit 42 and the first wireless communication unit 32 is activated. In this embodiment, the first wireless communication unit 42 is a built-in Bluetooth module of a smart phone, and the first wireless communication unit 32 is also a Bluetooth module. After the two are wirelessly connected, the communication between the input unit 41 and the voice editing unit 31 can be realized through Bluetooth signals. Information exchange. The voice editing unit 31 in this embodiment is an MCU chip with an editing function, such as a chip with a burning function, or a burner can also be used, which can change the recognition mode stored in the voice processing unit 12, that is, realize voice Edit the recognized language type and tone of voice. For example, the input unit 41 has an editing input button. After the input unit 41 and the voice editing unit 31 are connected for information interaction, the operator presses the editing input button for a long time, and speaks the corresponding language according to his daily language and accent habits. For example, the operator speaks the Cantonese voices of "open the curtains" and "close the curtains". After the input is completed, the input unit 41 communicates the Cantonese voices of "open the curtains" and "close the curtains" with the second wireless communication unit 42. The first wireless communication unit 32 cooperates and transmits it to the speech editing unit 31, and the speech editing unit 31 deletes the recognition pattern originally stored by the speech processing unit 12, and records the new recognition patterns of “open curtains” and “close curtains” in Cantonese into the speech processing unit. In the unit 12, a new recognition mode is formed, so that the editing of the recognition mode is completed, and the recognition mode of the speech processing unit 12 is changed.
优选的,为了确保编辑信息输入的准确性,输入单元41还可加入编辑信息确认和校验功能。编辑信息确认是将操作者输入的编辑信息进行在智能手机的APP上进行显示、播放让操作者确认,编辑信息校验时让操作者进行第二次编辑信息输入,并与第一次输入的编辑信息进行匹配性验证,若前后一致则完成验证。Preferably, in order to ensure the accuracy of editing information input, the input unit 41 may further add editing information confirmation and verification functions. Editing information confirmation is to display and play the editing information entered by the operator on the APP of the smartphone for the operator to confirm. Edit the information for matching verification, and complete the verification if they are consistent.
如此,操作者通过输入单元41与语音编辑单元31配合,完成了语音处理单元12的识别模式的编辑更改,使得操作者可以根据自己的习惯和喜好进行控制语音的自定义更改,极大的增加了语音控制的便利性和操作体验。In this way, the operator cooperates with the voice editing unit 31 through the input unit 41 to complete the editing and modification of the recognition mode of the voice processing unit 12, so that the operator can customize and modify the control voice according to his own habits and preferences, which greatly increases the The convenience and operation experience of voice control are improved.
实施例二 Embodiment 2
参照图2,图2为实施例二中优化语音控制的方法的流程图。本实施例中的优化语音控制的方法可基于实施例一种优化语音控制的系统实现,其具体包括以下步骤:Referring to FIG. 2 , FIG. 2 is a flowchart of a method for optimizing voice control in the second embodiment. The method for optimizing voice control in this embodiment can be implemented based on a system for optimizing voice control in the embodiment, which specifically includes the following steps:
S1,语音编辑模块3对语音识别模块1的识别模式进行编辑。S1 , the speech editing module 3 edits the recognition mode of the speech recognition module 1 .
S2,语音识别模块1接收语音信息,并根据编辑后的识别模式对语音信息进行命令识别,并将识别为命令的语音信息转化为控制信息。S2, the speech recognition module 1 receives the speech information, performs command recognition on the speech information according to the edited recognition mode, and converts the speech information recognized as commands into control information.
S3,语音控制模块2根据控制信息执行动作。S3, the voice control module 2 performs an action according to the control information.
通过语音编辑模块3对语音识别模块1的识别模式进行编辑,使得语音识别模块1内识别模式可根据操作者所习惯和喜好进行自定义编辑修改,极大的增加了语音控制的便利性和操作体验。The recognition mode of the voice recognition module 1 is edited by the voice editing module 3, so that the recognition mode in the voice recognition module 1 can be customized and modified according to the operator's habit and preference, which greatly increases the convenience and operation of voice control. experience.
优选的,在步骤S1,语音编辑模块3对语音识别模块1的识别模式进行编辑,之前还包括以下步骤:Preferably, in step S1, the speech editing module 3 edits the recognition mode of the speech recognition module 1, which also includes the following steps before:
S0,终端模块4发送编辑信息至语音编辑模块3。S0, the terminal module 4 sends the editing information to the voice editing module 3.
操作者通过终端模块4将编辑信息发送至编辑模块3,极大了增加了编辑的便捷性。The operator sends the editing information to the editing module 3 through the terminal module 4, which greatly increases the convenience of editing.
优选的,在步骤S0,终端模块4发送编辑信息至语音编辑模块3,之前还包括:Preferably, in step S0, the terminal module 4 sends the editing information to the voice editing module 3, which further includes:
S00,终端模块4发送确认信息和校验信息供操作者确认。终端模块4通过发送确认信息和校验信息工操作者确认,以确保编辑信息的准确性。S00, the terminal module 4 sends confirmation information and verification information for the operator to confirm. The terminal module 4 ensures the accuracy of the edited information by sending the confirmation information and verifying the information for the operator to confirm.
在步骤S00,终端模块4发送确认信息和校验信息供操作者确认,之前还包括:In step S00, the terminal module 4 sends confirmation information and verification information for the operator to confirm, which also includes:
S000,操作者通过终端模块4进行编辑信息的输入。S000 , the operator inputs editing information through the terminal module 4 .
本实施例中的识别模式包括语种类型,优选的,语音识别模式还包括语音腔调。The recognition mode in this embodiment includes language types, and preferably, the speech recognition mode also includes voice intonation.
上述S000、S00、S0、S1、S2、S3各个步骤的实现,可参见实施例一中的优化语音控制的系统,此处不再赘述。For the implementation of the above steps S000, S00, S0, S1, S2, and S3, reference may be made to the system for optimizing voice control in Embodiment 1, and details are not repeated here.
综上,通过语音编辑模块的设置,实现对语音识别模块的识别模式的编辑,实现语音控制的优化,使得操作者日常使用的语音信息能够自定义为控制信息被识别,极大的增加了语音控制的便利性和操作体验。In summary, through the settings of the voice editing module, the editing of the recognition mode of the voice recognition module is realized, and the optimization of voice control is realized, so that the voice information used by the operator can be customized to be recognized as control information, which greatly increases the voice. Convenience of control and operating experience.
上所述仅为本发明的实施方式而已,并不用于限制本发明。对于本领域技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原理的内所作的任何修改、等同替换、改进等,均应包括在本发明的权利要求范围之内。The above description is merely an embodiment of the present invention, and is not intended to limit the present invention. Various modifications and variations of the present invention are possible for those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included within the scope of the claims of the present invention.

Claims (10)

  1. 一种优化语音控制的系统,其特征在于,包括:A system for optimizing voice control, comprising:
    语音识别模块(1),其用于接收语音信息,并根据识别模式对所述语音信息进行命令识别,并将识别为命令的所述语音信息转化为控制信息;A voice recognition module (1), which is used for receiving voice information, and carries out command recognition to the voice information according to a recognition mode, and converts the voice information identified as a command into control information;
    与所述语音识别模块(1)连接的语音控制模块(2);所述语音控制模块(2)接收所述控制信息,并根据所述控制信息执行动作;以及a voice control module (2) connected to the voice recognition module (1); the voice control module (2) receives the control information and performs actions according to the control information; and
    与所述语音识别模块(1)连接的语音编辑模块(3);所述语音编辑模块(3)用于对所述语音识别模块(1)的所述识别模式进行编辑。A voice editing module (3) connected to the voice recognition module (1); the voice editing module (3) is used for editing the recognition mode of the voice recognition module (1).
  2. 根据权利要求1所述的优化语音控制的系统,其特征在于,其还包括终端模块(4);所述终端模块(4)与所述语音编辑模块(3)连接;所述终端模块(4)用于发送编辑信息至所述语音编辑模块(3),所述语音编辑模块(3)根据所述编辑信息对所述语音识别模块(1)的所述识别模式进行编辑。The system for optimizing voice control according to claim 1, characterized in that it further comprises a terminal module (4); the terminal module (4) is connected with the voice editing module (3); the terminal module (4) ) is used to send editing information to the speech editing module (3), and the speech editing module (3) edits the recognition mode of the speech recognition module (1) according to the editing information.
  3. 根据权利要求1所述的优化语音控制的系统,其特征在于,所述语音识别模块(1)包括语音识别单元(11)以及语音处理单元(12);所述语音识别单元(11)与所述语音处理单元(12)连接;所述语音处理单元(12)内存储有所述识别模式;所述语音识别单元(11)接收所述语音信息,并将所述语音信息传递至所述语音处理单元(12);所述语音处理单元(12)根据识别模式对语音信息进行命令识别,并将识别为命令的语音信息转化为控制信息。The system for optimizing voice control according to claim 1, wherein the voice recognition module (1) comprises a voice recognition unit (11) and a voice processing unit (12); the voice processing unit (12) is connected; the voice processing unit (12) stores the recognition pattern; the voice recognition unit (11) receives the voice information and transmits the voice information to the voice A processing unit (12); the voice processing unit (12) performs command recognition on the voice information according to the recognition mode, and converts the voice information recognized as commands into control information.
  4. 根据权利要求1所述的优化语音控制的系统,其特征在于,所述语音控制模块(2)包括语音控制单元(21)以及动作执行单元(22);所述语音控制单元(21)与所述动作执行单元(22)连接;所述语音控制单元(21)接收所述控制信息,并形成控制指令传递给所述动作执行单元(22);所述动作执行单元(22)根据所述控制指令执行动作。The system for optimizing voice control according to claim 1, wherein the voice control module (2) comprises a voice control unit (21) and an action execution unit (22); the voice control unit (21) is connected to the the action execution unit (22) is connected; the voice control unit (21) receives the control information, and forms a control instruction and transmits it to the action execution unit (22); the action execution unit (22) according to the control Instructions perform actions.
  5. 根据权利要求2-4任一所述的优化语音控制的系统,其特征在于,所 述语音编辑模块(3)包括语音编辑单元(31)以及第一无线通信单元(32);所述语音编辑单元(31)与所述第一无线通信单元(32)连接;所述终端模块(4)包括输入单元(41)以及第二无线通信单元(42);所述输入单元(41)与所述第二无线通信单元(42)连接;所述第一无线通信单元(32)与所述第二无线通信单元(42)无线连接;所述输入单元(41)输入的所述编辑信息通过所述第二无线通信单元(42)以及所述第一无线通信单元(32)的配合无线传送至所述语音编辑单元(31);所述语音编辑单元(31)根据所述编辑信息更改所述语音识别模块(1)的所述识别模式。The system for optimizing voice control according to any one of claims 2-4, wherein the voice editing module (3) comprises a voice editing unit (31) and a first wireless communication unit (32); the voice editing The unit (31) is connected with the first wireless communication unit (32); the terminal module (4) includes an input unit (41) and a second wireless communication unit (42); the input unit (41) is connected with the The second wireless communication unit (42) is connected; the first wireless communication unit (32) is wirelessly connected with the second wireless communication unit (42); the editing information input by the input unit (41) is passed through the The cooperation of the second wireless communication unit (42) and the first wireless communication unit (32) is wirelessly transmitted to the voice editing unit (31); the voice editing unit (31) modifies the voice according to the editing information The recognition mode of the recognition module (1).
  6. 一种优化语音控制的方法,包括:A method of optimizing voice control, comprising:
    语音编辑模块(3)对语音识别模块(1)的识别模式进行编辑;The speech editing module (3) edits the recognition mode of the speech recognition module (1);
    所述语音识别模块(1)接收语音信息,并根据编辑后的所述识别模式对所述语音信息进行命令识别,并将识别为命令的所述语音信息转化为控制信息;The voice recognition module (1) receives voice information, and performs command recognition on the voice information according to the edited recognition pattern, and converts the voice information identified as a command into control information;
    所述语音控制模块(2)根据所述控制信息执行动作。The voice control module (2) performs actions according to the control information.
  7. 根据权利要求6所述的优化语音控制的方法,其特征在于,语音编辑模块(3)对语音识别模块(1)的识别模式进行编辑,之前还包括:The method for optimizing voice control according to claim 6, wherein the voice editing module (3) edits the recognition pattern of the voice recognition module (1), and further comprises:
    所述终端模块(4)发送编辑信息至所述语音编辑模块(3)。The terminal module (4) sends editing information to the voice editing module (3).
  8. 根据权利要求7所述的优化语音控制的方法,其特征在于,所述终端模块(4)发送编辑信息至所述语音编辑模块(3),之前还包括:The method for optimizing voice control according to claim 7, wherein the terminal module (4) sends editing information to the voice editing module (3), before further comprising:
    所述终端模块(4)发送确认信息和校验信息供操作者确认。The terminal module (4) sends confirmation information and verification information for the operator to confirm.
  9. 根据权利要求8所述的优化语音控制的方法,其特征在于,所述终端模块(4)发送确认信息和校验信息供操作者确认,之前还包括:The method for optimizing voice control according to claim 8, wherein the terminal module (4) sends confirmation information and verification information for the operator to confirm, before further comprising:
    操作者通过所述终端模块(4)进行所述编辑信息的输入。The operator inputs the editing information through the terminal module (4).
  10. 根据权利要求6所述的优化语音控制的方法,其特征在于,所述识别模式包括语种类型和语音腔调。The method for optimizing voice control according to claim 6, wherein the recognition mode includes language type and voice intonation.
PCT/CN2020/114184 2020-08-28 2020-09-09 Voice control optimization system and method WO2022041319A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010888455.7 2020-08-28
CN202010888455.7A CN111986672A (en) 2020-08-28 2020-08-28 System and method for optimizing voice control

Publications (1)

Publication Number Publication Date
WO2022041319A1 true WO2022041319A1 (en) 2022-03-03

Family

ID=73440889

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/114184 WO2022041319A1 (en) 2020-08-28 2020-09-09 Voice control optimization system and method

Country Status (2)

Country Link
CN (1) CN111986672A (en)
WO (1) WO2022041319A1 (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101236744A (en) * 2008-02-29 2008-08-06 北京联合大学 Speech recognition object response system and method
CN204595492U (en) * 2015-05-07 2015-08-26 河南东骏智能科技有限公司 Wireless instructions load voice controller
KR20150105622A (en) * 2015-08-31 2015-09-17 한지흠 A curtain controller reacting to specific alarm sound unconsciously
CN110432739A (en) * 2019-08-18 2019-11-12 朱大春 A kind of intelligent curtain
CN110575039A (en) * 2019-08-13 2019-12-17 广东省安心加科技有限公司 Intelligent household curtain and automatic light control method
CN210018875U (en) * 2018-12-27 2020-02-07 重庆阿拉丁魔方酒店管理有限公司 Voice-controlled curtain for hotel
WO2020122653A1 (en) * 2018-12-14 2020-06-18 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
CN111415668A (en) * 2020-04-23 2020-07-14 惠州莫思特科技有限公司 Intelligent language control system and device
CN212907066U (en) * 2020-08-28 2021-04-06 广东奥科伟业科技发展有限公司 System for optimizing voice control

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101236744A (en) * 2008-02-29 2008-08-06 北京联合大学 Speech recognition object response system and method
CN204595492U (en) * 2015-05-07 2015-08-26 河南东骏智能科技有限公司 Wireless instructions load voice controller
KR20150105622A (en) * 2015-08-31 2015-09-17 한지흠 A curtain controller reacting to specific alarm sound unconsciously
WO2020122653A1 (en) * 2018-12-14 2020-06-18 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
CN210018875U (en) * 2018-12-27 2020-02-07 重庆阿拉丁魔方酒店管理有限公司 Voice-controlled curtain for hotel
CN110575039A (en) * 2019-08-13 2019-12-17 广东省安心加科技有限公司 Intelligent household curtain and automatic light control method
CN110432739A (en) * 2019-08-18 2019-11-12 朱大春 A kind of intelligent curtain
CN111415668A (en) * 2020-04-23 2020-07-14 惠州莫思特科技有限公司 Intelligent language control system and device
CN212907066U (en) * 2020-08-28 2021-04-06 广东奥科伟业科技发展有限公司 System for optimizing voice control

Also Published As

Publication number Publication date
CN111986672A (en) 2020-11-24

Similar Documents

Publication Publication Date Title
WO2020029500A1 (en) Voice command customization method, device, apparatus, and computer storage medium
US20080059191A1 (en) Method, system and apparatus for improved voice recognition
EP1256936B1 (en) Method for the training or the adaptation of a speech recognizer
US7451081B1 (en) System and method of performing speech recognition based on a user identifier
WO2014180218A1 (en) Update method, apparatus and system for voice recognition device
ES2386673T3 (en) Voice conversion device and procedure
US20080167868A1 (en) Systems and methods for intelligent control of microphones for speech recognition applications
CN107929044B (en) Moxibustion instrument capable of listening to speech and control method thereof
JP2002540703A (en) Oral user interface for call facilitator
CN104794834A (en) Intelligent voice doorbell system and implementation method thereof
KR102060775B1 (en) Electronic device for performing operation corresponding to voice input
JP2001005485A (en) Method and device to improve activity of voice control device
EP1851757A1 (en) Selecting an order of elements for a speech synthesis
JP2000089781A (en) Speech recognition method, device therefor, and recording medium stored with speech recognition processing program
CN204791241U (en) Voice interaction formula access control system
WO2022041319A1 (en) Voice control optimization system and method
CN212907066U (en) System for optimizing voice control
US7328159B2 (en) Interactive speech recognition apparatus and method with conditioned voice prompts
CN113643707A (en) Identity verification method and device and electronic equipment
CN110010122B (en) Voice control method for nursing bed
WO2022198365A1 (en) Voice control method and apparatus
JP2001312297A (en) Voice recognition device
JP4487298B2 (en) Voice recognition device
CN109359307B (en) Translation method, device and equipment for automatically identifying languages
KR20220125523A (en) Electronic device and method for processing voice input and recording in the same

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20950988

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20950988

Country of ref document: EP

Kind code of ref document: A1