WO2022247156A1 - Voice interaction system and method, and intelligent device - Google Patents

Voice interaction system and method, and intelligent device Download PDF

Info

Publication number
WO2022247156A1
WO2022247156A1 PCT/CN2021/130287 CN2021130287W WO2022247156A1 WO 2022247156 A1 WO2022247156 A1 WO 2022247156A1 CN 2021130287 W CN2021130287 W CN 2021130287W WO 2022247156 A1 WO2022247156 A1 WO 2022247156A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice interaction
unit
voice
interaction unit
instruction
Prior art date
Application number
PCT/CN2021/130287
Other languages
French (fr)
Chinese (zh)
Inventor
李刚
董飞
曲国健
赵锬鸿
李响
王伯长
严韶明
陈亚伟
Original Assignee
京东方科技集团股份有限公司
北京京东方显示技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 京东方科技集团股份有限公司, 北京京东方显示技术有限公司 filed Critical 京东方科技集团股份有限公司
Publication of WO2022247156A1 publication Critical patent/WO2022247156A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the present disclosure relates to the technical field of smart devices, in particular to a voice interaction system and method, and a smart device.
  • the Internet of Things has become an important driving force for a new round of global technological revolution and industrial transformation.
  • the "Xiaodu” launched by Baidu and the Siri on the Apple mobile phone are all intelligent voice technologies. Its core is very simple - let the machine have the ability to be similar to human beings in the voice dialogue, and penetrate into people's daily life space.
  • the microphone determines the ability of voice interaction. In related technologies, if a smart voice speaker wants to realize interaction, it needs a microphone to give instructions to tell consumers to start voice recognition.
  • Embodiments of the present disclosure provide a voice interaction system and method, and a smart device, which can realize body sensing to turn on the smart voice speaker, and interact with the entire smart device.
  • An embodiment of the present disclosure provides a voice interaction system, including:
  • the voice interaction unit is used to collect and recognize the target voice command of the user
  • a photoelectric sensing unit is connected to the voice interaction unit, and is used to receive and recognize the target limb instruction of the user, and control the switch of the voice interaction unit according to the target limb instruction;
  • An instruction control unit the instruction control unit is connected with the voice interaction unit, and is used to judge whether the voice interaction unit is in an on state;
  • the target voice commands corresponding actions.
  • the command control unit is also used to determine whether the voice interaction unit receives a voice command within a predetermined time when the voice interaction unit is in the on state; if not, send the voice interaction unit to the voice interaction unit the close command.
  • the voice interaction unit is further configured to: when receiving the close command of the voice interaction unit, send the close command of the photoelectric sensor unit to the photoelectric sensor unit.
  • the instruction control unit is further configured to receive and identify the image data collected by the image acquisition unit; and when the image data includes a target gesture, send an activation instruction of the voice interaction unit to the voice interaction unit;
  • the voice interaction unit is further configured to send an activation instruction of the photoelectric induction unit to the photoelectric induction unit when receiving the activation instruction of the voice interaction unit.
  • the voice interaction unit communicates with the instruction control unit through serial port instructions,
  • the voice interaction unit communicates with the instruction control unit through serial port instructions.
  • the photoelectric sensing unit includes a photoelectric switch
  • the photoelectric switch includes:
  • a casing the interior of the casing is hollow, the casing includes a front end and a rear end, the front end is provided with an indicator mark, and the rear end is open;
  • a photoelectric sensor arranged in the housing
  • circuit board arranged in the casing, the circuit board is connected to the photoelectric sensor
  • a signal transmission harness one end of the signal transmission harness is connected to the circuit board, and the other end extends out of the rear cover.
  • the rear end of the housing is connected to the rear cover through a buckle.
  • the front end is provided with a first through hole and a second through hole;
  • the photoelectric sensor is provided with a receiver and an emitter, the receiver is located at the first through hole, and the emitter is located at the second through hole.
  • the indication mark includes an engraved hollow mark.
  • the photoelectric switch further includes: a light source disposed in the housing.
  • the voice interaction unit includes a microphone module
  • the microphone module includes:
  • the upper cover the upper cover is hollow inside, the upper cover includes a front end and a rear end, the front end is provided with at least two sound receiving holes, and the rear end is open;
  • each of the sound-receiving microphones is set corresponding to one of the sound-receiving holes, and each of the sound-receiving holes is provided with a sealing ring;
  • a printed circuit board arranged inside the upper cover, and the printed circuit board is connected to the sound-receiving microphone;
  • the bottom cover is arranged at the rear end of the upper cover.
  • the microphone module also includes:
  • the signal transmission wiring harness connected to the printed circuit board, and the wire pressing plate arranged on the printed circuit board for pressing the signal transmission wiring harness.
  • the bottom cover and the crimping plate are fixed to the upper cover through buckles, and packaged with a sealant.
  • An embodiment of the present disclosure also provides a voice interaction method, including:
  • control the smart device to perform an action corresponding to the target voice command according to the user's target voice command received and recognized by the voice interaction unit;
  • the voice interaction unit is controlled to open, and according to the voice interaction unit receiving and recognizing the user's target voice instruction, the smart device is controlled to execute the target voice instruction corresponding action.
  • the method also includes:
  • the voice interaction unit When the voice interaction unit is turned on, it is judged whether the voice interaction unit has received a voice instruction within a predetermined time; if not, the voice interaction unit is controlled to be turned off.
  • the method also includes:
  • the voice interaction unit When the voice interaction unit is turned on, it is judged whether the voice interaction unit receives a voice command within a predetermined time; if not, the photoelectric sensor unit is controlled to be turned off.
  • the method also includes:
  • the photoelectric sensing unit and the voice interaction unit are controlled to be turned on.
  • the embodiment of the present disclosure also provides a smart device, on which the voice interaction system provided by the embodiment of the present disclosure is provided.
  • the voice interaction system and method, and the smart device provided by the embodiments of the present disclosure combine the intelligent voice interaction unit with the photoelectric sensor unit, and can control the switch of the voice interaction unit through body movements, realize interaction with the whole machine, and solve the problem of photoelectricity. How does the induction unit control the opening of the voice interaction unit and the technical problem of instructing the control unit to switch the voice interaction unit and the photoelectric induction unit.
  • FIG. 1 is a structural block diagram of a voice interaction system provided in an embodiment of the present disclosure
  • FIG. 2 is a logical block diagram of a voice interaction system provided in an embodiment of the present disclosure
  • Fig. 3 is a schematic diagram of the communication mode between the command control unit, the voice interaction unit and the photoelectric sensing unit;
  • FIG. 4 is a schematic structural diagram of a photoelectric switch in an embodiment of the present disclosure.
  • FIG. 5 is a schematic diagram of the structure of the front end of the housing of the photoelectric switch in the embodiment of the present disclosure
  • FIG. 6 is an exploded diagram of the structure of the microphone module in the embodiment of the disclosure.
  • the voice interaction system provided by the embodiments of the present disclosure can be applied to various smart devices, for example, smart refrigerators, smart washing machines, and smart TVs.
  • FIG. 1 is a structural block diagram of a speech interaction system provided in an embodiment of the present disclosure
  • FIG. 2 is a logical block diagram of the speech interaction system provided in an embodiment of the present disclosure.
  • the voice interaction system of the smart device provided by the embodiment of the present disclosure includes:
  • the voice interaction unit 100 is used to collect and recognize the user's target voice instruction
  • a photoelectric sensing unit 200, the photoelectric sensing unit 200 is connected to the voice interaction unit 100, and is used to receive and recognize the target limb instruction of the user, and control the switch of the voice interaction unit 100 according to the target limb instruction;
  • An instruction control unit 300 the instruction control unit 300 is connected to the voice interaction unit 100, and is used to determine whether the voice interaction unit 100 is in an on state; if so, according to the voice interaction unit 100 receiving and recognizing the user's target voice instruction, The smart device is controlled to perform an action corresponding to the target voice instruction.
  • the photoelectric sensor unit 200, the voice interaction unit 100 and the instruction control unit 300 are connected to form a voice interaction system, which can be applied to smart devices.
  • Combining the voice interaction unit 100 and the photoelectric sensor unit 200 can realize , to control the switch of the voice interaction unit 100, realize the interaction with the whole machine, and solve the technical problems of how the photoelectric induction unit 200 controls the opening of the voice interaction unit 100 and instructs the control unit 300 to switch the voice interaction unit 100 and the photoelectric induction unit 200.
  • the instruction control unit 300 is also used to determine whether the voice interaction unit 100 receives a voice instruction within a predetermined time when the voice interaction unit 100 is in the on state; 100 sends an instruction to close the voice interaction unit 100 .
  • the voice interaction unit 100 is further configured to: when receiving the close command of the voice interaction unit 100 , send the close command of the photoelectric sensor unit 200 to the photoelectric sensor unit 200 .
  • the voice interaction system further includes an image acquisition unit 400, the instruction control unit 300 is connected to the image acquisition unit 400, and the instruction control unit 300 is also used to receive and identify the image acquisition unit 400 Collected image data; and when the image data includes a target gesture, send the voice interaction unit 100 an opening instruction to the voice interaction unit 100; the voice interaction unit 100 is also used to receive the voice interaction unit 100 When the turn-on instruction of the photoelectric sensing unit 200 is sent to the photoelectric sensing unit 200 .
  • the voice interaction unit 100 may include a microphone module (MIC) and the like.
  • the voice interaction unit can choose different types of silicon microphones, and can design microphone modules with different numbers of silicon microphones.
  • the method of collecting and recognizing the user's target voice command may be that the microphone module starts to record the user's voice, and sends the recorded audio to the command control unit 300 .
  • the photoelectric sensing unit 200 may include distance sensing switch devices such as a photoelectric switch, a laser sensor switch, an electromagnetic induction sensor switch, and a capacitive sensing sensor switch.
  • the photoelectric switch is based on infrared sensing technology. The principle is to use the physical properties of infrared rays, and the principle that the infrared signal encounters obstacles with different reflection intensities at different distances to detect the distance of obstacles. Combining the infrared sensor with the switch circuit forms a sensor distance switch.
  • the method of using the photoelectric switch to receive and identify the user's target body instructions can be that the user performs some kind of body movement within the sensing distance of the photoelectric switch, for example, approaching the photoelectric switch or waving and other body movements, the photoelectric switch receives the reflected light signal, Recognize body movement commands based on light signals.
  • the instruction control unit 300 may be a computer, an MCU, etc., for example, the instruction control unit 300 may be a PC terminal of a smart device.
  • the voice interaction unit 100 communicates with the instruction control unit 300 through serial port instructions, and the voice interaction unit 100 communicates with the instruction control unit 300 through serial port instructions. to communicate.
  • voice interaction unit 100 The specific structures of the voice interaction unit 100 , the photoelectric sensor unit 200 and the command control unit 300 will be further described later, and the voice interaction implementation process of the voice interaction system will be explained in more detail below.
  • the voice interaction method of the voice interaction system of the smart device may include the following process:
  • the command control unit 300 receives the target voice command sent by the voice interaction unit 100, and controls the smart device according to the target voice command received and recognized by the voice interaction unit 100 Executing an action corresponding to the target voice instruction, so as to realize voice interaction.
  • the process in the logic block diagram is sequence number B ⁇ sequence number C.
  • the command control unit 300 judges whether the voice interaction unit 100 receives a voice command within a predetermined time; if not, sends the voice command to the voice interaction unit 100 A closing command of the interactive unit 100;
  • the instruction control unit 300 when the instruction control unit 300 detects that the voice interaction unit 100 does not input a voice within a certain period of time, the instruction control unit 300 will send a shutdown instruction to the voice interaction unit 100 (serial number 2) , the voice interaction unit 100 stops recording, and will no longer send a voice command (serial number 3) to the command control unit 300. At the same time, the voice interaction unit 100 sends a shutdown command (sequence number 4) to the photoelectric sensor unit 200, and the photoelectric sensor unit 200 is turned off.
  • a shutdown instruction to the voice interaction unit 100 (serial number 2)
  • the voice interaction unit 100 stops recording, and will no longer send a voice command (serial number 3) to the command control unit 300.
  • the voice interaction unit 100 sends a shutdown command (sequence number 4) to the photoelectric sensor unit 200, and the photoelectric sensor unit 200 is turned off.
  • the command control unit 300 will send a shutdown command to the voice interaction unit 100.
  • the voice interaction unit 100 can be in a standby state, so that not only can save power , and can also play a role in protecting privacy.
  • the voice of the user will not be recorded and saved by the voice interaction unit 100 .
  • the voice interaction unit 100 When the voice interaction unit 100 is in the off state, when the photoelectric sensor unit 200 senses the user's target limb instruction, it sends an instruction to the voice interaction unit 100 to control the voice interaction unit 100 to be turned on, and the The command control unit 300 receives the target voice command sent by the voice interaction unit 100, and according to the voice interaction unit 100 receiving and recognizing the user's target voice command, controls the smart device to perform an action corresponding to the target voice command, so as to realize the voice interact;
  • the photosensitive unit 200 sends a signal to the photosensitive unit 200.
  • the voice interaction unit 100 sends the start command, the voice interaction unit 100 starts recording, and sends the recording data to the command control unit 300, the command control unit 300 recognizes the text according to the recorded audio, and makes corresponding actions according to the user's voice.
  • the middle is serial number A ⁇ serial number B ⁇ serial number C.
  • the command control unit 300 can autonomously send an activation command (serial number Y) to the voice interaction unit 100, the voice interaction unit 100 is turned on, and sends the received audio data including the target voice command to the command control unit 300 (serial number Z), The voice interaction unit 100 sends an opening command to the photoelectric sensing unit 200, and the photoelectric sensing unit 200 is turned on.
  • the voice interaction system also includes an image acquisition unit 400.
  • the instruction control unit 300 can be based on the images collected by the image acquisition unit 400.
  • the instruction control unit 300 judges that the user has the intention to open the voice interaction unit 100, thereby sending an opening instruction (serial number V) to the voice interaction unit 100, and at the same time, the voice interaction unit 100 sends an opening instruction to the photoelectric sensor unit 200 to open the voice interaction unit 100 and Photoelectric sensing unit 200.
  • the command control unit 300 is used as the host (PC end) of the smart device, and the voice interaction unit 100 includes a microphone module (MIC).
  • the photoelectric sensing unit 200 includes a photoelectric switch LED light as a specific embodiment, and the specific work flow is described as follows:
  • the photoelectric sensor unit 200 is connected to the voice interaction unit 100, and the voice interaction unit 100 is connected to the command control unit 300; the MIC defaults to the MUTE ON state. At this time, the MIC remains in the standby state and does not output voice commands. Send the corresponding MUTE ON state to the photoelectric sensing unit 200, the photoelectric switch remains closed (MUTE ON state), and the LED light goes out;
  • the MIC When the PC sends the MUTE OFF command to the MIC through the UART, the MIC is turned on, and at the same time, the MIC sends the current state to the photoelectric sensor unit 200 through the serial port, the photoelectric switch is turned on (MUTE OFF state), and the LED light is turned on; the photoelectric switch only In the MUTE ON state (LED is off), the user's body commands can be received, and in the MUTE OFF state (LED is on), the user's body commands cannot be received.
  • control instructions in the voice interaction system are shown in Table 1:
  • the photoelectric sensing unit 200 includes a photoelectric switch
  • the photoelectric switch includes: a housing 210, a photoelectric sensor 220, a circuit board 230, a rear cover 240 and a first signal transmission Wire harness (not shown in the figure), the interior of the housing 210 is hollow, the housing 210 includes a front end and a rear end, the front end is provided with an indicator mark 211, and the rear end is open; the photoelectric sensor 220 is arranged in the housing 210;
  • the circuit board 230 is arranged in the housing 210, and the circuit board 230 is connected to the photoelectric sensor 220; the rear cover 240 is buckled on the rear end of the housing 210; one end of the first signal transmission harness is connected to the circuit
  • the board 230 is connected, and the other end protrudes from the back cover 240
  • the photoelectric switch also includes a light source arranged in the housing 210 .
  • the light source may comprise LED lights.
  • the rear end of the housing 210 is connected to the rear cover 240 through buckling.
  • the rear end of the housing 210 can be provided with a buckle groove
  • the rear cover 240 is provided with a buckle 241
  • the buckle 241 on the back cover 240 is stuck in the buckle groove of the housing 210
  • the front end is provided with a first through hole 213 and a second through hole 214;
  • the photoelectric sensor 220 is provided with a receiving electrode and an emitter electrode, and the receiving electrode The emitter is located at the first through hole 213 , and the emitter is located at the second through hole 214 , so that the receiver and emitter of the photosensor 220 are exposed.
  • the size of the first through hole 213 and the second through hole 214 is aligned to expose the receiving electrode and the emitter, for example, the diameter of the first through hole 213 and the second through hole 214 can be is 2.8mm.
  • the indication mark includes an engraved hollow mark.
  • the front surface of the housing 210 has an engraved hollow logo, which can clearly display the mark on the surface of the photoelectric switch housing 210.
  • the mark is text, such as "Wave Hand To Speak", reminding the user to turn on the photoelectric switch by waving his hand to wake up the microphone. Make a recording.
  • the depth of the embossed design of the engraved hollow mark on the front surface of the housing 210 may be about 0.3 mm.
  • the font is clearer.
  • the engraved font is thicker. If the casing 210 is formed by injection molding and the surface is painted, the light transmission of the casing 210 is not good. The thicker the font, the darker the display and the worse the contrast effect. If the form of intaglio and hollowing is adopted, the font will be thinner, the brighter the display, the better the contrast effect, and the technical problem that the mark of the shell 210 is not clearly displayed when the photoelectric switch is working is solved.
  • the surface of the housing 210 can be painted.
  • the thickness of the paint can be 0.7-0.12 ⁇ m, so as to ensure that the light of the LED is evenly transmitted, and the handwriting of the engraved hollow logo is clearly displayed.
  • the photoelectric sensor 220 in the photoelectric switch can be an infrared sensor, and the infrared sensor can determine the range of the energizing current and the sensing distance of the infrared sensor through the size of the peripheral resistance.
  • the sensing distance of the photoelectric switch may be about 80 mm, but it is not limited thereto.
  • the emitter of the infrared sensor emits infrared light.
  • the emitted infrared light will reflect the light under the reflection of blocking obstacles.
  • the receiving pole of the infrared sensor will receive the light signal and convert the light signal into an electrical signal.
  • the photoelectric switch controls the LED light to light up according to the electrical signal of the infrared sensor, and at the same time wakes up the microphone module according to the UART command.
  • the circuit board 230 in the photoelectric switch includes a first surface and a second surface opposite to each other, and an LED light is provided on the first surface to provide the photoelectric switch with As a light source, after the LED light is turned on, the light can pass through the casing 210 to display the engraved hollow logo on the front end of the casing 210 .
  • the infrared sensor and peripheral resistors are also arranged on the first surface.
  • a photoelectric sensor main chip and a first signal transmission harness are arranged on the second surface, and the photoelectric sensor main chip receives the signal sent by the infrared sensor, processes the signal, and sends an activation signal to the microphone module. instruction.
  • One end of the first signal transmission wire harness is connected to the circuit board 230, and the other end passes through the through hole provided in the center of the rear cover 240, and is connected with the first signal transmission wire harness of the microphone module, and the photoelectric switch receives Both the signal transmission harness and the sending instruction are transmitted.
  • the voice interaction unit includes a microphone module.
  • the figure shows an exploded view of the structure of an embodiment of the microphone module.
  • the microphone module includes: an upper cover 110, at least two radio microphones (not shown in the figure), a printed circuit board 120, a sealing ring 130 and a bottom cover 140, wherein the inside of the upper cover 110 Hollow, the upper cover 110 includes a front end and a rear end, the front end is provided with at least two sound-receiving holes, and the rear end is open; at least two sound-receiving microphones are arranged inside the upper cover 110, each of the sound-receiving microphones Corresponding to one of the sound receiving holes, and each of the sound receiving holes is provided with a sealing ring 130; the printed circuit board 120 is arranged inside the upper cover 110, and the printed circuit board 120 is connected to the sound receiving microphone; The bottom cover 140 is disposed on the rear end of the upper cover 110 .
  • the microphone module further includes: a second signal transmission wire harness connected to the printed circuit board 120, and a wiring harness arranged on the printed circuit board 120 for pressing the second signal transmission wire harness.
  • the wire harness plate 150 The bottom cover 140 and the pressure plate are fixed on the upper cover 110 by buckles, and sealed by a sealant.
  • the upper cover 110 of the microphone module is silk-screened with a black mark (LOGO), which is convenient for users to identify.
  • the location is more conducive to the collection of sound.
  • the printed circuit board 120 includes a front side and a back side. Taking two microphones as an example, the two microphones are arranged at two ends of the front side. Taking the number of microphones in the microphone module as an example, after the ambient noise is sampled, the sound waveform is analyzed and phase-operated, and superimposed on the sampling waveform of the main microphone to form a phase cancellation, so that one of the microphones can keep a stable and clear recording, and the other One microphone actively eliminates physical noise, and after algorithm processing, the recorded sound is clearer, which solves the technical problem of poor recording effect of the microphone in a noisy environment. Dual microphones can improve the signal when dealing with changing and complex sound environments. The noise ratio keeps the recording sound pure, and the post-algorithm processing is more accurate.
  • the front of the printed circuit board 120 is also provided with a voice signal main chip, which can perform processing such as noise reduction and algorithm optimization according to the sound data entered by the two microphones.
  • the voice signal main chip includes a serial port interface for sending And receive instructions from photoelectric switches and PC terminals.
  • a reset key is provided on the back of the printed circuit board 120 for subsequent software upgrade operations.
  • the back of the printed circuit board 120 is also provided with a connecting line jack for connecting the second signal transmission harness to maintain signal data transmission between the photoelectric switch and the PC terminal.
  • the bottom cover 140 and the wire crimping cover can be connected to the upper cover 110 by a buckle, and encapsulated by a sealant to prevent damage from external force.
  • An embodiment of the present disclosure also provides a voice interaction method, including:
  • control the smart device to perform an action corresponding to the target voice command
  • the voice interaction unit 100 is controlled to be turned on according to the user's target body instruction received and recognized by the photoelectric sensing unit 200, and the smart device is controlled to perform the same operation as described above according to the voice interaction unit 100 receiving and recognizing the user's target voice instruction.
  • the target voice commands the corresponding action.
  • the method also includes:
  • the voice interaction unit 100 When the voice interaction unit 100 is turned on, it is judged whether the voice interaction unit 100 receives a voice command within a predetermined time; if not, the voice interaction unit 100 is controlled to be turned off.
  • the method also includes:
  • the voice interaction unit 100 When the voice interaction unit 100 is turned on, it is determined whether the voice interaction unit 100 receives a voice command within a predetermined time; if not, the photoelectric sensor unit 200 is controlled to be turned off.
  • the method also includes:
  • the photoelectric sensing unit 200 and the voice interaction unit 100 are controlled to be turned on.
  • the specific voice interaction process of the voice interaction method is the same as the specific voice interaction process of the voice interaction system provided in the present disclosure, and will not be repeated here.
  • the embodiment of the present disclosure also provides a smart device, on which the voice interaction system provided by the embodiment of the present disclosure is provided.
  • the smart devices provided by the embodiments of the present disclosure may include smart refrigerators, smart washing machines, smart TVs, etc.
  • the application scenarios are not limited to household appliances, but can also be applied to application scenarios such as shopping guides in clothing shopping malls and self-service convenience stores.

Abstract

A voice interaction system and method, and an intelligent device. The voice interaction system comprises: a voice interaction unit (100) used for acquiring and recognizing a target voice instruction of a user; a photoelectric sensing unit (200) connected to the voice interaction unit (100) and used for receiving and recognizing a target limb instruction of the user and controlling the voice interaction unit (100) to be turned on and off according to the target limb instruction; and an instruction control unit (300) connected to the voice interaction unit (100) and used for determining whether the voice interaction unit (100) is in an on state: if yes, receiving and recognizing the target voice instruction of the user according to the voice interaction unit (100), and controlling the intelligent device to execute an action corresponding to the target voice instruction. An intelligent voice loudspeaker box can be turned on by means of limb sensing, and interaction with the whole intelligent device can be achieved.

Description

语音交互系统及方法、智能设备Voice interaction system and method, smart device
相关申请的交叉引用Cross References to Related Applications
本申请主张在2021年5月28日在中国提交的中国专利申请号No.202110594436.8的优先权,其全部内容通过引用包含于此。This application claims priority to Chinese Patent Application No. 202110594436.8 filed in China on May 28, 2021, the entire contents of which are hereby incorporated by reference.
技术领域technical field
本公开涉及智能设备技术领域,尤其涉及一种语音交互系统及方法、智能设备。The present disclosure relates to the technical field of smart devices, in particular to a voice interaction system and method, and a smart device.
背景技术Background technique
物联网已成为全球新一轮科技革命与产业变革的重要驱动力。现在市面上的智能音箱层出不穷,五花八门,以百度推出的“小度”,和苹果手机上的Siri上都属于智能语音技术。其核心非常简要——要让机器在语音对话这一环节拥有近似于人的能力,渗入人们的日常生活空间。麦克风作为智能语音音箱上的重要部件,决定了语音交互的能力。在相关技术中,智能语音音箱若想实现交互,需要麦克风给出指示,告诉消费者语音识别开始。The Internet of Things has become an important driving force for a new round of global technological revolution and industrial transformation. Now there are endless smart speakers on the market, of all kinds. The "Xiaodu" launched by Baidu and the Siri on the Apple mobile phone are all intelligent voice technologies. Its core is very simple - let the machine have the ability to be similar to human beings in the voice dialogue, and penetrate into people's daily life space. As an important part of the smart voice speaker, the microphone determines the ability of voice interaction. In related technologies, if a smart voice speaker wants to realize interaction, it needs a microphone to give instructions to tell consumers to start voice recognition.
发明内容Contents of the invention
本公开实施例提供了一种语音交互系统及方法、智能设备,能够实现肢体感应打开智能语音音箱,与整个智能设备整机进行交互。Embodiments of the present disclosure provide a voice interaction system and method, and a smart device, which can realize body sensing to turn on the smart voice speaker, and interact with the entire smart device.
本公开实施例所提供的技术方案如下:The technical solutions provided by the embodiments of the present disclosure are as follows:
本公开实施例提供了一种语音交互系统,包括:An embodiment of the present disclosure provides a voice interaction system, including:
语音交互单元,用于采集并识别用户的目标语音指令;The voice interaction unit is used to collect and recognize the target voice command of the user;
光电感应单元,所述光电感应单元与所述语音交互单元连接,用于接收并识别用户的目标肢体指令,并根据所述目标肢体指令控制所述语音交互单元开关;A photoelectric sensing unit, the photoelectric sensing unit is connected to the voice interaction unit, and is used to receive and recognize the target limb instruction of the user, and control the switch of the voice interaction unit according to the target limb instruction;
指令控制单元,所述指令控制单元与所述语音交互单元连接,用于判断语音交互单元是否处于开启状态;若是,根据所述语音交互单元接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作。An instruction control unit, the instruction control unit is connected with the voice interaction unit, and is used to judge whether the voice interaction unit is in an on state; The target voice commands corresponding actions.
示例性的,所述指令控制单元还用于语音交互单元处于开启状态时,判断在预定时间内所述语音交互单元是否接收到语音指令;若否,向所述语音交互单元发送该语音交互单元的关闭指令。Exemplarily, the command control unit is also used to determine whether the voice interaction unit receives a voice command within a predetermined time when the voice interaction unit is in the on state; if not, send the voice interaction unit to the voice interaction unit the close command.
示例性的,所述语音交互单元还用于:当接收到该语音交互单元的关闭指令时,向所述光电感应单元发送所述光电感应单元的关闭指令。Exemplarily, the voice interaction unit is further configured to: when receiving the close command of the voice interaction unit, send the close command of the photoelectric sensor unit to the photoelectric sensor unit.
示例性的,所述指令控制单元还用于接收并识别图像采集单元所采集的图像数据;并当所述图像数据包括目标手势时,向所述语音交互单元发送该语音交互单元的开启指令;Exemplarily, the instruction control unit is further configured to receive and identify the image data collected by the image acquisition unit; and when the image data includes a target gesture, send an activation instruction of the voice interaction unit to the voice interaction unit;
所述语音交互单元还用于接收到所述语音交互单元的开启指令时,向所述光电感应单元发送该光电感应单元的开启指令。The voice interaction unit is further configured to send an activation instruction of the photoelectric induction unit to the photoelectric induction unit when receiving the activation instruction of the voice interaction unit.
示例性的,所述语音交互单元与所述指令控制单元之间通过串口指令进行通讯,Exemplarily, the voice interaction unit communicates with the instruction control unit through serial port instructions,
所述语音交互单元与所述指令控制单元之间通过串口指令进行通讯。The voice interaction unit communicates with the instruction control unit through serial port instructions.
示例性的,所述光电感应单元包括光电开关,所述光电开关包括:Exemplarily, the photoelectric sensing unit includes a photoelectric switch, and the photoelectric switch includes:
外壳,所述外壳的内部中空,所述外壳包括前端和后端,前端设有指示标识,后端开口;A casing, the interior of the casing is hollow, the casing includes a front end and a rear end, the front end is provided with an indicator mark, and the rear end is open;
设置于所述外壳内的光电传感器;a photoelectric sensor arranged in the housing;
设置于外壳内的线路板,所述线路板与所述光电传感器连接;a circuit board arranged in the casing, the circuit board is connected to the photoelectric sensor;
扣装在所述外壳的后端的后盖;a rear cover snapped onto the rear end of the housing;
信号传输线束,所述信号传输线束一端与线路板连接,另一端伸出所述后盖。A signal transmission harness, one end of the signal transmission harness is connected to the circuit board, and the other end extends out of the rear cover.
示例性的,所述外壳的后端与所述后盖通过卡扣连接。Exemplarily, the rear end of the housing is connected to the rear cover through a buckle.
示例性的,所述前端上设有第一通孔和第二通孔;Exemplarily, the front end is provided with a first through hole and a second through hole;
所述光电传感器上设有接收极和发射极,所述接收极位于所述第一通孔处,所述发射极位于所述第二通孔处。The photoelectric sensor is provided with a receiver and an emitter, the receiver is located at the first through hole, and the emitter is located at the second through hole.
示例性的,所述指示标识包括阴刻镂空标识。Exemplarily, the indication mark includes an engraved hollow mark.
示例性的,所述光电开关还包括:设置于所述外壳内的光源。Exemplarily, the photoelectric switch further includes: a light source disposed in the housing.
示例性的,所述语音交互单元包括麦克风模组,所述麦克风模组包括:Exemplarily, the voice interaction unit includes a microphone module, and the microphone module includes:
上盖,所述上盖内部中空,所述上盖包括前端和后端,所述前端设有至 少两个收音孔,所述后端开口;The upper cover, the upper cover is hollow inside, the upper cover includes a front end and a rear end, the front end is provided with at least two sound receiving holes, and the rear end is open;
设置于所述上盖内部的至少两个收音麦克风,每一所述收音麦克风对应一个所述收音孔设置,且每一所述收音孔处设置一密封圈;At least two sound-receiving microphones arranged inside the upper cover, each of the sound-receiving microphones is set corresponding to one of the sound-receiving holes, and each of the sound-receiving holes is provided with a sealing ring;
设置于所述上盖内部的印刷电路板,所述印刷电路板与所述收音麦克风连接;a printed circuit board arranged inside the upper cover, and the printed circuit board is connected to the sound-receiving microphone;
设置于所述上盖的后端的底盖。The bottom cover is arranged at the rear end of the upper cover.
示例性的,所述麦克风模组还包括:Exemplarily, the microphone module also includes:
连接于所述印刷电路板上的信号传输线束、及设置于所述印刷电路板上用于压住所述信号传输线束的压线板。The signal transmission wiring harness connected to the printed circuit board, and the wire pressing plate arranged on the printed circuit board for pressing the signal transmission wiring harness.
示例性的,所述底盖和所述压线板通过卡扣固定在所述上盖,并通过密封胶封装。Exemplarily, the bottom cover and the crimping plate are fixed to the upper cover through buckles, and packaged with a sealant.
本公开实施例还提供一种语音交互方法,包括:An embodiment of the present disclosure also provides a voice interaction method, including:
判断语音交互单元是否处于开启状态;judging whether the voice interaction unit is on;
若是,根据所述语音交互单元所接收并识别的用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作;If so, control the smart device to perform an action corresponding to the target voice command according to the user's target voice command received and recognized by the voice interaction unit;
若否,根据光电感应单元所接收和识别的用户的目标肢体指令,控制语音交互单元开启,并根据所述语音交互单元接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作。If not, according to the user's target body instruction received and recognized by the photoelectric sensing unit, the voice interaction unit is controlled to open, and according to the voice interaction unit receiving and recognizing the user's target voice instruction, the smart device is controlled to execute the target voice instruction corresponding action.
示例性的,所述方法还包括:Exemplary, the method also includes:
语音交互单元处于开启状态时,判断在预定时间内所述语音交互单元是否接收到语音指令;若否,控制语音交互单元关闭。When the voice interaction unit is turned on, it is judged whether the voice interaction unit has received a voice instruction within a predetermined time; if not, the voice interaction unit is controlled to be turned off.
示例性的,所述方法还包括:Exemplary, the method also includes:
语音交互单元处于开启状态时,判断在预定时间内所述语音交互单元是否接收到语音指令;若否,控制光电感应单元关闭。When the voice interaction unit is turned on, it is judged whether the voice interaction unit receives a voice command within a predetermined time; if not, the photoelectric sensor unit is controlled to be turned off.
示例性的,所述方法还包括:Exemplary, the method also includes:
接收并识别图像采集单元所采集的图像数据;receiving and identifying the image data collected by the image acquisition unit;
当所述图像数据包括目标手势时,控制所述光电感应单元和所述语音交互单元开启。When the image data includes a target gesture, the photoelectric sensing unit and the voice interaction unit are controlled to be turned on.
本公开实施例还提供一种智能设备,在所述智能设备上设有本公开实施 例提供的语音交互系统。The embodiment of the present disclosure also provides a smart device, on which the voice interaction system provided by the embodiment of the present disclosure is provided.
本公开实施例所带来的有益效果如下:The beneficial effects brought by the embodiments of the present disclosure are as follows:
本公开实施例所提供的语音交互系统及方法、智能设备,将智语音交互单元与光电感应单元结合在一起,可以实现通过肢体动作,控制语音交互单元开关,实现与整机进行交互,解决光电感应单元如何控制语音交互单元打开以及指令控制单元开关语音交互单元与光电感应单元的技术难题。The voice interaction system and method, and the smart device provided by the embodiments of the present disclosure combine the intelligent voice interaction unit with the photoelectric sensor unit, and can control the switch of the voice interaction unit through body movements, realize interaction with the whole machine, and solve the problem of photoelectricity. How does the induction unit control the opening of the voice interaction unit and the technical problem of instructing the control unit to switch the voice interaction unit and the photoelectric induction unit.
附图说明Description of drawings
图1所示为本公开实施例中提供的语音交互系统的结构框图;FIG. 1 is a structural block diagram of a voice interaction system provided in an embodiment of the present disclosure;
图2所示为本公开实施例中提供的语音交互系统的逻辑框图;FIG. 2 is a logical block diagram of a voice interaction system provided in an embodiment of the present disclosure;
图3为指令控制单元、语音交互单元和光电感应单元之间的通讯方式示意图;Fig. 3 is a schematic diagram of the communication mode between the command control unit, the voice interaction unit and the photoelectric sensing unit;
图4为本公开实施例中的光电开关的一种结构示意图;FIG. 4 is a schematic structural diagram of a photoelectric switch in an embodiment of the present disclosure;
图5所示为本公开实施例中光电开关的外壳前端结构示意图;FIG. 5 is a schematic diagram of the structure of the front end of the housing of the photoelectric switch in the embodiment of the present disclosure;
图6所示为本公开实施例中麦克风模组的结构爆炸图。FIG. 6 is an exploded diagram of the structure of the microphone module in the embodiment of the disclosure.
具体实施方式Detailed ways
为使本公开实施例的目的、技术方案和优点更加清楚,下面将结合本公开实施例的附图,对本公开实施例的技术方案进行清楚、完整地描述。显然,所描述的实施例是本公开的一部分实施例,而不是全部的实施例。基于所描述的本公开的实施例,本领域普通技术人员在无需创造性劳动的前提下所获得的所有其他实施例,都属于本公开保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present disclosure clearer, the technical solutions of the embodiments of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings of the embodiments of the present disclosure. Apparently, the described embodiments are some of the embodiments of the present disclosure, not all of them. Based on the described embodiments of the present disclosure, all other embodiments obtained by persons of ordinary skill in the art without creative effort fall within the protection scope of the present disclosure.
除非另外定义,本公开使用的技术术语或者科学术语应当为本公开所属领域内具有一般技能的人士所理解的通常意义。本公开中使用的“第一”、“第二”以及类似的词语并不表示任何顺序、数量或者重要性,而只是用来区分不同的组成部分。同样,“一个”、“一”或者“该”等类似词语也不表示数量限制,而是表示存在至少一个。“包括”或者“包含”等类似的词语意指出现该词前面的元件或者物件涵盖出现在该词后面列举的元件或者物件及其等同,而不排除其他元件或者物件。“连接”或者“相连”等类似的词语并非限定于 物理的或者机械的连接,而是可以包括电性的连接,不管是直接的还是间接的。“上”、“下”、“左”、“右”等仅用于表示相对位置关系,当被描述对象的绝对位置改变后,则该相对位置关系也可能相应地改变。Unless otherwise defined, the technical terms or scientific terms used in the present disclosure shall have the usual meanings understood by those skilled in the art to which the present disclosure belongs. "First", "second" and similar words used in the present disclosure do not indicate any order, quantity or importance, but are only used to distinguish different components. Likewise, words like "a", "an" or "the" do not denote a limitation of quantity, but mean that there is at least one. "Comprising" or "comprising" and similar words mean that the elements or items appearing before the word include the elements or items listed after the word and their equivalents, without excluding other elements or items. Words such as "connected" or "connected" are not limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect. "Up", "Down", "Left", "Right" and so on are only used to indicate the relative positional relationship. When the absolute position of the described object changes, the relative positional relationship may also change accordingly.
本公开实施例所提供的语音交互系统可应用于各种智能设备上,例如,智能冰箱、智能洗衣机、智能电视等。The voice interaction system provided by the embodiments of the present disclosure can be applied to various smart devices, for example, smart refrigerators, smart washing machines, and smart TVs.
图1所示为本公开实施例中提供的语音交互系统的结构框图,图2所示为本公开实施例中提供的语音交互系统的逻辑框图。FIG. 1 is a structural block diagram of a speech interaction system provided in an embodiment of the present disclosure, and FIG. 2 is a logical block diagram of the speech interaction system provided in an embodiment of the present disclosure.
请参见图1和图2,本公开实施例提供的智能设备的语音交互系统包括:Referring to Fig. 1 and Fig. 2, the voice interaction system of the smart device provided by the embodiment of the present disclosure includes:
语音交互单元100,用于采集并识别用户的目标语音指令;The voice interaction unit 100 is used to collect and recognize the user's target voice instruction;
光电感应单元200,所述光电感应单元200与所述语音交互单元100连接,用于接收并识别用户的目标肢体指令,并根据所述目标肢体指令控制所述语音交互单元100开关;A photoelectric sensing unit 200, the photoelectric sensing unit 200 is connected to the voice interaction unit 100, and is used to receive and recognize the target limb instruction of the user, and control the switch of the voice interaction unit 100 according to the target limb instruction;
指令控制单元300,所述指令控制单元300与所述语音交互单元100连接,用于判断语音交互单元100是否处于开启状态;若是,根据所述语音交互单元100接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作。An instruction control unit 300, the instruction control unit 300 is connected to the voice interaction unit 100, and is used to determine whether the voice interaction unit 100 is in an on state; if so, according to the voice interaction unit 100 receiving and recognizing the user's target voice instruction, The smart device is controlled to perform an action corresponding to the target voice instruction.
上述方案中,光电感应单元200、语音交互单元100和指令控制单元300连接构成语音交互系统,可应用于智能设备上,将语音交互单元100与光电感应单元200结合在一起,可以实现通过肢体动作,来控制语音交互单元100开关,实现与整机进行交互,解决光电感应单元200如何控制语音交互单元100打开以及指令控制单元300开关语音交互单元100与光电感应单元200的技术难题。In the above solution, the photoelectric sensor unit 200, the voice interaction unit 100 and the instruction control unit 300 are connected to form a voice interaction system, which can be applied to smart devices. Combining the voice interaction unit 100 and the photoelectric sensor unit 200 can realize , to control the switch of the voice interaction unit 100, realize the interaction with the whole machine, and solve the technical problems of how the photoelectric induction unit 200 controls the opening of the voice interaction unit 100 and instructs the control unit 300 to switch the voice interaction unit 100 and the photoelectric induction unit 200.
在一些实施例中,所述指令控制单元300还用于语音交互单元100处于开启状态时,判断在预定时间内所述语音交互单元100是否接收到语音指令;若否,向所述语音交互单元100发送该语音交互单元100的关闭指令。In some embodiments, the instruction control unit 300 is also used to determine whether the voice interaction unit 100 receives a voice instruction within a predetermined time when the voice interaction unit 100 is in the on state; 100 sends an instruction to close the voice interaction unit 100 .
在一些实施例中,所述语音交互单元100还用于:当接收到该语音交互单元100的关闭指令时,向所述光电感应单元200发送所述光电感应单元200的关闭指令。In some embodiments, the voice interaction unit 100 is further configured to: when receiving the close command of the voice interaction unit 100 , send the close command of the photoelectric sensor unit 200 to the photoelectric sensor unit 200 .
在一些实施例中,所述语音交互系统还包括图像采集单元400,所述指 令控制单元300与所述图像采集单元400连接,所述指令控制单元300还用于接收并识别图像采集单元400所采集的图像数据;并当所述图像数据包括目标手势时,向所述语音交互单元100发送该语音交互单元100的开启指令;所述语音交互单元100还用于接收到所述语音交互单元100的开启指令时,向所述光电感应单元200发送该光电感应单元200的开启指令。In some embodiments, the voice interaction system further includes an image acquisition unit 400, the instruction control unit 300 is connected to the image acquisition unit 400, and the instruction control unit 300 is also used to receive and identify the image acquisition unit 400 Collected image data; and when the image data includes a target gesture, send the voice interaction unit 100 an opening instruction to the voice interaction unit 100; the voice interaction unit 100 is also used to receive the voice interaction unit 100 When the turn-on instruction of the photoelectric sensing unit 200 is sent to the photoelectric sensing unit 200 .
以下对本公开实施例提供的语音交互系统进行更为详细的说明:The voice interaction system provided by the embodiments of the present disclosure is described in more detail below:
本公开实施例所提供的语音交互系统中,所述语音交互单元100可以包括麦克风模组(MIC)等。语音交互单元根据不同的使用环境,对音质质量不同的要求,可以选用不同型号的硅麦,同时可以设计具有不同硅麦数量的麦克风模组。以麦克风模组为例,其采集和识别用户的目标语音指令的方式可以是,麦克风模组开启,开始录制用户语音,并将录制音频发送至指令控制单元300。In the voice interaction system provided by the embodiment of the present disclosure, the voice interaction unit 100 may include a microphone module (MIC) and the like. According to different use environments and different requirements for sound quality, the voice interaction unit can choose different types of silicon microphones, and can design microphone modules with different numbers of silicon microphones. Taking the microphone module as an example, the method of collecting and recognizing the user's target voice command may be that the microphone module starts to record the user's voice, and sends the recorded audio to the command control unit 300 .
所述光电感应单元200可以包括光电开关、激光传感器开关、电磁感应传感器开关、电容感应传感器开关等距离感应开关装置。以光电开关为例,光电开关基于红外传感技术,原理是利用红外线的物理性质,红外信号遇到障碍物距离的不同反射的强度也不同的原理,进行障碍物远近的检测。将红外传感器与开关电路结合在一起,就形成了一种感应距离式的开关。采用光电开关接收并识别用户的目标肢体指令的方式可以是,用户在光电开关的感应距离内进行某种肢体动作,例如,靠近光电开关或者挥手等肢体动作,光电开关根据接收到反射光信号,根据光信号识别肢体动作指令。The photoelectric sensing unit 200 may include distance sensing switch devices such as a photoelectric switch, a laser sensor switch, an electromagnetic induction sensor switch, and a capacitive sensing sensor switch. Taking the photoelectric switch as an example, the photoelectric switch is based on infrared sensing technology. The principle is to use the physical properties of infrared rays, and the principle that the infrared signal encounters obstacles with different reflection intensities at different distances to detect the distance of obstacles. Combining the infrared sensor with the switch circuit forms a sensor distance switch. The method of using the photoelectric switch to receive and identify the user's target body instructions can be that the user performs some kind of body movement within the sensing distance of the photoelectric switch, for example, approaching the photoelectric switch or waving and other body movements, the photoelectric switch receives the reflected light signal, Recognize body movement commands based on light signals.
所述指令控制单元300可以为计算机、MCU等,例如,所述指令控制单元300可以是智能设备的PC端。The instruction control unit 300 may be a computer, an MCU, etc., for example, the instruction control unit 300 may be a PC terminal of a smart device.
在本公开实施例的语音交互系统中,所述语音交互单元100与所述指令控制单元300之间通过串口指令进行通讯,所述语音交互单元100与所述指令控制单元300之间通过串口指令进行通讯。In the voice interaction system of the embodiment of the present disclosure, the voice interaction unit 100 communicates with the instruction control unit 300 through serial port instructions, and the voice interaction unit 100 communicates with the instruction control unit 300 through serial port instructions. to communicate.
关于所述语音交互单元100、所述光电感应单元200和所述指令控制单元300的具体结构后文会进行进一步说明,以下先针对语音交互系统的语音交互实现过程进行更为详细的解释说明。The specific structures of the voice interaction unit 100 , the photoelectric sensor unit 200 and the command control unit 300 will be further described later, and the voice interaction implementation process of the voice interaction system will be explained in more detail below.
如图2所示,本公开实施例中提供的智能设备的语音交互系统的语音交 互方法可以包括如下过程:As shown in Figure 2, the voice interaction method of the voice interaction system of the smart device provided in the embodiment of the present disclosure may include the following process:
1)用户自主打开所述语音交互单元100,所述指令控制单元300接收语音交互单元100所发送的目标语音指令,并根据所述语音交互单元100接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作,以实现语音交互。逻辑框图中该过程为序号B→序号C。1) The user independently turns on the voice interaction unit 100, the command control unit 300 receives the target voice command sent by the voice interaction unit 100, and controls the smart device according to the target voice command received and recognized by the voice interaction unit 100 Executing an action corresponding to the target voice instruction, so as to realize voice interaction. The process in the logic block diagram is sequence number B→sequence number C.
2)在所述语音交互单元100处于开启状态时,所述指令控制单元300判断在预定时间内所述语音交互单元100是否接收到语音指令;若否,向所述语音交互单元100发送该语音交互单元100的关闭指令;2) When the voice interaction unit 100 is in the open state, the command control unit 300 judges whether the voice interaction unit 100 receives a voice command within a predetermined time; if not, sends the voice command to the voice interaction unit 100 A closing command of the interactive unit 100;
具体的,例如,所述指令控制单元300根据算法,当检测到语音交互单元100在一定时间内没有输入语音的时候,所述指令控制单元300会发送给语音交互单元100关闭指令(序号②),语音交互单元100停止录音,不会再向指令控制单元300发送语音指令(序号③),同时,语音交互单元100向光电感应单元200发送关闭命令(序号④),光电感应单元200关闭。Specifically, for example, according to the algorithm, when the instruction control unit 300 detects that the voice interaction unit 100 does not input a voice within a certain period of time, the instruction control unit 300 will send a shutdown instruction to the voice interaction unit 100 (serial number ②) , the voice interaction unit 100 stops recording, and will no longer send a voice command (serial number ③) to the command control unit 300. At the same time, the voice interaction unit 100 sends a shutdown command (sequence number ④) to the photoelectric sensor unit 200, and the photoelectric sensor unit 200 is turned off.
需要说明的是,当用户语音指令完毕后,一段时间无语音指令,指令控制单元300会向语音交互单元100发送关闭指令,此时,语音交互单元100可处于待机状态,这样,不仅可以省电,而且还能够起到保护隐私的作用,此时,用户的语音不会被语音交互单元100录音保存。It should be noted that after the user's voice command is completed, and there is no voice command for a period of time, the command control unit 300 will send a shutdown command to the voice interaction unit 100. At this time, the voice interaction unit 100 can be in a standby state, so that not only can save power , and can also play a role in protecting privacy. At this time, the voice of the user will not be recorded and saved by the voice interaction unit 100 .
3)当所述语音交互单元100处于关闭状态时,所述光电感应单元200感应到用户的目标肢体指令时,向所述语音交互单元100发送指令,控制所述语音交互单元100开启,所述指令控制单元300接收语音交互单元100所发送的目标语音指令,并根据所述语音交互单元100接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作,以实现语音交互;3) When the voice interaction unit 100 is in the off state, when the photoelectric sensor unit 200 senses the user's target limb instruction, it sends an instruction to the voice interaction unit 100 to control the voice interaction unit 100 to be turned on, and the The command control unit 300 receives the target voice command sent by the voice interaction unit 100, and according to the voice interaction unit 100 receiving and recognizing the user's target voice command, controls the smart device to perform an action corresponding to the target voice command, so as to realize the voice interact;
例如,当用户靠近光电感应单元200(即,用户与光电感应单元200的距离小于预定距离)或者用户在光电感应单元200的传感器前做目标肢体动作(例如,挥手)时,光电感应单元200向语音交互单元100发送开启指令,语音交互单元100开始录音,并将录音数据发送到指令控制单元300,指令控制单元300根据录制音频识别文字,按照使用者的语音做出相应的动作,在逻辑框图中为序号A→序号B→序号C。For example, when the user is close to the photosensitive unit 200 (that is, the distance between the user and the photosensitive unit 200 is less than a predetermined distance) or the user makes a target body movement (for example, waving) in front of the sensor of the photosensitive unit 200, the photosensitive unit 200 sends a signal to the photosensitive unit 200. The voice interaction unit 100 sends the start command, the voice interaction unit 100 starts recording, and sends the recording data to the command control unit 300, the command control unit 300 recognizes the text according to the recorded audio, and makes corresponding actions according to the user's voice. In the logic block diagram The middle is serial number A→serial number B→serial number C.
4)指令控制单元300可自主向语音交互单元100发送开启指令(序号Y),语音交互单元100开启,并将接收到的包括目标语音指令的音频数据发送到指令控制单元300(序号Z),语音交互单元100向光电感应单元200发送开启命令,光电感应单元200开启。4) The command control unit 300 can autonomously send an activation command (serial number Y) to the voice interaction unit 100, the voice interaction unit 100 is turned on, and sends the received audio data including the target voice command to the command control unit 300 (serial number Z), The voice interaction unit 100 sends an opening command to the photoelectric sensing unit 200, and the photoelectric sensing unit 200 is turned on.
例如,以智能设备为智能冰箱为例,所述语音交互系统还包括图像采集单元400,当用户距离光电感应单元200距离较远而超过阈值,或者感应受影响、灵敏性受干扰时,此时可能无法通过肢体动作来打开光电感应单元200和语音交互单元100,那么,指令控制单元300可根据图像采集单元400所采集到的图像,例如,当所采集的图像中包括目标手势时,则指令控制单元300判断用户具有有打开语音交互单元100的意图,从而向语音交互单元100发送开启指令(序号V),同时,语音交互单元100向光电感应单元200发送开启指令,以打开语音交互单元100和光电感应单元200。For example, taking the smart refrigerator as an example, the voice interaction system also includes an image acquisition unit 400. When the user is far away from the photoelectric sensing unit 200 and exceeds the threshold, or the sensing is affected and the sensitivity is disturbed, at this time It may not be possible to turn on the photoelectric sensor unit 200 and the voice interaction unit 100 through body movements. Then, the instruction control unit 300 can be based on the images collected by the image acquisition unit 400. For example, when the collected images include target gestures, then the instruction control unit 300 The unit 300 judges that the user has the intention to open the voice interaction unit 100, thereby sending an opening instruction (serial number V) to the voice interaction unit 100, and at the same time, the voice interaction unit 100 sends an opening instruction to the photoelectric sensor unit 200 to open the voice interaction unit 100 and Photoelectric sensing unit 200.
图3为指令控制单元300、语音交互单元和光电感应单元200之间的通讯方式示意图,以指令控制单元300为智能设备的主机(PC端),语音交互单元100包括麦克风模组(MIC),光电感应单元200包括光电开关LED灯为具体实施例,具体的工作流程说明如下:3 is a schematic diagram of the communication mode between the command control unit 300, the voice interaction unit and the photoelectric sensor unit 200. The command control unit 300 is used as the host (PC end) of the smart device, and the voice interaction unit 100 includes a microphone module (MIC). The photoelectric sensing unit 200 includes a photoelectric switch LED light as a specific embodiment, and the specific work flow is described as follows:
1)如图3连接,光电感应单元200与语音交互单元100,语音交互单元100与指令控制单元300连接;MIC默认MUTE ON状态,此时,MIC保持待机状态,不输出语音指令,同时,MIC将对应的MUTE ON状态发送给光电感应单元200,光电开关保持关闭(MUTE ON状态),LED灯熄灭;1) Connect as shown in Figure 3, the photoelectric sensor unit 200 is connected to the voice interaction unit 100, and the voice interaction unit 100 is connected to the command control unit 300; the MIC defaults to the MUTE ON state. At this time, the MIC remains in the standby state and does not output voice commands. Send the corresponding MUTE ON state to the photoelectric sensing unit 200, the photoelectric switch remains closed (MUTE ON state), and the LED light goes out;
2)当PC端通过UART(异步收发传输器,Universal Asynchronous Receiver/Transmitter)向MIC发送MUTE ON指令后,MIC关闭,同时MIC通过串口将当前的状态发送给光电感应单元200,光电开关关闭(MUTE ON状态),LED灯关闭;2) When the PC sends the MUTE ON command to the MIC through UART (Universal Asynchronous Receiver/Transmitter), the MIC is turned off, and at the same time the MIC sends the current state to the photoelectric sensor unit 200 through the serial port, and the photoelectric switch is turned off (MUTE ON state), the LED light is off;
3)当PC端通过UART向MIC发送MUTE OFF指令后,MIC开启,同时,MIC通过串口将当前的状态发送给光电感应单元200,光电开关打开(MUTE OFF状态),LED灯打开;光电开关只有在MUTE ON的状态下(LED关闭)才能接收用户的肢体指令,在MUTE OFF状态下(LED打开),无法接收用户的肢体指令。3) When the PC sends the MUTE OFF command to the MIC through the UART, the MIC is turned on, and at the same time, the MIC sends the current state to the photoelectric sensor unit 200 through the serial port, the photoelectric switch is turned on (MUTE OFF state), and the LED light is turned on; the photoelectric switch only In the MUTE ON state (LED is off), the user's body commands can be received, and in the MUTE OFF state (LED is on), the user's body commands cannot be received.
具体的,该语音交互系统中控制指令如表1所示:Specifically, the control instructions in the voice interaction system are shown in Table 1:
表1:PC-MIC-光电开关控制指令Table 1: PC-MIC-photoelectric switch control command
Figure PCTCN2021130287-appb-000001
Figure PCTCN2021130287-appb-000001
以上是针对本公开实施例提供的智能设备的语音交互系统的逻辑设计进行的说明,以下对本公开实施例提供的智能设备的语音交互系统中各单元从结构上再进行详细说明。The above is the description of the logical design of the voice interaction system for smart devices provided by the embodiments of the present disclosure. The structure of each unit in the voice interaction system for smart devices provided by the embodiments of the present disclosure will be described in detail below.
在一些实施例中,如图4和图5所示,所述光电感应单元200包括光电开关,所述光电开关包括:外壳210、光电传感器220、线路板230、后盖240和第一信号传输线束(图中未示意),所述外壳210的内部中空,所述外壳210包括前端和后端,前端设有指示标识211,后端开口;所述光电传感器220设置于所述外壳210内;所述线路板230设置于外壳210内,所述线路板230与所述光电传感器220连接;所述后盖240扣装在所述外壳210的后端;所述第一信号传输线束一端与线路板230连接,另一端伸出所述后盖240,所述光电开关还包括设置于所述外壳210内的光源。例如,所述光源可以包括LED灯。In some embodiments, as shown in FIG. 4 and FIG. 5, the photoelectric sensing unit 200 includes a photoelectric switch, and the photoelectric switch includes: a housing 210, a photoelectric sensor 220, a circuit board 230, a rear cover 240 and a first signal transmission Wire harness (not shown in the figure), the interior of the housing 210 is hollow, the housing 210 includes a front end and a rear end, the front end is provided with an indicator mark 211, and the rear end is open; the photoelectric sensor 220 is arranged in the housing 210; The circuit board 230 is arranged in the housing 210, and the circuit board 230 is connected to the photoelectric sensor 220; the rear cover 240 is buckled on the rear end of the housing 210; one end of the first signal transmission harness is connected to the circuit The board 230 is connected, and the other end protrudes from the back cover 240 , and the photoelectric switch also includes a light source arranged in the housing 210 . For example, the light source may comprise LED lights.
在一些实施例中,如图4所示,所述外壳210的后端与所述后盖240通过卡扣连接。例如,所述外壳210的后端可设置卡扣槽,所述后盖240上设卡扣241,通过将后盖240上的卡扣241卡在所述外壳210的卡扣槽内,通过所述后盖240盖住外壳210后,可以防止脱落。In some embodiments, as shown in FIG. 4 , the rear end of the housing 210 is connected to the rear cover 240 through buckling. For example, the rear end of the housing 210 can be provided with a buckle groove, the rear cover 240 is provided with a buckle 241, and the buckle 241 on the back cover 240 is stuck in the buckle groove of the housing 210, and the After the rear cover 240 covers the casing 210, it can prevent it from falling off.
此外,在一些实施例中,如图5所示,所述前端上设有第一通孔213和第二通孔214;所述光电传感器220上设有接收极和发射极,所述接收极位于所述第一通孔213处,所述发射极位于所述第二通孔214处,以便于所述 光电传感器220的接收极和发射极露出。In addition, in some embodiments, as shown in FIG. 5 , the front end is provided with a first through hole 213 and a second through hole 214; the photoelectric sensor 220 is provided with a receiving electrode and an emitter electrode, and the receiving electrode The emitter is located at the first through hole 213 , and the emitter is located at the second through hole 214 , so that the receiver and emitter of the photosensor 220 are exposed.
上述方案中,所述第一通孔213和所述第二通孔214的尺寸以露出接收极和发射极为准,例如,所述第一通孔213和所述第二通孔214的直径可以为2.8mm。In the above solution, the size of the first through hole 213 and the second through hole 214 is aligned to expose the receiving electrode and the emitter, for example, the diameter of the first through hole 213 and the second through hole 214 can be is 2.8mm.
此外,在本公开一些实施例中,所述指示标识包括阴刻镂空标识。这样,在外壳210的前端表面具有阴刻镂空标识,能够在光电开关外壳210表面清晰地显示标记,例如,标记为文字,如“Wave Hand To Speak”,提醒用户可挥手打开光电开关,唤醒麦克风进行录音。In addition, in some embodiments of the present disclosure, the indication mark includes an engraved hollow mark. In this way, the front surface of the housing 210 has an engraved hollow logo, which can clearly display the mark on the surface of the photoelectric switch housing 210. For example, the mark is text, such as "Wave Hand To Speak", reminding the user to turn on the photoelectric switch by waving his hand to wake up the microphone. Make a recording.
外壳210的前端表面阴刻镂空标识的浮雕设计深度可以为0.3mm左右,采用阴刻镂空标识相较于阳刻镂空标记来说,字体更清晰。阳刻字体较厚,若外壳210为注塑工艺成型,表面喷漆,外壳210光的透过性不好,字体越厚则显示越暗,对比效果越差。若采用阴刻镂空的形式,字体减薄,显示越亮,对比效果越好,解决光电开关工作时外壳210标记显示不清晰的技术问题。The depth of the embossed design of the engraved hollow mark on the front surface of the housing 210 may be about 0.3 mm. Compared with the hollow mark engraved in the sun, the font is clearer. The engraved font is thicker. If the casing 210 is formed by injection molding and the surface is painted, the light transmission of the casing 210 is not good. The thicker the font, the darker the display and the worse the contrast effect. If the form of intaglio and hollowing is adopted, the font will be thinner, the brighter the display, the better the contrast effect, and the technical problem that the mark of the shell 210 is not clearly displayed when the photoelectric switch is working is solved.
此外,需要说明的是,外壳210可的表面可喷漆,例如,选用Pantone 656C色号,喷漆厚度可以为0.7~0.12μm,以保证LED的光线均匀透过,阴刻镂空标识的字迹显示清晰。In addition, it should be noted that the surface of the housing 210 can be painted. For example, if Pantone 656C color is selected, the thickness of the paint can be 0.7-0.12 μm, so as to ensure that the light of the LED is evenly transmitted, and the handwriting of the engraved hollow logo is clearly displayed.
此外,本公开实施例中,所述光电开关中的光电传感器220可以选用红外传感器,该红外传感器可通过外围电阻的大小,决定通电电流的范围,决定红外传感器的感应距离。例如,根据实际使用环境,光电开关的感应距离可以在80mm左右,但是不以此为限。In addition, in the embodiment of the present disclosure, the photoelectric sensor 220 in the photoelectric switch can be an infrared sensor, and the infrared sensor can determine the range of the energizing current and the sensing distance of the infrared sensor through the size of the peripheral resistance. For example, according to the actual use environment, the sensing distance of the photoelectric switch may be about 80 mm, but it is not limited thereto.
所述红外传感器的发射极发射红外光线,当用户在感应距离内进行肢体动作(例如,靠近光电开关或挥手等动作)时,发射出的红外光线会在遮挡障碍物的反射下,将光线反射回,此时红外传感器的接收极会收到光信号,并将光信号转换成电信号。所述光电开关根据所述红外传感器的电信号,控制LED灯点亮,同时根据UART指令唤醒麦克风模组。The emitter of the infrared sensor emits infrared light. When the user performs physical movements (for example, approaching the photoelectric switch or waving) within the sensing distance, the emitted infrared light will reflect the light under the reflection of blocking obstacles. At this time, the receiving pole of the infrared sensor will receive the light signal and convert the light signal into an electrical signal. The photoelectric switch controls the LED light to light up according to the electrical signal of the infrared sensor, and at the same time wakes up the microphone module according to the UART command.
此外,在一种具体示例性实施例中,所述光电开关中的线路板230包括相背设置的第一面和第二面,在第一面设置LED灯,用于为所述光电开关提供光源,该LED灯点亮后,光线可透过外壳210,显示外壳210前端上的阴 刻镂空标识。In addition, in a specific exemplary embodiment, the circuit board 230 in the photoelectric switch includes a first surface and a second surface opposite to each other, and an LED light is provided on the first surface to provide the photoelectric switch with As a light source, after the LED light is turned on, the light can pass through the casing 210 to display the engraved hollow logo on the front end of the casing 210 .
在所述第一面还设置所述红外传感器以及外围电阻等。The infrared sensor and peripheral resistors are also arranged on the first surface.
在所述第二面设置光电感应主芯片和第一信号传输线束,所述光电感应主芯片接收到所述红外感应器所发送的信号,并对信号进行处理,向所述麦克风模组发送开启指令。所述第一信号传输线束一端连接至所述线路板230,另一端穿过所述后盖240中心开设的通孔,与所述麦克风模组的第一信号传输线束相连,所述光电开关接收与发送指令均都通过信号传输线束传输。A photoelectric sensor main chip and a first signal transmission harness are arranged on the second surface, and the photoelectric sensor main chip receives the signal sent by the infrared sensor, processes the signal, and sends an activation signal to the microphone module. instruction. One end of the first signal transmission wire harness is connected to the circuit board 230, and the other end passes through the through hole provided in the center of the rear cover 240, and is connected with the first signal transmission wire harness of the microphone module, and the photoelectric switch receives Both the signal transmission harness and the sending instruction are transmitted.
此外,在一些实施例中,所述语音交互单元包括麦克风模组。图所示为麦克风模组的一种实施例的结构爆炸图。In addition, in some embodiments, the voice interaction unit includes a microphone module. The figure shows an exploded view of the structure of an embodiment of the microphone module.
如图6所示,所述麦克风模组包括:上盖110、至少两个收音麦克风(图中未示意出)、印刷电路板120、密封圈130和底盖140,其中所述上盖110内部中空,所述上盖110包括前端和后端,所述前端设有至少两个收音孔,所述后端开口;至少两个收音麦克风设置于所述上盖110内部,每一所述收音麦克风对应一个所述收音孔设置,且每一所述收音孔处设置一密封圈130;所述印刷电路板120设置于所述上盖110内部,所述印刷电路板120与所述收音麦克风连接;所述底盖140设置于所述上盖110的后端。As shown in Figure 6, the microphone module includes: an upper cover 110, at least two radio microphones (not shown in the figure), a printed circuit board 120, a sealing ring 130 and a bottom cover 140, wherein the inside of the upper cover 110 Hollow, the upper cover 110 includes a front end and a rear end, the front end is provided with at least two sound-receiving holes, and the rear end is open; at least two sound-receiving microphones are arranged inside the upper cover 110, each of the sound-receiving microphones Corresponding to one of the sound receiving holes, and each of the sound receiving holes is provided with a sealing ring 130; the printed circuit board 120 is arranged inside the upper cover 110, and the printed circuit board 120 is connected to the sound receiving microphone; The bottom cover 140 is disposed on the rear end of the upper cover 110 .
在一些实施例中,所述麦克风模组还包括:连接于所述印刷电路板120上的第二信号传输线束、及设置于所述印刷电路板120上用于压住所述第二信号传输线束的压线板150。所述底盖140和所述压线板通过卡扣固定在所述上盖110,并通过密封胶封装。In some embodiments, the microphone module further includes: a second signal transmission wire harness connected to the printed circuit board 120, and a wiring harness arranged on the printed circuit board 120 for pressing the second signal transmission wire harness. The wire harness plate 150. The bottom cover 140 and the pressure plate are fixed on the upper cover 110 by buckles, and sealed by a sealant.
作为一种示例性的实施例,所述麦克风模组的上盖110丝印黑色标记(LOGO),便于用户识别,所述上盖110的表面上下分别为两个收音孔,对应在密封圈130的位置,更加有利于声音的收集。As an exemplary embodiment, the upper cover 110 of the microphone module is silk-screened with a black mark (LOGO), which is convenient for users to identify. The location is more conducive to the collection of sound.
作为一种示例性的实施例,所述印刷电路板120包括正面和背面,以麦克风数量为两个为例,两个麦克风设置在正面两端。以该麦克风模组中麦克风数量为两个为例,环境噪音采样后,声音波形经过分析和相位操作,叠加到主麦克风的采样波形上,形成相位抵消,使其中一个麦克风稳定保持清晰录音,另一个麦克风主动消除物理噪音,再经过算法处理,录出来的声音更清晰,解决了麦克风在嘈杂环境中,录音效果差的技术难题,双麦克风在处 理变化的、复杂的声音环境时,可以提高信噪比,保持录制声音纯净,后期算法处理更为精准。As an exemplary embodiment, the printed circuit board 120 includes a front side and a back side. Taking two microphones as an example, the two microphones are arranged at two ends of the front side. Taking the number of microphones in the microphone module as an example, after the ambient noise is sampled, the sound waveform is analyzed and phase-operated, and superimposed on the sampling waveform of the main microphone to form a phase cancellation, so that one of the microphones can keep a stable and clear recording, and the other One microphone actively eliminates physical noise, and after algorithm processing, the recorded sound is clearer, which solves the technical problem of poor recording effect of the microphone in a noisy environment. Dual microphones can improve the signal when dealing with changing and complex sound environments. The noise ratio keeps the recording sound pure, and the post-algorithm processing is more accurate.
在印刷电路板120的正面还设置语音信号主芯片,语音信号主芯片可根据两个麦克风录入的声音数据进行降噪以及算法优化等处理,同时,语音信号主芯片内包括串口接口,用于发送和接收光电开关和PC端的指令。The front of the printed circuit board 120 is also provided with a voice signal main chip, which can perform processing such as noise reduction and algorithm optimization according to the sound data entered by the two microphones. At the same time, the voice signal main chip includes a serial port interface for sending And receive instructions from photoelectric switches and PC terminals.
在印刷电路板120的背面设置有复位键,用于后续升级软件操作。A reset key is provided on the back of the printed circuit board 120 for subsequent software upgrade operations.
在印刷电路板120的背面还设有连接线插孔,用于连接第二信号传输线束,保持与光电开关和PC端之间的信号数据传输。The back of the printed circuit board 120 is also provided with a connecting line jack for connecting the second signal transmission harness to maintain signal data transmission between the photoelectric switch and the PC terminal.
所述底盖140和所述压线盖可通过卡扣方式连接在所述上盖110,并通过密封胶封装,以防止外力受损。The bottom cover 140 and the wire crimping cover can be connected to the upper cover 110 by a buckle, and encapsulated by a sealant to prevent damage from external force.
本公开实施例还提供一种语音交互方法,包括:An embodiment of the present disclosure also provides a voice interaction method, including:
判断语音交互单元100是否处于开启状态;Judging whether the voice interaction unit 100 is in an open state;
若是,根据所述语音交互单元100所接收并识别的用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作;If yes, according to the user's target voice command received and recognized by the voice interaction unit 100, control the smart device to perform an action corresponding to the target voice command;
若否,根据光电感应单元200所接收和识别的用户的目标肢体指令,控制语音交互单元100开启,并根据所述语音交互单元100接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作。If not, the voice interaction unit 100 is controlled to be turned on according to the user's target body instruction received and recognized by the photoelectric sensing unit 200, and the smart device is controlled to perform the same operation as described above according to the voice interaction unit 100 receiving and recognizing the user's target voice instruction. The target voice commands the corresponding action.
在一些示例性的实施例中,所述方法还包括:In some exemplary embodiments, the method also includes:
语音交互单元100处于开启状态时,判断在预定时间内所述语音交互单元100是否接收到语音指令;若否,控制语音交互单元100关闭。When the voice interaction unit 100 is turned on, it is judged whether the voice interaction unit 100 receives a voice command within a predetermined time; if not, the voice interaction unit 100 is controlled to be turned off.
在一些示例性的实施例中,所述方法还包括:In some exemplary embodiments, the method also includes:
语音交互单元100处于开启状态时,判断在预定时间内所述语音交互单元100是否接收到语音指令;若否,控制光电感应单元200关闭。When the voice interaction unit 100 is turned on, it is determined whether the voice interaction unit 100 receives a voice command within a predetermined time; if not, the photoelectric sensor unit 200 is controlled to be turned off.
在一些示例性的实施例中,所述方法还包括:In some exemplary embodiments, the method also includes:
接收并识别智能设备中图像采集单元400所采集的图像数据;Receiving and identifying the image data collected by the image collection unit 400 in the smart device;
当所述图像数据包括目标手势时,控制所述光电感应单元200和所述语音交互单元100开启。When the image data includes the target gesture, the photoelectric sensing unit 200 and the voice interaction unit 100 are controlled to be turned on.
对于该语音交互方法的具体语音交互过程,与本公开提供的语音交互系统的具体语音交互过程相同,在此不再赘述。The specific voice interaction process of the voice interaction method is the same as the specific voice interaction process of the voice interaction system provided in the present disclosure, and will not be repeated here.
本公开实施例还提供一种智能设备,在所述智能设备上设有本公开实施例提供的语音交互系统。The embodiment of the present disclosure also provides a smart device, on which the voice interaction system provided by the embodiment of the present disclosure is provided.
本公开实施例提供的智能设备可以包括智能冰箱、智能洗衣机、智能电视等,应用场景不只局限于家用电器,还可应用于服装商场导购、自助便利店等应用场景。The smart devices provided by the embodiments of the present disclosure may include smart refrigerators, smart washing machines, smart TVs, etc., and the application scenarios are not limited to household appliances, but can also be applied to application scenarios such as shopping guides in clothing shopping malls and self-service convenience stores.
有以下几点需要说明:The following points need to be explained:
(1)本公开实施例附图只涉及到与本公开实施例涉及到的结构,其他结构可参考通常设计。(1) The drawings of the embodiments of the present disclosure only relate to the structures involved in the embodiments of the present disclosure, and other structures may refer to general designs.
(2)为了清晰起见,在用于描述本公开的实施例的附图中,层或区域的厚度被放大或缩小,即这些附图并非按照实际的比例绘制。可以理解,当诸如层、膜、区域或基板之类的元件被称作位于另一元件“上”或“下”时,该元件可以“直接”位于另一元件“上”或“下”或者可以存在中间元件。(2) For the sake of clarity, in the drawings used to describe the embodiments of the present disclosure, the thicknesses of layers or regions are exaggerated or reduced, that is, the drawings are not drawn in actual scale. It will be understood that when an element such as a layer, film, region, or substrate is referred to as being "on" or "under" another element, it can be "directly on" or "under" the other element, or Intermediate elements may be present.
(3)在不冲突的情况下,本公开的实施例及实施例中的特征可以相互组合以得到新的实施例。(3) In the case of no conflict, the embodiments of the present disclosure and the features in the embodiments can be combined with each other to obtain new embodiments.
以上,仅为本公开的具体实施方式,但本公开的保护范围并不局限于此,本公开的保护范围应以权利要求的保护范围为准。The above are only specific embodiments of the present disclosure, but the protection scope of the present disclosure is not limited thereto, and the protection scope of the present disclosure should be based on the protection scope of the claims.

Claims (18)

  1. 一种语音交互系统,其特征在于,包括:A voice interaction system is characterized in that it comprises:
    语音交互单元,用于采集并识别用户的目标语音指令;The voice interaction unit is used to collect and recognize the target voice command of the user;
    光电感应单元,所述光电感应单元与所述语音交互单元连接,用于接收并识别用户的目标肢体指令,并根据所述目标肢体指令控制所述语音交互单元开关;A photoelectric sensing unit, the photoelectric sensing unit is connected to the voice interaction unit, and is used to receive and recognize the target limb instruction of the user, and control the switch of the voice interaction unit according to the target limb instruction;
    指令控制单元,所述指令控制单元与所述语音交互单元连接,用于判断语音交互单元是否处于开启状态;若是,根据所述语音交互单元接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作。An instruction control unit, the instruction control unit is connected with the voice interaction unit, and is used to judge whether the voice interaction unit is in an on state; The target voice commands corresponding actions.
  2. 根据权利要求1所述的语音交互系统,其特征在于,The voice interactive system according to claim 1, wherein,
    所述指令控制单元还用于语音交互单元处于开启状态时,判断在预定时间内所述语音交互单元是否接收到语音指令;若否,向所述语音交互单元发送该语音交互单元的关闭指令。The command control unit is also used to judge whether the voice interaction unit has received a voice command within a predetermined time when the voice interaction unit is in an on state; if not, send a shutdown command of the voice interaction unit to the voice interaction unit.
  3. 根据权利要求2所述的语音交互系统,其特征在于,The voice interactive system according to claim 2, wherein,
    所述语音交互单元还用于:当接收到该语音交互单元的关闭指令时,向所述光电感应单元发送所述光电感应单元的关闭指令。The voice interaction unit is further configured to: when receiving the closing command of the voice interaction unit, send the closing command of the photoelectric sensing unit to the photoelectric sensing unit.
  4. 根据权利要求1所述的语音交互系统,其特征在于,The voice interaction system according to claim 1, wherein,
    所述语音交互系统还包括图像采集单元,所述指令控制单元与所述图像采集单元连接,用于接收并识别图像采集单元所采集的图像数据,并当所述图像数据包括目标手势时,向所述语音交互单元发送该语音交互单元的开启指令;The voice interaction system also includes an image acquisition unit, the instruction control unit is connected to the image acquisition unit, and is used to receive and identify the image data collected by the image acquisition unit, and when the image data includes a target gesture, send The voice interaction unit sends an activation instruction of the voice interaction unit;
    所述语音交互单元还用于接收到所述语音交互单元的开启指令时,向所述光电感应单元发送该光电感应单元的开启指令。The voice interaction unit is further configured to send an activation instruction of the photoelectric induction unit to the photoelectric induction unit when receiving the activation instruction of the voice interaction unit.
  5. 根据权利要求1所述的语音交互系统,其特征在于,The voice interaction system according to claim 1, wherein,
    所述语音交互单元与所述指令控制单元之间通过串口指令进行通讯,The voice interaction unit communicates with the instruction control unit through serial port instructions,
    所述语音交互单元与所述指令控制单元之间通过串口指令进行通讯。The voice interaction unit communicates with the instruction control unit through serial port instructions.
  6. 根据权利要求1所述的语音交互系统,其特征在于,The voice interaction system according to claim 1, wherein,
    所述光电感应单元包括光电开关,所述光电开关包括:The photoelectric sensing unit includes a photoelectric switch, and the photoelectric switch includes:
    外壳,所述外壳的内部中空,所述外壳包括前端和后端,前端设有指示标识,后端开口;A casing, the interior of the casing is hollow, the casing includes a front end and a rear end, the front end is provided with an indicator mark, and the rear end is open;
    设置于所述外壳内的光电传感器;a photoelectric sensor arranged in the housing;
    设置于外壳内的线路板,所述线路板与所述光电传感器连接;a circuit board arranged in the casing, the circuit board is connected to the photoelectric sensor;
    扣装在所述外壳的后端的后盖;a rear cover snapped onto the rear end of the housing;
    信号传输线束,所述信号传输线束一端与线路板连接,另一端伸出所述后盖。A signal transmission harness, one end of the signal transmission harness is connected to the circuit board, and the other end extends out of the rear cover.
  7. 根据权利要求6所述的语音交互系统,其特征在于,The voice interaction system according to claim 6, wherein,
    所述外壳的后端与所述后盖通过卡扣连接。The rear end of the housing is connected to the rear cover through buckles.
  8. 根据权利要求6所述的语音交互系统,其特征在于,The voice interaction system according to claim 6, wherein,
    所述前端上设有第一通孔和第二通孔;所述光电传感器上设有接收极和发射极,所述接收极位于所述第一通孔处,所述发射极位于所述第二通孔处。The front end is provided with a first through hole and a second through hole; the photoelectric sensor is provided with a receiver and an emitter, the receiver is located at the first through hole, and the emitter is located at the second At the second through hole.
  9. 根据权利要求6所述的语音交互系统,其特征在于,The voice interaction system according to claim 6, wherein,
    所述指示标识包括阴刻镂空标识。The indication mark includes an engraved hollow mark.
  10. 根据权利要求6所述的语音交互系统,其特征在于,The voice interaction system according to claim 6, wherein,
    所述光电开关还包括:设置于所述外壳内的光源。The photoelectric switch also includes: a light source arranged in the housing.
  11. 根据权利要求1所述的语音交互系统,其特征在于,The voice interaction system according to claim 1, wherein,
    所述语音交互单元包括麦克风模组,所述麦克风模组包括:The voice interaction unit includes a microphone module, and the microphone module includes:
    上盖,所述上盖内部中空,所述上盖包括前端和后端,所述前端设有至少两个收音孔,所述后端开口;The upper cover, the upper cover is hollow inside, the upper cover includes a front end and a rear end, the front end is provided with at least two sound receiving holes, and the rear end is open;
    设置于所述上盖内部的至少两个收音麦克风,每一所述收音麦克风对应一个所述收音孔设置,且每一所述收音孔处设置一密封圈;At least two sound-receiving microphones arranged inside the upper cover, each of the sound-receiving microphones is set corresponding to one of the sound-receiving holes, and each of the sound-receiving holes is provided with a sealing ring;
    设置于所述上盖内部的印刷电路板,所述印刷电路板与所述收音麦克风连接;a printed circuit board arranged inside the upper cover, and the printed circuit board is connected to the sound-receiving microphone;
    设置于所述上盖的后端的底盖。The bottom cover is arranged at the rear end of the upper cover.
  12. 根据权利要求11所述的语音交互系统,其特征在于,The voice interaction system according to claim 11, wherein,
    所述麦克风模组还包括:The microphone module also includes:
    连接于所述印刷电路板上的信号传输线束、及设置于所述印刷电路板上用于压住所述信号传输线束的压线板。The signal transmission wiring harness connected to the printed circuit board, and the wire pressing plate arranged on the printed circuit board for pressing the signal transmission wiring harness.
  13. 根据权利要求12所述的语音交互系统,其特征在于,The voice interaction system according to claim 12, wherein,
    所述底盖和所述压线板通过卡扣固定在所述上盖,并通过密封胶封装。The bottom cover and the pressure plate are fixed to the upper cover through buckles, and sealed with a sealant.
  14. 一种语音交互方法,其特征在于,应用于如权利要求1至13任一项所述的语音交互系统,所述方法包括:A voice interaction method, characterized in that it is applied to the voice interaction system according to any one of claims 1 to 13, said method comprising:
    判断语音交互单元是否处于开启状态;judging whether the voice interaction unit is on;
    若是,根据所述语音交互单元所接收并识别的用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作;If so, control the smart device to perform an action corresponding to the target voice command according to the user's target voice command received and recognized by the voice interaction unit;
    若否,根据光电感应单元所接收和识别的用户的目标肢体指令,控制语音交互单元开启,并根据所述语音交互单元接收和识别用户的目标语音指令,控制智能设备执行与所述目标语音指令相应的动作。If not, according to the user's target body instruction received and recognized by the photoelectric sensing unit, the voice interaction unit is controlled to open, and according to the voice interaction unit receiving and recognizing the user's target voice instruction, the smart device is controlled to execute the target voice instruction corresponding action.
  15. 根据权利要求14所述的语音交互方法,其特征在于,The voice interaction method according to claim 14, wherein:
    所述方法还包括:The method also includes:
    语音交互单元处于开启状态时,判断在预定时间内所述语音交互单元是否接收到语音指令;若否,控制语音交互单元关闭。When the voice interaction unit is turned on, it is judged whether the voice interaction unit has received a voice instruction within a predetermined time; if not, the voice interaction unit is controlled to be turned off.
  16. 根据权利要求14所述的语音交互方法,其特征在于,The voice interaction method according to claim 14, wherein:
    所述方法还包括:The method also includes:
    语音交互单元处于开启状态时,判断在预定时间内所述语音交互单元是否接收到语音指令;若否,控制光电感应单元关闭。When the voice interaction unit is turned on, it is judged whether the voice interaction unit receives a voice command within a predetermined time; if not, the photoelectric sensor unit is controlled to be turned off.
  17. 根据权利要求15所述的语音交互方法,其特征在于,The voice interaction method according to claim 15, wherein,
    所述方法还包括:The method also includes:
    接收并识别图像采集单元所采集的图像数据;receiving and identifying the image data collected by the image acquisition unit;
    当所述图像数据包括目标手势时,控制所述光电感应单元和所述语音交互单元开启。When the image data includes a target gesture, the photoelectric sensing unit and the voice interaction unit are controlled to be turned on.
  18. 一种智能设备,其特征在于,在所述智能设备上设有如权利要求1至13任一项所述的语音交互系统。An intelligent device, characterized in that the voice interaction system according to any one of claims 1 to 13 is provided on the intelligent device.
PCT/CN2021/130287 2021-05-28 2021-11-12 Voice interaction system and method, and intelligent device WO2022247156A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110594436.8A CN113192509A (en) 2021-05-28 2021-05-28 Voice interaction system and method and intelligent device
CN202110594436.8 2021-05-28

Publications (1)

Publication Number Publication Date
WO2022247156A1 true WO2022247156A1 (en) 2022-12-01

Family

ID=76986329

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/130287 WO2022247156A1 (en) 2021-05-28 2021-11-12 Voice interaction system and method, and intelligent device

Country Status (2)

Country Link
CN (1) CN113192509A (en)
WO (1) WO2022247156A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113192509A (en) * 2021-05-28 2021-07-30 北京京东方显示技术有限公司 Voice interaction system and method and intelligent device
WO2023184535A1 (en) * 2022-04-02 2023-10-05 京东方科技集团股份有限公司 Speech interaction system and method, and smart device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140173440A1 (en) * 2012-12-13 2014-06-19 Imimtek, Inc. Systems and methods for natural interaction with operating systems and application graphical user interfaces using gestural and vocal input
CN108320742A (en) * 2018-01-31 2018-07-24 广东美的制冷设备有限公司 Voice interactive method, smart machine and storage medium
US20180285062A1 (en) * 2017-03-28 2018-10-04 Wipro Limited Method and system for controlling an internet of things device using multi-modal gesture commands
CN109754801A (en) * 2019-01-15 2019-05-14 东莞松山湖国际机器人研究院有限公司 A kind of voice interactive system and method based on gesture identification
US20200064458A1 (en) * 2018-08-22 2020-02-27 Google Llc Radar-Based Gesture Enhancement for Voice Interfaces
CN111182385A (en) * 2019-11-19 2020-05-19 广东小天才科技有限公司 Voice interaction control method and intelligent sound box
CN113192509A (en) * 2021-05-28 2021-07-30 北京京东方显示技术有限公司 Voice interaction system and method and intelligent device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140173440A1 (en) * 2012-12-13 2014-06-19 Imimtek, Inc. Systems and methods for natural interaction with operating systems and application graphical user interfaces using gestural and vocal input
US20180285062A1 (en) * 2017-03-28 2018-10-04 Wipro Limited Method and system for controlling an internet of things device using multi-modal gesture commands
CN108320742A (en) * 2018-01-31 2018-07-24 广东美的制冷设备有限公司 Voice interactive method, smart machine and storage medium
US20200064458A1 (en) * 2018-08-22 2020-02-27 Google Llc Radar-Based Gesture Enhancement for Voice Interfaces
CN109754801A (en) * 2019-01-15 2019-05-14 东莞松山湖国际机器人研究院有限公司 A kind of voice interactive system and method based on gesture identification
CN111182385A (en) * 2019-11-19 2020-05-19 广东小天才科技有限公司 Voice interaction control method and intelligent sound box
CN113192509A (en) * 2021-05-28 2021-07-30 北京京东方显示技术有限公司 Voice interaction system and method and intelligent device

Also Published As

Publication number Publication date
CN113192509A (en) 2021-07-30

Similar Documents

Publication Publication Date Title
WO2022247156A1 (en) Voice interaction system and method, and intelligent device
CN110164440B (en) Voice interaction awakening electronic device, method and medium based on mouth covering action recognition
US10681642B2 (en) Method for controlling unlocking and related products
EP3757876B1 (en) Method for collecting fingerprints and related products
WO2022105784A1 (en) Stylus, electronic device, operation control method and apparatus, and terminal device
WO2014086273A1 (en) Mobile phone proximity waking method and mobile phone proximity waking device
CN107635072B (en) Control method of mobile terminal and mobile terminal
CN104503321A (en) Ultralow-power wireless intelligent control system for body sensing or voice control
WO2018086382A1 (en) Screen backlight control system and method for smart device
US20220116758A1 (en) Service invoking method and apparatus
CN108108683A (en) Touch-control response method, mobile terminal and storage medium
CN109067965A (en) Interpretation method, translating equipment, wearable device and storage medium
CN203289591U (en) Intelligent remote control device provided with multi-point touch control display screen
CN107506730A (en) Fingerprint module, pressure sensitive control method, device and computer-readable recording medium
CN110413148A (en) False-touch prevention detection method, device, equipment and storage medium
JP2020516962A (en) Optical fingerprint recognition method and apparatus, computer-readable storage medium
CN111625175B (en) Touch event processing method, touch event processing device, medium and electronic equipment
CN110058729B (en) Method and electronic device for adjusting sensitivity of touch detection
US9939907B2 (en) Gesture detection using MEMS wind turbines
WO2023029940A1 (en) Touch screen control method and related device
CN108427518B (en) Terminal input assembly, terminal and terminal input method
WO2023184535A1 (en) Speech interaction system and method, and smart device
WO2022042274A1 (en) Voice interaction method and electronic device
WO2022048623A1 (en) Electronic device, and control method and control apparatus therefor
CN215730852U (en) Voice interaction system and intelligent equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21942720

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE