WO2020038494A1 - Intelligent speaker and method for using intelligent speaker - Google Patents

Intelligent speaker and method for using intelligent speaker Download PDF

Info

Publication number
WO2020038494A1
WO2020038494A1 PCT/CN2019/107869 CN2019107869W WO2020038494A1 WO 2020038494 A1 WO2020038494 A1 WO 2020038494A1 CN 2019107869 W CN2019107869 W CN 2019107869W WO 2020038494 A1 WO2020038494 A1 WO 2020038494A1
Authority
WO
WIPO (PCT)
Prior art keywords
wireless communication
voice information
communication module
smart speaker
sound source
Prior art date
Application number
PCT/CN2019/107869
Other languages
French (fr)
Chinese (zh)
Inventor
邱振青
吴海全
张恩勤
曹磊
师瑞文
Original Assignee
深圳市冠旭电子股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市冠旭电子股份有限公司 filed Critical 深圳市冠旭电子股份有限公司
Publication of WO2020038494A1 publication Critical patent/WO2020038494A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics

Definitions

  • the invention relates to the technical field of smart homes, and in particular, to a smart speaker, a method for using the smart speaker, and a computer-readable storage medium.
  • embodiments of the present invention provide a smart speaker and a method for using the smart speaker, which can simultaneously display image information of a video conference in multiple directions, and can improve the utilization rate of the smart speaker while being convenient for users.
  • a first aspect of the embodiments of the present invention provides a smart speaker, including:
  • a control module a microphone array, a wireless communication module, a camera, and at least two screens;
  • the microphone array, the wireless communication module, the camera, and the screen are all connected to the control module;
  • the microphone array is configured to collect voice information and determine a sound source direction according to the voice information
  • the control module is configured to control, according to the direction of the sound source, a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and control the
  • the smart speaker plays voice information collected by the microphone array and / or voice information received by the wireless communication module.
  • a second aspect of the embodiments of the present invention provides a method for using a smart speaker.
  • the smart speaker includes:
  • the control module controls a screen corresponding to the sound source direction to display image information collected by the camera and / or image information received by the wireless communication module, and controls the smart speaker to play Voice information collected by the microphone array and / or voice information received by the wireless communication module.
  • a third aspect of the embodiments of the present invention provides a computer-readable storage medium, including: the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the method mentioned in the second aspect is implemented.
  • the embodiment of the present invention has the beneficial effect that, in this embodiment, the smart speaker includes: a control module, a microphone array, a wireless communication module, a camera, and at least two screens, the microphone array, The wireless communication module, the camera, and the screen are all connected to the control module, and the method includes: the microphone array collects voice information, and determines a sound source direction according to the voice information, and the control module is based on The sound source direction, controlling a screen corresponding to the sound source direction to display image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker to play the microphone array acquisition Voice information and / or voice information received by the wireless communication module.
  • the participants in all directions can clearly see the video conference picture while hearing the sound, which greatly improves the utilization rate of the smart speaker, and has strong ease of use and practicality.
  • FIG. 1 is a schematic structural diagram of a smart speaker according to a first embodiment of the present invention
  • FIG. 2 is a schematic diagram of a specific structure of a smart speaker provided in Embodiment 2 of the present invention.
  • FIG. 3 is a schematic flowchart of a method for using a smart speaker according to a third embodiment of the present invention.
  • FIG. 4 is a schematic diagram of a specific implementation process of a method for using a smart speaker according to a fourth embodiment of the present invention.
  • FIG. 5 is a schematic diagram of a specific implementation process of a method for using a smart speaker according to a fifth embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a video conference system according to a sixth embodiment of the present invention.
  • the term “if” can be construed as “when” or “once” or “in response to a determination” or “in response to a detection” depending on the context .
  • the phrase “if determined” or “if [the described condition or event] is detected” can be interpreted, depending on the context, to mean “once determined” or “in response to the determination” or “once [the condition or event described ] “Or” In response to [Description of condition or event] detected ".
  • the present invention may include any number of smart speakers to enable two or more users to conduct a video conference, wherein the smart speakers include wireless speakers.
  • FIG. 1 is a schematic structural diagram of a smart speaker according to a first embodiment of the present invention.
  • the smart speaker may include:
  • the microphone array 12, the wireless communication module 13, the camera 14 and the screen 15 are all connected to the control module 11.
  • the microphone array 12 is configured to collect voice information and determine a sound source direction according to the voice information, and the sound source direction may be determined based on a positioning algorithm based on a difference in arrival time. It should be understood that the microphone array 12 is a system composed of a certain number of microphones for sampling and processing the spatial characteristics of the sound field. Optionally, the number of the microphones is seven, and the microphones are arranged in a ring shape.
  • the wireless communication module 13 is configured to interact with the server, so as to send locally collected voice information and / or image information to the server, and receive voice information and / or image information collected by the peer during the entire video conference.
  • the wireless communication module 13 may include a WiFi communication sub-module and a Bluetooth communication sub-module.
  • the server receives voice information other than the voice information collected by the microphone array 12 and / or image information other than the image information collected by the camera 14. It should be noted that considering that the smart speakers in this application are mainly used in video conference scenarios, when using the microphone array on the smart speakers to collect voice information, they need to be played out by the smart speakers for local users to hear.
  • the camera 14 is configured to collect image information of a user. It should be noted that the type and number of the cameras 14 can be flexibly selected according to actual conditions, including, but not limited to, a common camera, a 360-degree panoramic camera, or a camera array.
  • the screen 15 is configured to display image information collected by the camera 14 and / or image information received by the wireless communication module 13.
  • the number of the screens is at least two.
  • the control module 11 is configured to control, according to the direction of the sound source, a screen closest to the direction of the sound source to display image information collected by the camera 14 and / or image information received by the wireless communication module 13;
  • the control module 11 is further configured to control the smart speaker to play the voice information collected by the microphone array 12 and / or the voice information received by the wireless communication module 13.
  • the control module 11 includes a main control chip, and the main control chip is an APQ8009 chip.
  • voice information is collected through the microphone array, and a sound source direction is determined according to the voice information.
  • the control module controls a screen display corresponding to the sound source direction according to the sound source direction.
  • FIG. 2 is a detailed structural diagram of a smart speaker provided in Embodiment 2 of the present invention.
  • the smart speaker may include:
  • the control module 21 the microphone array 22, the wireless communication module 23, the camera 24, the screen 25, the wake-up module 26, the audio processing module 27, and the key module 28.
  • the microphone array 22, the wireless communication module 23, the camera 24 and the screen 25, the wake-up module 26, the audio processing module 27, and the key module 28 are all connected to the control module 21. It should be noted that the control module 21, the microphone array 22, the wireless communication module 23, the camera 24, and the screen 25 are the same as the control module 11, the microphone array 12, and the wireless communication module in the first embodiment. 13. The camera 14 is the same as the screen 15 and is not repeated here.
  • the wake-up module 26 wakes up the smart speaker after detecting a preset wake-up keyword, so that the smart speaker is in a working state.
  • the audio processing module 27 includes a digital signal processor, a power amplifier, and a speaker.
  • the digital signal processor, power amplifier, and speaker are all connected to the control module 21.
  • the output end of the digital signal processor is connected to the digital signal processor.
  • An input terminal of the power amplifier is connected, and an output terminal of the power amplifier is connected to an input terminal of the speaker.
  • the voice information collected by the microphone array 22 and / or the voice information received by the wireless communication module 23 contains a lot of noise, if it is directly played, it will affect the final playback effect and reduce the user experience.
  • the voice information collected by the microphone array 22 and / or the voice information received by the wireless communication module 23 is processed by a digital signal processing system including the audio processing module 27.
  • the key module 28 is configured to receive a key instruction from a user and control the volume adjustment of the smart speaker through the control module.
  • the embodiment of the present invention adds a wake-up module, which can wake up the smart speaker to enter the working state after detecting a preset wake-up keyword; and, it adds audio processing
  • the module can make the voice played by the smart speaker more pleasant; in addition, a key module is added, which can be combined with the control module to adjust the volume of the smart speaker, so as to meet the different needs of users in different application scenarios and improve users.
  • a wake-up module which can wake up the smart speaker to enter the working state after detecting a preset wake-up keyword
  • audio processing can make the voice played by the smart speaker more pleasant
  • a key module is added, which can be combined with the control module to adjust the volume of the smart speaker, so as to meet the different needs of users in different application scenarios and improve users.
  • a key module is added, which can be combined with the control module to adjust the volume of the smart speaker, so as to meet the different needs of users in different application scenarios and improve users.
  • a schematic flowchart of a method for using a smart speaker according to Embodiment 3 of the present invention may include the following steps:
  • the microphone array collects voice information, and determines a sound source direction according to the voice information.
  • the smart speaker includes a control module, a microphone array, a wireless communication module, a camera, and at least two screens.
  • the microphone array, the wireless communication module, the camera, and the screen are all connected to the control module. connection.
  • voice information is collected through the microphone array, the voice information is processed into voice data, and the direction of the sound source corresponding to the voice information is determined according to the voice data.
  • the control module controls a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and controls the intelligent
  • the speaker plays voice information collected by the microphone array and / or voice information received by the wireless communication module.
  • the image displayed on the screen may be only image information collected by the camera, that is, own image information; or may be only image information received by the wireless communication module, that is, image information of the other party; It can include the image information collected by the camera and the image information received by the wireless communication module, that is, the image information of the own party and the other party are displayed at the same time, and the specific displayed information can be flexibly set according to the actual needs and the size of the screen .
  • the screen simultaneously displays image information collected by the camera and image information received by the wireless communication module at different ratios.
  • the voice played by the smart speaker may be only voice information collected by the microphone array, that is, own voice information; or may be only voice information received by the wireless communication module, that is, the voice of the other party
  • the information may also include the voice information collected by the microphone array and the voice information received by the wireless communication module, that is, the voice information of the own party and the other party are displayed at the same time, and the specific displayed information may be based on actual needs and the audio processing module.
  • the processing effect can be flexibly set.
  • the smart speaker plays voice information received by the wireless communication module.
  • voice information is collected through the microphone array, and a sound source direction is determined according to the voice information.
  • the control module controls a screen display corresponding to the sound source direction according to the sound source direction.
  • Image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module It can make users in all directions to communicate in addition to hearing the voice of the person you are talking to, and see their expressions and actions, so that people in different places can communicate in the same conference room. Improve the user experience and increase the use of smart speakers, with strong ease of use and practicality.
  • the schematic diagram of the specific implementation process of the method for using the smart speaker provided in the fourth embodiment of the present invention is a further refinement and description of steps S301 and S302 in the third embodiment.
  • the method may include the following steps:
  • the microphone array collects voice information.
  • the step S401 is basically the same as step S301 in the third embodiment, and details are not described herein again.
  • S402 Detect whether the voice information includes a preset wakeup keyword, and if a preset wakeup keyword is detected, wake up the smart speaker.
  • the wake-up keyword is a predefined word that switches the smart speaker from a standby state to a working state.
  • the preset wakeup keywords are flexibly set according to a user's preference.
  • the step S403 is basically the same as step S301 in the third embodiment, and details are not described herein again.
  • control module controls a screen closest to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and Controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module.
  • the direction of the sound source may be One or more, in the embodiment of the present invention, only a case where there is only one sound source direction is used as an example for explanation and description.
  • Embodiment 5 For a description of multiple sound source directions, refer to Embodiment 5 for details.
  • the direction of the sound source is one, by controlling the screen closest to the direction of the sound source to display the image information collected by the camera and / or the image information received by the wireless communication module, the maximum extent can be achieved. To ensure that users see clear video.
  • the distance from the sound source to the screen can be obtained by converting the distance from the sound source to the microphone array.
  • the embodiment of the present invention adds a voice wake-up step and a step of judging the direction of the sound source.
  • the voice wake-up step can promptly switch the smart speaker from the standby state to the working state. , Speeding up the data processing speed; in addition, for the case where there is only one sound source direction, controlling the screen closest to the sound source direction to display the image information collected by the camera and / or the image received by the wireless communication module Information, you can get better viewing results, so that the use of smart speakers can be improved, with strong ease of use and practicality.
  • the schematic diagram of the specific implementation process of the method for using the smart speaker provided in the fifth embodiment of the present invention is a further step of detailing and describing steps S301 and S302 in the third embodiment.
  • the method may include the following steps:
  • the microphone array collects voice information.
  • S502 Detect whether the voice information includes a preset wakeup keyword, and if a preset wakeup keyword is detected, wake up the smart speaker.
  • the microphone array determines a sound source direction according to the voice information.
  • steps S501-S503 and steps S401-S403 in the fourth embodiment are basically the same, and reference may be made to related descriptions in the foregoing embodiments, which are not described herein again.
  • control module determines an angle formed by each of the sound source directions and a preset reference direction. At the angle, controlling the screen to display the image information collected by the camera and / or the image information received by the wireless communication module, and controlling the smart speaker to play the voice information and / or the information collected by the microphone array. The speech information received by the wireless communication module is described.
  • the preset reference direction is a reference direction set when the microphone array is installed.
  • the viewing angle range refers to a maximum angle range in which a user can clearly observe all content on the screen from different directions. It should be understood that the viewing angle range is related to the number of screens.
  • the viewing angle range corresponding to the first screen is (0, 120o)
  • the viewing angle range corresponding to the second screen is (120o , 240o]
  • the viewing angle range corresponding to the third screen is (240o, 360o].
  • Screens are in a working state, displaying image information collected by the camera and / or image information received by the wireless communication module; when the control module determines that the angle formed by the sound source direction and a preset reference direction falls on (120o, 240o) interval, controlling the second screen to be in a working state, displaying image information collected by the camera and / or image information received by the wireless communication module; when the control module determines the sound When the angle formed by the source direction and the preset reference direction falls in the (240o, 360o) interval, the third screen is controlled to be in a working state, and the data collected by the camera is displayed.
  • the The control module may further control the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module.
  • the embodiment of the present invention provides a specific implementation when there are multiple sound source directions, which can better control the working state of the screen, thereby improving the utilization rate of the smart speaker. Strong usability and practicality.
  • FIG. 6 is a schematic structural diagram of a video conference system provided by Embodiment 6 of the present invention.
  • the video conference system may include:
  • the video conference system shown in FIG. 6 includes a first smart speaker 61, a second smart speaker 62, and a server 63.
  • One smart speaker 61 is used by a local user
  • the second smart speaker 62 is used by a remote user at the opposite end.
  • the number of local users and the number of remote users are not limited for the time being, and may be one or more, respectively, and the specific number may depend on circumstances.
  • the first smart speaker 61 collects local image information and voice information through its own camera and microphone array, and passes the collected image information and voice information through The wireless communication module sends to the server.
  • the server receives the request message from the second smart speaker 62, it forwards the image information and voice information sent by the first smart speaker 61 to the second smart speaker 62, and receives the second smart speaker 62.
  • the server forwards the image information and voice information sent by the second smart speaker 62 to the first smart speaker 61.
  • the screen corresponding to the direction of the sound source is controlled to display the image information collected locally and / or the opposite end, and the smart speaker 61 is controlled to play the voice information collected locally and / or the opposite end. In this way, for local users, they can hear each other's voice while seeing the other party's image. The picture.
  • the disclosed terminal device and method may be implemented in other manners.
  • the terminal device embodiments described above are only schematic.
  • the division of the modules or units is only a logical function division.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, which may be electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objective of the solution of this embodiment.
  • the integrated module When the integrated module is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on such an understanding, the present invention implements all or part of the processes in the methods of the above embodiments, and may also be completed by a computer program instructing related hardware.
  • the computer program may be stored in a computer-readable storage medium.
  • the computer When the program is executed by a processor, the steps of the foregoing method embodiments can be implemented.
  • the computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form.
  • the computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • electric carrier signals telecommunication signals
  • software distribution media any entity or device capable of carrying the computer program code
  • a recording medium a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media.

Abstract

The present application is applicable to the technical field of intelligent homes, and provides an intelligent speaker and a method for using the intelligent speaker. The method for using the intelligent speaker comprises: a microphone array collecting voice information and determining a sound source direction according to the voice information; and a control module controlling, according to the sound source direction, a screen corresponding to the sound source direction to display image information collected by a camera and/or image information received by a wireless communication module, and controlling an intelligent speaker to play the voice information collected by the microphone array and/or voice information received by the wireless communication module. The present application can be used in multiple application scenarios, thereby increasing a usage rate of the intelligent speaker, and achieving stronger usability and practicability.

Description

一种智能音箱及智能音箱使用的方法Intelligent speaker and method for using intelligent speaker 技术领域Technical field
本发明涉及智能家居技术领域,尤其涉及一种智能音箱、智能音箱使用的方法及计算机可读存储介质。The invention relates to the technical field of smart homes, and in particular, to a smart speaker, a method for using the smart speaker, and a computer-readable storage medium.
背景技术Background technique
随着互联网技术的兴起,极大地丰富了人们相互之间的沟通方式,使身处不同地域的人们沟通起来更加方便。其中,视频会议系统作为一种重要的远程交流技术,因其便捷、高效等优点,受到人们的一致好评。With the rise of Internet technology, people's communication methods have been greatly enriched, making it easier for people in different regions to communicate. Among them, the video conference system, as an important remote communication technology, has been well received by people because of its convenience and efficiency.
然而市场上的智能音箱设备在支持视频通话时,一般仅能在一个方向上显示当前的视频画面,不能满足群体视频会议场景下用户的需求,使用率较低。However, when smart speaker devices on the market support video calls, they can generally only display the current video picture in one direction, which cannot meet the needs of users in a group video conference scenario, and the utilization rate is low.
技术问题technical problem
鉴于此,本发明实施例提供了一种智能音箱及智能音箱使用的方法,可以同时在多个方向上显示视频会议的图像信息,能够在方便用户使用的同时提高智能音箱的使用率。In view of this, embodiments of the present invention provide a smart speaker and a method for using the smart speaker, which can simultaneously display image information of a video conference in multiple directions, and can improve the utilization rate of the smart speaker while being convenient for users.
技术解决方案Technical solutions
本发明实施例的第一方面提供了一种智能音箱,包括:A first aspect of the embodiments of the present invention provides a smart speaker, including:
控制模块、麦克风阵列、无线通信模块、摄像头和至少两个屏幕;A control module, a microphone array, a wireless communication module, a camera, and at least two screens;
所述麦克风阵列、所述无线通信模块、所述摄像头和所述屏幕均与所述控制模块连接;The microphone array, the wireless communication module, the camera, and the screen are all connected to the control module;
所述麦克风阵列,用于采集语音信息,并根据所述语音信息确定声源方向;The microphone array is configured to collect voice information and determine a sound source direction according to the voice information;
所述控制模块,用于根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。The control module is configured to control, according to the direction of the sound source, a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and control the The smart speaker plays voice information collected by the microphone array and / or voice information received by the wireless communication module.
本发明实施例的第二方面提供了一种智能音箱使用的方法,所述智能音箱包括:A second aspect of the embodiments of the present invention provides a method for using a smart speaker. The smart speaker includes:
控制模块、麦克风阵列、无线通信模块、摄像头和至少两个屏幕,所述麦克风阵列、所述无线通信模块、所述摄像头和所述屏幕均与所述控制模块连接,所述方法包括:A control module, a microphone array, a wireless communication module, a camera, and at least two screens, and the microphone array, the wireless communication module, the camera, and the screen are all connected to the control module, and the method includes:
所述麦克风阵列采集语音信息,并根据所述语音信息确定声源方向;Collecting voice information by the microphone array, and determining a sound source direction according to the voice information;
所述控制模块根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。According to the sound source direction, the control module controls a screen corresponding to the sound source direction to display image information collected by the camera and / or image information received by the wireless communication module, and controls the smart speaker to play Voice information collected by the microphone array and / or voice information received by the wireless communication module.
本发明实施例的第三方面提供了一种计算机可读存储介质,包括:该计算机可读存储介质上存储有计算机程序,上述计算机程序被处理器执行时实现上述第二方面提及的方法。A third aspect of the embodiments of the present invention provides a computer-readable storage medium, including: the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the method mentioned in the second aspect is implemented.
有益效果Beneficial effect
本发明实施例与现有技术相比存在的有益效果是:在本实施例中,所述智能音箱包括:控制模块、麦克风阵列、无线通信模块、摄像头和至少两个屏幕,所述麦克风阵列、所述无线通信模块、所述摄像头和所述屏幕均与所述控制模块连接,所述方法包括:所述麦克风阵列采集语音信息,并根据所述语音信息确定声源方向,所述控制模块根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。通过本发明实施例,能够使各个方向的与会人员在听到声音的同时,清晰看到视频会议的画面,大大提高了智能音箱的使用率,具有较强的易用性和实用性。Compared with the prior art, the embodiment of the present invention has the beneficial effect that, in this embodiment, the smart speaker includes: a control module, a microphone array, a wireless communication module, a camera, and at least two screens, the microphone array, The wireless communication module, the camera, and the screen are all connected to the control module, and the method includes: the microphone array collects voice information, and determines a sound source direction according to the voice information, and the control module is based on The sound source direction, controlling a screen corresponding to the sound source direction to display image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker to play the microphone array acquisition Voice information and / or voice information received by the wireless communication module. Through the embodiments of the present invention, the participants in all directions can clearly see the video conference picture while hearing the sound, which greatly improves the utilization rate of the smart speaker, and has strong ease of use and practicality.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings used in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings in the following description are only the present invention. For some embodiments, for those of ordinary skill in the art, other drawings can be obtained according to these drawings without paying creative labor.
图1为本发明实施例一提供的智能音箱的结构示意图;1 is a schematic structural diagram of a smart speaker according to a first embodiment of the present invention;
图2为本发明实施例二提供的智能音箱的具体结构示意图;2 is a schematic diagram of a specific structure of a smart speaker provided in Embodiment 2 of the present invention;
图3为本发明实施例三提供的智能音箱使用的方法的流程示意图;3 is a schematic flowchart of a method for using a smart speaker according to a third embodiment of the present invention;
图4为本发明实施例四提供的智能音箱使用的方法的具体实现过程示意图;4 is a schematic diagram of a specific implementation process of a method for using a smart speaker according to a fourth embodiment of the present invention;
图5为本发明实施例五提供的智能音箱使用的方法的具体实现过程示意图;5 is a schematic diagram of a specific implementation process of a method for using a smart speaker according to a fifth embodiment of the present invention;
图6为本发明实施例六提供的视频会议系统的结构示意图。FIG. 6 is a schematic structural diagram of a video conference system according to a sixth embodiment of the present invention.
本发明的实施方式Embodiments of the invention
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本发明实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本发明。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本发明的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as a specific system structure and technology are provided in order to thoroughly understand the embodiments of the present invention. However, it should be clear to a person skilled in the art that the present invention can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary details.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and the appended claims, the term "comprising" indicates the presence of described features, integers, steps, operations, elements and / or components, but does not exclude one or more other features , The whole, steps, operations, elements, components, and / or their presence or addition.
还应当理解,在此本发明说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本发明。如在本发明说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It should also be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to limit the invention. As used in the description of the invention and the appended claims, the singular forms "a", "an" and "the" are intended to include the plural forms unless the context clearly indicates otherwise.
还应当进一步理解,在本发明说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should be further understood that the term "and / or" used in the present description and the appended claims refers to any combination of one or more of the listed items and all possible combinations, and includes these combinations .
如在本说明书和所附权利要求书中所使用的那样,术语“如果”可以依据上下文被解释为“当...时”或“一旦”或“响应于确定”或“响应于检测到”。类似地,短语“如果确定”或“如果检测到[所描述条件或事件]”可以依据上下文被解释为意指“一旦确定”或“响应于确定”或“一旦检测到[所描述条件或事件]”或“响应于检测到[所描述条件或事件]”。As used in this specification and the appended claims, the term "if" can be construed as "when" or "once" or "in response to a determination" or "in response to a detection" depending on the context . Similarly, the phrase "if determined" or "if [the described condition or event] is detected" can be interpreted, depending on the context, to mean "once determined" or "in response to the determination" or "once [the condition or event described ] "Or" In response to [Description of condition or event] detected ".
应理解,本实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本发明实施例的实施过程构成任何限定。It should be understood that the size of the sequence numbers of the steps in this embodiment does not mean the order of execution. The execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present invention.
需要说明的是,本发明中可以包括任意数量的智能音箱以使得两个或更多用户能够进行视频会议,其中,所述智能音箱包括无线音箱。It should be noted that the present invention may include any number of smart speakers to enable two or more users to conduct a video conference, wherein the smart speakers include wireless speakers.
为了说明本发明所述的技术方案,下面通过具体实施例来进行说明。In order to explain the technical solution of the present invention, the following description is made through specific embodiments.
实施例一Example one
图1是本发明实施例一提供的智能音箱的结构示意图,该智能音箱可以包括:FIG. 1 is a schematic structural diagram of a smart speaker according to a first embodiment of the present invention. The smart speaker may include:
控制模块11、麦克风阵列12、无线通信模块13、摄像头14和屏幕15。The control module 11, the microphone array 12, the wireless communication module 13, the camera 14 and the screen 15.
本发明实施例中,所述麦克风阵列12、无线通信模块13、所述摄像头14和所述屏幕15均与所述控制模块11连接。In the embodiment of the present invention, the microphone array 12, the wireless communication module 13, the camera 14 and the screen 15 are all connected to the control module 11.
所述麦克风阵列12,用于采集语音信息,并根据所述语音信息确定声源方向,其中,可以基于到达时间差的定位算法来确定所述声源方向。应当理解,所述麦克风阵列12是由一定数目的麦克风组成,用来对声场的空间特性进行采样并处理的系统。可选的,所述麦克风的数量为7,呈环状排列。The microphone array 12 is configured to collect voice information and determine a sound source direction according to the voice information, and the sound source direction may be determined based on a positioning algorithm based on a difference in arrival time. It should be understood that the microphone array 12 is a system composed of a certain number of microphones for sampling and processing the spatial characteristics of the sound field. Optionally, the number of the microphones is seven, and the microphones are arranged in a ring shape.
所述无线通信模块13,用于与服务器进行交互,从而将本地采集的语音信息和/或图像信息发送至服务器,并接收整个视频会议过程中对端采集的语音信息和/或图像信息。可选的,所述无线通信模块13可包括:WiFi通信子模块和蓝牙通信子模块。进一步的,通过服务器接收除所述麦克风阵列12采集的语音信息以外的语音信息和/或除所述摄像头14采集的图像信息以外的图像信息。需需要说明的是,考虑到本申请中的智能音箱主要应用于视频会议场景中,在使用智能音箱上的麦克风阵列进行语音信息采集时,需要经智能音箱播放出来,让本地用户听到。The wireless communication module 13 is configured to interact with the server, so as to send locally collected voice information and / or image information to the server, and receive voice information and / or image information collected by the peer during the entire video conference. Optionally, the wireless communication module 13 may include a WiFi communication sub-module and a Bluetooth communication sub-module. Further, the server receives voice information other than the voice information collected by the microphone array 12 and / or image information other than the image information collected by the camera 14. It should be noted that considering that the smart speakers in this application are mainly used in video conference scenarios, when using the microphone array on the smart speakers to collect voice information, they need to be played out by the smart speakers for local users to hear.
所述摄像头14,用于采集用户的图像信息。需要说明的是,所述摄像头14的类型和数量可以根据实际情况来进行灵活的选取,包括但不限于一个普通的摄像头、一个360度的全景摄像头或一个摄像头阵列。The camera 14 is configured to collect image information of a user. It should be noted that the type and number of the cameras 14 can be flexibly selected according to actual conditions, including, but not limited to, a common camera, a 360-degree panoramic camera, or a camera array.
所述屏幕15,用于显示所述摄像头14采集的图像信息和/或所述无线通信模块13接收到的图像信息。可选的,所述屏幕的数量至少为2个。The screen 15 is configured to display image information collected by the camera 14 and / or image information received by the wireless communication module 13. Optionally, the number of the screens is at least two.
所述控制模块11,用于根据所述声源方向,控制最靠近所述声源方向的屏幕显示所述摄像头14采集的图像信息和/或所述无线通信模块13接收到的图像信息;另外,所述控制模块11,还用于控制所述智能音箱播放所述麦克风阵列采集12的语音信息和/或所述无线通信模块13接收的语音信息。可选的,所述控制模块11包含一主控芯片,所述主控芯片为APQ8009芯片。The control module 11 is configured to control, according to the direction of the sound source, a screen closest to the direction of the sound source to display image information collected by the camera 14 and / or image information received by the wireless communication module 13; The control module 11 is further configured to control the smart speaker to play the voice information collected by the microphone array 12 and / or the voice information received by the wireless communication module 13. Optionally, the control module 11 includes a main control chip, and the main control chip is an APQ8009 chip.
在本发明实施例中,通过所述麦克风阵列采集语音信息,并根据所述语音信息确定声源方向,通过所述控制模块根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,同时控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息,可以满足多人视频会议的场景需求,使得智能音箱的实用性提高,功能更加齐全,人们使用更加方便。In the embodiment of the present invention, voice information is collected through the microphone array, and a sound source direction is determined according to the voice information. The control module controls a screen display corresponding to the sound source direction according to the sound source direction. Image information collected by the camera and / or image information received by the wireless communication module, and simultaneously controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module, It can meet the needs of multi-person video conference scenarios, making the smart speaker more practical, more complete, and more convenient for people to use.
实施例二Example two
图2是本发明实施例二提供的智能音箱的具体结构示意图,该智能音箱可以包括:FIG. 2 is a detailed structural diagram of a smart speaker provided in Embodiment 2 of the present invention. The smart speaker may include:
控制模块21、麦克风阵列22、无线通信模块23、摄像头24、屏幕25、唤醒模块26、音频处理模块27和按键模块28。The control module 21, the microphone array 22, the wireless communication module 23, the camera 24, the screen 25, the wake-up module 26, the audio processing module 27, and the key module 28.
其中,麦克风阵列22、无线通信模块23、摄像头24和屏幕25、唤醒模块26、音频处理模块27和按键模块28均与所述控制模块21连接。需要说明的是,所述控制模块21、所述麦克风阵列22、无线通信模块23、所述摄像头24和所述屏幕25与实施例一中的所述控制模块11、麦克风阵列12、无线通信模块13、所述摄像头14和所述屏幕15相同,在此不作重复赘述。The microphone array 22, the wireless communication module 23, the camera 24 and the screen 25, the wake-up module 26, the audio processing module 27, and the key module 28 are all connected to the control module 21. It should be noted that the control module 21, the microphone array 22, the wireless communication module 23, the camera 24, and the screen 25 are the same as the control module 11, the microphone array 12, and the wireless communication module in the first embodiment. 13. The camera 14 is the same as the screen 15 and is not repeated here.
所述唤醒模块26在检测到预设的唤醒关键词后,唤醒所述智能音箱,以使得所述智能音箱处于工作状态。The wake-up module 26 wakes up the smart speaker after detecting a preset wake-up keyword, so that the smart speaker is in a working state.
所述音频处理模块27包括:数字信号处理器、功率放大器和扬声器,所述数字信号处理器、功率放大器和扬声器均与所述控制模块21连接,所述数字信号处理器的输出端与所述功率放大器的输入端连接,所述功率放大器的输出端与所述扬声器的输入端连接。应当理解,由于所述麦克风阵列22采集的语音信息和/或所述无线通信模块23接收到的语音信息中包含很多噪声,如果直接将其播放,会影响最终的播放效果,使得用户体验感降低。可选的,通过包含所述音频处理模块27在内的数字信号处理系统对所述麦克风阵列22采集的语音信息和/或所述无线通信模块23接收到的语音信息进行处理。The audio processing module 27 includes a digital signal processor, a power amplifier, and a speaker. The digital signal processor, power amplifier, and speaker are all connected to the control module 21. The output end of the digital signal processor is connected to the digital signal processor. An input terminal of the power amplifier is connected, and an output terminal of the power amplifier is connected to an input terminal of the speaker. It should be understood that, since the voice information collected by the microphone array 22 and / or the voice information received by the wireless communication module 23 contains a lot of noise, if it is directly played, it will affect the final playback effect and reduce the user experience. . Optionally, the voice information collected by the microphone array 22 and / or the voice information received by the wireless communication module 23 is processed by a digital signal processing system including the audio processing module 27.
所述按键模块28,用于接收用户的按键指令,并通过所述控制模块控制所述智能音箱音量的调节。The key module 28 is configured to receive a key instruction from a user and control the volume adjustment of the smart speaker through the control module.
由上可见,本发明实施例相比于实施例一,增加了唤醒模块,可以在检测到预设的唤醒关键词后,唤醒所述智能音箱,使其进入工作状态;并且,增加了音频处理模块,可以使所述智能音箱播放的语音更加好听;另外,还增加了按键模块,可以结合所述控制模块来进行智能音箱音量的调节,从而满足用户在不同应用场景下的不同需求,提高用户的体验感,具有较强的易用性和实用性。As can be seen from the above, compared with the first embodiment, the embodiment of the present invention adds a wake-up module, which can wake up the smart speaker to enter the working state after detecting a preset wake-up keyword; and, it adds audio processing The module can make the voice played by the smart speaker more pleasant; in addition, a key module is added, which can be combined with the control module to adjust the volume of the smart speaker, so as to meet the different needs of users in different application scenarios and improve users. Experience, with strong ease of use and practicality.
实施例三Example three
本发明实施例三提供的智能音箱使用的方法的流程示意图,该方法可以包括以下步骤:A schematic flowchart of a method for using a smart speaker according to Embodiment 3 of the present invention. The method may include the following steps:
S301:麦克风阵列采集语音信息,并根据所述语音信息确定声源方向。S301: The microphone array collects voice information, and determines a sound source direction according to the voice information.
其中,所述述智能音箱包括:控制模块、麦克风阵列、无线通信模块、摄像头和至少两个屏幕,所述麦克风阵列、所述无线通信模块、所述摄像头和所述屏幕均与所述控制模块连接。The smart speaker includes a control module, a microphone array, a wireless communication module, a camera, and at least two screens. The microphone array, the wireless communication module, the camera, and the screen are all connected to the control module. connection.
可选的,通过所述麦克风阵列采集语音信息,并将所述语音信息处理为语音数据,根据所述语音数据确定所述语音信息对应的声源方向。Optionally, voice information is collected through the microphone array, the voice information is processed into voice data, and the direction of the sound source corresponding to the voice information is determined according to the voice data.
S302:所述控制模块根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。S302: According to the direction of the sound source, the control module controls a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and controls the intelligent The speaker plays voice information collected by the microphone array and / or voice information received by the wireless communication module.
应理解,所述屏幕显示的图像可以仅为所述摄像头采集的图像信息,即:己方的图像信息;也可以仅为所述无线通信模块接收到的图像信息,即:对方的图像信息;还可以是包括所述摄像头采集的图像信息和所述无线通信模块接收到的图像信息,即:同时显示己方和对方的图像信息,具体显示的信息可以根据实际的需求和屏幕的大小进行灵活的设置。可选的,所述屏幕以不同比例同时显示所述摄像头采集的图像信息和所述无线通信模块接收到的图像信息。It should be understood that the image displayed on the screen may be only image information collected by the camera, that is, own image information; or may be only image information received by the wireless communication module, that is, image information of the other party; It can include the image information collected by the camera and the image information received by the wireless communication module, that is, the image information of the own party and the other party are displayed at the same time, and the specific displayed information can be flexibly set according to the actual needs and the size of the screen . Optionally, the screen simultaneously displays image information collected by the camera and image information received by the wireless communication module at different ratios.
还应理解,所述智能音箱播放的语音可以仅为所述麦克风阵列采集的语音信息,即:己方的语音信息;也可以仅为所述无线通信模块接收到的语音信息,即:对方的语音信息;还可以是包括所述麦克风阵列采集的语音信息和所述无线通信模块接收到的语音信息,即:同时显示己方和对方的语音信息,具体显示的信息可以根据实际的需求和音频处理模块的处理效果进行灵活的设置。可选的,所述智能音箱播放所述无线通信模块接收到的语音信息。It should also be understood that the voice played by the smart speaker may be only voice information collected by the microphone array, that is, own voice information; or may be only voice information received by the wireless communication module, that is, the voice of the other party The information may also include the voice information collected by the microphone array and the voice information received by the wireless communication module, that is, the voice information of the own party and the other party are displayed at the same time, and the specific displayed information may be based on actual needs and the audio processing module. The processing effect can be flexibly set. Optionally, the smart speaker plays voice information received by the wireless communication module.
由上可见,本发明实施例通过所述麦克风阵列采集语音信息,并根据所述语音信息确定声源方向,所述控制模块根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息,可以使处于各个方向上的用户除了能听见与你通话的人的声音外还能进行语音交流,并且看到他们的表情和动作,使处于不同地方的人就像在同一会议室内沟通,能够在提升用户体验感的同时提高智能音箱的使用率,具有较强的易用性和实用性。As can be seen from the above, in the embodiment of the present invention, voice information is collected through the microphone array, and a sound source direction is determined according to the voice information. The control module controls a screen display corresponding to the sound source direction according to the sound source direction. Image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module, It can make users in all directions to communicate in addition to hearing the voice of the person you are talking to, and see their expressions and actions, so that people in different places can communicate in the same conference room. Improve the user experience and increase the use of smart speakers, with strong ease of use and practicality.
实施例四Embodiment 4
本发明实施例四提供的智能音箱使用的方法的具体实现过程示意图,是对上述实施例三中的步骤S301、S302的进一步细化和说明,该方法可以包括以下步骤:The schematic diagram of the specific implementation process of the method for using the smart speaker provided in the fourth embodiment of the present invention is a further refinement and description of steps S301 and S302 in the third embodiment. The method may include the following steps:
S401:所述麦克风阵列采集语音信息。S401: The microphone array collects voice information.
其中,上述步骤S401和上述实施例三中的步骤S301基本相同,此处不再赘述。The step S401 is basically the same as step S301 in the third embodiment, and details are not described herein again.
S402:检测所述语音信息是否包含预设的唤醒关键词,若检测到预设的唤醒关键词,则唤醒所述智能音箱。S402: Detect whether the voice information includes a preset wakeup keyword, and if a preset wakeup keyword is detected, wake up the smart speaker.
其中,所述唤醒关键词为预先定义的将所述智能音箱从待机状态切换到工作状态的词语。可选的,根据用户的喜好来灵活设置所述预设的唤醒关键词。The wake-up keyword is a predefined word that switches the smart speaker from a standby state to a working state. Optionally, the preset wakeup keywords are flexibly set according to a user's preference.
S403:在唤醒所述智能音箱后,根据所述语音信息确定声源方向。S403: After waking up the smart speaker, determine a sound source direction according to the voice information.
其中,上述步骤S403和上述实施例三中的步骤S301基本相同,此处不再赘述。The step S403 is basically the same as step S301 in the third embodiment, and details are not described herein again.
S404:当确定的所述声源方向为一个,所述控制模块控制距离所述声源方向最近的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。S404: When the determined direction of the sound source is one, the control module controls a screen closest to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and Controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module.
应理解,考虑到本发明中的应用场景包括:一对一的单人视频会议模式、一对多的群体视频会议模式和多对多的群体视频会议模式,因此所述声源的方向可能有一个或多个,其中本发明实施例中仅以声源方向只有一个时为例进行解释和说明,关于多个声源方向的描述可详见实施例五。It should be understood that considering the application scenarios in the present invention include: one-to-one single-person video conference mode, one-to-many group video conference mode, and many-to-many group video conference mode, the direction of the sound source may be One or more, in the embodiment of the present invention, only a case where there is only one sound source direction is used as an example for explanation and description. For a description of multiple sound source directions, refer to Embodiment 5 for details.
还应理解,当所述声源方向为一个时,通过控制距离所述声源方向最近的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,可以最大程度的保证用户观看到清晰的视频画面。其中,声源到屏幕的距离可以根据声源到麦克风阵列的距离换算得到。It should also be understood that when the direction of the sound source is one, by controlling the screen closest to the direction of the sound source to display the image information collected by the camera and / or the image information received by the wireless communication module, the maximum extent can be achieved. To ensure that users see clear video. The distance from the sound source to the screen can be obtained by converting the distance from the sound source to the microphone array.
由上可见,本发明实施例相比于实施例三,增加了语音唤醒步骤和对所述声源方向进行判断的步骤,通过语音唤醒步骤可以及时将所述智能音箱从待机状态切换到工作状态,加快了数据处理的速度;另外,对于声源方向仅有一个的情形,控制距离所述声源方向最近的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,可以获取较佳的观看效果,使得智能音箱的使用率得以提高,具有较强的易用性和实用性。As can be seen from the above, compared with the third embodiment, the embodiment of the present invention adds a voice wake-up step and a step of judging the direction of the sound source. The voice wake-up step can promptly switch the smart speaker from the standby state to the working state. , Speeding up the data processing speed; in addition, for the case where there is only one sound source direction, controlling the screen closest to the sound source direction to display the image information collected by the camera and / or the image received by the wireless communication module Information, you can get better viewing results, so that the use of smart speakers can be improved, with strong ease of use and practicality.
实施例五Example 5
本发明实施例五提供的智能音箱使用的方法的具体实现过程示意图,是对上述实施例三中的步骤S301、S302的又一步细化和说明,该方法可以包括以下步骤:The schematic diagram of the specific implementation process of the method for using the smart speaker provided in the fifth embodiment of the present invention is a further step of detailing and describing steps S301 and S302 in the third embodiment. The method may include the following steps:
S501:所述麦克风阵列采集语音信息。S501: The microphone array collects voice information.
S502:检测所述语音信息是否包含预设的唤醒关键词,若检测到预设的唤醒关键词,则唤醒所述智能音箱。S502: Detect whether the voice information includes a preset wakeup keyword, and if a preset wakeup keyword is detected, wake up the smart speaker.
S503:在唤醒所述智能音箱后,所述麦克风阵列根据所述语音信息确定声源方向。S503: After the smart speaker is woken up, the microphone array determines a sound source direction according to the voice information.
其中,上述步骤S501-S503和上述实施例四中的步骤S401-S403基本相同,可参照上述实施例中的相关描述,此处不再赘述。The steps S501-S503 and steps S401-S403 in the fourth embodiment are basically the same, and reference may be made to related descriptions in the foregoing embodiments, which are not described herein again.
S504:当确定的所述声源方向为多个时,所述控制模块确定所述声源方向中的每一个声源方向与预设基准方向所成的角度,当存在屏幕对应的视角范围包含所述角度时,控制所述屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。S504: When there are multiple determined directions of the sound source, the control module determines an angle formed by each of the sound source directions and a preset reference direction. At the angle, controlling the screen to display the image information collected by the camera and / or the image information received by the wireless communication module, and controlling the smart speaker to play the voice information and / or the information collected by the microphone array. The speech information received by the wireless communication module is described.
可选的,所述预设基准方向为所述麦克风阵列安装时设定的基准方向。Optionally, the preset reference direction is a reference direction set when the microphone array is installed.
其中,所述视角范围是指用户可以从不同的方向清晰地观察到屏幕上所有内容的最大角度范围,应当理解,所述视角范围与屏幕的数量有关。The viewing angle range refers to a maximum angle range in which a user can clearly observe all content on the screen from different directions. It should be understood that the viewing angle range is related to the number of screens.
示例性的,在一种具体的应用场景中,若所述智能音箱安装有三个屏幕,则第一个屏幕对应的视角范围为(0,120º],第二个屏幕对应的视角范围为(120º,240º],第三个屏幕对应的视角范围为(240º,360º],当所述控制模块确定所述声源方向与预设基准方向所成的角度小于或者等于120º时,控制所述第一个屏幕处于工作状态,显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息;当所述控制模块确定所述声源方向与预设基准方向所成的角度落在(120º,240º]区间时,控制所述第二个屏幕处于工作状态,显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息;当所述控制模块确定所述声源方向与预设基准方向所成的角度落在(240º,360º]区间时,控制所述第三个屏幕处于工作状态,显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息。For example, in a specific application scenario, if the smart speaker is equipped with three screens, the viewing angle range corresponding to the first screen is (0, 120º), and the viewing angle range corresponding to the second screen is (120º , 240º], the viewing angle range corresponding to the third screen is (240º, 360º]. When the control module determines that the angle formed by the sound source direction and the preset reference direction is less than or equal to 120º, the first screen is controlled. Screens are in a working state, displaying image information collected by the camera and / or image information received by the wireless communication module; when the control module determines that the angle formed by the sound source direction and a preset reference direction falls on (120º, 240º) interval, controlling the second screen to be in a working state, displaying image information collected by the camera and / or image information received by the wireless communication module; when the control module determines the sound When the angle formed by the source direction and the preset reference direction falls in the (240º, 360º) interval, the third screen is controlled to be in a working state, and the data collected by the camera is displayed. The image information received image information and / or the wireless communication module.
还应理解,上述应用场景中,为了使本地和远端能进行同样的交流和显示,在显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息的同时,所述控制模块还可以控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。It should also be understood that, in the above application scenario, in order to enable the same communication and display between the local and remote ends, while displaying image information collected by the camera and / or image information received by the wireless communication module, the The control module may further control the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module.
由上可见,本发明实施例相比于实施例三,给出了当声源方向为多个时的具体实现方式,可以更好地控制屏幕的工作状态,从而提高智能音箱的使用率,具有较强的易用性和实用性。As can be seen from the above, compared with the third embodiment, the embodiment of the present invention provides a specific implementation when there are multiple sound source directions, which can better control the working state of the screen, thereby improving the utilization rate of the smart speaker. Strong usability and practicality.
实施例六Example Six
图6是本发明实施例六提供的视频会议系统的结构示意图,该视频会议系统可以包括:FIG. 6 is a schematic structural diagram of a video conference system provided by Embodiment 6 of the present invention. The video conference system may include:
两个以上的智能音箱以及分别与所述至少两个智能音箱连接的服务器,其中所述智能音箱在实施例一中已详细说明过,此处不再赘述。Two or more smart speakers and a server respectively connected to the at least two smart speakers, wherein the smart speakers have been described in detail in the first embodiment, and are not repeated here.
下面以一种具体的应用场景为例对本发明实施例中的视频会议系统进行描述,如图6所示的视频会议系统包括:第一智能音箱61、第二智能音箱62以及服务器63,其中第一智能音箱61由本地用户使用,第二智能音箱62由对端的远程用户使用。需要说明的是,本申请中暂不对本地用户的数量和远程用户的数量进行限制,可以分别是一个或多个,具体数量可视情况而定。当本地用户和对端用户双方分别开启各自的智能音箱后,第一智能音箱61通过自带的摄像头和麦克风阵列来分别采集本地的图像信息和语音信息,并将采集的图像信息和语音信息通过无线通信模块发送至服务器,当服务器在接收到第二智能音箱62的请求消息后,将第一智能音箱61发送的图像信息和语音信息转发至第二智能音箱62,并接收第二智能音箱62发送的图像信息和语音信息,当服务器接收到第一智能音箱61的请求消息后,将第二智能音箱62发送的图像信息和语音信息转发至第一智能音箱61,当第一智能音箱61根据本地采集的语音信息确定了声源的方向后,控制与声源方向对应的屏幕显示本地和/或对端采集的图像信息,并控制智能音箱61播放本地和/或对端采集的语音信息,这样对于本地用户来说就可以在听到对方语音的同时又看到包含对方图像在内的画面。The following uses a specific application scenario as an example to describe the video conference system in the embodiment of the present invention. The video conference system shown in FIG. 6 includes a first smart speaker 61, a second smart speaker 62, and a server 63. One smart speaker 61 is used by a local user, and the second smart speaker 62 is used by a remote user at the opposite end. It should be noted that, in this application, the number of local users and the number of remote users are not limited for the time being, and may be one or more, respectively, and the specific number may depend on circumstances. When both the local user and the opposite user turn on their respective smart speakers, the first smart speaker 61 collects local image information and voice information through its own camera and microphone array, and passes the collected image information and voice information through The wireless communication module sends to the server. When the server receives the request message from the second smart speaker 62, it forwards the image information and voice information sent by the first smart speaker 61 to the second smart speaker 62, and receives the second smart speaker 62. After sending the image information and voice information, when the server receives the request message of the first smart speaker 61, the server forwards the image information and voice information sent by the second smart speaker 62 to the first smart speaker 61. After the voice information collected locally determines the direction of the sound source, the screen corresponding to the direction of the sound source is controlled to display the image information collected locally and / or the opposite end, and the smart speaker 61 is controlled to play the voice information collected locally and / or the opposite end. In this way, for local users, they can hear each other's voice while seeing the other party's image. The picture.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of the description, only the above-mentioned division of functional units and modules is used as an example. In practical applications, the above functions can be allocated by different functional units according to needs. Module completion, that is, dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit. The integrated unit may be hardware. It can be implemented in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing from each other, and are not used to limit the protection scope of the present application. For specific working processes of the units and modules in the foregoing system, reference may be made to corresponding processes in the foregoing method embodiments, and details are not described herein again.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above embodiments, the description of each embodiment has its own emphasis. For a part that is not detailed or recorded in an embodiment, reference may be made to related descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Those of ordinary skill in the art may realize that the units and algorithm steps of each example described in connection with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. A person skilled in the art can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of the present invention.
在本发明所提供的实施例中,应该理解到,所揭露的终端设备和方法,可以通过其它的方式实现。例如,以上所描述的终端设备实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided by the present invention, it should be understood that the disclosed terminal device and method may be implemented in other manners. For example, the terminal device embodiments described above are only schematic. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be another division manner, such as multiple units or components. It can be combined or integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, which may be electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objective of the solution of this embodiment.
所述集成的模块如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。When the integrated module is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on such an understanding, the present invention implements all or part of the processes in the methods of the above embodiments, and may also be completed by a computer program instructing related hardware. The computer program may be stored in a computer-readable storage medium. The computer When the program is executed by a processor, the steps of the foregoing method embodiments can be implemented. The computer program includes computer program code, and the computer program code may be in a source code form, an object code form, an executable file, or some intermediate form. The computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signals, telecommunication signals, and software distribution media. It should be noted that the content contained in the computer-readable medium can be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdictions. For example, in some jurisdictions, the computer-readable medium Excludes electric carrier signals and telecommunication signals.
以上所述实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围,均应包含在本发明的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present invention, but not limited thereto. Although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still implement the foregoing implementations. The technical solutions described in the examples are modified, or some of the technical features are equivalently replaced; and these modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the present invention, and should be included in Within the scope of the present invention.

Claims (12)

  1. 一种智能音箱,其特征在于,包括:A smart speaker, comprising:
    控制模块、麦克风阵列、无线通信模块、摄像头和至少两个屏幕;A control module, a microphone array, a wireless communication module, a camera, and at least two screens;
    所述麦克风阵列、所述无线通信模块、所述摄像头和所述屏幕均与所述控制模块连接;The microphone array, the wireless communication module, the camera, and the screen are all connected to the control module;
    所述麦克风阵列,用于采集语音信息,并根据所述语音信息确定声源方向;The microphone array is configured to collect voice information and determine a sound source direction according to the voice information;
    所述控制模块,用于根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。The control module is configured to control, according to the direction of the sound source, a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and control the The smart speaker plays voice information collected by the microphone array and / or voice information received by the wireless communication module.
  2. 根据权利要求1所述的智能音箱,其特征在于,所述智能音箱还包括:The smart speaker according to claim 1, wherein the smart speaker further comprises:
    唤醒模块;Wake module
    所述唤醒模块与所述控制模块连接;The wake-up module is connected to the control module;
    所述唤醒模块在检测到预设的唤醒关键词后,唤醒所述智能音箱。The wake-up module wakes up the smart speaker after detecting a preset wake-up keyword.
  3. 根据权利要求1或2所述的智能音箱,其特征在于,当确定的所述声源方向为一个时,所述控制模块具体用于,控制距离所述声源方向最近的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。The smart speaker according to claim 1 or 2, wherein when the determined direction of the sound source is one, the control module is specifically configured to control a screen closest to the direction of the sound source to display the camera The collected image information and / or image information received by the wireless communication module, and controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module.
  4. 根据权利要求3所述的智能音箱,其特征在于,所述智能音箱还包括:The smart speaker according to claim 3, wherein the smart speaker further comprises:
    音频处理模块,所述音频处理模块包括数字信号处理器、功率放大器和扬声器;An audio processing module including a digital signal processor, a power amplifier, and a speaker;
    所述数字信号处理器、功率放大器和扬声器均与所述控制模块连接。The digital signal processor, power amplifier and speaker are all connected to the control module.
  5. 根据权利要求1或2所述的智能音箱,其特征在于,当确定的所述声源方向为多个时,所述控制模块具体用于,确定所述声源方向中的每一个声源方向与预设基准方向所成的角度,当存在屏幕对应的视角范围包含所述角度时,控制所述屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。The smart speaker according to claim 1 or 2, characterized in that when there are a plurality of determined sound source directions, the control module is specifically configured to determine each sound source direction among the sound source directions An angle formed with a preset reference direction, when the angle of view corresponding to the screen includes the angle, controlling the screen to display image information collected by the camera and / or image information received by the wireless communication module, and Controlling the smart speaker to play voice information collected by the microphone array and / or voice information received by the wireless communication module.
  6. 根据权利要求1所述的智能音箱,其特征在于,所述智能音箱还包括:The smart speaker according to claim 1, wherein the smart speaker further comprises:
    按键模块;Key module
    所述按键模块与所述控制模块连接;The key module is connected to the control module;
    所述控制模块,用于在所述按键模块接收到按键指令时,控制所述智能音箱音量的调节。The control module is configured to control the volume adjustment of the smart speaker when the key module receives a key instruction.
  7. 一种智能音箱使用的方法,其特征在于,所述智能音箱包括:控制模块、麦克风阵列、无线通信模块、摄像头和至少两个屏幕,所述麦克风阵列、所述无线通信模块、所述摄像头和所述屏幕均与所述控制模块连接,所述方法包括:A method for using a smart speaker, characterized in that the smart speaker includes a control module, a microphone array, a wireless communication module, a camera, and at least two screens, the microphone array, the wireless communication module, the camera and The screens are all connected to the control module, and the method includes:
    所述麦克风阵列采集语音信息,并根据所述语音信息确定声源方向;Collecting voice information by the microphone array, and determining a sound source direction according to the voice information;
    所述控制模块根据所述声源方向,控制与所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。According to the sound source direction, the control module controls a screen corresponding to the sound source direction to display image information collected by the camera and / or image information received by the wireless communication module, and controls the smart speaker to play Voice information collected by the microphone array and / or voice information received by the wireless communication module.
  8. 根据权利要求7所述的方法,其特征在于,控制所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息包括:The method according to claim 7, characterized in that controlling a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker Playing voice information collected by the microphone array and / or voice information received by the wireless communication module includes:
    当确定的所述声源方向为一个时,所述控制模块控制距离所述声源方向最近的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。When the determined sound source direction is one, the control module controls the screen closest to the sound source direction to display image information collected by the camera and / or image information received by the wireless communication module, and controls The smart speaker plays voice information collected by the microphone array and / or voice information received by the wireless communication module.
  9. 根据权利要求7所述的方法,其特征在于,控制所述声源方向对应的屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息还包括:The method according to claim 7, characterized in that controlling a screen corresponding to the direction of the sound source to display image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker Playing the voice information collected by the microphone array and / or the voice information received by the wireless communication module further includes:
    当确定的所述声源方向为多个时,所述控制模块确定所述声源方向中的每一个声源方向与预设基准方向所成的角度,当存在屏幕对应的视角范围包含所述角度时,控制所述屏幕显示所述摄像头采集的图像信息和/或所述无线通信模块接收到的图像信息,及控制所述智能音箱播放所述麦克风阵列采集的语音信息和/或所述无线通信模块接收到的语音信息。When there are a plurality of determined sound source directions, the control module determines an angle formed by each sound source direction and a preset reference direction, and when a range of viewing angles corresponding to a screen includes the At the angle, controlling the screen to display image information collected by the camera and / or image information received by the wireless communication module, and controlling the smart speaker to play voice information collected by the microphone array and / or the wireless Voice information received by the communication module.
  10. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求7至9任一项所述方法的步骤。A computer-readable storage medium storing a computer program, wherein when the computer program is executed by a processor, the steps of the method according to any one of claims 7 to 9 are implemented.
  11. 一种视频会议系统,包括:至少两个如权利要求1至6任一项所述的智能音箱。A video conference system includes: at least two smart speakers according to any one of claims 1 to 6.
  12. 如权利要求11所述的视频会议系统,其特征在于,所述视频会议系统还包括:分别与所述至少两个智能音箱连接的服务器。The video conference system according to claim 11, further comprising: a server connected to each of the at least two smart speakers.
PCT/CN2019/107869 2018-08-24 2019-09-25 Intelligent speaker and method for using intelligent speaker WO2020038494A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810973579.8A CN110858883A (en) 2018-08-24 2018-08-24 Intelligent sound box and use method thereof
CN201810973579.8 2018-08-24

Publications (1)

Publication Number Publication Date
WO2020038494A1 true WO2020038494A1 (en) 2020-02-27

Family

ID=69592313

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/107869 WO2020038494A1 (en) 2018-08-24 2019-09-25 Intelligent speaker and method for using intelligent speaker

Country Status (2)

Country Link
CN (1) CN110858883A (en)
WO (1) WO2020038494A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113608641B (en) * 2020-06-18 2024-01-16 深圳市冠旭电子股份有限公司 Method and device for adjusting display position of curved screen, intelligent sound box and storage medium
CN114245267B (en) * 2022-02-27 2022-07-08 北京荣耀终端有限公司 Method and system for multi-device cooperative work and electronic device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101534413A (en) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 System, method and apparatus for remote representation
CN102036158A (en) * 2009-10-07 2011-04-27 株式会社日立制作所 Sound monitoring system and speech collection system
CN104144315A (en) * 2013-05-06 2014-11-12 华为技术有限公司 Displaying method of multipoint videoconference and multipoint videoconference system
US20150350769A1 (en) * 2014-06-03 2015-12-03 Cisco Technology, Inc. Determination, Display, and Adjustment of Best Sound Source Placement Region Relative to Microphone
CN108366216A (en) * 2018-02-28 2018-08-03 深圳市爱影互联文化传播有限公司 TV news recording, record and transmission method, device and server
CN208862988U (en) * 2018-08-24 2019-05-14 深圳市冠旭电子股份有限公司 A kind of intelligent sound box and video conferencing system

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1705911A1 (en) * 2005-03-24 2006-09-27 Alcatel Video conference system
CN106155200B (en) * 2016-06-30 2020-11-20 联想(北京)有限公司 Electronic equipment and display method
CN206559473U (en) * 2017-02-20 2017-10-13 北京光年无限科技有限公司 A kind of image collecting device and intelligent robot
CN108366319A (en) * 2018-03-30 2018-08-03 京东方科技集团股份有限公司 Intelligent sound box and its sound control method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101534413A (en) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 System, method and apparatus for remote representation
CN102036158A (en) * 2009-10-07 2011-04-27 株式会社日立制作所 Sound monitoring system and speech collection system
CN104144315A (en) * 2013-05-06 2014-11-12 华为技术有限公司 Displaying method of multipoint videoconference and multipoint videoconference system
US20150350769A1 (en) * 2014-06-03 2015-12-03 Cisco Technology, Inc. Determination, Display, and Adjustment of Best Sound Source Placement Region Relative to Microphone
CN108366216A (en) * 2018-02-28 2018-08-03 深圳市爱影互联文化传播有限公司 TV news recording, record and transmission method, device and server
CN208862988U (en) * 2018-08-24 2019-05-14 深圳市冠旭电子股份有限公司 A kind of intelligent sound box and video conferencing system

Also Published As

Publication number Publication date
CN110858883A (en) 2020-03-03

Similar Documents

Publication Publication Date Title
JP6773832B2 (en) How to switch the playback mode of the wireless speaker, device and wireless speaker
US7466977B2 (en) Call transfer to proximate devices
WO2020063675A1 (en) Smart loudspeaker box and method for using smart loudspeaker box
US20140347565A1 (en) Media devices configured to interface with information appliances
US20090143053A1 (en) Transfer then sleep
US20090023390A1 (en) Bring call here selectively
US20140022402A1 (en) Method and apparatus for automatic capture of multimedia information
WO2021169472A1 (en) Voice call transfer method and electronic device
WO2023125350A1 (en) Audio data pushing method, apparatus and system, and electronic device and storage medium
WO2020038494A1 (en) Intelligent speaker and method for using intelligent speaker
WO2023151526A1 (en) Audio acquisition method and apparatus, electronic device and peripheral component
CN208862988U (en) A kind of intelligent sound box and video conferencing system
CN113395305A (en) Method and device for synchronous playing processing and electronic equipment
CN108124114A (en) A kind of audio/video conference sound collection method and device
CN111586523B (en) Conference sound acquisition method and conference sound playing method
WO2023231686A9 (en) Video processing method and terminal
WO2022161446A1 (en) Control method and apparatus, and electronic device
WO2018064883A1 (en) Method and device for sound recording, apparatus and computer storage medium
WO2011153926A1 (en) Method for broadcasting meeting place image and multipoint control unit
CN113760219A (en) Information processing method and device
JP6500366B2 (en) Management device, terminal device, transmission system, transmission method and program
JP2017163466A (en) Information processor and conference system
JP6930280B2 (en) Media capture / processing system
CN110213531A (en) Monitoring video processing method and processing device
JP6473203B1 (en) Server apparatus, control method, and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19852827

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19852827

Country of ref document: EP

Kind code of ref document: A1