Disclosure of Invention
In order to solve the deficiencies of the prior art, the utility model provides a this meeting equipment, the integration has the AI camera, can track, focus the face of speaker, and the speaker in the removal is followed fast to automatic accuracy to and carry out automatically regulated to speaker's face and surrounding environment change.
The utility model discloses the technical problem that will solve realizes through following technical scheme:
the utility model provides a meeting equipment of integrated AI camera, connects between source equipment and display device, including data transmission module, AI camera, microphone module and loudspeaker module, wherein
The input end of the data transmission module is connected to the source equipment, and the output end of the data transmission module is connected to the display equipment, so that a source signal output by the source equipment is transmitted to the display equipment;
the output end of the AI camera is connected with the data transmission module and used for collecting image signals of a speaker and sending the collected image signals to the data transmission module so that the data transmission module can transmit the image signals and the source signals to the display equipment;
the microphone module is used for collecting the voice signal of a speaker;
the loudspeaker module is used for playing the sound signal of the speaker.
Further, the AI camera includes rotatable camera and AI processing chip, the output of rotatable camera is connected to AI processing chip's input, AI processing chip's output is connected to the data transmission module.
Further, the rotation range of the rotatable camera is 0-360 degrees, and the rotatable camera is a USB camera supporting 4K high definition and 106 degrees wide angle.
Further, the microphone module includes microphone array and DSP processing chip, the output of microphone array connect in DSP processing chip.
Further, the data transmission module comprises a Type-C to HDMI transmission channel, a USB3.0to HDMI transmission channel and an HDBaseT extension transmission channel, wherein
The Type-C to HDMI transmission channel is used for converting a source signal output by the source device in a Type-C format into an HDMI format and transmitting the HDMI format to the display device;
the USB 3.0-to-HDMI transmission channel is used for converting a source signal output by the source equipment in a USB3.0 format into an HDMI format and transmitting the HDMI format to the display equipment;
the HDBaseT extended transmission channel is used for encoding the HDMI format source signals converted from the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel into network signals and then outputting the network signals.
Further, the Type-C to HDMI transmission channel comprises a Type-C interface connected with the source device, an HDMI interface connected with the display device, and an audio/video conversion chip connected between the Type-C interface and the HDMI interface.
Further, still include with the module fills soon that Type-C interface is connected.
Further, the USB 3.0-to-HDMI transmission channel includes a USB3.0 main interface connected to the source device, an HDMI interface connected to the display device, and an audio/video conversion chip connected between the USB3.0 main interface and the HDMI interface.
Further, the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel share the same audio/video conversion chip and the same HDMI interface, and the Type-C to HDMI transmission channel and the USB3.0 main interface are connected to the audio/video conversion chip through a switching chip.
Further, the HDBaseT extended transmission channel includes an HDBaseT chip and a network interface, an input end of the HDBaseT chip is connected to the Type-C to HDMI transmission channel and the audio/video conversion chip of the USB3.0to HDMI transmission channel, and an output end of the HDBaseT chip is connected to the network interface.
The utility model discloses following beneficial effect has: the conference equipment is integrated with the AI camera, the face of a speaker can be automatically identified, tracked and focused by utilizing an AI human shape and a human face identification algorithm carried by the AI camera, the speaker can be automatically, accurately and quickly tracked on a conference site, the face of the speaker and the light change of the ambient environment can be automatically adjusted, the AI camera is matched with the microphone module for use, image signals and sound signals of the speaker are respectively collected, then the image signals and the source signals are transmitted to the display equipment through the data transmission module, and the sound signals are played through the loudspeaker module.
Detailed Description
The present invention will be described in detail with reference to the accompanying drawings and examples.
As shown in fig. 1 and 2, a conference device integrated with an AI camera, connected between a source device and a display device, includes a data transmission module, an AI camera, a microphone module and a speaker module, wherein
The input end of the data transmission module is connected to the source equipment, and the output end of the data transmission module is connected to the display equipment, so that a source signal output by the source equipment is transmitted to the display equipment; and
the output end of the AI camera is connected with the data transmission module and used for collecting image signals of a speaker and sending the collected image signals to the data transmission module so that the data transmission module can transmit the image signals and the source signals to the display equipment;
the microphone module is used for collecting the voice signal of a speaker;
the loudspeaker module is used for playing the sound signal of the speaker.
The AI camera is a shooting device for recognizing, tracking and focusing a human face by utilizing an AI human shape and human face recognition algorithm, can automatically, accurately and quickly move along with the human face and automatically adjust the change of the human face and ambient light.
The conference equipment is integrated with the AI camera, the face of a speaker can be automatically identified, tracked and focused by utilizing an AI human shape and a human face identification algorithm carried by the AI camera, the speaker can be automatically, accurately and quickly tracked on a conference site, the face of the speaker and the light change of the ambient environment can be automatically adjusted, the AI camera is matched with the microphone module for use, image signals and sound signals of the speaker are respectively collected, then the image signals and the source signals are transmitted to the display equipment through the data transmission module, and the sound signals are played through the loudspeaker module.
In this embodiment, since the voice signal of the speaker needs to be synchronized to the display device, the output end of the microphone module is connected to the data transmission module to send the collected voice signal to the data transmission module, so that the data transmission module transmits the voice signal and the source signal to the display device; the input end of the loudspeaker module is connected with the data transmission module so as to indirectly acquire the sound signal from the data transmission module.
In implementation, if the sound signal of the speaker does not need to be synchronized to the display device, the input end of the speaker module may be directly connected to the output end of the microphone module, so as to directly obtain the sound signal from the microphone module (not shown).
The AI camera includes rotatable camera and AI processing chip, the output of rotatable camera is connected to AI processing chip's input, AI processing chip's output is connected to the data transmission module.
The AI processing chip acquires an image signal from the transmission camera, identifies a speaker in the image signal by utilizing an integrated AI human shape and face identification algorithm, controls the rotation camera to rotate and focus so as to track and focus the face of the speaker, and finally outputs the image signal of the speaker to the data transmission module.
The AI processing chip can determine who the speaker is by mouth change or body movement distance, frequency, etc. of each participant in the image signal, for example, identify the participant with the largest mouth change as the speaker, and track and focus the face.
The rotating range of the rotatable camera is 0-360 degrees so as to synchronously rotate along with a moving speaker, and the rotatable camera used in the scheme adopts a USB camera supporting 4K high definition and 106 large wide angles, so that the use scene of a small and medium-sized conference room can be completely covered.
The microphone module comprises a microphone array and a DSP processing chip, wherein the output end of the microphone array is connected with the DSP processing chip, and the output end of the DSP processing chip is connected with the data transmission module or the loudspeaker module.
The DSP chip can utilize a DSP algorithm with a built-in core to perform echo cancellation, background noise reduction, sound enhancement and other processing on the collected sound signals so as to perfectly output the voice of a speaker with high-frequency and low-frequency crystal-like tone quality to the data transmission module or the loudspeaker module, thereby bringing excellent user experience in a meeting room.
The microphone array used in this case includes four microphones, and the four microphones are arranged around the central axis of the box in a 90 ° offset manner at the periphery of the conference device, so as to realize sound reception at 360 °.
The data transmission module comprises a Type-C to HDMI transmission channel, a USB3.0to HDMI transmission channel and an HDBaseT extension transmission channel, wherein
The Type-C to HDMI transmission channel is used for converting a source signal output by the source device in a Type-C format into an HDMI format and transmitting the HDMI format to the display device;
the USB 3.0-to-HDMI transmission channel is used for converting a source signal output by the source equipment in a USB3.0 format into an HDMI format and transmitting the HDMI format to the display equipment;
the HDBaseT extended transmission channel is used for encoding the HDMI format source signals converted from the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel into network signals and then outputting the network signals.
The conference equipment is internally integrated with functions of Type-C to HDMI output, USB3.0to HDMI output, HDBaseT extension and the like, source signals (together with sound signals collected by the microphone module) output by the source equipment in a Type-C format or a USB3.0to HDMI format can be converted into HDMI formats together through the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel and then output to the display equipment at the near end, and the HDMI format source signals (and sound signals) converted from the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel are encoded into network signals through the HDBaseT extension transmission channel and then output to the display equipment at the far end.
The remote display device needs to receive the network signal output by the conference device through the cable Cat5e/6 through an HDBaseT receiver, and decode the network signal into an HDMI format for display.
Type-C changes HDMI transmission channel include with Type-C interface that the source equipment is connected, with the HDMI interface that the display device is connected and connect in audio and video conversion chip between Type-C interface and the HDMI interface, USB3.0 change HDMI transmission channel with USB3.0 main interface that the source equipment is connected, with the HDMI interface that the display device is connected and connect in audio and video conversion chip between USB3.0 main interface and the HDMI interface.
In this embodiment, Type-C changes HDMI transmission channel with USB3.0 changes same audio and video conversion chip of HDMI transmission channel sharing and HDMI interface, Type-C changes the Type-C interface of HDMI transmission channel with USB3.0 changes the USB3.0 main interface of HDMI transmission channel is connected to through a switching chip jointly audio and video conversion chip. At this time, an MCU connected to the switching chip may be added, and the MUC receives a switching instruction through RS23, infrared, serial ports, and the like, and then controls the switching chip to switch.
In implementation, each transmission channel in the data transmission module may also adopt other connection schemes (not shown in the figure), such as: the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel can also only share the same HDMI interface, and the two have respective video transmission chips, at this time, the switching chip needs to be respectively connected between the audio and video conversion chip of the Type-C to HDMI transmission channel and the HDMI interface and between the audio and video conversion chip of the USB3.0to HDMI transmission channel and the HDMI interface; for another example, the Type-C to HDMI transmission channel and the USB3.0to HDMI transmission channel do not share any component, and both have their own video transmission chip and HDMI interface.
The conference equipment supports Type-C to HDMI2.0 video output, and maximum support is achieved to 4K resolution, 30Hz refresh rate and 4: 4: 4, the maximum output 1080P resolution and the 60Hz refresh rate can almost meet the requirements of most of the mobile devices such as computers, flat panels and mobile phones supporting the Type-C interface on the market at present, and simultaneously, the USB3.0toHDMI technology can meet the requirements of relatively old PC, pen and power devices and the like, and various source devices can be supported as much as possible.
The conference equipment also comprises a USB3.0 slave interface connected with the audio and video conversion chip for the access of external USB equipment.
The conference equipment further comprises a quick charging module connected with the Type-C interface, wherein the quick charging module comprises a charging interface and a quick charging protocol chip, and the quick charging protocol chip is connected between the Type-C interface and the charging interface.
The conference equipment simultaneously provides 65W Type-C charging, and ensures that the computers of the conference participants can keep sufficient electric quantity to meet the long-time conference requirement.
The HDBaseT extended transmission channel includes an HDBaseT chip and a network cable interface, an input end of the HDBaseT chip is connected to the Type-C to HDMI transmission channel and an audio/video conversion chip of the USB3.0to HDMI transmission channel, and an output end of the HDBaseT chip is connected to the network cable interface.
The network cable interface outputs a network signal to the HDBaseT receiver of the remote display device by connecting a network cable Cat5 e/6.
The output end of the AI camera and the output end of the microphone module are both connected to the input end of the audio and video conversion chip, and the input end of the loudspeaker module is connected to the output end of the audio and video conversion chip.
The above-mentioned embodiments only express the embodiments of the present invention, and the description thereof is specific and detailed, but the invention can not be understood as the limitation of the patent scope of the present invention, but all the technical solutions obtained by adopting the equivalent substitution or equivalent transformation should fall within the protection scope of the present invention.