CN216122672U - Conference system and remote conference platform - Google Patents

Conference system and remote conference platform Download PDF

Info

Publication number
CN216122672U
CN216122672U CN202121394714.7U CN202121394714U CN216122672U CN 216122672 U CN216122672 U CN 216122672U CN 202121394714 U CN202121394714 U CN 202121394714U CN 216122672 U CN216122672 U CN 216122672U
Authority
CN
China
Prior art keywords
audio signal
sound
processing
conference system
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202121394714.7U
Other languages
Chinese (zh)
Inventor
张知硕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Taiden Industrial Co ltd
Original Assignee
Shenzhen Taiden Industrial Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Taiden Industrial Co ltd filed Critical Shenzhen Taiden Industrial Co ltd
Priority to CN202121394714.7U priority Critical patent/CN216122672U/en
Application granted granted Critical
Publication of CN216122672U publication Critical patent/CN216122672U/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The utility model provides a conference system and a remote conference platform. The conference system comprises a control unit, and a pickup unit, a sound amplification unit and a recording unit which are respectively connected with the control unit. The pickup unit is used for collecting an original audio signal and performing first audio processing on the original audio signal to generate a first audio signal and performing second audio processing on the original audio signal to generate a second audio signal; the control unit is used for receiving and processing the first audio signal and the second audio signal, and transmitting the processed first audio signal to the sound amplifying unit, and the sound amplifying unit is used for receiving the first audio signal, amplifying and outputting the first audio signal; the control unit is further configured to transmit the processed second audio signal to the recording unit, and the recording unit is configured to receive and analyze the second audio signal to perform recording, archiving and/or voice transcription.

Description

Conference system and remote conference platform
Technical Field
The utility model relates to the technical field of audio processing, in particular to a conference system and a remote conference platform comprising the conference system.
Background
With the development of science and technology, the application of modern conference systems is more and more common. In some important conferences, the conference site audio is used for both sound amplification and sound recording and archiving, and in addition, real-time voice transcription of the conference site audio is gradually becoming one of the indispensable functions of the modern conference system, so that the demand of high-quality teleconferencing is increasing day by day. However, in the existing conference system, the conference room audio often has only one audio signal, and the above-mentioned multiple requirements cannot achieve the effect of good quality at the same time.
SUMMERY OF THE UTILITY MODEL
In view of this, the present invention provides a conference system and a teleconference platform, which can perform different audio processing on the same collected original audio signal to generate different multi-channel audio signals, thereby meeting different requirements in a conference.
In order to achieve the above object, in a first aspect, the present invention provides a conference system, including a control unit, and a sound pickup unit, a sound amplification unit, and a recording unit respectively connected to the control unit;
the pickup unit is used for collecting an original audio signal, performing first audio processing on the original audio signal to generate a first audio signal, and performing second audio processing on the original audio signal to generate a second audio signal;
the control unit is used for receiving and processing the first audio signal and the second audio signal, and transmitting the processed first audio signal to the sound amplifying unit, and the sound amplifying unit is used for receiving the first audio signal, amplifying and outputting the first audio signal; the control unit is further configured to transmit the processed second audio signal to the recording unit, and the recording unit is configured to receive and analyze the second audio signal to perform recording, archiving and/or voice transcription.
In a second aspect, the present invention provides a remote conference platform, including the conference system as described above, and at least one remote conference system communicatively connected to the conference system; wherein the remote conference system comprises a terminal device in communication connection with the conference system and/or another conference system in communication connection with the conference system.
Compared with the prior art, the utility model has the beneficial effects that: in the conference system and the teleconference platform provided by the utility model, different audio processing is carried out on the same path of acquired original audio signals, so that different first audio signals and second audio signals can be generated, wherein the first audio signals are used for sound amplification and external reproduction in the conference, and the second audio signals are used for recording and archiving and/or voice transcription in the conference, thereby meeting different requirements in the conference.
Additional aspects and advantages of the utility model will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the utility model.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
Fig. 1 is a schematic diagram of a framework of a conference system provided by the present invention.
Fig. 2 is a schematic structural diagram of the conference system shown in fig. 1 in one embodiment.
Fig. 3 is a schematic diagram illustrating the principle of audio processing of the original audio signal by the sound pickup unit shown in fig. 2.
Fig. 4 is a schematic diagram of the conference system host of fig. 2 processing multiple audio signals.
Fig. 5 is a schematic diagram of a framework of a teleconferencing platform provided by the present invention.
Fig. 6 is a flow chart of an audio processing method provided by the utility model.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the accompanying drawings are illustrative only for the purpose of explaining the present invention, and are not to be construed as limiting the present invention.
In the description of the present invention, it should be noted that the terms "first", "second", etc. are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implying any number of technical features indicated. Thus, features defined as "first," "second," etc. may explicitly or implicitly include one or more of the features.
Referring to fig. 1, the present invention provides a conference system 1, which includes a control unit 10, and a sound pickup unit 20, a sound amplification unit 30, and a recording unit 40 respectively connected to the control unit 10.
The sound pickup unit 20 is configured to collect an original audio signal, and perform first audio processing on the original audio signal to generate a first audio signal, and perform second audio processing on the original audio signal to generate a second audio signal. The control unit 10 is configured to receive and process the first audio signal and the second audio signal, and transmit the processed first audio signal to the sound amplifying unit 30, where the sound amplifying unit 30 is configured to receive the first audio signal, amplify the first audio signal, and output the amplified first audio signal (i.e., sound amplifying and playing the sound). The control unit 10 is further configured to transmit the processed second audio signal to the recording unit 40, and the recording unit 40 is configured to receive and analyze the second audio signal to perform recording, archiving and/or voice transcription.
It is understood that, in the embodiment of the present invention, the second audio signal received and analyzed by the recording unit 40 may be used only for recording, archiving, or only for voice transcription, and may also be used for recording, archiving, and voice transcription together, and the specific use of the second audio signal may be set according to actual needs, which is not limited herein.
According to the conference system 1 provided by the utility model, different audio processing is carried out on the same path of acquired original audio signals, so that different first audio signals and second audio signals can be generated, the first audio signals are used for sound amplification and external sound reproduction in a conference, and the second audio signals are used for recording and archiving and/or voice transcription in the conference, so that different requirements in the conference are met.
Preferably, in some embodiments of the present invention, the sound pickup unit 20 is further configured to perform a third audio processing on the acquired original audio signal to generate a third audio signal, the control unit 10 is further configured to receive and process the third audio signal, and transmit the processed third audio signal to the recording unit 40, and the recording unit 40 is further configured to receive and parse the third audio signal to perform recording archiving and/or voice transcription. Similar to the second audio signal, in the embodiment of the present invention, the third audio signal received and analyzed by the recording unit 40 may be used only for recording, archiving, or only for voice transcription, and may also be used for both recording, archiving and voice transcription, that is, the purpose of the third audio signal may also be set according to actual needs, and is not limited thereto.
It is understood that both the second audio signal and the third audio signal may be used for recording archiving and/or voice transcription when both are present, i.e. the second audio signal and the third audio signal may be used for the same or different purposes.
Preferably, in an embodiment, the recording unit 40 may receive and parse the second audio signal for recording and archiving, and receive and parse the third audio signal for voice transcription. In another embodiment, the recording unit 40 may receive and parse the third audio signal for recording and archiving, and receive and parse the second audio signal for voice transcription. In the embodiment of the utility model, two different paths of audio signals are adopted for recording, archiving and voice transcription, and the two paths of audio signals cannot influence each other, thereby being beneficial to improving the sound quality definition of the audio file generated by recording, archiving and improving the recognition success rate and the transcription speed of the voice transcription.
In other embodiments, the second audio signal and the third audio signal may be both used for recording and archiving or both used for voice transcription, the second audio signal and the third audio signal may also be both used for recording and archiving and voice transcription, respectively, and the second audio signal and the third audio signal may also be one of them used for recording and archiving or voice transcription and the other one used for recording and archiving and voice transcription. It can be understood that two different audio signals are simultaneously used for recording and archiving or voice transcription, so that two different recording files can be correspondingly generated for recording and archiving or voice transcription, and the recording content is more comprehensive.
In the embodiment of the present invention, the audio processing means performed by the sound pickup unit 20 on the original audio signal includes, but is not limited to, a plurality of audio processing means such as gain processing, noise reduction processing, echo cancellation processing, and equalization processing, and the first audio processing, the second audio processing, and the third audio processing all include at least some of the plurality of audio processing means, but the combination of the audio processing means included in the first audio processing, the second audio processing, and the third audio processing is different, so that the first audio processing, the second audio processing, and the third audio processing become different audio processing technologies, and thus different first audio signals, second audio signals, and third audio signals can be correspondingly generated and used for the same or different purposes according to actual needs.
Specifically, in an embodiment, the first audio processing includes gain processing, noise reduction processing, echo cancellation processing, and equalization processing, the second audio processing includes gain processing and noise reduction processing, and the third audio processing includes gain processing. In another embodiment, the first audio processing comprises gain processing, noise reduction processing, echo cancellation processing, and equalization processing, the second audio processing comprises gain processing, and the third audio processing comprises gain processing and noise reduction processing. In the embodiment of the present invention, after the original audio signal is subjected to multiple processing such as gain, noise reduction, echo cancellation, and equalization, when the first audio signal generated correspondingly is externally played through the sound amplifying unit 30, the method has the advantages of stable signal, no noise, and good sound quality, and is beneficial to improving the listening experience of the user; after the original audio signal is subjected to gain processing and noise reduction processing, the signal of the second audio signal or the third audio signal generated by the original audio signal is stable and the noise signal is removed, which is beneficial to forming audio with high sound quality definition and no noise when the second audio signal or the third audio signal completes recording and archiving through the recording unit 40; after the original audio is subjected to gain processing, the signal of the third audio signal or the second audio signal generated by the original audio is stable, which is beneficial to the rapid recognition of the third audio signal or the second audio signal when the third audio signal or the second audio signal is subjected to voice transcription through the recording unit 40, thereby improving the efficiency of voice transcription.
In some embodiments of the present invention, the sound pickup unit 20 is multiple, and each sound pickup unit 20 is configured to collect one path of the original audio signal and generate one path of the first audio signal, one path of the second audio signal, and one path of the third audio signal through corresponding audio processing. Control unit 10 includes a plurality of sound mixers, control unit 10 receives a plurality ofly in the lump the multichannel of pickup unit 20 transmission first audio signal, multichannel second audio signal and multichannel behind the third audio signal, the multichannel first audio signal carries out mixing sound through first sound mixer and handles, and the multichannel the second audio signal carries out mixing sound through the second sound mixer and handles, and the multichannel the third audio signal carries out mixing sound through the third sound mixer and handles.
After the sound mixing processing by the corresponding sound mixer, when the first audio signal is amplified and externally played by the sound amplifying unit 30, the problems of poor sound quality, echo, howling, noise and the like of the sound playing of a human can be avoided, so that the played sound is more clear and powerful, and the listening experience of a user is improved; similarly, the second audio signal or the third audio signal passes through after recording unit 40 records the file, the recording file that its corresponding formation also can avoid not good tone quality, volume low scheduling problem when the broadcast, makes the better reduction speaker's of broadcast sound, improves user's listening experience.
Further, in some embodiments of the present invention, the conference system 1 further includes a communication unit 50 connected to the control unit, and the control unit 10 is further configured to transmit the first audio signal as a remote audio signal through the communication unit 50.
Therefore, when the number of the conference systems 1 is multiple, after the first conference system is in communication connection with at least one second conference system, the control unit 10 of the first conference system is further configured to send the remote audio signal to the control unit 10 of at least one second conference system, the control unit 10 of each second conference system is further configured to receive and process the remote audio signal, and transmit the processed remote audio signal to the sound amplification unit 30 of the second conference system, and the sound amplification unit 30 of the second conference system receives, amplifies, and outputs the remote audio signal. Specifically, the control unit 10 of each conference system 1 further includes a fourth sound mixer, where the fourth sound mixer is configured to perform sound mixing processing on a first audio signal transmitted by the sound pickup unit 10 of the conference system 1, and the first audio signal after sound mixing processing is used as the remote audio signal and is sent out through the communication unit 50 of the conference system 1; further, each conference system 1 includes first sound mixer still is used for right the long-range audio signal that conference system 1 received carries out mixing processing, works as the control unit 10 of second conference system receives long-range audio signal, and the pickup unit 20 of second conference system transmits behind the first audio signal, long-range audio signal with first audio signal passes through the first sound mixer of second conference system carries out mixing processing, after mixing processing long-range audio signal with first audio signal transmits to the public address unit 30 of second conference system carries out public address and externally broadcasts.
In the embodiment of the present invention, the conference systems 1 are in communication connection, and the remote audio signals are received and transmitted through the respective control units 10 and the communication units 50, so that a teleconference can be performed among the conference systems 1, and the application range of the conference system 1 is widened. Furthermore, the remote audio signal (i.e. the first audio signal) sent by the first conference system is processed by the echo cancellation process of the first audio process, so that it can be avoided that the remote audio signal sent by any one of the second conference systems is received by the first conference system, is externally played, is collected and is transmitted back to the second conference system, and the audio content externally played by the sound amplifying unit 30 of any one of the conference systems 1 is clean and does not mix with the echo, which is beneficial to improving the listening experience of users.
It should be noted that the first conference system and the second conference system refer to different participants participating in a teleconference, respectively, where the first conference system can be regarded as a party hosting the teleconference, and both the first conference system and the second conference system have all the features and functions of the conference system 1.
Preferably, in an embodiment, the control unit 10 of the first conferencing system comprises a teleconference controller for controlling the number of second conferencing systems communicatively connected to the first conferencing system, i.e. for controlling the number of second conferencing systems participating in a teleconference. The number of the second conference systems is controlled by the teleconference controller, so that the content of the teleconference can be ensured to be listened by only allowed participants, and the safety of the teleconference is improved.
Further, in the embodiment of the present invention, the conference system 1 further includes a camera unit 60 and a display unit 70 connected to the control unit 10 thereof. The camera unit 60 is configured to collect a video signal in a conference and transmit the video signal to the control unit 10, the control unit 10 is further configured to process the video signal and transmit the processed video signal to the display unit 70, and the display unit 70 receives and analyzes the video signal to display a related video in the conference, so that a conference audio and a conference video are synchronized, thereby implementing a video conference.
It can be understood that, when a plurality of conference systems 1 are communicatively connected to perform a teleconference, the control unit 10 of any conference system 1 is further configured to use the video signal captured by the camera unit 60 as a remote video signal, and send the remote video signal to the control units 10 of other conference systems 1 along with the remote audio signal, so as to perform a display through the display units 70 of other conference systems, thereby implementing a teleconference.
A specific embodiment of the conference system 1 provided by the present invention will be further described with reference to fig. 1 to 4.
As shown in fig. 1, the conference system 1 provided by the present invention includes a control unit 10, and a sound collecting unit 20, a sound amplifying unit 30, a recording unit 40, a communication unit 50, an image capturing unit 60, and a display unit 70 respectively connected to the control unit 10.
Specifically, referring to fig. 2, in an embodiment of the present invention, the control unit 10 includes a conference system host 12, a network switch 14 connected to the conference system host 12, a remote video conference host 16 connected to the conference system host 12, and a remote conference controller 18 connected to the remote video conference host 16. The conference system host 12, the network switch 14, the remote video conference host 16, and the remote conference controller 18 have the same functions as the existing conference system host, network switch, remote video conference host, and remote conference controller, respectively, in a one-to-one correspondence, and no further description is given here.
The number of the sound pickup units 20 is at least one, each sound pickup unit 20 includes a microphone 22 and an audio processor 24 connected to the microphone 22, the microphone 22 is configured to collect an original audio signal when a participant speaks and transmit the original audio signal to the audio processor 24, and the audio processor 24 is configured to perform different audio processing on the received original audio signal, so as to generate different audio signals and transmit the different audio signals to the conference system host 12. As shown in fig. 3, the audio processor 24 includes an analog-to-digital conversion (a/D) module, an Automatic Gain Control (AGC) module, an Adaptive Noise Reduction (ANR) module, an Adaptive Echo Cancellation (AEC) module, and an Equalization (EQ) module, and the audio processor 24 further includes other functional modules (not shown), such as an audio transmitting module for transmitting the processed audio signal to the conference system host 12. It can be understood that, as in the conventional audio processor, the a/D module, the AGC module, the ANR module, the AEC module, and the EQ module of the audio processor 24 are respectively configured to perform analog-to-digital conversion, gain processing, noise reduction processing, echo cancellation processing, and equalization processing on an original audio signal received by the audio processor 24, which is not described herein again.
In an embodiment of the present invention, after the microphone 22 of each pickup unit 20 collects an original audio signal of a corresponding participant, the audio processor 24 of the pickup unit 20 performs analog-to-digital conversion on the original audio signal through the a/D module of the pickup unit, and then performs different audio processing on the analog-to-digital converted original audio signal through at least some of the AGC module, the ANR module, the AEC module, and the EQ module, so as to generate a corresponding target audio signal. As described above, after the AGC module, the ANR module, the AEC module, and the EQ module of the audio processor 24 are combined, the original audio signal can be subjected to the first audio processing, the second audio processing, or the third audio processing, where the target audio signal generated by the original audio signal after the first audio processing is the first audio signal (CUn 1 shown in fig. 3), the target audio signal generated by the original audio signal after the second audio processing is the second audio signal (CUn 2 shown in fig. 3), and the target audio signal generated by the original audio signal after the third audio processing is the third audio signal (CUn 3 shown in fig. 3). The first audio signal is used for performing sound amplification and sound amplification through the sound amplification unit 30, and the second audio signal and the third audio signal may be used for recording and archiving and/or voice transcription through the recording unit 40 according to actual needs.
Preferably, as shown in fig. 2, in an embodiment of the present invention, the number of the sound pickup units 20 is equal to the number of the participants, and each of the microphones 22 is configured to separately collect the original audio signal of a corresponding one of the participants, so that the original audio signal of each of the participants can be used for sound amplification and sound amplification play as well as sound recording and archiving and/or voice transcription after being received and processed by the audio processor 24, so that each file formed by sound recording and archiving and/or voice transcription corresponds to a participant, in other words, a microphone is equipped by a person to separate roles during sound recording and archiving and/or voice transcription.
In other embodiments, the number of the sound pickup units 20 may be less than the number of the participants, that is, the sound pickup units 20 do not correspond to the participants one by one, and at least one of the sound pickup units 20 is configured to collect the original audio signals of a plurality of the participants, so that the original audio signal of each of the participants can be collected by one of the sound pickup units 20.
It should be noted that, in the embodiment of the present invention, the sound pickup units 20 may be sequentially connected and then communicatively connected to the conference system host 12, or each sound pickup unit 20 may be connected to the conference system host 12; the connection between the two sound pickup units 20 or the connection between the sound pickup unit 20 and the conference system main unit 12 may be a wired communication connection or a wireless communication connection, which is not limited to this.
Referring to fig. 2 again, in an embodiment of the present invention, the sound amplifying unit 30 includes a power amplifier 32 connected to the conference system host 12, and at least one sound amplifying device 34 (e.g., a sound box) connected to the power amplifier 32, where the power amplifier 32 is configured to amplify the power of the first audio signal transmitted by the conference system host 12 and transmit the power to the at least one sound amplifying device 34, and the at least one sound amplifying device 34 receives and analyzes the amplified first audio signal for sound amplifying and sound outputting. Preferably, in this embodiment, the sound reinforcement devices 34 are multiple and are disposed at different positions of the meeting place, so that the sound of the first audio signal radiated outside through the multiple sound reinforcement devices 34 has a stereo effect, which is beneficial to improving the user experience.
As shown in fig. 2, in an embodiment of the present invention, the recording unit 40 includes a recording device 42 and a summary device 44 having a voice transcription function, the recording device 42 is directly connected to the conference system host 12, the summary device 44 is connected to the network switch 14, the conference system host 12 transmits an audio signal to the summary device 44 through the network switch 14, and the recording device 42 and the summary device 44 are respectively configured to archive and voice-transcribe the audio signal transmitted by the conference system host 12. In other embodiments, the recording unit 40 may include only a recording device 42 or only a staging device 44. The recording device 42 and the summarizing device 44 may be any existing recording device and summarizing device, and are not limited thereto.
It should be noted that, in an embodiment of the present invention, the communication unit 50 is disposed in the remote video conference host 16, the conference system 1 is communicatively connected to a cloud server through the communication unit 50, and a plurality of conference systems 1 are communicatively connected through the cloud server to perform a multi-party conference. The communication unit 50 may be one or a combination of several of communication modules, such as a bluetooth module, a wifi module, and a wired network connection module, which is not limited herein.
As shown in fig. 2, in an embodiment of the present invention, the camera unit 60 includes a plurality of cameras 62 and a video matrix 64, the video matrix 64 is connected to the conference system host 12, and the plurality of cameras 62 are respectively connected to the video matrix 64. The plurality of cameras 62 are used for capturing relevant videos in the conference and generating corresponding video signals, for example, a part of the plurality of cameras 62 is used for capturing the images of the participants, another part is used for capturing the panoramic view or the local view of the conference site, and the video signals generated by each of the cameras 62 are transmitted to the conference system host 12 through the video matrix 64. After receiving the video signal sent by each camera 62, the conference system host 12 transmits the video signal to the display unit 70, and the display unit 70 analyzes the received video signal and displays the analyzed video content through the display screen 72. The conference system host 12 may control the display screen 72, so that the display screen 72 only displays the portrait video corresponding to the speaking participant, and thus when the speaking content (i.e., the audio content corresponding to the first audio signal) corresponding to the speaking participant is played through the sound amplifying unit 30, the portrait corresponding to the speaking participant is simultaneously displayed on the display screen 72, so that the audios and videos of the speaking participant are unified, and it is beneficial for other participants to correspond the listened audio content to the speaking participant. Of course, in other embodiments, the conference system host 12 may control the display screen 72 to display the videos captured by all the cameras 62 or some of the videos captured by the cameras 62, which is not limited herein. In addition, as described above, the video signal generated by any one of the cameras 62 may also be used as a remote video signal and transmitted to another conference system 1 through the conference system host 12, the remote video conference host 16 connected thereto, and the communication unit 50, so as to be used for remote video display.
In other embodiments, the camera unit 60 may include only one camera 62, and the panoramic view of the conference is captured by the camera 62 and displayed on the display screen 72.
Further, referring to fig. 4, in an embodiment of the present invention, the conference system host 12 is provided with a plurality of mixers, which specifically include a first Mixer (Mixer1), a second Mixer (Mixer2), a third Mixer (Mixer3), and a fourth Mixer (Mixer 4).
As shown in fig. 4, after the Audio processors 24 of the pickup units 20 generate multiple paths of first Audio signals (CU11-CUn1), the conference system host 12 receives the multiple paths of first Audio signals and performs sound mixing processing through the first sound mixer, and then transmits the multiple paths of mixed first Audio signals to the power amplifier 32 (PA shown in fig. 4) of the sound amplifying unit 30 for amplification and external playing through the sound amplifying device 34 (Audio shown in fig. 4); after the audio processors 24 of the pickup units 20 generate a plurality of second audio signals (CU12-CUn2), the conference system host 12 receives the plurality of second audio signals and performs mixing processing by the second mixer, wherein the plurality of mixed second audio signals are used for Recording and archiving (Recording); after the audio processors 24 of the sound pickup units 20 correspondingly generate multiple paths of third audio signals (CU13-CUn3), the conference system host 12 receives the multiple paths of third audio signals and performs sound mixing processing by the third sound mixer, and the multiple paths of third audio signals after sound mixing are used for speech transcription (ASR); after receiving the multiple first audio signals, the conference system host 12 further performs mixing processing on the multiple first audio signals through the fourth sound mixer, and then transmits the multiple first audio signals after mixing processing to other conference systems 1 as remote audio signals (VCS Out), and performs external playing through the sound amplifying units 30 of the other conference systems 1. As shown in fig. 4, when the conference system host 12 of each conference system 1 receives remote audio signals sent by other conference systems 1 through the remote video conference host 16 connected thereto, the first sound mixer of the conference system 1 is further configured to perform sound mixing processing on the received remote audio signals and multiple paths of first audio signals, and then transmit the signals to the sound amplification unit 30 of the conference system 1 for sound amplification and external playing, thereby implementing audio content interaction between different conference systems 1. It should be noted that, In order to distinguish from the remote audio signal (VCS Out) sent by the conference system host 12 to the other conference system 1, the remote audio signal of the other conference system 1 received by the conference system host 12 is defined as VCS In (see fig. 4).
Preferably, in an embodiment of the present invention, the conference system host 12 is further configured to generate an echo cancellation reference signal (REF) when the remote audio signal is included in the mixed processed audio signal output by the first mixer, the echo cancellation reference signal is used for the audio processor 24 of each pickup unit 20 connected to the conference system host 12 to perform echo cancellation processing, so as to avoid that the original audio signal generated by the microphone 22 of the pickup unit 20 after the sound amplification and the sound amplification of the remote audio signal is amplified contains an echo audio signal (i.e. the audio signal generated by the microphone 22 of the remote audio signal), thereby eliminating the echo generated when the first audio signal generated by the echo audio signal through the first audio processing is externally played by the sound amplifying unit 30 and preventing the echo audio signal from being transmitted back to other conference systems 1.
Further preferably, in an embodiment of the present invention, each conference system 1 further includes an adaptive feedback suppression (AFC) module, where the adaptive feedback suppression module is configured to increase a sound amplification gain of the audio signal output by the first sound mixer to increase a volume of the audio signal when the audio signal is played out, and the adaptive feedback suppression module is further configured to suppress a howling frame signal in the audio signal output by the first sound mixer, so as to reduce howling generated when the audio signal is played out.
In summary, in the conference system 1 in an embodiment of the present invention, the sound pickup unit 20 collects original audio signals and performs different audio processing on the same collected original audio signal, so as to generate different multiple audio signals, the control unit 10 transmits the multiple audio signals to the sound amplification unit 30 and the recording unit 40, respectively, so that a first audio signal in the multiple audio signals is amplified and played through the sound amplification unit 30, a second audio signal is archived and recorded through the recording device 42 of the recording unit 40, and a third audio signal is subjected to voice transcription through the essential device 44 of the recording unit 40, thereby meeting different requirements in a conference; after the plurality of conference systems 1 are connected in communication, the first audio signal in any conference system 1 can also be used as a remote audio signal and sent to other conference systems 1 together with the video signal shot by the camera unit 60 through the respective control unit 10 and the communication unit 50, so that a video conference among the plurality of conference systems 1 is realized, and the applicability of the conference system 1 is enhanced.
Referring to fig. 5, the present invention further provides a remote conference platform 2, including the conference system 1 according to any of the above embodiments, and at least one remote conference system 3 communicatively connected to the conference system 1. Specifically, the remote conference platform 2 further includes a cloud server (not shown), and the conference system 1 and the at least one remote conference system 3 are in communication connection through the cloud server.
The remote conference system 3 may be a terminal device such as a computer or a tablet, or may be another conference system 1 having all features and functions of the conference system 1. When the number of the remote conference systems 3 is plural, the plural remote conference systems 3 may be any combination of the terminal device and the conference system 1, and the plural remote conference systems 3 may be in communication connection with the conference system 1 through one or more cloud servers. For example, in an embodiment, the conference system 1 and the plurality of remote conference systems 3 may be respectively communicatively connected to a same cloud server, so that the conference system 1 is communicatively connected to the plurality of remote conference systems 3 to implement a teleconference; in another embodiment, the plurality of remote conference systems 3 may be divided into a plurality of remote conference system groups, the plurality of remote conference system groups are in one-to-one correspondence with the plurality of cloud servers, and the plurality of cloud servers are in communication connection with the conference system 1 respectively, so that the conference system 1 can also be in communication connection with the plurality of remote conference systems 3 to implement a teleconference.
The cloud server may be a public cloud server or a private cloud server, which is not limited herein.
It can be understood that the teleconference platform 2 provided by the present invention has all functions and features of the teleconference system 1, and can perform different audio processing on the same acquired original audio signal to generate different multiple audio signals, where the multiple audio signals are used to meet different requirements in a conference, and further detailed description may refer to the related content of the teleconference system 1, which is not described herein again.
Further, referring to fig. 6, the present invention further provides an audio processing method for a conference system 1. As shown in fig. 1, the conference system 1 includes a control unit 10, and a sound pickup unit 20, a sound amplification unit 30, and a recording unit 40 respectively connected to the control unit 10, and specific features and functions of each unit included in the conference system 1 may refer to the related contents of the conference system 1, which is not described again. As shown in fig. 6, the audio processing method includes the steps of:
step S1, the pickup unit 20 collects an original audio signal, and performs a first audio processing on the original audio signal to generate a first audio signal, and performs a second audio processing on the original audio signal to generate a second audio signal;
step S2, the control unit 10 receives and processes the first audio signal and the second audio signal, and transmits the processed first audio signal and second audio signal to the sound amplifying unit 30 and the recording unit 40, respectively;
in step S3, the sound amplifying unit 30 receives the first audio signal, amplifies the first audio signal, and outputs the amplified first audio signal, and the recording unit 40 receives and analyzes the second audio signal to record and/or reproduce the second audio signal.
According to the audio processing method provided by the utility model, different audio processing is carried out on the same path of acquired original audio signals, so that different first audio signals and second audio signals can be generated, the first audio signals are used for sound amplification and external sound reproduction in a conference, and the second audio signals are used for recording and archiving and/or voice transcription in the conference, so that different requirements in the conference are met.
Preferably, in an embodiment of the present invention, the audio processing method further includes step S4: pickup unit 20 is right original audio signal carries out third audio processing in order to generate third audio signal, control unit 10 receives and handles third audio signal, and will handle the back third audio signal transmits to recording unit 40, recording unit 40 receives and analyzes third audio signal is in order to carry out recording file and/or pronunciation transcription.
In the embodiment of the present invention, when the second audio signal and the third audio signal exist simultaneously, both of them may be used for recording, archiving and/or voice transcription, that is, the purposes of the second audio signal and the third audio signal may be the same or different.
Preferably, in an embodiment, the recording unit 40 may receive and parse the second audio signal for recording and archiving, and receive and parse the third audio signal for voice transcription. In another embodiment, the recording unit 40 may receive and parse the third audio signal for recording and archiving, and receive and parse the second audio signal for voice transcription. In the embodiment of the utility model, two different paths of audio signals are adopted for recording, archiving and voice transcription, and the two paths of audio signals cannot influence each other, thereby being beneficial to improving the sound quality definition of the audio file generated by recording, archiving and improving the recognition success rate and the transcription speed of the voice transcription.
In other embodiments, the second audio signal and the third audio signal may be both used for recording and archiving or both used for voice transcription, the second audio signal and the third audio signal may also be both used for recording and archiving and voice transcription, respectively, and the second audio signal and the third audio signal may also be one of them used for recording and archiving or voice transcription and the other one used for recording and archiving and voice transcription. It can be understood that two different audio signals are simultaneously used for recording and archiving or voice transcription, so that two different recording files can be correspondingly generated for recording and archiving or voice transcription, and the recording content is more comprehensive.
In the embodiment of the present invention, the audio processing means performed by the sound pickup unit 20 on the original audio signal includes, but is not limited to, a plurality of audio processing means such as gain processing, noise reduction processing, echo cancellation processing, and equalization processing, and the first audio processing, the second audio processing, and the third audio processing all include at least some of the plurality of audio processing means, but the combination of the audio processing means included in the first audio processing, the second audio processing, and the third audio processing is different, so that the first audio processing, the second audio processing, and the third audio processing become different audio processing technologies, and thus different first audio signals, second audio signals, and third audio signals can be correspondingly generated and used for the same or different purposes according to actual needs.
Specifically, in an embodiment, the first audio processing includes gain processing, noise reduction processing, echo cancellation processing, and equalization processing, the second audio processing includes gain processing and noise reduction processing, and the third audio processing includes gain processing. In another embodiment, the first audio processing comprises gain processing, noise reduction processing, echo cancellation processing, and equalization processing, the second audio processing comprises gain processing, and the third audio processing comprises gain processing and noise reduction processing. In the embodiment of the present invention, after the original audio signal is subjected to multiple processing such as gain, noise reduction, echo cancellation, and equalization, when the first audio signal generated correspondingly is externally played through the sound amplifying unit 30, the method has the advantages of stable signal, no noise, and good sound quality, and is beneficial to improving the listening experience of the user; after the original audio signal is subjected to gain processing and noise reduction processing, the second audio signal or the third audio signal generated correspondingly by the original audio signal is stable and the noise signal is removed, which is beneficial to forming audio with high sound quality definition and no noise when the second audio signal or the third audio signal completes recording and archiving through the recording unit 40; after the original audio is subjected to gain processing, the third audio signal or the second audio signal generated correspondingly by the original audio is stable, which is beneficial to the rapid recognition of the third audio signal or the second audio signal when the third audio signal or the second audio signal is subjected to voice transcription through the recording unit 40, thereby improving the efficiency of voice transcription.
Further, in an embodiment of the present invention, the number of the sound pickup units 20 is multiple, each sound pickup unit 20 is configured to collect one path of the original audio signal and generate one path of the first audio signal, one path of the second audio signal, and one path of the third audio signal through corresponding audio processing, and the control unit 10 includes multiple sound mixers, specifically, a first sound mixer, a second sound mixer, and a third sound mixer.
In the audio processing method, the steps of "the control unit 10 receives and processes the first audio signal and the second audio signal" in the step S2 and "the control unit 10 receives and processes the third audio signal" in the step S4 specifically include: after the control unit 10 receives a plurality of the first audio signal of multichannel, multichannel second audio signal and multichannel third audio signal that pickup unit 20 transmitted, through first sound mixer is to the multichannel first audio signal mixes the acoustic treatment, through the second sound mixer is to the multichannel second audio signal mixes the acoustic treatment, and passes through the third sound mixer is to the multichannel third audio signal mixes the acoustic treatment.
It can be understood that, after the sound mixing processing by the corresponding sound mixer, when the first audio signal is amplified and externally played by the sound amplifying unit 30, the problems of poor sound quality, echo, howling, noise and the like of the played voice can be avoided, so that the played voice is more clear and powerful, and the listening experience of the user is improved; similarly, the second audio signal or the third audio signal passes through after recording unit 40 records the file, the recording file that its corresponding formation also can avoid not good tone quality, volume low scheduling problem when the broadcast, makes the better reduction speaker's of broadcast sound, improves user's listening experience.
Further, in an embodiment of the present invention, the number of the conference systems 1 is at least two, and the at least two conference systems 1 include a first conference system and at least a second conference system communicatively connected to the first conference system. It should be noted that the first conference system and the second conference system refer to different participants participating in a teleconference, respectively, where the first conference system can be regarded as a party hosting the teleconference, and both the first conference system and the second conference system have all the features and functions of the conference system 1.
Each conference system 1 further comprises a communication unit 50 connected to the control unit 10, and the control unit 10 is further configured to send the first audio signal as a remote audio signal through the communication unit 50. Specifically, the control unit 10 of each conference system 1 further includes a fourth sound mixer, and the "control unit is further configured to take the first audio signal as a remote audio signal and send the remote audio signal through the communication unit 50" specifically includes:
after the control unit 10 of the conference system 1 receives the first audio signal transmitted by the sound pickup unit 20 of the conference system 1, the control unit 10 performs sound mixing processing on the first audio signal through the fourth sound mixer;
the control unit 10 takes the first audio signal after the sound mixing processing as a remote audio signal, and sends out the remote audio signal through the communication unit 50.
The audio processing method further comprises the steps of:
after the first conference system is in communication connection with at least one second conference system, the control unit 10 of the first conference system sends the remote audio signal to the control unit 10 of at least one second conference system;
the control unit 10 of the second conference system receives and processes the remote audio signal, and transmits the processed remote audio signal to the sound amplification unit 30 of the second conference system, and the sound amplification unit 30 of the second conference system receives and amplifies the remote audio signal and outputs the amplified remote audio signal.
The first sound mixer included in each conference system 1 is further configured to perform sound mixing processing on a remote audio signal received by the conference system 1, where the "control unit 10 of the second conference system receives and processes the remote audio signal, and transmits the processed remote audio signal to the sound amplification unit 30 of the second conference system, and the sound amplification unit 30 of the second conference system receives and amplifies the remote audio signal and outputs the amplified remote audio signal" specifically includes the following steps:
the first step is as follows: the control unit 10 of the second conference system receives the remote audio signal and the first audio signal transmitted by the pickup unit 20 of the second conference system, and mixes the remote audio signal and the first audio signal by the first sound mixer.
The second step is that: the control unit 10 of the second conference system transmits the remote audio signal and the first audio signal after sound mixing processing to the sound amplification unit 30 of the second conference system.
The third step: the sound amplification unit 30 of the second conference system receives the remote audio signal and the first audio signal, amplifies the signals, and outputs the amplified signals.
In an embodiment of the present invention, the conference systems 1 are communicatively connected, and the remote audio signal is received and transmitted through the respective control unit 10 and the communication unit 50, so that a teleconference can be performed among the conference systems 1, and an application range of the conference system 1 is widened. Furthermore, the remote audio signal (i.e. the first audio signal) sent by the first conference system is processed by the echo cancellation process of the first audio process, so that it can be avoided that the remote audio signal sent by any one of the second conference systems is received by the first conference system, is externally played, is collected and is transmitted back to the second conference system, and the audio content externally played by the sound amplifying unit 30 of any one of the conference systems 1 is clean and does not mix with the echo, which is beneficial to improving the listening experience of users.
It should be noted that, when the audio processing method provided by the present invention is applied to the conference system 1, the conference system 1 further has other features and functions of the conference system 1 of any of the foregoing embodiments, for example, the camera function of the camera unit 60 and the display function of the display unit 70, and for a more detailed description, reference may be made to relevant contents of the conference system 1, which is not described again.
The above embodiments of the present invention are described in detail, and the principle and the implementation of the present invention are explained by applying specific embodiments, and the above description of the embodiments is only used to help understanding the method of the present invention and the core idea thereof; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (11)

1. A conference system is characterized by comprising a control unit, and a sound pickup unit, a sound amplification unit and a recording unit which are respectively connected with the control unit;
the pickup unit is used for collecting an original audio signal, performing first audio processing on the original audio signal to generate a first audio signal, and performing second audio processing on the original audio signal to generate a second audio signal;
the control unit is used for receiving and processing the first audio signal and the second audio signal, and transmitting the processed first audio signal to the sound amplifying unit, and the sound amplifying unit is used for receiving the first audio signal, amplifying and outputting the first audio signal; the control unit is further configured to transmit the processed second audio signal to the recording unit, and the recording unit is configured to receive and analyze the second audio signal to perform recording, archiving and/or voice transcription;
wherein, the number of the sound pickup units is equal to or less than the number of the participants.
2. The conferencing system of claim 1, wherein the pickup unit is further configured to perform a third audio processing on the original audio signal to generate a third audio signal, wherein the control unit is further configured to receive and process the third audio signal and transmit the processed third audio signal to the recording unit, and wherein the recording unit is further configured to receive and parse the third audio signal for recording and archiving and/or voice transcription.
3. The conferencing system of claim 2, wherein the recording unit receives and parses the second audio signal for recording archiving, and the recording unit receives and parses the third audio signal for voice transcription.
4. The conferencing system of claim 2, wherein the first audio processing comprises gain processing, noise reduction processing, echo cancellation processing, and equalization processing, the second audio processing comprises gain processing and noise reduction processing, and the third audio processing comprises gain processing;
or, the first audio processing includes gain processing, noise reduction processing, echo cancellation processing, and equalization processing, the second audio processing includes gain processing, and the third audio processing includes gain processing and noise reduction processing.
5. The conference system as claimed in claim 2, wherein the pickup unit is plural, and each pickup unit is configured to collect a path of the original audio signal and generate a path of the first audio signal, a path of the second audio signal and a path of the third audio signal through corresponding audio processing;
the control unit includes a plurality of sound mixers, the control unit receives a plurality ofly the multichannel of pickup unit transmission first audio signal, multichannel second audio signal and multichannel behind the third audio signal, the multichannel first audio signal carries out mixing sound through first sound mixer and handles, and the multichannel the second audio signal carries out mixing sound through the second sound mixer and handles, and the multichannel third audio signal carries out mixing sound through the third sound mixer and handles.
6. The conferencing system of any of claims 1 to 5, further comprising a communication unit connected to the control unit, the control unit further configured to transmit the first audio signal as a remote audio signal via the communication unit;
after the first conference system is in communication connection with at least one second conference system, the control unit of the first conference system is further configured to send the remote audio signal to the control unit of the at least one second conference system, the control unit of the second conference system is further configured to receive and process the remote audio signal, transmit the processed remote audio signal to the sound amplification unit of the second conference system, and the sound amplification unit of the second conference system receives the remote audio signal, amplifies the remote audio signal and outputs the amplified remote audio signal.
7. The conferencing system of claim 6, wherein the control unit of each of the conferencing systems comprises a first sound mixer;
after the control unit of the second conference system receives the remote audio signal and the first audio signal transmitted by the pickup unit of the second conference system, the remote audio signal and the first audio signal are subjected to sound mixing processing through the first sound mixer, and the remote audio signal and the first audio signal after the sound mixing processing are transmitted to the sound amplification unit of the second conference system.
8. The conference system as claimed in claim 6, wherein the control unit of each conference system comprises a fourth sound mixer, and after the control unit of the conference system receives the first audio signal transmitted by the sound pickup unit of the conference system, the fourth sound mixer performs sound mixing processing on the first audio signal, and the sound mixed processed first audio signal is used as the remote audio signal.
9. The conferencing system of claim 6, wherein the control unit of the first conferencing system comprises a teleconference controller for controlling the number of second conferencing systems communicatively connected to the first conferencing system.
10. A teleconferencing platform, comprising a conferencing system as claimed in any one of claims 1 to 9, and at least one remote conferencing system communicatively connected to the conferencing system;
wherein the remote conference system comprises a terminal device in communication connection with the conference system or another conference system in communication connection with the conference system.
11. The teleconferencing platform of claim 10, further comprising at least one cloud server, wherein the conference system and the at least one remote conference system are communicatively coupled via the at least one cloud server.
CN202121394714.7U 2021-06-22 2021-06-22 Conference system and remote conference platform Active CN216122672U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202121394714.7U CN216122672U (en) 2021-06-22 2021-06-22 Conference system and remote conference platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202121394714.7U CN216122672U (en) 2021-06-22 2021-06-22 Conference system and remote conference platform

Publications (1)

Publication Number Publication Date
CN216122672U true CN216122672U (en) 2022-03-22

Family

ID=80720737

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202121394714.7U Active CN216122672U (en) 2021-06-22 2021-06-22 Conference system and remote conference platform

Country Status (1)

Country Link
CN (1) CN216122672U (en)

Similar Documents

Publication Publication Date Title
US11107490B1 (en) System and method for adding host-sent audio streams to videoconferencing meetings, without compromising intelligibility of the conversational components
US8379076B2 (en) System and method for displaying a multipoint videoconference
CN100484177C (en) Stereo microphone processing for teleconferencing
EP2566194A1 (en) Method and device for processing audio in video communication
US11650790B2 (en) Centrally controlling communication at a venue
WO2014094461A1 (en) Method, device and system for processing video/audio information in video conference
US11521636B1 (en) Method and apparatus for using a test audio pattern to generate an audio signal transform for use in performing acoustic echo cancellation
CN105959614A (en) Method and system for processing video conference
CN114845144B (en) Screen projection method, auxiliary screen projection device and storage medium
WO2012034329A1 (en) Method and apparatus for video recording in video calls
CN101631032B (en) Method, device and system for realizing multilingual meetings
CN115209083A (en) Multi-video-conference collaborative conference opening method, terminal and storage medium
JP2006254064A (en) Remote conference system, sound image position allocating method, and sound quality setting method
CN113556503B (en) Conference system, remote conference platform and audio processing method
KR101918676B1 (en) Videoconferencing Server for Providing Multi-Screen Videoconferencing by Using Plural Videoconferencing Terminals and Camera Tracking Method therefor
CN112688965B (en) Conference audio sharing method and device, electronic equipment and storage medium
CN216122672U (en) Conference system and remote conference platform
CN114531425B (en) Processing method and processing device
CN112071132B (en) Audio and video teaching equipment and intelligent teaching system
TWI757954B (en) Conference terminal and multi-device coordinating method for conference
JP5097169B2 (en) Telephone conference device and telephone conference system using the same
JP2001036881A (en) Voice transmission system and voice reproduction device
US20080266381A1 (en) Selectively privatizing data transmissions in a video conference
TWI814270B (en) Position-sensing-with-audio conference video apparatus and method for the same
JP4849494B2 (en) Teleconference system, sound image location assignment method, and sound quality setting method

Legal Events

Date Code Title Description
GR01 Patent grant
GR01 Patent grant