CN211509180U - Multifunctional audio and video processing equipment - Google Patents

Multifunctional audio and video processing equipment Download PDF

Info

Publication number
CN211509180U
CN211509180U CN201921705101.3U CN201921705101U CN211509180U CN 211509180 U CN211509180 U CN 211509180U CN 201921705101 U CN201921705101 U CN 201921705101U CN 211509180 U CN211509180 U CN 211509180U
Authority
CN
China
Prior art keywords
module
audio
video
board card
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201921705101.3U
Other languages
Chinese (zh)
Inventor
秦碧波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Skyvis Technologies Co ltd
Original Assignee
Beijing Skyvis Technologies Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Skyvis Technologies Co ltd filed Critical Beijing Skyvis Technologies Co ltd
Priority to CN201921705101.3U priority Critical patent/CN211509180U/en
Application granted granted Critical
Publication of CN211509180U publication Critical patent/CN211509180U/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The utility model discloses a multi-functional audio and video processing equipment relates to audio and video signal processing technology field. The apparatus comprises: the case is connected with the back plate; a system motherboard; controlling a board card; the audio board card comprises an audio input module, an audio matrix module and an audio output module; a video input board card; a video output board card; the voice recognition terminal board card, the coding board card, the network interaction board card, the system mainboard, the control board card, the audio board card, the video input board card, the video output board card, the voice recognition terminal board card, the coding board card and the network interaction board card are respectively connected with the connecting back board. The utility model provides a support multi-functional audio and video processing equipment of speech recognition function, with a plurality of audio and video processing module core processing module integration in equipment, handle the audio and video signal who gathers through single equipment and store in the hard disk after the conversion or give the backstage through network output, accomplish the function that many equipment could be accomplished in the past with single equipment.

Description

Multifunctional audio and video processing equipment
Technical Field
The disclosure relates to the technical field of audio and video signal processing, in particular to a multifunctional audio and video processing device.
Background
With the development of AI (artificial intelligence) technology in recent years, speech recognition technology is increasingly applied to occasions requiring on-site character recording, such as digital courtroom, digital conference room, intelligent classroom, and the like. The voices of a plurality of speakers on the spot are collected and automatically recognized into characters, and then the characters are output to character recording software such as court bookmaker software through a software interface, so that the workload of manually inputting the characters is greatly reduced. Because the existing multifunctional audio/video processing equipment is mainly used for recording and storing live audio/video, if a voice recognition function needs to be realized, taking a digital court as an example, in the related art, the solution shown in fig. 1 is generally adopted: a court is internally provided with at least 8 analog microphones, a plurality of cameras are used for video acquisition, a multifunctional audio and video processing device (supporting audio and video acquisition, processing, coding and storage), an 8-channel sound card, a gigabit network switch with 4 ports at the lowest, and local audio and video output devices such as a power amplifier, a sound box and a display. The sound card equipment has the main functions of connecting analog microphones, independently collecting analog audio signals of each microphone, producing digital audio streams, transmitting the digital audio streams to a voice recognition server through a network, returning recognized text information to the sound card in a character string mode, and outputting the recognized text information to writer writing software on a writer computer through a USB (universal serial bus) line or the network by the sound card. In order to transmit the sound of the field microphone to the sound card and to the multifunctional audio/video processing device, a multi-channel audio distributor and an audio matrix are added or the multifunctional audio/video processing device or the sound card is required to have an audio loop-out function, and since the general multifunctional audio/video processing device or the sound card does not have the multi-channel audio loop-out function, the multi-channel audio distributor and the audio matrix are mostly adopted for realization.
For the audio and video processing device in the related art, there are mainly some problems as follows: (1) the integration level of the equipment is poor, and in order to meet the functional requirements, an audio matrix, a sound card, a multifunctional audio and video processing device, audio connecting lines among the devices and the like need to be configured, so that the system deployment is complex, the workload of field device deployment, environment arrangement and the like is large, and the working efficiency is reduced; (2) there are more points of failure. Because a plurality of devices are used and dozens of connecting lines are simultaneously arranged, dozens of fault points exist in single project implementation, the project implementation requirement is high, the possibility of later-stage fault is high, and the later-stage project maintenance cost is increased; (3) the implementation cost is high. External audio switching equipment such as an audio matrix and the like need to be independently equipped for meeting the functional requirements, and meanwhile, the cost of the independent sound card and the multifunctional audio and video processing equipment is higher than that of equipment with higher integration degree. The large implementation workload also causes long implementation period and labor cost improvement.
Since the speech recognition function belongs to a technology which appears relatively late, the multifunctional audio and video processing equipment used in early digital courts, digital conference rooms, intelligent classrooms and the like has no way to be integrated when being developed. With the gradual maturity of the current voice recognition function, the requirement of assisting various word records in the field by voice recognition gradually becomes a standard requirement, and the requirement of integrating the voice recognition with the traditional multifunctional audio and video processing equipment is urgent.
Disclosure of Invention
The purpose of the present disclosure is to provide an audio/video processing device, which processes and converts the acquired audio/video signals through a single device and then stores the converted signals into a hard disk or outputs the converted signals to a background server through a network, thereby overcoming one or more problems caused by the limitations and defects of the related art at least to a certain extent. Additional features and advantages of the disclosure will be set forth in the detailed description which follows, or in part will be obvious from the description, or may be learned by practice of the disclosure.
According to a first aspect of the embodiments of the present disclosure, there is provided a multifunctional audio/video processing device, including a chassis, further including:
connecting the back plate;
a system motherboard;
controlling a board card;
the audio board card comprises an audio input module, an audio matrix module and an audio output module;
a video input board card;
a video output board card;
the voice recognition terminal board is connected with the audio board;
the coding board card is connected with the video input board card; and
the network interaction board card is connected with the voice recognition terminal board card;
the system mainboard, the control board card, the audio board card, the video input board card, the video output board card, the voice recognition terminal board card, the coding board card and the network interaction board card are respectively connected with the connecting back plate.
According to some embodiments, the control board card has a control interface, and the control interface includes a network interface, an infrared control interface, and an RS232 protocol control interface.
According to some embodiments, a high-definition video input module and a picture composition module are integrated in the video input board.
According to some embodiments, the system further comprises a storage module and a power module, wherein the storage module is connected with the system mainboard through a hard disk data line and used for storing the video file output by the system mainboard, and the power module is connected with the connection back plate through a power line.
According to a second aspect of the embodiments of the present disclosure, there is provided a multifunctional audio and video processing device, including:
the audio acquisition module is used for acquiring an external analog audio signal and carrying out audio processing on the analog audio signal;
the video acquisition module is used for acquiring an external high-definition video signal and carrying out video processing on the high-definition video signal;
the audio matrix module is connected with the audio acquisition module and is used for switching the analog audio signals accessed from the audio acquisition module into a plurality of paths of analog audio signals;
the voice recognition module is connected with the audio matrix module and is used for receiving the multi-channel analog audio signals output by the audio matrix module, converting the analog audio signals into digital audio signals and then outputting text information;
the coding module is connected with the audio matrix module and the video acquisition module and is used for converting the analog audio signals and the high-definition video signals output by the audio matrix module and the video acquisition module into audio codes and video codes;
the core processing module is connected with the coding module and the voice recognition module and is used for receiving the audio coding and the video coding output by the coding module, receiving the text information output by the voice recognition module, controlling and configuring the audio coding, the audio coding and the text information and outputting the audio coding, the audio coding and the text information;
and the network interaction module is connected with the core processing module, the voice recognition module and the coding module and is used for assisting data transmission among the core processing module, the voice recognition module and the coding module and providing network connection for the outside.
According to some embodiments, the video acquisition module is further provided with a video matrix module, and the video matrix module is used for switching the high-definition video signals accessed from the video acquisition module into multiple paths of high-definition video signals and then outputting the multiple paths of high-definition video signals to the encoding module.
According to some embodiments, the video matrix module is further connected to a picture synthesis module, and the picture synthesis module is configured to synthesize a plurality of high-definition video signals accessed from the video matrix module into a composite video signal, output the composite video signal back to the video matrix module, and transmit the composite video signal to the encoding module through the video matrix module.
According to some embodiments, the core processing module is further externally connected with a storage module, and the storage module is used for receiving the video file transmitted by the core processing module.
According to some embodiments, the audio capture module is capable of receiving at least 8 analog audio signals and the video capture module is capable of receiving at least 5 high definition video signals.
According to some embodiments, the encoding module is capable of receiving at least a 1-way audio signal input and a 1-way video signal input.
According to the technical scheme, the multifunctional audio and video processing equipment has the advantages and positive effects that:
the audio and video acquisition module, the audio and video matrix module, the voice recognition module, the coding module and the core processing module are integrated in the equipment, acquired audio and video signals are processed and converted by a single device and then stored in a hard disk or output to corresponding word processing software through a network, and the functions which can be completed by a plurality of previous equipment are completed by the single device.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
FIG. 1 is a diagram illustrating connections of a plurality of devices in a speech recognition solution of the related art;
fig. 2 is a schematic diagram of data transmission of each module in a multifunctional audio/video processing device according to an embodiment of the present disclosure;
fig. 3 is a schematic diagram illustrating that each board card in a case of a multifunctional audio/video processing device is plugged with a connection backplane according to an embodiment of the disclosure;
FIG. 4 is a schematic view of a connector according to an embodiment of the present disclosure;
fig. 5 shows a back panel schematic of a device according to an embodiment of the present disclosure.
Detailed Description
The principles and spirit of the present invention will be described with reference to a number of exemplary embodiments. It is understood that these embodiments are given solely for the purpose of enabling those skilled in the art to better understand and to practice the invention, and are not intended to limit the scope of the invention in any way. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
Exemplary embodiments that embody features and advantages of the present disclosure will be described in detail in the following description. It is to be understood that the disclosure is capable of various modifications in various embodiments without departing from the scope of the disclosure, and that the description and drawings are to be regarded as illustrative in nature, and not as restrictive.
The system is designed aiming at the strong requirements of occasions such as a digital conference room, a digital court and the like on a voice recognition function at present and overcoming the defects of increased implementation cost, increased fault points and the like caused by the need of deploying a plurality of devices such as a sound mixer, a sound card, traditional multifunctional audio and video processing equipment and the like in the traditional implementation process.
Specifically, please refer to fig. 2, which is a schematic diagram illustrating data transmission of each module in a multifunctional audio/video processing device according to an exemplary embodiment, where the multifunctional audio/video processing device includes an audio capture module 101, an audio matrix module 102, a speech recognition module 103, a video capture module 201, an audio matrix module 202, a picture synthesis module 203, an encoding module 30, a core processing module 40, a storage module 50, and a network interaction module 60.
The audio acquisition module 101 is configured to acquire an external analog audio signal and perform audio processing on the analog audio signal, and specifically, the audio acquisition module 101 may be connected to at least 8 analog microphones, acquire a microphone audio signal, perform audio processing such as noise reduction and gain adjustment, and output the processed audio signal to the audio matrix module 102.
The audio matrix module 102 is connected to the audio acquisition module 101 and configured to switch an analog audio signal accessed from the audio acquisition module 101 into multiple analog audio signals, the audio matrix module 102 may be combined with the audio acquisition module 101 into one module, and the audio matrix module 102 switches and outputs the input analog audio to the speech recognition module 103 and the encoding module 30, respectively. The audio matrix module 102 may support a minimum of 8 single microphone audio outputs to the speech recognition module 103 and at least 1 mixed audio output to the encoding module 30.
And the speech recognition module 103 is connected to the audio matrix module 102 and configured to receive multiple paths of analog audio signals output by the audio matrix module 102, convert the analog audio signals into digital audio signals, output text information, and transmit the digital audio signals to the core processing module 40 through a network.
The video acquisition module 201 is configured to acquire an external high-definition video signal and perform video processing on the high-definition video signal, and specifically, the video acquisition module 201 supports at least 5-channel input of a high-definition video input signal of a high-definition camera, and can output the video signal to the video matrix module 202.
The video matrix module 202, the video matrix module 202 is configured to switch a high-definition video signal accessed from the video acquisition module 201 into multiple high-definition video signals and output the multiple high-definition video signals to the encoding module 30, the video matrix module 202 may be combined with the video acquisition module 201 into one module, the video matrix module 202 also supports video signal input by the picture composition module 203, and may output the input high-definition video signal to the picture composition module 203 and the encoding module 30.
The picture synthesizing module 203 is configured to synthesize a plurality of high-definition video signals accessed from the video matrix module 202 into a composite video signal, output the composite video signal back to the video matrix module 202, and transmit the composite video signal to the encoding module 30 through the video matrix module 202.
The encoding module 30 is connected to the audio matrix module 102 and the video capture module 201, and is configured to convert the analog audio signals and the high-definition video signals output by the audio matrix module 102 and the video capture module 201 into audio codes and video codes, where the encoding module 30 supports at least 1 channel of audio and 1 channel of high-definition video signal input, generates a video stream, and outputs the video stream to the core processing module 40.
The core processing module 40 is connected with the coding module 30 and the voice recognition module 103, and is configured to receive the audio coding and the video coding output by the coding module 30, and receive the text information output by the voice recognition module 103, and output the audio coding, the audio coding and the text information after controlling and configuring, the core processing module 40 configures and controls the whole device, and outputs the generated video file to the storage module 50 for storage, and outputs the text information output by the voice recognition module 103 to the external word processing software.
And the network interaction module 60 is connected with the core processing module 40, the voice recognition module 103 and the coding module 30, and is configured to assist data transmission among the core processing module 40, the voice recognition module 103 and the coding module 30 and provide network connection for the outside.
The storage module 50 may be configured as a hard disk, and the storage module 50 is connected to the core processing module 40 through a hard disk data line for storing a video file.
Other functional modules, such as a decoding module (not shown), a centralized control module (not shown), and the like, may also be added to the multifunctional audio/video processing device.
Fig. 3 is a schematic diagram of each board card and a connection backplane in a case of a multifunctional audio/video processing device according to an embodiment of the present disclosure, each board card and the connection backplane are of a pluggable structure, a connection backplane a is vertically arranged in the middle of the case (not shown), a control board card B, an audio board card C, a video input board card D, a video output board card E, a power supply F, a voice recognition terminal board card G, a network switch board card H, a coding board card I, a system board J and a hard disk K are plugged in the connection backplane a, and each board card and the connection backplane are connected by one to a plurality of 30PIN connectors.
In one embodiment, the device is a pluggable structure, the middle of the chassis is a vertical connection backplane, and one or more connectors as shown in fig. 4 are used for connecting each board and the connection backplane. Wherein, the connector can select multirow, and the corresponding PIN quantity as required.
In one embodiment, the control board B is configured to receive and provide external control interfaces, including a network interface, an infrared control interface, an RS232 protocol control interface, and the like. The audio board card C comprises an audio input module, a sound mixing matrix module and an audio output module. The video input board card D comprises a high-definition video input module and a picture synthesis module, and provides high-definition video input and picture synthesis functions in various formats. The video output board card E comprises a video matrix module and provides high-definition video output in various formats. The system mainboard J comprises a core processing module and provides overall control and processing for the operation of the whole host. The coding board card I comprises a coding module and provides multi-path audio and video coding. The network interaction board card H comprises a network interaction module and provides a network interaction function for data transmission among a plurality of modules in the equipment. The voice recognition terminal board card G comprises a voice recognition module, converts multi-channel analog audio output by the audio matrix into a digital audio file, transmits the digital audio file to a voice video server at the cloud end, and transmits a recognition result to external application software such as bookmarker software through a network for use. The hard disk K is used as a storage module and is connected with the system mainboard J through a hard disk data line to store the video file. The power module F is connected with the connecting backboard A through a power line to provide power for the whole equipment.
Fig. 5 shows a schematic rear panel of an apparatus employing this configuration in one embodiment.
In summary, the multifunctional audio/video processing device disclosed by the present disclosure is used for local audio/video acquisition, processing, encoding, and storage, and simultaneously supports voice recognition and outputs the recognition result to the background through the network.
The implementing personnel only need according to the operation instruction of this equipment, inserts this equipment with all kinds of audio signal, exports the audio signal of this equipment and just can accomplish the deployment of this equipment for corresponding equipment, greatly reduced engineering implementation complexity and degree of difficulty, also greatly reduced to implementing personnel's quality requirement.
Meanwhile, as the equipment is single, the failure rate is greatly reduced compared with the traditional scheme, and the maintenance cost is also greatly reduced. Compared with the total cost of a plurality of devices in the traditional scheme, the production cost of a single device is greatly reduced, and the waste of resources and the purchase cost of a user are reduced. The implementation difficulty is reduced, so that the implementation period of the equipment is greatly shortened compared with the traditional scheme, the working efficiency is improved, and the implementation cost is reduced.
The single device replaces the stacking of a plurality of devices in the traditional scheme, so that the learning difficulty of a user is reduced, and the using method of the device can be mastered quickly. Meanwhile, the difficulty of finding fault points by a user is reduced by single equipment, and the problem can be quickly solved by the user or through remote guidance of technicians.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
While the present disclosure has been described with reference to several exemplary embodiments, it is understood that the terminology used is intended to be in the nature of words of description and illustration, rather than of limitation. As the present disclosure may be embodied in several forms without departing from the spirit or essential characteristics thereof, it should also be understood that the above-described embodiments are not limited by any of the details of the foregoing description, but rather should be construed broadly within its spirit and scope as defined in the appended claims, and therefore all changes and modifications that fall within the meets and bounds of the claims, or equivalences of such meets and bounds are therefore intended to be embraced by the appended claims.

Claims (6)

1. A multifunctional audio-video processing device, comprising:
the audio acquisition module is used for acquiring an external analog audio signal and carrying out audio processing on the analog audio signal;
the video acquisition module is used for acquiring an external high-definition video signal and carrying out video processing on the high-definition video signal;
the audio matrix module is connected with the audio acquisition module and used for switching the analog audio signals accessed from the audio acquisition module into a plurality of paths of analog audio signals;
the voice recognition module is connected with the audio matrix module and used for receiving the multi-channel analog audio signals output by the audio matrix module, converting the analog audio signals into digital audio signals and then outputting text information;
the coding module is connected with the audio matrix module and the video acquisition module and is used for converting the analog audio signals and the high-definition video signals output by the audio matrix module and the video acquisition module into audio codes and video codes;
the core processing module is connected with the coding module and the voice recognition module and is used for receiving the audio coding and the video coding output by the coding module, receiving the text information output by the voice recognition module, controlling and configuring the audio coding, the audio coding and the text information and outputting the audio coding, the audio coding and the text information;
and the network interaction module is connected with the core processing module, the voice recognition module and the coding module and is used for assisting data transmission among the core processing module, the voice recognition module and the coding module and providing network connection for the outside.
2. A multifunctional audio-video processing device according to claim 1, characterized in that: the video acquisition module is also internally provided with a video matrix module which is used for switching the high-definition video signals accessed from the video acquisition module into a plurality of paths of high-definition video signals and then outputting the signals to the encoding module.
3. A multifunctional audio-video processing device according to claim 2, characterized in that: the video matrix module is also connected with a picture synthesis module, and the picture synthesis module is used for synthesizing a plurality of paths of high-definition video signals accessed from the video matrix module into a path of composite video signal, outputting the composite video signal back to the video matrix module, and transmitting the composite video signal to the encoding module by the video matrix module.
4. A multifunctional audio-video processing device according to claim 1, characterized in that: the core processing module is also externally connected with a storage module, and the storage module is used for receiving the video file transmitted by the core processing module.
5. A multifunctional audio-video processing device according to claim 1, characterized in that: the audio acquisition module can receive 8 way analog audio signal at least, video acquisition module can receive 5 way high definition video signal at least.
6. A multifunctional audio-video processing device according to claim 1, characterized in that: the encoding module is capable of receiving at least 1 audio signal input and 1 video signal input.
CN201921705101.3U 2019-10-12 2019-10-12 Multifunctional audio and video processing equipment Active CN211509180U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201921705101.3U CN211509180U (en) 2019-10-12 2019-10-12 Multifunctional audio and video processing equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201921705101.3U CN211509180U (en) 2019-10-12 2019-10-12 Multifunctional audio and video processing equipment

Publications (1)

Publication Number Publication Date
CN211509180U true CN211509180U (en) 2020-09-15

Family

ID=72406536

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201921705101.3U Active CN211509180U (en) 2019-10-12 2019-10-12 Multifunctional audio and video processing equipment

Country Status (1)

Country Link
CN (1) CN211509180U (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143478A (en) * 2021-11-25 2022-03-04 广州林电智能科技有限公司 Multifunctional audio and video processing terminal
CN115113573A (en) * 2022-08-16 2022-09-27 广州市保伦电子有限公司 Digital conference host and conference control method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143478A (en) * 2021-11-25 2022-03-04 广州林电智能科技有限公司 Multifunctional audio and video processing terminal
CN115113573A (en) * 2022-08-16 2022-09-27 广州市保伦电子有限公司 Digital conference host and conference control method
CN115113573B (en) * 2022-08-16 2024-03-08 广东保伦电子股份有限公司 Conference control method of digital conference host

Similar Documents

Publication Publication Date Title
CN101309390B (en) Visual communication system, apparatus and subtitle displaying method
CN105959613A (en) Digital conference equipment and system
CN211509180U (en) Multifunctional audio and video processing equipment
CN102984496B (en) The processing method of the audiovisual information in video conference, Apparatus and system
CN102006453A (en) Superposition method and device for auxiliary information of video signals
CN205584362U (en) Conference system
CN101742218A (en) Implementation method of videophone system among IPTV set top boxes
CN104301657A (en) Conference television terminal and auxiliary flow data access method thereof
CN201698607U (en) Interactive English teaching examination device for voice classroom
CN101212677B (en) Serial multimedia stream handling device and serial stream transmission method
CN202103781U (en) Multifunctional video conferencing system
CN206402344U (en) One kind is based on 3D scene Video Conference Systems
CN210606151U (en) Multi-functional wisdom education all-in-one
CN205883421U (en) Distance learning intelligent terminal
CN218547286U (en) Teaching centralized control system and device
CN112040195A (en) Digital centralized control equipment and method
CN201409215Y (en) Microphone sound-mixing device
CN103313029A (en) Terminal for video conference
CN112071132B (en) Audio and video teaching equipment and intelligent teaching system
CN203434995U (en) Electronic sound console with multi-output channels
CN102954558A (en) Method and device for controlling air conditioner
CN107040769A (en) Portable minisize intelligent sound projecting apparatus and its control method
CN202918420U (en) Network monitoring and network advertising all-in-one machine capable of realizing remote monitoring
CN211015980U (en) Multimedia classroom teaching auxiliary system
CN101640648A (en) Device and method for sharing television signal through instant communication service

Legal Events

Date Code Title Description
GR01 Patent grant
GR01 Patent grant