CN219627776U - Video synthesis system - Google Patents
Video synthesis system Download PDFInfo
- Publication number
- CN219627776U CN219627776U CN202223297076.4U CN202223297076U CN219627776U CN 219627776 U CN219627776 U CN 219627776U CN 202223297076 U CN202223297076 U CN 202223297076U CN 219627776 U CN219627776 U CN 219627776U
- Authority
- CN
- China
- Prior art keywords
- module
- video
- input module
- electrically connected
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Transforming Electric Information Into Light Information (AREA)
Abstract
The utility model discloses a video synthesis system which comprises an input module, a video synthesis module, a video release module and a liquid crystal display screen, wherein the output end of the input module is electrically connected with the input end of the video synthesis module, the output end of the video synthesis module is electrically connected with the input end of the video release module, and the output end of the video release module is electrically connected with the liquid crystal display screen. The beneficial effects of the utility model are as follows: the news event is input through the input module and is sent to the video synthesis module for processing, and is transmitted to the video release module after being processed, and then the news event is sent to the liquid crystal display for display by the video release module, so that complicated video processing by manpower is avoided, and the timeliness of news is improved.
Description
Technical Field
The utility model relates to the technical field of video synthesis, in particular to a video synthesis system.
Background
The intelligent video AI synthesis system is an epoch-making product inoculated in the epoch background of Internet information blowout and media fusion, and aims to create a set of full-intelligent and full-automatic media making system, namely, the full process of media making is enabled by utilizing the robot automation technology, and the making efficiency and the program quality are improved. At present, news production mainly relies on manpower to collect and shoot materials, professional and tedious software is utilized to edit and synthesize videos, the efficiency is quite low, and usually 3-5 persons are required to produce one video, and the production can be completed in two days.
Disclosure of Invention
The utility model aims to overcome the defects of the prior art and provides a video synthesis system.
The aim of the utility model is achieved by the following technical scheme: the video synthesis system comprises an input module, a video synthesis module, a video release module and a liquid crystal display screen, wherein the output end of the input module is electrically connected with the input end of the video synthesis module, the output end of the video synthesis module is electrically connected with the input end of the video release module, and the output end of the video release module is electrically connected with the liquid crystal display screen.
Preferably, the input module comprises a text input module and a voice input module, wherein the text input module and the voice input module are electrically connected with the video synthesis module, the text input module is used for inputting text information, and the voice input module is used for inputting voice information.
Preferably, the video synthesis module comprises an information labeling module, a voice extraction module, a voice recognition module and a lens switching module, and the information labeling module, the voice extraction module, the voice recognition module and the lens switching module are electrically connected with the text input module and the voice input module.
Preferably, the information labeling module comprises a face recognition module, an article detection module and an OCR module, and the face recognition module, the article detection module and the OCR module are electrically connected with the text input module and the voice input module.
Preferably, the liquid crystal display screen is arranged on the front side of the frame-type video synthesis device, an operation console is arranged at the lower end of the front side of the frame-type video synthesis device, the operation console is used for placing a mouse and a keyboard, and the keyboard is electrically connected with the text input module.
Preferably, the input end of the rack-mounted video synthesis equipment is provided with a gigabit network port, and the output end of the rack-mounted video synthesis equipment is provided with a standard high-definition video interface.
Preferably, the rack-mounted video synthesizing device is provided with a power switch, and the power switch is positioned at the front side of the rack-mounted video synthesizing device.
The utility model has the following advantages: according to the utility model, news events are input through the input module and are sent to the video synthesis module for processing, and then are transmitted to the video release module, and then are sent to the liquid crystal display for display by the video release module, so that complicated video processing by manpower is avoided, and the timeliness of news is improved.
Drawings
FIG. 1 is a schematic diagram of the electrical principle of a video compositing system;
FIG. 2 is a schematic diagram of the structure of a front panel of a rack-mounted video compositing apparatus;
FIG. 3 is a schematic view of the structure of a rear panel of a rack-mounted video compositing apparatus;
in the figure, a 1-power switch, a 2-liquid crystal display screen, a 3-operation desk, a 4-gigabit network port, a 5-standard high-definition video interface, a 6-input module, a 7-character input module, an 8-voice input module, a 9-video synthesis module and a 10-video release module.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present utility model more apparent, the technical solutions of the embodiments of the present utility model will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present utility model, and it is apparent that the described embodiments are some embodiments of the present utility model, but not all embodiments. The components of the embodiments of the present utility model generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the utility model, as presented in the figures, is not intended to limit the scope of the utility model, as claimed, but is merely representative of selected embodiments of the utility model. All other embodiments, based on the embodiments of the utility model, which are apparent to those of ordinary skill in the art without inventive faculty, are intended to be within the scope of the utility model.
In addition, the embodiments of the present utility model and the features of the embodiments may be combined with each other without collision.
It should be noted that: like reference numerals and letters denote like items in the following figures, and thus once an item is defined in one figure, no further definition or explanation thereof is necessary in the following figures.
In the description of the present utility model, it should be noted that, directions or positional relationships indicated by terms such as "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., are directions or positional relationships based on those shown in the drawings, or are directions or positional relationships conventionally put in use of the inventive product, or are directions or positional relationships conventionally understood by those skilled in the art, are merely for convenience of describing the present utility model and for simplifying the description, and are not to indicate or imply that the apparatus or element to be referred to must have a specific direction, be constructed and operated in a specific direction, and thus should not be construed as limiting the present utility model. Furthermore, the terms "first," "second," and the like, are used merely to distinguish between descriptions and should not be construed as indicating or implying relative importance.
In the description of the present utility model, it should also be noted that, unless explicitly specified and limited otherwise, the terms "disposed," "mounted," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present utility model will be understood in specific cases by those of ordinary skill in the art.
In this embodiment, as shown in fig. 1, a video composition system includes an input module 6, a video composition module 9, a video distribution module 10 and a liquid crystal display 2, wherein an output end of the input module 6 is electrically connected with an input end of the video composition module 9, an output end of the video composition module 9 is electrically connected with an input end of the video distribution module 10, and an output end of the video distribution module 10 is electrically connected with the liquid crystal display 2. The news event is input through the input module 6 and is sent to the video synthesis module 9 for processing, and is transmitted to the video release module 10 after being processed, and then the news event is sent to the liquid crystal display 2 for display by the video release module 10, so that complicated video processing by manpower is avoided, and the timeliness of news is improved.
Further, the input module 6 includes a text input module 7 and a voice input module 8, the text input module 7 and the voice input module 8 are electrically connected with the video synthesis module 9, the text input module 7 is used for inputting text information, and the voice input module 8 is used for inputting voice information.
In this embodiment, the video synthesis module 9 includes an information labeling module, a voice extraction module, a voice recognition module, and a lens switching module, which are all electrically connected with the text input module 7 and the voice input module 8. Specifically, the purpose of the voice extraction module is to extract the audio information in the video file for performing the next voice recognition, the video is usually composed of two parts of video stream and audio stream, and is packaged into the formats of MKV, MP4 and the like, the codes of the audio stream in the video are usually wav, MP3 and the like, and the audio in the wav format is extracted from the video by the existing tool for performing the subsequent voice recognition processing flow; after the audio information is extracted, the audio information is transmitted to a voice recognition module for processing, the voice recognition module aims at converting a voice signal into a feature vector through feature extraction based on an audio file, an acoustic model is used for measuring the distance between voice features and texts, a mapping relation from acoustic symbols to character strings is established by using a neural network, chinese characters are directly used as output for Chinese characters, and letters are used as output labels for English; the main function of the shot switching module is that the shot refers to a segment recorded by the camera which is shot uninterruptedly from startup to shutdown, the video comprises a plurality of segments, and the shot switching module cuts the source videos by taking the shot as a unit. In this embodiment, the voice extraction module, the voice recognition module and the lens switching module are all in the prior art, and the processing method thereof is also an existing method, and the processing method is not improved here, and will not be described again here.
Further, the information labeling module comprises a face recognition module, an article detection module and an OCR module, and the face recognition module, the article detection module and the OCR module are electrically connected with the text input module 7 and the voice input module 8. Specifically, after news data is input through a text input module 7 and a voice input module 8, a face recognition module performs face detection, inputs pictures of specific frames of video, outputs coordinate information of face positions, performs face alignment, uniformly calibrates the faces by detecting key points in the faces so as to eliminate errors caused by different gestures, encodes the pictures of the faces to be identified, encodes the pictures to form a list, compares the encoding measurement similarity of a pre-built important person identity library, identifies important person identities and obtains important person labels; the object detection module determines important objects (targets) and position information in the current video through a pre-trained important object (target) model, so that important object (target) labels in the video are obtained; the OCR module detects all possible text line area positions, then carries out a correcting operation on the text lines detected in the pictures through the direction classifier, then carries out text recognition, recognizes the characters of the detected possible text areas, and finally carries out post-processing flow to finish text correction and text structuring, thereby obtaining character information in video content, avoiding complex video processing by manpower and improving the timeliness of news. In this embodiment, the face recognition module, the object detection module, and the OCR module are all in the prior art, and the processing method thereof is also an existing method, which is not improved here, and will not be described here again.
In this embodiment, the liquid crystal display 2 is disposed on the front side of the rack-mounted video synthesis device, and the lower end of the front side of the rack-mounted video synthesis device is provided with an operation console 3, the operation console 3 is used for placing a mouse and a keyboard, and the keyboard is electrically connected with the text input module 7.
Furthermore, the input end of the rack-mounted video synthesis equipment is provided with a gigabit network port 4, and the output end of the rack-mounted video synthesis equipment is provided with a standard high-definition video interface 5. Specifically, the gigabit network port 4 and the standard high-definition video interface 5 can be connected with any third-party input system and any third-party output system, so that system integration is realized.
Still further, the rack-mounted video synthesizing apparatus is provided with a power switch 1, and the power switch 1 is located at the front side of the rack-mounted video synthesizing apparatus.
Although the present utility model has been described with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments described, or equivalents may be substituted for elements thereof, and any modifications, equivalents, improvements and changes may be made without departing from the spirit and principles of the present utility model.
Claims (7)
1. A video composition system, characterized by: the video distribution system comprises an input module (6), a video synthesis module (9), a video distribution module (10) and a liquid crystal display (2), wherein the output end of the input module (6) is electrically connected with the input end of the video synthesis module (9), the output end of the video synthesis module (9) is electrically connected with the input end of the video distribution module (10), and the output end of the video distribution module (10) is electrically connected with the liquid crystal display (2).
2. A video compositing system as defined in claim 1, wherein: the input module (6) comprises a text input module (7) and a voice input module (8), wherein the text input module (7) and the voice input module (8) are electrically connected with the video synthesis module (9), the text input module (7) is used for inputting text information, and the voice input module (8) is used for inputting voice information.
3. A video compositing system according to claim 2, wherein: the video synthesis module (9) comprises an information labeling module, a voice extraction module, a voice recognition module and a lens switching module, wherein the information labeling module, the voice extraction module, the voice recognition module and the lens switching module are electrically connected with the text input module (7) and the voice input module (8).
4. A video compositing system according to claim 3, wherein: the information labeling module comprises a face recognition module, an article detection module and an OCR module, wherein the face recognition module, the article detection module and the OCR module are electrically connected with the text input module (7) and the voice input module (8).
5. A video composition system as defined in claim 4 wherein: the liquid crystal display (2) is arranged on the front side of the frame-type video synthesis device, an operation table (3) is arranged at the lower end of the front side of the frame-type video synthesis device, the operation table (3) is used for placing a mouse and a keyboard, and the keyboard is electrically connected with the text input module (7).
6. A video compositing system as defined in claim 5, wherein: the input end of the rack-mounted video synthesis equipment is provided with a gigabit network port (4), and the output end of the rack-mounted video synthesis equipment is provided with a standard high-definition video interface (5).
7. A video composition system as defined in claim 6, wherein: the frame-type video synthesis equipment is provided with a power switch (1), and the power switch (1) is positioned at the front side of the frame-type video synthesis equipment.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202223297076.4U CN219627776U (en) | 2022-12-09 | 2022-12-09 | Video synthesis system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202223297076.4U CN219627776U (en) | 2022-12-09 | 2022-12-09 | Video synthesis system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN219627776U true CN219627776U (en) | 2023-09-01 |
Family
ID=87768981
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202223297076.4U Active CN219627776U (en) | 2022-12-09 | 2022-12-09 | Video synthesis system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN219627776U (en) |
-
2022
- 2022-12-09 CN CN202223297076.4U patent/CN219627776U/en active Active
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102542268B (en) | Method for detecting and positioning text area in video | |
US8676562B2 (en) | Communication support apparatus and method | |
Fragoso et al. | TranslatAR: A mobile augmented reality translator | |
US6441825B1 (en) | Video token tracking system for animation | |
CN104505091A (en) | Human-machine voice interaction method and human-machine voice interaction system | |
CN113052169A (en) | Video subtitle recognition method, device, medium, and electronic device | |
CN104796584A (en) | Prompt device with voice recognition function | |
CN106527945A (en) | text information extraction method and device | |
CN110148418B (en) | Scene record analysis system, method and device | |
CN111402885A (en) | Interactive method and system based on voice and air imaging technology | |
CN106161873A (en) | A kind of video information extracts method for pushing and system | |
CN112257513A (en) | Training method, translation method and system for sign language video translation model | |
CN115988149A (en) | Method for generating video by AI intelligent graphics context | |
CN219627776U (en) | Video synthesis system | |
WO2013152682A1 (en) | Method for tagging news video subtitles | |
Zahedi et al. | Continuous sign language recognition–approaches from speech recognition and available data resources | |
CN117474886A (en) | Ceramic cup defect detection method and system | |
CN115438223B (en) | Video processing method, device, electronic equipment and storage medium | |
CN113591519A (en) | Gesture recognition processing method | |
JP2002259908A (en) | Written data processing system, written data processing server and written data processing device | |
CN115686198A (en) | Convenient high-precision human-computer interaction system | |
CN104680159A (en) | Note prompting system and method for intelligent glasses | |
CN114580429A (en) | Artificial intelligence-based language and image understanding integrated service system | |
CN204559707U (en) | There is the prompter device of speech identifying function | |
KR20230126829A (en) | Apparatus for generating a highlight image using scroll velocity and method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
GR01 | Patent grant | ||
GR01 | Patent grant |