CN108449649A - A kind of video caption generation method - Google Patents
A kind of video caption generation method Download PDFInfo
- Publication number
- CN108449649A CN108449649A CN201810331983.5A CN201810331983A CN108449649A CN 108449649 A CN108449649 A CN 108449649A CN 201810331983 A CN201810331983 A CN 201810331983A CN 108449649 A CN108449649 A CN 108449649A
- Authority
- CN
- China
- Prior art keywords
- sound
- video
- subtitle
- word
- dialogue
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 238000006243 chemical reaction Methods 0.000 claims abstract description 7
- 238000012545 processing Methods 0.000 claims description 12
- 238000000605 extraction Methods 0.000 claims description 6
- 230000004048 modification Effects 0.000 claims description 4
- 238000012986 modification Methods 0.000 claims description 4
- 238000005457 optimization Methods 0.000 claims description 3
- 230000008569 process Effects 0.000 claims description 3
- 239000000463 material Substances 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000009877 rendering Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/278—Subtitling
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Studio Circuits (AREA)
Abstract
The present invention provides a kind of video caption generation method by obtaining video sound dialogue information, the video sound dialogue information of acquisition being converted into word, and the dialogue word and time code of conversion are generated srt subtitle files.The invention has the advantages that carry out through the invention subtitle make have it is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.
Description
Technical field
The invention mainly relates to multimedia technology field, especially a kind of video caption generation method.
Background technology
With the continuous development of video and multimedia technology, subtitle is produced on television program production field and is increasingly able to
Universal, technology makes that index is also higher and higher, and purposes is more and more extensive, subtitle make it is most widely used be TV station section
Mesh, subtitle makes and the subtitle of film and television play makes, in addition, in the activities such as video monitoring, civilian advertisement, wedding celebration video recording
It will be related to.
In general, increasing the work of the information such as word, picture, animation, stunt on original video can be referred to as
Subtitle makes, and simply says, as long as there is the appearance of subtitle, will be related to subtitle making.Subtitle refers to being shown with written form
The non-visual contents such as TV, film, dialogue inside stage works, also refer to the word of films and television programs post-production.For example, shadow
It is regarded as the spoken title of product, is generally present in below screen.Furthermore it is also possible to which the voice content of program is shown in a manner of subtitle
Show, the spectators that hearing can be helped weaker understand programme content.In subtitle fabrication, user terminal usually requires reference service
A large amount of subtitle material, these subtitle materials exist in the form of caption material file, are carried out to caption material file on device
When processing, often need to generate CompanionFile for it, so that user checks and edits, such as parsing and rendering caption material file.
However parsing and rendering caption material file can take considerable time.
Traditional subtitle making is completed using film titler, and most widely used is the program credits making of TV station and film
The subtitle of TV play makes.Currently, the subtitle on TV station's broadcasting line makes and broadcasts, or completed by operator, or
It is aided with simple control.
This subtitle production method has the following disadvantages:
First, safety is low, due to operator's manual editing's subtitle list, is susceptible to artificial editor's accident;
Second, personnel's waste needs daily more people, more shifts, multichannel work;
Third, it is strongly professional, need preferable ability to understand speech and hearing, operation difficulty high.
Invention content
To solve the above problems, the present invention provides a kind of video caption generation method, which is characterized in that including:
Obtain video sound dialogue information;
Video sound dialogue information is handled, the video sound dialogue information of acquisition is converted into word;
Subtitle file is generated, the dialogue word and time code of conversion are generated srt subtitle files.
Wherein, the acquisition video sound dialogue information obtains video data, voice data by decoding process;It is described
Voice data includes sound time axis.
Wherein, the processing video sound dialogue information further includes optimizing processing to the video sound of acquisition,
Sound is converted into word dialogue, sound time shaft rotation is turned to the time code of srt subtitle files.
Wherein, the generation subtitle file further includes modification, editor, optimization.
The present invention also provides a kind of video captions to generate system, including:Sound extraction module, sound processing module, subtitle
Document Editing module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,;
The video sound dialogue information of acquisition is converted into word and time code by the sound processing module;It is compiled by subtitle file
It collects module and the dialogue word and time code of conversion is generated srt subtitle files.
In conclusion compared with prior art, the invention has the advantages that carrying out subtitle making through the invention has
It is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.
Description of the drawings
Fig. 1 is the flow diagram for illustrating a kind of video caption generation method of the present invention.
Specific implementation mode
Come that the present invention will be described in detail below with reference to attached drawing and in conjunction with the embodiments.It should be understood that described herein
Specific implementation mode be merely to illustrate and explain the present invention, be not intended to restrict the invention.
A kind of video caption generation method is provided in the present embodiment, and Fig. 1 is video words according to the ... of the embodiment of the present invention
The flow chart of curtain generation method, as shown in Figure 1, this method comprises the following steps::
Step S101 obtains video sound dialogue information;
Step S102 handles video sound dialogue information, the video sound dialogue information of acquisition is converted into word;
Step S103 generates subtitle file, and the dialogue word and time code of conversion are generated srt subtitle files.
Specifically, the acquisition video sound dialogue information described in step S101 obtains video data, sound by decoding process
Sound data;The voice data includes sound time axis.
Specifically, the processing video sound dialogue information described in step S102 further includes that the video sound progress to acquisition is excellent
Change is handled, and sound is converted into word dialogue, sound time shaft rotation is turned to the time code of srt subtitle files.
Specifically, the generation subtitle file described in step S103 further includes modification, editor, optimization.
The present invention also provides a kind of video captions to generate system, including:Sound extraction module, sound processing module, subtitle
Document Editing module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,;
The video sound dialogue information of acquisition is converted into word and time code by the sound processing module;It is compiled by subtitle file
It collects module and the dialogue word and time code of conversion is generated srt subtitle files.
In conclusion compared with prior art, the invention has the advantages that carrying out subtitle making through the invention has
It is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.
One embodiment of the present invention above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously
Cannot the limitation to the scope of the claims of the present invention therefore be interpreted as.It should be pointed out that for those of ordinary skill in the art
For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the guarantor of the present invention
Protect range.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.
Claims (5)
1. a kind of video caption generation method, which is characterized in that including:
Obtain video sound dialogue information;
Video sound dialogue information is handled, the video sound dialogue information of acquisition is converted into word;
Subtitle file is generated, the dialogue word and time code of conversion are generated srt subtitle files.
2. a kind of video caption generation method according to claim 1, which is characterized in that the acquisition video sound pair
White information obtains video data, voice data by decoding process;The voice data includes sound time axis.
3. a kind of video caption generation method according to claim 1, which is characterized in that the processing video sound pair
White information further includes optimizing processing to the video sound of acquisition, sound is converted into word dialogue, sound time shaft rotation
Turn to the time code of srt subtitle files.
4. a kind of video caption generation method according to claim 1, which is characterized in that the generation subtitle file is also
Including modification, editor, optimization.
5. a kind of video caption generates system, which is characterized in that including:Sound extraction module, sound processing module, subtitle file
Editor module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,;It is described
Sound processing module the video sound dialogue information of acquisition is converted into word and time code;Pass through subtitle file editor's mould
The dialogue word and time code of conversion are generated srt subtitle files by block.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810331983.5A CN108449649A (en) | 2018-04-13 | 2018-04-13 | A kind of video caption generation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810331983.5A CN108449649A (en) | 2018-04-13 | 2018-04-13 | A kind of video caption generation method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108449649A true CN108449649A (en) | 2018-08-24 |
Family
ID=63199896
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810331983.5A Pending CN108449649A (en) | 2018-04-13 | 2018-04-13 | A kind of video caption generation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108449649A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117440116A (en) * | 2023-12-11 | 2024-01-23 | 深圳麦风科技有限公司 | Video generation method, device, terminal equipment and readable storage medium |
-
2018
- 2018-04-13 CN CN201810331983.5A patent/CN108449649A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117440116A (en) * | 2023-12-11 | 2024-01-23 | 深圳麦风科技有限公司 | Video generation method, device, terminal equipment and readable storage medium |
CN117440116B (en) * | 2023-12-11 | 2024-03-22 | 深圳麦风科技有限公司 | Video generation method, device, terminal equipment and readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101188697A (en) | A method for importing caption in manuscript in non editing status | |
CN109635154B (en) | Method for automatically generating Internet image-text manuscript based on manuscript and news program | |
US11064245B1 (en) | Piecewise hybrid video and audio synchronization | |
CN106340294A (en) | Synchronous translation-based news live streaming subtitle on-line production system | |
CN103905744A (en) | Rendering synthesis method and system | |
CN109637561A (en) | A kind of multi-channel sound video automated intelligent edit methods | |
CN109769142B (en) | Video cutting method and system for urban media wall light show | |
CN202406198U (en) | Caption overlaying system facing to real-time audio/video stream | |
US20230185518A1 (en) | Video playing method and device | |
CN101188698B (en) | A device for importing caption in manuscript in non editing status | |
CN207854084U (en) | A kind of caption display system | |
CN108449649A (en) | A kind of video caption generation method | |
US20070038781A1 (en) | Apparatus and method for converting contents | |
CN108366305A (en) | A kind of code stream without subtitle shows the method and system of subtitle by speech recognition | |
CN102036121A (en) | Digital television browser based mosaic video navigation method | |
CN100558146C (en) | A kind of figure manufacturing-broadcasting system of serving as theme and driving with manuscript | |
CN104168509B (en) | Program editing method applicable to environment with various material sources | |
JP2007202094A (en) | Semi-real-time caption producing and sending system | |
CN115706762A (en) | Method and device for generating subtitles by real-time voice recognition of mobile live broadcast | |
US10334325B2 (en) | Use of a program schedule to modify an electronic dictionary of a closed-captioning generator | |
CN102739990B (en) | Subtitle material editing method and device with independent broadcast characteristics | |
CN101188023B (en) | A method for making drawing in writing manuscript | |
CN117354603A (en) | Video generation method, device, equipment and storage medium | |
CN114915802B (en) | Virtual reality multifunctional live broadcast system and method | |
CN114554246B (en) | UGC mode-based medical science popularization video production method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20180824 |