CN108449649A

CN108449649A - A kind of video caption generation method

Info

Publication number: CN108449649A
Application number: CN201810331983.5A
Authority: CN
Inventors: 郑俊杰
Original assignee: Individual
Current assignee: Individual
Priority date: 2018-04-13
Filing date: 2018-04-13
Publication date: 2018-08-24

Abstract

The present invention provides a kind of video caption generation method by obtaining video sound dialogue information, the video sound dialogue information of acquisition being converted into word, and the dialogue word and time code of conversion are generated srt subtitle files.The invention has the advantages that carry out through the invention subtitle make have it is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.

Description

A kind of video caption generation method

Technical field

The invention mainly relates to multimedia technology field, especially a kind of video caption generation method.

Background technology

With the continuous development of video and multimedia technology, subtitle is produced on television program production field and is increasingly able to Universal, technology makes that index is also higher and higher, and purposes is more and more extensive, subtitle make it is most widely used be TV station section Mesh, subtitle makes and the subtitle of film and television play makes, in addition, in the activities such as video monitoring, civilian advertisement, wedding celebration video recording It will be related to.

In general, increasing the work of the information such as word, picture, animation, stunt on original video can be referred to as Subtitle makes, and simply says, as long as there is the appearance of subtitle, will be related to subtitle making.Subtitle refers to being shown with written form The non-visual contents such as TV, film, dialogue inside stage works, also refer to the word of films and television programs post-production.For example, shadow It is regarded as the spoken title of product, is generally present in below screen.Furthermore it is also possible to which the voice content of program is shown in a manner of subtitle Show, the spectators that hearing can be helped weaker understand programme content.In subtitle fabrication, user terminal usually requires reference service A large amount of subtitle material, these subtitle materials exist in the form of caption material file, are carried out to caption material file on device When processing, often need to generate CompanionFile for it, so that user checks and edits, such as parsing and rendering caption material file. However parsing and rendering caption material file can take considerable time.

Traditional subtitle making is completed using film titler, and most widely used is the program credits making of TV station and film The subtitle of TV play makes.Currently, the subtitle on TV station's broadcasting line makes and broadcasts, or completed by operator, or It is aided with simple control.

This subtitle production method has the following disadvantages：

First, safety is low, due to operator's manual editing's subtitle list, is susceptible to artificial editor's accident；

Second, personnel's waste needs daily more people, more shifts, multichannel work；

Third, it is strongly professional, need preferable ability to understand speech and hearing, operation difficulty high.

Invention content

To solve the above problems, the present invention provides a kind of video caption generation method, which is characterized in that including：

Obtain video sound dialogue information；

Video sound dialogue information is handled, the video sound dialogue information of acquisition is converted into word；

Subtitle file is generated, the dialogue word and time code of conversion are generated srt subtitle files.

Wherein, the acquisition video sound dialogue information obtains video data, voice data by decoding process；It is described Voice data includes sound time axis.

Wherein, the processing video sound dialogue information further includes optimizing processing to the video sound of acquisition, Sound is converted into word dialogue, sound time shaft rotation is turned to the time code of srt subtitle files.

Wherein, the generation subtitle file further includes modification, editor, optimization.

The present invention also provides a kind of video captions to generate system, including：Sound extraction module, sound processing module, subtitle Document Editing module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,； The video sound dialogue information of acquisition is converted into word and time code by the sound processing module；It is compiled by subtitle file It collects module and the dialogue word and time code of conversion is generated srt subtitle files.

In conclusion compared with prior art, the invention has the advantages that carrying out subtitle making through the invention has It is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.

Description of the drawings

Fig. 1 is the flow diagram for illustrating a kind of video caption generation method of the present invention.

Specific implementation mode

Come that the present invention will be described in detail below with reference to attached drawing and in conjunction with the embodiments.It should be understood that described herein Specific implementation mode be merely to illustrate and explain the present invention, be not intended to restrict the invention.

A kind of video caption generation method is provided in the present embodiment, and Fig. 1 is video words according to the ... of the embodiment of the present invention The flow chart of curtain generation method, as shown in Figure 1, this method comprises the following steps：：

Step S101 obtains video sound dialogue information；

Step S102 handles video sound dialogue information, the video sound dialogue information of acquisition is converted into word；

Step S103 generates subtitle file, and the dialogue word and time code of conversion are generated srt subtitle files.

Specifically, the acquisition video sound dialogue information described in step S101 obtains video data, sound by decoding process Sound data；The voice data includes sound time axis.

Specifically, the processing video sound dialogue information described in step S102 further includes that the video sound progress to acquisition is excellent Change is handled, and sound is converted into word dialogue, sound time shaft rotation is turned to the time code of srt subtitle files.

Specifically, the generation subtitle file described in step S103 further includes modification, editor, optimization.

One embodiment of the present invention above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously Cannot the limitation to the scope of the claims of the present invention therefore be interpreted as.It should be pointed out that for those of ordinary skill in the art For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the guarantor of the present invention Protect range.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims

1. a kind of video caption generation method, which is characterized in that including：

Obtain video sound dialogue information；

2. a kind of video caption generation method according to claim 1, which is characterized in that the acquisition video sound pair White information obtains video data, voice data by decoding process；The voice data includes sound time axis.

3. a kind of video caption generation method according to claim 1, which is characterized in that the processing video sound pair White information further includes optimizing processing to the video sound of acquisition, sound is converted into word dialogue, sound time shaft rotation Turn to the time code of srt subtitle files.

4. a kind of video caption generation method according to claim 1, which is characterized in that the generation subtitle file is also Including modification, editor, optimization.

5. a kind of video caption generates system, which is characterized in that including：Sound extraction module, sound processing module, subtitle file Editor module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,；It is described Sound processing module the video sound dialogue information of acquisition is converted into word and time code；Pass through subtitle file editor's mould The dialogue word and time code of conversion are generated srt subtitle files by block.