CN108449649A - A kind of video caption generation method - Google Patents

A kind of video caption generation method Download PDF

Info

Publication number
CN108449649A
CN108449649A CN201810331983.5A CN201810331983A CN108449649A CN 108449649 A CN108449649 A CN 108449649A CN 201810331983 A CN201810331983 A CN 201810331983A CN 108449649 A CN108449649 A CN 108449649A
Authority
CN
China
Prior art keywords
sound
video
subtitle
word
dialogue
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810331983.5A
Other languages
Chinese (zh)
Inventor
郑俊杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201810331983.5A priority Critical patent/CN108449649A/en
Publication of CN108449649A publication Critical patent/CN108449649A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/278Subtitling

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Studio Circuits (AREA)

Abstract

The present invention provides a kind of video caption generation method by obtaining video sound dialogue information, the video sound dialogue information of acquisition being converted into word, and the dialogue word and time code of conversion are generated srt subtitle files.The invention has the advantages that carry out through the invention subtitle make have it is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.

Description

A kind of video caption generation method
Technical field
The invention mainly relates to multimedia technology field, especially a kind of video caption generation method.
Background technology
With the continuous development of video and multimedia technology, subtitle is produced on television program production field and is increasingly able to Universal, technology makes that index is also higher and higher, and purposes is more and more extensive, subtitle make it is most widely used be TV station section Mesh, subtitle makes and the subtitle of film and television play makes, in addition, in the activities such as video monitoring, civilian advertisement, wedding celebration video recording It will be related to.
In general, increasing the work of the information such as word, picture, animation, stunt on original video can be referred to as Subtitle makes, and simply says, as long as there is the appearance of subtitle, will be related to subtitle making.Subtitle refers to being shown with written form The non-visual contents such as TV, film, dialogue inside stage works, also refer to the word of films and television programs post-production.For example, shadow It is regarded as the spoken title of product, is generally present in below screen.Furthermore it is also possible to which the voice content of program is shown in a manner of subtitle Show, the spectators that hearing can be helped weaker understand programme content.In subtitle fabrication, user terminal usually requires reference service A large amount of subtitle material, these subtitle materials exist in the form of caption material file, are carried out to caption material file on device When processing, often need to generate CompanionFile for it, so that user checks and edits, such as parsing and rendering caption material file. However parsing and rendering caption material file can take considerable time.
Traditional subtitle making is completed using film titler, and most widely used is the program credits making of TV station and film The subtitle of TV play makes.Currently, the subtitle on TV station's broadcasting line makes and broadcasts, or completed by operator, or It is aided with simple control.
This subtitle production method has the following disadvantages:
First, safety is low, due to operator's manual editing's subtitle list, is susceptible to artificial editor's accident;
Second, personnel's waste needs daily more people, more shifts, multichannel work;
Third, it is strongly professional, need preferable ability to understand speech and hearing, operation difficulty high.
Invention content
To solve the above problems, the present invention provides a kind of video caption generation method, which is characterized in that including:
Obtain video sound dialogue information;
Video sound dialogue information is handled, the video sound dialogue information of acquisition is converted into word;
Subtitle file is generated, the dialogue word and time code of conversion are generated srt subtitle files.
Wherein, the acquisition video sound dialogue information obtains video data, voice data by decoding process;It is described Voice data includes sound time axis.
Wherein, the processing video sound dialogue information further includes optimizing processing to the video sound of acquisition, Sound is converted into word dialogue, sound time shaft rotation is turned to the time code of srt subtitle files.
Wherein, the generation subtitle file further includes modification, editor, optimization.
The present invention also provides a kind of video captions to generate system, including:Sound extraction module, sound processing module, subtitle Document Editing module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,; The video sound dialogue information of acquisition is converted into word and time code by the sound processing module;It is compiled by subtitle file It collects module and the dialogue word and time code of conversion is generated srt subtitle files.
In conclusion compared with prior art, the invention has the advantages that carrying out subtitle making through the invention has It is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.
Description of the drawings
Fig. 1 is the flow diagram for illustrating a kind of video caption generation method of the present invention.
Specific implementation mode
Come that the present invention will be described in detail below with reference to attached drawing and in conjunction with the embodiments.It should be understood that described herein Specific implementation mode be merely to illustrate and explain the present invention, be not intended to restrict the invention.
A kind of video caption generation method is provided in the present embodiment, and Fig. 1 is video words according to the ... of the embodiment of the present invention The flow chart of curtain generation method, as shown in Figure 1, this method comprises the following steps::
Step S101 obtains video sound dialogue information;
Step S102 handles video sound dialogue information, the video sound dialogue information of acquisition is converted into word;
Step S103 generates subtitle file, and the dialogue word and time code of conversion are generated srt subtitle files.
Specifically, the acquisition video sound dialogue information described in step S101 obtains video data, sound by decoding process Sound data;The voice data includes sound time axis.
Specifically, the processing video sound dialogue information described in step S102 further includes that the video sound progress to acquisition is excellent Change is handled, and sound is converted into word dialogue, sound time shaft rotation is turned to the time code of srt subtitle files.
Specifically, the generation subtitle file described in step S103 further includes modification, editor, optimization.
The present invention also provides a kind of video captions to generate system, including:Sound extraction module, sound processing module, subtitle Document Editing module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,; The video sound dialogue information of acquisition is converted into word and time code by the sound processing module;It is compiled by subtitle file It collects module and the dialogue word and time code of conversion is generated srt subtitle files.
In conclusion compared with prior art, the invention has the advantages that carrying out subtitle making through the invention has It is safe, save human cost, easy to operate, greatly improve subtitle producing efficiency and reduce the difficulty of subtitle making.
One embodiment of the present invention above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously Cannot the limitation to the scope of the claims of the present invention therefore be interpreted as.It should be pointed out that for those of ordinary skill in the art For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the guarantor of the present invention Protect range.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims (5)

1. a kind of video caption generation method, which is characterized in that including:
Obtain video sound dialogue information;
Video sound dialogue information is handled, the video sound dialogue information of acquisition is converted into word;
Subtitle file is generated, the dialogue word and time code of conversion are generated srt subtitle files.
2. a kind of video caption generation method according to claim 1, which is characterized in that the acquisition video sound pair White information obtains video data, voice data by decoding process;The voice data includes sound time axis.
3. a kind of video caption generation method according to claim 1, which is characterized in that the processing video sound pair White information further includes optimizing processing to the video sound of acquisition, sound is converted into word dialogue, sound time shaft rotation Turn to the time code of srt subtitle files.
4. a kind of video caption generation method according to claim 1, which is characterized in that the generation subtitle file is also Including modification, editor, optimization.
5. a kind of video caption generates system, which is characterized in that including:Sound extraction module, sound processing module, subtitle file Editor module, it includes corresponding sound time axis information that the sound extraction module, which obtains video sound dialogue information,;It is described Sound processing module the video sound dialogue information of acquisition is converted into word and time code;Pass through subtitle file editor's mould The dialogue word and time code of conversion are generated srt subtitle files by block.
CN201810331983.5A 2018-04-13 2018-04-13 A kind of video caption generation method Pending CN108449649A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810331983.5A CN108449649A (en) 2018-04-13 2018-04-13 A kind of video caption generation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810331983.5A CN108449649A (en) 2018-04-13 2018-04-13 A kind of video caption generation method

Publications (1)

Publication Number Publication Date
CN108449649A true CN108449649A (en) 2018-08-24

Family

ID=63199896

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810331983.5A Pending CN108449649A (en) 2018-04-13 2018-04-13 A kind of video caption generation method

Country Status (1)

Country Link
CN (1) CN108449649A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117440116A (en) * 2023-12-11 2024-01-23 深圳麦风科技有限公司 Video generation method, device, terminal equipment and readable storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117440116A (en) * 2023-12-11 2024-01-23 深圳麦风科技有限公司 Video generation method, device, terminal equipment and readable storage medium
CN117440116B (en) * 2023-12-11 2024-03-22 深圳麦风科技有限公司 Video generation method, device, terminal equipment and readable storage medium

Similar Documents

Publication Publication Date Title
CN101188697A (en) A method for importing caption in manuscript in non editing status
CN109635154B (en) Method for automatically generating Internet image-text manuscript based on manuscript and news program
US11064245B1 (en) Piecewise hybrid video and audio synchronization
CN106340294A (en) Synchronous translation-based news live streaming subtitle on-line production system
CN103905744A (en) Rendering synthesis method and system
CN109637561A (en) A kind of multi-channel sound video automated intelligent edit methods
CN109769142B (en) Video cutting method and system for urban media wall light show
CN202406198U (en) Caption overlaying system facing to real-time audio/video stream
US20230185518A1 (en) Video playing method and device
CN101188698B (en) A device for importing caption in manuscript in non editing status
CN207854084U (en) A kind of caption display system
CN108449649A (en) A kind of video caption generation method
US20070038781A1 (en) Apparatus and method for converting contents
CN108366305A (en) A kind of code stream without subtitle shows the method and system of subtitle by speech recognition
CN102036121A (en) Digital television browser based mosaic video navigation method
CN100558146C (en) A kind of figure manufacturing-broadcasting system of serving as theme and driving with manuscript
CN104168509B (en) Program editing method applicable to environment with various material sources
JP2007202094A (en) Semi-real-time caption producing and sending system
CN115706762A (en) Method and device for generating subtitles by real-time voice recognition of mobile live broadcast
US10334325B2 (en) Use of a program schedule to modify an electronic dictionary of a closed-captioning generator
CN102739990B (en) Subtitle material editing method and device with independent broadcast characteristics
CN101188023B (en) A method for making drawing in writing manuscript
CN117354603A (en) Video generation method, device, equipment and storage medium
CN114915802B (en) Virtual reality multifunctional live broadcast system and method
CN114554246B (en) UGC mode-based medical science popularization video production method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180824